Data quality in biological databases has become a topic of great discussion. To provide high quality data and to deal with the vast amount of biochemical data, annotators and curators need to be supported by software that carries out part of their work in an (semi-) automatic manner. The detection of errors and inconsistencies is a part that requires the knowledge of domain experts, thus in most cases it is done manually, making it very expensive and time-consuming. This paper presents two tools to partially support the curation of data on biochemical pathways. The tool enables the automatic classification of chemical compounds based on their respective SMILES strings. Such classification allows the querying and visualization of biochemical reactions at different levels of abstraction, according to the level of detail at which the reaction participants are described. Chemical compounds can be classified in a flexible manner based on different criteria. The support of the process of data curation is provided by facilitating the detection of compounds that are identified as different but that are actually the same. This is also used to identify similar reactions and, in turn, pathways.
SEEK ID: https://publications.h-its.org/publications/22
PubMed ID: 18629066
Research Groups: Scientific Databases and Visualisation
Publication type: Journal
Journal: Comp Funct Genomics
Citation: Comp Funct Genomics. 2004;5(2):156-62. doi: 10.1002/cfg.387.
Date Published: 17th Jul 2008
Registered Mode: Not specified
Views: 6369
Created: 8th Nov 2017 at 10:45
Last updated: 5th Mar 2024 at 21:23
This item has not yet been tagged.
None