Classification of chemical compounds to support complex queries in a pathway database.

Abstract:

Data quality in biological databases has become a topic of great discussion. To provide high quality data and to deal with the vast amount of biochemical data, annotators and curators need to be supported by software that carries out part of their work in an (semi-) automatic manner. The detection of errors and inconsistencies is a part that requires the knowledge of domain experts, thus in most cases it is done manually, making it very expensive and time-consuming. This paper presents two tools to partially support the curation of data on biochemical pathways. The tool enables the automatic classification of chemical compounds based on their respective SMILES strings. Such classification allows the querying and visualization of biochemical reactions at different levels of abstraction, according to the level of detail at which the reaction participants are described. Chemical compounds can be classified in a flexible manner based on different criteria. The support of the process of data curation is provided by facilitating the detection of compounds that are identified as different but that are actually the same. This is also used to identify similar reactions and, in turn, pathways.

SEEK ID: https://publications.h-its.org/publications/22

PubMed ID: 18629066

Research Groups: Scientific Databases and Visualisation

Publication type: Journal

Journal: Comp Funct Genomics

Citation: Comp Funct Genomics. 2004;5(2):156-62. doi: 10.1002/cfg.387.

Date Published: 17th Jul 2008

Registered Mode: Not specified

Authors: U. Wittig, A. Weidemann, R. Kania, C. Peiss, I. Rojas

help Submitter
Activity

Views: 5042

Created: 8th Nov 2017 at 10:45

Last updated: 5th Mar 2024 at 21:23

help Tags

This item has not yet been tagged.

help Attributions

None

Powered by
(v.1.14.2)
Copyright © 2008 - 2023 The University of Manchester and HITS gGmbH