TY - GEN
T1 - The multiple alignment algorithm for metabolic pathways without abstraction
AU - Chen, Wenbin
AU - Rocha, Andrea M.
AU - Hendrix, William
AU - Schmidt, Matthew
AU - Samatova, Nagiza F.
PY - 2010
Y1 - 2010
N2 - Computational problems associated with metabolic pathways have been extensively studied in computational biology. The problem of aligning multiple metabolic pathways is very challenging. Tohsato et al.'s algorithm for aligning multiple metabolic pathways is based on similarities between enzymes, however, a metabolic pathway consists of three types of entities: reactions, compounds, and enzymes. In this paper, we propose the first algorithm for the problem of aligning multiple metabolic pathways based on the similarities among reactions, compounds, enzymes, and pathway topology. First, we compute a weight between each pair of like entities in different input pathways based on the entities' similarity score and topological structure using the methods by Ferhat Ay et al.. We then construct a weighted k-partite graph for the reactions, compounds, and enzymes. We extract a mapping between these entities by solving the maximum-weighted k-partite matching problem by applying a novel heuristic algorithm. By analyzing the alignment results of multiple pathways in different organisms, we show that the alignments found by our algorithm correctly identify common sub networks among multiple pathways.
AB - Computational problems associated with metabolic pathways have been extensively studied in computational biology. The problem of aligning multiple metabolic pathways is very challenging. Tohsato et al.'s algorithm for aligning multiple metabolic pathways is based on similarities between enzymes, however, a metabolic pathway consists of three types of entities: reactions, compounds, and enzymes. In this paper, we propose the first algorithm for the problem of aligning multiple metabolic pathways based on the similarities among reactions, compounds, enzymes, and pathway topology. First, we compute a weight between each pair of like entities in different input pathways based on the entities' similarity score and topological structure using the methods by Ferhat Ay et al.. We then construct a weighted k-partite graph for the reactions, compounds, and enzymes. We extract a mapping between these entities by solving the maximum-weighted k-partite matching problem by applying a novel heuristic algorithm. By analyzing the alignment results of multiple pathways in different organisms, we show that the alignments found by our algorithm correctly identify common sub networks among multiple pathways.
KW - Alignment
KW - K-partite graph matching
KW - Pathway
UR - http://www.scopus.com/inward/record.url?scp=79951759652&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79951759652&partnerID=8YFLogxK
U2 - 10.1109/ICDMW.2010.21
DO - 10.1109/ICDMW.2010.21
M3 - Conference contribution
AN - SCOPUS:79951759652
SN - 9780769542577
T3 - Proceedings - IEEE International Conference on Data Mining, ICDM
SP - 669
EP - 678
BT - Proceedings - 10th IEEE International Conference on Data Mining Workshops, ICDMW 2010
T2 - 10th IEEE International Conference on Data Mining Workshops, ICDMW 2010
Y2 - 14 December 2010 through 17 December 2010
ER -