TY - GEN
T1 - Systems biology via redescription and ontologies (III)
T2 - 2008 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2008
AU - Mitrofanova, Antonina
AU - Kleinberg, Samantha
AU - Carlton, Jane
AU - Kasif, Simon
AU - Mishra, Bud
PY - 2008
Y1 - 2008
N2 - This paper addresses the protein classification problem, and explores how its accuracy can be improved by using information from time-course gene expression data. The methods are tested on data from the most deadly species of the parasite responsible for malaria infections, Plasmodium falciparum. Even though a vaccination for Malaria infections has been under intense study for many years, more than half of Plasmodium proteins still remain uncharacterized and therefore are exempted from clinical trials. The task is further complicated by a rapid life cycle of the parasite, thus making precise targeting of the appropriate proteins for vaccination a technical challenge. We propose to integrate protein-protein interactions (PPIs), sequence similarity, metabolic pathway, and gene expression, to produce a suitable set of predicted protein functions for P. falciparum. Further, we treat gene expression data with respect to various changes that occur during the five phases of the intraerythrocytic developmental cycle (IDC) (as determined by our segmentation algorithm) of P. falciparum and show that this analysis yields a significantly improved protein function prediction, e.g., when compared to analysis based on Pearson correlation coefficients seen in the data. The algorithm is able to assign "meaningful" functions to 628 out of 1439 previously unannotated proteins, which are first-choice candidates for experimental vaccine research.
AB - This paper addresses the protein classification problem, and explores how its accuracy can be improved by using information from time-course gene expression data. The methods are tested on data from the most deadly species of the parasite responsible for malaria infections, Plasmodium falciparum. Even though a vaccination for Malaria infections has been under intense study for many years, more than half of Plasmodium proteins still remain uncharacterized and therefore are exempted from clinical trials. The task is further complicated by a rapid life cycle of the parasite, thus making precise targeting of the appropriate proteins for vaccination a technical challenge. We propose to integrate protein-protein interactions (PPIs), sequence similarity, metabolic pathway, and gene expression, to produce a suitable set of predicted protein functions for P. falciparum. Further, we treat gene expression data with respect to various changes that occur during the five phases of the intraerythrocytic developmental cycle (IDC) (as determined by our segmentation algorithm) of P. falciparum and show that this analysis yields a significantly improved protein function prediction, e.g., when compared to analysis based on Pearson correlation coefficients seen in the data. The algorithm is able to assign "meaningful" functions to 628 out of 1439 previously unannotated proteins, which are first-choice candidates for experimental vaccine research.
UR - http://www.scopus.com/inward/record.url?scp=58149194996&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=58149194996&partnerID=8YFLogxK
U2 - 10.1109/BIBM.2008.82
DO - 10.1109/BIBM.2008.82
M3 - Conference contribution
AN - SCOPUS:58149194996
SN - 9780769534527
T3 - Proceedings - IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2008
SP - 278
EP - 283
BT - Proceedings - IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2008
Y2 - 3 November 2008 through 5 November 2008
ER -