Next-Generation Bioinformatics for Biomedical Research

While traditional bioinformatics has evolved from simple data management to data-interpretation, the emphasis today has shifted to high-throughput data collection, personal medicine, and the analysis of complex systems. This tendency is accompanied by an unprecedented development of new computer architectures and cloud computing that bring the power of supercomputers within arm’s reach of bench scientists and clinical practitioners. At the same time, bionic devices and on-line diagnostic tools open up new areas of applications.

In this fast-evolving scene of new technologies, integrating heterogeneous bioinformatics data is perhaps one of the most challenging tasks. Databases increase both in volume and in complexity, and public resources available on the Internet can not cope with a growing number of user groups, especially medical and industrial users concerned with data confidentiality. On the other hand many, if not most biomolecular mechanisms that translate the human genomic information into phenotypes are not known and as a consequence, most of the molecular and cellular data cannot be interpreted in terms of biomedically relevant conclusions. While personalized diagnostics and cures are likely to remain a dominant trend, the temperate view suggests biomedical applications relying on the comparison of biomolecular sequences and/or on the already known biomolecular mechanisms may have even greater chances to enter clinical practice. Developing stand-alone tools for genome annotation, personalized medicine and high throughput technologies is especially important in the analysis of complex diseases such as neurological and psychiatric disorders.

Project Participants:

Zsolt Gelencsér, PhD student
Áron Erdei, BSc student
Prof. Sándor Pongor, PI

Collaborators

Prof. Mária Judit Molnár
Clinical and Research Centre for Molecular Neurology
Semmelweiss University, Budapest, Hungary
Prof. Frank Eisenhaber
Bioinformatics Institute A*STAR Singapore
Dr. Michael P. Myers
International Centre for Genetic Engineering and Biotechnology, Trieste, Italy,
Roberto Vera
PhD student International Centre for Genetic Engineering and Biotechnology, Trieste, Italy

References

Kertész-Farkas, A., Kocsor, A., & Pongor, S. (2009). The Application of Data Compression-Based Distances to Biological Sequences. In F. Emmert-Streib & M. Dehmer (Eds.), Information Theory and Statistical Learning (pp. 83–100). Springer US. https://doi.org/10.1007/978-0-387-84816-7_4

Kuznetsov, V., Lee, H. K., Maurer-Stroh, S., Molnár, M. J., Pongor, S., Eisenhaber, B., & Eisenhaber, F. (2013). How bioinformatics influences health informatics: usage of biomolecular sequences, expression profiles and automated microscopic image analyses for clinical needs and public health. Health Information Science and Systems, 1(1), 2. https://doi.org/10.1186/2047-2501-1-2

Busa-Fekete, R., Kertész-Farkas, A., Kocsor, A., & Pongor, S. (2008). Balanced ROC (BaROC) analysis for portien classification. Journal of Biochemical and Biophysical Methods, 70(6), 1210–1214. http://www.ncbi.nlm.nih.gov/pubmed/17689617

Kuzniar, A., van Ham, R. C. H. J., Pongor, S., & Leunissen, J. A. M. (2008). The quest for orthologs: finding the corresponding gene across genomes. Trends in Genetics: TIG, 24(11), 539–551. https://doi.org/10.1016/j.tig.2008.08.009

Kertész-Farkas, A., Dhir, S., Sonego, P., Pacurar, M., Netoteia, S., Nijveen, H., Kuzniar, A., Leunissen, J. A. M., Kocsor, A., & Pongor, S. (2008). Benchmarking protein classification algorithms via supervised cross-validation. Journal of Biochemical and Biophysical Methods, 70(6), 1215–1223. https://doi.org/10.1016/j.jbbm.2007.05.011

Sonego, P., Kocsor, A., & Pongor, S. (2008). ROC analysis: applications to the classification of biological sequences and 3D structures. Briefings in Bioinformatics, 9(3), 198–209. https://doi.org/10.1093/bib/bbm064

Kocsor, A., Busa-Fekete, R., & Pongor, S. (2008). Protein classification based on propagation of unrooted binary trees. Protein and Peptide Letters, 15(5), 428–434. https://doi.org/10.2174/092986608784567492

Vera, R., Perez-Riverol, Y., Perez, S., Ligeti, B., Kertész-Farkas, A., & Pongor, S. (2013). JBioWH: an open-source Java framework for bioinformatics data integration. Database, 2013, bat051. https://doi.org/10.1093/database/bat051