Towards efficient data integration and knowledge management for rice research
发布时间 :2018-05-30  阅读次数 :5823

报告题目:Towards efficient data integration and knowledge management for rice research

报告人:Pierre Larmande1,2,3教授 

1 ICT Lab & LMI Rice, USTH, Hanoi, Vietnam. 

2 UMR DIADE,       IRD, Montpellier, France. 

3 South Green Bioinformatics Platform, Montpellier France.

时  间:2018年6月5日,1:30 - 3:00 pm

地 点:树华多功能报告厅



CV: Prof Pierre Larmande got his PhD degree in computer science – University of Montpellier in France at 2007, Master degree in computer science – University of Montpellier in France at 2000 and Master degree in biochemstry at 1996. Since 2016 he is the Visiting Scientist of USTH University of Hanoi (Vietnam) and Agricultural Genetics Institute (AGI) at LMI RICE & ICTLab. He is also the Chair of the Rice Data Interoperability Working Group ( and the Chair of the 2016 PhenoHarmonIS workshop ( 

Unit DIADE (Plant Diversity Adaptation  and Development) - IRD     

911 avenue Agropolis, BP 64501, 34394 Montpellier cedex 5, France,  

Email: Phone : +84 1 666 018 725  

Web :  

ORCID :      


Abstract: Recent advances in high-throughput technologies have resulted in tremendous increase in the amount of data in the rice research domain. This data explosion in-conjunction with its heterogeneity presents a major challenge in adopting an integrative approach towards research. There is an urgent need to effectively integrate and assimilate complementary information to understand the biological system in its entirety. To this end, the Semantic Web and machine learning approaches offer a stack of powerful technologies for the integration of information from diverse sources and make knowledge explicit thanks to ontologies. We have developed AgroLD, an RDF knowledge-based system that exploits the Semantic Web technology and some of the relevant standard domain ontologies, to integrate information on rice species and in this way facilitating the formulation of new scientific hypotheses. RDF describes a resource and its relationships/properties in the form of simple triples, i.e., Subject-Predicate-Object offering a very convenient framework for integrating data across multiple platforms assuming the platforms share some common vocabularies to describe their objects. These triples can be combined to construct large networks of information (also known as RDF graphs). The objective of this effort is to provide the community with a platform for domain specific knowledge, capable of answering complex biological questions. We present some integration results of the project, which initially focused on genomics, proteomics and phenomics. Currently, AgroLD contains hundreds millions of triples created by annotating more than 50 datasets coming from 10 data sources such as [1] and TropGeneDB [2] with 10 ontologies such as Gene Ontology [3] and Plant Trait Ontology [4]. Our objective is to offer a domain specific knowledge platform to solve complex biological and agronomical questions related to the implication of genes/proteins in, for instances, plant disease resistance or high yield traits. We expect the resolution of these questions to facilitate the formulation of new scientific hypotheses to be validated with a knowledge-oriented approach.


1. Monaco MK, Stein J, Naithani S, Wei S, Dharmawardhana P, Kumari S, et al. Gramene 2013: Comparative plant genomics resources. Nucleic Acids Res. 2014;42. 

2. Hamelin C, Sempere G, Jouffe V, Ruiz M. TropGeneDB, the multi-tropical crop information system updated and extended. Nucleic Acids Res. 2013;41. 

3. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet [Internet]. 2000;25:25―29. Available from:

4. Cooper L, Walls RL, Elser J, Gandolfo MA, Stevenson DW, Smith B, et al. The plant ontology as a tool for comparative plant anatomy and genomic analyses. Plant Cell Physiol. 2013;54:e1.