Disease and Phenotype Track
The 2021 edition is composed by two tasks requiring the pairwise alignment of:
Currently, mappings between these ontologies are mostly curated by bioinformatics and disease experts who would benefit from automation of their workflows supported by implementation of ontology matching algorithms.
Table 1 summarizes the ontology versions and sources of the ontologies used in the OAEI 2021 (same as 2017-2010). Note that the version and source of HP, MP, DOID and ORDO are different from the ones used in 2016.
We have extracted "baseline" reference alignments based on the available BioPortal mappings (July 8, 2017). Most of the BioPortal mappings are generated automatically by the LOOM system; which should only be considered as a baseline since it is incomplete or may contain erroneous mappings. Consensus alignments can be found here.
The ontologies and baseline mappings for the OAEI 2021 campaign can be downloaded as zip files: Ontologies [25Mb] and BioPortal mappings [0.6Mb].
The evaluation of the Disease and Phenotype Track will be run with support of the MELT framework. This requires that you wrap your matching system in a way that allows us to execute it using MELT (see OAEI 2021 evaluation details).
The phenotype track is a built-in track within the MELT framework. It can also be accessed via the MELT client (see dataset identifiers below).
Required input for the MELT CLI:
This task consists of matching the HP (31,034 classes) and MP (30,273 entities) ontologies. The BioPortal-based alignment contains 696 mappings.
This task consists of matching the DOID (38,240 classes) and ORDO (13,504) ontologies. The BioPortal-based alignment contains 1,237 mappings.