OAEI 2018 results available here.
Special issue announced. See 2016 special issue papers

Ontology Alignment Evaluation Initiative

2018 Campaign

Since 2004, OAEI organises evaluation campaigns aiming at evaluating ontology matching technologies. This year we will combine tracks running under the SEALS platform and tracks running under the HOBBIT platform. Some tracks will allow both types of participation.

Please check the organizing committee and main contacts of the OAEI 2018 campign.

We are having a special issue devoted to the participants and evaluation of the OAEI campaigns in the Knowledge Engineering Review journal. As usual, participants will also be invited to present their results during the Ontology Matching Workshop 2018.

See list of papers published in the Journal of Biomedical Semantics as part of the 2016 special issue on Ontology Alignment in Life Sciences.

Problems

The OAEI 2018 campaign will once again confront ontology matchers to ontology and data sources to be matched. This year, the following test sets are available:

anatomy: The anatomy real world case is about matching the Adult Mouse Anatomy (2744 classes) and the NCI Thesaurus (3304 classes) describing the human anatomy.
conference: The goal of the track is to find alignments within a collection of ontologies describing the domain of organising conferences. Additionally, 'complex correspondences' are also very welcome. Alignments will be evaluated automatically against reference alignments also considering its uncertain version presented at ISWC 2014. Summary results along with detail performance results for each ontology pair (test case) and comparison with tools' performance from last years will be provided.
Multifarm: This dataset is composed of a subset of the Conference dataset, translated in nine different languages (Arabic, Chinese, Czech, Dutch, French, German, Portuguese, Russian, and Spanish) and the corresponding alignments between these ontologies. Based on these test cases, it is possible to evaluate and compare the performance of matching approaches with a special focus on multilingualism.
Complex: This track evaluates the detection of complex correspondences between ontologies of four different domains: conference, hydrography, geography and species taxonomy. Each dataset has its particularities and evaluation modalities.
Interactive matching evaluation (interactive): This track offers the possibility to compare different interactive matching tools which require user interaction. The goal is to show if user interaction can improve the matching results, which methods are most promising and how many interactions are necessary. All participating systems are evaluated using an oracle which bases on the reference alignment. Using the SEALS client, the matching system only needs to be slightly adapted to participate to this track.
Large Biomedical Ontologies (largebio): This track consists of finding alignments between the Foundational Model of Anatomy (FMA), SNOMED CT, and the National Cancer Institute Thesaurus (NCI). These ontologies are semantically rich and contain tens of thousands of classes. UMLS Metathesaurus has been selected as the basis for the track reference alignments.
Disease and Phenotype (phenotype): The Pistoia Alliance Ontologies Mapping project team organises and sponsors this track based on a real use case where it is required to find alignments between disease and phenotype ontologies. Specifically, the selected ontologies are the Human Phenotype (HP) Ontology, the Mammalian Phenotype (MP) Ontology, the Human Disease Ontology (DOID), and the Orphanet and Rare Diseases Ontology (ORDO).
Biodiversity and Ecology (biodiv): The goal of the track is to find pairwise alignments between the Environment Ontology (ENVO) and the Semantic Web for Earth and Environment Technology Ontology (SWEET), and between the Plant Trait Ontology (PTO) and the Flora Phenotype Ontology (FLOPO). These ontologies are particularly useful for biodiversity and ecology research and are being used in various projects. They have been developed in parallel and are very overlapping. They are semantically rich and contain tens of thousands of classes.
SPIMBENCH (spimbench): The goal of this track is to determine when two OWL instances describe the same Creative Work. The datasets are generated and transformed using SPIMBENCH by altering a set of original data through value-based, structure-based, and semantics-aware transformations (simple combination of transformations).
Link Discovery (link): In this track two benchmark generators are proposed to deal with link discovery for spatial data where spatial data are represented as trajectories (i.e., sequences of longitude, latitude pairs).
IIMB (IIMB): IIMB is an OWL-based dataset that is automatically generated by introducing a set of controlled transformations in an initial OWL Abox, in order i) to provide an evaluation dataset for various kinds of data transformations, including value transformations, structural transformations, and logical transformations, and ii) to cover a wide spectrum of possible techniques and tools.
Knowledge graph: The Knowledge Graph Track contains nine isolated knowledge graphs with instance and schema data. The goal of the task is to match both the instances and the schema.

Ontology Alignment Evaluation Initiative

2018 Campaign

Problems

T-Box/Schema matching

Instance matching or link discovery

Instance and schema matching

Evaluation

Preparation phase

Execution phase

Evaluation phase

OAEI rules

Schedule

Presentation