October 15, 2020: (1) Results for some tracks already available. (2) OAEI-related special issue papers.

Ontology Alignment Evaluation Initiative

2020 Campaign

Since 2004, OAEI organises evaluation campaigns aiming at evaluating ontology matching technologies. This year we will combine tracks running under the SEALS platform and tracks running under the HOBBIT platform. This year we will get the support from the MELT framework to facilitate the SEALS and HOBBIT wrapping and evaluation.

See the organizing committee and main contacts of the OAEI 2020 campign here.

Relevant material

OAEI-related special issue papers.

Public OAEI systems (SEALS tracks) for the latest campaigns.

Participation: forum and registration

We have a discussion group for the campaign where we share the latest news with the participants and we discuss issues risen during the evaluation.

Please register your system using this form.

Detailed instructions about system submission are given below.

Problems

The OAEI 2020 campaign will once again confront ontology matchers to ontology and data sources to be matched. This year, the following test sets are available:

anatomy: The anatomy real world case is about matching the Adult Mouse Anatomy (2744 classes) and the NCI Thesaurus (3304 classes) describing the human anatomy.
conference: The goal of the track is to find alignments within a collection of ontologies describing the domain of organising conferences. Additionally, 'complex correspondences' are also very welcome. Alignments will be evaluated automatically against reference alignments also considering its uncertain version presented at ISWC 2014. Summary results along with detail performance results for each ontology pair (test case) and comparison with tools' performance from last years will be provided.
Multifarm: This dataset is composed of a subset of the Conference dataset, translated in nine different languages (Arabic, Chinese, Czech, Dutch, French, German, Portuguese, Russian, and Spanish) and the corresponding alignments between these ontologies. Based on these test cases, it is possible to evaluate and compare the performance of matching approaches with a special focus on multilingualism.
Complex: This track evaluates the detection of complex correspondences between ontologies of four different domains: conference, hydrography, geography and species taxonomy. Each dataset has its particularities and evaluation modalities.
Interactive matching evaluation (interactive): This track offers the possibility to compare different interactive matching tools which require user interaction. The goal is to show if user interaction can improve the matching results, which methods are most promising and how many interactions are necessary. All participating systems are evaluated using an oracle which bases on the reference alignment. Using the SEALS client, the matching system only needs to be slightly adapted to participate to this track.
Large Biomedical Ontologies (largebio): This track consists of finding alignments between the Foundational Model of Anatomy (FMA), SNOMED CT, and the National Cancer Institute Thesaurus (NCI). These ontologies are semantically rich and contain tens of thousands of classes. UMLS Metathesaurus has been selected as the basis for the track reference alignments.
Disease and Phenotype (phenotype): The Pistoia Alliance Ontologies Mapping project team organises and sponsors this track based on a real use case where it is required to find alignments between disease and phenotype ontologies. Specifically, the selected ontologies are the Human Phenotype (HP) Ontology, the Mammalian Phenotype (MP) Ontology, the Human Disease Ontology (DOID), and the Orphanet and Rare Diseases Ontology (ORDO).
Biodiversity and Ecology (biodiv): The aim of this track is to motivate ontology matching systems to work on ontologies used in the biodiversity and ecology domain. It consists on finding alignments between 4 OWL ontologies and 4 SKOS thesauri that are particularly useful to this domain. These resources are used in various projects, they are semantically rich and highly overlapping.
SPIMBENCH (spimbench): The goal of this track is to determine when two OWL instances describe the same Creative Work. The datasets are generated and transformed using SPIMBENCH by altering a set of original data through value-based, structure-based, and semantics-aware transformations (simple combination of transformations).
Link Discovery (link): In this track two benchmark generators are proposed to deal with link discovery for spatial data where spatial data are represented as trajectories (i.e., sequences of longitude, latitude pairs).
GeoLink Cruise (geolink cruise): The goal of this track is to determine if two instances from different ontologies describe the same cruise. The datasets are collected from the Geolink project, which was funded under the U.S. National Science Foundation's EarthCube initiative. The datasets and alignments are guaranteed to contain real-world use cases to solve the instance matching problem in practice.
Knowledge graph: The Knowledge Graph Track contains nine isolated knowledge graphs with instance and schema data. The goal of the task is to match both the instances and the schema.
SemTab (TD→KG special track): Tabular data to Knowledge Graph (KG) matching is the process of assigning semantic tags from Knowledge Graphs (e.g., Wikidata or DBpedia) to the elements of a table (e.g., a web table or an arbitrary csv file). Ontology alignment and link discovery systems are welcome to participate.

Ontology Alignment Evaluation Initiative

2020 Campaign

Relevant material

Participation: forum and registration

Problems

T-Box/Schema matching

Instance matching or link discovery

Instance and schema matching

Tabular data to Knowledge Graph matching

Evaluation

Preparation phase

Execution phase

Evaluation phase

OAEI rules

Schedule (tentative)

Presentation