Ontology Alignment Evaluation Initiative - OAEI-2010 CampaignOAEI

Since 2010, it is not necessary anymore to download the data set (it has always been better to get it on the web. The SEALS platform will provide the data sets. In the future, it is even planned that it will generate the data set on the fly.

Benchmark test library

The goal of the benchmark test library is to offer a set of tests which are wide in feature coverage, progressive and stable. It serves the purpose of evaluating the strength and weakness of matchers (by being progressive and wide coverage) and measuring the progress of matchers (by being stable and reusable over the years).

Appart from that, it is made of a set of 111 pairs of ontologies for which the participants have to return an alignment in the alignment format. These results are compared against reference alignments which are available from the beginning.

Data sets

Reference ontology

The domain of this first test is Bibliographic references. It is, of course, based on a subjective view of what must be a bibliographic ontology. There can be many different classifications of publications (based on area, quality, etc.). We choose the one common among scholars based on mean of publications; as many ontologies below (tests #301-304), it is reminiscent to BibTeX.

This reference ontology contains 33 named classes, 24 object properties, 40 data properties, 56 named individuals and 20 anonymous individuals.

The complete ontology is that of test #101.

The reference ontology is based on the one of the first EON Ontology Alignment Contest. It is improved by comprising a number of circular relations that were missing from the first test. In 2006, we have put the UTF-8 version of the tests as standard, the ISO-8859-1 being optional. In 2007, the tests are the same as in 2006.

The reference ontology is put in the context of the semantic web by using other external resources for expressing non bibliographic information. It takes advantage of FOAF (http://xmlns.com/foaf/0.1/) and iCalendar (http://www.w3.org/2002/12/cal/) for expressing the People, Organization and Event concepts. Here are the external reference used:

This reference ontology is a bit limited in the sense that it does not contain attachement to several classes.

Similarly the kind of proposed alignments is still limited: they only match named classes and properties, they mostly use the "=" relation with confidence of 1.

The ontologies are described in OWL-DL and serialized in the RDF/XML format.

Tests at a glance

The tests are systematically generated to as to start from some reference ontology and discarding a number of information in order to evaluate how the algorithm behave when this information is lacking.

Test numbering (almost) fully preserves the numbering of the first EON contest.

The table below summarize what has been retracted from the reference ontology. There are here 6 catagories of alteration:

Name
Name of entities that can be replaced by (R/N) random strings, (S)ynonyms, (N)ame with different conventions, (F) strings in another language than english.
Comments
Comments can be (N) suppressed or (F) translated in another language.
Specialization Hierarchy
can be (N) suppressed, (E)xpansed or (F)lattened.
Instances
can be (N) suppressed
Properties
can be (N) suppressed or (R) having the restrictions on classes discarded.
Classes
can be (E)xpanded, i.e., relaced by several classes or (F)latened.

#NameComHierInstPropClassComment
101000000Reference alignment
102Irrelevant ontology
103000000Language generalization
104000000Language restriction
201R00000No names
202RN0000No names, no comments
2030N0000No comments (was missspelling)
204C00000Naming conventions
205S00000Synonyms
206FF0000Translation
207F00000
208CN0000
209SN0000
210FN0000
22100N000No specialisation
22200F000Flatenned hierarchy
22300E000Expanded hierarchy
224000N00No instance
2250000R0No restrictions
226No datatypes
227Unit difference
2280000N0No properties
229Class vs instances
23000000FFlattened classes
231*00000EExpanded classes
23200NN00
23300N0N0
236000NN0
23700FN00
23800EN00
23900F0N0
24000E0N0
24100NNN0
24600FNN0
24700ENN0
248NNN000
249NN0N00
250NN00N0
251NNF000
252NNE000
253NNNN00
254NNN0N0
257NN0NN0
258NNFN00
259NNEN00
260NNF0N0
261NNE0N0
262NNNNN0
265NNFNN0
266NNENN0
301Real: BibTeX/MIT
302Real: BibTeX/UMBC
303Real: Karlsruhe
304Real: INRIA

These transformations generate the set of tests displayed below with their derivation relations. The upper the test, the easier it is supposed to be.

Alignments

Below are provided the set of tests with expected alignments in the Alignment format described here. Each ontology is to be aligned with the reference ontology (i.e., that of test #101).

The only interesting alignments are those involving classes and properties of the given ontologies. So the alignments should not align individuals, nor entities from the external ontologies.

There is some chance that the final test be improved by adding entity expansion and reduction. It is also possible that there will be a lot of more instances in each ontology.


101) Concept test: Id

This test compares the ontology to itself.

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


102) Concept test: ?

This test compares the ontology to a totally irrelevant one. We ussed the food ontology given in the OWL guide (verbatim).

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]

NOTE: The onto.rdf file is provided for compatibility purposes. However, it is possible to run the test directly on the true file, i.e., http://www.w3.org/TR/2003/PR-owl-guide-20031209/wine. Indeed, wine imports food which imports wine and some parsers do not realize that onto.rdf is the same ontology as wine.


103) Concept test: Language generalisation

This test compares the ontology with its generalisation in OWL Lite (i.e., unavailable constraints are replaced by the more general available). The generalization basically removes owl:unionOf and owl:oneOf and the Property types (owl:TransitiveProperty).

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


104) Concept test: Language restriction

This test compares the ontology with its restriction in OWL Lite (where unavailable constraints have been discarded).

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


201[-2-4-6-8]) Systematic: No names

Each label or identifier is replaced by a random one.

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


202[-2-4-6-8]) Systematic: No names, no comment

Each label or identifier is replaced by a random one. Comments (rdfs:comment and dc:description) have been suppressed as well.

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


203) Systematic: Misspelling

A random, but consistent, typo generator should be applied to labels and comments.

Not available in this test (if you know how to do it, contact me).


204) Systematic: Naming conventions

Different naming conventions (Uppercasing, underscore, dash, etc.) are used for labels. Comments have been suppressed.

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


205) Systematic: Synonyms

Labels are replaced by synonyms. Comments have been suppressed.

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


206) Systematic: Foreign names

The complete ontology is translated to another language than english (French in the current case, but other languages would be fine).

Ontology : [RDF/XML] [RDF/XML in ISO-8859-1] [HTML]
Alignment : [RDF/XML] [RDF/XML in ISO-8859-1] [HTML]

NOTE: You can use alternatively the ISO-Latin-1 (ISO-8859-1) version of the tests by renaming them after their UTF-8 version.


207) Systematic:

Each label or identifieris translated to another language than english (French in the current case, but other languages would be fine).

Ontology : [RDF/XML] [RDF/XML in ISO-8859-1] [HTML]
Alignment : [RDF/XML] [RDF/XML in ISO-8859-1] [HTML]

NOTE: You can use alternatively the ISO-Latin-1 (ISO-8859-1) version of the tests by renaming them after their UTF-8 version.


208) Systematic:

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


209) Systematic:

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


210) Systematic:

Ontology : [RDF/XML] [RDF/XML in ISO-8859-1] [HTML]
Alignment : [RDF/XML] [RDF/XML in ISO-8859-1] [HTML]

NOTE: You can use alternatively the ISO-Latin-1 (ISO-8859-1) version of the tests by renaming them after their UTF-8 version.


221) Systematic: No hierarchy

All subclass assertions to named classes are suppressed.

Ontology : [RD/XML] [HTML]
Alignment : [RDF/XML] [HTML]

(variation: compile inheritance)


222) Systematic: Flattened hierarchy

A hierarchy still exists but has been strictly reduced.

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]

The alignment here contains relations which are not "=" but "<".


223) Systematic: Expanded hierarchy

Numerous intermediate classes are introduced within the hierarchy.

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


224) Systematic: No instances

All individuals have been suppressed from the ontology.

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


225) Systematic: No restrictions

All local restrictions on properties have been suppressed from the ontology.

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]

(variation: no property nor global restrictions on properties)


226) Systematic: No datatypes

In this test all datatypes are converted to xsd:string.

Not available in this test


227) Systematic: Unit differences

(Measurable) values are expressed in different datatypes.

Not available in this test


228) Systematic: No properties

Properties and relations between objects have been completely suppressed.

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]

(variation: leave the properties in instances)


229) Systematic: Class vs instances

Some classes have become instances.

Not available in this test.


230) Systematic: Flattening entities

Some components of classes are expanded in the class structure (e.g., year, month, day attributes instead of date).

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]

Here one limitation of the proposed format is that it does not cover alignments such as: journalName = name o journal.


231) Systematic: Multiplying entities

Some classes are spreaded over several classes.

Not available in this test.


232) Systematic: no hierarchy + no instance

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


233) Systematic: no hierarchy + no property

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


236) Systematic: no instance + no property

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


237) Systematic: flattened hierarchy + no instance

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


238) Systematic: expanded hierarchy + no instance

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


239) Systematic: flattened hierarchy + no property

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


240) Systematic: expanded hierarchy + no property

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


241) Systematic: no hierarchy + no instance + no property

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


246) Systematic: flattened hierarchy + no instance + no property

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


247) Systematic: expanded hierarchy + no instance + no property

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


248[-2-4-6-8]) Systematic: scrambled labels + no comments + no hierarchy

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


249[-2-4-6-8]) Systematic: scrambled labels + no comments + no instance

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


250[-2-4-6-8]) Systematic: scrambled labels + no comments + no property

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


251[-2-4-6-8]) Systematic: scrambled labels + no comments + flattened hierarchy

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


252[-2-4-6-8]) Systematic: scrambled labels + no comments + expanded hierarchy

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


253[-2-4-6-8]) Systematic: scrambled labels + no comments + no hierarchy + no instance

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


254[-2-4-6-8]) Systematic: scrambled labels + no comments + no hierarchy + no property

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


257[-2-4-6-8]) Systematic: scrambled labels + no comments + no instance + no property

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


258[-2-4-6-8]) Systematic: scrambled labels + no comments + flattened hierarchy + no instance

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


259[-2-4-6-8]) Systematic: scrambled labels + no comments + expanded hierarchy + no instance

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


260[-2-4-6-8]) Systematic: scrambled labels + no comments + flattened hierarchy + no property

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


261[-2-4-6-8]) Systematic: scrambled labels + no comments + expanded hierarchy + no property

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


262[-2-4-6-8]) Systematic: scrambled labels + no comments + no hierarchy + no instance + no property

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


265) Systematic: scrambled labels + no comments + flattened hierarchy + no instance + no property

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


266) Systematic: scrambled labels + no comments + expanded hierarchy + no instance + no property

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]

Note that the 3xx tests are only here for comparability with previous years. We know that the reference alignments for these tests are not perfect (especially because the ontologies sometimes contain flaws).


301) Real ontology: BibTeX/MIT

For a computer scientist, BibTeX is the starting point for a useful bibliographic ontology. It is of wide use and relatively well thought out. This ontology can be found at and is documented in BibTex in OWL.

This is a test of comparing our test ontology with an actual ontology, simpler and closer to the initial BibTeX ontology. The alignment result contains some inclusion (<) alignment relations.

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


302) Real ontology: BibTeX/UMBC

This ontology is very similar to the previous one, even closer to the genuine BibTeX, with different extensions and naming conventions. It can be found at http://ebiquity.umbc.edu.

The alignment result also contains some inclusion (<) alignment relations.

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


303) Real ontology: Karlsruhe

This is a test of comparing our test ontology with an actual ontology which contains more items than the actual items used in the current ontology.

The Karlsruhe ontology (http://www.aifb.uni-karlsruhe.de/ontology) is used in the Ontoweb portal. It is a refinement from other ontologies such as (KA)2. As such it does not only defines bibliographic items but many other items.

The alignment contains < as well as > relations.

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


304) Real ontology: INRIA

This is a test of comparing our test ontology with an actual ontology which is not equivalent but quite close (it can be though of as a previous version).

This INRIA ontology (fr.inrialpes.exmo.rdf.bib.owl) has been designed by Antoine Zimmermann from the BibTeX in OWL ontology and our Bibliographic XML DTD. Its goal was to gather easily a number of RDF items. These items were BibTeX entries found on the web and transformed in RDF according to this ontology.

The actual hierarchy of this ontology contains classes which are subclasses of several other classes.

Ontology : [RDF/XML] [HTML]
Alignment : [RDF/XML] [HTML]


Versions

The release notes of the previous test versions used in the OAEI-2010 can be found here.

Modalities

The test set is made of a set of directories (one per test) containing an ontology (onto.rdf) in OWL. Participants must match each of these ontologies with the ontology found in 101/onto.rdf and output the resulting alignment, in the alignment format, in a file participant.rdf in the same directory (participant is replaced by the name of the system: no more than 8 alphanumeric characters, lowercase).

The test set also contains the reference alignments against which the results will be evaluated. It is, of course, forbiden to use any for there results for performing the matching task.

The resulting alignments in the same directory structure can be zipped (please, zipped) and sent to the organisers.

Schedule

The schedule is that of http://oaei.ontologymatching.org/2010/.

Tools

A zip file contains all the data about the test. That is:

We would like to find a simple converter from OWL/XML/RDF to N3 (and maybe RDFS).

You can use the Alignment API for manipulating and generating your alignments (in particular for computing evaluation of your results).

Acknowledgements

Many resources have been used for setting up this test:

Various people helped testing or suggested improvements and tests:

Contacts

Contact address is Cassia : Trojahn # inrialpes : fr

$Id: index.html,v 1.28 2010/06/01 15:09:09 euzenat Exp euzenat $