The Anatomy track of the 2010 campaign consists of finding alignments between the Adult Mouse Anatomy and a part of the NCI Thesaurus (describing the human anatomy). The task is placed in a domain where we find large, carefully designed ontologies that are described in technical terms. Besides their large size and a conceptualization that is only to a limited degree based on the use of natural language, they also differ from other ontologies with respect to the use of specific annotations and roles, e.g. the extensive use of the partOf relation. The manual harmonization of the ontologies leads to a situation, where we have a high number of rather trivial mappings that can be found by simple string comparison techniques. At the same time, we have a good share of non-trivial mappings that require a careful analysis and sometimes also medical background knowledge.
Results are already available for 2010 evaluation: Link to the results page
Most of the evaluations (subtask #1 to #3) will be run with support of SEALS. This will not only allow an automized evaluation approach, but also the chance for each participant to easily test your system. We clarify this point below.
Again we fixed several flaws in the reference alignment and in the ontologies to be matched. Further we carefully added some disjointness axioms into the ontologies. These changes have partially been announced for 2009, however, we had no time to validate our changes. Therefore, the 2009 evaluation was only based a restricted set of changes. In 2010 we will now use this improved dataset
Use the following ontologies as input for your matching system. You can already test with this data and report probems (see email address below). Caution! The formal representation of some OBO specific properties has been changed. You should test whether your system is still capable of reading the relevant information.
The anatomy track consists of four subtasks. Subtask #1 is obligatory for all participants of the anatomy track, while subtasks #2, #3, and #4 are optional.
Subtask #1, #2, and #3 are standard matching tasks with respect to the input (two ontologies to be matched). For all of these subtasks your matching system should generate an alignment between the mouse-anatomy and the human-anatomy that differs with respect to recall and precision.
In the evaluation we compare the results of subtask #1 with the results of #4 to see whether or not the additional information had positive effects.
Due to our 2007 and 2008 experiences we know that certain correspondences in the partial reference mapping are hard to detect by a matching system. If some of these correspondences are part of the submissions for #1, #2 or #3, we will ask the authors of the matching system to explain how these correspondences could be detected by the implemented algorithms. If it cannot be shown or at least be suggested how these correspondences have been generated automatically, we will exclude the system from taking part in the anatomy track!
ATTTENTION: Changes have been announced by accident only few days prior to submission deadline. Now also #2 and #3 have to be submitted manually.
[#1] In case you participate only in subtask #1, you do not have to sent any submissions (opposed to the last years!), but have to implement a simple interface. Please follow the instructions given here.
[#2-#4] In case you participate in subtask #2 and #3 and/or #4, you have to submit a zip file that contains the result (we plan to automize evaluations related to this task also on the long term). Your submission should contain the following folders and files.
+- anatomy | +- 4 | | +- participant.rdf
The files participant.rdf (replace 'partcipant' by the name of your system) contain the mapping generated by your system for subtask #4. These files have to follow the format described here (standard format for submissions to the OAEI).
The reference mapping contains only equivalence correspondences between concepts of the ontologies (more precisely: only the equivalence correspondences of the reference alignment are used for evaluation purpose). Furthermore, no correspondences between properties (roles) are specified in the reference alignment.
Please submit the files (preliminary and final results) for subtask #4 directly to the email address specified under 'Contact' (below). Send the results zipped in a file participant.zip or participant.rar and let the name of your matching systems occur somewhere in the subject heading of the mail.
The schedules given at http://oaei.ontologymatching.org/2010/index.html#schedule is also binding for the anatomy track
We would like to gratefully thank Martin Ringwald and Terry Hayamizu (Mouse Genome Informatics - http://www.informatics.jax.org/), who provided us with an intial reference mapping for the matching task of this track.
We would like Elena Beisswanger for improving the datasets with respect to several problems (see description available here). Thanks a lot!
In addition, we would like to thank all of the participants of the OAEI-07 and OAEI-08 and OAEI-09 anatomy track for hints and discussions with respect to the realization and evaluation of the last year.
This track is organized by Christian Meilicke and Heiner Stuckenschmidt and supported by the SEALS project. If you have any problems working with the ontologies, any questions related to SEALS, or any suggestions related to the anatomy track, feel free to write an email to christian [at] informatik [.] uni-mannheim [.] de.