This track aims at evaluating the ability of systems to deal with the schema metadata matching task, in particular, with a subset of a collection of crosswalks from fifteen research data schemas to Schema.org [1].
This year a subset of the 16 metadata schemas aligned to schema.org has been considered. This subset involves: Data Catalogue Vocabulary (DCAT-v3), Data Catalogue Vocabulary - Application Profile (DCAT-AP), DataCity, Dublin Core (DC), ISO19115-1 schemas (ISO) and RIFCS.
We have conducted an open evaluation. The systems have been executed on a Ubuntu Linux machine configured with 32GB of RAM running under a Intel Core CPU 2.00GHz x8 processors. All measurements are based on a single run. 4 systems have registered to participate in the track: AMD, Matcha, LogMap, LogMapLite and Matcha.
Matcha | |||||
correct | output | expected | |||
dcat3 | 3 | 17 | 42 | ||
datacity | 0 | 4 | 34 | ||
LogMap | |||||
correct | output | expected | |||
dcat3 | 0 | 12 | 42 | ||
datacity | 0 | 3 | 34 | ||
rifcs | 0 | 11 | 24 | ||
dcat-ap | 0 | 2 | 34 | ||
LogMapLite | |||||
correct | output | expected | |||
dcat3 | 3 | 41 | 42 | ||
datacity | 0 | 4 | 34 | ||
rifcs | 0 | 9 | 24 | ||
dcat-ap | 0 | 4 | 34 | ||
iso | 0 | 2 | 42 |
This task mostly deal with properties of metadata schemas. This year, we have used the schemas for which an RDF serialization is available. A first future improvement is to provide an OWL serialisation and/or provide a task dedicate do those specific types of format.
This track is organized by
[1] Wu, M., Hagan, P., Cecconi, B., Richard, S. M., Verhey, C., & RDA Research Metadata Schemas WG. (2022). A Collection of Crosswalks from Fifteen Research Data Schemas to Schema.org. Research Data Alliance. https://doi.org/10.15497/RDA00069