Ontology Alignment Evaluation Initiative - OAEI-2019 Campaign OAEI OAEI

Results for OAEI 2019 - Knowledge Graph Track

Matching systems

As a pre-test, we executed all systems submitted as SEALS packages to OAEI (even if they are not registered for the track) on a very small matching example (dataset) with a similar structure and shape like the real knowledge graphs (in fact, they are a small subset of them). It showed that not all matching systems are able to complete this small task due to exceptions or other failures. The following matching systems produced an exception: The following matchers produced an empty alignment: We wrote an email to the respective system developers to give them the possibility to modify the system configuration to complete this task. However, none of them were ultimately capable of producing results on the test task of the Knowledge Graph track. Thus, we executed the following systems: The source code for the baseline matchers is available. The baselineLabel matcher matches all resources which share the same rdfs:label. (in case multiple resources share the same label, all of them are matched). BaselineAltLabel is additionally using skos:altLabel. Again, in cases where multiple resources share a common label, all those resources are matched in a cross product manner.

Experimental setting

The evaluation is executed on a virtual machine(VM) with 32GB of RAM and 16 vCPUs (2.4 GHz). The operating system is debian 9 with openjdk version "1.8.0_212".

We used the "-o" option in SEALS (version 7.0.5) to provide the two knowledge graphs which should be matched. The two given URLs are file URLs and not HTTP URLs because downloading the knowledge graphs from another server would be a huge time overhead (downloading 500 MB multiple times for different tasks). Thus the call of the SEALS client looks like the following:

java -Xmx25g -Xms15g -jar ../seals-omt-client.jar ${MATCHER_DIR} -z -o file:///data/ont1.xml file:///data/ont2.xml -f out.xml

This also means that the reported time also includes the environment preparation of SEALS (copying the configuration folder of each matcher at the start of the matching task) as well as copying the resulting alignment to the output file. We could not use the "-x" option of SEALS because we had to modify the evaluation routine based on two reasons. The first one is to differentiate between class, property and instance mappings. The second reason is to deal with the partial gold standard of this track.

The alignments were evaluated based on Precision, Recall and F-Measure for classes, properties and instances (each in isolation). Our partial gold standard consist of 1:1 mappings extracted from links contained in wiki pages (cross wiki links). The schema was matched by ontology experts. We assume that in each knowledge graph, only one representation of one concept exists. This means if we have the mapping in our gold standard we can count the mapping as a false positive (the assumption here is that in the seconds knowledge graph no similar concept to B exists). The value of false negatives is only increased if we have a 1:1 mapping and it is not found by a matcher. The source code for generating the evaluation results is also available.

We imposed a maximum execution time of 4h per task, however, that time limit was never exceeded.

Generated dashboard

We also generated an online dashboard with the help of the MELT framework. Have a look at the knowledge graph results here (it may take some seconds to load due to 200 000 correspondences).
It would be nice if you could fill out a questionnaire about this tool.

Results overview

class property instance overall
SystemTime#testcases SizePrec.F-m.Rec. SizePrec.F-m.Rec. SizePrec.F-m.Rec. SizePrec.F-m.Rec.
AGM10:47:385 14.60.23 (0.23)0.09 (0.09)0.06 (0.06) 49.40.66 (0.66)0.32 (0.32)0.21 (0.21) 5169.00.48 (0.48)0.25 (0.25)0.17 (0.17) 5233.20.48 (0.48)0.25 (0.25)0.17 (0.17)
AML0:45:464 27.50.78 (0.98)0.69 (0.86)0.61 (0.77) 58.20.72 (0.91)0.59 (0.73)0.49 (0.62) 7529.80.72 (0.90)0.71 (0.88)0.69 (0.86) 7615.50.72 (0.90)0.70 (0.88)0.69 (0.86)
baselineAltLabel0:11:485 16.41.00 (1.00)0.74 (0.74)0.59 (0.59) 47.80.99 (0.99)0.79 (0.79)0.66 (0.66) 4674.20.89 (0.89)0.84 (0.84)0.80 (0.80) 4739.00.89 (0.89)0.84 (0.84)0.80 (0.80)
baselineLabel0:12:305 16.41.00 (1.00)0.74 (0.74)0.59 (0.59) 47.80.99 (0.99)0.79 (0.79)0.66 (0.66) 3641.20.95 (0.95)0.81 (0.81)0.71 (0.71) 3706.00.95 (0.95)0.81 (0.81)0.71 (0.71)
DOME1:05:264 22.50.74 (0.92)0.62 (0.77)0.53 (0.66) 75.50.79 (0.99)0.77 (0.96)0.75 (0.93) 4895.20.74 (0.92)0.70 (0.88)0.67 (0.84) 4994.80.74 (0.92)0.70 (0.88)0.67 (0.84)
FCAMap-KG1:14:495 18.61.00 (1.00)0.82 (0.82)0.70 (0.70) 69.01.00 (1.00)0.98 (0.98)0.96 (0.96) 4530.60.90 (0.90)0.84 (0.84)0.79 (0.79) 4792.60.91 (0.91)0.85 (0.85)0.79 (0.79)
LogMap0:15:435 26.00.95 (0.95)0.84 (0.84)0.76 (0.76) 0.00.00 (0.00)0.00 (0.00)0.00 (0.00) 0.00.00 (0.00)0.00 (0.00)0.00 (0.00) 26.00.95 (0.95)0.01 (0.01)0.00 (0.00)
LogMapBio2:31:015 26.00.95 (0.95)0.84 (0.84)0.76 (0.76) 0.00.00 (0.00)0.00 (0.00)0.00 (0.00) 0.00.00 (0.00)0.00 (0.00)0.00 (0.00) 26.00.95 (0.95)0.01 (0.01)0.00 (0.00)
LogMapKG2:26:145 26.00.95 (0.95)0.84 (0.84)0.76 (0.76) 0.00.00 (0.00)0.00 (0.00)0.00 (0.00) 29190.40.40 (0.40)0.54 (0.54)0.86 (0.86) 29216.40.40 (0.40)0.54 (0.54)0.84 (0.84)
LogMapLt0:07:284 23.00.80 (1.00)0.56 (0.70)0.43 (0.54) 0.00.00 (0.00)0.00 (0.00)0.00 (0.00) 6653.80.73 (0.91)0.67 (0.84)0.62 (0.78) 6676.80.73 (0.91)0.66 (0.83)0.61 (0.76)
POMAP++0:14:395 2.00.00 (0.00)0.00 (0.00)0.00 (0.00) 0.00.00 (0.00)0.00 (0.00)0.00 (0.00) 0.00.00 (0.00)0.00 (0.00)0.00 (0.00) 19.40.00 (0.00)0.00 (0.00)0.00 (0.00)
Wiktionary0:20:145 21.41.00 (1.00)0.80 (0.80)0.67 (0.67) 75.80.97 (0.97)0.98 (0.98)0.98 (0.98) 3483.60.91 (0.91)0.79 (0.79)0.70 (0.70) 3581.80.91 (0.91)0.80 (0.80)0.71 (0.71)
Aggregated results per matcher, divided into class, property, instance, and overall alignments. Time is displayed as HH:MM:SS. Column #testcases indicates the number of testcases where the tool is able to generate (non empty) alignments. Column size indicates the averaged number of system correspondences. Two kinds of results are reported: (1) those not distinguishing empty and erroneous (or not generated) alignments, and (2) those considering only non empty alignments (value between parenthesis).
The following matchers could output results on all 5 test cases: AGM, baselineAltLabel, baselineLabel, FCAMap-KG, LogMap, LogMapBio, LogMapKG, POMAP++, and Wiktionary. The longest runtime was observed for AGM with more than 10 hours for all 5 test cases. The systems which did not finish a task did so because of exceeding the amount of main memory they were granted.

Test case specific results

Overall results

This result table shows the overall performance (without dividing into class, property or instance) of the matchers.
marvelcinematicuniverse-marvel memoryalpha-memorybeta memoryalpha-stexpanded starwars-swg starwars-swtor
SizePrec.F-m.Rec. SizePrec.F-m.Rec. SizePrec.F-m.Rec. SizePrec.F-m.Rec. SizePrec.F-m.Rec.
AGM 90850.140.110.09 114490.470.320.24 36840.440.300.23 11010.700.300.19 8470.650.200.12
AML 00.000.000.00 188420.900.880.87 39960.900.920.93 35780.900.810.74 40460.920.910.90
baselineAltLabel 25740.860.760.68 135140.880.890.89 32300.880.900.92 17120.920.740.63 26650.920.910.90
baselineLabel 18790.900.690.56 105520.950.850.77 25820.980.900.83 12450.960.680.53 22720.950.890.84
DOME 00.000.000.00 124750.910.880.84 27270.930.910.89 20240.920.820.74 27530.930.910.89
FCAMap-KG 26820.840.730.65 131710.920.880.85 31740.940.910.89 21400.900.800.71 27960.930.900.87
LogMap 121.000.000.00 400.750.000.00 331.000.010.00 141.000.010.00 311.000.020.01
LogMapBio 121.000.000.00 400.750.000.00 331.000.010.00 141.000.010.00 311.000.020.01
LogMapKG 410820.240.350.71 545390.430.580.87 155620.480.630.92 153370.580.670.80 195620.270.420.92
LogMapLt 00.000.000.00 166880.900.830.77 35770.940.870.82 28070.910.740.62 36350.910.870.83
POMAP++ 90.000.000.00 200.000.000.00 250.000.000.00 140.000.000.00 290.000.000.00
Wiktionary 18650.860.680.56 98380.920.790.69 21670.940.870.80 15920.920.750.63 24470.930.880.84

Class results

All matchers were able to generate class correspondences.
marvelcinematicuniverse-marvel memoryalpha-memorybeta memoryalpha-stexpanded starwars-swg starwars-swtor
SizePrec.F-m.Rec. SizePrec.F-m.Rec. SizePrec.F-m.Rec. SizePrec.F-m.Rec. SizePrec.F-m.Rec.
AGM 210.000.000.00 180.670.240.14 150.500.240.15 90.000.000.00 100.000.000.00
AML 00.000.000.00 360.910.800.71 321.000.820.69 121.000.890.80 301.000.930.87
baselineAltLabel 81.001.001.00 191.000.440.29 191.000.630.46 91.000.570.40 271.000.890.80
baselineLabel 81.001.001.00 191.000.440.29 191.000.630.46 91.000.570.40 271.000.890.80
DOME 00.000.000.00 221.000.670.50 251.000.820.69 150.750.670.60 280.930.900.87
FCAMap-KG 81.001.001.00 211.000.440.29 241.000.760.62 121.000.890.80 281.000.890.80
LogMap 121.001.001.00 400.750.690.64 331.000.700.54 141.000.890.80 311.000.890.80
LogMapBio 121.001.001.00 400.750.690.64 331.000.700.54 141.000.890.80 311.000.890.80
LogMapKG 121.001.001.00 400.750.690.64 331.000.700.54 141.000.890.80 311.000.890.80
LogMapLt 00.000.000.00 231.000.440.29 271.000.700.54 121.000.750.60 301.000.850.73
POMAP++ 20.000.000.00 20.000.000.00 20.000.000.00 20.000.000.00 20.000.000.00
Wiktionary 91.001.001.00 241.000.440.29 311.000.760.62 141.000.750.60 291.000.930.87

Property results

While last year, most matchers struggled on creating mappings typed as rdf:Property (instead of owl:ObjectProperty or owl:DatatypeProperty, many matchers were able to produce property mappings also in that case. Only POMAP++ and all LogMap variants could not return any property mappings.
marvelcinematicuniverse-marvel memoryalpha-memorybeta memoryalpha-stexpanded starwars-swg starwars-swtor
SizePrec.F-m.Rec. SizePrec.F-m.Rec. SizePrec.F-m.Rec. SizePrec.F-m.Rec. SizePrec.F-m.Rec.
AGM 660.220.200.18 710.540.350.26 450.740.470.34 311.000.180.10 340.820.270.16
AML 00.000.000.00 840.760.730.70 640.890.850.80 121.000.400.25 730.980.820.71
baselineAltLabel 71.000.530.36 411.000.510.34 460.970.800.68 421.001.001.00 1031.000.940.89
baselineLabel 71.000.530.36 411.000.510.34 460.970.800.68 421.001.001.00 1031.000.940.89
DOME 00.000.000.00 880.980.940.91 700.970.950.93 421.001.001.00 1021.000.940.89
FCAMap-KG 191.000.950.91 931.000.970.94 730.980.980.98 481.001.001.00 1121.000.990.98
LogMap 00.000.000.00 00.000.000.00 00.000.000.00 00.000.000.00 00.000.000.00
LogMapBio 00.000.000.00 00.000.000.00 00.000.000.00 00.000.000.00 00.000.000.00
LogMapKG 00.000.000.00 00.000.000.00 00.000.000.00 00.000.000.00 00.000.000.00
LogMapLt 00.000.000.00 00.000.000.00 00.000.000.00 00.000.000.00 00.000.000.00
POMAP++ 00.000.000.00 00.000.000.00 00.000.000.00 00.000.000.00 00.000.000.00
Wiktionary 221.001.001.00 1020.930.930.94 800.950.960.98 581.001.001.00 1170.980.980.98

Instance results

Only POMAP++, LogMap, and LogMapBio did not return any instance matches.
marvelcinematicuniverse-marvel memoryalpha-memorybeta memoryalpha-stexpanded starwars-swg starwars-swtor
SizePrec.F-m.Rec. SizePrec.F-m.Rec. SizePrec.F-m.Rec. SizePrec.F-m.Rec. SizePrec.F-m.Rec.
AGM 89970.140.110.09 113600.470.320.24 36240.430.300.23 10610.710.300.19 8030.640.200.12
AML 00.000.000.00 187220.900.880.87 39000.900.920.94 35540.900.820.75 39430.920.910.90
baselineAltLabel 25590.860.760.68 134510.880.890.89 31650.870.900.93 16610.920.740.62 25350.920.910.90
baselineLabel 18640.900.690.56 104890.950.850.77 25170.980.910.84 11940.950.670.52 21420.950.890.84
DOME 00.000.000.00 123600.910.880.84 26320.930.910.89 19660.910.820.74 26230.930.910.89
FCAMap-KG 26030.840.730.65 124740.920.880.85 30080.940.910.89 20040.900.790.70 25640.930.890.86
LogMap 00.000.000.00 00.000.000.00 00.000.000.00 00.000.000.00 00.000.000.00
LogMapBio 00.000.000.00 00.000.000.00 00.000.000.00 00.000.000.00 00.000.000.00
LogMapKG 410700.240.350.71 544990.430.580.87 155290.470.630.94 153230.580.680.82 195310.270.430.96
LogMapLt 00.000.000.00 166650.900.830.77 35500.940.890.84 27950.910.750.63 36050.910.890.87
POMAP++ 00.000.000.00 00.000.000.00 00.000.000.00 00.000.000.00 00.000.000.00
Wiktionary 18320.860.680.56 97090.920.790.69 20560.940.860.80 15200.920.740.63 23010.920.880.84

Runtime

AGM had the highest runtimes followed by LogMapBio, and LogMapKG. LogMapLt were quite fast and returned class as well as instance matches. POMAP++ is also fast but returns only class mappings.
marvelcinematicuniverse-marvel memoryalpha-memorybeta memoryalpha-stexpanded starwars-swg starwars-swtor
AGM 2:59:49 1:58:09 2:59:35 1:29:48 1:20:17
AML 0:00:00 0:28:05 0:05:15 0:06:27 0:05:59
baselineAltLabel 0:02:57 0:01:53 0:01:11 0:02:51 0:02:56
baselineLabel 0:03:03 0:02:07 0:01:12 0:02:53 0:03:15
DOME 0:00:00 0:08:45 0:05:45 0:28:29 0:22:27
FCAMap-KG 0:10:56 0:11:08 0:05:20 0:23:46 0:23:39
LogMap 0:03:57 0:02:09 0:01:20 0:04:12 0:04:05
LogMapBio 0:23:31 0:36:58 0:40:36 0:23:56 0:26:00
LogMapKG 1:14:09 0:17:52 0:11:44 0:30:30 0:11:59
LogMapLt 0:00:00 0:01:29 0:00:58 0:02:31 0:02:30
POMAP++ 0:04:23 0:01:58 0:01:15 0:03:37 0:03:26
Wiktionary 0:05:43 0:02:41 0:01:44 0:04:55 0:05:11

Organizers

References

[1] Sven Hertling, Heiko Paulheim: DBkWik: A Consolidated Knowledge Graph from Thousands of Wikis. International Conference on Big Knowledge 2018. [pdf]

[2] Alexandra Hofmann, Samresh Perchani, Jan Portisch, Sven Hertling, and Heiko Paulheim. DBkWik: Towards Knowledge Graph Creation from Thousands of Wikis. International Semantic Web Conference (Posters & Demos) 2017. [pdf]