Results for OAEI 2022 - Knowledge Graph Track

Matching systems

As a pre-test, we executed all systems submitted to OAEI (even if they are not registered for the track) on a very small matching example (dataset) with a similar structure and shape like the real knowledge graphs (in fact, they are a small subset of them). It showed that not all matching systems are able to complete this small task due to exceptions or other failures. The following matching systems produced an exception:

ALIN (Nullpointer - similar to last year)
SEBmATCHER (Endpoint not available)
WomboCombo (TypeError)
graph-matcher (dependency problem)

Thus, we executed the following systems:

AMD
ATMatcher
BaselineAltLabel
BaselineLabel
KGMatcher
LSMatch
LogMap
Matcha

The source code for the baseline matchers is available. The baselineLabel matcher matches all resources which share the same rdfs:label. (in case multiple resources share the same label, all of them are matched). BaselineAltLabel is additionally using skos:altLabel. Again, in cases where multiple resources share a common label, all those resources are matched in a cross product manner.

Experimental setting

The evaluation is executed on a virtual machine(VM) with 32GB of RAM and 16 vCPUs (2.4 GHz). The operating system is debian 9 with openjdk version "1.8.0_265".

We used the MELT toolkit for the evaluation which internally uses the SEALS client (version 7.0.5) to execute matcher packaged with SEALS. Matching systems which use the web packaging, are executed with the MatcherHTTPCall class. The reported times includes the environment preparation of SEALS as well as the file upload to the docker container (the start of the container is not timed). The alignments were evaluated based on Precision, Recall and F-Measure for classes, properties and instances (each in isolation). Our partial gold standard consist of 1:1 mappings extracted from links contained in wiki pages (cross wiki links). The schema was matched by ontology experts. We assume that in each knowledge graph, only one representation of one concept exists. This means if we have the mapping in our gold standard we can count the mapping as a false positive (the assumption here is that in the seconds knowledge graph no similar concept to B exists). The value of false negatives is only increased if we have a 1:1 mapping and it is not found by a matcher. The source code for generating the evaluation results is also available.

We imposed a maximum execution time of 12h per task, however, that time limit was never exceeded.

Generated dashboard / CSV file

We also generated an online dashboard with the help of the MELT framework. Have a look at the knowledge graph results here (it may take some seconds to load due to 200 000 correspondences).
Moreover, we also generated a CSV file which allows to analyze each matcher on a correspondence level. This should help matcher developers to increase the matcher performance.

Alignment results

The generated alignment files are also available.

Results overview

Aggregated results per matcher, divided into class, property, instance, and overall alignments. Time is displayed as HH:MM:SS. Column *#testcases* indicates the number of testcases where the tool is able to generate (non empty) alignments. Column *size* indicates the averaged number of system correspondences. Two kinds of results are reported: (1) those not distinguishing empty and erroneous (or not generated) alignments, and (2) those considering only non empty alignments (value between parenthesis).
			class				property				instance				overall
System	Time	#testcases	Size	Prec.	F-m.	Rec.	Size	Prec.	F-m.	Rec.	Size	Prec.	F-m.	Rec.	Size	Prec.	F-m.	Rec.
AMD	00:22:08	2	0.0	0.00 (0.00)	0.00 (0.00)	0.00 (0.00)	0.0	0.00 (0.00)	0.00 (0.00)	0.00 (0.00)	0.0	0.00 (0.00)	0.00 (0.00)	0.00 (0.00)	24.0	0.00 (0.00)	0.00 (0.00)	0.00 (0.00)
ATMatcher	00:18:48	5	25.6	0.97 (0.97)	0.87 (0.87)	0.79 (0.79)	78.8	0.97 (0.97)	0.96 (0.96)	0.95 (0.95)	4856.6	0.89 (0.89)	0.84 (0.84)	0.80 (0.80)	4961.0	0.89 (0.89)	0.85 (0.85)	0.80 (0.80)
BaselineAltLabel	00:11:37	5	16.4	1.00 (1.00)	0.74 (0.74)	0.59 (0.59)	47.8	0.99 (0.99)	0.79 (0.79)	0.66 (0.66)	4674.8	0.89 (0.89)	0.84 (0.84)	0.80 (0.80)	4739.0	0.89 (0.89)	0.84 (0.84)	0.80 (0.80)
BaselineLabel	00:11:27	5	16.4	1.00 (1.00)	0.74 (0.74)	0.59 (0.59)	47.8	0.99 (0.99)	0.79 (0.79)	0.66 (0.66)	3641.8	0.95 (0.95)	0.81 (0.81)	0.71 (0.71)	3706.0	0.95 (0.95)	0.81 (0.81)	0.71 (0.71)
KGMatcher	03:01:17	5	21.2	1.00 (1.00)	0.79 (0.79)	0.66 (0.66)	0.0	0.00 (0.00)	0.00 (0.00)	0.00 (0.00)	3789.6	0.94 (0.94)	0.82 (0.82)	0.74 (0.74)	3810.8	0.94 (0.94)	0.82 (0.82)	0.72 (0.72)
LogMap	00:55:52	5	19.4	0.93 (0.93)	0.81 (0.81)	0.71 (0.71)	0.0	0.00 (0.00)	0.00 (0.00)	0.00 (0.00)	4012.4	0.90 (0.90)	0.78 (0.78)	0.69 (0.69)	4031.8	0.90 (0.90)	0.77 (0.77)	0.68 (0.68)
LSMatch	04:17:13	5	23.6	0.97 (0.97)	0.78 (0.78)	0.64 (0.64)	85.6	0.73 (0.73)	0.71 (0.71)	0.69 (0.69)	5872.2	0.66 (0.66)	0.63 (0.63)	0.60 (0.60)	5981.4	0.66 (0.66)	0.63 (0.63)	0.61 (0.61)
Matcha	02:40:21	4	0.0	0.00 (0.00)	0.00 (0.00)	0.00 (0.00)	0.0	0.00 (0.00)	0.00 (0.00)	0.00 (0.00)	32844.2	0.53 (0.66)	0.61 (0.76)	0.72 (0.90)	32844.2	0.53 (0.66)	0.60 (0.76)	0.70 (0.88)

Test case specific results

Overall results

This result table shows the overall performance (without dividing into class, property or instance) of the matchers for each test case.

	marvelcinematicuniverse-marvel				memoryalpha-memorybeta				memoryalpha-stexpanded				starwars-swg				starwars-swtor
	Size	Prec.	F-m.	Rec.	Size	Prec.	F-m.	Rec.	Size	Prec.	F-m.	Rec.	Size	Prec.	F-m.	Rec.	Size	Prec.	F-m.	Rec.
AMD	0	0.00	0.00	0.00	22	0.00	0.00	0.00	26	0.00	0.00	0.00	0	0.00	0.00	0.00	0	0.00	0.00	0.00
ATMatcher	3515	0.67	0.59	0.52	12999	0.96	0.93	0.91	3279	0.96	0.94	0.92	2243	0.93	0.84	0.76	2769	0.95	0.93	0.91
BaselineAltLabel	2574	0.86	0.76	0.68	13514	0.88	0.89	0.89	3230	0.88	0.90	0.92	1712	0.92	0.74	0.63	2665	0.92	0.91	0.90
BaselineLabel	1879	0.90	0.69	0.56	10552	0.95	0.85	0.77	2582	0.98	0.90	0.83	1245	0.96	0.68	0.53	2272	0.95	0.89	0.84
KGMatcher	1909	0.89	0.69	0.56	10764	0.94	0.85	0.77	2577	0.98	0.89	0.82	1483	0.94	0.75	0.62	2321	0.94	0.88	0.83
LogMap	2255	0.84	0.59	0.46	11648	0.89	0.82	0.76	2491	0.88	0.81	0.75	1577	0.94	0.79	0.68	2188	0.94	0.84	0.75
LSMatch	2147	0.63	0.50	0.42	19073	0.59	0.66	0.75	5065	0.53	0.63	0.79	888	0.76	0.37	0.24	2734	0.81	0.82	0.82
Matcha	0	0.00	0.00	0.00	64900	0.56	0.69	0.90	15970	0.67	0.78	0.93	23555	0.70	0.74	0.80	26952	0.73	0.80	0.89

Class results

	marvelcinematicuniverse-marvel				memoryalpha-memorybeta				memoryalpha-stexpanded				starwars-swg				starwars-swtor
	Size	Prec.	F-m.	Rec.	Size	Prec.	F-m.	Rec.	Size	Prec.	F-m.	Rec.	Size	Prec.	F-m.	Rec.	Size	Prec.	F-m.	Rec.
AMD	0	0.00	0.00	0.00	0	0.00	0.00	0.00	0	0.00	0.00	0.00	0	0.00	0.00	0.00	0	0.00	0.00	0.00
ATMatcher	11	1.00	1.00	1.00	39	0.83	0.77	0.71	34	1.00	0.87	0.77	13	1.00	0.75	0.60	31	1.00	0.93	0.87
BaselineAltLabel	8	1.00	1.00	1.00	19	1.00	0.44	0.29	19	1.00	0.63	0.46	9	1.00	0.57	0.40	27	1.00	0.89	0.80
BaselineLabel	8	1.00	1.00	1.00	19	1.00	0.44	0.29	19	1.00	0.63	0.46	9	1.00	0.57	0.40	27	1.00	0.89	0.80
KGMatcher	8	1.00	1.00	1.00	27	1.00	0.44	0.29	29	1.00	0.70	0.54	12	1.00	0.75	0.60	30	1.00	0.93	0.87
LogMap	10	1.00	1.00	1.00	21	0.88	0.64	0.50	26	0.78	0.64	0.54	12	1.00	0.89	0.80	28	1.00	0.85	0.73
LSMatch	8	1.00	1.00	1.00	26	1.00	0.44	0.29	25	1.00	0.70	0.54	19	1.00	0.75	0.60	40	0.86	0.83	0.80
Matcha	0	0.00	0.00	0.00	0	0.00	0.00	0.00	0	0.00	0.00	0.00	0	0.00	0.00	0.00	0	0.00	0.00	0.00

Property results

	marvelcinematicuniverse-marvel				memoryalpha-memorybeta				memoryalpha-stexpanded				starwars-swg				starwars-swtor
	Size	Prec.	F-m.	Rec.	Size	Prec.	F-m.	Rec.	Size	Prec.	F-m.	Rec.	Size	Prec.	F-m.	Rec.	Size	Prec.	F-m.	Rec.
AMD	0	0.00	0.00	0.00	0	0.00	0.00	0.00	0	0.00	0.00	0.00	0	0.00	0.00	0.00	0	0.00	0.00	0.00
ATMatcher	24	0.91	0.91	0.91	103	0.98	0.95	0.92	85	0.95	0.95	0.95	61	1.00	1.00	1.00	121	1.00	0.99	0.98
BaselineAltLabel	7	1.00	0.53	0.36	41	1.00	0.51	0.34	46	0.97	0.80	0.68	42	1.00	1.00	1.00	103	1.00	0.94	0.89
BaselineLabel	7	1.00	0.53	0.36	41	1.00	0.51	0.34	46	0.97	0.80	0.68	42	1.00	1.00	1.00	103	1.00	0.94	0.89
KGMatcher	0	0.00	0.00	0.00	0	0.00	0.00	0.00	0	0.00	0.00	0.00	0	0.00	0.00	0.00	0	0.00	0.00	0.00
LogMap	0	0.00	0.00	0.00	0	0.00	0.00	0.00	0	0.00	0.00	0.00	0	0.00	0.00	0.00	0	0.00	0.00	0.00
LSMatch	36	0.82	0.82	0.82	112	0.62	0.60	0.58	82	0.62	0.62	0.61	79	0.72	0.68	0.65	119	0.88	0.83	0.79
Matcha	0	0.00	0.00	0.00	0	0.00	0.00	0.00	0	0.00	0.00	0.00	0	0.00	0.00	0.00	0	0.00	0.00	0.00

Instance results

	marvelcinematicuniverse-marvel				memoryalpha-memorybeta				memoryalpha-stexpanded				starwars-swg				starwars-swtor
	Size	Prec.	F-m.	Rec.	Size	Prec.	F-m.	Rec.	Size	Prec.	F-m.	Rec.	Size	Prec.	F-m.	Rec.	Size	Prec.	F-m.	Rec.
AMD	0	0.00	0.00	0.00	0	0.00	0.00	0.00	0	0.00	0.00	0.00	0	0.00	0.00	0.00	0	0.00	0.00	0.00
ATMatcher	3480	0.67	0.58	0.52	12857	0.96	0.93	0.91	3160	0.96	0.94	0.92	2169	0.93	0.83	0.76	2617	0.94	0.92	0.91
BaselineAltLabel	2559	0.86	0.76	0.68	13454	0.88	0.89	0.89	3165	0.88	0.90	0.93	1661	0.92	0.74	0.62	2535	0.92	0.91	0.90
BaselineLabel	1864	0.90	0.69	0.56	10492	0.95	0.85	0.77	2517	0.98	0.91	0.84	1194	0.95	0.67	0.52	2142	0.95	0.89	0.84
KGMatcher	1901	0.89	0.69	0.57	10737	0.94	0.85	0.78	2548	0.98	0.91	0.84	1471	0.94	0.76	0.63	2291	0.94	0.90	0.86
LogMap	2245	0.84	0.60	0.46	11627	0.89	0.82	0.76	2465	0.88	0.82	0.77	1565	0.94	0.80	0.69	2160	0.94	0.86	0.78
LSMatch	2103	0.63	0.50	0.41	18935	0.59	0.66	0.75	4958	0.53	0.63	0.80	790	0.76	0.36	0.23	2575	0.81	0.82	0.82
Matcha	0	0.00	0.00	0.00	64900	0.56	0.69	0.90	15970	0.67	0.79	0.95	23555	0.70	0.75	0.82	26952	0.73	0.82	0.94

Runtime

	marvelcinematicuniverse-marvel	memoryalpha-memorybeta	memoryalpha-stexpanded	starwars-swg	starwars-swtor
AMD	00:00:00	00:13:58	00:08:09	00:00:00	00:00:00
ATMatcher	00:04:36	00:03:23	00:02:04	00:04:24	00:04:20
BaselineAltLabel	00:02:45	00:01:54	00:01:11	00:02:54	00:02:50
BaselineLabel	00:02:40	00:01:50	00:01:11	00:02:52	00:02:51
KGMatcher	00:39:34	00:25:35	00:25:42	00:46:27	00:43:57
LogMap	00:32:40	00:05:09	00:03:09	00:07:44	00:07:10
LSMatch	01:46:01	00:57:37	00:20:38	00:38:50	00:34:05
Matcha	00:00:00	01:10:55	00:16:06	00:47:07	00:26:11

Organizers

Sven Hertling (University of Mannheim, Germany), main contact for the track, sven at informatik dot uni-mannheim dot de
Heiko Paulheim (University of Mannheim, Germany)

References

[1] Sven Hertling, Heiko Paulheim: The knowledge graph track at OAEI : Gold standards, baselines, and the golden hammer bias. ESWC 2020. [pdf]

[2] Sven Hertling, Heiko Paulheim: DBkWik: A Consolidated Knowledge Graph from Thousands of Wikis. International Conference on Big Knowledge 2018. [pdf]

[3] Alexandra Hofmann, Samresh Perchani, Jan Portisch, Sven Hertling, and Heiko Paulheim. DBkWik: Towards Knowledge Graph Creation from Thousands of Wikis. International Semantic Web Conference (Posters & Demos) 2017. [pdf]