Matthäus Zloch

M.Sc. Computer Science
  • Oracle Certified Professional Programmer, Java
  • Best Student Paper Award Winner, ESWC 2019

Current Affiliation
  • GESIS - Leibniz Institute for the Social Sciences, Cologne
  • Heinrich-Heine-Universität, Düsseldorf


Hi! This is Matthäus Zloch, a computer science Ph.D student and software developer working at GESIS - a Leibniz-Institute for the Social Sciences in Cologne Germany. I have (web) software programming experience of about 10 years with J2EE, Nodejs, and Python.

Beside my software development and research activities, I am the administrator of a bunch of services at GESIS, like

  • an instance of GitLab, with over 400 users and 1200 projects, and
  • a SPARQL endpoints to query TweetsKB and ClaimsKG, two RDF database management systems with over 2.2 billion triples.

Vita in a nutshell:


  • 2020-05We updated our TweetsKB corpus with 5% of Tweets from 2019 and 2020, check it out.
  • 2020-03I've been invited to the PC of the Joint International Conference on Knowledge Graphs (JIST-KG 2020) to review papers.
  • 2020-02I've been invited to the PC of the International Semantic Web Conference 2020 (ISWC 2020) to review Industry/In-use papers.
  • 2019-10I received an honorable mention within the Best Reviewer award at ISWC 2019.
  • 2019-08New claims were added to the ClaimsKG database.
  • 2019-07I received a Best Student paper award for my resources paper at ESWC 2019.
  • 2019-04I've been invited as PC member to review Industry/In-use papers at ISWC 2019.
  • 2019-02My paper "A Software Framework and Datasets for the Analysis of Graph Measures on RDF Graphs" got accepted at ESWC 2019 resources track.
  • 2016-10I've been invited to the CESSDA Expert-Seminar to introduce Git and Git Workflows.

Research Interests / Papers

My current research activities focus on (RDF) graphs and the analysis of graph topologies of real-world and synthetic/benchmark datasets. I investigate features of the graph topologies in order to do classification and domain-class prediction, for instance for the datasets in the LOD cloud. My intention is to (a) optimize the quality of synthetic datasets in terms of similarity to real-worl data, and (b) to use similarity of graph topologies as a measure for dataset search.

  • Charaterizing RDF Graphs through Graph-based Measures - Framework and Assessment
    Zloch, Matthäus and Acosta, Maribel and Hienert, Daniel and Conrad, Stefan and Dietze, Stefan. 2020.
    In The Semantic Web Journal. SWJUNDER REVIEW
  • A Software Framework and Datasets for the Analysis of Graph Measures on RDF Graphs
    Zloch, Matthäus and Acosta, Maribel and Hienert, Daniel and Dietze, Stefan and Conrad, Stefan. 2019, Portoroz, Slovenia.
    In The Semantic Web, 16th Extended Semantic Web Conference. ESWC 2019Best Student Paper Award
  • ClaimsKG: A Knowledge Graph of Fact-Checked Claims
    Tchechmedjiev, Andon and Fafalios, Pavlos and Boland, Katarina and Gasquet, Malo and Zloch, Matthäus and Zapilko, Benjamin and Dietze, Stefan and Todorov, Konstantin. 2019, Portoroz, Slovenia.
    In The Semantic Web, 16th Extended Semantic Web Conference. ISWC 2019
  • Towards a Use Case Driven Evaluation of Database Systems for RDF Data Storage - A Case Study for Statistical Data
    Zloch, Matthäus and Hienert, Daniel and Conrad, Stefan. 2017, Vienna, Austria.
    In 16th International Semantic Web Conference, 2nd International Workshop on Benchmarking Linked Data. ISWC 2017
  • Methods for the Automatic Selection of Database Systems for Optimized Query Performance
    Zloch, Matthäus. 2016, Klagenfurt, Austria.
    In Ph.D Symposium of INFORMATIK 2016: Computer Science of humans for humans. INFORMATIK 2016
  • Improving web search results with explanation-aware snippets: an experimental study
    Wira-Alam, Andias, and Zloch, Matthäus. 2013, Aachen, Germany.
    In 9th International Conference on Web Information Systems and Technologies. WEBIST 2013
  • Utilizing regular expressions for instance-based schema matching
    Zapilko, Benjamin, and Zloch, Matthäus and Schaible, Johann. 2012
    In 7th International Workshop on Ontology Matching (OM-2012) collocated with the 11th International Semantic Web Conference. ISWC 2012

Source Code

This is a collection of code and scripts that I've developed over the time.

  • lodcc - framework
    A software framework to prepare and perform a graph-based analysis on the graph topology of large-scale RDF datasets. It enables you to extract a bunch of measures from an RDF graph at large scale (i.e. the english dbpedia with 2.6 billion edges). See an example study which has been conducted on 280 datasets from the LOD Cloud.
  • spkosc03-slack-bot
    A slack-bot which notifies you via private message if your process is finished. This is convenient in situations where you have long running (scientific) processes, running on a server, that you have to wait for to finish. The can also be used to monitor server swapping and low hard disk.
  • Sythetic Dataset Generator
    "datazilla" - Generate realistic test data for any schema. Supports highly normalized and polymorphism-enabled schemas.
  • Metadata Discovery Vocabulary
    Implementation of the disco-spec, the DDI Discovery Vocabulary for social science data. The implementation enables to use the DISCO-model as an object-oriented model in Java. Example implementation can be found in MISSY.


  • Working address: Matthäus Zloch, GESIS - Leibniz Institute for Social Sciences, Unter Sachsenhausen 6-8, 50667 Cologne, Germany
  • Email: firstname.lastname at gesis dot org
  • Twitter: @matzlo
  • Skype: firstname.lastname