My research interest is web science, “the emergent science of the
people, organizations, applications, and of policies that shape and
are shaped by the Web, the largest informational artifact constructed
by humans in history” (from the call for
papers
of the ACM Web Science conference). Thereby, my research is situated
in computer science, with multi-disciplinary connections to
information science, psychology, sociology and economics. Relevant projects:Unknown Data,
Web Collections
Our collaborative tagging system BibSonomy is both a
valuable tool for researchers to organize their literature as well as
a test-bed for our methods and results. In that context, I am
interested in the development and integration of recommendation
methods for tags and scholarly articles for social bookmarking
systems. Further topics of interest include citation and link
analysis, entity matching and resolution, and social network analysis.
I extensively leverage big data technologies like Hadoop, HBase, or
Elasticsearch for my research, for instance, to analyze crawled web
pages of academic institutions in the context of our German Academic
Web archive.
“Liebe & Tod in der Deutschen Nationalbibliothek: Der DNB-Katalog
als Forschungsobjekt der digitalen Literaturwissenschaft” held at
Erschließen, Forschen, Analysieren –
EFA22@DNB, September 8, 2022,
Frankfurt, Germany
(Slides)
“What matters? Key passages in literary works” held at DHLunch@GS, March 21, 2022, University of Texas at Austin, USA
“What matters? Key passages in literary works” held at DH Coffee Talks, December 16, 2021, Humboldt-Universität zu Berlin, Germany (Video)
“Datenschätze selber heben: Data Science und Bibliotheken” held at Universitätsbibliothek Zürich,
June 5, 2021, Zürich, Switzerland (Slides)
“Datenschätze selber heben: Data Science und Bibliotheken” held at Universitätsbibliothek Johann Christian Senckenberg, February 25, 2020, Frankfurt, Germany (Slides)
“Uncovering Hidden Treasures: Libraries and Data Science” held at
KNVI Smart Humanity
Conference, November 15,
2019, Amsterdam, The Netherlands
(Slides)
“Datenschätze selber heben: Data Science und Bibliotheken” held at 107. Deutscher Bibliothekartag, June 13, 2018, Berlin, Germany (Slides)
“Identifying and Analyzing Researchers on Twitter” held at Twitter
Workshop, Göttingen Center for Digital Humanities, June 12 2014,
Göttingen, Germany (PDF)
Tutorial “Ontology Learning from Folksonomies” held at
Notebooks – A
collection of Jupyter Notebooks showing different features and
analyses; including exemplary and excellent computational essays of
our students from the data analysis module.
FCA4DH – A collection of slides and information for the application of Formal Concept Analysis (FCA) in, and by, Digital Humanities (DH).
DH@HU – The Digital Humanities Network at Humboldt-Universität zu Berlin.
Our social bookmark and publication sharing system
BibSonomy is online since 2006 with me being the main
developer from 2005 to 2012. Since 2009 I am leading the development
and operation of the system together with Andreas
Hotho. If you
are interested in a cooperation, just let me know.
Together with the University Library Kassel we have extended the
BibSonomy platform in the DFG-funded
PUMA project for academic publication
management. If you are interested in using PUMA, please
contact us ().
As part of the BMBF-funded AI-SKILLS
project we build an
application-oriented infrastructure for AI communities in
teaching-learning settings at Humboldt-Universität zu Berlin
(2021-2025).
Together with LZI (home of
DBLP) and GESIS we are
mining and consolidating research dataset metadata from the Web in
this
DFG-fundedresearch project (2021-2025).
Is Expert Knowledge Key? Scholarly Interpretations as Resource for the Analysis of Literary Texts in Computational Literary Studies #
We are developing novel methods to identify and extract Vossian
antonomasia from large newspaper
corpora. The approaches based on deep learning enable us to study this
linguistic device on a large scale. Code, data, statistics, and many
examples are available on our project page and GitHub repository.
Together with Frank Fischer and Mathias
Göbel we are writing about Digital
Humanities in general and our research on world literature in
particular on weltliteratur.net – a
black market for the digital humanities.
FolkRank is an algorithm for search and
ranking
in collaborative tagging systems. It has been integrated into the
community support architecture of the social semantic desktop
developed by the NEPOMUK
project. The source code is
available from the project’s SVN
repository.