All publications

Abusaleh, A., Verma, B., & Mehler, A. (2026). TTLab at AraSentEval: SARF (صرف) Sentiment Analysis via Root-based Fusion for Multi-Dialectal Arabic . Proceedings of the 7th Workshop on Open-Source Arabic Corpora and Processing Tools (OSACT7).

Abusaleh, A., Verma, B., & Mehler, A. (2026). TTLab at AraSentEval: SARF (صرف) Sentiment Analysis via Root-based Fusion for Multi-Dialectal Arabic. Proceedings of the 7th Workshop on Open-Source Arabic Corpora and Processing Tools (OSACT7), Co-Located with the Language Resources and Evaluation Conference (LREC 2026). Palma, Mallorca, Spain.

ENTAILab - 2 (CIRCLET)

Arabic sentiment analysis is challenged by morphological complexity and lexical variation across Arabic dialects, compounded by subjectivity in how speakers and writers express sentiment. In this paper, we present our submission for the AraSentEval 2026 Shared Task on Arabic Dialect Sentiment Analysis. We propose SARF (صرف) a multi-view architectural framework that integrates surface-level context with stemmed and rooted morphological perspectives using a shared MARBERTv2 encoder. Our system employs a hybrid BERT-CNN-BiLSTM-Attention architecture to capture both local sentiment n-grams and global sequential dependencies. Experimental results show that while individual morphological normalization strategies (stemming or rooting) may degrade performance, their joint integration via cross-morphological attention provides robust features across diverse dialects. Our final system achieved a competitive macro-F1-score of 0.9263, ranking 2nd out of 15 participating teams.

Weiss, J., Burger, A., Roßmann, J., Meurer, J. E., Abusaleh, A. (2026). From Images to Topics: Evaluating Vision-Language Models for Topic Classification of Election Advertising . Companion Publication of the 2026 18th ACM Web Science Conference.

Weiss, J., Burger, A., Roßmann, J., Meurer, J. E., & Abusaleh, A. (2026). From Images to Topics: Evaluating Vision-Language Models for Topic Classification of Election Advertising. Companion Publication of the 2026 18th ACM Web Science Conference, 10–14. doi:10.1145/3795513.3807426

ENTAILab - 2 (CIRCLET) SmartDyn

Manual topic coding of election advertising images is highly time-consuming, yet increasingly required in survey-based research that collects photos and screenshots of campaign materials. We evaluate privacy-compliant, locally deployable vision–language models for automated topic classification of election advertising collected via a smartphone-based high-frequency panel survey during the 2025 German federal election. We compare two approaches: (1) direct image-to-topic classification using Llama 4 (109B) and Qwen2.5-VL-7B, and (2) a modular two-step pipeline in which these models first generate structured image descriptions that are subsequently classified by topic using Llama 4 (109B), Qwen2.5-VL-7B, and the gpt-oss-20b text-only model. Model outputs are evaluated against a human-coded reference of 500 images. Results show that the two-step pipeline substantially improves topic classification when combined with a strong text-based classifier, increasing Macro-F1 from 0.43 (best direct model) to 0.54 (best two-step model). The study provides methodological guidance for designing transparent and privacy-aware pipelines for automated analysis of heterogeneous political visuals in survey research.

Abusaleh, A., Hammerla, L. & Mehler, A. (2026). Learning to Detect Cross-Modal Negation: An Analysis of Latent Representations and an Attention-Based Solution . Proceedings of the 8th International Conference on Natural Language Processing (ICNLP).

Abusaleh, A., Hammerla, L., & Mehler, A. (2026). Learning to detect cross-modal negation: An analysis of latent representations and an attention-based solution. Paper accepted at the 8th International Conference on Natural Language Processing (ICNLP), 2026.

ENTAILab - 2 (CIRCLET)

Detecting high-level semantic concepts like negation across modalities remains a challenge for current multimodal systems. We analyze this as a fundamental representation learning problem, providing the first evidence that negation does not form a linearly or non-linearly separable class in the latent spaces of standard vision-language models (VLMs). We demonstrate that pretrained embeddings primarily encode modality-specific features, lacking a generalizable negation signal. To overcome this, we propose a novel cross-modal attention architecture that explicitly models inter-modal dependencies, achieving performance gains of up to +7.03% F1 over unimodal baselines. Our analysis reveals a key asymmetry: while textual negation often appears independently, visual negation is semantically dependent on linguistic context, a finding validated through our statistical analysis of 3,222 political video-text pairs automatically annotated via Qwen2.5-VL. By combining this analysis with self-supervised video representations (JEPA2), we advance the modeling of temporal negation. This work provides new methods and insights for learning robust, semantically-aligned representations in multimodal systems.

Artelt, C., Schenck-Fontaine, A., Kleinert, C., Liebig, S., Mehler, A., & Pollak, R. (2026). Infrastructure Priority Programme "New Data Spaces for the Social Sciences" (SPP 2431) - Programme Overview . New Data Spaces | Reports, No. 1.

Artelt, C., Schenck-Fontaine, A., Kleinert, C., Liebig, S., Mehler, A., & Pollak, R. (2026). Infrastructure Priority Programmme New Data Spaces for the Social Sciences (SPP 2431) – A Programme Overview (New Data Spaces Reports No. 1). Leibniz-Institute for Educational Trajectories.

This white paper provides a comprehensive programme description for the Infrastructure Priority Programme "New Data Spaces for the Social Sciences" (SPP 2431), explaining the scientific context and the urgent need to modernize traditional survey research. It outlines a strategic response to declining participation rates and rising fieldwork costs by integrating technological innovations into empirical social research.

The paper describes the four core research areas where methodological innovation is critical:

Exploration and Integration
Respondent-Driven Designs
Instrument Validity
Multimodal Data Acquisition

Additionally, the paper provides a detailed description of the programme’s structure, including its governance framework and the ENTAILab. As the central research hub, the ENTAILab facilitates the integration of these innovations into major existing panel studies and provides supports to individual projects in the areas of data quality, reproducibility, data protection, and ethical standards.

Borkowski, C., Abrami, G., Terefe, D., Baumartz, D. & Mehler, A. (2026). DUUIgateway: A Web Service for Platform Independent, Ubiquitous Big Data NLP . SoftwareX.

Borkowski, C., Abrami, G., Terefe, D., Baumartz, D., & Mehler, A. (2026). DUUIgateway: A web service for platform-independent, ubiquitous big data NLP. SoftwareX, 34, 102549.

ENTAILab - 2 (CIRCLET)

Distributed processing of unstructured text data is a challenge in the rapidly changing and evolving natural language processing (NLP) landscape. This landscape is characterized by heterogeneous systems, models, and formats, and especially by the increasing influence of AI systems. While many of these systems handle text data, there are also unified systems that process multiple input and output formats, while allowing for distributed corpus processing. However, there are hardly any user-friendly interfaces that allow existing NLP frameworks to be used flexibly and extended in a user-controlled manner. Due to this gap and the increasing importance of NLP for various scientific disciplines, there has been a demand for a web and API based flexible software solution for deploying, managing and monitoring NLP systems. Such a solution is provided by Docker Unified UIMA Interface-gateway. We introduce DUUIgateway and evaluate its API and user-driven approach to encapsulation. We also describe how these features improve the usability and accessibility of the NLP framework DUUI. We illustrate DUUIgateway in the field of process modeling in higher education and show how it closes the latter gap in NLP by making a variety of systems for processing text and multimodal data accessible to non-experts.

Bundan, D., Abrami, G., & Mehler, A. (2025). Multimodal Docker Unified UIMA Interface: New Horizons for Distributed Microservice-Oriented Processing of Corpora using UIMANew Horizons for Distributed Microservice-Oriented Processing of Corpora using UIMA. . Proceedings of the 21st Conference on Natural Language Processing (KONVENS 2025).

Bundan, D., Abrami, G., & Mehler, A. (2025, September). MULTIMODAL DOCKER UNIFIED UIMA INTERFACE: New Horizons for Distributed Microservice-Oriented Processing of Corpora using UIMA. In Proceedings of the 21st Conference on Natural Language Processing (KONVENS 2025): Long and Short Papers (pp. 257-268).

ENTAILab - 2 (CIRCLET)

In addition to textual corpora, there are multimodal corpora that contain a significant amount of data from a variety of codes (e.g., iconographic, textual) that are currently made processable by only a few tools. What the research community needs here is an effective, distributed system that provides a processing pipeline for the integration of reusable tools for analyzing such corpora. Such systems currently exist for text corpora, but rarely for video corpora. We present MULTIMODAL DOCKER UNIFIED UIMA INTERFACE as an extension of DUUI that fills this gap by enabling annotation and processing of video corpora based on the UIMA standard

Koch, T., Jaehne, M. F., Riediger, M., Rauers, A., & Holtmann, J. (2025). Idiographic Interrater Reliability Measures for Intensive Longitudinal Multirater Data. . PsychArchives.

Koch, T., Jaehne, M. F., Riediger, M., Rauers, A., & Holtmann, J. (2025). Idiographic Interrater Reliability Measures for Intensive Longitudinal Multirater Data. PsychArchives.

SHERPA

Interrater reliability plays a crucial role in various areas of psychology. In this article, we propose a multilevel latent time series model for intensive longitudinal data with structurally different raters (e.g., self-reports and partner reports). The new MR-MLTS model enables researchers to estimate idiographic (person-specific) rater consistency coefficients at both the dynamic and momentary level. Additionally, the model allows rater consistency coefficients to be linked to external explanatory or outcome variables. It can be implemented in Mplus as well as in the newly developed R package mlts. We illustrate the model using data from an intensive longitudinal multirater study involving 100 heterosexual couples (200 individuals) assessed across 86 time points. Our findings show that relationship duration and partner cognitive resources positively predict momentary, but not dynamic, rater consistency. Results from a simulation study indicate that the number of time points is critical for accurately estimating idiographic rater consistency coefficients, whereas the number of participants is important for accurately recovering the random effect variances. We discuss advantages, limitations, and future extensions of the MR-MLTS model.

Bönisch, K., Abrami, G., & Mehler, A. (2025). Towards Unified, Dynamic and Annotation-based Visualisations and Exploration of Annotated Big Data Corpora with the Help of UNIFIED CORPUS EXPLORER. . Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (System Demonstrations) .

Bönisch, K., Abrami, G., & Mehler, A. (2025, April). Towards Unified, Dynamic and Annotation-based Visualisations and Exploration of Annotated Big Data Corpora with the Help of UNIFIED CORPUS EXPLORER. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (System Demonstrations) (pp. 522-534).

ENTAILab - 2 (CIRCLET)

The annotation and exploration of large text corpora, both automatic and manual, presents significant challenges across multiple disciplines, including linguistics, digital humanities, biology, and legal science. These challenges are exacerbated by the heterogeneity of processing methods, which complicates corpus visualization, interaction, and integration. To address these issues, we introduce the Unified Corpus Explorer (UCE), a standardized, dockerized, open-source and dynamic Natural Language Processing (NLP) application designed for flexible and scalable corpus navigation. Herein, UCE utilizes the UIMA format for NLP annotations as a standardized input, constructing interfaces and features around those annotations while dynamically adapting to the corpora and their extracted annotations. We evaluate UCE based on a user study and demonstrate its versatility as a corpus explorer based on generative AI. Received Best Demo Award.

Hofmann, M., Jansen, M., Wigbels, C., Briesemeister, B., & Jacobs, A. (2025). Individual Text Corpora Predict Openness, Interests, Knowledge and Level of Education . Proceedings of the 8th Workshop on Cognitive Aspects of the Lexicon (CogALex-VIII), LREC/Coling 2024.

Hofmann, M., Jansen, M., Wigbels, C., Briesemeister, B., & Jacobs, A. (2024, March). Individual text corpora predict openness, interests, knowledge and level of education. In Proceedings of the 8th Workshop on Cognitive Aspects of the Lexicon (CogALex-VIII), LREC/Coling 2024 (Turin, Italy)

ITC

Here we examine whether the personality dimension of openness to experience can be predicted from the individual google search history. By web scraping, individual text corpora (ICs) were generated from 214 participants with a mean number of 5 million word tokens. We trained word2vec models and used the similarities of each IC to label words, which were derived from a lexical approach of personality. These IC-label-word similarities were utilized as predictive features in neural models. For training and validation, we relied on 179 participants and held out a test sample of 35 participants. A grid search with varying number of predictive features, hidden units and boost factor was performed. As model selection criterion, we used R2 in the validation samples penalized by the absolute R2 difference between training and validation. The selected neural model explained 35% of the openness variance in the test sample, while an ensemble model with the same architecture often provided slightly more stable predictions for intellectual interests, knowledge in humanities and level of education. Finally, a learning curve analysis suggested that around 500 training participants are required for generalizable predictions. We discuss ICs as a complement or replacement of survey-based psychodiagnostics.

Leonard, M. McKone (2025). Conducting Respondent-Driven Sampling with Ethnic Minority Populations: The State of the Field . Survey Practice.

Leonard, M. M. (2025). Conducting Respondent-Driven Sampling with Ethnic Minority Populations: The State of the Field. Survey Practice, 19.

RDS

Ethnic minorities are often underrepresented in survey research, due to the challenges many researchers face in including these populations. Respondent-driven sampling (RDS) was developed in the late 1990s in order to investigate populations otherwise "hidden" from researchers due to a lack of extant sampling frames. RDS relies on individuals who recruit their fellow population members, allowing samples to grow through network linkages. RDS holds promise for recruiting ethnic minority respondents, and its use has steadily increased since the early 2000s. However, practicable guidance for implementing RDS with these populations is scarce. To address this methodological gap, I present the results of a scoping review of RDS studies targeting ethnic minority populations. I find that it is possible to conduct successful RDS studies with a range of ethnic minority populations. However, researchers intending to work with these populations must consider the intersectional nature of these populations' "hiddenness", including economic, educational, linguistic, legal, political, and social vulnerabilities, through all stages of the study design process.

Abrami, G., Genios, M., Fitzermann, F., Baumartz, D. , Mehler, A. (2025). Docker Unified UIMA Interface: New perspectives for NLP on big data . SoftwareX.

Abrami, G., Genios, M., Fitzermann, F., Baumartz, D., & Mehler, A. (2025). Docker Unified UIMA Interface: New perspectives for NLP on big data. SoftwareX, 29, 102033.

ENTAILab - 2 (CIRCLET)

Processing large amounts of natural language text using machine learning-based models is becoming important in many disciplines. This demand is being met by a variety of approaches, resulting in the heterogeneous deployment of separate, partly incompatible, not natively scalable applications. To overcome the technological bottleneck involved, we have developed Docker Unified UIMA Interface, a system for the standardized, parallel, platform-independent, distributed and microservices-based solution for processing large and extensive text corpora with any NLP method. We present DUUI as a framework that enables automated orchestration of GPU-based NLP processes beyond the existing Docker Swarm cluster variant, and in addition to the adaptation to new runtime environments such as Kubernetes. Therefore, a new driver for DUUI is introduced, which enables the lightweight orchestration of DUUI processes within a Kubernetes environment in a scalable setup. In this way, the paper opens up novel text-technological perspectives for existing practices in disciplines that deal with the scientific analysis of large amounts of data based on NLP.

Hase, V., Jef, A., Laura, B., Nico, P., Heleen, J., Theo, A., Thijs, C., Claes, D.V., Jörg, H., Felicia, L. and Kmetty, Z. (2024). Fulfilling data access obligations: How could (and should) platforms facilitate data donation studies? . Internet Policy Review.

Hase, V., Jef, A., Laura, B., Nico, P., Heleen, J., Theo, A., ... & Mario, H. (2024). Fulfilling data access obligations: How could (and should) platforms facilitate data donation studies?. Internet policy review: Journal on internet regulation, 13(3).

Data Donation

Research into digital platforms has become increasingly difficult. One way to overcome these difficulties is to build on data access rights in EU data protection law, which requires platforms to offer users a copy of their data. In data donation studies, researchers ask study participants to exercise this right and donate their data to science. However, there is increasing evidence that platforms do not comply with designated laws. We first discuss the obligations of data access from a legal perspective (with accessible, transparent, and complete data as key requirements). Next, we compile experiences from social scientists engaging in data donation projects as well as a study on data request/access. We identify 14 key challenges, most of which are a consequence of non-compliance by platforms. They include platforms’ insufficient adherence to (a) providing data in a concise and easily accessible form (e.g. the lack of information on when and how subjects can access their data); (b) being transparent about the content of their data (e.g. the lack of information on measures); and (c) providing complete data (e.g. the lack of all available information platforms process related to platform users). Finally, we formulate four central recommendations for improving the right to access.

Publications

Call for Proposals for Project Period 2027-2023 now open!