Statistics and Data Science

We conduct research in three areas and their intersection: psychometric modeling, machine learning methods, and multilevel and longitudinal modeling. On the one hand, we assess existing tools, test their applicability, and demonstrate opportunities and pitfalls for analyses in psychological research. On the other hand, we develop new tools and extend existing tools to offer new possibilities for data analysis and hypothesis generation. As our main methodological approach, we use simulation studies conducted in the open-source software R and illustrations of the new methods using empirical data from psychology.


Starting in February 2025, we will investigate and extend machine learning models so that they can also be applied to hierarchically structured and longitudinal data in psychology. This allows empirical researchers in psychology to investigate nonlinear dynamics of psychological processes in an exploratory manner when their data is hierarchically structured.


Our Research in Detail

psychometrics

Psychometric Modeling

Psychometric modeling tackles challenges in the area of psychological testing and measurement. We examine response biases in self-report measures, e.g., in personality measurement. We also examine methods for differential item and differential step functioning. These phenomena occur when certain items are more difficult for respondents with certain background characteristics. In our research group, we examine and develop tools that support researchers to improve data quality by extending psychometric modeling approaches or identifying relevant background variables, for example using machine learning methods.

machine_learning

Machine Learning & Interpretable Machine Learning

Machine learning models, such as decision trees or random forests, are robust, yet powerful methods to capture and interpret complex dynamics and non-linear effects of predictor variables on outcomes. At the same time, many machine learning models are so-called "black boxes" that do not allow us to see how it has come to its prediction. In our research group, we assess machine learning models and interpretability techniques for their applicability in psychological research. We test how they react to data structures that are typical in psychology, such as correlated predictor variables or data with a multilevel structure. We also extend machine learning methods, for example by tools from psychometrics, to enhance interpretation.

multilevel

Multilevel and Longitudinal Modeling

Many data in psychology are hierarchically clustered, for example when students are clustered in classes, or multiple measurements are clustered in persons. Multilevel modeling has become an important statistical tool to study humans in contexts, but also to distinguish intra- and interindividual psychological processes. In our research group, we extend the toolbox for the analysis of multilevel data. We assess how machine learning models can be used as a reliable tool to analyze multilevel and longitudinal data. We also develop new psychometric approaches that help to assess data quality issues in intensive longitudinal assessments.