GESIS Training: Sequence Analysis in the Social Sciences

Online-Workshop vom 21. bis 23. September 2020

Dozent(en): Prof. Dr. Marcel Raab, Dr. Emanuela Struffolino


This workshop introduces sequence analysis for social science research. Sequence analysis, originally developed in biology to analyze strings of DNA, has attracted increasing attention in the social sciences for the analysis of longitudinal data. Most applications study life course processes, including labor market careers, transitions to adulthood, or family formation. This workshop covers longitudinal data management (only briefly; with Stata), basic techniques of sequence analysis (with Stata, but mainly with R), as well as recent methodological developments tailored at social science research questions. Topics include different ways of calculating distances between sequences, cluster analysis after sequence analysis, sequences visualization, techniques for analyzing sequences' multidimensionality and the association between sequences' unfolding over time and independent variables. All methods are demonstrated with hands-on examples using Stata (SQ package, for data preparation and basic sequence analysis) and R (TraMineR package).


Everyone who is interested in the description and analysis of life courses and longitudinal processes in general (e.g., with time-use data).


The participants will learn:

  • How to prepare longitudinal data in Stata to use them for sequence analysis in R.
  • How to describe and visualize sequences with the powerful and flexible TraMineR package in R
  • How to use the outcomes from sequence analysis (distance matrices, measures of turbulence and complexity) in further analysis (cluster analysis; regression analysis; discrepancy analysis; implicative statistic for sequences of typical states; sequence analysis multistate model procedure)

In short, the participants will learn all the tools required to conduct and publish their own research applying sequence analysis.


Participants are expected to have some basic knowledge in preparing handling (longitudinal) data in Stata or R and to be familiar with basic quantitative methods (e.g., regression analysis).

Further Information (Link)