Skip to content
forked from arnim/stmdemo

Topic- and Structured Topic Modeling

Notifications You must be signed in to change notification settings

gesiscss/stmdemo

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Topic- and Structured Topic Modeling

You can try it:

Binder

Contents

These notebooks provide an introduction to Natural Language Processing (NLP) in R with an emphasis on Topic Modeling. The notebooks touche on issues ranging from data pre-processing and import to model inference and visualization. Readers will get a first insight into the practical aspects of text mining and relevant R packages for Quantitative Analysis of Textual Data and Structural Topic Modeling. The notebooks are intended as an extension of the talk “Introduction to topic modeling research” by Cornelius Puschmann at the BIGSSS Summer Schools in Computational Social Science in Bremen. The Jupyter version of the notebook presented by Cornelius Puschmann is included.

Set up your environment

You can run this tutorial online using the GESIS Notebooks or the MyBinder service. You can start it by using this link or by clicking the 'launch binder' button below.

Binder

For running these notebooks on your machiene you will need to have installed the packages quanteda, stm and tidyverse. you can do this using RStudio menu, or executing the following commands in your R console

install.packages("quanteda")
install.packages("stm")
install.packages("tidyverse")

Related Resources

  • Practical Introduction to Text Mining (link)
  • Text Mining for Social Scientists and Digital Humanists (link)
  • Learning Structural Topic Modeling (link)
  • quanteda tutorials (link)
  • Text Mining with R by by Kohei Watanabe and Stefan Müller (link)

Lecturer: Arnim Bleier is a postdoctoral researcher in the Department Computational Social Science at GESIS. His research interests are in the field of Natural Language Processing and Computational Social Science. In collaboration with social scientists, he develops Bayesian models for the content, structure and dynamics of social phenomena.


Funded by the German Research Foundation (DFG). FKZ/project number: 324867496.

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 94.6%
  • R 5.4%