You May Also Enjoy
New Preprint: An Empirical Investigation into the Utility of Large Language Models in Open-Ended Survey Data Categorization
1 minute read
Published:
I have a new preprint out on SocArXiv: An Empirical Investigation into the Utility of Large Language Models in Open-Ended Survey Data Categorization.
Cleaning data with AI: An example with CatLLM
3 minute read
Published:

Presenting CatLLM at the University of Washington
3 minute read
Published:
On October 29th, 2025, I had the privilege of presenting at the University of Washington on how large language models can augment social science research. The presentation focused on CatLLM, an open-source Python package I developed to address a common challenge in demographic and social science research: analyzing open-ended survey responses and complex data at scale.
CatLLM Now Builds Custom Datasets from Web Data
1 minute read
Published:



