methodology

Using copulas for making calibrated data generating processes (DGPs) for simulation

by Luke Miratrix

on May 16, 2025 · 18 min read · simulation ·

In a couple of simulation projects I have been working on, we wanted to generate synthetic data that closely mimicked the structure of a given empirical dataset. In one example, we wanted to generate new observations \((X, Y)\), where \(X\) was a vector of covariates (some continuous, some categorical) and \(Y\) was a …

Comparing ATE estimators in multisite and cluster randomized trials

by Luke Miratrix

on Apr 4, 2025 · 11 min read · multisite trials blocking randomized trials cluster randomized trials ·

Share on:

It is a sad truth that the same data can be analyzed different ways to get different results. Also, multiple models might be reasonable choices for a given dataset. Still, it is not a great feeling to wonder whether what seems like a subjective design decision is driving the results. In some ongoing work, we are …

Fine-Tuning ChatGPT for Essay Grading

by Youngwon Kim

on Nov 24, 2024 · 29 min read · Automated Essay Scoring ChatGPT coding Large Language Model Fine-tuning ·

Share on:

A Comprehensive Guide to Fine-Tuning ChatGPT for Essay Grading Introduction Our “Esssay Grading with ChatGPT” blog post series have unveiled the potential of ChatGPT for essay grading. We started with the fundamentals of the ChatGPT API and gradually explored the art of crafting effective prompts, building a solid …

The Art of Crafting Prompts for Essay Grading with ChatGPT

by Youngwon Kim

on Jun 3, 2024 · 24 min read · Automated Essay Scoring ChatGPT coding Large Language Model ·

Share on:

This second entry in the 'Essay Grading with ChatGPT' series delves deeper into this challenge, comparing the outcomes of essay grading based on different prompts (the instructions we give to ChatGPT) to optimize AI for educational purposes.

How to Grade Essays with ChatGPT

by Youngwon Kim

on May 29, 2024 · 18 min read · Automated Essay Scoring ChatGPT Large Language Model Coding ·

Share on:

How to Grade Essays with ChatGPT Introduction The rise of large language models (LLMs) like OpenAI’s ChatGPT has opened exciting possibilities in essay grading. With its advanced natural language processing capabilities, ChatGPT offers a new dimension in assessing written work, potentially revolutionizing the grading …

Designing Experiments Toward Shrinkage Estimation

by Evan Rosenman and Luke Miratrix

on May 15, 2024 · 13 min read · MLM multisite visualizations coding ·

Share on:

Designing Experiments Toward Shrinkage Estimation

Estimating subgroup impacts in an RCT can be hard. An RCT by itself is usually underpowered for this task–we barely have enough data to give us an overall average, and as subgroups are smaller, they are noisier! One idea recently gaining increased traction is to augment an RCT with observational data. We might use …

Plotting distributions of site-level impact estimates (or other collections of noisily estimated things)

by Luke Miratrix

on Apr 23, 2024 · 19 min read · MLM multisite visualizations coding ·

Share on:

Plotting distributions of site-level impact estimates (or other collections of noisily estimated things)

Do you ever want to visualize the distribution of effects across sites in a multi-site evaluation (or meta analysis)? For example, consider a multisite trial with 30 sites, where each site is effectively a small randomized experiment. A researcher might fit a multilevel model with a random effect for the impact in each …

Drawing a Line Between Sample Statistics and Population Inferences

by Eddie Kim

on Sep 29, 2022 · 22 min read · interpretation inference ·

Share on:

A shady figure presents you with a game. They have a deck of cards numbered 1, 2, 3, or 4, the exact distribution of which you do not know. They randomly shuffle and separate the deck into two equally sized piles, facedown, and after checking the piles for themselves, reveal some cards from each. From the first set: 4, …