Cares Blog
Blog
C.A.R.E.S. Lab
Publications
People
Resources
  • Using copulas for making calibrated data generating processes (DGPs) for simulation

    by Luke Miratrix
    on May 16, 2025 · 18 min read · simulation  ·
    Share on:
    Using copulas for making calibrated data generating processes (DGPs) for simulation

    In a couple of simulation projects I have been working on, we wanted to generate synthetic data that closely mimicked the structure of a given empirical dataset. In one example, we wanted to generate new observations \((X, Y)\), where \(X\) was a vector of covariates (some continuous, some categorical) and \(Y\) was a …

    Read More
  • Comparing ATE estimators in multisite and cluster randomized trials

    by Luke Miratrix
    on Apr 4, 2025 · 11 min read · multisite trials blocking randomized trials cluster randomized trials  ·
    Share on:
    Comparing ATE estimators in multisite and cluster randomized trials

    It is a sad truth that the same data can be analyzed different ways to get different results. Also, multiple models might be reasonable choices for a given dataset. Still, it is not a great feeling to wonder whether what seems like a subjective design decision is driving the results. In some ongoing work, we are …

    Read More
  • Fine-Tuning ChatGPT for Essay Grading

    by Youngwon Kim
    on Nov 24, 2024 · 29 min read · Automated Essay Scoring ChatGPT coding Large Language Model Fine-tuning  ·
    Share on:
    Fine-Tuning ChatGPT for Essay Grading

    A Comprehensive Guide to Fine-Tuning ChatGPT for Essay Grading Introduction Our “Esssay Grading with ChatGPT” blog post series have unveiled the potential of ChatGPT for essay grading. We started with the fundamentals of the ChatGPT API and gradually explored the art of crafting effective prompts, building a solid …

    Read More
  • The Art of Crafting Prompts for Essay Grading with ChatGPT

    by Youngwon Kim
    on Jun 3, 2024 · 24 min read · Automated Essay Scoring ChatGPT coding Large Language Model  ·
    Share on:
    The Art of Crafting Prompts for Essay Grading with ChatGPT

    This second entry in the 'Essay Grading with ChatGPT' series delves deeper into this challenge, comparing the outcomes of essay grading based on different prompts (the instructions we give to ChatGPT) to optimize AI for educational purposes.

    Read More
  • How to Grade Essays with ChatGPT

    by Youngwon Kim
    on May 29, 2024 · 18 min read · Automated Essay Scoring ChatGPT Large Language Model Coding  ·
    Share on:
    How to Grade Essays with ChatGPT

    How to Grade Essays with ChatGPT Introduction The rise of large language models (LLMs) like OpenAI’s ChatGPT has opened exciting possibilities in essay grading. With its advanced natural language processing capabilities, ChatGPT offers a new dimension in assessing written work, potentially revolutionizing the grading …

    Read More
  • Designing Experiments Toward Shrinkage Estimation

    by Evan Rosenman and Luke Miratrix
    on May 15, 2024 · 13 min read · MLM multisite visualizations coding  ·
    Share on:
    Designing Experiments Toward Shrinkage Estimation

    Estimating subgroup impacts in an RCT can be hard. An RCT by itself is usually underpowered for this task–we barely have enough data to give us an overall average, and as subgroups are smaller, they are noisier! One idea recently gaining increased traction is to augment an RCT with observational data. We might use …

    Read More
  • Plotting distributions of site-level impact estimates (or other collections of noisily estimated things)

    by Luke Miratrix
    on Apr 23, 2024 · 19 min read · MLM multisite visualizations coding  ·
    Share on:
    Plotting distributions of site-level impact estimates (or other collections of noisily estimated things)

    Do you ever want to visualize the distribution of effects across sites in a multi-site evaluation (or meta analysis)? For example, consider a multisite trial with 30 sites, where each site is effectively a small randomized experiment. A researcher might fit a multilevel model with a random effect for the impact in each …

    Read More
  • Drawing a Line Between Sample Statistics and Population Inferences

    by Eddie Kim
    on Sep 29, 2022 · 22 min read · interpretation inference  ·
    Share on:
    Drawing a Line Between Sample Statistics and Population Inferences

    A shady figure presents you with a game. They have a deck of cards numbered 1, 2, 3, or 4, the exact distribution of which you do not know. They randomly shuffle and separate the deck into two equally sized piles, facedown, and after checking the piles for themselves, reveal some cards from each. From the first set: 4, …

    Read More

HGSE Logo

A blog about Causality, Applications, and Research in Education and Statistics.

From the C.A.R.E.S. Lab at the Harvard Graduate School of Education
Director: Luke Miratrix, Associate Professor

Read More

Featured Posts

  • Using copulas for making calibrated data generating processes (DGPs) for simulation
  • Comparing ATE estimators in multisite and cluster randomized trials
  • Fine-Tuning ChatGPT for Essay Grading
  • The Art of Crafting Prompts for Essay Grading with ChatGPT
  • To block or not to block, that is the question
  • Drawing a Line Between Sample Statistics and Population Inferences
  • So, you decided to write your article in R Markdown

Recent Posts

  • Using copulas for making calibrated data generating processes (DGPs) for simulation
  • Comparing ATE estimators in multisite and cluster randomized trials
  • Fine-Tuning ChatGPT for Essay Grading
  • The Art of Crafting Prompts for Essay Grading with ChatGPT
  • How to Grade Essays with ChatGPT
  • Designing Experiments Toward Shrinkage Estimation
  • Plotting distributions of site-level impact estimates (or other collections of noisily estimated things)
  • Exploring power with the PUMP package

Categories

METHODOLOGY 8 TUTORIAL 6 REFLECTIONS 3

Tags

CODING 9 MULTISITE 4 AUTOMATED-ESSAY-SCORING 3 CHATGPT 3 INTERPRETATION 3 LARGE-LANGUAGE-MODEL 3 VISUALIZATIONS 3 BLOCKING 2 MATCHING 2 MLM 2 MULTISITE-TRIALS 2 CLUSTER-RANDOMIZED-TRIALS 1 DIFF-IN-DIFF 1 FINE-TUNING 1
All Tags
AUTOMATED-ESSAY-SCORING3 BLOCKING2 CHATGPT3 CLUSTER-RANDOMIZED-TRIALS1 CODING9 DIFF-IN-DIFF1 FINE-TUNING1 INDEX1 INFERENCE1 INTERPRETATION3 LARGE-LANGUAGE-MODEL3 LOGISTIC1 MATCHING2 MLM2 MULTISITE4 MULTISITE-TRIALS2 NEYMAN1 POWER1 RANDOMIZED-TRIALS1 RMARKDOWN1 SIMULATION1 VISUALIZATIONS3
[A~Z][0~9]
       Cares Blog © 2021 The President and Fellows of Harvard College * Privacy * Accessibility * Digital Accessibility * Report Copyright Infringement