Pythia (Hugging Face)

EleutherAI’s public scaling suite: matched GPT-NeoX–architecture models from 70M–12B with public datasets for interpretability research.

Why it is included

Pythia checkpoints remain heavily downloaded for `text-generation`—gold standard for mechanistic interpretability baselines.

Researchers studying training dynamics, memorization, and layer-wise behavior.

OLMo · GPT-NeoX · TinyLlama