The EleutherAI Research Notes are an experiment in sharing bites of preliminary research results through short and informal posts rather than the occasional paper. Our hope for this format is to encourage low-friction sharing of ideas in the fast-paced world of modern research.
Pretraining Data Filtering for Open-Weight AI Safety
Announcing Deep Ignorance: Filtering Pretraining Data Builds Tamper-Resistant Safeguards into Open-Weight LLMs