Third-party evaluation to identify risks in LLMs’ training data

An overview of the minetester and preliminary work

October 31, 2024 · Irina Bejan, Curtis Huebner

Llemma: An Open Language Model For Mathematics

October 16, 2023 · Zhangir Azerbayev, Hailey Schoelkopf, Keiran Paster, Marco Dos Santos, Stephen McAleer', Albert Q. Jiang, Jia Deng, Stella Biderman, Sean Welleck

Minetester: A fully open RL environment built on Minetest

An overview of the minetester and preliminary work

July 8, 2023 · Curtis Huebner, Robert Klassert, Stepan Shabalin, Edwin Fennell, Delta Hessler

Alignment Research @ EleutherAI

A breif overview of EAIs approach to alignment

May 3, 2023 · Curtis Huebner

Exploratory Analysis of TRLX RLHF Transformers with TransformerLens

A demonstration of interpretabilty for RLHF models

April 2, 2023 · Curt Tigges

Announcing GPT-NeoX-20B

Announcing GPT-NeoX-20B, a 20 billion parameter model trained in collaboration with CoreWeave.