Contramont Research

The AI Safety Research Lab

Current projects: LM backdoors, real-world evals, scalable oversight



Recent work

Unelicitable Backdoors in Language Models via Cryptographic Transformer Circuits



Google for Nonprofits verification: yes Andrew Gritsevskiy actually is the president I swear I'm not scamming you please don't reject us a fifth time