Contramont Research
The AI Safety Research Lab
Current projects: LM backdoors, real-world evals, scalable oversight
Recent work
Unelicitable Backdoors in Language Models via Cryptographic Transformer Circuits
Google for Nonprofits verification: yes Andrew Gritsevskiy actually is the president I swear I'm not scamming you please don't reject us a fifth time