
Best of AI safety research on frontier models. Obviously, subjective.
| Platform | Pricing | Only free issues | Publishes | Monthly | |
|---|---|---|---|---|---|
| Issues | 29 | Founded | 2 years ago | Last Issue | 4 months ago |
| Active | |||||

Papers of the month:
Activation probes achieve production-ready jailbreak robustness at orders-of-magnitude lower cost than LLM classifiers, with probe-first cascades now deployed at both Anthropic and Google DeepMind.
Research h...
Paper of the month:
Auditing game shows that sandbagging detection remains difficult—only on-distribution finetuning can reliably remove sandbagging, while detection suffers from false positives.
Research highlights:
Paper of the month:
Reward hacking in production RL can naturally induce broad misalignment including alignment faking and sabotage attempts.
Research highlights:
Paper of the month:
Synthetic Document Finetuning creates deep, robust beliefs that withstand adversarial scrutiny, though egregiously false facts still make these beliefs detectable, as do other knowledge editing methods and narr...
Paper of the month:
Deliberative alignment substantially reduces scheming behaviors in reasoning models, but covert actions persist and improvements partly reflect evaluation awareness rather than genuine alignment.
Research high...
Subscribers, engagement, traffic and sponsorship for AI Safety at the Frontier.
| Subscribers | Engagement | 64 | Monthly Web Visits | ||
|---|---|---|---|---|---|
| Accepts Sponsors | Estimated Cost per Ad | ||||
The writers behind this newsletter.
Research scientist working on AI safety & alignment
You can find recent issues that have been published by AI Safety at the Frontier on Reletter by scrolling up to where it says Latest Issues. Tap on the link for any of the most recent emails or hit More Issues to see older ones.
To see how many people subscribe to AI Safety at the Frontier, simply upgrade your Reletter account. We provide readership numbers and lots of other stats for this newsletter so you can decide if it's worth reaching out to.
Newsletter advertising can be extremely effective when it's done right. Before you pitch AI Safety at the Frontier as a potential sponsor or partner, make sure that you've done your research and checked its newsletter stats with Reletter.
Then, personalize one of our winning pitching templates and send it to the right person using the contact info provided.
Newsletter ad rates (or CPM) vary depending on many factors, including industry, number of subscribers, open rate, ad placement and more.
To find out how much an ad will cost, contact AI Safety at the Frontier using the contact information provided and ask for a copy of their media kit.
Scroll up to where it says Related Newsletters to see other publications like AI Safety at the Frontier. You can also search our email newsletter directory to discover other newsletters that cover the topics you're interested in.
Reletter provides this newsletter's website URL above, where you will often find their contact information. We also provide links to associated social media accounts and pitching templates so you can reach out fast.