Reletter
Artwork for AI Safety at the Frontier

AI Safety at the Frontier

Johannes Gasteiger

Best of AI safety research on frontier models. Obviously, subjective.

Platform
Substack
PricingOnly free issuesPublishesMonthly
Issues29Founded2 years agoLast Issue4 months ago
Active

Read this Newsletter

aisafetyfrontier.substack.com
Artwork for AI Safety at the Frontier

Latest Issues

Paper Highlights of January 2026

tl;dr

Papers of the month:

Activation probes achieve production-ready jailbreak robustness at orders-of-magnitude lower cost than LLM classifiers, with probe-first cascades now deployed at both Anthropic and Google DeepMind.

Research h...

4 months ago
6
1

Paper Highlights of December 2025

tl;dr

Paper of the month:

Auditing game shows that sandbagging detection remains difficult—only on-distribution finetuning can reliably remove sandbagging, while detection suffers from false positives.

Research highlights:

  • Asynchr...
4 months ago
6

Paper Highlights of November 2025

tl;dr

Paper of the month:

Reward hacking in production RL can naturally induce broad misalignment including alignment faking and sabotage attempts.

Research highlights:

  • Increasing model honesty via finetuning works best but lie de...
6 months ago
1

Paper Highlights of October 2025

tl;dr

Paper of the month:

Synthetic Document Finetuning creates deep, robust beliefs that withstand adversarial scrutiny, though egregiously false facts still make these beliefs detectable, as do other knowledge editing methods and narr...

7 months ago
6

Paper Highlights, September '25

tl;dr

Paper of the month:

Deliberative alignment substantially reduces scheming behaviors in reasoning models, but covert actions persist and improvements partly reflect evaluation awareness rather than genuine alignment.

Research high...

8 months ago
5
2

Key Facts

Contact Information
Newsletter Author
Number of Subscribers
Find out how many people subscribe to this newsletter.

Audience Metrics

Subscribers, engagement, traffic and sponsorship for AI Safety at the Frontier.

SubscribersEngagement64Monthly Web Visits
Accepts SponsorsEstimated Cost per Ad

Authors

The writers behind this newsletter.

  • Johannes Gasteiger

    Research scientist working on AI safety & alignment

  • Frequently Asked Questions

    How can I access the email archive for AI Safety at the Frontier?

    You can find recent issues that have been published by AI Safety at the Frontier on Reletter by scrolling up to where it says Latest Issues. Tap on the link for any of the most recent emails or hit More Issues to see older ones.

    How many subscribers does AI Safety at the Frontier have?

    To see how many people subscribe to AI Safety at the Frontier, simply upgrade your Reletter account. We provide readership numbers and lots of other stats for this newsletter so you can decide if it's worth reaching out to.

    How can I advertise in AI Safety at the Frontier?

    Newsletter advertising can be extremely effective when it's done right. Before you pitch AI Safety at the Frontier as a potential sponsor or partner, make sure that you've done your research and checked its newsletter stats with Reletter.

    Then, personalize one of our winning pitching templates and send it to the right person using the contact info provided.

    How much does it cost to sponsor a publication like AI Safety at the Frontier?

    Newsletter ad rates (or CPM) vary depending on many factors, including industry, number of subscribers, open rate, ad placement and more.

    To find out how much an ad will cost, contact AI Safety at the Frontier using the contact information provided and ask for a copy of their media kit.

    How can I find newsletters related to AI Safety at the Frontier?

    Scroll up to where it says Related Newsletters to see other publications like AI Safety at the Frontier. You can also search our email newsletter directory to discover other newsletters that cover the topics you're interested in.

    How do I contact AI Safety at the Frontier?

    Reletter provides this newsletter's website URL above, where you will often find their contact information. We also provide links to associated social media accounts and pitching templates so you can reach out fast.