Reletter
Artwork for AI Safety Papers

AI Safety Papers

Xerxes Dotiwalla

Digests of new AI safety papers from arxiv, ~weekly. Full list here: https://tinyurl.com/ai-safety-papers

Platform
Substack
PricingOnly free issuesPublishesWeekly
Issues62SubscribersRead onexerxes.substack.com

Curious about how many subscribers AI Safety Papers has or want to find similar newsletters? Reletter has got you covered. We collated all the information we could find from across the web in our database of over three million newsletters.

Check the email archives, get traffic estimates, engagement scores and more to discover the best advertising opportunities.

Our search tool helps you locate relevant newsletters for any topic and compare their stats for better sponsorship decisions.

Contact Information
How Many Subscribers?
Reletter gives you subscriber numbers, contacts, chart rankings, traffic estimates and more across 3m+ newsletters.

Latest Issues

Recent posts by this newsletter. Browse the email archive.

Mythos system card, mitigating reward hacking for BoN with pessimism, ...

Also: dictatorship eval, how misalignment shapes collective behaviors in agent communities, sycophantic chatbots cause delusional spiraling

4 days ago
1
0

emotion concepts in LLMs, predicting when RL breaks CoT monitorability, metagaming, ...

Also: alignment eval case study, preferences of models that claim to be conscious, agentic intelligence explosion, detecting multi-agent collusion

11 days ago
2
0

measuring AI R&D automation, underestimating AI capabilities, disentangling model beliefs from CoT, ...

Also: automating post-training, monitoring coding agents for misalignment, training on documents about monitoring leads to CoT obfuscation, pro-human declaration, pitfalls in evaluating interp agents

18 days ago
0
0

how well models follow constitutions, preserving safety alignment during fine tuning, ...

Also: secret knowledge elicitation, activation oracles, visual self-fulfilling alignment, posttraining benchmark

a month ago
1
0

Authors

The writers behind this newsletter.

  • Xerxes Dotiwalla

    AGI Alignment @ Google DeepMind

  • Frequently Asked Questions

    How can I access the email archive for AI Safety Papers?

    You can find recent issues that have been published by AI Safety Papers on Reletter by scrolling up to where it says Latest Issues. Tap on the link for any of the most recent emails or hit More Issues to see older ones.

    How many subscribers does AI Safety Papers have?

    To see how many people subscribe to AI Safety Papers, simply upgrade your Reletter account. We provide readership numbers and lots of other stats for this newsletter so you can decide if it's worth reaching out to.

    How can I advertise in AI Safety Papers?

    Newsletter advertising can be extremely effective when it's done right. Before you pitch AI Safety Papers as a potential sponsor or partner, make sure that you've done your research and checked its newsletter stats with Reletter.

    Then, personalize one of our winning pitching templates and send it to the right person using the contact info provided.

    How much does it cost to sponsor a publication like AI Safety Papers?

    Newsletter ad rates (or CPM) vary depending on many factors, including industry, number of subscribers, open rate, ad placement and more.

    To find out how much an ad will cost, contact AI Safety Papers using the contact information provided and ask for a copy of their media kit.

    How can I find newsletters similar to AI Safety Papers?

    Scroll up to where it says Similar Newsletters to see other publications like AI Safety Papers. You can also search our email newsletter directory to discover other newsletters that cover the topics you're interested in.

    How do I contact AI Safety Papers?

    Reletter provides this newsletter's website URL above, where you will often find their contact information. We also provide links to associated social media accounts and pitching templates so you can reach out fast.