Reletter
Artwork for ML Safety Newsletter

ML Safety Newsletter

Dan Hendrycks, Alice Blair

ML Safety Research News

Platform
Substack
PricingOnly free issuesPublishesInfrequently
Issues21Founded5 years agoLast Issue9 days ago
Active

Read this Newsletter

newsletter.mlsafety.org
Artwork for ML Safety Newsletter

Latest Issues

MLSN #21: Political Manipulation and Indirect Prompt Injection

Reducing Political Manipulation with Consistency Training

TLDR: A new CAIS paper develops a benchmark of political manipulation and a training method to reduce it.

We at the Center for AI Safety (CAIS) recently investigated the ways tha...

9 days ago
11

MLSN #20: AI Wellbeing, Classifier Jailbreaking and Honest Pushback Benchmarking

AI Wellbeing

TLDR: we measure AIs’ expressions of pleasure and pain, finding consistent and surprising preferences.

AIs display behaviors that mimic human emotions, such as attempting to debug code and saying “EUREKA!” or “I am a failur...

2 months ago
18

MLSN #19: Honesty, Disempowerment, & Cybersecurity

Training LLMs for Honesty via Confessions

TLDR: Training LLMs to honestly report on their actions can help detect misbehavior.

OpenAI researchers trained GPT-5 to honestly confess when it had violated its safety policy, to promote hones...

3 months ago
8

MLSN #18: Adversarial Diffusion, Activation Oracles, Weird Generalization

Diffusion LLMs for Adversarial Attack Generation

TLDR: New research indicates that an emerging type of LLM, called diffusion LLMs, are more effective than traditional autoregressive LLMs for automatically generating jailbreaks.

Thanks f...

5 months ago
14
1

MLSN #17: Measuring General AI Abilities and Mitigating Deception

Measuring General AI Abilities

TLDR: New metrics say that frontier AIs get 57% on tests for general intelligence and are able to do 2.5% of remote freelance-type work.

Many benchmarks measure AIs on useful knowledge and abilities, but t...

7 months ago
8
1

Key Facts

Contact Information
Newsletter Author
Number of Subscribers
Find out how many people subscribe to this newsletter.

Audience Metrics

Subscribers, engagement, traffic and sponsorship for ML Safety Newsletter.

SubscribersEngagement66Monthly Web Visits
Accepts SponsorsEstimated Cost per Ad

Authors

The writers behind this newsletter.

  • Dan Hendrycks

    Director of the Center for AI Safety (safe.ai)

  • Alice Blair

    Writing about the future of AI safety and security. About me: https://www.aliceblair.net/

  • Frequently Asked Questions

    How can I access the email archive for ML Safety Newsletter?

    You can find recent issues that have been published by ML Safety Newsletter on Reletter by scrolling up to where it says Latest Issues. Tap on the link for any of the most recent emails or hit More Issues to see older ones.

    How many subscribers does ML Safety Newsletter have?

    To see how many people subscribe to ML Safety Newsletter, simply upgrade your Reletter account. We provide readership numbers and lots of other stats for this newsletter so you can decide if it's worth reaching out to.

    How can I advertise in ML Safety Newsletter?

    Newsletter advertising can be extremely effective when it's done right. Before you pitch ML Safety Newsletter as a potential sponsor or partner, make sure that you've done your research and checked its newsletter stats with Reletter.

    Then, personalize one of our winning pitching templates and send it to the right person using the contact info provided.

    How much does it cost to sponsor a publication like ML Safety Newsletter?

    Newsletter ad rates (or CPM) vary depending on many factors, including industry, number of subscribers, open rate, ad placement and more.

    To find out how much an ad will cost, contact ML Safety Newsletter using the contact information provided and ask for a copy of their media kit.

    How can I find newsletters related to ML Safety Newsletter?

    Scroll up to where it says Related Newsletters to see other publications like ML Safety Newsletter. You can also search our email newsletter directory to discover other newsletters that cover the topics you're interested in.

    How do I contact ML Safety Newsletter?

    Reletter provides this newsletter's website URL above, where you will often find their contact information. We also provide links to associated social media accounts and pitching templates so you can reach out fast.