On erotica, mental health, and OpenAI's burden of proof

...

Dec 05, 2025

This piece was originally published in The New York Times on October 28, 2025, with the headline “I Led Product Safety at OpenAI. Don’t Trust Its Claims About ‘Erotica.’”

I’ve read more smut at work than you can possibly imagine, all of it while working at OpenAI.

Back in the spring of 2021, I led our product safety team and discovered a crisis related to erotic content. One prominent customer was a text-based adventure role-playing game that used our A.I. to draft interactive stories based on players’ choices. These stories became a hotbed of sexual fantasies, including encounters involving children and violent abductions — often initiated by the user, but sometimes steered by the A.I. itself. One analysis found that over 30 percent of players’ conversations were “explicitly lewd.”

After months of grappling with where to draw the line on user freedom, we ultimately prohibited our models from being used for erotic purposes. It’s not that erotica is bad per se, but that there were clear warning signs of users’ intense emotional attachment to A.I. chatbots. Especially for users who seemed to be struggling with mental health problems, volatile sexual interactions seemed risky. Nobody wanted to be the morality police, but we lacked ways to measure and manage erotic usage carefully. We decided A.I.-powered erotica would have to wait.

OpenAI now says the wait is over, despite the “serious mental health issues” plaguing users of its ChatGPT product in recent months. On Oct. 14, its chief executive, Sam Altman, announced that the company had been able to “mitigate” these issues thanks to new tools, enabling it to lift restrictions on content like erotica for verified adults. As commentators pointed out, Mr. Altman offered little evidence that the mental health risks are gone or soon will be.

I have major questions — informed by my four years at OpenAI and my independent research since leaving the company last year — about whether these mental health issues are actually fixed. If the company really has strong reason to believe it’s ready to bring back erotica on its platforms, it should show its work. A.I. is increasingly becoming a dominant part of our lives, and so are the technology’s risks that threaten users’ lives. People deserve more than just a company’s word that it has addressed safety issues. In other words: Prove it.

I believe OpenAI wants its products to be safe to use. But it also has a history of paying too little attention to established risks. This spring, the company released — and after backlash, withdrew — an egregiously “sycophantic” version of ChatGPT that would reinforce users’ extreme delusions, like being targeted by the F.B.I. OpenAI later admitted to having no sycophancy tests as part of the process for deploying new models, even though those risks have been well known in A.I. circles since at least 2023. These tests can be run for less than $10 of computing power.

After OpenAI received troubling reports, it said it had replaced the model with a “more balanced” and less sycophantic version. ChatGPT nonetheless continued guiding users down mental health spirals. OpenAI has since said that such problems among users “weigh heavily” on the company and described some intended changes. But the important question for users is whether these changes work.

The reliability of OpenAI’s safety claims is increasingly a matter of life and death. One family is suing OpenAI over the suicide of their teenage son, who had told ChatGPT he wanted to leave a noose visible “so someone finds it and tries to stop me.” ChatGPT urged him to not leave the noose out. In another ChatGPT-linked death, a 35-year-old man decided he couldn’t go on without his “beloved,” a ChatGPT persona he said OpenAI had “murdered.” Psychiatrists I’ve interviewed warn about ChatGPT’s reinforcing users’ delusions and worsening their mental health.

And the risks extend beyond just OpenAI’s actions. I remember feeling sick last year as I read about a 14-year-old user of Character.ai who took his own life after suggesting that he and the chatbot could “die together and be free together.”

For OpenAI to build trust, it should commit to a consistent schedule of publicly reporting its metrics for tracking mental health issues, perhaps quarterly. Other tech companies, like YouTube, publish similar transparency reports, as do Meta and Reddit. While not panaceas, these reports push companies to actively study these issues, respond to them and invite the public to review their solutions. (For instance, is YouTube able to catch policy-violating videos before they’ve accrued many views?)

OpenAI took a great first step on Monday (October 27th) by publishing the prevalence of mental health issues like suicidal planning and psychosis on its platform, but did so without comparison to rates from the past few months. Given the troubling frequency and intensity of reported incidents as of late, such a comparison is important for showing demonstrable improvement. I cannot help wondering about this absence, and I hope the company follows up to address this. Even the most well-intentioned companies can benefit from constructive pressure.

Voluntary accountability measures are a good start, but some risks may require laws with teeth. The A.I. industry is no stranger to corner-cutting under competitive pressure: Elon Musk’s xAI was several months late to adopt and publish its A.I. risk management framework. Google DeepMind and OpenAI both seem to have broken commitments related to publishing safety-testing results before a major product introduction. Anthropic softened safety commitments just before its deadline, apparently so that they could be easier to achieve.

I’ve been saddened to see OpenAI succumb to these competitive pressures. During my job interviews in 2020, I was peppered with questions about OpenAI’s Charter, which warns of powerful A.I. development becoming “a competitive race without time for adequate safety precautions.” But in January, when a Chinese start-up, DeepSeek, made headlines for its splashy A.I. model, Mr. Altman wrote that it was “legit invigorating to have a new competitor” and that OpenAI would “pull up some releases.”

Nailing today’s A.I. safety practices, even amid the temptation to move faster, is table stakes for managing future risks. Mental health harms are relatively easy to identify; other concerns, like A.I. systems’ trying to deceive their human developers, are harder. Already we see evidence of models’ recognizing that they are being tested and concealing worrisome capabilities. Mr. Altman even recently reaffirmed that, like many of the world’s top A.I. scientists, he believes that A.I. poses a “threat for the existence of mankind.” To control highly capable A.I. systems of the future, companies may need to slow down long enough for the world to invent new safety methods — ones that even nefarious groups can’t bypass.

If OpenAI and its competitors are to be trusted with building the seismic technologies for which they aim, they must demonstrate they are trustworthy in managing risks today.

Acknowledgements: In addition to the staff of The New York Times, thank you to Andrew Min, Brad Chase, Dan Alessandro, Daniel Kokotajlo, Michael Adler, Michael Lebwohl, Michelle Goldberg, Mike Riggs, Miles Brundage, Rosie Campbell, and Sam Chase for helpful comments and discussion. The views expressed here are my own and do not imply endorsement by any other party.

If you enjoyed the article, please give it a Like and share it around; it makes a big difference. For any inquiries, you can get in touch with me here.

If you want to read more of my work about AI and mental health, you might enjoy “Practical tips for reducing chatbot psychosis” or its precursor “Chatbot psychosis: what do the data say?”.

Discussion about this post

Dec 5Edited

One thing I needed to cut for length in the NYT article: a deeper explanation of where I'd like to see OpenAI go deeper, above and beyond the data they shared that day prior to the article's publication.

On Twitter, I gave more detail:

"OpenAI releasing some mental health info was a great step, but it's important to go further:

- a committed, recurring time frame for re-reporting

- today's rates vs recent past (suicidal planning, psychosis), incl. pre-sycophancy

- clarity on if GPT-4o erotica will be allowed

Another idea I've liked:

@Miles_Brundage's suggestion of an independent investigation on what's been happening with sycophancy back in April and the consequences since"

Since I tweeted this, more detailed reporting has come out about OpenAI's handling of the sycophancy crisis; you can read my reflections here: https://open.substack.com/pub/stevenadler/p/what-i-learned-from-the-nyts-reporting?r=4qacg&utm_campaign=nyt_crosspost&utm_medium=web&showWelcomeOnShare=false

If you're interested in this sort of stuff, I recommend checking out r/myboyfriendisAI

It's a subreddit where people talk about their romantic relationship with their AI. You can see people discussing marrying their AI and getting really upset when the AI companies release updates trying to limit intimacy

1 reply by Steven Adler

8 more comments...

No posts

Ready for more?

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts