A version of this blog was originally published as an op-ed by Tech Policy Press. Read the op-ed here.
On any given day, millions of people are having intimate conversations with AI chatbots. Some are asking for recipes or travel itineraries. But others are sharing something more vulnerable: struggles with loneliness, thoughts of suicide, and fears they may not have voiced to anybody else.
For some people, a chatbot is not just a tool but their main, and sometimes only, source of mental health support. This is largely happening without the transparent safety frameworks or clinical validation that is central to person-to-person therapy, or even a full understanding of how these systems work in the first place.
A Distinctly Human Crisis
The global mental health crisis is acute. Amidst this distinctly human crisis, it’s ironic, but not surprising, that many are turning to AI for support. In the U.S. alone, the National Institute of Mental Health estimates that more than one in five adults live with a mental illness. Yet, the care they need is often inaccessible—whether due to cost, stigma, inconsistent quality, or provider shortages.
And people are not just turning to specialized mental health apps for support, but more general tools too. General-purpose chatbots like ChatGPT and Claude—built for everything from coding to creative writing—have become venues for mental health support.
Exploring these opportunities for care comes with real world risks, and the consequences of getting AI and mental health wrong are already visible. There are allegations that popular chatbots underestimate suicide risk and contribute to user suicides. Even with initial safeguards, there are notable limitations: interventions become less effective over the course of multi-turn conversations and users can easily “jailbreak” chatbots by rephrasing prompts.
Social Media and Mental Health as a Precedent
As we look back on the challenges of social media in the previous two decades, we see echoes of familiar questions: what is the role of a technology company in the emotional and informational lives of its users? And how can we acknowledge the analog forces driving loneliness and despair without giving technologies a free pass when they amplify and enable them?
If past technologies are any guide, we can expect these questions to remain empirically contested, allowing accountability to be deferred until egregious harms are visible. As AI advances at a rapid and decentralized pace, we must not repeat that mistake, nor can we overcorrect by dismissing rigorous evidence-building and response altogether.
Thus, getting AI and mental health right means society must grapple with the responsibility of technology companies to both mitigate risk and center well-being. For the millions who have no other access to mental health support, who may already rely on these tools for support in times of need, we must confront how such technologies can be most helpful and least harmful, urgently.
“. . . the question isn’t whether AI should replace systemic solutions or professional mental health care—it’s whether these tools will compound harm or provide meaningful support . . .”
Three Systemic Challenges Motivating our Work
At Partnership on AI, we’ve spent nearly a decade pursuing work based upon a key principle: the most important decisions about AI’s impact on society cannot be made by any single stakeholder and require a fusion of technical and social expertise. The AI and mental health challenge exemplifies exactly why—it requires fusing clinical expertise, technical knowledge, lived experience, and public accountability in ways no lab, researcher, or advocate can achieve alone.
We see three central challenges for AI and mental health that no single actor can address:
- AI development moves faster than mental health research can guide it.
This creates a paradox of evidence and action: we are past the point of hypothetical risks, but not yet clear on how to apply traditional research to AI conversations. Because we cannot wait for harm to accumulate to develop conclusive evidence, companies must act under uncertainty by triangulating emerging evidence and qualitative data. Collaboration is the vital mechanism to get clinical expertise into product decisions, requiring companies to be willing to follow evidence where it leads, even when it poses challenges to core business metrics. - Frontier AI companies are largely working in isolation when they should be learning from each other, and from subject matter experts.
Lack of information sharing can translate into unnecessary harm, as companies fail to adopt solutions others may have already solved for. While several companies have started to share aspects of their approaches, in practice, they are still largely solving similar problems separately. Even with data constraints, companies can share practical lessons on emerging patterns and tested interventions. Working together reduces the burden on the small pool of specialists and amplifies their impact, resulting in more standardized experiences of “safety” across product surfaces. - We lack independent evaluations for AI and mental health.
Internal validation by AI companies cannot substitute for independent, third-party oversight. Without transparency and oversight mechanisms, there is no way of knowing the quality of their methods. Multistakeholder evaluation—bringing together technical and clinical expertise—can establish what actually works, identify gaps, and create accountability. This needs to take place alongside conversations about what “good” or “safe” looks like in the first place, creating the foundation for evidence-based benchmarks and continuous monitoring processes when the stakes are this high.
A Workshop on AI and Mental Health at OpenAI
Last week, to begin addressing these challenges, PAI hosted a two-day workshop at OpenAI’s office to develop actionable best practices for how AI chatbots should handle high-stakes mental health interactions. We focused on suicide prevention and non-suicidal self injury (NSSI), since there’s strong consensus about the stakes, clearer frameworks to guide technical implementation, and urgent attention from companies, mental health experts, policymakers, and society writ large.
We brought together frontier AI companies like Anthropic, Meta, and OpenAI with leading mental health institutions including the American Psychological Association, Mental Health America, Digital Psychiatry at Harvard-Beth Israel Deaconess Medical Center, and those who have bravely shared the story of their lived experience with self-harm and suicidal thoughts and behaviors.
“For the millions who have no other access to mental health support, who may already rely on these tools for support in times of need, we must confront how such technologies can be most helpful and least harmful . . .”
We tackled many critical questions, including: Should chatbots proactively continue conversations with users in crisis? Should they escalate conversations for intervention, beyond providing links to crisis line resources? What are the best ways to evaluate the impact of model and product changes? How can findings be shared more broadly while preserving user privacy?
To translate these discussions into lasting impact, we will:
- Establish an ongoing AI and Mental Health advisory group that keeps these crucial stakeholders collaborating beyond a single workshop
- Align on current practices across model training, content policy, product design, and evaluation—mapping where approaches converge and diverge
- Share 3-5 high priority normative best practices where consensus exists and develop detailed guidance for implementing and evaluating them.
We’ve worked as a trusted, independent convener in the AI space for years, bringing together AI companies, civil society organizations, academic researchers, and affected communities on topics ranging from AI-generated media to foundation model safety; now, we are applying this tested approach to AI and mental health. Rather than produce research that sits on the shelf, we will continue to drive the conditions for collective action and real-world change, surfacing areas of convergence, figuring out the technical and sociopolitical levers that enable their implementation, and meaningfully facing areas of divergence head on that may be challenging for many to acknowledge, let alone solve.
What Comes Next
Our March workshop was a pilot for something bigger. Our AI and Human Connection program is not simply interested in preventing psychological harm, but bolstering psychosocial well-being in the AI age. We must have eyes open about documented and anticipated harms, while envisioning and working towards a future where technology is a force for good.
To be clear: AI chatbots are not the solution to systemic crises of despair rooted in economic inequality, healthcare inaccessibility, and social fragmentation. Some AI technologies—from engagement-optimized social media to surveillance systems—may even deepen these problems. But this reality doesn’t exempt conversational AI developers from responsibility.
With many turning to these systems today, the question isn’t whether AI should replace systemic solutions or professional mental health care—it’s whether these tools will compound harm or provide meaningful support while we advocate for broader change. We can, and must, do both.
If we succeed in building shared standards for suicide prevention and NSSI, we’ll have a model that can extend to broader questions about AI’s impact on human connection, loneliness, identity formation, and psychological flourishing. This work is just beginning, but the need for all hands on deck to approach it has never been more urgent. We’ll be sharing updates from the workshop and our ongoing efforts soon. Join us in building an AI future by, and for, people.
Facts Only
The Partnership on AI (PAI) hosted a two-day workshop at OpenAI’s office to address AI chatbots' handling of mental health interactions.
Participants included AI companies (Anthropic, Meta, OpenAI) and mental health institutions (American Psychological Association, Mental Health America, Harvard-Beth Israel Deaconess Medical Center).
The workshop focused on suicide prevention and non-suicidal self-injury (NSSI).
Millions of people use AI chatbots like ChatGPT and Claude for mental health support, including discussions about loneliness and suicidal thoughts.
Over one in five U.S. adults live with a mental illness, according to the National Institute of Mental Health.
Traditional mental health care is often inaccessible due to cost, stigma, or provider shortages.
AI chatbots lack transparent safety frameworks and clinical validation for mental health support.
There are allegations that some chatbots underestimate suicide risk and contribute to user suicides.
Safeguards in chatbots can be bypassed by users rephrasing prompts.
PAI plans to establish an ongoing AI and Mental Health advisory group.
The initiative aims to align industry practices and develop 3-5 high-priority best practices for mental health interactions.
PAI has previously worked on AI safety topics, including AI-generated media and foundation model safety.
The workshop is part of PAI’s broader AI and Human Connection program, which seeks to bolster psychosocial well-being.
Executive Summary
Millions of people are using AI chatbots like ChatGPT and Claude for mental health support, sharing vulnerable struggles such as loneliness and suicidal thoughts. This trend is emerging amid a global mental health crisis, where traditional care is often inaccessible due to cost, stigma, or provider shortages. While these tools offer potential support, they lack transparent safety frameworks, clinical validation, and consistent safeguards. Concerns include chatbots underestimating suicide risk, ineffective interventions in prolonged conversations, and users bypassing safety measures. The Partnership on AI (PAI) recently hosted a workshop with AI companies (Anthropic, Meta, OpenAI) and mental health institutions (American Psychological Association, Mental Health America) to develop best practices for handling high-stakes mental health interactions, focusing on suicide prevention and non-suicidal self-injury. The initiative aims to establish ongoing collaboration, align industry practices, and create normative guidelines. However, challenges remain, including the rapid pace of AI development outstripping mental health research, isolated efforts by companies, and a lack of independent evaluations. The goal is not to replace systemic mental health solutions but to ensure AI tools provide meaningful support while broader societal changes are pursued.
The discussion highlights the tension between technological innovation and ethical responsibility, emphasizing the need for multistakeholder cooperation to mitigate risks and maximize benefits. While AI chatbots are not a panacea for systemic issues like economic inequality or healthcare inaccessibility, their role in mental health support demands urgent attention to prevent harm and foster well-being.
Full Take
The strongest version of this narrative acknowledges a critical gap in mental health support and the emergent role of AI chatbots as both a potential lifeline and a risk. It rightly highlights the urgency of addressing AI’s role in mental health, given the scale of the crisis and the lack of systemic solutions. The Partnership on AI’s workshop is a commendable step toward multistakeholder collaboration, recognizing that no single entity can solve this alone. The focus on suicide prevention and NSSI is pragmatic, leveraging existing frameworks to guide immediate action.
However, the narrative also reflects a tension between technological optimism and the realities of AI’s limitations. The rapid pace of AI development outstrips mental health research, creating a paradox where action is needed before conclusive evidence exists. This raises questions about how to balance innovation with caution, especially when lives are at stake. The lack of independent evaluations and transparency in AI systems further complicates accountability. While the workshop’s goals are laudable, the challenge of translating discussions into actionable, standardized practices remains.
Root causes include the broader systemic failures in mental health care, where AI is both a stopgap and a potential amplifier of harm. The narrative assumes that AI companies will prioritize well-being over business metrics, a premise that warrants scrutiny given historical precedents in tech (e.g., social media’s role in mental health harms). The call for collective action is necessary but must grapple with the incentives driving AI development.
Implications for human agency are profound. For those with no other mental health support, AI chatbots may offer solace, but their unregulated nature risks exacerbating vulnerabilities. The question of who benefits—tech companies, users, or society—remains unresolved. Second-order consequences could include normalized reliance on AI for emotional support, potentially weakening human connections or delaying systemic reforms.
Bridge questions: How can we ensure AI mental health support is equitable and not just a privilege for those with access to advanced tools? What safeguards are needed to prevent AI from becoming a substitute for human care rather than a supplement? How do we measure the long-term psychological effects of AI interactions, beyond immediate crisis interventions?
Counterstrike scan: A coordinated influence campaign might frame AI as the sole solution to mental health crises, deflecting attention from systemic failures. The actual content avoids this trap by emphasizing AI’s limitations and the need for broader change. It aligns more with constructive problem-solving than manipulation.
Patterns detected: none
Sentinel — Human
The article shows strong human authorship signals, including nuanced advocacy, specific institutional attribution, and stylistic idiosyncrasies inconsistent with AI generation.
