AI Safety: Tracking Risk, Governance, and Responsibility

Artificial intelligence is advancing rapidly — bringing enormous promise alongside significant safety challenges. Over the past year, a surge of policy action, safety research, industry accountability efforts, and international cooperation have shaped the landscape of AI safety. Below, we unpack key developments that anyone interested in AI governance and risk mitigation should know.

1. Governments Step Up AI Regulation

Governments worldwide are moving beyond abstract discussions toward concrete regulatory frameworks:

California’s landmark AI Law — known as the Transparency in Frontier Artificial Intelligence Act — now requires companies building powerful AI models to disclose catastrophic risk assessments, publish safety documentation, and maintain whistleblower protections. This marks one of the first binding state-level mandates focused on catastrophic AI risk transparency. Wikipedia
China introduced draft rules regulating AIs that simulate human-like interaction. These draft regulations would require real-time monitoring of user behavior, content safeguards, and protections against psychological harms such as addiction — a novel approach to addressing non-physical safety risks. Reuters
Texas will establish a statewide AI oversight council tasked with evaluating harmful uses and recommending legislative updates — reflecting a trend toward sub-national governance alongside federal efforts. Chron
Other jurisdictions, including likely new laws in New York state (e.g., the RAISE Act), signal a broader push toward AI safety regulation in the U.S. at multiple levels.

2. AI Companies Face Accountability Pressures

Despite regulatory momentum, industry practices continue to lag safety expectations:

A recent study found that several leading AI companies’ safety practices fall short of emerging global standards, suggesting that corporate safety commitments need stronger mechanisms and independent evaluation. Reuters
A companion assessment warned that eight major AI firms lack credible plans to prevent catastrophic AI failures, despite industry rhetoric about alignment and safeguards. euronews
In response, some organizations are signaling increased internal focus on safety: OpenAI has advertised a high-level “Head of Preparedness” role dedicated to anticipating and mitigating severe AI risks, including biological and psychological harms. The Verge

Meanwhile, third-party safety indices — such as those from the Future of Life Institute — continue to highlight gaps in planning and preparedness, even as they rank firms on risk mitigation efforts.

3. Global Collaboration and AI Risk Research Accelerate

AI safety has taken on a strongly international character:

The International AI Safety Report — supported by 30 nations and led by eminent researchers — continues to shape global understanding of AI risks and capabilities. Its 2025 key update underscores the rapid evolution in general-purpose AI, associated cybersecurity and biosecurity risks, and the need for robust monitoring and control. College of Information (INFO)
At the policy level, discussions on coordinating AI governance globally are gaining traction. Future summits — including the AI Impact Summit scheduled for New Delhi in February 2026 — aim to broaden international cooperation on trust, safety evaluation, and responsible deployment. Wikipedia
Independent analyses show that the AI safety field is growing quickly, with the number of researchers and organizations devoted to technical safety and governance expanding meaningfully since 2020 — evidence that both industry and civil society recognize the importance of safety work. Effective Altruism Forum

4. Technical and Theoretical Safety Progress

Beyond policy, AI safety research continues to explore novel technical frameworks:

Emerging work on provable safety mechanisms seeks models with guaranteed safety properties even under extreme threat models, addressing fundamental concerns about control and tamper resistance in AI systems. arXiv
Other research proposes multilayer governance architectures that integrate societal, legal, and technical controls to comprehensively constrain risk across AI decision pathways. arXiv
Work on probabilistic safety for embodied systems aims to bridge theoretical rigor and real-world deployment for systems such as autonomous vehicles — showing increased nuance in safety approaches for both software and physical AI. arXiv

5. Broader Risk Conversations and Public Awareness

Public and scholarly debate around AI safety is also evolving:

High-profile books and media — such as If Anyone Builds It, Everyone Dies — raise public awareness about existential risks from superhuman AI, while also fueling deeper discussions in policy and academic circles. Wikipedia
Analysts and commentators argue for greater focus on accountability and governance mechanisms, rather than only long-term abstract risk framing, to address immediate harms and systemic issues. Algorithm Watch

What This Means for the Future

Overall, the trajectory of AI safety in late 2025 and early 2026 shows:

Intensifying regulatory activity — especially around transparency, accountability, and risk disclosures.
Increased scrutiny of corporate safety practices — with independent assessments and calls for credible long-term planning.
Growing international collaboration — albeit with challenges in harmonizing diverse policy approaches.
Expanding research frontiers — both technical and governance-oriented.
Broader public engagement and debate — shaping expectations for safe AI development.

While many questions remain unresolved, these developments signal that AI safety is becoming a central pillar of AI policy, research, and industry strategy worldwide — a trend we will continue to track and analyze here at Super AI Safety.