Featured
Ashwagandha
Ashwagandha Industry Calls for Science-Led Review to Protect Farmers, Ayurveda, and India’s Nutraceutical Growth
One Platform. Thirty-Five Countries. Zero Investors. The ExamOnline Story That Indian Business Media Has Ignored for 17 Years.-TBt
One Platform. Thirty-Five Countries. Zero Investors. The ExamOnline Story That Indian Business Media Has Ignored for 17 Years.
Indraprastha
Indraprastha Institute of Information Technology, Delhi, launches Post Graduate Diploma Programme in Interaction Design & UX
May 22, 2026
The Blunt Times The Blunt Times
  • National
  • City Events
  • Business Vibes
  • Education
  • Entertainment
  • Regional
    • Bharuch
    • Dang
    • Navsari
    • Surat
    • Valsad
    • Hindi
    • Gujarati
  • Health
  • Crime corner
  • Sports
  • Spotlight
Search the Site
Popular Searches:
Chatgpt Nasa Halloween
Recent Posts
Ravi Raj Desai SGCCI vice president election, the blunt times
Ravi Raj Desai Pitches Youth-Driven Vision for SGCCI Vice President Post
May 22, 2026
Daman airport inauguration June 2026, the blunt times
Daman Set for Major Infrastructure Leap as PM Modi Likely to Inaugurate Airport and Rs.250 Crore Hospital
May 22, 2026
Cockroach Janta Party Malware APK Targets Android Users Through WhatsApp and Telegram, Warns TraceX Labs
May 22, 2026
The Blunt Times The Blunt Times
  • National
  • City Events
  • Business Vibes
  • Education
  • Entertainment
  • Regional
    • Bharuch
    • Dang
    • Navsari
    • Surat
    • Valsad
    • Hindi
    • Gujarati
  • Health
  • Crime corner
  • Sports
  • Spotlight
Follow us
Home/Business/Guardrails for the Frontier: How AI Safety is Actually Being Built
Business

Guardrails for the Frontier: How AI Safety is Actually Being Built

In January 2026, Dario Amodei wrote a 20,000 word essay that made waves across the internet. The CEO of Anthropic, one of the tech giants and leaders in AI, has been openly talking about safety...

TBT NEWS SERVICE
May 2, 2026 3 Min Read

In January 2026, Dario Amodei wrote a 20,000 word essay that made waves across the internet. The CEO of Anthropic, one of the tech giants and leaders in AI, has been openly talking about safety issues and risks associated with this emerging technology. In his essay, The Adolescence of Technology, Dario wrote in depth about various anticipated risks and the need for private organizations and governments to work together in forming policies, laws, and systems to mitigate these risks. He also took a dialectical approach, arguing that the positive impact of AI could far outweigh the risks associated with it.

In the past two years, with the speed of AI development, we have seen governments and private organizations take action through reforms and internal processes to control both current and anticipated threats. The most common approach is to identify, evaluate, and mitigate these risks.

How we categorize and assess AI risks

The most comprehensible threats associated with AI today are Biological/Chemical, Cybersecurity, Manipulation, and Model Autonomy.

● Biological and Chemical: This includes nuclear and radiological hazards where AI enables a non-expert to develop known or unknown biological threats.
● Cybersecurity: Offense can be done by enhancing the effectiveness of human attackers or through autonomous cyberattacks executed by AI systems from start to finish.
● Manipulation: This refers to the ability of AI systems to influence human beliefs, potentially influencing individuals to act against their own interests which could implicate democratic processes, social stability, and information integrity.
● Model Autonomy: The ability of AI systems to act independently, where they could replicate, survive, or conduct research to improve their own capabilities.

Then there are unknown risks that we may not comprehend or anticipate right now but may arise in the future. These threats are achievable when adversaries (individuals or well-resourced organizations) get unauthorized access to model weights or misuse the technology to exploit vulnerabilities.

Currently, most of the leading developers are setting multiple security thresholds depending on the model, an alarm is raised if a threshold is exceeded. Anthropic introduced ASL (AI Safety Levels), where each level requires specific safeguards. Google DeepMind uses Critical Capability Levels (CCLs) to represent points where AI may pose heightened risks. OpenAI tracks risks through defined categories with a gradation scale ranging from low to critical.

How we move from risk to regulation

The EU was the first to introduce the “EU AI Act”, governing AI development based on risk categories: unacceptable, high, limited, and minimal. In the US, the New York legislation introduced the Responsible AI Safety and Education Act (RAISE Act) to govern “frontier models.” California also introduced the Transparency in Frontier Artificial Intelligence Act (TFAIA) in Sep’25, targeting large developers for accountability. Both provide whistleblower protections and include significant financial penalties for non-submission of reports or disclosure of risks. These frameworks focuses majorly on frontier models that are more prone to systemic risks.

In the private sector, tech giants have taken regulation into their own hands. Google DeepMind has the Frontier Safety Framework, Anthropic regularly updates its Responsible Scaling Policy (RSP), and OpenAI has its Preparedness Framework. While distinct, they all share common steps: identifying, evaluating, mitigating, and governing risks. Currently, companies use methods like red teaming to stress-test models at different levels of development and deployment.

Securing access to model weights is one of the most critical safety norms among AI developers. Other key policies include the reporting of risks, rigorous third-party audits, and the tracking of incidents and mitigation for future reference. These frameworks ensure that crossing risk thresholds triggers immediate, non-discretionary actions such as halting deployment or hardening physical security. However, these regulations need to move from unilateral, company-led measures toward a coordinated multilateral ecosystem, where transparency and shared information flow among all stakeholders ensures that AI progress does not outpace our collective ability to control it.

About Author SHRUTI RAJVANSHI, Associate Director | Market Xcel

Shruti Rajvanshi is an Associate Director at Market Xcel and a postgraduate student at the Georgia Institute of Technology (MS, Human-Computer

Interaction). She holds a Bachelor’s degree in Computer Science from the University of Delhi and an MBA in Marketing and Finance. With over a decade of experience across analytics and business strategy, she has independently built AI-driven applications end-to-end using large language models, and has worked across emerging technology ecosystems including cryptocurrencies and NFTs. Her work reflects a strong, hands-on engagement with how technology is being built and deployed in real-world systems.

Share Article

Dr. Dhaval Naik of Marengo CIMS Hospital Honoured with Gujarat Garima Award for Excellence in Cardiac Care -PNn
Previous Post

Dr. Dhaval Naik of Marengo CIMS Hospital Honoured with Gujarat Garima Award for Excellence in Cardiac Care

Shahid Smriti Van-PNn
Next Post

‘Shahid Smriti Van’ Validated as Key Pollution Mitigator in National Study

Picked
TradeFlock Unveils 10 Best HR Leaders in India 2026, Recognising People-Centric Transformation-TBT
TradeFlock Unveils 10 Best HR Leaders in India 2026, Recognising People-Centric Transformation
Ashwagandha
Ashwagandha Industry Calls for Science-Led Review to Protect Farmers, Ayurveda, and India’s Nutraceutical Growth
One Platform. Thirty-Five Countries. Zero Investors. The ExamOnline Story That Indian Business Media Has Ignored for 17 Years.-TBt
One Platform. Thirty-Five Countries. Zero Investors. The ExamOnline Story That Indian Business Media Has Ignored for 17 Years.
Indraprastha
Indraprastha Institute of Information Technology, Delhi, launches Post Graduate Diploma Programme in Interaction Design & UX
Capital India
Capital India Finance AUM Grows 22% to Rs 1,227 Crore in FY26; PAT Rises 243%
Gujarat smart meter electricity discount 2026, the blunt times
Gujarat smart meter consumers save crores under new electricity discount
Popular Posts
Capital India
Capital India Finance AUM Grows 22% to Rs 1,227 Crore in FY26; PAT Rises 243%
By TBT Online Desk
Gujarat smart meter electricity discount 2026, the blunt times
Gujarat smart meter consumers save crores under new electricity discount
By Times News Network
Ahmedabad airport ganja seizure 20 kg drug bust, the blunt times
Rs.20 Crore Ganja Seized at Ahmedabad Airport in Major Smuggling Crackdown
By Times News Network
Shiprocket
Shiprocket Launches Appointment-Based Delivery for Quick Commerce with 98% On-Time Adherence
By TBT Online Desk
Jitendra Vaswani
“The Window to Adapt Is Closing”: Jitendra Vaswani on AI and the Future of Jobs
By TBT Online Desk
Brokerages See Up To 48% Upside On PNC Infratech Post Q4FY26; Execution Recovery, Order Book Visibility Key Triggers; Stock Surges 34% Since FY27-PNn
Brokerages See Up To 48% Upside On PNC Infratech Post Q4FY26; Execution Recovery, Order Book Visibility Key Triggers; Stock Surges 34% Since FY27
By TBT Online Desk

Read Next

VYNA Electric Scales B2B Distribution Network to 100+ Partners in Six Months, Accelerating Expansion in India's Consumer Electrical Market-TBT
Business
VYNA Electric Scales B2B Distribution Network to 100+ Partners in Six Months, Accelerating Expansion in India’s Consumer Electrical Market
May 22, 2026
2 Min Read
TradeFlock Unveils 10 Best HR Leaders in India 2026, Recognising People-Centric Transformation-TBT
Business
TradeFlock Unveils 10 Best HR Leaders in India 2026, Recognising People-Centric Transformation
May 22, 2026
3 Min Read
One Platform. Thirty-Five Countries. Zero Investors. The ExamOnline Story That Indian Business Media Has Ignored for 17 Years.-TBt
Business
One Platform. Thirty-Five Countries. Zero Investors. The ExamOnline Story That Indian Business Media Has Ignored for 17 Years.
May 22, 2026
3 Min Read
Capital India
Business
Capital India Finance AUM Grows 22% to Rs 1,227 Crore in FY26; PAT Rises 243%
May 22, 2026
2 Min Read
The Blunt Times

The Blunt Times is a 24-hour news portal from Surat and south Gujarat. It was launched by senior journalist Melvyn Thomas, who has over 21 years of experience working with the top news organizations such as The Indian Express, The Times of India, and The Economic Times.

Popular
TradeFlock Unveils 10 Best HR Leaders in India 2026, Recognising People-Centric Transformation
May 22, 2026
Ashwagandha Industry Calls for Science-Led Review to Protect Farmers, Ayurveda, and India’s Nutraceutical Growth
May 22, 2026
One Platform. Thirty-Five Countries. Zero Investors. The ExamOnline Story That Indian Business Media Has Ignored for 17 Years.
May 22, 2026
Indraprastha Institute of Information Technology, Delhi, launches Post Graduate Diploma Programme in Interaction Design & UX
May 22, 2026
Categories
City Events
National
Business Vibes
Lifestyle
Spotlight
Regional
Education
Entertainment
Health
Press Release
Trending
Sports

© 2026 All Rights Reserved, The Blunt Times

  • Terms of Service
  • Privacy Policy