Back to home

Tag

AI security

AI security covers the risks around models, apps, and infrastructure: jailbreaks, prompt injection, data leakage, and automated vulnerability testing. For developers, it matters because deployment now depends on clear evaluation, permission boundaries, and attack-surface control.

19 articles

IBM, Red Hat pledge $5B for open source AI security
Industry News/Jun 1

IBM, Red Hat pledge $5B for open source AI security

IBM and Red Hat are launching Project Lightwell, a $5 billion push to secure open source software with AI and 20,000 engineers.

How to Secure AI Assistants End to End
AI Agent/May 28

How to Secure AI Assistants End to End

Set up data-layer controls, encryption, and audit logs to reduce AI assistant security risk.

IBM adds Anthropic-backed AI security push
Tools & Apps/May 22

IBM adds Anthropic-backed AI security push

IBM expanded AI security services and teamed with Anthropic on Project Glasswing to help secure open-source software in critical infrastructure.

Agentic AI turns autonomy into a security problem
AI Agent/May 19

Agentic AI turns autonomy into a security problem

A developer’s breakdown of Forbes’ agentic AI hub, with a copy-ready governance template for agents, drift, and authority control.

Microsoft’s MDASH finds 16 Windows flaws
Research/May 18

Microsoft’s MDASH finds 16 Windows flaws

Microsoft’s MDASH AI found 16 Windows flaws, including four critical RCEs, and will enter private preview for enterprises in June.

Yakovenko Warns AI Could Crack PQC Wallets
Blockchain & Web3/May 8

Yakovenko Warns AI Could Crack PQC Wallets

Solana co-founder Anatoly Yakovenko says AI may break post-quantum signature schemes before blockchains finish migrating.

MCP flaw may expose 150 million downloads
Research/May 6

MCP flaw may expose 150 million downloads

Ox Security says an MCP design flaw could expose 150 million downloads and up to 200,000 vulnerable instances.

AI Finds Nine-Year Linux Kernel Zero-Day
Research/May 5

AI Finds Nine-Year Linux Kernel Zero-Day

A researcher used AI tooling to find Copy Fail, a Linux kernel zero-day present since 2017 and rated CVSS 7.8.

Anthropic’s Mythos Model Triggers Security Panic
Model Releases/Apr 24

Anthropic’s Mythos Model Triggers Security Panic

Anthropic’s Mythos reportedly finds software flaws fast enough to worry governments, banks, and grid operators worldwide.

AVISE tests AI security with modular jailbreak evals
Research/Apr 23

AVISE tests AI security with modular jailbreak evals

AVISE is an open-source framework for finding AI vulnerabilities, with a 25-case jailbreak test that flagged all nine models as vulnerable.

Mythos, Anthropic’s unreleased AI model, explained
Research/Apr 21

Mythos, Anthropic’s unreleased AI model, explained

Anthropic says Mythos is too dangerous to ship. Here’s what its 73% hacking score, 31-point math gain, and limited rollout mean.

Altman Attack Suspect Named Other AI Leaders
Industry News/Apr 18

Altman Attack Suspect Named Other AI Leaders

Federal filings say the suspect carried an anti-AI note naming CEOs and investors after the Molotov attack on Sam Altman’s home.

Anthropic’s Mythos Preview Raises the Cyber Stakes
Industry News/Apr 14

Anthropic’s Mythos Preview Raises the Cyber Stakes

Anthropic’s new Mythos Preview is being tested with Apple, Google, Microsoft, and 45+ firms to probe AI’s cyber risks.

UK regulators assess Anthropic model risks
Industry News/Apr 14

UK regulators assess Anthropic model risks

UK regulators are reviewing Anthropic’s latest model with the NCSC after FT reporting raised concerns about critical IT system vulnerabilities.

Anthropic Accidentally Exposes Claude Agent Code
Tools & Apps/Apr 2

Anthropic Accidentally Exposes Claude Agent Code

Anthropic accidentally exposed internal code for Claude’s coding assistant, raising fresh questions about how the company protects its own tools.

Openclaw Flaw Exposes AI Admin Hijack Risk
Blockchain & Web3/Apr 1

Openclaw Flaw Exposes AI Admin Hijack Risk

Certik says Openclaw’s flaws expose 135,000+ instances, token theft, and admin takeover risk, with CVE-2026-25253 leading the list.

Anthropic Leak Exposes Mythos Model Details
Model Releases/Mar 29

Anthropic Leak Exposes Mythos Model Details

Anthropic exposed draft assets and Mythos model details in a public cache, showing how one CMS setting can spill thousands of files.

AI in 2026: Trends Poised to Transform Industries
Industry News/Mar 26

AI in 2026: Trends Poised to Transform Industries

By 2026, AI will actively join discovery processes in physics, chemistry, and biology, moving beyond summarizing papers and answering questions.

SurePath AI's New MCP Policy Controls Enhance AI Security
Tools & Apps/Mar 26

SurePath AI's New MCP Policy Controls Enhance AI Security

SurePath AI introduces MCP Policy Controls, providing real-time governance over AI interactions to enhance security and oversight.