Technology

VentureBeat

venturebeat.com

Transformative tech coverage that matters

Articles100

DeepSeek open sources DSpark, a new framework to speed up LLM inference by up to 85%
DeepSeek open sources DSpark, a new framework to speed up LLM inference by up to 85%

The attack that hijacked Claude Code came through Sentry. Datadog, PagerDuty, and Jira have the same exposure.
The attack that hijacked Claude Code came through Sentry. Datadog, PagerDuty, and Jira have the same exposure.

Prompt injection is exploiting enterprise AI's biggest design flaws by targeting agents, RAG pipelines and model routers
Prompt injection is exploiting enterprise AI's biggest design flaws by targeting agents, RAG pipelines and model routers

Claude Code turned every engineer into three. Now companies need more product thinkers
Claude Code turned every engineer into three. Now companies need more product thinkers

New agentic memory framework uses 118K tokens per query. LangMem burns through 3.26M.
New agentic memory framework uses 118K tokens per query. LangMem burns through 3.26M.

Autonomous security agents need complete data. Here's how to check if yours is ready.
Autonomous security agents need complete data. Here's how to check if yours is ready.

OpenAI unveils GPT-5.6 Sol, Terra and Luna models — but only accessible to limited preview partners for now, per US Gov
OpenAI unveils GPT-5.6 Sol, Terra and Luna models — but only accessible to limited preview partners for now, per US Gov

Most companies think they're building a software factory. They're actually just shipping bugs faster.
Most companies think they're building a software factory. They're actually just shipping bugs faster.

Liquid AI's smallest model yet LFM2.5-230M beats models 4X its size at data extraction, can run 'anywhere'
Liquid AI's smallest model yet LFM2.5-230M beats models 4X its size at data extraction, can run 'anywhere'

OpenAI's updated GPT-5.5 Instant is better at shopping, complex constraints, and understanding user intent  — and it's already in the API
OpenAI's updated GPT-5.5 Instant is better at shopping, complex constraints, and understanding user intent  — and it's already in the API

Your enterprise AI agents should automatically remember which model is right for which task. Mindstone built the capability with Rebel
Your enterprise AI agents should automatically remember which model is right for which task. Mindstone built the capability with Rebel

Mistral launches OCR 4, turning document extraction into a full enterprise AI play
Mistral launches OCR 4, turning document extraction into a full enterprise AI play

Alibaba's model never trained as an agent — and improved agent performance across seven benchmarks
Alibaba's model never trained as an agent — and improved agent performance across seven benchmarks

Xiaomi's HarnessX rewrites its own AI scaffolding mid-task — and smaller models gain the most
Xiaomi's HarnessX rewrites its own AI scaffolding mid-task — and smaller models gain the most

Stanford researchers will discuss their agentic 'scientists' that are on course to reshape drug discovery at VB Transform 2026
Stanford researchers will discuss their agentic 'scientists' that are on course to reshape drug discovery at VB Transform 2026

How Shopify built an AI stack that doesn't care which models survive
How Shopify built an AI stack that doesn't care which models survive

Amazon will present its framework for engineering trustworthy AI agents at VB Transform 2026
Amazon will present its framework for engineering trustworthy AI agents at VB Transform 2026

Intuit will show off how it rebuilt its AI infrastructure to support fast and complex tasks at VB Transform 2026
Intuit will show off how it rebuilt its AI infrastructure to support fast and complex tasks at VB Transform 2026

OpenAI unveils first custom AI inference chip, Jalapeño, with Broadcom — and its development was sped-up with OpenAI's own models
OpenAI unveils first custom AI inference chip, Jalapeño, with Broadcom — and its development was sped-up with OpenAI's own models

Enterprise-grade AI image generation in 2 seconds is here: Krea 2 Raw and Turbo available as open weights under custom license
Enterprise-grade AI image generation in 2 seconds is here: Krea 2 Raw and Turbo available as open weights under custom license

Anthropic launches Claude Tag, replacing its Slack app with a persistent AI teammate that learns, monitors and works autonomously
Anthropic launches Claude Tag, replacing its Slack app with a persistent AI teammate that learns, monitors and works autonomously

A proof of concept forgives a fragile data path. Operational AI does not.
A proof of concept forgives a fragile data path. Operational AI does not.

Alibaba's AI video model rises to No. 2 in global rankings, as OpenAI's Sora and ByteDance's Seedance fall away
Alibaba's AI video model rises to No. 2 in global rankings, as OpenAI's Sora and ByteDance's Seedance fall away

No Claude Fable 5? No problem: Sakana achieves frontier performance with new Fugu multi-model, auto synthesis system
No Claude Fable 5? No problem: Sakana achieves frontier performance with new Fugu multi-model, auto synthesis system

Why agentic enterprises need to become learning systems
Why agentic enterprises need to become learning systems

Researchers introduce Self-Harness, a framework that lets AI agents rewrite their own rules, boosting performance up to 60%
Researchers introduce Self-Harness, a framework that lets AI agents rewrite their own rules, boosting performance up to 60%

AI hit the memory wall — now it needs a new context tier
AI hit the memory wall — now it needs a new context tier

7,000 Langflow servers are under attack. LangGraph and LangChain have the same holes
7,000 Langflow servers are under attack. LangGraph and LangChain have the same holes

Fine-tuning forgets. RAG leaks context. Hypernetworks build the model your agent needs on demand.
Fine-tuning forgets. RAG leaks context. Hypernetworks build the model your agent needs on demand.

Anthropic's Claude Code Artifacts update brings live, shared dashboards and interactive workspaces to enterprises
Anthropic's Claude Code Artifacts update brings live, shared dashboards and interactive workspaces to enterprises

New AI optimization framework beats Claude Code and Codex by 2.5x on the same compute budget
New AI optimization framework beats Claude Code and Codex by 2.5x on the same compute budget

Copilot searched your mailbox. LiteLLM handed out admin keys. Run this 5-check audit before your stack is next
Copilot searched your mailbox. LiteLLM handed out admin keys. Run this 5-check audit before your stack is next

Adobe embeds agentic AI workflows across Creative Cloud, shifting from media generation to production orchestration
Adobe embeds agentic AI workflows across Creative Cloud, shifting from media generation to production orchestration

AWS enters the context layer race with a graph that learns from agents, not manual curation
AWS enters the context layer race with a graph that learns from agents, not manual curation

Anthropic ships major Claude Design overhaul with design system imports, code round-trips, and a fix for its token-burning problem
Anthropic ships major Claude Design overhaul with design system imports, code round-trips, and a fix for its token-burning problem

Why Weibo’s tiny VibeThinker-3B has the AI world arguing over benchmarks again
Why Weibo’s tiny VibeThinker-3B has the AI world arguing over benchmarks again

Z.ai’s open-weights GLM-5.2 beats GPT-5.5 on multiple long-horizon coding benchmarks for 1/6th the cost
Z.ai’s open-weights GLM-5.2 beats GPT-5.5 on multiple long-horizon coding benchmarks for 1/6th the cost

Databricks says it solved the decades-old data pipeline problem that's been slowing AI agents
Databricks says it solved the decades-old data pipeline problem that's been slowing AI agents

Stanford's DeLM cuts multi-agent task costs 50% — without a central orchestrator
Stanford's DeLM cuts multi-agent task costs 50% — without a central orchestrator

Satya Nadella warns that AI could hollow out entire industries, echoing the damage done by globalization
Satya Nadella warns that AI could hollow out entire industries, echoing the damage done by globalization

When deep research isn't enough for your business: Sakana AI launches 'ultra deep research' agent for 100+ page reports in 8 hours
When deep research isn't enough for your business: Sakana AI launches 'ultra deep research' agent for 100+ page reports in 8 hours

85% of IT teams claim every AI agent is under control. Only 42% actually know who owns them.
85% of IT teams claim every AI agent is under control. Only 42% actually know who owns them.

Vibe coding can build your pipeline. It can't explain it six months later
Vibe coding can build your pipeline. It can't explain it six months later

Attackers scale deception with AI. Defenders need truth at machine speed.
Attackers scale deception with AI. Defenders need truth at machine speed.

MCP solved tool calling. A2A solved coordination. What solves transport?
MCP solved tool calling. A2A solved coordination. What solves transport?

Anthropic blocks all public access to Claude Fable 5, Mythos 5 following US government order — what enterprises should do
Anthropic blocks all public access to Claude Fable 5, Mythos 5 following US government order — what enterprises should do

Kimi K2.7-Code cuts thinking tokens 30% — but practitioners say the benchmarks don't check out
Kimi K2.7-Code cuts thinking tokens 30% — but practitioners say the benchmarks don't check out

Google researchers introduce 'faithful uncertainty,' allowing LLMs to offer best guesses instead of hallucinations
Google researchers introduce 'faithful uncertainty,' allowing LLMs to offer best guesses instead of hallucinations

NanoClaw and JFrog launch 'immune system' to block AI agents from downloading malicious code
NanoClaw and JFrog launch 'immune system' to block AI agents from downloading malicious code

PixelRAG beats text parsers on accuracy and cuts AI agent token costs 10x
PixelRAG beats text parsers on accuracy and cuts AI agent token costs 10x

Microsoft’s open-source SkillOpt automatically upgrades AI agent skills without touching model weights
Microsoft’s open-source SkillOpt automatically upgrades AI agent skills without touching model weights

Xiaomi's new open source, agentic AI coding harness MiMo Code beats Claude Code at ultra-long, 200+ step tasks
Xiaomi's new open source, agentic AI coding harness MiMo Code beats Claude Code at ultra-long, 200+ step tasks

Context compression finally works in production: new research cuts LLM input 16x without the accuracy hit
Context compression finally works in production: new research cuts LLM input 16x without the accuracy hit

What AI benchmarks miss about real-world performance
What AI benchmarks miss about real-world performance

Google's DiffusionGemma generates 256 tokens in parallel and self-corrects as it goes
Google's DiffusionGemma generates 256 tokens in parallel and self-corrects as it goes

Why AI that works in the lab often fails in production — and what actually fixes it
Why AI that works in the lab often fails in production — and what actually fixes it

Surprise upset: GPT-5.5 beats Claude Fable 5 on brutal new Agents’ Last Exam benchmark
Surprise upset: GPT-5.5 beats Claude Fable 5 on brutal new Agents’ Last Exam benchmark

Researchers say they trained a foundation model from scratch for about $1,500
Researchers say they trained a foundation model from scratch for about $1,500

Anthropic CEO calls for FAA-style regulation of powerful AI models: what enterprises should know
Anthropic CEO calls for FAA-style regulation of powerful AI models: what enterprises should know

MassMutual's AI strategy: 12-month contracts, 30% productivity gains, zero lock-in
MassMutual's AI strategy: 12-month contracts, 30% productivity gains, zero lock-in

Apple’s new Siri AI is more than just a smarter assistant — it's a new enterprise app layer
Apple’s new Siri AI is more than just a smarter assistant — it's a new enterprise app layer

Cohere open-sources a coding agent that runs on a single H100
Cohere open-sources a coding agent that runs on a single H100

On-device AI agents hit a hard memory limit. Apple's new architecture routes around it.
On-device AI agents hit a hard memory limit. Apple's new architecture routes around it.

Anthropic brings Mythos to the masses with Claude Fable 5, its most powerful generally available model ever
Anthropic brings Mythos to the masses with Claude Fable 5, its most powerful generally available model ever

Every World Cup fan deserves a seat. Norton Neo says its free browser is the ticket
Every World Cup fan deserves a seat. Norton Neo says its free browser is the ticket

Researchers trained an open source AI search agent, Harness-1, that outperforms GPT-5.4 on recalling relevant information
Researchers trained an open source AI search agent, Harness-1, that outperforms GPT-5.4 on recalling relevant information

Agentic AI solved coding — and exposed every other problem in software engineering
Agentic AI solved coding — and exposed every other problem in software engineering

When Claude changed, everything changed: Managing AI blast radius in production
When Claude changed, everything changed: Managing AI blast radius in production

Microsoft AI chief says company was “set free” from OpenAI to pursue superintelligence
Microsoft AI chief says company was “set free” from OpenAI to pursue superintelligence

Microsoft's AI Futurist explains how he uses Copilot — and the real-world problems enterprises are solving with agents
Microsoft's AI Futurist explains how he uses Copilot — and the real-world problems enterprises are solving with agents

AI agents are learning on the job — just not for your whole team
AI agents are learning on the job — just not for your whole team

Meta's AI support agent bound recovery emails for anyone who asked. Your SOC never saw an alert.
Meta's AI support agent bound recovery emails for anyone who asked. Your SOC never saw an alert.

Anthropic says 80% of its new production code is now authored by Claude — how your enterprise can keep up
Anthropic says 80% of its new production code is now authored by Claude — how your enterprise can keep up

Google's new open source Gemma 4 12B analyzes audio, video — and runs entirely locally on a typical 16GB enterprise laptop
Google's new open source Gemma 4 12B analyzes audio, video — and runs entirely locally on a typical 16GB enterprise laptop

Alibaba's Qwen3.7-Plus supports text, video and imagery inputs at low cost of $0.4/$1.6 per 1M token — but it's proprietary
Alibaba's Qwen3.7-Plus supports text, video and imagery inputs at low cost of $0.4/$1.6 per 1M token — but it's proprietary

Perplexity AI unveils hybrid local-cloud inference system at Computex 2026
Perplexity AI unveils hybrid local-cloud inference system at Computex 2026

The Agentic Reckoning: Enterprise AI organizations have a runtime problem, not a model problem — and most are building the wrong solution
The Agentic Reckoning: Enterprise AI organizations have a runtime problem, not a model problem — and most are building the wrong solution

Enterprise AI agents keep creating data silos. Microsoft's Build answer is Microsoft IQ and Rayfin.
Enterprise AI agents keep creating data silos. Microsoft's Build answer is Microsoft IQ and Rayfin.

Microsoft debuts Surface RTX Spark Dev Box to run large AI models without cloud costs
Microsoft debuts Surface RTX Spark Dev Box to run large AI models without cloud costs

Microsoft launches MXC, an OS-level sandbox for AI agents, with OpenAI and Nvidia already on board
Microsoft launches MXC, an OS-level sandbox for AI agents, with OpenAI and Nvidia already on board

OpenAI's Codex update lets agents build interactive enterprise workspaces via Sites and role-specific plugins
OpenAI's Codex update lets agents build interactive enterprise workspaces via Sites and role-specific plugins

AI agents keep giving confident wrong answers. The context layer is enterprise AI's next production problem.
AI agents keep giving confident wrong answers. The context layer is enterprise AI's next production problem.

MiniMax-M3 debuts, eclipsing GPT-5.5 and Gemini 3.1 Pro on key benchmark performance for just 5-10% of the cost
MiniMax-M3 debuts, eclipsing GPT-5.5 and Gemini 3.1 Pro on key benchmark performance for just 5-10% of the cost

Anthropic’s browser agent got hijacked 31.5% of the time before safeguards engaged
Anthropic’s browser agent got hijacked 31.5% of the time before safeguards engaged

AI doesn't break security. Complexity does
AI doesn't break security. Complexity does

Claude Mythos exposed a hard truth: Your enterprise patching process is way too slow
Claude Mythos exposed a hard truth: Your enterprise patching process is way too slow

The AI agent bottleneck isn't model performance — it's permissions
The AI agent bottleneck isn't model performance — it's permissions

MeMo's memory model lets teams upgrade their LLM without retraining it — and performance jumps 26%
MeMo's memory model lets teams upgrade their LLM without retraining it — and performance jumps 26%

Pinterest cut AI costs 90% by gutting a frontier model's vision layer
Pinterest cut AI costs 90% by gutting a frontier model's vision layer

AI agents are entering their rebuild era as enterprises confront the reliability problem
AI agents are entering their rebuild era as enterprises confront the reliability problem

Researchers automated LLM reasoning strategy design and cut token usage by 69.5%
Researchers automated LLM reasoning strategy design and cut token usage by 69.5%

Mistral AI launches Vibe, expands into industrial AI and announces data center push to challenge OpenAI
Mistral AI launches Vibe, expands into industrial AI and announces data center push to challenge OpenAI

Anthropic's Claude Opus 4.8 is here with 3X cheaper fast mode and near-Mythos level alignment
Anthropic's Claude Opus 4.8 is here with 3X cheaper fast mode and near-Mythos level alignment

How DeepSeek’s radical architecture is shattering Silicon Valley's token moat
How DeepSeek’s radical architecture is shattering Silicon Valley's token moat

Are designers the new SWEs? Figma Make's new two-way GitHub integration turns designs into live, production code — with built-in governance
Are designers the new SWEs? Figma Make's new two-way GitHub integration turns designs into live, production code — with built-in governance

SQL query logs hold the context AI agents need to stop hallucinating joins
SQL query logs hold the context AI agents need to stop hallucinating joins

Control within connection: How data sovereignty is rewriting the rules of critical infrastructure
Control within connection: How data sovereignty is rewriting the rules of critical infrastructure

MiniMax teases upcoming M3 model with new sparse attention mechanism and 15.6X long-context response speed boost
MiniMax teases upcoming M3 model with new sparse attention mechanism and 15.6X long-context response speed boost

Merck and Mastercard are seeing real agentic AI results. Both say the plumbing came first.
Merck and Mastercard are seeing real agentic AI results. Both say the plumbing came first.

DataGrail report finds your vendor may be sending data to AI models you never approved
DataGrail report finds your vendor may be sending data to AI models you never approved