Anthropic PBC is doubling down on artificial intelligence safety with the release of a new open-source tool that uses AI agents to audit the behavior of large language models. It’s designed to ...
The company claims 6,000 internal developers within IBM have used Project Bob, achieving an average productivity gain of 45% and a 22-43% increase in code commits . There is no shortage of AI-powered ...
Manny Medina, previously best-known as the founder of sales automation startup Outreach ($4.4 billion valuation), has wowed investors with his young startup, Paid.
Claude Sonnet 4.5 achieved top scores on the SWE-bench Verified evaluation, which tests real-world software coding skills.
Google's Gemini 2.5 Computer Use model is a new AI agent that can autonomously browse the web and interact with UIs.
CodeMender aims to help developers keep pace with AI-powered vulnerability discovery by automatically patching security flaws.
Microsoft claims that Agent Mode will make M365 Copilot more reliable in Excel. In its tests, Agent Mode received a score of 57.2% in Spreadsheet Bench accuracy results — that compares with 71.3% for ...
Learn how OpenAI’s Agent Builder empowers users to design dynamic AI workflows with ease. No coding skills? No problem—start building today!
To try Agent Mode in Excel, you need to get the Excel Labs add-in and choose Agent Mode. In Word, you can just open Copilot and select Agent Mode from the menu below the prompt box. The feature will ...
Over the past year, companies have been tinkering with generative artificial intelligence to create customized agents that automate complex tasks, but they’ve gotten mixed results. Now, OpenAI is ...
AgentKit, announced during OpenAI’s DevDay in San Francisco, enables developers and enterprises to build agents and add chat capabilities in one place, potentially competing with platforms like Zapier ...
Office Agent in Copilot chat, powered by Anthropic models, brings presentation and document creation into a chat-first interface.