Are AI agents ready for the workplace? A new benchmark raises doubts.
New research looks at how leading AI models hold up doing actual white-collar work tasks, drawn from consulting, investment banking, and law. Most models failed.
New research looks at how leading AI models hold up doing actual white-collar work tasks, drawn from consulting, investment banking, and law. Most models failed.
As companies deploy AI-powered chatbots, agents, and copilots across their operations, they’re facing a new risk: how do you let employees and AI agents use powerful AI tools without accidentally leaking sensitive data, violating compliance rules, or opening the door to prompt-based injections? Witness AI just raised $58 million to find a solution, building what they call “the […]
Built into the Claude Desktop app, Cowork lets users designate a specific folder where Claude can read or modify files, with further instructions given through the standard chat interface.
Israeli startup Milestone raised a $10 million seed funding round to correlate AI tool usage with engineering metrics, including code quality.
Amanda Kahlow has been quietly been back in sales and marketing tech for about a year with agentic startup 1Mind.
Serval is using agentic AI models to automate IT service management, but the company has a unique approach that takes advantage of agentic AI’s powers while avoiding many of its pitfalls.
The company is updating Seller Assistant, its AI tool for third-party sellers, to help handle tasks on the seller’s behalf.
Called the Agent Payments Protocol (AP2), the system is meant to be interoperable between AI platforms, payment systems and vendors.
Box released a new set of AI tools at its Boxworks conference on Thursday, filling out CEO Aaron Levie’s vision for an AI-led transformation of the enterprise.
The round, which pushes Databricks’ valuation to $100B, was co-led by Insight Partners and Thrive. CEO Ali Ghodsi says he’s found an enormous untapped AI agent market to spend the funds.