731 articles curated and counting

Your 360°
view of AI.

Curated from 30+ sources. Scored for relevance. Never algorithmic. Updated daily.

This week's digestSoon

May 18, 2026

Research

Hugging Face and IBM Launch Open Agent Leaderboard for AI Agents

The introduction of a public, standardized leaderboard for AI agents is critical for transparent evaluation and accelerating progress in agent development.

Hugging Face BlogMay 18

AI agents leaderboard benchmarking

April 15, 2026

Research

Inside VAKRA: Reasoning, Tool Use, and Failure Modes of Agents

The VAKRA benchmark offers crucial insights into the current limitations of AI agents in reasoning and tool use, providing a clear roadmap for future research and development efforts.

Hugging Face BlogApr 15

VAKRA AI Agents Benchmarks

April 8, 2026

Research

IBM Research Introduces ALTK-Evolve for On-the-Job AI Agent Learning

ALTK-Evolve from IBM Research introduces a new paradigm for AI agents to learn and adapt continuously in real-time, significantly improving their autonomy and robustness.

Hugging Face BlogApr 8

AI Agents On-the-Job Learning IBM Research

March 13, 2026

Research

IBM & UC Berkeley Diagnose Enterprise Agent Failures with IT-Bench & MAST

The research provides critical insights into the vulnerabilities of enterprise AI agents, offering a path to more robust and reliable AI deployments through specialized diagnostic tools.

Hugging Face BlogMar 13

Enterprise AI AI Agents Failure Diagnosis

Your 360° view of AI.

May 18, 2026

Hugging Face and IBM Launch Open Agent Leaderboard for AI Agents

April 15, 2026

Inside VAKRA: Reasoning, Tool Use, and Failure Modes of Agents

April 8, 2026

IBM Research Introduces ALTK-Evolve for On-the-Job AI Agent Learning

March 13, 2026

IBM & UC Berkeley Diagnose Enterprise Agent Failures with IT-Bench & MAST

May 18, 2026

Hugging Face and IBM Launch Open Agent Leaderboard for AI Agents

April 15, 2026

Inside VAKRA: Reasoning, Tool Use, and Failure Modes of Agents

April 8, 2026

IBM Research Introduces ALTK-Evolve for On-the-Job AI Agent Learning

March 13, 2026

IBM & UC Berkeley Diagnose Enterprise Agent Failures with IT-Bench & MAST

Your 360°
view of AI.