Tracking Our Latest Research, Systems Work, And Real-World Performance Wins
Function calling is the foundation of real-world LLM agents—but most existing datasets are unreliable. Applied to benchmarks like BFCL, EigenData shows that 71.5% of samples contain critical errors affecting correctness or evaluation, and introduces a self-evolving system to generate, audit, and repair them.
Eigen AI and Nebius are partnering to bring faster, optimized open-source AI models to Token Factory, Nebius’s platform for running open models in production.
Eigen AI now holds the #1 output speed across 25 of the most widely used open-source models tracked by Artificial Analysis — more than doubling our footprint in just six weeks.
Interactive tool-agent post-training is unstable because user simulation noise corrupts delayed-reward learning. We propose EigenData + verifier-based RL, achieving strong improvements on τ²-bench (Airline 58.0% → 73.0%; Telecom 53.7% → 98.3%).
Eigen AI and Boson AI are co-hosting the Higgs Audio Hackathon at Circuit Launch in Mountain View, March 20–22, 2026. Build with Higgs Audio and Eigen AI's full model suite.
Introducing Mini Banana, a new approach to AI.
We are proud to announce that we have expanded this performance leadership across 11 of the most prominent models in the industry, consistently securing top-tier rankings for speed, latency, and efficiency.
Palo Alto, Calif., Jan 15, 2026 - The future of voice has arrived: Eigen AI and Boson AI join forces to power Higgs-Audio v2.5.
Eigen AI announces full open-source support for DFlash — covering both training and inference. DFlash uses block diffusion for up to 6.17× lossless acceleration.
ML-Master 2.0 reaches SOTA on OpenAI’s MLE-bench and advances ultra-long-horizon autonomy.