Lates News

date
28/09/2025
Recently, a team consisting of Tang Xiangru and Wang Yujie from Yale University, Xu Wanghan from Shanghai Jiao Tong University, Wan Guancheng from UCLA, Yin Zhenfei from Oxford University, Eigen AI Jin Di, Wang Hanrui, and others jointly developed the Eigen-1 multi-agent system, achieving a historic breakthrough. The accuracy of Pass@1 on the HLE Bio/Chem Gold test set reached 48.3%, and the accuracy of Pass@5 soared to 61.74%, surpassing the 60% threshold for the first time. This achievement far exceeds Google's Gemini 2.5 Pro (26.9%), OpenAI's GPT-5 (22.82%), and Grok 4 (30.2%). What is even more exciting is that this accomplishment is not dependent on closed-source large models, but is completely built on the open-source DeepSeek V3.1 platform. (Qbit)