logo

机器人资讯 魔法院子

Robotics & Automation News (RSS) 行业资讯 2026-04-01 00:00
Positronic Robotics has introduced a new benchmarking initiative aimed at evaluating how well AI-driven robots perform in real-world industrial tasks, as interest grows in so-called “physical AI” systems. The benchmark, called PhAIL (Physical AI Leaderboard), measures robotic performance using operational metrics such as units per hour and mean time between failures, rather than traditional academic […]
ScienceDaily Robotics (RSS) 科研/论文 2026-03-13 14:08
As AI systems began acing traditional tests, researchers realized those benchmarks were no longer tough enough. In response, nearly 1,000 experts created Humanity’s Last Exam, a massive 2,500-question challenge covering highly specialized topics across many fields. The exam was engineered so that any question solvable by current AI models was removed. Early results show even the most advanced systems still struggle — revealing a surprisingly large gap between AI performance and true expert-level knowledge.