HN
New
Show
Ask
Jobs
Built with Qwik
Evaluating frontier AI R&D capabilities of LLM agents against human experts
(metr.org)
1 points | by
tedsanders
18 hours ago ago
No comments yet.
No comments yet.