9 points | by hugetim 3 days ago ago
2 comments
Contrary to what the leaderboard lists as the human score, their technical paper implies a human baseline of ~48%.
[flagged]
Contrary to what the leaderboard lists as the human score, their technical paper implies a human baseline of ~48%.
[flagged]