4 points | by smusamashah 6 hours ago ago
2 comments
Such a great project that could automate a lot vibes testing hopefully! A pity that the dataset only contains 55 questions. I'd like to see this number in the thousands.
https://github.com/petergpt/bullshit-benchmark
Such a great project that could automate a lot vibes testing hopefully! A pity that the dataset only contains 55 questions. I'd like to see this number in the thousands.
https://github.com/petergpt/bullshit-benchmark