HN
New
Show
Ask
Jobs
Built with Qwik
Reinforcement Learning from Human Feedback (RLHF) in Notebooks
(github.com)
71 points | by
ash_at_hny
a day ago ago
3 comments
kcdom1000f
a day ago
Hl
careful_ai
a day ago
[dead]
bobvylan
a day ago
[dead]
Hl
[dead]
[dead]