LLMs: Solvers vs. Judges

(bensantora.com)

2 points | by truelinux1 5 hours ago ago

1 comments

  • truelinux1 5 hours ago

    I gave several LLMs a logic puzzle with an embedded contradiction. Some flagged it. Some quietly bent the rules to produce an answer anyway.

    Knowing which type of model you're using (a helpful solver or a strict judge) really matters.