Why is GPT-5.4 obsessed with Goblins?

11 points | by pants2 16 hours ago ago

7 comments

  • HPSimulator an hour ago

    One thing that might also be happening is that LLMs tend to converge on metaphors that compress complex ideas quickly.

    If you look at how engineers explain messy systems, they often reach for anthropomorphic metaphors — “gremlins in the machine”, “ghost in the system”, “yak shaving”, etc. They’re basically shorthand for “there’s hidden complexity here that behaves unpredictably”.

    For a model generating explanations, those metaphors are useful because they bundle a lot of meaning into one word. So even if the actual frequency in normal conversation is low, the model might still favor them because they’re efficient explanation tokens.

    In other words it might not just be training frequency — it could be the model learning that those metaphors are a compact way to communicate messy-system behavior.

  • ghostlyInc 9 hours ago

    LLMs tend to pick up recurring metaphors from training data and reinforcement tuning.

    Words like “goblin”, “gremlin”, “yak shaving”, etc. are common in engineering culture to describe hidden bugs or messy systems. If those appear often in the training corpus or get positively reinforced during alignment tuning, the model may overuse them as narrative shortcuts.

    It's basically a mild style artifact of the training distribution, not something intentionally programmed.

    • d--b 8 hours ago

      They seem a lot more common in OP's conversations than in any regular engineering conversation though. Like I've been an engineer for 20 years. I don't remember the phrase used in my work context, ever.

      • ghostlyInc 5 hours ago

        That's fair. It probably depends a lot on which corners of engineering culture the training data comes from. In some communities (older Unix culture, Hacker News, ops/debugging discussions) terms like “gremlins”, “yak shaving”, etc. pop up more often as humorous shorthand for messy problems.

        But you're right that in day-to-day professional environments they aren't used nearly as much. So it might also just be the model over-generalizing a small stylistic pattern it saw frequently in certain parts of the corpus.

  • kilianciuffolo 7 hours ago

    I am getting the world goblin and gremlin once every hour.

  • arthurcolle 16 hours ago

    why don't you ask the model?

    • Tarraq 15 hours ago

      Not to scare away the goblins!