Transformers Are Inherently Succinct

(arxiv.org)

16 points | by bearseascape 3 hours ago ago

2 comments

  • cadamsdotcom an hour ago

    Seems intuitively sound; a larger model would have the ability to differentiate among a larger variety of concepts, which translates to a larger vocabulary and greater ability to use expressive tools such as imagery, metaphor etc etc.

    I could go on, but brevity is virtuous.

  • pstuart an hour ago

    It makes sense that flowery language is more decorative than functional, but I wonder how much nuance can help shape reckoning, reasoning, and rendering -- if at all.

    Maybe RFC terms are all that's needed: https://datatracker.ietf.org/doc/html/rfc2119