1 comments

  • fblgit 4 hours ago

    one of a kind single-transformer block layer, high throughput. The new generation of transformer-based lightweight models for common NLP tasks?