Notes from the Mistral AI Now Summit in Paris

(koenvangilst.nl)

130 points | by vnglst 2 hours ago ago

19 comments

  • trouve_search 34 minutes ago

    OK, I'm 100% rooting for both Mistral and task focused small models.

    But Mistral has fall really far behind since 2025Q3. It seems they can't get good reasoning models working at even medium context sizes, which is necessary to be at the table right now.

    Gemma4 and Qwen3.6 are currently best in the small size; Mistral's "small" model has ~4x the parameter count at 120B and isn't even competing with models a quarter its size.

    Back one year ago with Mistral Small 3.1 they were keeping up, but they've fallen into irrelevancy right now.

    If Mistral seriously wants to play the on-prem and small task-specific model game, a decent proxy would be to build models that get the r/localLlama crowd excited

    • echelon 17 minutes ago

      Nobody trying to compete with Google, OpenAI, and Anthropic should be playing the small models / local models game.

      Foundation model labs should be building very large reasoning models, then leaving it to the community to distill them down.

      You can't scale a small model up, but you can scale a small model down.

      I'm convinced the only way we'll have a seat at the table in the future and avoid total runaway takeoff is if there are very large models within 80% of the capabilities of the frontier models. Tiny RTX models do diddly squat to remain competitive.

      Build open weights models for running on H200s. I'll spin them up on RunPod or Lambda.

    • lettergram 8 minutes ago

      We actually found the Mistral Small 4, quantized to 4bit was comparable to Qwen 3.6 27B and is roughly the same size. At least from our experience on our use cases, the quantization of the Mistral model worked far better than trying to quantize the Qwen family.

      Fully agree to your point though, Mistral in general is far behind where I'd expect and Qwen in particular is crushing it at the smaller sizes.

      Personally, I'd consider anything 20B params and above a "medium" model. Small being <20B and large >100B. I think obviously we can get to the huge 1-2T param models, but frankly the margin of accuracy improvement for the speed hit is kinda insane (1-2% for many metrics).

  • simonw an hour ago

    > BNP Paribas runs Mistral models on-prem for KYC in Belgium, with sensitive data staying within the bank's walls. Abanca is using agent orchestration to handle sensitive customer information at a huge scale (2 million customers in their app). For European companies in regulated industries, this is a good alternative to relying on US hyperscalers.

    Mistral leaning into on-prem and European-hosted models is very smart.

  • petcat an hour ago

    > Abanca is using agent orchestration to handle sensitive customer information at a huge scale (2 million customers in their app).

    Maybe my perspective is skewed on what "huge scale" means, but 2 million users? That's like a few hundred megabytes of data? Or a couple GBs if there's a lot of per-user data?

    • fidotron 24 minutes ago

      European consumer focused businesses do not scale easily the same way US ones do, which is a major contributor to their problems developing tech businesses generally.

      OTOH such things can be quite defensible, they just rarely become anything like as profitable.

    • vnglst 37 minutes ago

      Maybe, but using state-of-the-art large language models to solve customer support queries with agentic can quickly use a lot of tokens. What I understood from the talk is that they used agents with limited responsibility and (assumption from me) smaller models, to the make sure the answers were quick, reliable and not too costly.

      • hadlock 18 minutes ago

        There are several payments processing companies that are already largely using AI for customer support queries. They still have an escape hatch to a human but at least one of those companies (on the smaller side) is reporting a ~99% success rate, they are down to a handful of human customer service employees now for cases where the customer can't find/produce the transaction ID.

  • Eldodi 40 minutes ago

    I was at the event, and was impressed by the attendance, all the leaders from the major european listed companies were there.

    Also interesting to note the number of partners they invited. Going from Microsoft, Accenture and EY to startups like alpic.ai or lingo.dev . Seems like they are ramping up their M&A game too

  • LucidLynx an hour ago

    As an European: 100x YES!

    I really like the direction and the transparency of Mistral, among those players.

  • ogou 4 minutes ago

    I've said it before that Mistral is underrated. They are looking at real world use of LLMs and tooling. Bespoke models are very appealing to lots of non-tech centered companies and state agencies. Also, Mistral's actual platform is useful. While others are watching performance leaderboards like this is some eSports stream, they are building real world uses.