Notes from the Mistral AI Now Summit in Paris

(koenvangilst.nl)

130 points | by vnglst 2 hours ago ago

19 comments

OK, I'm 100% rooting for both Mistral and task focused small models.

But Mistral has fall really far behind since 2025Q3. It seems they can't get good reasoning models working at even medium context sizes, which is necessary to be at the table right now.

Gemma4 and Qwen3.6 are currently best in the small size; Mistral's "small" model has ~4x the parameter count at 120B and isn't even competing with models a quarter its size.

Back one year ago with Mistral Small 3.1 they were keeping up, but they've fallen into irrelevancy right now.

If Mistral seriously wants to play the on-prem and small task-specific model game, a decent proxy would be to build models that get the r/localLlama crowd excited

[-]

echelon 17 minutes ago

Nobody trying to compete with Google, OpenAI, and Anthropic should be playing the small models / local models game.

Foundation model labs should be building very large reasoning models, then leaving it to the community to distill them down.

You can't scale a small model up, but you can scale a small model down.

I'm convinced the only way we'll have a seat at the table in the future and avoid total runaway takeoff is if there are very large models within 80% of the capabilities of the frontier models. Tiny RTX models do diddly squat to remain competitive.

Build open weights models for running on H200s. I'll spin them up on RunPod or Lambda.

lettergram 8 minutes ago

We actually found the Mistral Small 4, quantized to 4bit was comparable to Qwen 3.6 27B and is roughly the same size. At least from our experience on our use cases, the quantization of the Mistral model worked far better than trying to quantize the Qwen family.

Fully agree to your point though, Mistral in general is far behind where I'd expect and Qwen in particular is crushing it at the smaller sizes.

Personally, I'd consider anything 20B params and above a "medium" model. Small being <20B and large >100B. I think obviously we can get to the huge 1-2T param models, but frankly the margin of accuracy improvement for the speed hit is kinda insane (1-2% for many metrics).

simonw an hour ago

> BNP Paribas runs Mistral models on-prem for KYC in Belgium, with sensitive data staying within the bank's walls. Abanca is using agent orchestration to handle sensitive customer information at a huge scale (2 million customers in their app). For European companies in regulated industries, this is a good alternative to relying on US hyperscalers.

Mistral leaning into on-prem and European-hosted models is very smart.

[-]

bg24 36 minutes ago

Also Mistral did just the right thing by acquiring Koyeb, to beef up their deployment at scale expertise.

doctorpangloss 18 minutes ago

Yeah but why use mistral on premises instead of Qwen?

[-]

plaidthunder 15 minutes ago

Because the lab working on Mistral is in the European Union.

simonw 15 minutes ago

One reason might be that Mistral doesn't have a risk of weird training biases that were required by the Chinese government.

johnbarron 39 minutes ago

Lets hope the models can do a better KYC than the humans have been doing..because they are well known.

Or is this a case of the humans, now preparing for the excuse it was the AI failure?

"BNP Paribas Sentenced for Conspiring to Violate the Trading with the Enemy Act" - https://www.justice.gov/archives/opa/pr/bnp-paribas-sentence...

"BNP Paribas caught up in French money laundering investigation" - https://www.reuters.com/business/finance/bnp-paribas-caught-...

"BNP Paribas faces $246m fine in currency scandal" - https://www.bbc.com/news/business-40635070

"BNP Paribas caught in a Cypriot money laundering investigation" - https://www.lemonde.fr/en/les-decodeurs/article/2023/12/26/b...

In Money Laundering their track record is unmatched: https://violationtracker.goodjobsfirst.org/parent/bnp-pariba...

[-]

pavlov 18 minutes ago

When the humans have a track record of corruption, it might make sense for a company to seek parallel opinions from a LLM so they can at least flag suspicious human decisions.

Assuming BNP Paribas leadership wants to stop the corruption of course.

[-]

johnbarron 11 minutes ago

They had years to fix it: https://violationtracker.goodjobsfirst.org/parent/bnp-pariba...

psychoslave 19 minutes ago

That's just one side of the story, not following it on details, but their own le chat explained to me that the company was a capitalist succubus starving to build data center in some north European country. Hilarious if you ask me.

petcat an hour ago

> Abanca is using agent orchestration to handle sensitive customer information at a huge scale (2 million customers in their app).

Maybe my perspective is skewed on what "huge scale" means, but 2 million users? That's like a few hundred megabytes of data? Or a couple GBs if there's a lot of per-user data?

[-]

fidotron 24 minutes ago

European consumer focused businesses do not scale easily the same way US ones do, which is a major contributor to their problems developing tech businesses generally.

OTOH such things can be quite defensible, they just rarely become anything like as profitable.

vnglst 37 minutes ago

Maybe, but using state-of-the-art large language models to solve customer support queries with agentic can quickly use a lot of tokens. What I understood from the talk is that they used agents with limited responsibility and (assumption from me) smaller models, to the make sure the answers were quick, reliable and not too costly.

[-]

hadlock 18 minutes ago

There are several payments processing companies that are already largely using AI for customer support queries. They still have an escape hatch to a human but at least one of those companies (on the smaller side) is reporting a ~99% success rate, they are down to a handful of human customer service employees now for cases where the customer can't find/produce the transaction ID.

Eldodi 40 minutes ago

I was at the event, and was impressed by the attendance, all the leaders from the major european listed companies were there.

Also interesting to note the number of partners they invited. Going from Microsoft, Accenture and EY to startups like alpic.ai or lingo.dev . Seems like they are ramping up their M&A game too

LucidLynx an hour ago

As an European: 100x YES!

I really like the direction and the transparency of Mistral, among those players.

ogou 4 minutes ago

I've said it before that Mistral is underrated. They are looking at real world use of LLMs and tooling. Bespoke models are very appealing to lots of non-tech centered companies and state agencies. Also, Mistral's actual platform is useful. While others are watching performance leaderboards like this is some eSports stream, they are building real world uses.