Expanding Project Glasswing

(anthropic.com)

50 points | by surprisetalk 2 hours ago ago

48 comments

jb_briant 31 minutes ago

Step 1: claim you created a tool so dangerous you can't release it

Step2: offer to test it, but only for the biggest companies in the world

Step 3: onboard those big players on your tooling and product

Step 4: profit

This is genius.

[-]

geodel 13 minutes ago

With trillion dollars at stake they can hire best of best in sales and marketing. And unlike some hardcore hackers who may have ethics that does not always move in direction of more money. Sales and marketing people are highly motivated for opportunities to make more money.

[-]

jb_briant 10 minutes ago

Our game is to craft shit, their game is to sell shit. You gotta respect the different tastes in the nature!

[-]

geodel 4 minutes ago

Yeah, Companies to buy shit and their employees to eat shit. Lion king would say it is great circle of life.

[-]

jb_briant 3 minutes ago

Here spoke the wise man!

estearum 16 minutes ago

And all you have to do is demonstrate unique value during the pilot phase!

Err... wait... that was already the hard part... hmm

[-]

jb_briant 12 minutes ago

Genius marketing move doesn't mean there is no value.

It means than even if the value you offer is similar as your competitors, you are the one conquering the market.

That's the only way to not becoming a commodity.

mekpro 11 minutes ago

It’s clear that Anthropic has run out of the compute capacity needed to serve Mythos publicly.

They’re using security concerns to mask their inability to deliver the model at scale, while still trying to maintain their lead over OpenAI. As a result, they’ve chosen to release it privately under the banner of an “ethical” rollout.

[-]

jb_briant 3 minutes ago

It is not "clear", as your comment suggests, it's hidden. Which is semantically the opposite of clear. Regarding your theory, might be true, might be false. But it's highly speculative.

simonw 9 minutes ago

They started Glasswing before they struck that $1.25B/month deal with xAI/SpaceX for their (notoriously dirty) Memphis data centers.

So they have a whole lot more compute now than they did last month.

[-]

nickthegreek 7 minutes ago

But that compute might not be available to then long term. Hard to make big moves with a contract like that.

cobolcomesback 7 minutes ago

So why is OpenAI also releasing 5.5-Cyber in a private manner? Are they also out of compute?

bushido an hour ago

This feels more and more like a marketing/scarcity play for the largest global corps.

Will likely give them time to expand capacity as well. And make them harder to dislodge in these orgs.

[-]

jb_briant 20 minutes ago

Urgency x scarcity, unbeatable marketing move.

[-]

bushido 13 minutes ago

It is really good. Will also cut through the common procurement, legal and change management processes seen at these orgs.

[-]

jb_briant 10 minutes ago

Genius^2

ianm218 20 minutes ago

In case the topic of memory safety is interesting to anyone I've been experimenting with using AI agents to port common web infra projects to safe/ performant Rust. Somewhat inspired by the Bun port - was thinking that at some point memory safety might be such a big deal that people just need drop in replacements.

- Valkey/ Redis port here https://github.com/ianm199/valdr (passes ~99% of single node test suite, real prod features like replication/ clustering/ HA early or not implemented) - Further along port of Lua 5.1-5.5 https://github.com/ianm199/lua-rs-port/tree/main - I have a less developed nginx version that would be the north star - These projects are very alpha at the moment

If anyone is interested in getting involved in this or has done similar experiments I'd love to collaborate! There is so much variation in how you can run these large scale agent fleets I don't think anyone has a perfect system yet.

mrbonner 9 minutes ago

Maybe it is just me: I feel Anthropic most recent product announcements resemble more and more like what IBM tactic was at its high. For instance, the Watson AI hype after it defeated Kasparov. The difference is IBM actually wanted and let businesses buy and use Watson as opposed to time released like what Anthropic does to even boost the hype higher.

827a an hour ago

GPT-5.5-Cyber has already at least hit if not surpassed Mythos capability in cyber tasks. The only reason they're holding back is because once its out everyone would realize that its capabilities were a step change in March, but are not anymore, yet it costs significantly more and is much slower.

[-]

john_strinlai an hour ago

how did you go about assessing this?

jansan an hour ago

So you believe one marketing department more than the other?

[-]

NitpickLawyer 43 minutes ago

The brits have a step-based benchmark that they use for this - https://www.aisi.gov.uk/blog/our-evaluation-of-openais-gpt-5...

They seem pretty close, in both average and "best run" scores. And, in a highly verifiable domain, "best run" or pass@n is what you're looking for.

aspectop an hour ago

i think anthropic is being performative here, creating a hype for mythos and not releasing. i guess this is all a marketing thing to sell a security specialized AI to enterprise and startups at a way larger cost coz security market is deep in money.

mentalgear an hour ago

Here's my big fear: Even IF (and that's a BIG if) we get all critical vulnerabilities fixed in tech (before adversarial/state-actors turn up with open attack models) - we still have (in at least a year) models that will be so good in social engineering that they can still (given enough tokens) gain access to whatever system they want.

If society can't trust banks and other institutions to safely control their data, what follows ?

Do we we collectivelly switch off the internet?

[-]

colechristensen an hour ago

Social engineering as a problem goes away when anybody can get a model to do it for them for $5. It stops being possible, it's really the bank's problem when they can't have a minimum wage call center or a robot responsible for people's data.

[-]

p-e-w an hour ago

Yes. There will be a few high-profile incidents, and then institutions will be forced to stop performing administrative actions based on people’s word.

[-]

applfanboysbgon 44 minutes ago

This outcome is massively detrimental to humanity at large. By eliminating the human factor from support, you make it impossible to get support in edge cases that fall outside of the pre-planned bureacratic process. Everyone already hates that Google can arbitrarily ban anybody they please with no way to get in contact with a human, and you want to extend that to banks in control of people's life savings?

[-]

hallway_monitor 23 minutes ago

I don't think anyone is saying that. You will just need to be authenticated before giving any commands to the bank. Maybe some type of TOTP that you can use over the phone or in person.

[-]

applfanboysbgon 17 minutes ago

That is the exact problem. You have identification tied to your device. Your device is lost or stolen. Now you can't access your bank account. Human support can help you out by finding flexible ways to ascertain your identity. This is the angle social engineers exploit, tricking employees trying to be helpful to abuse that area of flexibility. You can take away human judgment and all flexibility in the system, and that will make the system more secure, but it also results in a deeply uncaring system that makes life harder for people. Rigid bureacracy doesn't do a good job of accounting for a house fire destroying everything you own or your e-mail provider shutting down; these are fringe cases but they do happen and there are positive resolutions available as long as human discretion is involved.

repeekad 26 minutes ago

> Everyone already hates that Google can arbitrarily ban people

Yet they’re still the predominate search engine, sadly the concerns of the few don’t interest monopolistic profit seekers without forced regulations, think how airlines are legally required to give refunds for delayed flights, there’s a reason it required legislation

aplthrowaway67 an hour ago

How "altruistic" of them. If only Anthropic extended this level of care to the environment or the economy.

tantalor an hour ago

They should share it with Meta.

https://www.0xsid.com/blog/meta-account-takeover-fiasco

iamniels 14 minutes ago

Whats currently an open source project which comes closest to Mythos capabilities?

[-]

adrian_b 5 minutes ago

No single open weights model comes close to either Mythos or GPT 5.5.

Nonetheless, running many of the open weights models over a codebase, with an appropriate harness, can provide about the same vulnerability coverage (i.e. each of the open weights models would find a subset of what Mythos or GPT 5.5 could find, but the subsets are not the same).

Despite needing more runs and more time, this may be significantly cheaper, especially if the models are self hosted.

yanis_t an hour ago

Is there any evidence Mythos is qualitatively better than the Opus 4.x?

I'm afraid that the usual mantra that "we just need more scale" that worked well for attracting investments, is not working anymore - bigger models provide marginal improvements while naturally get much more expensive to run.

Is this why both Anthropic and OpenAI are rushing for IPOs this year?

[-]

alasano 37 minutes ago

From what I've read so far it's less about Mythos being much better at tasks in isolation.

Security wise, it's about being able to find and chain multiple vulnerabilities to actually create viable exploits.

So I would imagine that if you were using it for regular software development you may not feel that it's that different unless used in a particular way?

andrewjneumann 27 minutes ago

They keep writing like they stand to profit from this or something. Too many “coulds” in there for me too, this could be an amazing advancement and it could be nothing… normally we look at data and last headline I saw was 25 “high” vulnerabilities at the cost of $1 million in tokens.

No comparison to human teams, and I’m sure that $1 million in tokens was used by humans, in a team. So like most AI, they’ve developed a tool that capable people can use to be better, but unlike most tools, they’re claiming this to be outright magic. The magic is the hype train.

frays an hour ago

> We intend to go much further: our longer-term aim is to support the industry in creating new initiatives, standards, and infrastructure for the era of powerful cyber models.

Another reason why I will be buying Anthropic shares after they IPO.

I don't see them as just an AI company, they are a cybersecurity powerhouse too.

[-]

Jtarii an hour ago

Thanks for your input Claude.

[-]

devmor an hour ago

I see that as not just a spam post, but a generated addition to the dead internet - a real win for us algorithms.

jwpapi 40 minutes ago

Ragebait god

fontain an hour ago

“Mythos Preview continues a long-term trend that we’ve been warning about for some time: within 6 to 12 months […]”

The only trend Mythos continues is Anthropic’s trend of warning that disaster is always 6 to 12 months away.

jofzar an hour ago

> The organizations in this new group are based in more than 15 countries

I mean most nasdaq tech companies would be in 13+ countries, why are they writing this like it's a big number, is hilariously small?

[-]

newtonsmethod 38 minutes ago

I assume they're using a more candid definition where they're not counting all the countries a company may be based, but rather the primary country they're based in.

I don't think they're trying to flex this as a large number. They don't want to give an exact number, as that may change etc / is fuzzy, but also want to give you an idea of the scale.

They say "In the future, we intend to expand our geographical reach much further". I imagine this commentary is somewhat related to the concerns that AI will create an even worse "global underclass". AI developments are first accessible to Americans, then allies, and then later the whole world.

SpicyLemonZest 23 minutes ago

They're writing it in contrast to the previous scope, which doesn't seem to have been available to any organizations based outside the US. (There was news a few weeks ago about how Japanese banks were going to gain access, but based on the timing I think this announcement is that access.)

cmxch an hour ago

That’s fine as long as I can identify and reject any Mythos derived patch as being irreproducible.

[-]

IanCal an hour ago

Why would it not be reproducible?

philipwhiuk an hour ago

It would have been nice to have a list of the 150, but I guess it would make them a hacking target?