I am so confused, and angry about everything that's happening.
A few weeks ago it was gemini, I got a subscription kept up with things and now if you go to the subreddit it's people saying how bad the quality has degraded.
The hype/bust cycle bullshit is starting to wear really thin given how long the tech has been out, and how much attention it's been given.
Recently, people from anthropic have talked about how 90% of contributions to claude were written by the agent, or how Anthropic's "Cowork" was built in a week and a half, entirely by Claude. So we should objectively be at the point of inflection for software-singularity, where entire suites of software should be automated in hours because servers can scale with money instead of only with time like with people.
Except where are all the things?
It's just so frustrating, because the tech is amazing, it feels like it might be techinically possible, but then people keep hyping and lying, and stuff isn't making sense.
What do you mean where are all the things? How much software exactly do you think gets released per day? You got a live feed or something that you expect to see something profound in? People are building apps & tools to accelerate their own capabilities or solving for niche problem domains / tasks.
Opendev is better, and it's just so much easier convincing your boss to let you use it when all company data doesn't get sent to some American company.
While you notice the difference, it was very surprising to me that for almost all tasks, worse models just means things take longer. Plus, both claude and openai lie when they don't know something. I actually have the impression Qwen coder 30B-A3B is a lot more forthcoming about not knowing something.
But the intelligence of the model doesn't actually make the difference between succeed or fail, and when it's exploring an actual problem, the tokens/second makes more difference than the intelligence. And, well, if it's hard enough, just take a break.
How much money do you think they're spending on these advertisement carpet bombing runs?
(instead of spending it on real things)
Most of it? ;)
By them, i assume you mean billionaires who are jockeying to be trillionaires instead of dealing with real problems.
https://archive.is/bXjsr not much new compared to what has been posted on HN in the past few weeks.
What surprised me was scrolling through https://xcancel.com/search?f=tweets&q=claude+code , so many tweets every few seconds.
I am so confused, and angry about everything that's happening.
A few weeks ago it was gemini, I got a subscription kept up with things and now if you go to the subreddit it's people saying how bad the quality has degraded.
The hype/bust cycle bullshit is starting to wear really thin given how long the tech has been out, and how much attention it's been given.
Recently, people from anthropic have talked about how 90% of contributions to claude were written by the agent, or how Anthropic's "Cowork" was built in a week and a half, entirely by Claude. So we should objectively be at the point of inflection for software-singularity, where entire suites of software should be automated in hours because servers can scale with money instead of only with time like with people.
Except where are all the things?
It's just so frustrating, because the tech is amazing, it feels like it might be techinically possible, but then people keep hyping and lying, and stuff isn't making sense.
What do you mean where are all the things? How much software exactly do you think gets released per day? You got a live feed or something that you expect to see something profound in? People are building apps & tools to accelerate their own capabilities or solving for niche problem domains / tasks.
Opendev is better, and it's just so much easier convincing your boss to let you use it when all company data doesn't get sent to some American company.
While you notice the difference, it was very surprising to me that for almost all tasks, worse models just means things take longer. Plus, both claude and openai lie when they don't know something. I actually have the impression Qwen coder 30B-A3B is a lot more forthcoming about not knowing something.
But the intelligence of the model doesn't actually make the difference between succeed or fail, and when it's exploring an actual problem, the tokens/second makes more difference than the intelligence. And, well, if it's hard enough, just take a break.
Randall, we'll need an update: https://xkcd.com/303/