More

eldenring · 2026-05-24T19:08:02 1779649682

2-3x is completely dwarfed by the remaining improvements in training which is still in its infancy relatively

BearOso · 2026-05-24T19:17:53 1779650273

Unless there's a new paradigm, scaling up is all they can do to improve performance. They've shrunk down all the way to 1-bit models and all the low-hanging fruit is gone. There's no way for them to get much smaller, so they have to get bigger and faster to meet expectations.

intelkishan · 2026-05-25T02:56:21 1779677781

This hasn’t been true for the past 2 years

oblio · 2026-05-25T07:24:35 1779693875

Is this based on an assumption that Opus 4.7 & co are equivalent or smaller to Opus 4.5 & co? I highly doubt the advanced models (Opus, Pro, etc) aren't biggen than the standard ones (Sonnet, Flash, etc) and fairly sure newer models are bigger than older ones.

eldenring · 2026-05-24T19:49:03 1779652143

this is just not true at all, there are massive leaps from algorithms, data, etc. every year. scale is one axis of many and you need to get them all correct.

BearOso · 2026-05-25T18:52:30 1779735150

What novel data hasn't already been used in training? What new algorithms are there? Can you post some links so we can read about them?

gpm · 2026-05-24T19:18:18 1779650298

Probably, but at some point we're very likely to run out of significant training improvements and it's not clear that we'll see that point coming from a long way out.

Likewise it's probably dwarfed by improvements in how we make dram - continuing the roughly exponential (maybe a bit less recently) scaling of chips - but not necessarily.

The 2x from returning to previous costs is interesting because it's practically guaranteed, and it's on top of everything else. We're just currently "overpaying" (relative to the stable market price) for the manufacture of dram because of a sudden increase in demand.

eldenring · 2026-05-24T21:14:11 1779657251

my reply from the other thread fits here too:

> this is just not true at all, there are massive leaps from algorithms, data, etc. every year. scale is one axis of many and you need to get them all correct.

eldenring · 2026-05-21T05:24:37 1779341077

I'm guessing they had a significant revenue spike from gpt 5.4 and gpt 5.5 being so good at coding, and hiccups at anthropic making it easier for programmers to try the models.

eldenring · 2026-05-17T11:20:45 1779016845

Its just not a thing to consider and doesn't happen often.

eldenring · 2026-05-11T01:59:13 1778464753

This article makes 0 sense. Its not up to billing or computer systems or ease of use or anything else that matters. The question is will the scaling laws, which in the asymptote are likely the laws of physics, hold up in converting energy to smarter models. Its not really up to anyone, the labs or developers, to choose if local or remote models will be the norm.

eldenring · 2026-04-27T02:35:52 1777257352

CompactStr doesnt have any additional runtime overhead iirc right? So in theory you can drop it in everywhere even when you expect > 25 chars. Maybe an extra branch in the >25 char case?

kibwen · 2026-04-27T07:04:45 1777273485

SSO does have overhead. Firstly, on every access you have a branch. Secondly, and more severely, the "most general" umbrella type that all string methods are defined on is a string slice, and whereas conversion from `String` to `&str` is literally a no-op, SSO strings require work to be done to convert them to string slices. Furthermore, note that in the (surprisingly common) case where the string is zero-length, String already skips the allocation, same as an SSO string.

eldenring · 2026-04-20T00:22:50 1776644570

> Folks are now starting to ask difficult questions about their burn rate and revenue.

this view isn't updated correctly post-claude code and codex. there will clearly be sufficient demand.

philistine · 2026-04-20T00:37:18 1776645438

Seriously? One release is all it took to turn the whole ship around?

eldenring · 2026-04-16T17:37:14 1776361034

I think the coding market will be much larger. Knowledge work is kind of like the leaf nodes of the economy where software is the branches. That's to say, making software easier and cheaper to write will cause more and more complexity and work to move into the Software domain from the "real world" which is much messier and complicated.

cjbarber · 2026-04-16T17:38:36 1776361116

Yes, and the same thing will happen in non-coding knowledge work too. Making knowledge work cheaper will cause complexity to increase, more knowledge work.

eldenring · 2026-04-16T17:56:28 1776362188

I don't think so, the whole point of writing software is it is a great sink for complexity. Encoding a process or mechanism in a program makes it work (as defined) for ever perfectly.

An example here is in engineering. Building a simulator for some process makes computing it much safer and consistent vs. having people redo the calculations themselves, even with AI assistance.

cjbarber · 2026-04-16T18:01:52 1776362512

The history of both knowledge work and software engineering seems to be increasing in both volume and complexity, feels reasonable to me to bet on both of those trendlines increasing?

visarga · 2026-04-16T18:47:26 1776365246

Yes, I have a theory - that higher efficiency becomes structural necessity. We just can't revert to earlier inefficient ways. Like mitochondria merging with the primitive cell - now they can't be apart.

eldenring · 2026-04-08T17:14:34 1775668474

Because there's a realistic chance this is the only important software technology moving forward, and commoditizes Metas's entire business which is software.

dgellow · 2026-04-08T18:17:39 1775672259

Meta’s business is human attention, human connections, and all derived data. They can use AIs for their systems, but the question is why do they feel the need to spend billions on training and running their own frontier model

eldenring · 2026-04-08T00:09:13 1775606953

I don't see how its possible to think this. AI coding assistants are some of the most useful technologies ever created, and model quality is by far the most important thing, so I doesn't make sense why local inference would be the path forward unless something fundamentally changes about hardware.

sunir · 2026-04-08T02:43:25 1775616205

The hardware will change. We know that.

eldenring · 2026-04-05T04:52:18 1775364738

How many docs do you put in the context? we maintain a lot of dsl code internally, and each file has a copy of the spec + guide as a comment at the top. Its about 50 locs and the relevant models are great at writing it.

danpalmer · 2026-04-05T05:00:42 1775365242

Oh yeah the models are great at writing the DSLs, there are enough examples to do that very effectively. It's the building of the DSL, which is implemented in the config language, which is tricky. i.e, writing a new A/B test in the language is trivial, writing an A/B testing config DSL in the language is hard.

The main problem is the dynamic scoping (as opposed to lexical scoping like most languages), and the fact that lots of things are untyped and implicitly referenced.