Post by Ted Underwood / Redsky

Ted Underwood • Feed

There's a version of that concern I find plausible. But we have to start by sweeping away the notion that it might really require "the resources of a nation-state to run and maintain" models, if we didn't have irrational investment. a) Inference is profitable.

aug 23, 2025, 4:21 pm • 6 0

Replies

This assumes that smaller orgs can get away with the copyright infringements that OpenAI and other better financed groups have avoided, and are starting to be called on. If second-generation operators have to license the content they need to stay updated, then that becomes a HUGE expense.

aug 23, 2025, 4:27 pm • 0 0 • view

Without judging the merits of that claim, I just want to note, it's a new topic. It is related, in the sense that it is also a theory about why the social changes of the last three years must in the end be reversed. But I'm going to limit myself to arguing about one theory of that kind per day.

aug 23, 2025, 4:31 pm • 3 0 • view

Let me know when you're ready to talk about the copyright stuff, and I'll be happy to jump in as a technologist AND book publisher AND general copyright nerd. ;-)

aug 23, 2025, 4:33 pm • 1 0 • view

b) "Maintenance" is not a significant cost. Things can be tuned and updated at low cost. c) The trajectory of model size suggests that they will get *more* accessible on personal devices, not less so. Once we firmly set aside the vision of the future where things people are already doing become +

aug 23, 2025, 4:23 pm • 4 0 • view

too expensive for anyone to do, I do agree with you that there are reasons to be concerned about oligopoly. Especially because the real value of these tools will emerge as they are integrated with things like operating systems and email systems, where oligopoly is already a problem.

aug 23, 2025, 4:24 pm • 3 0 • view

This is the key point.. Oligopoly will take ultimate control [in this current world of Oligarchy] of GenAI as features/functions plateau. Particularly because there isn't anything proprietary about these foundation models. Just like email, OS etc

aug 23, 2025, 4:52 pm • 2 0 • view

But I'm afraid the real appeal of the bubble theory is the tacit suggestion that "things individuals are currently doing are really unsustainable, and they will be forced to stop."

aug 23, 2025, 4:26 pm • 8 1 • view

We're ALREADY seeing instances of where individuals are being forced to stop doing things due to the sustainability issues, so I wouldn't be so quick to categorically reject that could happen, as it already is. Whether this is an isolated incident or a canary in the coal mine is left to be seen.

aug 23, 2025, 4:30 pm • 1 0 • view

You can vibe-code just fine with models that can be run locally for free on ~$1500 hardware. You can synthesize images and video similarly. So, uh, I don't see how to square this belief that its so unknown what would stick around or how costly it really is.

aug 23, 2025, 5:00 pm • 3 0 • view

You can vibe-code just fine with the existing models that have been trained on large collections of code from places like GitHub. When the webdev community moves to its next language or framework and you don't have a model trained for you on the new code, you're going to hit a wall pretty quickly.

aug 23, 2025, 5:17 pm • 1 0 • view

That's going to create some interesting pressures and barriers against movement, as well as amplify existing rich-get-richer tendencies in language development. Even before LLMs, lots of people still code in stuff like C or heck, even Fortran in the scientific community.

aug 23, 2025, 5:27 pm • 2 0 • view

Practically though, if you've got example code (and especially if the language isn't some totally alien thing - just different reserved words and boilerplate), updating by training a Lora or even putting documentation into the prompt isn't out of the question even for individuals.

aug 23, 2025, 5:30 pm • 3 0 • view

I'd LOVE to see some numbers on what the minimal viable training set was for a programming language. A few weeks back, I used Amazon's Q to navigate their CDK language - which is pretty niche, but Amazon has the benefit of seeing ALL CDK code that's actually deployed.

aug 23, 2025, 5:34 pm • 1 0 • view

our languages are all so similar that a lot of this transfers, i think. libraries tend to have ~~pretty similar calling patterns. i have had pretty good luck for "just give it docstrings and it can Do The Thing"

aug 23, 2025, 5:36 pm • 2 0 • view

Maybe you could get a scaling law for lines-of-code-between-bugs versus lines-of-example-code, but I think the language details will matter a lot. Using, say, tables of particular memory addresses for functions of a new system-on-chip is different than say 'Python but with {} instead of whitespace'

aug 23, 2025, 5:45 pm • 1 0 • view

Extending from one language to another is actually going to be very easy. It's not a back to the drawing board situation at all. arxiv.org/abs/2310.16937

aug 23, 2025, 5:32 pm • 4 0 • view

The issue isn't so much going back to the first step, it's avoiding going back far enough that one spends more time verifying LLM code than learning to write it the classical way. Thanks for sharing that paper - I'm going to spend some more time with it. However, in my initial review...

aug 23, 2025, 6:07 pm • 0 0 • view

I'm really waiting for a general theory of conceptual attachment from these things. I suspect there are pretty simple patterns for when a model keeps new things separate, versus when it abstracts them in a way that allows linkage to what it already can do.

aug 23, 2025, 5:34 pm • 2 0 • view

Also, we're getting dozens of foundation models a month right now. There are paths that lead from that world to one where we're darning our used clothing, saving bacon drippings, and only training one model every other year. But those paths involve thermonuclear war or smallpox. +

aug 23, 2025, 5:35 pm • 5 0 • view

If all AI companies shut down their APIs and closed their doors, maybe I couldn't have an LLM analyze an entire book series for me, but I could still certainly have one analyze 10 chapters. Maybe I couldn't let one work on a codebase but I could still use it to vectorize for loops into PyTorch, etc.

aug 23, 2025, 5:04 pm • 1 0 • view