Post by Aaron Reichlin-Melnick / Redsky

We burn the planet for this ish?

aug 8, 2025, 2:01 am • 1 0 • view

But has anybody asked the obvious follow up question?

aug 8, 2025, 5:56 am • 0 0 • view

We should definitely let AI do air traffic control!

aug 8, 2025, 1:56 am • 1 0 • view

So I personally am against LLMs. But I tested this and did not find the same result. Just for fact checking.

aug 8, 2025, 2:13 am • 1 0 • view

LLMs make a statistical guess each time. They don’t develop a correct answer and remember it for later use. You can literally ask it the same question ten times and get ten different answers.

aug 8, 2025, 11:23 am • 0 0 • view

Inconsistency is indeed the problem. I asked gpt 4o the same question and got a different response

aug 8, 2025, 3:31 am • 1 0 • view

😆

aug 8, 2025, 1:56 am • 0 0 • view

Which iteration of AI will figure out how to count to 2? Nothing a few billion lit on fire can’t figure out, ….maybe.

aug 8, 2025, 1:56 am • 1 0 • view

Last line doing a lot of work in the AI response, lol

aug 8, 2025, 4:15 pm • 0 0 • view

Complete Blubbery

aug 8, 2025, 2:17 am • 0 0 • view

v5 is a hybrid of multiple models, and will use the one that matches the perceived complexity of your question. So we're back to the "think very hard about this" world of prompt engineering.

> How many bs are in blueberry? The word

aug 8, 2025, 2:53 am • 3 0 • view

See, the words LLMs use have no meaning. Their place is determined by probabilities, not meaning. So LLMs have no idea what "middle" means in this context. Something makes it decide 3 (which has no meaning) would be right if you used "middle" (middle is often used to describe one element of three).

aug 8, 2025, 2:20 am • 1 0 • view

That's addressed to the initial attempt. But a similar analysis would apply to the word "last" and maybe other in the second one. All those words often go together but what so?

aug 8, 2025, 2:23 am • 0 0 • view

Claude got it!

aug 8, 2025, 1:58 am • 3 0 • view

The fact that it broke it down like that makes me think Anthropic specifically included training data to address this problem lol

aug 8, 2025, 2:01 am • 7 0 • view

MLLMLs

aug 8, 2025, 1:57 am • 0 0 • view

Or this is a viral marketing campaign to get people to go try ChatGPT for themselves.

aug 8, 2025, 2:03 am • 0 0 • view

ChatGPT breaks text into tokens—small words or parts of words—which it codes as numbers. So “blueberry” gets broken into “blue” and “berry” which are coded as 18789 and 19772. ChatGPT literally has no idea there are letters involved.

aug 8, 2025, 4:13 am • 1 0 • view

(There are ways around this problem, which is why some AI/LLMs can figure it out.)

aug 8, 2025, 4:15 am • 1 0 • view

I had a fun moment where my standard paid ChatGPT 4o didn’t even know that ChatGPT 5 exists and was just released. chatgpt.com/share/68956d...

aug 8, 2025, 3:26 am • 0 0 • view

I can see why replacing your customer service and advertising teams with AI is a good idea

aug 8, 2025, 2:02 am • 0 0 • view

I asked an AI knower about it if you’re curious

aug 8, 2025, 1:59 am • 4 0 • view

My wife, who is a data scientist, just gave me a very similar version of that talk!

aug 8, 2025, 2:02 am • 5 0 • view

It's like an aphasic patient "explaining"

aug 8, 2025, 3:38 am • 1 0 • view

Replies