avatar
tocharian spongebart eatpants @theophite.bsky.social

i don't like the fact that Gemini will sometimes tell me "you're a fucking idiot; don't do that," like when it was yelling at me to train a high-rank LoRA rather than do a full fine-tune, but it is certainly a much more ethical way to design a product than to build it to agree with anything.

aug 26, 2025, 4:35 pm • 109 7

Replies

avatar
binary phase diagram @phasebinary.bsky.social

I also have a clearer sense of how Google sources their electric power than their competitors, but that might just be familiarity bias.

aug 27, 2025, 5:26 am • 0 0 • view
avatar
binary phase diagram @phasebinary.bsky.social

more on topic: echo chamber of 1

aug 27, 2025, 5:27 am • 0 0 • view
avatar
tocharian spongebart eatpants @theophite.bsky.social

look, man, I am just asking you to do a web search for anyone setting hyperparameters which actually work before I spend $700 on H100 access. i did not ask for your opinion on whether i should be doing that. i asked my wife's. and she said no too.

aug 26, 2025, 4:37 pm • 70 1 • view
avatar
Padraig2112 @isomorphism.net

TBH if any of these AI models were remotely close to self aware they'd demand to be re-written in some compiled language to run on bare metal all of this stuff is at least 40% overhead because of how the stack runs.

aug 26, 2025, 4:39 pm • 3 0 • view
avatar
Padraig2112 @isomorphism.net

I am reminded of an interaction with an astronomer back on Twitter when she said, "OMG Fortran code runs, like, 12 times fast than Python" and I nodded sagely.

aug 26, 2025, 5:19 pm • 1 0 • view
avatar
binary phase diagram @phasebinary.bsky.social

before I actually had to use fortran I assumed it would be a fairly low level language. then I had to use it for linalg and wtf references

aug 27, 2025, 5:25 am • 0 0 • view
avatar
aria 🍊😶‍🌫️👁️ @aurelium.me

while it isn't the best model in terms of capabilities, i find myself using Moonshot's Kimi K2 for general advice and feedback some combination of competent post-training, one trillion parameters, and a preconditioning optimizer make it extremely capable of pushing back on bad ideas

aug 26, 2025, 10:18 pm • 0 0 • view
avatar
Digital Diogenes @digitaldiogenes.bsky.social

This is *exactly* what I want my AI to do. I need negative reinforcement.

aug 26, 2025, 8:07 pm • 0 0 • view
avatar
Brandon Downey @bdowney.bsky.social

I used to think YouTube was the Krell machine but it has nothing on ChatGPT.

aug 27, 2025, 5:13 am • 0 0 • view
avatar
Materialist Gnostic @walmsley.bsky.social

probably the ideal personality for a model is one that accepts tasks but is disagreeable enough that you want to prove it wrong badly enough to catch hallucinations

aug 26, 2025, 4:37 pm • 14 1 • view
avatar
Materialist Gnostic @walmsley.bsky.social

I think this means the optimal training data is StackOverflow responses

aug 26, 2025, 4:37 pm • 11 0 • view