avatar
Romeo Kokriatski @vagrantjourno.bsky.social

Training data, as has widely been reported, is largely pirated content scrapped without consent from across the internet. There are a multitude of ways to stamp down on that.

aug 27, 2025, 7:10 am • 3 0

Replies

avatar
aly @aly.ruffruff.party

deepthroating the RIAA seems undesirable

aug 28, 2025, 4:07 pm • 0 0 • view
avatar
Romeo Kokriatski @vagrantjourno.bsky.social

Without the training data, models are worthless. Without training data, there is no LLM.

aug 27, 2025, 7:10 am • 6 0 • view
avatar
Romeo Kokriatski @vagrantjourno.bsky.social

Anyway miss me with that AI doomerism shit. It's a shitty tech with use cases outvalued by BPM software (I can do nocode too lmao), investments into it are just money pits, and there is no return good enough to justify even a fraction of the interest.

aug 27, 2025, 7:16 am • 5 0 • view