avatar
Damien Stanton @damienstanton.bsky.social

No, a trace through the token prediction computation is a graph -- nothing re: the Transformer architecture or neural nets themselves. I'm telling you the way in which the foundation models self-reported the process I described; no different than Hinton's example of the prism/subjective experience.

jul 26, 2025, 2:09 pm • 2 0

Replies

avatar
Pekka Lund @pekka.bsky.social

Self-reported what process? Tracing their thoughts through the networks? Like you would expect people to describe their internal processes?

jul 26, 2025, 2:23 pm • 0 0 • view