Post by Damien Stanton / Redsky

No, a trace through the token prediction computation is a graph -- nothing re: the Transformer architecture or neural nets themselves. I'm telling you the way in which the foundation models self-reported the process I described; no different than Hinton's example of the prism/subjective experience.

jul 26, 2025, 2:09 pm • 2 0

Replies