imho we have to be careful about these comparisons. when ppl talk about “this LLM is an expert at task x” often the task has been defined down or the target population has been defined up, to wit: the average person has a high school degree & doesn’t read books bsky.app/profile/davi...