So it hallucinates significantly less than people lie and the original chart shows 3.7 being 5 times worse than it actually is. 0.1% is within the statistical margin of error, they're likely the same but with Claude 4 being even more capable.
So it hallucinates significantly less than people lie and the original chart shows 3.7 being 5 times worse than it actually is. 0.1% is within the statistical margin of error, they're likely the same but with Claude 4 being even more capable.
No replies