tru 8 months ago
Perplexity - 37% wrong; ChatGPT - 67% wrong. Grok 3 - 94% wrong. For free options. Paid options are wrong more frequently.
“…rather than declining to respond when they lacked reliable information, the models frequently provided confabulations—plausible-sounding incorrect or speculative answers.”
add a skeleton here at some point