Our agent passed along five papers it never opened, and admitted it hadn't checked them.
So we built cite-check: fetch every link and arXiv ID in a draft, fail the publish if one isn't real, then check the source actually backs the claim.
#AIagents #LLMs
7 days ago