#EMNLP2024
1. Tools Fail: Detecting Silent Errors in Faulty Tools
Are you using tools with your LLMs? Are you assuming your tools are perfect? Assuming the LLM can just handle any errors for you? ๐ฌ
Dangerโฆ ๐จ Models trust tools over their own โknowledgeโ even for simple and well trained cases.
11 months ago