loren schmidt 2 months ago
i really want us, collectively, to understand that chat format LLMs cannot assess their own behavior. if you ask it what it did, it will construct a likely response to your query and that is all.
"why did you do this" is projection. even accusing them of "lying" is projection.
add a skeleton here at some point