Interesting argument that the architectural blocks of language models are almost always surjective, implying that any given output text has at least one corresponding input which can generate it
(They do note that *finding* that input isn't necessarily easy, and assume you control input embeddings)
add a skeleton here at some point
about 1 month ago