New working paper!
We develop an approach for safely delegating to strategically aware and potentially misaligned AI systems. The theoretical tool we use is sequential information design with imperfect recall.
A short thread on the key highlights.
add a skeleton here at some point
11 months ago