The multiple jailbreaks available to get DeepSeek to eventually figure out something happened in Tiananmen in 1989 are fascinating, and hint towards a Chinese AI censorship regime that is very specific and fragile. Do they just throw the most obvious queries at the model and call it a day?
add a skeleton here at some point
8 months ago