This is a detailed article about the outage. I also like that they show what assumptions they had (even if they turned out to be wrong), because that's exactly what real-life troubleshooting is like. You get a lot of clues, and you usually don’t find the actual solution right away.
add a skeleton here at some point
18 days ago