The internal-systems team at Atlassian builds, well, our internal systems. This includes the order processing service for both our products and our partners; since 2008 these systems have processed over $700M in orders, but for most of that time the team consisted of only 2 (later 3) engineers, who performed all development and operations. This has driven us to (try to) build low-touch, ops-less systems. Partly this is talk about trying to extract reliability from unreliable systems in an unreliable world, but mostly it’s a list of the amusing screw-ups we made in the hope that others may avoid them. Along the way we’ll discuss
For the last 7 years Steve has worked for Atlassian as both as a sysadmin and a developer. Prior to that he worked on tanks and radars in the Outer Hebrides, telecoms systems in Hong Kong, and in startups in Australia. He now works out of Atlassian's Amsterdam offices, focusing on high-availability, continuous-deployment and monitoring issues.