Senior Site Reliability Engineer
Greater Boston Area
1 week ago
Managing global, large scale, multi-datacenter, production-level applications and infrastructure.
Developing services to automatically detect and reduce service disruption, driving towards fluid elasticity between bare-metal, private cloud and commercial cloud.
Participating in defining Site Reliability policy, process, technology and best practices and driving to instill a culture of Reliability within the teams by advocating for infrastructure improvements and best practices.
Monitoring high-uptime, low-latency services to ensure the best experience for our customers.
Advanced understanding of general infrastructure operation, and ability to drive a team to incident resolution, troubleshoot and resolve issues in a timely manner.
On-call rotation with the rest of the team.