Senior SRE Manager
Now, more than ever, the Toast team is committed to our customers. We’re taking steps to help restaurants navigate these unprecedented times with technology, resources, and community. Our focus is on building the restaurant platform that helps restaurants adapt, take control, and get back to what they do best: building the businesses they love. And because our technology is purpose-built for restaurants, by restaurant people, restaurants can trust that we’ll deliver on their needs for today while investing in experiences that will power their restaurant of the future.
Toast is scaling rapidly: We expect to double the traffic to hundreds of services we run before the end of the year with significant growth beyond that. Join us as the founding Manager of the Production Engineering team that enables Toast to provide uninterrupted service to our customers during this rapid growth.
The Production Engineering team (similar to the SRE role elsewhere) is responsible for running Toast’s production services with a commitment to quality, reliability, and low latency — without needing heroics. The team accomplishes this goal by:
- Building tooling to automate, monitor, and manage deployed services
- Helping author and enforce SLAs
- Partnering with product and infrastructure teams to ensure scalable designs, uniform monitoring and adequate capacity planning for each service
- Optimizing operation of individual services through creation of runbooks, participation in on-call rotations and helping development teams troubleshoot production problems
- Consciously improving the software and processes we run by facilitating blameless postmortems
About this roll* (Responsibilities)
As a Manager of the Production Engineering team you will be responsible for the continuous improvement of the operational metrics for Toast’s systems. You will have the following tools at your disposal:
- Driving day-to-day operations of the team and contributing to the development and prioritization of the Product Engineering roadmap for major initiatives
- Enabling and mentoring engineers on your team to do the best work they can and rewarding their performance
- Establishing strong working relationships with peer infrastructure and product teams
- Influencing architecture decisions in your team and for individual services to optimize resilience and scalability
- Growing the organization through hiring and creating professional growth opportunities for the members of your team
Do you have the right ingredients*? (Requirements)
- Experience as an Engineering Manager, including hiring and cross functional collaboration
- Experience in a role with operational or production responsibility
- Deep understanding of systems, networking and scaling issues
- Direct exposure to SaaS systems, ideally in a PE/SRE/DevOps context
- Hands-on coding/troubleshooting experience
*Bread puns encouraged but not required