The Site Reliability Engineer will ensure system reliability and performance, design scalable architectures, improve CI/CD pipelines, maintain infrastructures, and lead incident response efforts.
About PushPress
PushPress is building the Intelligent Industry Ledger for boutique fitness.
We’re transforming how boutique gyms operate — and how the entire $100B fitness industry connects, transacts, and grows. Trusted by 5,000+ gyms and 500,000+ members, PushPress processes over $500M annually and is backed by Altos Ventures and Mucker Capital.
We're evolving from a traditional business system of record into an AI-powered Industry Ledger — an intelligent infrastructure layer that brings order to a highly fragmented boutique fitness industry. By unifying disconnected operators, workflows, and data into a single platform, we’re enabling faster decisions, new business models, cross-gym collaboration, and network effects that increase the value of every studio in our client base.
We’re a global team of builders, operators, and fitness fanatics on a mission to level the playing field for fitness entrepreneurs. If you're ready to help reshape an industry — let’s talk.
About the Role
We're seeking a Site Reliability Engineer to own the reliability and performance of systems that power 5,000+ gyms daily, process a billion dollars in payments annually, and handle 5 million class check-ins every month. This is a critical role where you'll be responsible for infrastructure that directly impacts thousands of businesses and millions of their members. You'll work with modern technologies including AWS, Kubernetes, ArgoCD, GitHub Actions, and Terraform to build and maintain highly available, scalable systems. This is an opportunity to join during a high-growth phase where you'll have significant influence over our reliability practices, infrastructure architecture, and operational excellence standards. Our ideal candidate embodies a strong ownership mindset, is highly cross-functional, adaptable, and thinks beyond the conventional boundaries of traditional SRE work.
What You'll Do
- Ensure the reliability, performance, and availability of PushPress's production systems.
- Design and implement scalable, fault-tolerant, and efficient architectures on AWS using Kubernetes and Terraform.
- Own and continuously improve our CI/CD pipeline using GitHub Actions and ArgoCD with the goal of fast, reliable, and secure deployments.
- Maintain and optimize our developer and test infrastructure to enable efficient software development and testing processes.
- Develop comprehensive monitoring, logging, and alerting systems to proactively identify and resolve issues before they impact our customers.
- Lead incident response efforts and conduct thorough post-mortems to prevent future occurrences.
- Partner with engineering teams to build reliability into new features and services from day one.
- Continuously optimize our infrastructure costs while maintaining high performance and reliability at scale.
What You Need
- A minimum of 3 years of experience in Site Reliability Engineering, designing and managing large-scale, distributed systems on AWS.
- Proficiency in one or more programming languages, such as Python, Go, or JavaScript.
- Deep knowledge of Kubernetes, Terraform, and GitOps practices with ArgoCD.
- Experience building and maintaining CI/CD pipelines using GitHub Actions or similar tools.
- Strong infrastructure as code experience with Terraform in production environments.
- Experience with modern observability tools like Datadog, Prometheus, or similar monitoring platforms.
- Familiarity with containerization technologies like Docker and container orchestration at scale.
- Understanding of high-volume payment processing systems and their reliability requirements is a plus.
- Excellent problem-solving, communication, and collaboration skills with the ability to work effectively across teams.
PushPress is dedicated to fostering an inclusive and dynamic workplace. We’re all about leveling up, and that means we don’t tolerate any form of discrimination or harassment. We’re committed to provide equal opportunities, regardless of race, color, religion, sex, sexual orientation, gender identity or expression, pregnancy, age, national origin, disability, genetic info, veteran status, or any other legally protected characteristic.
At PushPress, we’re dedicated to helping both our technology and our team reach peak performance. Whether it’s with your proactive approach, eye for detail, or drive to make a meaningful impact, we’d love to hear from you. At PushPress, we’re all about pushing boundaries and achieving new personal bests—come join us and be part of our fitness-tech journey!
Top Skills
Argocd
AWS
Datadog
Docker
Github Actions
Go
JavaScript
Kubernetes
Prometheus
Python
Terraform
Similar Jobs
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
The Sr. Engineer will manage CI/CD systems, lead project administration, enforce best practices, and improve service reliability while mentoring teams.
Top Skills:
Artifact Repository Services (ArtifactoryChefCi/Cd Tools (BazelGithub ActionsGithub)GitlabIac Provisioning Tools (AnsibleJenkins)NexusPuppetQuay.Io)Source Code Management (BitbucketTerraform)
Artificial Intelligence • Big Data • Cloud • Information Technology • Machine Learning • Software
Design, build, and maintain the infrastructure for a multi-tenant SaaS platform, ensuring reliability and scalability. Monitor system health and manage cloud-native services while improving incident response and automation.
Top Skills:
AWSBashCi/CdDatadogElk/EfkGoGrafanaHelmKubernetesPrometheusPythonTerraform
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The role involves supporting US Public Sector customers by maintaining the reliability, scalability, and performance of the ServiceNow cloud infrastructure, mentoring team members, driving automation, and solving complex technical problems.
Top Skills:
AnsibleAWSAzureBashDockerGCPGrafanaJavaJavaScriptKafkaKubernetesLinuxMariadbMySQLNginxOpenstackOraclePostgresPrometheusPuppetPythonSplunkTerraform
What you need to know about the Boston Tech Scene
Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.
Key Facts About Boston Tech
- Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
- Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
- Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
- Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories



