Waystar Logo

Waystar

Director, Site Reliability Engineer

Posted Yesterday
Be an Early Applicant
In-Office
Lehi, UT
Senior level
In-Office
Lehi, UT
Senior level
The Director of SRE will oversee SRE teams, ensuring reliability and operational excellence of platforms, driving automation, and setting strategic direction while leading site reliability initiatives.
The summary above was generated by AI

ABOUT THIS POSITION

We are seeking an experienced and strategic Director of Site Reliability Engineering (SRE) to lead and scale our SRE organization, overseeing four SRE teams responsible for the reliability, scalability, performance, and operational excellence of our most critical platforms and services.

This role is both highly technical and deeply people‑focused, requiring strong cloud expertise (GCP preferred or equivalent), hands‑on SRE experience, and a proven ability to set vision, standards, and direction across Site Reliability Engineering, Platform Engineering, and Infrastructure‑as‑Code (IaC) automation.

As a senior leader, the Director of SRE will partner closely with Engineering, Product, Architecture, Infrastructure, and Security leadership to embed reliability, automation, and resilience into every layer of the technology stack while enabling teams to move faster and more safely.

WHAT YOU'LL DO

Leadership & Strategy
  • Provide strategic leadership and oversight for four SRE teams, setting clear direction, priorities, and expectations aligned to business and engineering objectives.
  • Lead, mentor, and develop SRE managers and senior engineers, fostering a culture of accountability, operational ownership, innovation, and psychological safety.
  • Define and own the SRE and Platform Engineering strategy and roadmap, ensuring alignment with cloud transformation initiatives and long‑term organizational goals.
  • Serve as a key voice in architectural and platform decisions, influencing designs with a focus on scalability, reliability, automation, and operational efficiency.
  • Partner with executive leadership to communicate reliability posture, risks, and investment needs in clear business terms.
Reliability & Platform Engineering
  • Establish and continuously evolve SRE principles and best practices, including SLIs, SLOs, error budgets, toil management, and reliability‑driven prioritization.
  • Provide technical direction and governance across GCP (preferred) and AWS environments, ensuring consistent reliability and operational patterns.
  • Drive the evolution of Platform Engineering, enabling self‑service infrastructure and guard‑railed service delivery for application teams.
  • Own strategy and standards for Infrastructure‑as‑Code (IaC) and automation, leveraging tools such as Terraform or equivalent frameworks across cloud environments.
  • Ensure observability excellence through metrics, logging, tracing, alerting, and proactive capacity and performance management.
Incident Management & Operational Resilience
  • Provide executive leadership during large‑scale or high‑impact incidents, ensuring effective coordination, escalation, and stakeholder communication.
  • Define, refine, and scale incident management and on‑call practices, emphasizing resilience, sustainability, and rapid recovery.
  • Champion blameless postmortems, ensuring root causes are addressed and learnings are translated into systemic improvements.
  • Partner with Security and Compliance teams to ensure systems meet security, privacy, and regulatory requirements without compromising reliability.
Operational Excellence & Measurement
  • Own and report on reliability metrics, operational KPIs, and service health for leadership and executive stakeholders.
  • Drive continuous improvement through reliability reviews, retrospectives, and data‑driven decision‑making.
  • Balance reliability, velocity, and cost across platforms, applying error budgets and capacity planning to guide trade‑offs.

WHAT YOU'LL NEED

  • 10+ years of experience in SRE, infrastructure, platform, or systems engineering roles, with 5+ years leading managers and senior technical teams.
  • Direct, hands‑on experience in Site Reliability Engineering, including operating production systems at scale.
  • Strong experience with Google Cloud Platform (GCP) or equivalent public cloud (AWS or Azure), including distributed, cloud‑native architectures.
  • Proven expertise in Infrastructure‑as‑Code (IaC) and automation frameworks (e.g., Terraform or similar).
  • Deep understanding of observability ecosystems (metrics, logging, tracing), CI/CD pipelines, and DevOps/SRE tooling.
  • Ability to communicate complex technical concepts clearly to both technical and non‑technical stakeholders, influencing at all levels of the organization.
AI & Innovation Mindset
  • Leverage AI‑assisted tools and platforms to improve operational efficiency, incident response, reliability analysis, and engineering workflows.
  • Champion experimentation and continuous learning, applying emerging technologies to modernize reliability and platform practices.
  • Enable teams to responsibly adopt AI capabilities while maintaining reliability, security, and governance standards.
Preferred Qualifications
  • Experience with Kubernetes, microservices architectures, and service meshes.
  • Familiarity with chaos engineering, resilience testing, and failure injection methodologies.
  • Background in performance engineering, capacity planning, or large‑scale platform migrations.
  • Experience leading reliability or platform initiatives during major cloud or organizational transformations.

ABOUT WAYSTAR

Through a smart platform and better experience, Waystar helps providers simplify healthcare payments and yield powerful results throughout the complete revenue cycle.

Waystar’s healthcare payments platform combines innovative, cloud-based technology, robust data, and unparalleled client support to streamline workflows and improve financials so providers can focus on what matters most: their patients and communities. Waystar is trusted by 1M+ providers, 1K+ hospitals and health systems, and is connected to over 5K commercial and Medicaid/Medicare payers.  We are deeply committed to living out our organizational values: honesty; kindness; passion; curiosity; fanatical focus; best work, always; making it happen; and joyful, optimistic & fun.

Waystar products have won multiple Best in KLAS® or Category Leader awards since 2010 and earned multiple #1 rankings from Black Book™ surveys since 2012. The Waystar platform supports more than 500,000 providers, 1,000 health systems and hospitals, and 5,000 payers and health plans. For more information, visit waystar.com or follow @Waystar on Twitter.  

WAYSTAR PERKS

  • Competitive total rewards (base salary + bonus, if applicable)
  • Customizable benefits package (3 medical plans with Health Saving Account company match)
  • We offer generous paid time off for our non-exempt team members, starting with 3 weeks + 13 paid holidays, including 2 personal floating holidays. We also offer flexible time off for our exempt team members + 13 paid holidays
  • Paid parental leave (including maternity + paternity leave)
  • Education assistance opportunities and free LinkedIn Learning access
  • Free mental health and family planning programs, including adoption assistance and fertility support
  • 401(K) program with company match
  • Pet insurance
  • Employee resource groups

Waystar is proud to be an equal opportunity workplace. We celebrate, value, and support diversity and inclusion. Qualified applicants will receive consideration for employment without regard to race, color, religion, age, sex, national origin, disability status, genetics, marital status, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws.

This applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training.

Top Skills

AI
AWS
GCP
Kubernetes
Terraform

Similar Jobs at Waystar

Junior
Healthtech • Payments • Software
The Market Development Executive focuses on lead generation, account profiling, and outbound prospecting to drive sales for the Enterprise Ambulatory team.
Top Skills: Ai ToolsSalesforce
7 Days Ago
In-Office
Expert/Leader
Expert/Leader
Healthtech • Payments • Software
Manage Security Operations, Vulnerability Management, and Security Engineering teams; develop security processes and metrics, mentor teams, and manage complex projects.
Top Skills: Application ScannersAvAWSAzureFimGCPIds/IpsSecurity EngineeringSecurity OperationsSIEMVulnerability ManagementVulnerability Scanners
7 Days Ago
In-Office
Senior level
Senior level
Healthtech • Payments • Software
The Director of Quote-to-Cash Operations & Strategy will lead QTC transformation, optimize processes, improve data flows, and enhance operational efficiency across departments to drive revenue growth.
Top Skills: Automation ToolsBilling PlatformsCpqCRMMS OfficePowerPointSalesforce

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account