CVS Health Logo

CVS Health

Director of Engineering - SRE & Operations

Posted Yesterday
Be an Early Applicant
In-Office
Wellesley, MA, USA
144K-288K Annually
Senior level
In-Office
Wellesley, MA, USA
144K-288K Annually
Senior level
The Director of Platform Engineering - SRE & Operations will oversee reliability and operational excellence, driving strategies for SRE, AIOps, and cloud reliability while leading high-performing teams.
The summary above was generated by AI

We’re building a world of health around every individual — shaping a more connected, convenient and compassionate health experience. At CVS Health®, you’ll be surrounded by passionate colleagues who care deeply, innovate with purpose, hold ourselves accountable and prioritize safety and quality in everything we do. Join us and be part of something bigger – helping to simplify health care one person, one family and one community at a time.

Position Summary

As the Director of Platform Engineering - SRE & Operations, you will guide the strategy, implementation, and ongoing maturity of reliability, availability, and operational excellence across key platforms within the DDAT organization. You will oversee the reliability of web, mobile, API, platform, and AI‑enabled systems, ensuring they are resilient, scalable, secure, and cost‑efficient.

You will partner closely with the other engineering teams across CVS Health to embed SRE best practices and strengthen the resiliency, observability, and performance of our digital ecosystem.

Responsibilities

SRE Strategy & Reliability Leadership

  • Contribute to and execute the SRE strategy, including definition and management of SLOs, SLIs, and error budgets.
  • Establish and operationalize reliability standards across web, mobile, backend services, and data workloads.
  • Champion a culture of reliability-by-design and continuous improvement within engineering teams.

AI‑Driven Operations (AIOps) & Automation

  • Drive adoption of AIOps capabilities for intelligent alerting, proactive issue detection, and predictive failure mitigation.
  • Implement AI-assisted automation: incident triage, runbooks, root-cause analysis, and self-healing workflows.
  • Collaborate with the AI Platform team to integrate LLMs and machine learning models into operational processes.

Observability & Monitoring

  • Lead the observability roadmap spanning metrics, logs, traces, and experience monitoring.
  • Define and standardize tooling and operational practices using Datadog, Splunk, Prometheus, Grafana, and OpenTelemetry.
  • Deliver actionable dashboards and reporting for availability, performance, latency, and error budget consumption.

DevOps, CI/CD & Release Reliability

  • Partner with the DevEx and Cloud Engineering teams to strengthen CI/CD reliability, safety, and automation.
  • Promote progressive delivery (canary, blue/green, feature flags) to reduce deployment risk.
  • Ensure quality gates, automated rollback, and deployment safeguards are consistently applied.

Incident Management & Operational Excellence

  • Lead major incident response and escalation processes for critical digital platforms.
  • Improve MTTD, MTTR, and reduce incident recurrence through preventive engineering and automation.
  • Maintain operational readiness through runbooks, on‑call processes, and post‑incident learning.

Cloud Reliability & FinOps

  • Ensure cloud reliability and scalability across On-Prem, Azure, and GCP environments.
  • Collaborate with Finance and Platform teams to support FinOps practices, cost optimization, and capacity planning.
  • Optimize performance and availability across high‑traffic, customer‑facing platforms.

Leadership & Talent Development

  • Lead and develop high-performing SRE teams, including managers, engineers, and technical specialists.
  • Support career pathways, skill frameworks, and upskilling initiatives aligned to SRE disciplines.
  • Foster a culture centered on ownership, accountability, curiosity, and continuous learning.

Required Qualifications

  • 10+ years of experience in software engineering, platform operations, or site reliability engineering.
  • 5+ years in leadership roles managing SRE, DevOps, or platform reliability teams at scale.

Preferred Qualifications

  • Experience using AI/ML capabilities in operations (anomaly detection, predictive alerting, log analysis, automated remediation).
  • Hands‑on knowledge of AIOps platforms (e.g., Datadog Watchdog, Dynatrace Davis, Splunk AI, or custom ML/LLM tooling).
  • Deep expertise in cloud infrastructure, distributed systems, and high‑availability architectures.
  • Strong understanding of SRE principles, DevOps practices, and modern reliability engineering.
  • Experience running mission‑critical digital systems with large-scale user traffic.
  • Effective communication and stakeholder influence skills, including with senior technology leaders.
  • Experience working in regulated industries (e.g., healthcare, financial services, insurance).
  • Demonstrated success collaborating with platform engineering, AI teams, architecture, and cross-functional technical organizations.

Education

Bachelor's degree required. Master's degree preferred.

Pay Range

The typical pay range for this role is:

$144,200.00 - $288,400.00


This pay range represents the base hourly rate or base annual full-time salary for all positions in the job grade within which this position falls.  The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors.  This position is eligible for a CVS Health bonus, commission or short-term incentive program in addition to the base pay range listed above.  This position also includes an award target in the company’s equity award program. 
 

Our people fuel our future. Our teams reflect the customers, patients, members and communities we serve and we are committed to fostering a workplace where every colleague feels valued and that they belong.

Great benefits for great people

We take pride in offering a comprehensive and competitive mix of pay and benefits that reflects our commitment to our colleagues and their families.

This full‑time position is eligible for a comprehensive benefits package designed to support the physical, emotional, and financial well‑being of colleagues and their families. The benefits for this position include medical, dental, and vision coverage, paid time off, retirement savings options, wellness programs, and other resources, based on eligibility.


Additional details about available benefits are provided during the application process and on
Benefits Moments.

We anticipate the application window for this opening will close on: 05/22/2026

Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state and local laws.

CVS Health Boston, Massachusetts, USA Office

Boston, Massachusetts, United States, 02114

Similar Jobs

29 Minutes Ago
Remote or Hybrid
United States
70K-125K Annually
Junior
70K-125K Annually
Junior
Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
As a Software Engineer, you will design, develop, and deliver high-quality software, contribute to code reviews, monitor application performance, and ensure maintainability and consistency across products.
Top Skills: .NetAngularAsp.NetAWSAzureBashC#CachingDistributed SystemsHTTPJavaScriptMessagingPowershellQueuesRest ApisServicesSQLSql / Nosql Databases
An Hour Ago
Remote or Hybrid
United States
22-33 Hourly
Mid level
22-33 Hourly
Mid level
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
The Sr Customer Care Specialist ensures client satisfaction by managing tasks, providing solutions, and maintaining effective communication with clients in a dynamic environment.
An Hour Ago
Easy Apply
Hybrid
Somerville, MA, USA
Easy Apply
100K-140K Annually
Mid level
100K-140K Annually
Mid level
Enterprise Web • Hardware • Internet of Things • Software
Responsible for product positioning, go-to-market strategy, and adoption metrics, with a focus on manufacturing verticals and collaboration across teams.
Top Skills: AICloud-Native PlatformsIndustrial SoftwareNo-Code Development

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account