Cox employees working on campus
Cox Enterprises Logo

Cox Enterprises

Sr Lead Site Reliability & Systems Engineer

Posted 16 Hours Ago
Be an Early Applicant
Hybrid
Austin, TX
163K-272K Annually
Senior level
Hybrid
Austin, TX
163K-272K Annually
Senior level
Lead SRE and systems engineering for platform reliability and scalable infrastructure. Define SRE strategy, SLOs/SLIs, incident management, and architecture standards. Drive IaC (Terraform), cloud reliability, observability, CI/CD, chaos engineering, capacity planning, and automation. Mentor engineers, lead postmortems, and partner with product, security, and ops teams to reduce toil and improve system resilience.
The summary above was generated by AI
SENIOR LEAD SITE RELIABILITY & SYSTEMS ENGINEER
Platform Engineering | Infrastructure, Reliability & Systems Architecture
Location: Austin (candidates must be based in Austin or willing to relocate for this role)
ABOUT THE ROLE
We are seeking a Senior Lead Site Reliability & Systems Engineer - a versatile technical leader who combines deep SRE expertise with broad systems engineering capability. In this hybrid role you will drive platform reliability, operational excellence, and systems architecture across our infrastructure, ensuring our products are scalable, resilient, and delivered with high velocity. You will partner with engineering, product, and operations teams to embed reliability and sound systems design at every layer of the stack.
KEY RESPONSIBILITIES
Reliability Engineering & Incident Management
  • Define and drive the SRE strategy, roadmap, and standards across engineering teams
  • Establish and enforce SLOs, SLIs, and error budgets across all production services
  • Own the incident management lifecycle - detection, response, resolution, and prevention
  • Lead blameless postmortems and translate findings into lasting systemic improvements
  • Manage on-call rotations and aggressively reduce toil through automation

Systems Architecture & Design
  • Lead the design and evolution of large-scale, distributed systems and platform infrastructure
  • Define technical standards, architectural patterns, and engineering best practices org-wide
  • Evaluate and recommend technologies and tooling aligned to business and reliability requirements
  • Conduct architecture reviews and provide guidance on complex technical trade-offs
  • Lead capacity planning, performance engineering, and infrastructure scaling strategies

Platform & Infrastructure
  • Build and maintain highly available, fault-tolerant infrastructure on cloud platforms (AWS/GCP/Azure)
  • Drive infrastructure-as-code adoption (Terraform) and enforce best practices
  • Architect and implement observability platforms - metrics, logging, tracing, and alerting
  • Build and improve CI/CD pipelines, deployment automation, and release engineering workflows
  • Lead chaos engineering and game day exercises to validate system resilience
  • Champion automation across provisioning, testing, deployment, and monitoring workflows

Leadership, Mentorship & Collaboration
  • Mentor and grow a team of SREs, platform engineers, and systems engineers
  • Partner with DevOps, security, and product teams to align on shared platform goals
  • Serve as the technical escalation point for critical infrastructure incidents and outages
  • Communicate complex technical concepts clearly to non-technical stakeholders and leadership
  • Contribute to build vs. buy evaluations and drive strategic vendor assessments

REQUIRED QUALIFICATIONS
  • 8+ years of experience in SRE, systems engineering, platform engineering, or DevOps roles
  • 3+ years in a senior or lead capacity with ownership of large-scale, distributed systems
  • Deep expertise in at least one major cloud provider - AWS preferred
  • Strong proficiency in Python, Go, Bash, Java, or C++
  • Hands-on experience with Kubernetes, container orchestration, and service mesh technologies
  • Solid understanding of Linux/Unix internals, networking (TCP/IP, DNS, TLS/SSL, load balancing)
  • Proficiency with observability tooling: Datadog, Prometheus/Grafana, Splunk, or equivalent
  • Proven track record defining and operating against SLOs and error budgets
  • Experience with infrastructure-as-code tools - Terraform required
  • Strong understanding of distributed systems design, security fundamentals, and data governance

PREFERRED QUALIFICATIONS
  • Experience with service mesh (Istio, Linkerd) and API gateways (Kong, Apigee)
  • Background in systems integration across enterprise middleware, ERP, or CRM platforms
  • Familiarity with FinOps practices and cloud cost optimization
  • Experience in regulated industries: financial services, automotive, healthcare, or government
  • Familiarity with compliance frameworks: SOC 2, ISO 27001, or NIST
  • Track record of leading migrations - legacy-to-cloud or monolith-to-microservices
  • Relevant certifications: AWS Solutions Architect, CKA/CKAD, GCP Professional, or Red Hat RHCA

WHAT WE OFFER
Compensation & Benefits
  • Competitive base salary + annual bonus
  • Comprehensive health, dental, and vision coverage
  • 401(k) with company match
  • Generous PTO and paid parental leave

Culture & Growth
  • Flexible hybrid work model
  • Learning & development budget (conferences, certs, courses)
  • Engineering-first culture with direct product impact
  • Collaborative teams and transparent leadership

USD 163,400.00 - 272,300.00
Compensation:
Compensation includes a base salary in the range of $163,400.00 - $272,300.00. The base salary may vary within the anticipated base pay range based on factors such as the ultimate location of the position and the selected candidate's knowledge, skills, and abilities. Position may be eligible for additional compensation that may include an incentive program.
Benefits:
The Company offers eligible employees the flexibility to take as much vacation with pay as they deem consistent with their duties, the company's needs, and its obligations; seven paid holidays throughout the calendar year; and up to 160 hours of paid wellness annually for their own wellness or that of family members. Employees are also eligible for additional paid time off in the form of bereavement leave, time off to vote, jury duty leave, volunteer time off, military leave, and parental leave.
EOE, including disability/vets

Similar Jobs at Cox Enterprises

16 Hours Ago
Hybrid
135K-225K Annually
Senior level
135K-225K Annually
Senior level
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
Lead technical direction for a full-stack product team: design architecture, build frontend and backend systems, own APIs and data modeling, drive performance and accessibility, set engineering standards, mentor engineers, run design/RFC processes, and partner with Product and Design to deliver high-quality software.
Top Skills: .NetA/B TestingAngularAWSAxeAzureC#ChromaticCi/CdDjangoDockerDynamoDBFastapiFeature FlaggingGCPGitflowGoGraphQLJavaKafkaKubernetesLighthouseMongoDBMySQLNode.jsOpentelemetryPostgresPythonRabbitMQReactRedisRestScreen Reader TestingSnsSpringSqsStorybookTrunk-Based DevelopmentTypescriptVue
16 Hours Ago
Remote or Hybrid
United States
112K-186K Annually
Senior level
112K-186K Annually
Senior level
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
Lead translation of strategic direction into scalable sales and retention initiatives across Performance Management. Drive cross-business projects, provide analytical and project management support for account planning, product adoption, performance visibility, and revenue retention. Partner with enablement, training, finance, and analytics to execute initiatives, manage risks, and improve decision-making. Remote role requiring Eastern Time Zone residency and up to 25% travel.
Top Skills: Ai ToolsExcelPowerPointSalesforce
16 Hours Ago
Hybrid
112K-186K Annually
Senior level
112K-186K Annually
Senior level
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
Lead reliability efforts for cloud-native production systems: design and operate infrastructure, define SLOs/SLIs, lead incident response, build IaC and CI/CD, improve observability and automate toil, and mentor SRE engineers.
Top Skills: AWSAzureCassandraCdnCloudFormationDnsEcsElkGCPGithub ActionsGitopsGoGrafanaJavaJenkinsKubernetesLinuxMySQLNewrelicOraclePagerdutyPostgresPrometheusPythonRedisSplunkTcp/IpTerraform

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account