SecurityScorecard Logo

SecurityScorecard

Senior Site Reliability Engineer

Posted 19 Days Ago
Be an Early Applicant
In-Office
Austin, TX
100K-150K
Senior level
In-Office
Austin, TX
100K-150K
Senior level
As a Staff Site Reliability Engineer, you will design, optimize, and maintain Kubernetes infrastructure and CI/CD systems while collaborating with teams to enhance automation and reliability.
The summary above was generated by AI

About SecurityScorecard:

SecurityScorecard is the global leader in cybersecurity ratings, with over 12 million companies continuously rated, operating in 64 countries. Founded in 2013 by security and risk experts Dr. Alex Yampolskiy and Sam Kassoumeh and funded by world-class investors, SecurityScorecard’s patented rating technology is used by over 25,000 organizations for self-monitoring, third-party risk management, board reporting, and cyber insurance underwriting; making all organizations more resilient by allowing them to easily find and fix cybersecurity risks across their digital footprint. 

Headquartered in New York City, our culture has been recognized by Inc Magazine as a "Best Workplace,” by Crain’s NY as a "Best Places to Work in NYC," and as one of the 10 hottest SaaS startups in New York for two years in a row. Most recently, SecurityScorecard was named to Fast Company’s annual list of the World’s Most Innovative Companies for 2023 and to the Achievers 50 Most Engaged Workplaces in 2023 award recognizing “forward-thinking employers for their unwavering commitment to employee engagement.”  SecurityScorecard is proud to be funded by world-class investors including Silver Lake Waterman, Moody’s, Sequoia Capital, GV and Riverwood Capital.

Role Overview

As a Staff Site Reliability Engineer, you will be a key technical leader driving the design, implementation, and optimization of our Kubernetes-based infrastructure and CI/CD systems. You’ll work hands-on with engineering teams to accelerate delivery, ensure production reliability, and embed best practices for automation, observability, and resilience. This role requires both strong technical depth and the ability to collaborate across multiple teams to guide large-scale infrastructure and platform initiatives.

Key Responsibilities

  • Design, build, and scale Kubernetes infrastructure to support secure, multi-tenant, high-availability applications.
  • Lead efforts to optimize and maintain CI/CD pipelines, improving reliability, speed, and rollback safety for production deployments.
  • Collaborate with developers to implement progressive delivery strategies, including blue/green and canary deployments.
  • Improve Infrastructure as Code practices with tools like Terraform, Helm, and Argo CD, and help define reusable patterns for the broader org.
  • Operate and optimize data streaming and analytics infrastructure, including Kafka, Flink, and ClickHouse, to ensure reliable, scalable pipelines for real-time and batch workloads.
  • Build and enforce automated testing strategies (unit, integration, performance) within the CI/CD lifecycle.
  • Partner with development and platform teams to improve system observability, define SLOs, and establish meaningful alerts and dashboards.
  • Actively contribute to incident response efforts and postmortems, with a focus on root cause analysis and sustainable remediation.
  • Mentor engineers across teams, sharing deep knowledge of Kubernetes, CI/CD, and cloud infrastructure.

Qualifications

  • 6+ years of experience in SRE, DevOps, or Infrastructure roles, including significant experience in production Kubernetes environments.
  • Proven success building and maintaining CI/CD pipelines using tools such as GitHub Actions, Jenkins, GitLab CI, or Spinnaker.
  • Strong hands-on experience with Kubernetes internals (networking, scaling, RBAC, etc.) and cloud-managed services like EKS, GKE, or AKS.
  • Expertise with Infrastructure as Code (Terraform, Helm, Pulumi) and GitOps workflows.
  • Solid experience with test automation tools and integrating testing into the CI/CD lifecycle.
  • Proficient in scripting or programming languages such as Python, Bash, or Go.
  • Knowledge of monitoring, logging, and observability tools (e.g., Prometheus, Grafana, Datadog, OpenTelemetry).
  • Practical experience with Kafka (event streaming), Flink (real-time stream processing), and ClickHouse (high-performance analytics database) in production environments.
  • Strong communication and collaboration skills to work effectively with product engineering, security, and platform teams.

Nice-to-Have

  • Experience operating multi-region or multi-cluster Kubernetes environments.
  • Exposure to chaos engineering, resilience testing, or traffic shaping strategies.
  • Familiarity with security scanning, compliance automation, or infrastructure policy-as-code.
  • Contributions to open-source Kubernetes tools or CI/CD platforms.
  • Familiarity with JVM and Node.js-based services.

Benefits:
Specific to each country, we offer a competitive salary, stock options, Health benefits, and unlimited PTO, parental leave, tuition reimbursements, and much more!

The estimated total compensation range for this position is $100,000 - $205,000  (base plus bonus). Actual compensation for the position is based on a variety of factors, including, but not limited to affordability, skills, qualifications and experience, and may vary from the range. In addition to base salary, employees may also be eligible for annual performance-based incentive compensation awards and equity, among other company benefits. 

SecurityScorecard is committed to Equal Employment Opportunity and embraces diversity. We believe that our team is strengthened through hiring and retaining employees with diverse backgrounds, skill sets, ideas, and perspectives. We make hiring decisions based on merit and do not discriminate based on race, color, religion, national origin, sex or gender (including pregnancy) gender identity or expression (including transgender status), sexual orientation, age, marital, veteran, disability status or any other protected category in accordance with applicable law. 

We also consider qualified applicants regardless of criminal histories, in accordance with applicable law. We are committed to providing reasonable accommodations for qualified individuals with disabilities in our job application procedures. If you need assistance or accommodation due to a disability, please contact [email protected].

Any information you submit to SecurityScorecard as part of your application will be processed in accordance with the Company’s privacy policy and applicable law. 

SecurityScorecard does not accept unsolicited resumes from employment agencies.  Please note that we do not provide immigration sponsorship for this position.   #LI-DNI

Top Skills

Argo Cd
Bash
Ci/Cd
Clickhouse
Datadog
Flink
Github Actions
Gitlab Ci
Go
Grafana
Helm
Jenkins
Kafka
Kubernetes
Opentelemetry
Prometheus
Python
Spinnaker
Terraform

Similar Jobs

16 Days Ago
Hybrid
Fort Worth, TX, USA
Senior level
Senior level
Financial Services
As a Senior Lead Site Reliability Engineer, you will implement observability solutions, mentor junior engineers, drive adoption of SRE principles, and communicate with stakeholders to ensure high system reliability and performance.
Top Skills: AngularDatadogDynatraceGrafanaJavaPrometheusPythonSplunkTerraform
11 Days Ago
In-Office
Frisco, TX, USA
Senior level
Senior level
Security • Software • Cybersecurity
Responsible for maintaining service levels, ensuring application availability, and collaborating with teams for operational improvements in a hybrid SRE role.
Top Skills: AWSCloudwatchDockerGitGrafanaHarnessJenkinsKubernetes
10 Hours Ago
In-Office or Remote
Houston, TX, USA
Mid level
Mid level
Artificial Intelligence • Machine Learning • Energy
The Site Reliability Engineer designs and maintains cloud infrastructure at Imubit, optimizing deployment processes, managing incidents, and collaborating with teams to enhance system reliability and performance.
Top Skills: AnsibleAWSAws Secrets ManagerGCPGitGoGrafanaHashicorp VaultKubernetesNew RelicPostgresPrometheusPythonSplunkTerraform

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account