Job Title, Company or Keyword

Maximum of 25 job preferences reached.

Top Remote Senior Site Reliability Engineer Jobs in Boston, MA

Nebius

Site Reliability Engineer

Reposted 16 Days AgoSaved

Remote

United States

100K-140K Annually

Mid level

100K-140K Annually

Mid level

Artificial Intelligence • Information Technology • Consulting

The Linux Systems Administrator will maintain and troubleshoot Linux systems, support network services, and work on systems integration while collaborating with infrastructure teams.

Top Skills: DhcpDnsLinuxNtpPython

Strike (simplistic.com)

Site Reliability Engineer

Reposted 16 Days AgoSaved

Remote

USA

Senior level

Information Technology • Cryptocurrency

The Site Reliability Engineer will lead technical initiatives, architect solutions, troubleshoot issues, mentor team members, and improve observability practices.

Top Skills: ArgocdBashElk StackGCPGoGrafanaHelmKubernetesPrometheusPythonTerraform

Kraken Digital Asset Exchange

Site Reliability Engineer - AI Agents

17 Days AgoSaved

Remote

United States

96K-192K Annually

Senior level

96K-192K Annually

Senior level

Blockchain • Financial Services • Cryptocurrency • Web3

Design, build, and operate scalable, observable infrastructure for AI agent workflows. Build platform services, APIs, and SDKs; manage cloud, Kubernetes, and model-serving compute; implement IaC, CI/CD, monitoring, incident response, security controls, and runbooks; collaborate with AI and data teams to productionize agent prototypes.

Top Skills: AWSBashCi/CdDockerKubernetesPythonTerraform

PTC

Principal Software Engineer-SRE

Reposted 17 Days AgoSaved

Remote

USA

113K-175K Annually

Senior level

113K-175K Annually

Senior level

Information Technology • Internet of Things • Software • Virtual Reality

Lead reliability, availability, and resiliency strategies for large-scale systems, drive operational excellence, and provide technical mentorship across engineering teams.

Top Skills: AWSCi/CdJavaMongoDBRabbitMQZookeeper

Veeam

GOV Site Reliability Engineer

18 Days AgoSaved

Remote

United States

152K-253K Annually

Mid level

152K-253K Annually

Mid level

Cloud • Security • Software • Cybersecurity

Join the GOV/Sovereign Cloud SRE team to maintain and improve reliability for the Veeam Data Cloud. Responsibilities include incident response, SLIs/SLOs, observability (monitoring, alerting, dashboards), runbooks and documentation, IaC and CI/CD work in compliance-restricted environments, and participation in on-call rotations. Collaborate with engineering, security, and compliance teams to implement high availability and automation.

Top Skills: ArgocdAzureAzure DevopsAzure GovernmentC#Elk StackGithub ActionsGitlab CiGoGrafanaJavaJavaScriptKubernetesOpentelemetryPrometheusPulumiTerraformTerragruntTypescript

HiBob

Senior Site Reliability Engineer - Remote EST

Reposted 23 Days AgoSaved

Remote or Hybrid

United States

190K-235K Annually

Senior level

190K-235K Annually

Senior level

HR Tech • Information Technology • Professional Services • Sales • Software

Own and operate production-grade Kubernetes infrastructure on AWS, build GitOps CI/CD with GitHub Actions and ArgoCD, develop AI agents and internal DevOps tooling, maintain Datadog-based observability, and manage on-call incident response while collaborating with engineering teams to improve reliability and delivery speed.

Top Skills: Ai/LlmArgocdAWSCi/CdDatadogGithub ActionsGitopsGoKubernetesPython

Yahoo

Software Engineer , SRE Tooling & Reliability Platforms

Reposted 18 Days AgoSaved

Remote

United States of America

89K-184K Annually

Entry level

89K-184K Annually

Entry level

AdTech • Digital Media • Information Technology • Other

As a Software Engineer in the Tooling and Reliability Platforms team, you'll develop AI services, manage incident tools, and utilize Infrastructure as Code for high-availability systems. You'll focus on integrating AI workflows and improving operational resilience for Yahoo's brands.

Top Skills: AWSCloudFormationDockerGCPGoJavaKubernetesPythonTerraform

TERN Group

Site Reliability Engineer

19 Days AgoSaved

Remote

United States

175K-200K Annually

Senior level

175K-200K Annually

Senior level

Artificial Intelligence • Healthtech • HR Tech • Software

Own the Heroku-to-GCP migration, maintain Postgres and data pipelines, optimize high‑traffic code paths, build monitoring/alerting, lead incident response and post‑mortems, reduce costs and scale proactively, and coach other infrastructure engineers.

Top Skills: AppsignalBigQueryBugsnagCannyClaude CodeFivetranGoogle Cloud PlatformHerokuHexHotwireInfrastructure-As-CodePostgresRuby On Rails

Airbnb

Engineering Manager, Storage SRE

Reposted 19 Days AgoSaved

Remote

United States

212K-265K Annually

Expert/Leader

212K-265K Annually

Expert/Leader

Real Estate • Travel • PropTech

The Engineering Manager for Storage SRE will lead a team to ensure reliable database operations, improve developer experience, and expand tooling and operational models, focusing on mission-critical systems.

Top Skills: Cloud InfrastructureDatabasesSite Reliability EngineeringStorage Systems

Oscilar

Sr./Staff - Infrastructure/Site Reliability Engineer (SRE)

Reposted 20 Days AgoSaved

Remote

USA

Senior level

Artificial Intelligence • Fintech • Software • Financial Services

The SRE will own reliability for a cloud-native platform, optimizing performance, availability, and observability, while mentoring engineering teams.

Top Skills: AWSClickhouseGoKafkaKubernetesPulumiPythonTerraform

Aalyria

Site Reliability Engineer

Reposted 21 Days AgoSaved

Remote

United States

115K-135K Annually

Mid level

115K-135K Annually

Mid level

Aerospace • Manufacturing

As a Site Reliability Engineer, you'll build and manage observability platforms for satellite communications, define SLOs/SLIs, and collaborate on incident response and deployment automation.

Top Skills: ArgocdAWSElkGCPGoGrafanaIstioJaegerKubernetesLinkerdLokiOpentelemetryPrometheusPythonTempoTerraform

Tekmetric

Site Reliability Engineer

Reposted 21 Days AgoSaved

Remote

United States

Senior level

Automotive

Design and implement scalable cloud infrastructure, monitor performance, automate processes, ensure security and compliance, and lead a DevOps team.

Top Skills: AWSBashCi/CdDockerElk StackGCPGrafanaKubernetesPrometheusPythonTerraform

New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free

Canonical

Site Reliability Engineer

Reposted 22 Days AgoSaved

In-Office or Remote

United States

200K-200K Annually

Mid level

200K-200K Annually

Mid level

Cloud • Software

The Site Reliability Engineer will ensure reliable cloud operations by applying Python for infrastructure automation, managing OpenStack and Kubernetes, and practicing devsecops in a fast-paced environment.

Top Skills: KubernetesLinuxOpenstackPython

Elastic

Site Reliability Engineer (Hosted Infra) - Platform

23 Days AgoSaved

Remote

United States

143K-175K Annually

Mid level

143K-175K Annually

Mid level

Cloud • Security • Software • Generative AI

Design, build, and automate large-scale multi-cloud infrastructure and internal SRE tools. Improve host lifecycle, observability, alerting, and reliability; operate containerized workloads; participate in on-call rotations, incident response, runbooks, postmortems, code reviews, and mentoring.

Top Skills: AnsibleArgo CdArgo WorkflowsCueDockerElastic StackGoGraphiteInfluxKubernetesLinuxPrometheusPuppetTerraformUbuntuUbuntu Live Patch

Yugabyte

Staff Site Reliability Engineer

Reposted 23 Days AgoSaved

Remote

United States

220K-250K Annually

Expert/Leader

220K-250K Annually

Expert/Leader

Cloud • Software • Database

Lead design, build, and operate the YugabyteDB DBaaS infrastructure. Drive architecture, automate lifecycle and maintenance, manage incidents and on-call rotations, implement security/encryption processes, and optimize reliability using SRE principles and observability.

Top Skills: AksAnsibleAWSAzureBashDockerEksGCPGitGithub ActionsGkeJavaKubernetesLinuxPostgresPrometheusPythonShellTerraform

Wikimedia Foundation

Senior Site Reliability Engineer, Wikimedia Enterprise

Reposted YesterdaySaved

Remote

USA

117K-181K Annually

Senior level

117K-181K Annually

Senior level

Other • Social Impact

As a Senior Site Reliability Engineer, you will design, develop, and maintain reliable infrastructure for Wikimedia's API services, ensuring performance and availability while driving reliability engineering practices and improving developer experience.

Top Skills: AnsibleArgocdAWSAzureGCPGitlabGoKubernetesOpentelemetryPrometheusPythonTerraform

Wikimedia Foundation

Senior Site Reliability Engineer, Data Persistence

Reposted YesterdaySaved

Remote

USA

113K-176K Annually

Senior level

113K-176K Annually

Senior level

Other • Social Impact

The Senior Site Reliability Engineer is responsible for maintaining Wikimedia's infrastructure, improving reliability, automating processes, and collaborating with teams. The role involves troubleshooting, managing deployments, and leading incident responses while working remotely.

Top Skills: AnsibleBashCassandraDebianGoGrafanaHhvmKubernetesMariadbMemcachedPHPPrometheusPuppetPythonRedisRubyShell

Alkami

Sr Site Reliability Engineer - Release

2 Days AgoSaved

Remote

110K-137K Annually

Senior level

110K-137K Annually

Senior level

Financial Services

Prototype, write, test, document, and deploy release automation across environments. Build and maintain pipelines, collaborate with engineers and product teams, troubleshoot issues, participate in on-call rotation, and improve software delivery, configuration, monitoring, and operations.

Top Skills: AnsibleBashDockerGitlabJenkinsKubernetesMssqlPostgresPowershellPythonRedisTeamcity

Socure

Senior Software Engineer - SRE

25 Days AgoSaved

Remote or Hybrid

160K-180K Annually

Senior level

160K-180K Annually

Senior level

Artificial Intelligence • Machine Learning • Software • Analytics

The role involves end-to-end ownership of AWS infrastructure, managing Kubernetes platforms, and ensuring system reliability through observability and automation. Responsibilities include incident response and maintaining CI/CD systems.

Top Skills: ArgocdAWSDatadogGitGoKubernetesPythonTerraform

Andromeda (andromeda.ai)

Site Reliability Engineer - AI Infrastructure

Reposted 25 Days AgoSaved

In-Office or Remote

United States

Senior level

Artificial Intelligence • Cloud • Information Technology • Software

The Site Reliability Engineer will provision and manage Kubernetes clusters, build automation tools, debug customer issues, and improve infrastructure reliability.

Top Skills: AnsibleBashDatadogGoGrafanaHelmKubernetesLokiPrometheusPythonTerraform

CargoSprint

Director of DevOps and Site Reliability Engineering (SRE)

Reposted 25 Days AgoSaved

Remote

United States

Senior level

Logistics • Software • Transportation

Lead and mentor teams in DevOps and SRE, architect scalable Azure Cloud infrastructure, implement CI/CD and IaC, ensure database reliability, and drive cross-functional collaboration.

Top Skills: Azure CloudAzure DevopsCi/CdCosmosdbDockerElkGrafanaKubernetesMySQLPostgresPrometheusRedisSQL ServerTerraform

Binance

Senior Site Reliability Engineer (Node.js & Javascript), Trading Technologies

3 Days AgoSaved

In-Office or Remote

United States

Senior level

Blockchain • Fintech • Software • Cryptocurrency • Metaverse

Design, build, and maintain internal monitoring and alerting for high-load real-time systems; automate production testing; troubleshoot and resolve performance issues; coordinate cross-team incident resolution; recommend architectural and process improvements; research vendor solutions and enforce security best practices.

Top Skills: AWSGCPJavaScriptLinuxNode.jsRest ApiWebsockets

CertifyOS

Senior Site Reliability Engineer

3 Days AgoSaved

Remote

Senior level

Healthtech • Social Impact • Software

Own the operational lifecycle of cloud-native data infrastructure: design and automate reliable deployments, observability, incident response, SLIs/SLOs, autoscaling and IaC, and improve platform efficiency and data freshness across GKE and Cloud Run.

Top Skills: BashBigQueryCloud BuildCloud MonitoringCloud RunDatadogDockerGCPGithub ActionsGkeGoGrafanaJIRAKubernetesPrometheusPulumiPythonSentrySlackSnykSonarqubeTerraform

RELX

Senior Site Reliability Engineer

3 Days AgoSaved

In-Office or Remote

5 Locations

105K-198K Annually

Senior level

105K-198K Annually

Senior level

Information Technology • Legal Tech • Analytics

Design, deploy, and maintain highly available Kubernetes clusters on AWS EKS; manage and optimize cloud infrastructure; develop IaC and automation; implement CI/CD (GitHub Actions); monitor multi-region systems, troubleshoot incidents, perform root cause analysis; document best practices; and mentor junior engineers.

Top Skills: AWSAws EksCi/CdContainersGithub ActionsInfrastructure As CodeKubernetesNewrelicPythonRbac

Loft Orbital

Senior Site Reliability Engineer

3 Days AgoSaved

Remote or Hybrid

180K-240K Annually

Senior level

180K-240K Annually

Senior level

Aerospace • Defense

Lead design, implementation, and operation of scalable, secure hybrid-cloud infrastructure for satellite ground systems. Improve developer experience, automate CI/CD and IaC, own observability, troubleshoot reliability issues, and collaborate with developers and satellite operators to advance SatDevOps practices.

Top Skills: C/C++Ci/CdGCPGoGrafanaInfrastructure As Code (Iac)JavaKubernetesLokiPrometheusPythonRustSoftware Defined Networking (Sdn)

Let Your Resume Do The Work

Upload your resume to be matched with jobs you're a great fit for.

All Filters

Early Applicant

JobType

New Jobs

Job Category

Experience

Industry

Company Name

Find Company

Company Size

Sign up now Access later

Create Free Account

Already have an account? Log In

Top Remote Senior Site Reliability Engineer Jobs in Boston, MA

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer - AI Agents

Principal Software Engineer-SRE

GOV Site Reliability Engineer

Senior Site Reliability Engineer - Remote EST

Software Engineer , SRE Tooling & Reliability Platforms

Site Reliability Engineer

Engineering Manager, Storage SRE

Sr./Staff - Infrastructure/Site Reliability Engineer (SRE)

Site Reliability Engineer

Site Reliability Engineer

Cut your apply time in half.

Site Reliability Engineer

Site Reliability Engineer (Hosted Infra) - Platform

Staff Site Reliability Engineer

Senior Site Reliability Engineer, Wikimedia Enterprise

Senior Site Reliability Engineer, Data Persistence

Sr Site Reliability Engineer - Release

Senior Software Engineer - SRE

Site Reliability Engineer - AI Infrastructure

Director of DevOps and Site Reliability Engineering (SRE)

Senior Site Reliability Engineer (Node.js & Javascript), Trading Technologies

Senior Site Reliability Engineer

Senior Site Reliability Engineer

Senior Site Reliability Engineer

Top Boston, MA Companies Hiring Remote Senior Site Reliability Engineers

Loft Orbital

Popular Job Searches

Total selected ()