Get the job you really want.
Maximum of 25 job preferences reached.
Top Remote Senior Site Reliability Engineer Jobs in Boston, MA
Computer Vision • Machine Learning • Software
As a Site Reliability Engineer, ensure the reliability, performance, and scalability of Ditto's cloud infrastructure by developing observability solutions, leading incident management, and collaborating with product engineering teams.
Top Skills:
AWSAzureCDatadogGCPGoGrafanaHelmJavaKubernetesPrometheusRustTerraform
Artificial Intelligence • Healthtech • Software
The Staff Site Reliability Engineer will lead the reliability of production systems by defining SRE practices, improving observability, and ensuring fault-tolerance in cloud environments.
Top Skills:
AWSGoKubernetesPostgresPythonTerraformTypescript
Digital Media • Social Media • Software • Sports
Lead the technical architecture and execution of migration to AWS, drive developer enablement, and automate infrastructure using code-first principles.
Top Skills:
Aws EksDatadogGithub ActionsGoIstioK6KubernetesNode.jsTerraform
Software
As a Site Reliability Engineer, you'll enhance system reliability, collaborate on production readiness, define SLIs/SLOs, and improve incident response.
Top Skills:
AWSDatadogGrafanaKubernetesOpentelemetryPrometheusTypescript
Edtech
The Lead Software Engineer will lead the SRE team, focusing on reliability, performance optimization, security, and mentoring developers, while improving overall platform resilience.
Top Skills:
ActivejobAnsibleAWSAws CloudwatchEc2EcsElasticsearchGitGCPGoogle Cloud StackdriverJenkinsJIRAKubernetesMemcachedMongoDBNew RelicNode.jsPostgresRedisRuby On RailsSidekiqSpinnakerTerraformTerragrunt
Information Technology • Legal Tech
The role involves maintaining and improving Azure infrastructure, managing Infrastructure as Code with Terraform, enhancing security measures, and operating CI/CD pipelines.
Top Skills:
AzureAzure DevopsBashCircleCIDatadogEfkElkGithub ActionsPowershellPythonTerraform
News + Entertainment
As an Ads Reliability Engineer, you will ensure the reliability of Netflix's Ad Suite by designing scalable infrastructure, collaborating with teams, and implementing automation for monitoring and incident response.
Top Skills:
AWSAzureGCPGoJavaKubernetesPythonTerraform
Artificial Intelligence • Fintech • Machine Learning • Natural Language Processing • Business Intelligence
The Senior Director of SRE leads and defines reliability and operational excellence across products, manages the SRE team, and scales reliability practices within the organization.
Top Skills:
AWSAzureCloud-Native NetworkingDistributed SystemsGCPKubernetesMicroservicesSite Reliability Engineering Principles
Information Technology • Software • Cryptocurrency • Web3
The Senior Site Reliability Engineer will design, build, and manage Azure infrastructure for HashSphere, ensuring secure and scalable deployments while enhancing system reliability and operational excellence in partnership with cross-functional teams.
Top Skills:
ArgoAzureGoGrafanaKubernetesPrometheusPythonSpaceliftTerraform
Big Data • Information Technology • Security • Software
The Senior Developer will drive observability roadmaps using SRE Golden Signals, establish monitoring strategies, enhance system reliability, and act as an expert in New Relic technology for performance management.
Top Skills:
BashCri-OCshKubernetesNew RelicPerlWindows Powershell
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The role involves supporting network infrastructure, automating cloud infrastructure, managing CI/CD workflows, and ensuring operational excellence in IT support, including incident response and security practices.
Top Skills:
AnsibleAWSBashDockerGitKubernetesPythonRubyTerraform
Security • Software
The Senior Site Reliability Engineer will manage AWS infrastructure, automate deployment, ensure architecture meets requirements, and develop tools for reliability.
Top Skills:
AnsibleAWSC#C++CloudFormationCloudwatchDatadogDockerEc2EksElkGrafanaJavaPythonS3TerraformVpc
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Software • Cryptocurrency
Manage and scale Kubernetes clusters, automate infrastructure, optimize performance, maintain blockchain nodes, and improve system reliability while collaborating with product teams.
Top Skills:
Aws (Ec2Aws EksDatadogDockerIam)KubernetesOpentelemetryPulumiRdsS3Terraform
Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software
The Senior Site Reliability Engineer will manage system incidents, enhance monitoring and database infrastructure, and collaborate on scalable systems to maintain reliability as usage scales.
Top Skills:
AWSClickhouseKubernetesMySQLPostgresRedis
Blockchain • Web3
As a Site Reliability Engineer, you'll enhance observability, logging, and tracing, collaborating with engineers to optimize performance and security of infrastructure.
Top Skills:
AnsibleAWSAws CdkGCPGitGoGrafanaKubernetesLgtmLokiMimirOpentelemetryPrometheusRustSentryTempoTerraformTypescriptWebassembly
Cloud • Information Technology
As a Staff Site Reliability Engineer, you will enhance cloud product lines, ensuring real-time scalability, collaborating with teams, and automating builds.
Top Skills:
AnsibleAWSAzureBashDnsDockerEnvoyGCPGitGoGrafanaHaproxyHTTPJenkinsKafkaKubernetesLinuxMySQLOciOpentelemetryPostgresPrometheusPuppetPythonRedisTcp/IpTelegrafTerraformTls
Cloud • Information Technology
As a Staff Platform Engineer, you'll develop and maintain infrastructure components using Go and Node.js, improve service reliability, mentor juniors, and manage data ecosystems.
Top Skills:
EnvoyExpressGoJenkinsKafkaMySQLNode.jsPostgresPuppetPythonReactRedis
Sports
Manage and improve the AWS infrastructure, deploy into new regions, monitor releases, and implement new technologies in a fast-paced environment.
Top Skills:
AWSDockerGrafanaKubernetesPrometheusPython
Professional Services • Analytics
The Site Reliability Engineer will ensure the performance and availability of Crunchafi's cloud-based SaaS platform by building infrastructure, monitoring systems, and automating operational tasks. They will manage Azure services, CI/CD pipelines, and incident responses while collaborating with cross-functional teams to enhance reliability.
Top Skills:
App ServicesArm TemplatesAzure MonitorAzure SqlBashBicepC#DockerGithub ActionsGoKubernetesAzurePowershellPythonTerraformVirtual Networks
Security • Cybersecurity
The Staff Site Reliability Engineer will lead reliability strategy, architecture, and incident response while mentoring engineers and improving operational excellence.
Top Skills:
AWSCi/CdGithub ActionsJavaScriptPythonRubyTerraform
Information Technology • Security • Cybersecurity
The Staff/Principal Site Reliability Engineer leads infrastructure initiatives, architects solutions for cloud and SaaS, and collaborates cross-functionally to enhance reliability and innovation.
Top Skills:
AWSBashBazelCuelangDatadogGitopsGoGrafanaHelmKubernetesLinuxPrometheusPythonTerraform
Blockchain • Software
As a Senior Engineer, SRE/DevOps, you will enhance blockchain infrastructure reliability, automate deployment, and collaborate on CI/CD practices while ensuring security and performance optimization.
Top Skills:
AnsibleAWSBashCloudtrailCloudwatchCosmosDockerElk-StackEthereumGCPK8SKubernetesOpsgeniePingdomPythonTerraform
Software
As a Senior Site Reliability Engineer at Regrello, you'll shape the developer platform, collaborate with customers, and ensure the reliability and security of infrastructure and applications.
Top Skills:
AWSAzureCircleCIGCPGithub ActionsGitlab CiGoKubernetesTerraform
Cloud
Join Arista Networks as a Site Reliability Engineer to manage CloudVision service reliability, scalability, and stability in a FedRAMP environment, focusing on areas like architecture, security, and performance optimization.
Top Skills:
AnsibleBashGCPGkeGoKubernetesPulumiPython
Healthtech • Information Technology • Software • Telehealth
The Senior Site Reliability Engineer will develop, monitor, and maintain distributed production systems, ensuring uptime for patients and providers while automating processes and supporting a large engineering team.
Top Skills:
AWSDockerGCPKubernetes
Popular Job Searches
All Filters
Total selected ()
No Results
No Results










.png)
























