Get the job you really want.
Maximum of 25 job preferences reached.
Top Senior Site Reliability Engineer Jobs in Boston, MA
Artificial Intelligence • Enterprise Web • Information Technology • Machine Learning • Mobile • Software • Analytics
The Site Reliability Engineer will improve alert quality, maintain infrastructure, and enhance operational security while collaborating with teams.
Top Skills:
Cloud TechnologiesGkeKubernetes
Reposted YesterdaySaved
Easy Apply
Easy Apply
Big Data • Fintech • Mobile • Payments • Financial Services
This role involves setting technical strategies, collaborating across teams, managing operations and availability, and fostering a culture of quality and ownership within the Site Reliability Engineering team.
Top Skills:
AWSKotlinKubernetesMySQLPythonSpark
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Site Reliability Engineer will enhance reliability and observability, automate processes, support engineering teams, and promote a culture of reliability at Coinbase.
Top Skills:
AWSAzureDockerEc2GCPGoKubernetesRubyTerraform
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Software
The AWS Cloud Architect will design, build, and optimize cloud infrastructure, ensuring scalability and security while mentoring junior SREs and defining cloud strategy.
Top Skills:
AnsibleAws Api GatewayAws CloudfrontAws CloudtrailAws CloudwatchAws DocumentdbAws Ec2Aws EksAws LambdaAws RdsAws S3Aws Secrets ManagerAws SsmDockerGrafanaHashicorp ConsulHashicorp TerraformHashicorp VaultKubernetesNew RelicPrometheus
Reposted 13 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
This role involves building and maintaining observability services, ensuring service reliability, and collaborating with other teams on best practices.
Top Skills:
AWSFluentbitGCPJaegerKubernetesAzureQuickwitSplunkVectorVictoriametrics
Reposted 14 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
As a Staff Site Reliability Engineer, you will empower developers by optimizing MongoDB Atlas, ensuring seamless performance across multiple cloud platforms while fostering a supportive culture.
Top Skills:
AWSGCPAzureMongoDB
Reposted 6 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
As a Staff Engineer in the InfraSec team, you'll lead the design and deployment of security solutions for cloud platforms, automate monitoring, and manage security tooling while mentoring a small team of SREs.
Top Skills:
AnsibleAWSAzureCloudFormationGCPGoTerraform
Artificial Intelligence • Cloud • Consumer Web • eCommerce • Information Technology • Software
The Site Reliability Engineer will ensure application performance, architect monitoring tools, analyze systems, provide reliability recommendations, and support production.
Top Skills:
AnsibleCentosDatadogDockerLinuxMySQLNew RelicRhelSQL
Aerospace • Artificial Intelligence • Logistics • Machine Learning • Software • Transportation • Defense
Lead the deployment, scaling, and maintenance of the Flyways AI Platform in a secure cloud infrastructure, coding software solutions and managing complex systems.
Top Skills:
AWSCircleCIDockerGrafanaHelmJenkinsK8SPostgresPythonTerraform
Big Data • Cloud • Productivity • Software • Database • Analytics • Automation
The Site Reliability Engineer will support engineering teams, enhance system resilience, and drive scalable infrastructure practices.
Top Skills:
Aws ServicesGrafanaHoneycombLinuxPythonTerraform
Sales • Software • Automation
Join the Infrastructure Team to build and maintain critical systems, automating database lifecycles and enhancing disaster recovery with a focus on resilience and simplicity.
Top Skills:
AnsibleArgocdAWSClickhouseDockerElasticsearchFlaskGithub ActionsGrafanaKubernetesMongoDBPostgresPythonRedisTerraform
Reposted 23 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
The Senior Site Reliability Engineer will support, maintain and grow the Atlas platform, focusing on automating processes and running multi-cloud environments.
Top Skills:
AWSAzureDnsGCPGoHTTPLinuxPythonRubyTls
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
eCommerce • Legal Tech • Professional Services • Software • Data Privacy
The Site Reliability Engineer will ensure systems run smoothly, work with automation tools, resolve issues, and drive operational improvements.
Top Skills:
AWSAzureCloudFormationDockerGCPGrafanaKubernetesMemcachedNew RelicOpentelemetryPostgresPrometheusPulumiRedisSentryTerraform
Fintech • Software
The SRE is responsible for building cloud-native platforms, improving application reliability, and fostering collaboration within teams.
Top Skills:
Ci/CdKubernetesOpenshiftOpenstackPrometheusSplunkVMware
Security • Software
We are looking for a Staff Site Reliability Engineer to manage AWS infrastructure, architect cloud solutions, and guide DevOps practices while ensuring system reliability and performance.
Top Skills:
AnsibleAWSCloudFormationCloudwatchDatadogDockerEc2EksElkHelmJavaKubernetesPythonS3SaltTerraformVpc
Cloud • Software
As an Associate Site Reliability Engineer, you will maintain service availability, manage incidents, and optimize performance while adhering to compliance policies.
Top Skills:
Aws,C2S,Python,Go,Unix,Red Hat Enterprise Linux,Linux,Solaris,Chef,Puppet,Jenkins,Bamboo,Spinnaker
Reposted 3 Days AgoSaved
Travel
The Senior Site Reliability Engineer will enhance platform tooling, drive automation of infrastructure components, and support teams by ensuring reliable and scalable cloud infrastructure on Google Cloud.
Top Skills:
BashDatadogGoogle Cloud PlatformHelmIstioKubernetesKustomizePythonTerraform
Cloud • Information Technology • Internet of Things • Software • Consulting • Infrastructure as a Service (IaaS) • Automation
Design, automate, and support OpenShift-based platforms, ensuring reliability and security while onboarding new managed services and handling incident responses.
Top Skills:
ArgoGoGrafanaJenkinsKubernetesLinuxOpenshiftPrometheusPythonTekton
On-Demand • Security • Software
The Site Reliability Engineer is responsible for maintaining server and network infrastructure health, monitoring operations, tracking assets, and collaborating with IT and Engineering teams.
Top Skills:
Asset Tracking SoftwareDastMonitoring ToolsSastSca
Enterprise Web • Hardware • Internet of Things • Software
The Senior Site Reliability Engineer will mentor teams on observability practices, architect systems for growth, automate developer tasks, and debug production issues.
Top Skills:
GoKubernetesLgtm StackOpentelemetryPrometheusTypescript
AdTech • eCommerce • Food • Marketing Tech • Retail
The Senior Site Reliability Engineer ensures system reliability and performance through automation and operational processes in a cloud-native environment, mentoring junior engineers and collaborating with cross-functional teams.
Top Skills:
AksArgocdAWSAzureBashDatadogDockerElkGCPGithub ActionsGoJavaKafkaKubernetesPrometheusPythonRedisSpring BootTerraformTomcat
Information Technology • Energy
As a Site Reliability Engineer, you will design high-availability systems, maintain security, troubleshoot production issues, and mentor the development team while ensuring best practices in infrastructure management.
Top Skills:
CloudInfrastructure-As-CodeProgrammingScripting
Artificial Intelligence • Cloud • Fintech • Machine Learning • Mobile • Software
The Staff Site Reliability Engineer will design, implement, and optimize infrastructure for AI services, ensure reliability and performance, and drive automation and observability excellence across engineering teams.
Top Skills:
AzureAzure DevopsDockerElk StackGithub ActionsGrafanaKubernetesMimirPostgresPrometheusSQL ServerTeamcityTerraform
Blockchain • Web3
As a Site Reliability Engineer, you'll enhance observability, logging, and tracing, collaborating with engineers to optimize performance and security of infrastructure.
Top Skills:
AnsibleAWSAws CdkGCPGitGoGrafanaKubernetesLgtmLokiMimirOpentelemetryPrometheusRustSentryTempoTerraformTypescriptWebassembly
Software • Cryptocurrency
Manage and scale Kubernetes clusters, automate infrastructure, optimize performance, maintain blockchain nodes, and improve system reliability while collaborating with product teams.
Top Skills:
Aws (Ec2Aws EksDatadogDockerIam)KubernetesOpentelemetryPulumiRdsS3Terraform
Top Boston Companies Hiring Senior Site Reliability Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results


.png)





























