Get the job you really want.
Top Senior Site Reliability Engineer Jobs in Boston, MA
Artificial Intelligence • Software
As a Senior Staff Site Reliability Engineer, you will enhance system reliability and performance, lead incident management, analyze capacity planning, and mentor junior engineers.
Top Skills:
AnsibleAWSAzureBashChefCircleCICloudFormationDatadogDockerElkGCPGitlabGoGrafanaJenkinsKubernetesPrometheusPuppetPythonTerraform
Artificial Intelligence • Fintech • Information Technology • Software • Data Privacy
The Senior Site Reliability Engineer ensures SaaS products are stable and optimized, focusing on automation, monitoring, and collaboration within teams to maintain high service quality.
Top Skills:
AksAnsibleAppdynamicsAzure DevopsBashC# .NetCosmosDatadogDynatraceEksHarnessIdera Sql Diagnostic ManagerJavaJenkinsKubernetesNew RelicPowershellPythonRedgate Sql MonitorSolarwinds Database Performance AnalyzerSQLTerraform
Blockchain • Software
The Senior Engineer, SRE/DevOps will ensure the reliability and security of blockchain infrastructure by automating processes and collaborating with teams.
Top Skills:
AnsibleAWSBashCloudtrailCloudwatchCosmosDockerElasticsearchElk-StackEthereumGCPK8SMySQLOpsgeniePagerdutyPingdomPythonTerraform
Information Technology • Cryptocurrency
The Head of SRE will lead the SRE team, defining strategy, ensuring system reliability, and driving operational excellence while mentoring staff.
Top Skills:
ArgocdBashElk StackGCPGoGrafanaHelmKubernetesPrometheusPythonTerraform
Information Technology • Software
The Site Reliability Engineer will design and maintain resilient infrastructure for a SaaS platform, ensuring security and performance through AWS services and effective monitoring.
Top Skills:
Api GatewayAurora ServerlessAWSCloudwatchFusionauthGrafanaGuarddutyLambdaOpensearch ServerlessPrometheusSecrets ManagerShieldTerraformWaf
Blockchain • Software
The Site Reliability Engineer Lead will oversee build and deployment cycles, improve automation tools, and ensure highly available systems, while leading a diverse team in a remote setting.
Top Skills:
AnsibleAWSAzureBashBuildkiteChefDatadogDockerGCPGitGithub ActionsGitlab CiGoGrafanaHelmJenkinsKubernetesLinuxLokiOpentelemetryPrometheusPulumiPythonRustSaltstackTerraform
Consumer Web • Digital Media • Information Technology • News + Entertainment • Social Media
The Senior Site Reliability Engineer will enhance infrastructure resilience, optimize system performance, and improve both physical and cloud systems while collaborating with engineering teams.
Top Skills:
AnsibleCC++DockerGoJavaKubernetesPythonTerraformUnix/Linux
Security • Software
Design and implement AWS infrastructure, manage automation with CloudFormation and Terraform, and ensure availability and reliability of cloud architectures. Support teams and advocate for improvements in architecture.
Top Skills:
AnsibleAWSC#C++CloudFormationDatadogDockerElasticsearchGrafanaHelmInfluxdbJavaKubernetesLogstashPythonSaltTerraform
Featured Jobs
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The role involves leading technical roadmaps, managing CDN infrastructures, troubleshooting systems, mentoring engineers, and implementing AI technology in distributed systems.
Top Skills:
AWSAzureCdnDnsGCPHttp/SLinuxPythonSplunkTcp/IpTlsUnix
Travel
Seeking a Lead Site Reliability Engineer with over 7 years of experience in Ops or DevOps. Responsibilities include architecting reliable systems, collaborating with teams, and ensuring system security and uptime.
Top Skills:
AWSBackboneChefDatadogGitJavaJavaScriptJqueryMongoDBNoSQLPrometheusReactRequirejsTerraform
Software
The Principal Site Reliability Engineer will architect and maintain fault-tolerant systems in the Jama Cloud, focusing on automation and reliability practices while guiding teams in engineering processes.
Top Skills:
ArangodbAWSBashDatadogDockerJ2EeMs SqlMySQLNeo4JPostgresPythonTerraform
Big Data • Cloud • Software • Database
The Senior Site Reliability Engineer will design, implement, and enhance systems for infrastructure development, focusing on automation, reliability, and developer experience.
Top Skills:
AWSAzureBazelCrossplaneGCPGithub ActionsKubernetesTerraform
Reposted 12 Days Ago
Artificial Intelligence • Marketing Tech • Sales • Software
The Site Reliability Engineer will enhance system performance, optimize data systems, manage infrastructure issues, and ensure efficient database operations.
Top Skills:
ClickhouseDatabasesLinuxNetworkingSQL
Cloud • Information Technology
Lead the Infrastructure SRE team, implementing SRE best practices, contributing to automation efforts, and ensuring platform reliability. Mentor team members and foster stakeholder engagement.
Top Skills:
AnsibleAWSAzureElkEnvoyExpressGCPGithub ActionsGoGrafanaHaproxyJavaScriptJenkinsKafkaMySQLNode.jsNomadPostgresPuppetPythonReactRedisTerraformVictoriametrics
12 Days Ago
Gaming • Software
The Site Reliability Engineer will enhance game delivery systems, ensure scalability and reliability, collaborate with teams on best practices, and maintain operational health while participating in on-call support.
Top Skills:
AnsibleBuildkiteC++Ci/CdCloudFormationGitGitlabGitopsGrafanaHelix CoreJavaScriptKotlinLokiPerforcePrometheusPythonTerraform
Big Data • Cloud • Software • Database
Design and build infrastructure for cloud services; improve resilience, automation, and monitoring; participate in on-call rotation.
Top Skills:
Amazon Web ServicesCi/CdGCPKubernetesLinuxAzureMongoDB
Legal Tech • Software
As a Senior Site Reliability Engineer, you will enhance monitoring systems, mentor engineers, and contribute to production resiliency in legal technology.
Top Skills:
AppdynamicsAzure Application InsightsDockerElkKubernetesNew RelicPowershellPython
Other • Social Impact
Design and maintain ML infrastructure, improve reliability, collaborate with teams, monitor performance, provide guidance, mentor others, and optimize ML systems.
Top Skills:
AnsibleArgo CdDistributed Training SystemsDockerElk StackGpu AccelerationGrafanaHelmKubernetesPrometheusPyTorchScikit-LearnTensorFlowTerraform
Other • Social Impact
Design, develop, and maintain machine learning infrastructure for efficient workflows. Collaborate with teams, ensure high reliability and optimize system performance.
Top Skills:
AnsibleArgo CdDockerElk StackGpu AccelerationGrafanaHelmKubernetesMachine LearningPrometheusPythonPyTorchScikit-LearnTensorFlowTerraform
Other • Social Impact
Design and scale ML infrastructure, improve reliability of systems, mentor team members, and collaborate to optimize machine learning workflows.
Top Skills:
AnsibleArgo CdDockerElk StackGpu AccelerationGrafanaHelmKubernetesPrometheusPyTorchScikit-LearnTensorFlowTerraform
Other
As a Senior Site Reliability Engineer, you'll ensure the reliability of infrastructure, build automated systems, and support SaaS applications, requiring 7 years of engineering experience and 5 years in SRE roles.
Top Skills:
AnsibleArgocdAWSBashCloudFormationGrafanaJavaKubernetesLinuxNew RelicPythonTerraform
Semiconductor
Lead Site Reliability Engineering team to ensure the reliability and performance of cloud and on-premise applications, implementing automation and managing infrastructure in a hybrid environment.
Top Skills:
AnsibleAWSAzureCloudFormationDatadogDockerElk StackGCPGitlab CiGoGrafanaJenkinsKubernetesPrometheusPythonRubyShellTerraform
Kids + Family • Mobile
The Senior Site Reliability Engineer will build and maintain infrastructure platforms, solve complex problems, and lead engineering teams, focusing on scalable solutions.
Top Skills:
AnsibleAWSChefCloudFormationJavaKafkaKubernetesPythonTerraform
Blockchain • Information Technology • Internet of Things
As a Site Reliability Engineer, you'll enhance the reliability of backend systems through DevOps best practices, automating processes, and ensuring high availability in distributed systems.
Top Skills:
AWSAzureBashBlockchainGCPGoKubernetesPythonRustTerraform
Software • Cryptocurrency
As a Staff SRE Engineer, you'll manage Kubernetes infrastructure, optimize system performance, and ensure reliability for a crypto wallet platform.
Top Skills:
Aws Ec2Aws EksAws IamAws RdsAws S3DatadogDockerKubernetesOpentelemetryPulumiTerraform
Top Boston Companies Hiring Senior Site Reliability Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results