Get the job you really want.
Maximum of 25 job preferences reached.
Top Senior Site Reliability Engineer Jobs in Boston, MA
Reposted 8 Days AgoSaved
Information Technology
Lead Observability Engineer responsible for defining and implementing observability strategies, tools, and patterns to ensure reliable performance across various systems at Vivun.
Top Skills:
CeleryDatadogGrafanaHoneycombLangchainNode.jsObserveOpenai ApisOpentelemetryPrometheusPython
Big Data • Cloud • Healthtech • Software • Big Data Analytics
As a Senior Site Reliability Engineer, ensure system reliability and scalability, lead incident management, develop automation tools, and mentor team members.
Top Skills:
AnsibleAWSBashDockerGitGoHibernateJavaKubernetesLinuxMavenMySQLPythonRubyShellSolrSpringTomcatVagrant
Big Data • Cloud • Healthtech • Software • Big Data Analytics
As a Senior Site Reliability Engineer at Veeva, you will enhance the reliability and scalability of applications, lead incident management, and mentor team members while working with modern technologies.
Top Skills:
AnsibleAWSBashDockerGitGoHibernateJavaKubernetesLinuxMavenMySQLPythonRubyShellSolrSpringTomcatVagrant
Healthtech • Insurance
The Senior Software Engineer will lead technical projects, mentor engineers, and build resilient cloud infrastructures focusing on SRE best practices.
Top Skills:
AWSCi/CdGCPGithub ActionsGrafanaKubernetesPrometheusTerraform
Information Technology • Internet of Things • Software • Virtual Reality
Lead reliability, availability, and resiliency strategies for large-scale systems, drive operational excellence, and provide technical mentorship across engineering teams.
Top Skills:
AWSCi/CdJavaMongoDBRabbitMQZookeeper
Reposted 6 Days AgoSaved
Easy Apply
Easy Apply
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Senior Site Reliability Engineer will build and scale identity management tools, automate operations, ensure security, and support AWS, GCP, and Azure environments.
Top Skills:
AnsibleAWSAzureC#Cloud Identity ProvidersDockerGCPGoInfrastructure As CodeJavaKubernetesPythonRubyTerraform
Artificial Intelligence • Machine Learning • Robotics • Automation
The role involves leading root cause analysis, troubleshooting production systems, improving system reliability, and collaborating across engineering and operations teams.
Top Skills:
ElasticGitlabGrafanaItsm ToolsJIRAKubernetesLogic MonitorPower BIPrometheusTableauVMware
Cloud • Information Technology • Internet of Things • Software • Consulting • Infrastructure as a Service (IaaS) • Automation
Design, automate, and support OpenShift-based platforms, ensuring reliability and security while onboarding new managed services and handling incident responses.
Top Skills:
ArgoGoGrafanaJenkinsKubernetesLinuxOpenshiftPrometheusPythonTekton
Artificial Intelligence • Fintech • Software • Financial Services
Seeking a seasoned SRE to lead reliability for a cloud-native platform, overseeing infrastructure, CI/CD pipelines, observability, and mentoring engineers.
Top Skills:
AWSClickhouseGoJavaKafkaKubernetesPulumiTerraform
Fitness
The Staff Site Reliability Engineer will establish SRE best practices, drive observability strategy, implement software solutions, and mentor engineers. Responsibilities include improving platform resilience, managing risks, and participating in incident response processes.
Top Skills:
AnsibleAWSAzureBashCloudFormationGCPGoKubernetesPulumiPythonTerraform
Software • Analytics
This SRE role involves deep ownership of production systems, focusing on improving AWS infrastructure, operational tooling, and automation for scaling ClickHouse installations at petabyte scale.
Top Skills:
AnsibleAWSClickhouseEc2LinuxTerraform
Information Technology • Legal Tech
The Senior Technology Site Reliability Engineer is responsible for maintaining and optimizing infrastructure and applications, ensuring reliability and performance while automating processes and collaborating with teams.
Top Skills:
AWSChefDatadogGoGrafanaJavaPrometheusPuppetPythonSaltTerraform
New
Track Smarter, Apply Better.
Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.
Use For Free
Cloud • Information Technology • Biotech
The Site Reliability Engineer will build and deploy Linux servers, research technologies, monitor system performance, and resolve technical incidents.
Top Skills:
Infrastructure-As-CodeLinuxNetworkingVirtualization
Artificial Intelligence • Big Data • Machine Learning • Software
The Site Reliability Engineer will develop and maintain platform services using Go and Python, improve CI/CD pipelines, and manage applications on Kubernetes while collaborating on infrastructure automation and troubleshooting services.
Top Skills:
AWSAzureDockerGCPGitGitGoJenkinsKubernetesPostgresPython
Healthtech
The Site Reliability Engineer will ensure system reliability, collaborate with support teams, automate processes, and handle incident responses, with a strong focus on customer engagement and communication.
Top Skills:
AnsibleAWSAzureBashCi/CdDockerGCPGoGrafanaKubernetesPython
Fintech • Analytics • Financial Services
The Site Reliability Engineer will enhance system reliability, implement observability tools, and collaborate with teams to improve SaaS applications.
Top Skills:
AWSAzureAzure DevopsBashDatadogGoNew RelicPowershellPrometheusPythonTerraform
Aerospace • Artificial Intelligence • Logistics • Machine Learning • Software • Transportation • Defense
Lead efforts to deliver the Flyways AI Platform, deploying and maintaining secure cloud services, coding software solutions, and collaborating with teams.
Top Skills:
AWSDockerGrafanaHelmK8SPostgresPythonTerraform
Artificial Intelligence • Information Technology • Software • Generative AI
The Site Reliability Engineer will ensure the reliability and performance of SaaS production systems, manage deployments and incident responses, and improve operational processes within a dynamic AI environment.
Top Skills:
AWSAzureBashDockerElkGCPGitGoGrafanaKubernetesPrometheusPulumiPythonTerraform
3D Printing • Artificial Intelligence • Software • Design
The role involves building reliable platforms for 3D/4D content delivery to AR/VR devices, monitoring system health, and improving operational practices in collaboration with the engineering team.
Top Skills:
Aws FargateCoreweaveGrafanaKubernetesPrometheusTerraform
Information Technology • Cryptocurrency
The Site Reliability Engineer will lead technical initiatives, architect solutions, troubleshoot issues, mentor team members, and improve observability practices.
Top Skills:
ArgocdBashElk StackGCPGoGrafanaHelmKubernetesPrometheusPythonTerraform
Gaming • Mobile • Software
As an SRE Manager, you will lead a team to enhance infrastructure services, manage incidents, and contribute to technical decisions while ensuring high availability and scalability of systems.
Top Skills:
Amazon AwsAnsibleArtifactoryCrossplaneDatadogElasticsearchGitlabGoGCPJaegerJenkinsKubernetesAzureMongoDBPackerPostgresPythonRedisTerraformVault
Blockchain • Software
As a Site Reliability Engineer at Offchain Labs, you will manage infrastructure in cloud environments, design CI/CD workflows, and enhance system reliability with a focus on blockchain technology.
Top Skills:
ArgocdAWSAzureCodebuildGCPGithub ActionsGoGrafanaKubernetesLokiPrometheusPythonTerraform
Cloud • Information Technology
The Site Reliability Engineer will support IaaS services, monitor infrastructure health, perform root cause analysis, automate processes, and collaborate with teams for service reliability.
Top Skills:
AnsibleAWSAzureBashGitlab CiJenkinsKubernetesLinuxOpenshiftPythonTerraformVmware Vsphere
Fintech
As a Site Reliability Engineer, you will enhance system reliability through scalable infrastructure, observability practices, automation, and collaboration with engineering teams.
Top Skills:
AWSDatadogGoGrafanaJavaKubernetesNode.jsPrometheusPulumiPythonTerraform
5 Days AgoSaved
Easy Apply
Easy Apply
Analytics
The Site Reliability Engineer will ensure the reliability and performance of IaaS services, perform incident resolution, and enhance system reliability through automation while supporting mobility across hybrid infrastructures and collaborating extensively with various teams.
Top Skills:
AnsibleAWSAzureBashGitlab CiJenkinsKubernetesLinuxOpenshiftPythonTerraformVmware Vsphere
Top Boston Companies Hiring Senior Site Reliability Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results




.png)

























