Get the job you really want.
Maximum of 25 job preferences reached.
Top Senior Site Reliability Engineer Jobs in Boston, MA
Reposted 22 Hours AgoSaved
Easy Apply
Easy Apply
Big Data • Fintech • Mobile • Payments • Financial Services
The Staff Software Engineer in SRE is responsible for setting technical strategy, ensuring system availability, guiding incident management, and fostering talent within the team to enhance overall system reliability.
Top Skills:
AWSBashKotlinKubernetesMySQLPythonSpark
Reposted 10 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
As a Staff Site Reliability Engineer, you will empower developers by optimizing MongoDB Atlas, ensuring seamless performance across multiple cloud platforms while fostering a supportive culture.
Top Skills:
AWSGCPAzureMongoDB
Artificial Intelligence • Computer Vision • Greentech • Machine Learning • Robotics • Industrial • Automation
The Site Reliability Engineer at AMP will support the technology infrastructure, focusing on ticket management and software observability while developing tools to enhance operational efficiency in waste sortation facilities.
Top Skills:
AnsibleDockerGrafanaJenkinsLinuxPrometheus
Fintech • Software
The Principal Site Reliability Engineer is responsible for maintaining and optimizing network infrastructures in SaaS products, ensuring performance, security, and reliability through automation and collaboration with the engineering team.
Top Skills:
AzureBashKubernetesPalo AltoPowershellPythonSilverpeak SdwanTerraformWireshark
Fintech • Software
The Principal Site Reliability Engineer is responsible for maintaining cloud infrastructure, ensuring application performance, and implementing automated solutions in a SaaS environment, while collaborating with security and software engineering teams.
Top Skills:
.NetAnsibleAppdynamicsAWSAzureAzure DevopsC#DatadogDynatraceHarnessJavaJenkinsKubernetesNew RelicTerraform
Artificial Intelligence • Big Data • Cloud • Information Technology • Machine Learning • Software
Lead Site Reliability Engineers to build and manage a high-performance cloud platform, ensuring compliance and reliability in services, especially for the US Public Sector.
Top Skills:
AnsibleAWSAzureBashCloudFormationCrossplaneDockerGCPGitGitlabGoIds/IpsJenkinsKubernetesPythonSIEMTerraform
Reposted 14 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
This role involves building and maintaining observability services, ensuring service reliability, and collaborating with other teams on best practices.
Top Skills:
AWSFluentbitGCPJaegerKubernetesAzureQuickwitSplunkVectorVictoriametrics
Enterprise Web • Hardware • Internet of Things • Software
The Senior Site Reliability Engineer will mentor teams on observability practices, architect systems for growth, automate developer tasks, and debug production issues.
Top Skills:
GoKubernetesLgtm StackOpentelemetryPrometheusTypescript
Cloud • Information Technology • Security • Software • Cybersecurity
This internship role focuses on SRE skills, requiring collaboration and problem-solving in dynamic environments for Zscaler's Zero Trust Exchange team.
Top Skills:
AnsibleAws EcsKubernetesLinuxPythonTerraform
Information Technology • Web3
The Site Reliability Engineer manages AWS Kubernetes infrastructure, ensuring operational excellence, security, and scalability, while implementing reliability improvements and best practices.
Top Skills:
ArgocdAWSBashDatadogEksGoKafkaKubernetesPostgresPythonSysdigTerraform
Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
The Senior Site Reliability Engineer manages production infrastructure, ensuring performance and reliability using AI tools, Kubernetes, and CI/CD pipelines while mentoring teams.
Top Skills:
Apache AirflowAWSAws LambdaAzureChatgptCi/CdCrossplaneGCPGeminiGithub CopilotGoKubernetesOpensearchPostgresPythonRedisSnowflakeTerraform
Reposted 8 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
As a Staff Engineer in the InfraSec team, you'll lead the design and deployment of security solutions for cloud platforms, automate monitoring, and manage security tooling while mentoring a small team of SREs.
Top Skills:
AnsibleAWSAzureCloudFormationGCPGoTerraform
New
Track Smarter, Apply Better.
Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.
Use For Free
Big Data • Healthtech • HR Tech • Machine Learning • Software • Telehealth • Big Data Analytics
The Staff Site Reliability Engineer will architect, operate, and improve the platform while ensuring security compliance and enhancing development processes.
Top Skills:
AWSElasticsearchIstioKubernetesNatsNode.jsPostgresPythonReactTerraformTypescript
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The role involves improving software reliability, automating processes, collaborating with teams on system optimization, and mentoring engineers to establish reliability as a core value.
Top Skills:
AWSAzureDatadogDockerEc2GCPGoKibanaKubernetesRubyTerraform
Reposted 24 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
The Senior Site Reliability Engineer will support, maintain and grow the Atlas platform, focusing on automating processes and running multi-cloud environments.
Top Skills:
AWSAzureDnsGCPGoHTTPLinuxPythonRubyTls
eCommerce • Legal Tech • Professional Services • Software • Data Privacy
The Site Reliability Engineer will ensure systems run smoothly, work with automation tools, resolve issues, and drive operational improvements.
Top Skills:
AWSAzureCloudFormationDockerGCPGrafanaKubernetesMemcachedNew RelicOpentelemetryPostgresPrometheusPulumiRedisSentryTerraform
Security • Software
Design, implement, and architect AWS cloud infrastructure and automation for SaaS reliability. Lead SRE/DevOps practices, configuration management, observability, and recovery planning while mentoring engineers and driving platform improvements.
Top Skills:
Aws,Vpc,Ec2,Eks,S3,Cloudformation,Cicd,Docker,Kubernetes,Helm,Terraform,Salt,Ansible,Datadog,Logz.Io,Influxdb,Cloudwatch,Catchpoint,Elk,Grafana,Logstash,Elasticsearch,Python,C#,C++,Java
Security • Software
Work with Cloud Engineering to improve availability, performance, security, and scalability of CyberArk SaaS. Monitor, triage, and automate remediation of production incidents, enhance monitoring and dashboards, participate in on-call rotation, and influence system design to prevent failures. Focus on automation (Ansible, scripting), IaC, cloud platforms, and secure operations.
Top Skills:
Linux,Unix,Windows,Ansible,Puppet,Chef,Python,Ruby,Bash,Powershell,Terraform,Cloudformation,Aws,Azure,Gcp
Security • Software
The Site Reliability Engineer will enhance production services' reliability and security, automate tasks, monitor systems, and manage incidents.
Top Skills:
AnsibleAWSAzureBashCloudFormationGCPLinux/UnixPowershellPythonRubyTerraformWindows Os
Information Technology • Internet of Things • Software • Virtual Reality
Lead reliability, availability, and resiliency strategies for large-scale systems, drive operational excellence, and provide technical mentorship across engineering teams.
Top Skills:
AWSCi/CdJavaMongoDBRabbitMQZookeeper
Machine Learning • Software
The Site Reliability Engineer will enhance cloud infrastructure, improve system reliability, manage FedRAMP compliance, and build internal tools. Responsibilities include operational duties and leading product development.
Top Skills:
Argo CdAWSCoralogixCrossplaneGitGoIstioKafkaKopsKubernetesPrometheusTerraformThanos
Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
Design, scale, and manage AWS services for IoT devices. Collaborate on infrastructure, optimize performance, and ensure high availability of services.
Top Skills:
AWSBashGoHelmKubernetesPythonRubyTerraform
Fintech • Software
The SRE is responsible for building cloud-native platforms, improving application reliability, and fostering collaboration within teams.
Top Skills:
Ci/CdKubernetesOpenshiftOpenstackPrometheusSplunkVMware
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Site Reliability Engineer will enhance CI/CD frameworks, automate cloud infrastructure, manage Kubernetes and AWS services, and ensure operational excellence.
Top Skills:
AnsibleAWSBashChefCi/CdDockerGitKubernetesPuppetPythonRubySaltTerraform
Artificial Intelligence • Other • Security • Software • Analytics • Big Data Analytics
The Lead Site Reliability Engineer will oversee the reliability and scalability of the infrastructure, lead a team in operational execution, ensure best practices in SRE, and mentor senior engineers.
Top Skills:
Ci/CdDockerGitopsGoKubernetesLinuxPythonTerraform
Popular Job Searches
All Filters
Total selected ()
No Results
No Results









.png)
.png)






.png)











