Get the job you really want.

Top Senior Site Reliability Engineer Jobs in Boston, MA

12 Days AgoSaved
Remote
Boston, MA
129K-201K
Senior level
129K-201K
Senior level
Other • Social Impact
Design and maintain ML infrastructure, ensure reliability and scalability, collaborate with teams, optimize system performance, and mentor others.
Top Skills: AnsibleArgo CdDockerElk StackGpu AccelerationGrafanaHelmKubernetesPrometheusPythonPyTorchScikit-LearnTensorFlowTerraform
12 Days AgoSaved
Remote
Boston, MA
129K-201K
Senior level
129K-201K
Senior level
Other • Social Impact
As a Staff Site Reliability Engineer, you will design and maintain ML infrastructure, optimize performance, and guide teams in operational excellence.
Top Skills: AnsibleArgo CdDockerElk StackGpu AccelerationGrafanaHelmKubernetesPrometheusPyTorchScikit-LearnTensorFlowTerraform
12 Days AgoSaved
Remote
Boston, MA
129K-201K
Senior level
129K-201K
Senior level
Other • Social Impact
Design and maintain machine learning infrastructure, ensuring reliability and scalability while mentoring team members and collaborating with various teams.
Top Skills: AnsibleArgo CdDockerElk StackGpu AccelerationGrafanaHelmKubernetesMachine LearningPrometheusPyTorchScikit-LearnTensorFlowTerraform
12 Days AgoSaved
Remote
Boston, MA
129K-201K
Senior level
129K-201K
Senior level
Other • Social Impact
The Staff SRE will design, maintain, and scale ML infrastructure, improve reliability, and collaborate with teams to optimize ML workflows.
Top Skills: AnsibleArgo CdDockerElk StackGpu AccelerationGrafanaHelmKubernetesPrometheusPythonPyTorchScikit-LearnTensorFlowTerraform
Reposted 20 Days AgoSaved
Remote
Boston, MA
Senior level
Senior level
Artificial Intelligence • Marketing Tech • Sales • Software
The Site Reliability Engineer will enhance system performance, optimize data systems, manage infrastructure issues, and ensure efficient database operations.
Top Skills: ClickhouseDatabasesLinuxNetworkingSQL
14 Days AgoSaved
Remote
Boston, MA
88K-131K Annually
Senior level
88K-131K Annually
Senior level
Fintech • Financial Services
The Senior Site Reliability Engineer will ensure system reliability, automate deployments, govern monitoring infrastructure, and enhance software delivery with cross-functional collaboration.
Top Skills: AWSAzureBashGCPGroovyMonitoring ToolsNoSQLObservability ToolsPowershellSQL
15 Days AgoSaved
Remote
Boston, MA
Senior level
Senior level
Information Technology • Software
You will manage and improve the technology infrastructure, ensuring its efficiency and security, while mentoring junior team members and driving projects autonomously.
Top Skills: CloudflareCloudflare WorkersGCPGoPostgresRedis
Reposted 20 Days AgoSaved
Remote
Boston, MA
186K-219K Annually
Senior level
186K-219K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
As a Senior Site Reliability Engineer, you will manage IAM systems, implement cloud-native applications, and enhance automation and security in operations, ensuring peak uptime and performance.
Top Skills: AnsibleAWSAzureAzure AdC#DockerDuoGCPGoGoogle WorkspaceJavaKubernetesOktaPingPythonRubyTerraform
New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free
Application Tracker Preview
Reposted 17 Days AgoSaved
Remote
Boston, MA
221K-299K
Senior level
221K-299K
Senior level
Cloud • Information Technology
Lead and expand a Production SRE team, enhance infrastructure reliability, implement network automation, and shape SRE practices within the organization.
Top Skills: AnsibleEnvoyExpressGitGoHaproxyJavaScriptJenkinsKafkaMySQLNapalmNode.jsPostgresPythonReactRedisSaltstack
Reposted 19 Days AgoSaved
Remote
Boston, MA
185K-250K
Senior level
185K-250K
Senior level
Cloud • Information Technology
As a Staff Site Reliability Engineer, you will manage core infrastructure, improve reliability, automate operations, and support engineering teams in a remote environment.
Top Skills: ElkEnvoyGoGrafanaGrpcHaproxyHashicorp NomadHoneycombJenkinsKafkaLinuxMySQLNode.jsPostgresPuppetRedis
19 Days AgoSaved
Remote
Boston, MA
130K-140K
Senior level
130K-140K
Senior level
Real Estate • Software • PropTech
As a Site Reliability Engineer, you will ensure system reliability and stability, troubleshoot issues, and optimize operational processes within Qualia's technology systems, focusing on Resware applications.
Top Skills: .NetAzureIisPowershellSQL ServerTerraformWindows Server
20 Days AgoSaved
Remote
Boston, MA
100K-160K
Senior level
100K-160K
Senior level
Information Technology • Security • Cybersecurity
As a Staff Site Reliability Engineer, you will design and optimize Kubernetes infrastructure, maintain CI/CD pipelines, and mentor engineers, ensuring system reliability and automation practices.
Top Skills: Argo CdBashCi/CdDatadogGithub ActionsGitlab CiGoGrafanaHelmJenkinsKubernetesOpentelemetryPrometheusPythonSpinnakerTerraform
Reposted 6 Days AgoSaved
In-Office
Boston, MA
184K-426K
Senior level
184K-426K
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Design and implement GPU compute clusters, optimize operations for efficiency, troubleshoot and maintain large-scale infrastructure, and enhance researcher productivity.
Top Skills: BashDockerEnrootGpfsKubernetesLustreMySQLPythonPyTorchSlurmTensorFlowTerraform
Reposted 14 Hours AgoSaved
Remote
Boston, MA
118K-173K Annually
Senior level
118K-173K Annually
Senior level
Financial Services
As a Senior Site Reliability Engineer, you will ensure system reliability and performance while collaborating on system design, coding, and incident response.
Top Skills: AWSAzureDockerGCPJavaKubernetesOpentelemetryPrometheusSpring Boot
20 Days AgoSaved
Remote
Boston, MA
Senior level
Senior level
Blockchain • Information Technology
Lead the design and management of infrastructure for reliability, security, and scalability. Build developer tools, automate deployments, and ensure system performance in a blockchain environment.
Top Skills: AnsibleAWSAzureGCPGoKubernetesPythonRustTerraform
20 Days AgoSaved
Remote
Boston, MA
Senior level
Senior level
Database
Manage infrastructure for Postgres databases, improve system architecture, enhance observability, implement CI/CD, and resolve support issues.
Top Skills: AWSCdkGoInfrastructure As CodePulumiTerraformTypescript
Reposted 7 Days AgoSaved
Remote or Hybrid
Boston, MA
155K-207K Annually
Senior level
155K-207K Annually
Senior level
Artificial Intelligence • Automotive • Machine Learning • Transportation
The Senior Site Reliability Engineer will enhance system reliability and performance, lead incident response, mentor junior members, and manage cloud infrastructure costs.
Top Skills: AWSBashC++CloudFormationCloudwatchDatadogDockerGitlab CiGrafanaJavaJenkinsKubernetesPrometheusPythonTerraform
21 Days AgoSaved
Remote
Boston, MA
Senior level
Senior level
Cloud • Software
The Senior Site Reliability Engineer will deploy and maintain observability infrastructure, manage Kubernetes platforms, and enhance security for DoD networks.
Top Skills: ArgocdAWSAzureBashFluxGCPGoGrafanaHelmIstioKeycloakKubernetesMimirPrometheusPulumiYaml
Reposted 21 Days AgoSaved
Remote or Hybrid
Boston, MA
250K-250K
Senior level
250K-250K
Senior level
Cloud • Greentech • Other • Energy
In this role, you'll support virtualization and kernel performance, develop automation tools, optimize compute platforms for AI, and collaborate with hardware teams.
Top Skills: CGoKvmLinuxQemuRustSmartnics
Reposted 8 Days AgoSaved
Remote or Hybrid
Boston, MA
Senior level
Senior level
Artificial Intelligence • Healthtech • Machine Learning • Software • Biotech
Responsible for designing, building, and operating hybrid cloud and on-prem infrastructure, implementing SRE best practices, and automation.
Top Skills: AnsibleAWSCloudFormationDatadogEksGoGrafanaKubeadmKvmPrometheusPythonTerraform
Reposted 14 Hours AgoSaved
Remote
Boston, MA
Senior level
Senior level
Automotive • Software
The Senior Site Reliability Engineer will optimize platform reliability, manage Kubernetes infrastructure, deploy monitoring solutions, and collaborate on system performance.
Top Skills: AndroidArgocdAWSCircleCICrossplaneDockerGCPGitGoGrafanaKafkaKubernetesLokiNew RelicObjective-COpentelemetryPostgresPrometheusPythonReactRedisRedshiftReduxRuby On RailsSentrySwiftTerraformThanos
Reposted YesterdaySaved
In-Office or Remote
Boston, MA
Senior level
Senior level
Artificial Intelligence • Information Technology • Consulting
In this role, you'll ensure the reliability and performance of the AI Studio inference platform, involving extensive work with telemetry pipelines, Kubernetes, and resilience in infrastructure design.
Top Skills: BashGrafanaKubernetesMlopsPrometheusPythonTerraform
Reposted YesterdaySaved
Remote
Boston, MA
170K-210K Annually
Senior level
170K-210K Annually
Senior level
Software
The Senior Site Reliability Engineer will enhance system reliability, automate deployments, and mentor teams while managing AWS infrastructure and incident responses.
Top Skills: AWSBuildkiteCloudflareDatadogEcsFargateGitKafkaMikro-OrmMongoDBNestjsNode.jsPostgresReactReact NativeTerraformTypescriptVue
6 Days AgoSaved
Remote
Boston, MA
137K-172K Annually
Senior level
137K-172K Annually
Senior level
Biotech
The Senior Site Reliability Engineer will architect and automate AWS and Kubernetes platforms, ensuring operational excellence for bioinformatics workflows.
Top Skills: AWSAws CdkAws LambdaAws Secrets ManagerBashDockerEksGrafanaHelKubernetesPrometheusPythonTerraform
Reposted 6 Days AgoSaved
In-Office or Remote
Boston, MA
146K-181K Annually
Senior level
146K-181K Annually
Senior level
Big Data • Cloud • Marketing Tech • Social Impact • Software
The Senior SRE will manage global product deployments, provide engineering support, enhance CI/CD and monitoring, and maintain operational documentation.
Top Skills: AWSCircleCIGCPGoJenkinsKubernetesPythonTerraform
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account