Get the job you really want.

Top Tech Jobs & Startup Jobs in Boston, MA

Reposted 19 Hours AgoSaved
Remote
USA
109K-169K
Senior level
109K-169K
Senior level
Other • Social Impact
Responsible for designing and maintaining infrastructure for Wikimedia projects, including incident response, automation, and collaborating with global teams.
Top Skills: AnsibleDockerGerritGitlabGoKubernetesMediawikiPhabricatorPuppetPythonRubySpicerackTerraform
Reposted 19 Hours AgoSaved
Remote
USA
109K-169K
Senior level
109K-169K
Senior level
Other • Social Impact
The Senior Site Reliability Engineer will design and maintain infrastructure, ensure system reliability, participate in on-call rotations, and mentor peers in a collaborative remote environment.
Top Skills: AnsibleDockerGerritGitlabKubernetesMediawikiPuppetPythonSpicerackTerraform
Reposted 19 Hours AgoSaved
Remote
USA
109K-169K
Senior level
109K-169K
Senior level
Other • Social Impact
The Senior Site Reliability Engineer will design and maintain infrastructure, automate processes, lead incident responses, and mentor team members while working with a globally distributed team.
Top Skills: AnsibleDockerGerritGitlabKubernetesPhabricatorPuppetPythonSpicerackTerraform
YesterdaySaved
Remote
USA
129K-201K
Senior level
129K-201K
Senior level
Other • Social Impact
Design, develop, and maintain ML infrastructure for training and deploying models. Improve reliability and scalability while mentoring team members and collaborating with ML engineers.
Top Skills: AnsibleArgo CdDockerElk StackGpu AccelerationGrafanaHelmKubernetesMachine Learning InfrastructurePrometheusPythonPyTorchScikit-LearnTensorFlowTerraform
YesterdaySaved
Remote
USA
129K-201K
Senior level
129K-201K
Senior level
Other • Social Impact
The Staff Site Reliability Engineer will design, develop, and maintain machine learning infrastructure, ensuring reliability and performance while mentoring teams and optimizing operational processes.
Top Skills: AnsibleArgo CdDockerElk StackGrafanaHelmKubernetesPrometheusPythonPyTorchScikit-LearnTensorFlowTerraform
New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free
Application Tracker Preview
YesterdaySaved
Remote
USA
129K-201K
Senior level
129K-201K
Senior level
Other • Social Impact
The Staff Site Reliability Engineer will design and manage the ML infrastructure for production-grade machine learning models, ensuring reliability, scalability, and optimal performance while collaborating with various teams.
Top Skills: AnsibleArgo CdDockerElk StackGpu AccelerationGrafanaHelmKubernetesMachine LearningPrometheusPythonPyTorchScikit-LearnTensorFlowTerraform
YesterdaySaved
Remote
USA
129K-201K
Senior level
129K-201K
Senior level
Other • Social Impact
Responsible for designing, developing, and maintaining Machine Learning infrastructure, ensuring reliability and scalability, mentoring team members, and optimizing performance.
Top Skills: AnsibleArgo CdDockerElk StackGpu AccelerationGrafanaHelmKubernetesPrometheusPythonPyTorchScikit-LearnTensorFlowTerraform
YesterdaySaved
Remote
USA
129K-201K
Senior level
129K-201K
Senior level
Other • Social Impact
Design, develop, maintain, and scale ML infrastructure for training and deploying machine learning models, ensuring reliability and performance while collaborating with teams.
Top Skills: AnsibleArgo CdDockerElk StackGpu AccelerationGrafanaHelmKubernetesMachine Learning InfrastructurePrometheusPythonPyTorchScikit-LearnTensorFlowTerraform
YesterdaySaved
Remote
USA
129K-201K
Senior level
129K-201K
Senior level
Other • Social Impact
Design, develop, maintain, and scale ML infrastructure for Wikimedia, ensuring efficient training, deployment, and monitoring of machine learning models in production.
Top Skills: AnsibleArgo CdDockerElk StackGpu AccelerationGrafanaHelmKubernetesMachine Learning InfrastructurePrometheusPythonPyTorchScikit-LearnTensorFlowTerraform
YesterdaySaved
Remote
USA
129K-201K
Senior level
129K-201K
Senior level
Other • Social Impact
The Staff Site Reliability Engineer will design and maintain ML infrastructure, ensure system reliability, and mentor team members while collaborating with various stakeholders.
Top Skills: AnsibleArgo CdDockerElk StackGpu AccelerationGrafanaHelmKubernetesMachine LearningPrometheusPythonPyTorchScikit-LearnTensorFlowTerraform
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account