IDT Logo

IDT

Senior DevOps / ML Infrastructure Engineer - AI Lab

Posted 4 Days Ago
Be an Early Applicant
In-Office or Remote
5 Locations
Senior level
In-Office or Remote
5 Locations
Senior level
As a Senior DevOps/ML Infrastructure Engineer, you'll manage infrastructure, support ML model integration, and build automated MLOps pipelines in a collaborative setting.
The summary above was generated by AI
Secure Global Money Transfers with Cutting-Edge Technology. 

Join our mission to protect cross-border transactions, helping customers send money safely worldwide.

As a Senior DevOps / ML Infrastructure Engineer in our AI Lab, you'll maintain and scale our infrastructure while enabling seamless ML model integration into production workflows.

You'll work alongside our Senior MLOps Architect to build a comprehensive ML platform that serves multiple teams across the organization.

What You'll Do:

  • Manage multiple orchestration platforms: Kubernetes in AWS (CloudFormation) and on-prem Kubernetes clusters-
  • Maintain Apache Flink infrastructure (managed in AWS or self-hosted in on-prem Kubernetes)
  • Handle production support, incident response, and on-call rotations
  • Perform regular patching activities and security vulnerability remediation
  • Support and maintain workflow engine infrastructure
  • Improve observability by utilizing Prometheus, Grafana, Splunk, Slack alerts, etc.

MLOps & Platform Development:

  • Collaborate with Senior MLOps Architect to build and maintain ML infrastructure
  • Set up and configure MLflow for experiment tracking and model registry
  • Build automated MLOps pipelines for model training, experimentation, and deployment (Champion-Challenger, shadow mode)
  • Support feature calculation pipelines and ETL processes
  • Enable model serving infrastructure for Python-based ML services

We're Looking For:

  • 3-5+ years of professional experience in DevOps or infrastructure engineering
  • Strong hands-on experience with AWS services (EKS, ECR, SQS, S3, Managed Kafka, Managed Prometheus)
  • Deep experience with Kubernetes in production environments (multi-cluster management is a plus)
  • Proficiency with infrastructure as code: AWS CloudFormation and CDK (AWS Cloud Development Kit)
  • Experience with containerization (Docker) and container orchestration
  • Knowledge of setting up and maintaining CI/CD pipelines (GitHub Actions, ArgoCD, Jenkins, etc.)
  • Hands-on experience with observability tools: Prometheus, Grafana, Splunk- Experience with production support, incident response, and on-call rotations
  • Strong communication skills (English B2+)
  • Ability to work collaboratively with cross-functional teams (MLOps engineers, data scientists, software engineers)

It would be a plus:

  • Experience with Apache Flink, Kafka, or other stream processing frameworks
  • Understanding of ML lifecycle: model training, evaluation, deployment patterns
  • Experience with workflow engines or rule engines
  • Knowledge of fraud prevention, fintech, or compliance domains
  • Understanding of feature stores, ETL pipelines, and data engineering concepts

What We Offer:

  • Remote work flexibility – work from anywhere- B2B contract with competitive gross compensation in USD
  • Top-tier hardware to support your productivity
  • A challenging role in a team of skilled professionals with opportunity to grow into MLOps specialization
  • Direct collaboration with Senior MLOps Architect to learn and contribute to ML platform development
  • Continuous learning and career growth opportunities
  • Coverage for professional development: training, seminars, and conferences
  • Access to high-quality English lessons
  • Impact: Your work will directly prevent fraud while enabling secure financial access globally

Why This Role:

This position offers a unique opportunity to work at the intersection of traditional DevOps and MLOps. You'll maintain critical infrastructure while building expertise in ML infrastructure, model deployment, and workflow integration. You'll complement our MLOps Architect by handling general infrastructure needs while growing your ML platform skills, ultimately enabling faster delivery of ML capabilities across the organization.

Top Skills

Apache Flink
Argocd
AWS
Docker
Github Actions
Grafana
Jenkins
Kubernetes
Mlflow
Prometheus
Splunk

Similar Jobs

4 Hours Ago
Easy Apply
Remote
29 Locations
Easy Apply
92K-198K Annually
Mid level
92K-198K Annually
Mid level
Cloud • Security • Software • Cybersecurity • Automation
Manage and contribute to cross-functional initiatives in the Engineering division, overseeing technical programs, stakeholder alignment, communication, and program health tracking.
Top Skills: Ci/CdDevOpsGitlab
4 Hours Ago
Easy Apply
Remote
30 Locations
Easy Apply
158K-237K Annually
Senior level
158K-237K Annually
Senior level
Cloud • Security • Software • Cybersecurity • Automation
Drive the evolution of scalable data systems at GitLab, architecting solutions that enhance data management for enterprise-scale growth.
Top Skills: AirflowDockerGoPostgresPythonRuby On RailsTrino
4 Hours Ago
Easy Apply
Remote
31 Locations
Easy Apply
195K-420K Annually
Senior level
195K-420K Annually
Senior level
Cloud • Security • Software • Cybersecurity • Automation
As Director of Infrastructure, lead GitLab's software delivery modernization, oversee delivery teams, and ensure scalable, high-quality software experiences in SaaS and self-managed environments.
Top Skills: AIDevsecopsKubernetes

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account