Eka Robotics Logo

Eka Robotics

Machine Learning / Reinforcement Learning Infrastructure Engineer

Posted Yesterday
Be an Early Applicant
In-Office
Boston, MA, USA
Mid level
In-Office
Boston, MA, USA
Mid level
Design, implement, and maintain large-scale ML/RL training infrastructure: job orchestration, scheduling, checkpointing, experiment tracking, developer tooling, distributed training, and resource management for cloud compute.
The summary above was generated by AI

Eka Robotics

Eka Robotics is on a mission to build intelligence for the physical world - robots that are fast, general, and reliable. Our approach, grounded in physics, unlocks superhuman capabilities. We are defining the frontier of robotics research and deployment.

Our team consists of pioneers in robotics and machine learning. We are now hiring to scale our R&D effort. We are looking for hands-on individuals who are excited to help shape the future of robotics.

The Role

We are looking for a Reinforcement/Machine Learning Infrastructure Engineer to shape our training infrastructure. In this role, you will be responsible for designing, implementing, and maintaining the large-scale model training systems that power our next generation of robot learning.

We believe that world-class infrastructure is the foundation for moving research into production. You will focus on building an exceptional developer experience, creating intuitive and efficient tooling that our engineers and scientists love to use. Your work will directly accelerate our research cycles, making it effortless to test new ideas and scale successful experiments into production training runs. You will work closely with researchers to ensure our infrastructure scales seamlessly from prototyping to large-scale distributed training.

This is a hands-on, high-impact role at the intersection of machine learning, software engineering, and scalable infrastructure.

Responsibilities

  • Own Training Infrastructure: Design, implement, and maintain robust systems for large-scale model training, including job orchestration, scheduling, checkpointing, and experiment tracking.

  • Developer Experience & Tooling: Build streamlined, intuitive abstractions for launching, monitoring, debugging, and reproducing experiments, minimizing friction and maximizing productivity for our research teams.

  • Scale Distributed Training: Work closely with researchers to reliably scale reinforcement learning and machine learning pipelines across compute clusters.

  • Resource Management: Ensure efficient allocation and utilization of cloud-based compute resources while building the foundational systems needed for future scaling.

  • Collaborate with Researchers: Partner with the research team to understand their needs, build infrastructure that supports cutting-edge methods, guide best practices for training at scale, and contribute to core JAX model and training code.

Minimum Qualifications

  • Education: BS, MS or higher in Computer Science, Computer Engineering, Machine Learning or a related technical field.

  • Software Engineering: Strong software engineering fundamentals with a proven track record of building ML training infrastructure, internal developer platforms, or scalable systems.

  • Deep Learning Frameworks: Hands-on experience with large-scale training using JAX (preferred), PyTorch, or TensorFlow.

  • Distributed Systems: Familiarity with distributed training, multi-host setups, data pipelines, and managing workloads on cloud platforms or orchestration systems (e.g., Kubernetes, SLURM, GCP, AWS).

  • Communication & Ownership: Strong cross-functional communication skills, a deep ownership mindset, and a passion for building tools that improve the developer experience.

  • Infrastructure & DevOps: Experience building automated testing pipelines, CI/CD for ML workflows, and custom logging/telemetry stacks.

Preferred Qualifications

  • Domain Experience: Background in robotics, reinforcement learning or other machine learning systems.

  • Systems Design: Experience designing abstractions that balance researcher flexibility with system reliability.

Similar Jobs

6 Minutes Ago
Remote or Hybrid
United States
77K-214K Annually
Senior level
77K-214K Annually
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Provide tax planning, compliance, and strategy advice to clients; prepare and file tax returns; analyze financial data; optimize tax positions; draft tax documents; mentor junior staff; build client relationships while upholding professional standards.
6 Minutes Ago
Hybrid
Boston, MA, USA
99K-232K Annually
Mid level
99K-232K Annually
Mid level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead implementation of legal technology and process improvements to streamline legal services, oversee contract management and compliance, manage project planning/execution, mentor team members, apply data analysis and legal project management, and maintain client relationships while resolving conflicts and upholding professional standards.
Top Skills: Computer Assisted Legal Research (Calr)Contract ManagementContractual Risk Assessment MethodologyData AnalysisLegal Document ReviewLegal NegotiationLegal Project ManagementLegal TechnologyLitigation SupportProcess Improvement Methodologies
6 Minutes Ago
Hybrid
Boston, MA, USA
99K-232K Annually
Mid level
99K-232K Annually
Mid level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead client engagements to optimize end-to-end supply chain operations for pharma and medtech clients. Analyze processes, recommend technology and data-driven solutions, manage project planning and delivery, mentor teams, drive cost and efficiency improvements, and manage stakeholder relationships and risks to ensure successful transformation outcomes.
Top Skills: Data AnalyticsSupply Chain Management Software

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account