Yotta Labs Logo

Yotta Labs

Research Engineer - AI Systems

Posted 19 Hours Ago
In-Office or Remote
Hiring Remotely in United States
Mid level
In-Office or Remote
Hiring Remotely in United States
Mid level
Design and optimize high-performance kernels and custom operators for attention, MoE, GEMM, quantization and collective communication across NVIDIA, AMD and AWS Trainium. Improve LLM inference runtimes, develop distributed training/inference solutions at scale, use compilers and SDKs (Neuron, Torch Dynamo, PyTorch/XLA), contribute to open-source, and publish technical findings.
The summary above was generated by AI

Location: Remote (Global)

Type: Full-time

Company: Yotta Labs

Apply: [email protected]

🧠 About Yotta Labs

Yotta Labs is building the next generation multi-silicon AI cloud and runtime platform to power the world’s most demanding AI workloads. We enable training and inference across NVIDIA GPUs, AMD GPUs, and AWS Trainium, helping AI companies achieve the best performance and economics across heterogeneous hardware. Our mission is to provide high-performance AI computing and Model API services, enabling AI companies, research labs, and enterprises to train, deploy and integrate cutting-edge models at scale.

🛠️ Role Overview

We are seeking a highly motivated AI Systems Research Engineer specializing in Trainium, GPU kernels, and LLM systems optimization. You will work at the intersection of AI Systems, Compiler and Runtime Optimization, Distributed Training & Inference, GPU/Accelerator Kernel Development, and Large Language Model Infrastructure. Your work will directly impact the scalability and performance of AI applications deployed on our platform.

🎯 Responsibilities

  • Design and implement high-performance kernels for Attention, MoE, GEMM, collective communication, and quantization.

  • Optimize kernels for NVIDIA, AMD, and AWS Trainium.

  • Develop custom operators and graph optimizations using Neuron SDK, PyTorch/XLA, Torch Dynamo, and Neuron Compiler.

  • Improve performance of vLLM, SGLang, TensorRT-LLM, and custom inference runtimes.

  • Design scalable distributed training and inference solutions across thousands of accelerators.

  • Contribute to open-source projects, publish technical findings and engage with the developer community.

Qualifications

  • Proficiency in AI programming languages such as Python and C++.

  • Deep understanding of GPU architecture and performance optimization.

  • Experience with CUDA, Triton, ROCm/HIP, or AWS Neuron.

  • Strong understanding of AI frameworks (e.g., PyTorch, Dynamo, LMCache), model architectures and profiling tools (e.g. Nsight, ROCm Profiler, or Neuron Profiler).

  • Strong problem-solving skills and the ability to work in a collaborative, remote environment.

  • A background in computer science, engineering, or a related field is preferred.

🌟 Preferred Experience

  • Contributions to open-source AI infra projects like vLLM, SGLang, PyTorch, or Triton.

  • Experience with with FlashAttention, PagedAttention, MoE, RLHF, or distributed AI systems.

  • Publications in top-tier conferences like MLSys, OSDI, SOSP, NSDI, SC, HPCA, or ISCA

🌐 Why Join Yotta Labs?

  • Be part of a visionary team aiming to redefine AI infrastructure and influence the future of multi-silicon AI computing.

  • Work on cutting-edge technologies that solves frontier AI infrastructure problems.

  • Collaborate with experts from leading institutions and tech companies.

  • Competitive compensation with equity. Enjoy a flexible, remote work environment that values innovation and autonomy.

📩 How to Apply

Interested candidates should apply directly or send their resume and a brief cover letter to [email protected]. Please include links to any relevant projects or contributions.

Similar Jobs

48 Minutes Ago
Easy Apply
Remote
Easy Apply
112K-172K Annually
Senior level
112K-172K Annually
Senior level
Big Data • Fintech • Mobile • Payments • Financial Services
Manage end-to-end compensation systems and semiannual compensation processes. Lead market benchmarking and survey submissions, conduct ad hoc compensation and pay equity analyses, own multi‑geography pay reporting, and support compensation training and communications to ensure competitive, compliant programs.
Top Skills: Claude CodeExcelGoogle SheetsPaveRadfordSigma Computing
2 Hours Ago
Easy Apply
Remote or Hybrid
Easy Apply
154K-220K Annually
Senior level
154K-220K Annually
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
Lead and build a high-performing regional sales team in Western Canada, hire and coach reps, develop GTM and partner plans, forecast revenue, drive pipeline and close deals to accelerate customers' cloud security transformation.
3 Hours Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
227K-273K Annually
Expert/Leader
227K-273K Annually
Expert/Leader
eCommerce • Healthtech • Kids + Family • Retail • Social Media
Lead cross-pod technical direction and system design for high-scale backend services, embed AI-native development practices, collaborate with product and design, mentor engineers, and drive ambiguous initiatives from design through production.
Top Skills: AndroidAWSChatgptClaudeDjangoiOSMySQLNode.jsPythonReactRedisRuby On RailsSidekiq

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account