Prime Intellect

Applied Research - RL & Agents

Reposted 12 Days Ago

Be an Early Applicant

In-Office or Remote

2 Locations

100K-150K Annually

Senior level

In-Office or Remote

2 Locations

100K-150K Annually

Senior level

Develop and implement advanced reinforcement learning methods and distributed systems for AI agents. Collaborate with customers to understand needs and create tailored solutions.

The summary above was generated by AI

Building Open Superintelligence Infrastructure
Prime Intellect is building the open superintelligence stack - from frontier agentic models to the infra that enables anyone to create, train, and deploy them. We aggregate and orchestrate global compute into a single control plane and pair it with the full rl post-training stack: environments, secure sandboxes, verifiable evals, and our async RL trainer. We enable researchers, startups and enterprises to run end-to-end reinforcement learning at frontier scale, adapting models to real tools, workflows, and deployment contexts.

We recently raised $15mm in funding (total of $20mm raised) led by Founders Fund, with participation from Menlo Ventures and prominent angels including Andrej Karpathy (Eureka AI, Tesla, OpenAI), Tri Dao (Chief Scientific Officer of Together AI), Dylan Patel (SemiAnalysis), Clem Delangue (Huggingface), Emad Mostaque (Stability AI) and many others.

Role Impact

This is a customer facing role at the intersection of cutting-edge RL/post-training methods and applied agent systems. You’ll have a direct impact on shaping how advanced models are aligned, deployed, and used in the real world by:

Advancing Agent Capabilities: Designing and iterating on next-generation AI agents that tackle real workloads—workflow automation, reasoning-intensive tasks, and decision-making at scale.
Building Robust Infrastructure: Developing the distributed systems and coordination frameworks that enable these agents to operate reliably, efficiently, and at massive scale.
Bridge Between Customers & Research: Translate customer needs into clear technical requirements that guide product and research priorities.
Prototype in the Field: Rapidly design and deploy agents, evals, and harnesses alongside customers to validate solutions.

Customer-Facing Engineering

Work side-by-side with customers to deeply understand workflows and bottlenecks.
Prototype agents and eval harnesses tailored to real use cases, then hand off hardened systems to core teams.
Translate customer insights into roadmap and research direction.

Post-training & Reinforcement Learning

Design and implement novel RL and post-training methods (RLHF, RLVR, GRPO, etc.) to align large models with domain-specific tasks.
Build evaluation harnesses and verifiers to measure reasoning, robustness, and agentic behavior in real-world workflows.
Prototype multi-agent and memory-augmented systems to expand capabilities for customer-facing solutions.

Agent Development & Infrastructure

Rapidly prototype and iterate on AI agents for automation, workflow orchestration, and decision-making.
Extend and integrate with agent frameworks to support evolving feature requests and performance requirements.
Architect and maintain distributed training/inference pipelines, ensuring scalability and cost efficiency.
Develop observability and monitoring (Prometheus, Grafana, tracing) to ensure reliability and performance in production deployments..

Requirements

Strong background in machine learning engineering, with experience in post-training, RL, or large-scale model alignment.
Deep expertise in distributed training/inference frameworks (e.g., vLLM, sglang, Ray, Accelerate).
Experience deploying containerized systems at scale (Docker, Kubernetes, Terraform).
Track record of research contributions (publications, open-source contributions, benchmarks) in ML/RL.
Passion for advancing the state-of-the-art in reasoning and building practical, agentic AI systems.

What We Offer

Competitive Compensation + equity incentives
Flexible Work (remote or San Francisco)
Visa Sponsorship & relocation support
Professional Development budget
Team Off-sites & conference attendance

Growth Opportunity

You’ll join a mission-driven team working at the frontier of open, superintelligence infra. In this role, you’ll have the opportunity to:

Shape the evolution of agent-driven solutions—from research breakthroughs to production systems used by real customers.
Collaborate with leading researchers, engineers, and partners pushing the boundaries of RL and post-training.
Grow with a fast-moving organization where your contributions directly influence both the technical direction and the broader AI ecosystem.

If you’re excited to move fast, build boldly, and help define how agentic AI is developed and deployed, we’d love to hear from you.

Ready to build the open superintelligence infrastructure of tomorrow?
Apply now to help us make powerful, open AGI accessible to everyone.

Top Skills

Accelerate

Distributed Training

Docker

Grafana

Grpo

Kubernetes

Machine Learning

Prometheus

Ray

Reinforcement Learning

Rlhf

Rlvr

Sglang

Terraform

Vllm

Similar Jobs

Genius Sports

Sports Tracking Operator (Civil contract)

17 Minutes Ago

Easy Apply

Remote or Hybrid

USA

Easy Apply

Entry level

AdTech • Artificial Intelligence • Machine Learning • Marketing Tech • Software • Sports • Big Data Analytics

As a Sports Tracking Operator, you will monitor and collect live data on teams, players, and lineups while ensuring data accuracy and collaborating with game supervisors.

Top Skills: In-House Software

BrainPOP

Program Manager

33 Minutes Ago

Easy Apply

Remote or Hybrid

Easy Apply

115K-135K Annually

Senior level

115K-135K Annually

Senior level

Edtech • Kids + Family • Social Impact • Software

The Revenue Operations Program Manager leads Cross-functional integration projects within the revenue technology stack, ensuring alignment between various departments and technical implementation for improved revenue processes.

Top Skills: Business Intelligence ToolsCRMOutreachPardotRev.IoSalesforceSalesloftZoominfo

Jasper

Enterprise Account Executive

4 Hours Ago

Remote

United States

290K-290K Annually

Senior level

290K-290K Annually

Senior level

Artificial Intelligence • Marketing Tech • Software • Generative AI • Automation

As an Enterprise Account Executive, you will manage Enterprise accounts, leading sales cycles, demonstrations, and stakeholder engagements to drive successful closures.

Top Skills: B2B SaasMarketing AutomationRoi Planning

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories