Careerflow.ai Logo

Careerflow.ai

AI/ML Software Engineer (RL Environments) (Contract)

Posted 4 Hours Ago
Remote
Hiring Remotely in United States
Mid level
Remote
Hiring Remotely in United States
Mid level
Design and build reinforcement-learning training environments and diverse tasks to evaluate and improve LLM agents; iterate rapidly on task designs from customer feedback, deliver high-quality outputs with minimal supervision, and maintain PST overlap for collaboration.
The summary above was generated by AI
About the Role

We're seeking experienced Machine Learning Engineers and Software Engineers with ML experience to design and build high-quality RL training environments for LLM agents. As an RL Environment Engineer, you'll create diverse machine learning tasks that challenge and improve language models, working with minimal supervision to deliver consistent, quality outputs.

What You'll Do
  • Design and build tasks for machine learning domains that target specific language models and difficulty distributions

  • Iterate rapidly on task designs based on customer feedback, with 24-hour turnaround times

  • Create diverse, challenging scenarios that test language model capabilities and expose their limitations

  • Hit the ground running with minimal onboarding time

What We're Looking For
  • Strong machine learning background through coursework, previous work experience, or personal projects

  • Python fluency: you write clean, efficient Python code regularly

  • Heavy LLM user who understands current model capabilities and failure modes through daily hands-on experience

  • Self-directed and creative. You can generate novel ML task ideas in your domain without constant guidance

  • High responsibility and integrity. You deliver quality work consistently and meet deadlines

  • Availability overlap with PST 9am-5pm (minimum 3 hours required)

Work Details
  • Location: Remote

  • Type: Contractor

Time Commitment: 40 hours a week. Must have at least 3 hours of overlap with PST business hours (9am-5pm)

Selection Process:
  1. Screening

  2. Hacker rank assessment

  3. 1 Week paid task

  4. Full time

Similar Jobs

2 Minutes Ago
In-Office or Remote
2M-180K Annually
Senior level
2M-180K Annually
Senior level
Agency • Big Data • Consumer Web • Marketing Tech
Lead and grow Engineering and DevOps as a hands-on VP: own roadmap and SDLC, set engineering standards, act as technical architect for APIs and database modernization, manage CI/CD and cloud infrastructure, drive security/SOC 2 readiness, recruit and mentor teams, and coordinate cross-functional delivery.
Top Skills: AWSCi/CdDockerGCPInfrastructure-As-CodeKubernetesNosql DatabasesPythonRackspace CloudRelational DatabasesRest ApiSoc 2
13 Minutes Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
140K-150K Annually
Senior level
140K-150K Annually
Senior level
AdTech • Cloud • Marketing Tech • Productivity • Software • Analytics • Automation
The Product Manager will streamline security integration into Acquia's Cloud Platform, define functional requirements, and enhance customer experiences through AI and automation. Responsibilities include supporting Edge services, simplifying workflows, and collaborating with engineering teams.
Top Skills: Api-First PlatformsCdnCybersecurityDdosDnsEdge ServicesSsl/TlsWaf
16 Minutes Ago
Easy Apply
Remote or Hybrid
Easy Apply
160K-200K Annually
Senior level
160K-200K Annually
Senior level
Fintech • Software • Financial Services
Design, develop, and maintain Scala/Akka-based low-latency trading systems on Kubernetes/GCP. Architect scalable, resilient backend services, integrate market data feeds, implement monitoring/observability, troubleshoot distributed systems and networking, and deliver end-to-end production software collaborating with product and engineering teams.
Top Skills: AkkaDnsGithub ActionsGoogle Cloud PlatformHttp/HttpsJenkinsKubernetesScalaTcp/IpTlsWebsockets

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account