Bugcrowd Logo

Bugcrowd

Reinforcement Learning Infrastructure (Cybersecurity)

Posted 3 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in United States
176K-243K Annually
Senior level
Remote
Hiring Remotely in United States
176K-243K Annually
Senior level
Build scalable infrastructure and tooling that converts real-world vulnerability research into thousands of reproducible reinforcement learning environments. Design ingestion and pipeline systems, integrate Bugcrowd Mayhem analyses, maintain Linux ML container environments, and enable RL training workflows for frontier AI labs. Apply low-level security and systems expertise to automate environment generation and support large-scale model training.
The summary above was generated by AI

We are Bugcrowd. Since 2012, we’ve been empowering organizations to take back control and stay ahead of threat actors by uniting the collective ingenuity and expertise of our customers and trusted alliance of elite hackers, with our patented data and AI-powered Security Knowledge Platform™. Our network of hackers brings diverse expertise to uncover hidden weaknesses, adapting swiftly to evolving threats, even against zero-day exploits. With unmatched scalability and adaptability, our data and AI-driven CrowdMatch™ technology in our platform finds the perfect talent for your unique fight. We aim to create a new era of modern crowdsourced security that outpaces threat actors. Unleash the ingenuity of the hacker community with Bugcrowd, visit www.bugcrowd.com. Based in San Francisco and New Hampshire, Bugcrowd is supported by General Catalyst, Rally Ventures, Costanoa Ventures, and others.

Job Summary

The Bugcrowd RL and Reasoning Team focuses on pushing the boundaries of autonomous cybersecurity by building authentic reinforcement learning environments for foundational model companies. As a Staff Engineer, you will advance the frontier of AI Reinforcement Learning development and delivery.  You will build the infrastructure and tooling that transforms real-world vulnerability research into large-scale reinforcement learning environments used to train next-generation AI systems.

This role is unique. You will help create the training environments that teach AI systems how to hack and defend software. Your work will directly influence the capabilities of the next generation of AI models. Instead of building a single application, you will build the infrastructure that generates thousands of environments used to train frontier AI systems.

Our team works at the intersection of AI, security research, and systems engineering, building environments that allow models to learn skills such as vulnerability discovery, exploitation, and remediation. 

Essential Duties and Responsibilities 

If you enjoy building high-performance systems that power cutting-edge AI research, this role is for you.

This role focuses on building the systems that generate RL environments, not just the environments themselves. You will design pipelines that ingest software projects, analyze them with Bugcrowd’s Mayhem platform, and automatically construct training environments used by frontier AI labs including Anthropic, OpenAI, and Cohere.

The ideal candidate is a strong systems engineer who understands:

  • Reinforcement learning workflows
  • Building clean, reproducible Linux ML environments (containers, MCP, etc)
  • System security background in binary exploitation, such as buffer overflows, fuzzing, exploitation, and x86/64.
  • Experience developing applications in Python and C, with Rust a plus. 

Education, Experience, Knowledge, Skills, and Abilities

Understanding of RL training workflows used by modern LLM systems

  • Experience with DevOps pipelines (e.g., github actions), reproducible builds (docker, buildkit, nix).
  • Proficiency in Python and C. Other languages (especially Rust) are a plus.
  • Understanding of software vulnerabilities, fuzzing, or program analysis
  • Experience with build systems and large open-source codebases
  • Comfort working with Linux systems and low-level debugging
  • Experience working with benchmark environments (CTFs, SWE-bench, security challenges, etc.) is a plus

Working Conditions and Physical Requirements

The ideal candidate must be able to complete all physical requirements of the job with or without reasonable accommodation.

Sitting and / or standing - Must be able to remain in a stationary position 50% of the time

Carrying and / or lifting - Must be able to carry / move laptop as needed throughout the work day.

Environment - remote, work-from-home 100% of the time.

Pay Range Disclosure

At Bugcrowd, we strive for fairness, equality and to create an environment that allows our people to perform at their very best. Our compensation philosophy is to foster a collaborative community that rewards, attracts and retains the best possible talent. The provided salary details are based on US national averages and we retain the flexibility to tailor to the needs of the business.

The national estimate for the current base range for the position of $176,400 - $242,550.

This position may also be eligible to participate in a discretionary bonus program or commission plan, subject to the rules governing the program, whereby an award, if any, depends on various factors, including, without limitation, individual and organizational performance.

Culture

  • At Bugcrowd, we understand that diversity in the workplace is vital to a company’s success and growth. We strive to make sure that people are included and have a sense of being part of making Bugcrowd not only a great product but a great place to work.
  • We regularly hear from both customers and researchers that Bugcrowd feels like a family, and we strive to maintain that internally as well.
  • Our team consists of a broad range of people: musicians, adventure sports junkies, nature lovers, parents, cereal enthusiasts, night owls, cyclists, artists—you get the point.

At Bugcrowd, we are solving security threats and vulnerabilities that are relevant to everyone, therefore we believe solving these problems takes all kinds of backgrounds. We value the perspectives and experiences people from underrepresented backgrounds bring.

Disclaimer

This position has access to highly confidential, sensitive information relating to the technologies of Bugcrowd. It is essential that the applicant possess the requisite integrity to maintain the information in the strictest confidence.

The company is authorized to obtain background checks for employment purposes under state and federal law. Background checks will be conducted for positions that involve access to confidential or proprietary information (including trade secrets).

Background checks may include Social Security verification, prior employment verification, personal and professional references, educational verification, and criminal history. Applicants with conviction histories will not be excluded from consideration to the extent required by law.

Any personal data you submit in connection with your application will be processed in compliance with Bugcrowd's Privacy Policy, which you may review here: https://www.bugcrowd.com/privacy.


Equal Employment Opportunity:

Bugcrowd is EOE, Disability/Age Employer. 

Individuals seeking employment at Bugcrowd are considered without regards to race, color, religion, national origin, age, sex, marital status, ancestry, physical or mental disability, veteran status, gender identity, or sexual orientation. 

Bugcrowd is committed to the full inclusion of all qualified individuals. In keeping with our commitment, Bugcrowd will take the steps to assure that people with disabilities are provided reasonable accommodations. Accordingly, if reasonable accommodation is required to fully participate in the job application or interview process, to perform the essential functions of the position, and/or to receive all other benefits and privileges of employment, please contact HR at ADA at bugcrowd.com.

Apply at: https://www.bugcrowd.com/about/careers/

Bugcrowd Boston, Massachusetts, USA Office

745 Atlantic Ave (8th Floor), Boston, United States, 02111

Similar Jobs

30 Minutes Ago
Remote or Hybrid
Pearl Harbor, HI, USA
76K-122K Annually
Senior level
76K-122K Annually
Senior level
Aerospace • Hardware • Information Technology • Security • Software • Cybersecurity • Defense
Perform maintenance, troubleshooting, repairs, and testing of submarine VLS and VPT mechanical systems, TCPs, TCCs, and WCCs. Execute component-level maintenance using drawings and test procedures, analyze system monitoring data, read wiring schematics, and provide waterfront support, training, and mentoring to ship and NUWC personnel while collaborating with Navy commands and maintenance organizations.
Top Skills: 688 Class Vertical Launch System (Vls)774 Class Virginia Payload Tube (Vpt)Capsule Launch System (Cls)ExcelMicrosoft WordTube Control Cables (Tcc)Tube Control Panels (Tcp)Weapon Control Cables (Wcc)
37 Minutes Ago
Remote
United States
Senior level
Senior level
Edtech • Fintech • Payments • Social Impact • Financial Services • Big Data Analytics
Owner of end-to-end ecommerce features: design, implement, and maintain scalable backend services and performant React/Next.js frontends. Integrate payment gateways and procurement systems (PunchOut), mentor engineers, uphold code quality, and improve platform stability across Node.js, MySQL, and MongoDB.
Top Skills: CoupaCxmlGraphQLJaggaerJavaScriptMongoDBMySQLNest.JsNext.JsNistNode.jsOciOracle ProcurementPci-DssPunchoutReactRestSap AribaSoc 2Typescript
41 Minutes Ago
Easy Apply
In-Office or Remote
2 Locations
Easy Apply
130K-180K Annually
Mid level
130K-180K Annually
Mid level
Healthtech • Software
Serve as the internal expert on AcuityMD healthcare data products, support commercial teams on complex data questions, and partner with Engineering to build an AI context layer and evaluation framework to measure agentic solution quality and data explainability.
Top Skills: AIExcelSQL

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account