Reinforcement Learning Infrastructure (Cybersecurity)

1 hour, 16 minutes ago
Full-time
Lead
Artificial Intelligence and Machine Learning
Bugcrowd

Bugcrowd

Bugcrowd provides a crowdsourced cybersecurity platform that connects organizations with elite security researchers to enhance security measures through managed bug bounty programs, penetration testing, and vulnerability disclosure initiatives.

Internet Software & Services
1K-5K
Founded 2012
$79M raised

Description

  • Design pipelines that ingest software projects and automatically construct reinforcement learning training environments.
  • Build infrastructure and tooling for authentic cybersecurity RL environments used by foundation model companies.
  • Work at the intersection of AI, security research, and systems engineering to support vulnerability discovery, exploitation, and remediation training.
  • Integrate Bugcrowd’s Mayhem platform into environment-generation workflows.
  • Develop high-performance, reproducible Linux-based machine learning environments and related tooling.
  • Support the delivery of training environments used by frontier AI labs.
  • Contribute to systems that generate large volumes of environments rather than a single application.
  • Collaborate on infrastructure that advances autonomous cybersecurity research and AI model training.

Requirements

  • Experience with reinforcement learning workflows used by modern LLM systems.
  • Strong proficiency in Python and C; Rust is a plus.
  • Experience with DevOps pipelines such as GitHub Actions.
  • Experience with reproducible build and container tools such as Docker, BuildKit, and Nix.
  • Knowledge of software vulnerabilities, fuzzing, or program analysis.
  • Background in binary exploitation, including buffer overflows, exploitation, fuzzing, and x86/64.
  • Comfort working with Linux systems and low-level debugging.
  • Experience with build systems and large open-source codebases.
  • Experience building clean, reproducible Linux ML environments, including containers and MCP.
  • Experience with benchmark environments such as CTFs, SWE-bench, or security challenges is a plus.

Benefits

  • Base salary range of $176,400 to $242,550.
  • Eligibility for a discretionary bonus program or commission plan.
  • 100% remote, work-from-home arrangement.
  • Flexible compensation approach tailored to business needs.
  • Reasonable accommodations for applicants and employees with disabilities.
  • Commitment to inclusion and equal employment opportunity.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Cyber Software Engineer

STR 251-1K Aerospace & Defense

STR is hiring a Senior Cyber Software Engineer to develop and assess software tools that improve the security and resiliency of national defense systems.

Bash C C++ CI/CD Docker GitHub Actions GitLab CI Gradle Jenkins Podman Rust
1 minute ago

Senior Embedded Software Engineer - Cyber

STR 251-1K Aerospace & Defense

STR is seeking a Senior Embedded Software Engineer to join a multidisciplinary cyber team developing vulnerability research technologies for national security applications.

Bash C C++ Docker Embedded Systems Git GitLab Python SVN
1 minute ago

Security Technician 

Unlimited Technology 51-250 Professional Services

Unlimited Technology is hiring a Full-Time Security Installation Technician to install, program, troubleshoot, and maintain access control and IP camera systems at client sites.

16 minutes ago

Machine Learning Principal Solutions Architect

phData 251-1K IT Services

phData is hiring a Principal Solutions Architect to lead delivery of AI/ML solutions for enterprise clients while also driving strategic account growth and client engagement.

AWS Azure Databricks dbt Django Docker Flask GCP Java Keras Kubernetes Machine Learning MLflow Python SageMaker Scala Scikit-learn Snowflake Spring TensorFlow Vertex AI
16 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers