Buildkite

Buildkite

Buildkite: Powerful CI/CD platform for fast, secure, and scalable pipelines on your own infrastructure, offering flexibility, security, and data-driven insights.

Commercial Services & Supplies
51-250
Founded 2014
$41M raised

Description

  • Define and lead the ML strategy for predictive test selection from experimentation through production deployment.
  • Lead technical investigations into a generalized test selection model and adapt the approach based on data findings.
  • Design the end-to-end ML architecture, including feature engineering, model training, evaluation, serving infrastructure, and feedback loops.
  • Make key operational decisions around latency, accuracy trade-offs, and graceful degradation when confidence is low.
  • Integrate ML capabilities with Test Engine’s existing data infrastructure and test-to-code mapping systems.
  • Build and scale the ML platform layer so models can be shipped to production quickly and repeatably.
  • Design, build, and maintain data pipelines that connect code change signals with test execution history at scale.
  • Train, evaluate, deploy, monitor, and retrain ML models in production.
  • Instrument production models with observability metrics such as accuracy, latency, coverage, false negative rates, and drift detection.
  • Investigate and resolve complex performance and reliability issues across the data and ML stack.
  • Share engineering best practices through documentation, mentorship, and pairing.
  • Support cross-team tooling, infrastructure, and frameworks while aligning stakeholders on technical decisions.
  • Work closely with customers to understand their workflows and ensure test selection delivers real impact.

Requirements

  • Deep proficiency in Python and strong experience building production ML systems end-to-end.
  • Proven experience designing and operating ML infrastructure at scale, such as model registries, feature stores, serving layers, or experiment tracking.
  • Strong experience with large-scale data processing using batch or streaming frameworks such as Spark or Flink.
  • Deep proficiency in SQL.
  • Comfort working in AWS and with containerized workloads using Docker and Kubernetes.
  • Hands-on experience training, evaluating, and deploying ML models in production.
  • Experience with classification, ranking, or prediction problems with challenging signal-to-noise ratios.
  • Track record of building repeatable ML capabilities that scale beyond a single use case.
  • Experience with feature engineering from structured and semi-structured data such as code diffs, execution logs, or dependency graphs.
  • Experience instrumenting production models with observability metrics such as accuracy, latency, coverage, and drift.
  • Excellent written and verbal communication skills, especially in a remote-first environment.
  • Ability to explain complex technical concepts clearly to diverse audiences.
  • Collaborative, pragmatic mindset with the ability to balance technical quality and business context.
  • Comfort mentoring engineers and leading technical discussions across teams.
  • Proven ability to build alignment and influence technical direction without authority.
  • Experience with code analysis, static analysis tools, or features built from source code structure, preferred.
  • Familiarity with CI/CD systems, developer tooling, or test infrastructure, preferred.
  • Experience with Ruby on Rails, React, GraphQL, or Go, preferred.
  • Background in search ranking, recommendation systems, or similar sparse-signal domains, preferred.
  • Experience working with test frameworks or test execution data, preferred.

Benefits

  • Competitive compensation including salary, equity, and a benefits package.
  • Flexible, remote-first culture.
  • Remote work available in the ANZ and PST regions.
  • Opportunities for professional growth and technical leadership.
  • Cross-team influence on meaningful technical challenges at scale.
  • Collaborative, inclusive, and innovative culture.
  • Support for reasonable accommodations during the recruitment process.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Engineer/Senior Engineer, Devops (AWS)

BOLD 251-1K Internet Software & Services

Bold is hiring an AWS DevOps/Sr. MLOps engineer to own production ML environments and platform operations across infrastructure and data science teams.

AWS Bash CDN CI/CD Cloudflare Docker DynamoDB FastAPI GitHub GitHub Actions Gradle Grafana Groovy Java Jenkins Kubernetes Maven Microservices New Relic Nginx OpenSearch Prometheus Python SageMaker Solr Spinnaker Splunk Spring Boot Terraform WAF
5 minutes ago

Machine Learning Engineer, Robot Learning

Path Robotics 51-250 Automotive

Path Robotics is hiring a Machine Learning Engineer to develop robot learning systems for industrial robotics, with an emphasis on manipulation, control, and deployment in real-world environments.

C++ Machine Learning Python Reinforcement Learning
20 minutes ago

AI Tech Lead - Staff Machine Learning Engineer

Sumo Logic 251-1K Internet Software & Services

Sumo Logic is hiring a Staff Machine Learning Engineer – AI Tech Lead to lead the design and production delivery of agentic AI systems for Security Operations Center use cases at global scale.

Apache Airflow AWS Azure Docker GCP Kubernetes LLM Machine Learning MLflow Python PyTorch System Design Vertex AI
20 minutes ago

Principal Technical Staff

STR 251-1K Aerospace & Defense

STR is hiring a Principal Technical Staff member to lead research, development, and customer-facing execution of advanced algorithm and software programs supporting national security missions for the Intelligence Community and Department of Defense.

Cybersecurity Machine Learning Statistics
20 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers