Intuition Machines

Intuition Machines

Intuition Machines is a leading company in the field of Privacy Preserving AI/ML. They specialize in turning AI/ML research into platforms and services that prioritize user privacy. Their products, including hCaptcha.com, are widely used and have a sig...

Life Sciences Tools & Services
51-250

Description

  • Work with large-scale systems handling millions of requests per second across multiple cloud providers.
  • Develop solutions that improve performance, availability, security, and cost-effectiveness.
  • Maintain system uptime and speed while keeping development teams productive.
  • Ensure peer releases improve quality, security, uptime, delivery speed, threat detection, and customer engagement.
  • Source improvement ideas and priorities from customers, internal teams, and system metrics.
  • Make rapid decisions to support a small-team, fast-iteration environment.
  • Work across infrastructure, data, and application logic layers to build system-level solutions.
  • Collaborate with customer teams and internal stakeholders in a flat organization.

Requirements

  • Expert-level experience with Kubernetes.
  • Expert-level experience monitoring applications, infrastructure, and networks.
  • Software engineering background with backend development experience in Kubernetes-based systems.
  • Strong programming skills in Python, JavaScript, Go, C++, or Rust.
  • Strong understanding of networking, proxies, and content delivery networks, including Cloudflare.
  • Experience with multi-cloud environments, including virtual networking, load balancing, and web application firewalls.
  • Strong experience with CI/CD.
  • Hands-on experience in high-scale, high-uptime, and high-reliability environments.
  • Minimum of 6 years of hands-on experience in engineering, DevOps, or SRE roles.
  • Familiarity with distributed systems, including queue-first architectures and sharding.
  • Demonstrated ability to gather requirements, solve problems, and make recommendations.
  • Preferred: familiarity with security frameworks, attack vectors, botnets, and impact analysis.
  • Must pass pre-employment screening, including third-party verification of work history, education, and identity, plus a final in-person interview and identity verification in the country of residence.

Benefits

  • Fully remote position with flexible working hours.
  • A global team of colleagues distributed around the world.
  • Modern development and deployment workflows with an emphasis on shipping early and often.
  • High-impact work with lots of users, happy customers, and high growth.
  • Direct interaction with customer teams in a flat organization.
  • Commitment to equality of opportunity and an inclusive work environment.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Site Reliability Engineer (Senior or Staff), Atlas

MongoDB 1K-5K Internet Software & Services

MongoDB is hiring a Senior Site Reliability Engineer for its Atlas team to help support, maintain, and grow a multi-cloud platform for customer-facing production workloads.

AWS Azure DNS GCP Go HTTP Linux Python Ruby TLS
6 hours, 35 minutes ago

Manager, Software Engineering (Resilience Engineering)

Affirm 1K-5K Diversified Financial Services

Affirm is seeking an Engineering Manager to lead its Resilience Engineering team, building production load testing and chaos engineering capabilities that improve the safety and reliability of production systems.

AWS Java Kotlin Kubernetes Microservices Python
6 hours, 44 minutes ago

Site Reliability Engineer (Senior or Staff), Storage Layer Services (SLS)

MongoDB 1K-5K Internet Software & Services

MongoDB’s Storage Layer Services team is hiring a Site Reliability Engineer to help re-architect the cloud storage layer for Atlas and ensure the reliability and operational safety of its distributed storage infrastructure.

AWS Azure DNS GCP Go Kubernetes Linux Python TCP/IP TLS
7 hours, 32 minutes ago

Manager, Software Engineering (Resilience Engineering)

Affirm 1K-5K Diversified Financial Services

Affirm is hiring an Engineering Manager to lead its Resilience Engineering team in building production load testing and chaos engineering capabilities that improve the safety and reliability of its production systems.

AWS Java Kotlin Kubernetes Python
9 hours, 48 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers