Intuition Machines

Intuition Machines

Intuition Machines is a leading company in the field of Privacy Preserving AI/ML. They specialize in turning AI/ML research into platforms and services that prioritize user privacy. Their products, including hCaptcha.com, are widely used and have a sig...

Life Sciences Tools & Services
51-250

Description

  • Work with large-scale systems handling millions of requests per second across multiple cloud providers.
  • Develop solutions that improve performance, availability, security, and cost-effectiveness.
  • Maintain system uptime and speed while keeping development teams productive.
  • Ensure peer releases improve quality, security, uptime, delivery speed, threat detection, and customer engagement.
  • Source improvement ideas and priorities from customers, internal teams, and system metrics.
  • Make rapid decisions to support a small-team, fast-iteration environment.
  • Work across infrastructure, data, and application logic layers to build system-level solutions.
  • Collaborate with customer teams and internal stakeholders in a flat organization.

Requirements

  • Expert-level experience with Kubernetes.
  • Expert-level experience monitoring applications, infrastructure, and networks.
  • Software engineering background with backend development experience in Kubernetes-based systems.
  • Strong programming skills in Python, JavaScript, Go, C++, or Rust.
  • Strong understanding of networking, proxies, and content delivery networks, including Cloudflare.
  • Experience with multi-cloud environments, including virtual networking, load balancing, and web application firewalls.
  • Strong experience with CI/CD.
  • Hands-on experience in high-scale, high-uptime, and high-reliability environments.
  • Minimum of 6 years of hands-on experience in engineering, DevOps, or SRE roles.
  • Familiarity with distributed systems, including queue-first architectures and sharding.
  • Demonstrated ability to gather requirements, solve problems, and make recommendations.
  • Preferred: familiarity with security frameworks, attack vectors, botnets, and impact analysis.
  • Must pass pre-employment screening, including third-party verification of work history, education, and identity, plus a final in-person interview and identity verification in the country of residence.

Benefits

  • Fully remote position with flexible working hours.
  • A global team of colleagues distributed around the world.
  • Modern development and deployment workflows with an emphasis on shipping early and often.
  • High-impact work with lots of users, happy customers, and high growth.
  • Direct interaction with customer teams in a flat organization.
  • Commitment to equality of opportunity and an inclusive work environment.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Site Reliability Engineer

Alpaca 51-250 Capital Markets

Alpaca is hiring a Site Reliability Engineer to keep its brokerage platform reliable and operable across cloud, Kubernetes, observability, messaging, and database systems, with a strong focus on PostgreSQL reliability on the trading-critical path.

DNS GitOps Go Kafka Kubernetes Linux Load Balancing PostgreSQL Python RabbitMQ Secrets Management TLS
2 hours, 30 minutes ago

Site Reliability Engineer

Kaseya 1K-5K IT Services

Kaseya is hiring a Site Reliability Engineer to own the reliability, automation, and production stability of its AWS-based services used by thousands of MSPs worldwide.

Ansible AWS Chef CloudFormation Datadog DevSecOps Elasticsearch Kibana Kubernetes MySQL PostgreSQL Puppet Secrets Management Serverless Terraform
6 hours, 30 minutes ago

SRE - DevOps Engineer - Argentina

Coderio 51-250 Internet Software & Services

Coderio is hiring a remote DevOps/SRE Engineer in Argentina to ensure the stability, scalability, and efficient operation of the infrastructure that supports its global digital solutions.

Argo CD CI/CD Flux GitHub Actions GitOps Helm Jenkins Kubernetes OpenShift Terraform
10 hours, 9 minutes ago

Senior Site Reliability Engineer

Cribl 251-1K IT Services

Cribl is hiring a Senior Site Reliability Engineer in Poland to help build and operate the telemetry infrastructure and observability platform that supports its cloud products and enterprise customers.

Ansible AWS Azure CI/CD Grafana JavaScript Kibana Linux New Relic Node.js PagerDuty Prometheus Splunk Terraform TypeScript
17 hours, 42 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers