Software Engineer, Resilience and Chaos Engineering

3 days, 13 hours ago
Mid Level
Software Development
Exa

Exa

Exa provides a real-time AI search engine and web crawling API that enables users to search and extract structured content from websites, offering deep research tools and a comprehensive suite of functionalities across multiple endpoints.

Internet Software & Services
1-10
Founded 2016

Description

  • Design and execute automated resilience tests across service boundaries and hybrid on-prem and cloud environments.
  • Improve the stability of end-user tools and frontend systems under latency and service interruptions.
  • Build scenarios that simulate AI inference timeouts, high network latency, data pipeline congestion, and malformed input.
  • Develop platform-wide guidelines that support graceful degradation during adverse conditions.
  • Create and improve observability and monitoring tools to assess overall system health.
  • Work with a distributed team across Singapore, Mountain View, and Munich on software reliability initiatives.
  • Contribute to automated frameworks that replicate real-world disruptions for robotics developers.

Requirements

  • Bachelor’s degree in Computer Science or equivalent professional experience.
  • At least 3 years of software engineering experience.
  • Experience with cloud computing.
  • Experience with Go, Python, or C++.
  • Experience with testing strategies, including integration testing, fuzzing, and property-based testing.
  • Strong communication skills.
  • Experience with systems involving physical hardware such as robots, machinery, transportation, or logistics (preferred).
  • Experience with build tools, especially Bazel, and CI/CD systems such as Jenkins, Travis, CircleCI, Spinnaker, or Terraform (preferred).
  • Experience with GitHub-based custom infrastructure and workflows; monorepo experience is a plus (preferred).

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Intern, Forward Deployed Engineering

Workato 251-1K IT Services

Workato is hiring a Forward Deployed Engineering intern to support AI-driven automation initiatives by helping build intelligent agents and enterprise workflow integrations on its Agentic AI platform.

JavaScript JSON LLM Python REST API Salesforce
13 hours, 42 minutes ago

Software Engineer 3

Black Duck Inn 1K-5K Internet Software & Services

Black Duck Software is seeking a License Developer to evolve legacy licensing systems and build reliable, production-ready services for secure 24/7 customer use.

CI/CD DevSecOps Java Kubernetes Linux REST API Ruby on Rails
13 hours, 42 minutes ago

Statistical Programmer Sr

eClinical Solutions 251-1K Professional Services

Experienced Statistical Programmer role at a clinical research organization focused on delivering compliant statistical programming outputs for multiple clinical studies and regulatory submissions.

Git GitHub GitLab R SAP Shell Scripting
13 hours, 42 minutes ago

Data Conversion Software Engineer

Career TEAM 251-1K Professional Services

Career Team is hiring a Data Conversion Software Engineer to build data transformation and integration software for government-funded workforce development programs across the United States.

Agile Angular CI/CD Docker Express.js JavaScript JSON MongoDB NestJS Next.js Node.js React Scrum TypeScript XML
13 hours, 57 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers