Coupa Software

Coupa Software

Coupa Software is the premier cloud-based finance platform, empowering companies worldwide to optimize spend, boost profits, and reduce costs with a comprehensive suite of modules.

Internet Software & Services
1K-5K
Founded 2006

Description

  • Build, deploy, and troubleshoot microservices in Kubernetes and Amazon EKS.
  • Design secure, highly available web applications with strong capacity planning and performance optimization.
  • Deploy and manage the lifecycle of LLMs and embedding models.
  • Define KPIs to measure and improve AI application performance.
  • Evaluate and integrate emerging technologies such as RAG systems, MCP servers, AI Agents, and agentic workflows.
  • Manage AWS and GenAI services using infrastructure-as-code tools such as Terraform and Chef.
  • Maintain observability and incident awareness using tools such as New Relic and PagerDuty.
  • Collaborate with product, platform, and engineering teams on architecture, security patching, incident response, and release management.
  • Support the reliability of ML and GenAI infrastructure across the platform.

Requirements

  • Bachelor’s degree required.
  • 8+ years of experience managing large-scale cloud applications.
  • Strong background in Linux administration and troubleshooting.
  • 5+ years of hands-on experience managing cloud infrastructure across AWS, GCP, and Azure.
  • Practical experience with LLMs and embedding models such as OpenAI, AWS Bedrock, and SageMaker.
  • Familiarity with vector databases such as LanceDB is a plus.
  • Strong scripting skills in Bash or Python.
  • Experience with container orchestration platforms such as Amazon EKS or Azure AKS.
  • Proficiency with DevOps and automation tools such as Chef, GitHub Actions, Rundeck, Terraform, Spacelift, and Helm.
  • Working knowledge of DNS, load balancers, MySQL, and Git branching strategies.
  • Excellent communication skills and a collaborative mindset.
  • Ability to take ownership, drive solutions, and deliver results independently.

Benefits

  • Remote work environment.
  • Inclusive and welcoming workplace.
  • Equal employment opportunities and fair hiring practices.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Staff Machine Learning Engineer

Samsara 1K-5K IT Services

Samsara is hiring a Staff Machine Learning Engineer to develop end-to-end AI solutions and core ML infrastructure for physical operations customers using large-scale sensor, video, diagnostic, and text data.

Apache Spark C++ Computer Vision Machine Learning Python Rust
1 day, 2 hours ago

Senior Intelligent Process Automation Engineer (IPA)

GlobalDev Tech 51-250 Internet Software & Services

Senior Intelligent Process Automation Engineer at a transportation and logistics company, responsible for designing integration-first automation solutions that connect multiple systems into end-to-end workflows and support intelligent document processing.

Docker Kubernetes Machine Learning Microservices NLP REST API
1 day, 2 hours ago

Principal Machine Learning Engineer

Qodea is seeking a Principal Machine Learning Engineer to lead the architecture and evolution of large-scale data and ML systems that improve data quality, enrichment, and intelligent product linking within its Knowledge domain.

CI/CD Docker GCP Go GraphQL Kubernetes LLM Machine Learning NLP Node.js Python Redis REST API Scala SQL
1 day, 2 hours ago

Senior MLOps Engineer

Prolific 51-250 Professional Services

Prolific is hiring a Senior MLOps Engineer to build and operate the cloud and machine learning infrastructure that takes AI research into production across use cases like fraud detection and RAG-based search.

AWS CI/CD GCP GitHub Actions Kubernetes Machine Learning MLflow MLOps Serverless Terraform
1 day, 2 hours ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers