Sumo Logic

Sumo Logic

Sumo Logic offers top-tier cloud monitoring, log management, and Cloud SIEM tools for web and SaaS apps, empowering businesses with real-time insights and high-quality software delivery.

Internet Software & Services
251-1K
Founded 2010

Description

  • Lead technical evaluation and adoption of agentic AI platforms and frameworks, including Anthropic, LangChain/LangGraph, AWS Bedrock, and emerging tools.
  • Architect, prototype, and productionize multi-agent AI systems for detection, triage, investigation, and response workflows.
  • Own core agent architecture components such as planning, execution, tool orchestration, memory, context engineering, and long-running workflows.
  • Lead offline and online evaluation pipelines for AI agents, including golden datasets, synthetic data generation, and human/LLM-based judging.
  • Drive LLM fine-tuning and alignment efforts to improve domain-specific reasoning, accuracy, and reliability.
  • Design scalable LLMOps and AI infrastructure for inference routing, latency optimization, cost control, and production observability.
  • Partner with product, security, and data platform teams to deliver end-to-end AI capabilities from prototype to customer-facing systems.
  • Provide technical direction and mentorship to AI engineers working on agentic AI and LLM systems.
  • Define and implement best practices for AI safety, reliability, evaluation, and monitoring in production.
  • Set technical direction for ambiguous problems and drive delivery across teams.

Requirements

  • B.Tech, M.Tech, or Ph.D. in Computer Science, Machine Learning, Data Science, or a related technical field.
  • 5+ years of hands-on industry experience building, operating, and leading production ML/AI systems.
  • Strong foundation in machine learning, distributed systems, data pipelines, and large-scale system design.
  • Deep understanding of LLMs, prompt engineering, context engineering, agentic AI design patterns, and reasoning workflows.
  • Strong proficiency in Python and modern ML/AI ecosystems.
  • Experience designing and operating evaluation frameworks for ML/LLM systems, including offline and online evaluation.
  • Proven ability to lead complex technical initiatives across teams and influence architecture decisions.
  • Excellent communication skills and ability to translate complex AI systems into business impact.
  • Hands-on experience building and scaling agentic AI systems or multi-agent architectures in production (preferred).
  • Experience with modern agent frameworks such as LangGraph, LangChain, CrewAI, or similar (preferred).
  • Experience with foundation model platforms such as Anthropic, OpenAI, AWS Bedrock, or Vertex AI (preferred).
  • Experience with LLM fine-tuning pipelines such as SFT, RLHF/RLAIF, preference learning, or domain adaptation (preferred).
  • Strong background in LLMOps, including inference optimization, latency/cost management, observability, and production monitoring (preferred).
  • Experience with tools such as PyTorch, MLflow, Airflow, Docker, Kubernetes, and cloud platforms like AWS, GCP, or Azure (preferred).
  • Experience applying AI/ML to security, observability, or large-scale log/telemetry data is a strong plus.
  • Must be authorized to work in the United States at the time of hire and for the duration of employment.
  • No non-immigrant visa sponsorship is available for this position.

Benefits

  • Eligible roles may participate in bonus or commission plans.
  • Eligible roles may receive equity awards.
  • Benefits offerings are available as part of the compensation package.
  • Compensation is variable based on role level, skills, qualifications, location, and experience.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

AI Native Engineer

CookUnity 251-1K Hotels, Restaurants & Leisure

CookUnity is hiring an AI Engineer to redesign and automate high-value internal workflows by building, shipping, and operating production AI tools that improve how teams work.

AWS dbt Git JIRA Kotlin Linear LLM NetSuite Notion PostgreSQL Python Snowflake SQL TypeScript Vercel
5 hours, 52 minutes ago

Cision, Senior Software Developer, Software Engineer, AMER, Canada

Cision 5K-10K Professional Services

Cision is hiring a software engineer to work with product, design, and data science teams on spec-driven development of AI-enabled .NET applications that turn high-level specs into production-ready software.

C# Git Kubernetes MySQL PostgreSQL React SQL VS Code
6 hours, 7 minutes ago

AI App Engineer (FastAPI / React / EKS)

Vecten Internet Software & Services

AI App Engineer role at a Warsaw-based AI-native data and technology partner for private capital and healthcare, focused on taking internal AI applications from working prototypes to secure, production-ready systems on AWS EKS.

AWS FastAPI JavaScript Kubernetes OWASP Python React Terraform
6 hours, 22 minutes ago

Software Engineer II, Backend (ML Training & Serving)

Affirm 1K-5K Diversified Financial Services

Affirm is hiring a Software Engineer II for its ML Training & Serving engineering team to build the infrastructure that trains and serves machine learning models across the company.

AWS Kotlin Kubernetes Machine Learning MySQL Python
6 hours, 22 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers