Sumo Logic

Sumo Logic

Sumo Logic offers top-tier cloud monitoring, log management, and Cloud SIEM tools for web and SaaS apps, empowering businesses with real-time insights and high-quality software delivery.

Internet Software & Services
251-1K
Founded 2010

Description

  • Lead technical evaluation and adoption of agentic AI platforms and frameworks, including Anthropic, LangChain/LangGraph, AWS Bedrock, and emerging tools.
  • Architect, prototype, and productionize multi-agent AI systems for detection, triage, investigation, and response workflows.
  • Own core agent architecture components such as planning, execution, tool orchestration, memory, context engineering, and long-running workflows.
  • Lead offline and online evaluation pipelines for AI agents, including golden datasets, synthetic data generation, and human/LLM-based judging.
  • Drive LLM fine-tuning and alignment efforts to improve domain-specific reasoning, accuracy, and reliability.
  • Design scalable LLMOps and AI infrastructure for inference routing, latency optimization, cost control, and production observability.
  • Partner with product, security, and data platform teams to deliver end-to-end AI capabilities from prototype to customer-facing systems.
  • Provide technical direction and mentorship to AI engineers working on agentic AI and LLM systems.
  • Define and implement best practices for AI safety, reliability, evaluation, and monitoring in production.
  • Set technical direction for ambiguous problems and drive delivery across teams.

Requirements

  • B.Tech, M.Tech, or Ph.D. in Computer Science, Machine Learning, Data Science, or a related technical field.
  • 5+ years of hands-on industry experience building, operating, and leading production ML/AI systems.
  • Strong foundation in machine learning, distributed systems, data pipelines, and large-scale system design.
  • Deep understanding of LLMs, prompt engineering, context engineering, agentic AI design patterns, and reasoning workflows.
  • Strong proficiency in Python and modern ML/AI ecosystems.
  • Experience designing and operating evaluation frameworks for ML/LLM systems, including offline and online evaluation.
  • Proven ability to lead complex technical initiatives across teams and influence architecture decisions.
  • Excellent communication skills and ability to translate complex AI systems into business impact.
  • Hands-on experience building and scaling agentic AI systems or multi-agent architectures in production (preferred).
  • Experience with modern agent frameworks such as LangGraph, LangChain, CrewAI, or similar (preferred).
  • Experience with foundation model platforms such as Anthropic, OpenAI, AWS Bedrock, or Vertex AI (preferred).
  • Experience with LLM fine-tuning pipelines such as SFT, RLHF/RLAIF, preference learning, or domain adaptation (preferred).
  • Strong background in LLMOps, including inference optimization, latency/cost management, observability, and production monitoring (preferred).
  • Experience with tools such as PyTorch, MLflow, Airflow, Docker, Kubernetes, and cloud platforms like AWS, GCP, or Azure (preferred).
  • Experience applying AI/ML to security, observability, or large-scale log/telemetry data is a strong plus.
  • Must be authorized to work in the United States at the time of hire and for the duration of employment.
  • No non-immigrant visa sponsorship is available for this position.

Benefits

  • Eligible roles may participate in bonus or commission plans.
  • Eligible roles may receive equity awards.
  • Benefits offerings are available as part of the compensation package.
  • Compensation is variable based on role level, skills, qualifications, location, and experience.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Software Engineer - AI Engineering

RTB House 251-1K Media

RTB House is hiring a hands-on technical leader to lead its AI Engineering Lab, building internal tools and autonomous agents that improve engineering productivity through Agentic AI.

AWS Azure Deep Learning Docker GCP Go Java Kubernetes LLM Microservices Python Scala System Design TypeScript
1 hour, 42 minutes ago

AI Team Lead

Cato Networks 251-1K Diversified Telecommunication Services

Cato Networks is hiring a Hands-On Team Lead – Agentic Engineering to lead a new team building AI agent and workflow solutions that improve internal business processes across the organization.

LLM Python
3 hours, 49 minutes ago

Director of Product, Lab

Fundraise Up 51-250 Capital Markets

Fundraise Up is hiring a Lab leader in Serbia to build and run a 0→1 product exploration function that tests high-risk opportunities and decides which ideas should scale, pivot, or be killed.

Google Tag Manager LLM Prototyping
4 hours, 4 minutes ago

Arquitecto de Automatización de Servicios y Habilitación de IA

NEORIS 5K-10K Internet Software & Services

NEORIS is seeking an Automation Services and AI Enablement Architect to modernize end-user support platforms and lead the delivery of scalable, secure AIOps solutions across internal teams and selected vendors.

Azure Generative AI
4 hours, 44 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers