Senior LLM Engineer: Text & Reasoning LLM / NLU

1 month, 3 weeks ago
Full-time
Lead
Software Development
Omilia

Omilia

Omilia is a global leader in Conversational AI, offering AI-based self-service solutions for enhanced customer care fulfillment and success.

IT Services
251-1K
Founded 2002
$20M raised

Description

  • Lead research and experimentation on new model architectures, training strategies, and evaluation methods for LLM/NLU services.
  • Design, develop, fine-tune, and evaluate specialized LLMs for Concierge and Task Agents.
  • Develop and optimize ML pipelines for training, evaluation, and deployment on AWS SageMaker.
  • Architect and maintain inference servers with a focus on low latency and high reliability.
  • Implement and evolve closed-loop self-learning systems for continuous model improvement.
  • Drive benchmarking, experiment reproducibility, and high-quality documentation.
  • Ensure compliance with data privacy standards throughout the ML lifecycle, including PCI/PII requirements.
  • Provide technical mentorship through code reviews, pairing, knowledge sharing, and team guidance.
  • Align Product, Architecture, and Engineering on technical decisions and delivery plans.

Requirements

  • 5+ years of experience in applied LLM, ML, NLU, or NLP with ownership of production ML systems at scale.
  • Strong hands-on experience with Python, PyTorch, and HuggingFace Transformers.
  • Deep experience with LLM fine-tuning, distillation, prompt engineering, evaluation, and deployment, especially for small or efficient models.
  • Solid foundation in NLU concepts such as intent classification and entity extraction.
  • Experience with model serving infrastructure such as Triton Inference Server, vLLM, TGI, or FastAPI.
  • Experience with cloud ML infrastructure such as AWS SageMaker, Bedrock, or equivalent platforms.
  • Proven architectural decision-making and technical ownership across services or products.
  • Ability to break down ambiguous problems and drive actionable plans independently.
  • Excellent communication skills for both technical and non-technical audiences.
  • Preferred: experience with agentic system design, including tool use, reasoning chains, and multi-step planning.
  • Preferred: experience with self-learning or continuous-improvement ML systems.
  • Preferred: multilingual NLU or cross-lingual transfer experience.
  • Preferred: familiarity with PCI/PII compliance in ML workflows.
  • Preferred: experience with experiment tracking tools such as Weights & Biases or MLflow.
  • Preferred: open-source ML/NLP contributions, publications at top venues, or experience with speech or multimodal LLMs.

Benefits

  • Fixed compensation.
  • Long-term employment with vacation days.
  • Professional development support, including courses and training.
  • Opportunity to work on cutting-edge technology products with global impact.
  • Collaborative team of skilled and approachable colleagues.
  • Apple gear provided.
  • Equal opportunity employment in a diverse and inclusive workplace.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

AI Native Engineer

CookUnity 251-1K Hotels, Restaurants & Leisure

CookUnity is hiring an AI Engineer to redesign and automate high-value internal workflows by building, shipping, and operating production AI tools that improve how teams work.

AWS dbt Git JIRA Kotlin Linear LLM NetSuite Notion PostgreSQL Python Snowflake SQL TypeScript Vercel
1 hour, 23 minutes ago

Cision, Senior Software Developer, Software Engineer, AMER, Canada

Cision 5K-10K Professional Services

Cision is hiring a software engineer to work with product, design, and data science teams on spec-driven development of AI-enabled .NET applications that turn high-level specs into production-ready software.

C# Git Kubernetes MySQL PostgreSQL React SQL VS Code
1 hour, 38 minutes ago

AI App Engineer (FastAPI / React / EKS)

Vecten Internet Software & Services

AI App Engineer role at a Warsaw-based AI-native data and technology partner for private capital and healthcare, focused on taking internal AI applications from working prototypes to secure, production-ready systems on AWS EKS.

AWS FastAPI JavaScript Kubernetes OWASP Python React Terraform
1 hour, 53 minutes ago

Principal/Senior Mobile Engineer, CEX

OKX 1K-5K Diversified Financial Services

OKX is hiring a CEX Trading Mobile Team lead in Hong Kong to own the OKX App’s trading market data experience and drive product, engineering, and AI innovation for millions of users.

Android Development Flutter LLM React Native
2 hours, 8 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers