Senior LLM Engineer: Text & Reasoning LLM / NLU

10 hours, 57 minutes ago
Full-time
Lead
Software Development
Omilia

Omilia

Omilia is a global leader in Conversational AI, offering AI-based self-service solutions for enhanced customer care fulfillment and success.

IT Services
251-1K
Founded 2002
$20M raised

Description

  • Lead research and experimentation on new model architectures, training strategies, and evaluation methods for LLM/NLU services.
  • Design, develop, fine-tune, and evaluate specialized LLMs for Concierge and Task Agents.
  • Develop and optimize ML pipelines for training, evaluation, and deployment on AWS SageMaker.
  • Architect and maintain inference servers with a focus on low latency and high reliability.
  • Implement and evolve closed-loop self-learning systems for continuous model improvement.
  • Drive benchmarking, experiment reproducibility, and high-quality documentation.
  • Ensure compliance with data privacy standards throughout the ML lifecycle, including PCI/PII requirements.
  • Provide technical mentorship through code reviews, pairing, knowledge sharing, and team guidance.
  • Align Product, Architecture, and Engineering on technical decisions and delivery plans.

Requirements

  • 5+ years of experience in applied LLM, ML, NLU, or NLP with ownership of production ML systems at scale.
  • Strong hands-on experience with Python, PyTorch, and HuggingFace Transformers.
  • Deep experience with LLM fine-tuning, distillation, prompt engineering, evaluation, and deployment, especially for small or efficient models.
  • Solid foundation in NLU concepts such as intent classification and entity extraction.
  • Experience with model serving infrastructure such as Triton Inference Server, vLLM, TGI, or FastAPI.
  • Experience with cloud ML infrastructure such as AWS SageMaker, Bedrock, or equivalent platforms.
  • Proven architectural decision-making and technical ownership across services or products.
  • Ability to break down ambiguous problems and drive actionable plans independently.
  • Excellent communication skills for both technical and non-technical audiences.
  • Preferred: experience with agentic system design, including tool use, reasoning chains, and multi-step planning.
  • Preferred: experience with self-learning or continuous-improvement ML systems.
  • Preferred: multilingual NLU or cross-lingual transfer experience.
  • Preferred: familiarity with PCI/PII compliance in ML workflows.
  • Preferred: experience with experiment tracking tools such as Weights & Biases or MLflow.
  • Preferred: open-source ML/NLP contributions, publications at top venues, or experience with speech or multimodal LLMs.

Benefits

  • Fixed compensation.
  • Long-term employment with vacation days.
  • Professional development support, including courses and training.
  • Opportunity to work on cutting-edge technology products with global impact.
  • Collaborative team of skilled and approachable colleagues.
  • Apple gear provided.
  • Equal opportunity employment in a diverse and inclusive workplace.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Binance Accelerator Program - Data Scientist (LLM & Trading)

Binance 5K-10K Capital Markets

Binance is hiring a Binance Accelerator Program Data Scientist in Hong Kong/Taipei to help develop and deploy AI-powered financial trading systems built on Web3 data and large language models.

Blockchain LLM PyTorch Reinforcement Learning TensorFlow
10 hours, 57 minutes ago

Project Lion - Lead Prompt Engineer - United States (Remote, Part-Time)

Welo Global Professional Services

Welo Data is hiring a remote, part-time Lead Prompt Engineer in the United States to lead the migration of prompt templates to LLM autoraters and improve AI model performance across client systems.

LLM SQL
10 hours, 57 minutes ago

[Job-28773] AI Reverse-Engineering Specialist / Software Architect, Remote, Brazil

CI&T 5K-10K Internet Software & Services

CI&T is seeking a remote AI Reverse-Engineering Specialist / Software Architect in Brazil to analyze a legacy RPG codebase, extract critical business logic, and guide its migration to a modern, scalable architecture.

LLM
11 hours, 12 minutes ago

Lead Full Stack Engineer (Node.js, React/Vue & AI Solutions)

CoverGo 51-250 Insurance

CoverGo is hiring a Lead Full Stack Engineer to lead development of its enterprise insurance SaaS platform, guiding architecture, mentoring engineers, and advancing AI-enabled capabilities in a fully remote international team.

AWS Azure Docker GCP GraphQL Hugging Face Machine Learning MongoDB NestJS NLP Node.js React TypeScript Vue.js
11 hours, 12 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers