Senior LLM Engineer: Text & Reasoning LLM / NLU

1 month ago
Full-time
Lead
Software Development
Omilia

Omilia

Omilia is a global leader in Conversational AI, offering AI-based self-service solutions for enhanced customer care fulfillment and success.

IT Services
251-1K
Founded 2002
$20M raised

Description

  • Lead research and experimentation on new model architectures, training strategies, and evaluation methods for LLM/NLU services.
  • Design, develop, fine-tune, and evaluate specialized LLMs for Concierge and Task Agents.
  • Develop and optimize ML pipelines for training, evaluation, and deployment on AWS SageMaker.
  • Architect and maintain inference servers with a focus on low latency and high reliability.
  • Implement and evolve closed-loop self-learning systems for continuous model improvement.
  • Drive benchmarking, experiment reproducibility, and high-quality documentation.
  • Ensure compliance with data privacy standards throughout the ML lifecycle, including PCI/PII requirements.
  • Provide technical mentorship through code reviews, pairing, knowledge sharing, and team guidance.
  • Align Product, Architecture, and Engineering on technical decisions and delivery plans.

Requirements

  • 5+ years of experience in applied LLM, ML, NLU, or NLP with ownership of production ML systems at scale.
  • Strong hands-on experience with Python, PyTorch, and HuggingFace Transformers.
  • Deep experience with LLM fine-tuning, distillation, prompt engineering, evaluation, and deployment, especially for small or efficient models.
  • Solid foundation in NLU concepts such as intent classification and entity extraction.
  • Experience with model serving infrastructure such as Triton Inference Server, vLLM, TGI, or FastAPI.
  • Experience with cloud ML infrastructure such as AWS SageMaker, Bedrock, or equivalent platforms.
  • Proven architectural decision-making and technical ownership across services or products.
  • Ability to break down ambiguous problems and drive actionable plans independently.
  • Excellent communication skills for both technical and non-technical audiences.
  • Preferred: experience with agentic system design, including tool use, reasoning chains, and multi-step planning.
  • Preferred: experience with self-learning or continuous-improvement ML systems.
  • Preferred: multilingual NLU or cross-lingual transfer experience.
  • Preferred: familiarity with PCI/PII compliance in ML workflows.
  • Preferred: experience with experiment tracking tools such as Weights & Biases or MLflow.
  • Preferred: open-source ML/NLP contributions, publications at top venues, or experience with speech or multimodal LLMs.

Benefits

  • Fixed compensation.
  • Long-term employment with vacation days.
  • Professional development support, including courses and training.
  • Opportunity to work on cutting-edge technology products with global impact.
  • Collaborative team of skilled and approachable colleagues.
  • Apple gear provided.
  • Equal opportunity employment in a diverse and inclusive workplace.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Sr. Full Stack Software Engineer - Agents team (Fixed Term 6 Months)

Mitek Systems 251-1K Communications Equipment

Mitek is hiring a remote Senior Full Stack Software Engineer for its Agents team to evolve and improve products that support mobile banking and identity authentication over a fixed 6-month contract.

Agile AWS Bash CI/CD CloudFormation CSS Cypress Docker EC2 Git Go Groovy HTML Java JavaScript Jenkins Microservices MongoDB Python React Serverless Terraform TypeScript
1 hour, 2 minutes ago

Senior GenAI Integrated Designer

Brandtech+ 501-1000 Marketing services

Brandtech+ is hiring a Senior GenAI Integrated Designer to create and adapt digital, social, e-commerce, and motion content using GenAI workflows for high-profile brands.

After Effects Digital Marketing E-commerce Figma Generative AI Illustrator Instagram API Photoshop Social Media Marketing TikTok
1 hour, 37 minutes ago

AI Data Engineer

Influur 11-50 Media

Influur is hiring an AI Data Engineer in London/remote to own the full data-to-agent lifecycle for its production influencer-marketing AI system.

AWS GCP LLM Python
2 hours, 40 minutes ago

Client AI Implementation Specialist

AI Acquisition 51-200 Business Consulting and Services

One of our seasoned B2B clients is hiring a remote AI Implementation Specialist to design, build, and deploy production-ready AI systems that automate workflows and improve business operations.

JavaScript LLM Python
7 hours, 21 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers