Rockstar

Rockstar is a platform that connects top talent with companies, helping individuals land roles that accelerate their careers. They focus on roles that offer competitive cash and the opportunity to impact the trajectory of the business.

Professional Services
1-10

Description

  • Design, build, deploy, and maintain production GenAI systems, including LLM applications, agentic workflows, RAG pipelines, and AI-powered search capabilities.
  • Architect scalable AI services using ML frameworks, model-serving tools, APIs, Docker, Kubernetes, and CI/CD pipelines.
  • Develop and optimize retrieval systems using embeddings, vector databases, semantic search, reranking, and structured data sources.
  • Fine-tune, adapt, and evaluate LLMs for domain-specific use cases using prompt engineering, supervised fine-tuning, LoRA, or QLoRA.
  • Build automated evaluation frameworks to measure model quality, prompt performance, retrieval accuracy, reasoning reliability, latency, and cost.
  • Implement observability for AI systems, including tracing, logging, performance monitoring, drift detection, and output-quality review.
  • Translate prototypes and research concepts into reliable product features that can scale in production.
  • Partner with product managers, data engineers, backend engineers, analysts, and business stakeholders to define AI capabilities and technical tradeoffs.
  • Review architecture, provide technical guidance, mentor junior engineers, and promote strong engineering practices.
  • Create technical documentation, implementation plans, runbooks, and model lifecycle documentation.

Requirements

  • 5+ years of experience in machine learning engineering, AI engineering, data science engineering, or a related technical role.
  • 2+ years of experience building or shipping production GenAI, LLM, or AI-powered systems.
  • Advanced Python programming skills and experience building maintainable production software.
  • Hands-on experience with PyTorch, TensorFlow, Hugging Face Transformers, scikit-learn, or similar ML frameworks.
  • Experience with LLM applications, RAG systems, embeddings, vector databases, prompt engineering, and model evaluation.
  • Experience deploying AI / ML services using Docker, Kubernetes, CI/CD workflows, APIs, and cloud-native infrastructure.
  • Strong understanding of classical machine learning, deep learning, NLP, information retrieval, and model validation.
  • Ability to communicate complex AI concepts clearly to technical and non-technical stakeholders.
  • Experience mentoring engineers, reviewing technical designs, or leading complex AI engineering initiatives.
  • Advanced degree in Computer Science, Machine Learning, Artificial Intelligence, Data Science, or a related field (preferred).
  • Experience with agent frameworks such as LangGraph, AutoGen, CrewAI, or similar tools (preferred).
  • Experience with model-serving platforms such as vLLM, BentoML, Triton, Ray Serve, or similar systems (preferred).
  • Familiarity with ML observability, experiment tracking, model monitoring, and prompt/version management tools (preferred).
  • Experience with graph-based retrieval, knowledge graphs, multimodal models, large-scale data processing, or security-focused data products (preferred).
  • Experience with infrastructure-as-code, workflow orchestration, model routing, caching, batching, or quantization (preferred).

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

AI Native Engineer

CookUnity 251-1K Hotels, Restaurants & Leisure

CookUnity is hiring an AI Engineer to redesign and automate high-value internal workflows by building, shipping, and operating production AI tools that improve how teams work.

AWS dbt Git JIRA Kotlin Linear LLM NetSuite Notion PostgreSQL Python Snowflake SQL TypeScript Vercel
4 hours, 59 minutes ago

Cision, Senior Software Developer, Software Engineer, AMER, Canada

Cision 5K-10K Professional Services

Cision is hiring a software engineer to work with product, design, and data science teams on spec-driven development of AI-enabled .NET applications that turn high-level specs into production-ready software.

C# Git Kubernetes MySQL PostgreSQL React SQL VS Code
5 hours, 14 minutes ago

AI App Engineer (FastAPI / React / EKS)

Vecten Internet Software & Services

AI App Engineer role at a Warsaw-based AI-native data and technology partner for private capital and healthcare, focused on taking internal AI applications from working prototypes to secure, production-ready systems on AWS EKS.

AWS FastAPI JavaScript Kubernetes OWASP Python React Terraform
5 hours, 29 minutes ago

Principal/Senior Mobile Engineer, CEX

OKX 1K-5K Diversified Financial Services

OKX is hiring a CEX Trading Mobile Team lead in Hong Kong to own the OKX App’s trading market data experience and drive product, engineering, and AI innovation for millions of users.

Android Development Flutter LLM React Native
5 hours, 44 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers