Rockstar

Rockstar is a platform that connects top talent with companies, helping individuals land roles that accelerate their careers. They focus on roles that offer competitive cash and the opportunity to impact the trajectory of the business.

Professional Services
1-10

Description

  • Design, build, deploy, and maintain production GenAI systems, including LLM applications, agentic workflows, RAG pipelines, and AI-powered search capabilities.
  • Architect scalable AI services using ML frameworks, model-serving tools, APIs, Docker, Kubernetes, and CI/CD pipelines.
  • Develop and optimize retrieval systems using embeddings, vector databases, semantic search, reranking, and structured data sources.
  • Fine-tune, adapt, and evaluate LLMs for domain-specific use cases using prompt engineering, supervised fine-tuning, LoRA, or QLoRA.
  • Build automated evaluation frameworks to measure model quality, prompt performance, retrieval accuracy, reasoning reliability, latency, and cost.
  • Implement observability for AI systems, including tracing, logging, performance monitoring, drift detection, and output-quality review.
  • Translate prototypes and research concepts into reliable product features that can scale in production.
  • Partner with product managers, data engineers, backend engineers, analysts, and business stakeholders to define AI capabilities and technical tradeoffs.
  • Review architecture, provide technical guidance, mentor junior engineers, and promote strong engineering practices.
  • Create technical documentation, implementation plans, runbooks, and model lifecycle documentation.

Requirements

  • 5+ years of experience in machine learning engineering, AI engineering, data science engineering, or a related technical role.
  • 2+ years of experience building or shipping production GenAI, LLM, or AI-powered systems.
  • Advanced Python programming skills and experience building maintainable production software.
  • Hands-on experience with PyTorch, TensorFlow, Hugging Face Transformers, scikit-learn, or similar ML frameworks.
  • Experience with LLM applications, RAG systems, embeddings, vector databases, prompt engineering, and model evaluation.
  • Experience deploying AI / ML services using Docker, Kubernetes, CI/CD workflows, APIs, and cloud-native infrastructure.
  • Strong understanding of classical machine learning, deep learning, NLP, information retrieval, and model validation.
  • Ability to communicate complex AI concepts clearly to technical and non-technical stakeholders.
  • Experience mentoring engineers, reviewing technical designs, or leading complex AI engineering initiatives.
  • Advanced degree in Computer Science, Machine Learning, Artificial Intelligence, Data Science, or a related field (preferred).
  • Experience with agent frameworks such as LangGraph, AutoGen, CrewAI, or similar tools (preferred).
  • Experience with model-serving platforms such as vLLM, BentoML, Triton, Ray Serve, or similar systems (preferred).
  • Familiarity with ML observability, experiment tracking, model monitoring, and prompt/version management tools (preferred).
  • Experience with graph-based retrieval, knowledge graphs, multimodal models, large-scale data processing, or security-focused data products (preferred).
  • Experience with infrastructure-as-code, workflow orchestration, model routing, caching, batching, or quantization (preferred).

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Principal AI Engineer

Robots & Pencils 51-250 IT Services

Robots & Pencils is seeking a Principal AI Engineer to set the technical direction for AI/ML systems, lead architecture and implementation across the full lifecycle, and help scale production AI work across clients and the organization.

AWS Docker JIRA MLOps Python SQL System Design
1 hour, 2 minutes ago

Senior AI Engineer (Agents)

Workato 251-1K IT Services

Workato is hiring a Senior AI Engineer (Agents) to design and productionize AI-powered conversational systems and automation features for enterprise products.

Go HIPAA Java LLM Machine Learning Python
3 hours, 5 minutes ago

Senior Conversational AI Delivery Engineer

Omilia 251-1K IT Services

Omilia is hiring a Senior AI Delivery Engineer to lead the design, delivery, and deployment of enterprise conversational AI and agentic automation solutions for client engagements on its platform.

Generative AI Node.js Python
3 hours, 27 minutes ago

Agentic AI Solutions Architect - Remote, Latin America

Bluelight Consulting 11-50 Internet Software & Services

Bluelight is hiring a remote Agentic AI Solutions Architect in Latin America to lead client-facing AI strategy and design end-to-end agentic automation solutions that drive measurable business results.

Apache Airflow AWS Azure GCP Generative AI LLM MLOps Prefect Python Snowflake SQL
3 hours, 37 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers