Rockstar

Rockstar is a platform that connects top talent with companies, helping individuals land roles that accelerate their careers. They focus on roles that offer competitive cash and the opportunity to impact the trajectory of the business.

Professional Services
1-10

Description

  • Design, build, deploy, and maintain production GenAI systems, including LLM applications, agentic workflows, RAG pipelines, and AI-powered search capabilities.
  • Architect scalable AI services using ML frameworks, model-serving tools, APIs, Docker, Kubernetes, and CI/CD pipelines.
  • Develop and optimize retrieval systems using embeddings, vector databases, semantic search, reranking, and structured data sources.
  • Fine-tune, adapt, and evaluate LLMs for domain-specific use cases using prompt engineering, supervised fine-tuning, LoRA, or QLoRA.
  • Build automated evaluation frameworks to measure model quality, prompt performance, retrieval accuracy, reasoning reliability, latency, and cost.
  • Implement observability for AI systems, including tracing, logging, performance monitoring, drift detection, and output-quality review.
  • Translate prototypes and research concepts into reliable product features that can scale in production.
  • Partner with product managers, data engineers, backend engineers, analysts, and business stakeholders to define AI capabilities and technical tradeoffs.
  • Review architecture, provide technical guidance, mentor junior engineers, and promote strong engineering practices.
  • Create technical documentation, implementation plans, runbooks, and model lifecycle documentation.

Requirements

  • 5+ years of experience in machine learning engineering, AI engineering, data science engineering, or a related technical role.
  • 2+ years of experience building or shipping production GenAI, LLM, or AI-powered systems.
  • Advanced Python programming skills and experience building maintainable production software.
  • Hands-on experience with PyTorch, TensorFlow, Hugging Face Transformers, scikit-learn, or similar ML frameworks.
  • Experience with LLM applications, RAG systems, embeddings, vector databases, prompt engineering, and model evaluation.
  • Experience deploying AI / ML services using Docker, Kubernetes, CI/CD workflows, APIs, and cloud-native infrastructure.
  • Strong understanding of classical machine learning, deep learning, NLP, information retrieval, and model validation.
  • Ability to communicate complex AI concepts clearly to technical and non-technical stakeholders.
  • Experience mentoring engineers, reviewing technical designs, or leading complex AI engineering initiatives.
  • Advanced degree in Computer Science, Machine Learning, Artificial Intelligence, Data Science, or a related field (preferred).
  • Experience with agent frameworks such as LangGraph, AutoGen, CrewAI, or similar tools (preferred).
  • Experience with model-serving platforms such as vLLM, BentoML, Triton, Ray Serve, or similar systems (preferred).
  • Familiarity with ML observability, experiment tracking, model monitoring, and prompt/version management tools (preferred).
  • Experience with graph-based retrieval, knowledge graphs, multimodal models, large-scale data processing, or security-focused data products (preferred).
  • Experience with infrastructure-as-code, workflow orchestration, model routing, caching, batching, or quantization (preferred).

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Intern, Forward Deployed Engineering

Workato 251-1K IT Services

Workato is hiring a Forward Deployed Engineering intern to support AI-driven automation initiatives by helping build intelligent agents and enterprise workflow integrations on its Agentic AI platform.

JavaScript JSON LLM Python REST API Salesforce
13 hours, 10 minutes ago

Downeast Cider - AI Full Stack Developer

Jobrack 11-50 Professional Services

Downeast Cider is hiring an AI Full Stack Developer to become its first technical employee and build production-ready internal tools that improve operations across the business.

CRM GCP JavaScript NetSuite Python Shopify Snowflake SQL TypeScript
13 hours, 25 minutes ago

Freelance Chatbot Developer (WhatsApp / Telegram / Discord)

Mindrift.ai: Be the “I” in AI Internet Software & Services

Mindrift is hiring a part-time freelance Bot Developer for the Tendem project to build and refine conversational bots and messaging-platform integrations in a hybrid AI and human workflow.

Docker Node.js OAuth Python REST API Serverless
13 hours, 40 minutes ago

AI Automations Specialist (Systems & Agents Focus)

Assistantly 51-250 Professional Services

Assistantly is hiring an AI Automations Specialist to design and optimize AI-driven workflows and agents for growth-stage client teams, with the goal of reducing manual work and improving business efficiency.

JavaScript Python
13 hours, 40 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers