Orcrist Technologies

Orcrist Technologies

Orcrist Technologies specializes in providing advanced technology solutions, including data analytics, AI applications, and cybersecurity, aimed at empowering businesses to innovate and transform through the use of artificial intelligence and data-driv...

Internet Software & Services

Description

  • Package and deploy ASR, translation, OCR, NER, and summarization models on Kubernetes using Triton and KServe.
  • Build evaluation pipelines for model quality, latency, and cost, and automate release-gating decisions.
  • Operate streaming and batch inference workflows using Kafka, Temporal, and backfill tooling.
  • Monitor model drift and quality using Prometheus, Grafana, and Evidently.
  • Optimize inference performance and cost across production environments.
  • Collaborate with TypeScript teams on payload schemas, API contracts, and human-in-the-loop feedback loops.
  • Partner with Research and product teams to deliver reliable model enrichment for the platform.

Requirements

  • 4–8+ years of experience in ML engineering or MLOps shipping models to production.
  • Strong Python experience and hands-on experience with PyTorch and Transformers.
  • Experience with Triton, KServe, or similar model serving platforms.
  • Comfortable working with Kubernetes, GitOps, CI/CD, and GPU workload operations.
  • Knowledge of model evaluation metrics, monitoring, and annotation workflows.
  • Eligible to work in Germany.
  • Export-control screening is required for certain programs.
  • Temporal, Beam/Flink, or Ray Serve experience is a plus.
  • ONNX or TensorRT optimization experience is a plus.
  • German language skills at B1+ and familiarity with defense or public safety datasets are a plus.
  • Experience with WhisperX, DeepStream/GStreamer, or vector search integrations is a plus.

Benefits

  • Remote-first work environment in Germany.
  • Regular Berlin meetups.
  • 30 days of vacation.
  • Equipment budget.
  • Learning budget.
  • Modern MLOps stack including Triton, Temporal, Kafka, MLflow/Weights & Biases, Evidently, and Kubernetes.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

AI Security Engineer - Mid-Atlantic region (Remote in VA, MD, PA, NC, DE, NJ, or DC)

GuidePoint Security 251-1K Internet Software & Services

GuidePoint Security is hiring an AI Security Engineer to help customers design, implement, secure, and operate generative AI security solutions across enterprise environments.

Cybersecurity Generative AI LLM Python SageMaker Terraform
5 hours, 36 minutes ago

Machine Learning Engineer, Next-Generation Recommendation Systems (New Grad / PhD)

Unity 5K-10K Internet Software & Services

Unity’s Vector AI team is hiring a PhD graduate to develop and productionize large-scale ranking and recommendation systems that optimize ad relevance, user value, and delivery outcomes across billions of monthly users.

Feature Engineering LLM Machine Learning Python PyTorch Reinforcement Learning TensorFlow
5 hours, 51 minutes ago

Senior Machine Learning Engineer

Rubrik 1K-5K IT Services

Rubrik is hiring an Applied ML Engineer to build and operate SAGE, a real-time AI governance system for monitoring, enforcing, and remediating enterprise agent behavior at production scale.

LLM Python PyTorch Vertex AI
6 hours, 21 minutes ago

Machine Learning Engineer, Next-Generation Recommendation Systems (New Grad / PhD)

Unity 5K-10K Internet Software & Services

Unity’s Vector AI team is hiring a PhD-level machine learning researcher to develop production recommendation and ranking systems that power ad delivery across billions of users.

Feature Engineering LLM Machine Learning Python PyTorch Reinforcement Learning Statistics TensorFlow
6 hours, 36 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers