Orcrist Technologies

Orcrist Technologies

Orcrist Technologies specializes in providing advanced technology solutions, including data analytics, AI applications, and cybersecurity, aimed at empowering businesses to innovate and transform through the use of artificial intelligence and data-driv...

Internet Software & Services

Description

  • Package and deploy ASR, translation, OCR, NER, and summarization models on Kubernetes using Triton and KServe.
  • Build evaluation pipelines for model quality, latency, and cost, and automate release-gating decisions.
  • Operate streaming and batch inference workflows using Kafka, Temporal, and backfill tooling.
  • Monitor model drift and quality using Prometheus, Grafana, and Evidently.
  • Optimize inference performance and cost across production environments.
  • Collaborate with TypeScript teams on payload schemas, API contracts, and human-in-the-loop feedback loops.
  • Partner with Research and product teams to deliver reliable model enrichment for the platform.

Requirements

  • 4–8+ years of experience in ML engineering or MLOps shipping models to production.
  • Strong Python experience and hands-on experience with PyTorch and Transformers.
  • Experience with Triton, KServe, or similar model serving platforms.
  • Comfortable working with Kubernetes, GitOps, CI/CD, and GPU workload operations.
  • Knowledge of model evaluation metrics, monitoring, and annotation workflows.
  • Eligible to work in Germany.
  • Export-control screening is required for certain programs.
  • Temporal, Beam/Flink, or Ray Serve experience is a plus.
  • ONNX or TensorRT optimization experience is a plus.
  • German language skills at B1+ and familiarity with defense or public safety datasets are a plus.
  • Experience with WhisperX, DeepStream/GStreamer, or vector search integrations is a plus.

Benefits

  • Remote-first work environment in Germany.
  • Regular Berlin meetups.
  • 30 days of vacation.
  • Equipment budget.
  • Learning budget.
  • Modern MLOps stack including Triton, Temporal, Kafka, MLflow/Weights & Biases, Evidently, and Kubernetes.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

ML Infrastructure Engineer

Later 51-250 Media

Later is hiring a Machine Learning Infrastructure Engineer to build and operate the ML systems that support experimentation, training, deployment, and monitoring across its AI-powered product portfolio.

AWS CI/CD CloudFormation Datadog Docker Flask GCP Generative AI GitHub Actions GitLab CI Go Grafana Java Kubernetes LLM Machine Learning MLflow MLOps Prometheus Python Scala Terraform
1 hour, 14 minutes ago

C++ Mission Software Engineer, Mission Autonomy

Anduril Industries 1K-5K Aerospace & Defense

Anduril Industries is hiring a software engineer for its Air Dominance & Strike team to help develop and deploy autonomy software for aerial and multi-domain robotic systems, including products like Fury and Barracuda.

C++ Computer Vision Linux Rust
4 hours, 16 minutes ago

ML Infrastructure Engineer

Later 51-250 Media

Later is hiring a Machine Learning Infrastructure Engineer to build the ML platform and operating foundation that supports model experimentation, deployment, monitoring, and scaling across its product portfolio.

AWS CI/CD CloudFormation Datadog Docker Flask GCP Generative AI GitHub Actions GitLab CI Go Grafana Java Kubernetes LLM MLflow Prometheus Python Scala Terraform
5 hours, 18 minutes ago

Senior Machine Learning Engineer, Data & Intelligence Products

AcuityMD 51-250 Health Care Providers & Services

AcuityMD is hiring a Senior Machine Learning Engineer to turn messy healthcare data into predictive models and intelligence products that help MedTech companies understand product usage and improve patient access.

Feature Engineering Machine Learning Python SQL
9 hours, 26 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers