Orcrist Technologies

Orcrist Technologies

Orcrist Technologies specializes in providing advanced technology solutions, including data analytics, AI applications, and cybersecurity, aimed at empowering businesses to innovate and transform through the use of artificial intelligence and data-driv...

Internet Software & Services

Description

  • Package and deploy ASR, translation, OCR, NER, and summarization models on Kubernetes using Triton and KServe.
  • Build evaluation pipelines for model quality, latency, and cost, and automate release-gating decisions.
  • Operate streaming and batch inference workflows using Kafka, Temporal, and backfill tooling.
  • Monitor model drift and quality using Prometheus, Grafana, and Evidently.
  • Optimize inference performance and cost across production environments.
  • Collaborate with TypeScript teams on payload schemas, API contracts, and human-in-the-loop feedback loops.
  • Partner with Research and product teams to deliver reliable model enrichment for the platform.

Requirements

  • 4–8+ years of experience in ML engineering or MLOps shipping models to production.
  • Strong Python experience and hands-on experience with PyTorch and Transformers.
  • Experience with Triton, KServe, or similar model serving platforms.
  • Comfortable working with Kubernetes, GitOps, CI/CD, and GPU workload operations.
  • Knowledge of model evaluation metrics, monitoring, and annotation workflows.
  • Eligible to work in Germany.
  • Export-control screening is required for certain programs.
  • Temporal, Beam/Flink, or Ray Serve experience is a plus.
  • ONNX or TensorRT optimization experience is a plus.
  • German language skills at B1+ and familiarity with defense or public safety datasets are a plus.
  • Experience with WhisperX, DeepStream/GStreamer, or vector search integrations is a plus.

Benefits

  • Remote-first work environment in Germany.
  • Regular Berlin meetups.
  • 30 days of vacation.
  • Equipment budget.
  • Learning budget.
  • Modern MLOps stack including Triton, Temporal, Kafka, MLflow/Weights & Biases, Evidently, and Kubernetes.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Machine Learning Infrastructure Engineer

Unity 5K-10K Internet Software & Services

Unity is hiring a Senior Machine Learning Infrastructure Engineer for its Vector Ads team to build and operate the real-time infrastructure that powers ML-driven advertising at global, high-scale, low-latency performance.

Go Grafana Kubernetes Machine Learning OpenTelemetry Prometheus Python Terraform
24 minutes ago

Senior AI Platform Engineer

Wellhub 1-10 Gas Utilities

Wellhub is hiring a Senior AI Platform Engineer in Brazil to help build and evolve the cloud-native ML development platform that enables engineers and data scientists to develop and deploy AI at scale.

Apache Spark AWS CI/CD Kubeflow Kubernetes MLOps Python Terraform
3 hours, 43 minutes ago

Senior Software Engineer (Typescript / FrontEnd) - AI/ML

ClickHouse 51-250 IT Services

ClickHouse is hiring a Senior Software Engineer to build AI/ML-powered features for ClickHouse Cloud, connecting its high-performance database platform with end-to-end AI integrations and user-facing experiences.

AWS Azure ClickHouse GCP JavaScript Python React TypeScript
5 hours, 32 minutes ago

Senior Machine Learning Engineer, Ads Experimentation & Measurements

Unity 5K-10K Internet Software & Services

Unity’s Ads Experimentation Platform team is hiring a senior machine learning engineer to lead experimentation and evaluation for its global advertising ecosystem.

Apache Spark GCP Machine Learning MLOps Python Scala Snowflake Statistics
18 hours, 1 minute ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers