GlobalDev Tech

GlobalDev Tech

Custom software engineering & staff augmentation company Globaldev drives businesses to maximize their potential with our custom software development and staff augmentation services. Globaldev Group supports the most exciting technology companies, by p...

Internet Software & Services
51-250
Founded 2021

Description

  • Own and lead the design and development of low-latency inference services that handle billions of requests per day.
  • Build and scale real-time decision-making engines that combine machine learning models with business logic under strict SLAs.
  • Collaborate with data scientists to deploy models seamlessly and reliably into production.
  • Design runtime systems for model versioning, shadowing, and A/B testing.
  • Ensure high availability, scalability, and observability of production systems.
  • Continuously optimize latency, throughput, and cost efficiency using modern tooling and techniques.
  • Work independently while coordinating with cross-functional stakeholders across Algo, Infra, Product, Engineering, BA, and Business.
  • Support production-grade ML services with strong monitoring, diagnostics, and operational reliability.

Requirements

  • B.Sc. or M.Sc. in Computer Science, Software Engineering, or a related technical discipline.
  • 5+ years of experience building high-performance backend or ML inference systems.
  • Deep expertise in Python.
  • Experience with low-latency APIs and real-time serving frameworks such as FastAPI, Triton Inference Server, TorchServe, or BentoML.
  • Experience with scalable service architecture, message queues such as Kafka or Pub/Sub, and async processing.
  • Strong understanding of model deployment practices, online/offline feature parity, and real-time monitoring.
  • Experience in cloud environments such as AWS, GCP, or OCI and container orchestration with Kubernetes.
  • Experience with in-memory and NoSQL databases such as Aerospike, Redis, or Bigtable.
  • Familiarity with observability tools such as Prometheus, Grafana, and OpenTelemetry.
  • Strong sense of ownership and the ability to drive solutions end to end.
  • Passion for performance, clean architecture, and impactful systems.

Benefits

  • Polish public holidays.
  • 20 working days per year as Non-Operational Allowance for personal recreation, compensated in full.
  • Health insurance.
  • Gym subscription (Multisport).
  • Opportunity to lead a mission-critical inference engine at the core of the product.
  • Fast-paced, collaborative, and empowered culture with full ownership of the domain.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Machine Learning Engineer

airSlate 251-1K Professional Services

airSlate is seeking a Senior Machine Learning Engineer to develop and deploy ML and AI solutions that support high-impact marketing, SEO, and customer value initiatives at global scale.

AWS BERT Deep Learning Feature Engineering GPT LLM Machine Learning Python Reinforcement Learning SageMaker SEO
3 hours, 58 minutes ago

Senior Engineering Manager - Accelerated Compute Memory Systems

Pryon 51-250 Internet Software & Services

Pryon is seeking a Senior Engineering Manager to lead its Super Compute Memory team building cloud-native ingestion, retrieval, and inference infrastructure for large-scale AI memory workloads across commercial and federal deployments.

Apache Airflow AWS Azure C++ CloudFormation Datadog GCP Go Grafana Java Kafka Kubeflow Kubernetes Machine Learning NLP Prometheus Pulumi Python PyTorch RabbitMQ Rust TensorFlow Terraform
4 hours, 13 minutes ago

Senior Machine Learning Engineer

Spotify Media

Spotify’s Personalization team is hiring a Senior Machine Learning Engineer to help develop and improve recommendation systems that keep millions of listeners engaged across the main homepage and other personalized experiences.

Agile Apache Spark AWS GCP Java Machine Learning Python PyTorch Scala Scikit-learn Statistics TensorFlow
4 hours, 28 minutes ago

Machine Learning Engineer Lead

MUTT DATA 51-250 Internet Software & Services

Mutt Data is hiring a remote Machine Learning Engineer Lead in Argentina to lead data and ML projects that build scalable forecasting, recommendation, and AI systems for clients.

Apache Airflow AWS Azure Databricks dbt Deep Learning Docker Feature Engineering GCP Generative AI Jupyter Keras Machine Learning MLflow MLOps NumPy Pandas Plotly Python PyTorch Scikit-learn SQL TensorFlow XGBoost
4 hours, 28 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers