BOLD

BOLD

BOLD is a tech company dedicated to transforming work lives by providing online products, tools, guidance, and support for job seekers, employers, and businesses. Founded by passionate entrepreneurs, BOLD empowers individuals to reach their professiona...

Internet Software & Services
251-1K
Founded 2004

Description

  • Design and maintain end-to-end MLOps pipelines for data ingestion, feature engineering, model training, deployment, and monitoring.
  • Productionize ML and GenAI services in collaboration with data scientists on model serving and workflow optimization.
  • Implement monitoring, alerting, and observability to reduce MTTR and improve production reliability.
  • Manage data stores, feature stores, and search infrastructure for scalable ML inference.
  • Automate CI/CD for ML models and infrastructure with governance and security compliance.
  • Handle security patching, cost optimization, and 24x7 on-call support for critical services.
  • Coordinate with development, QA, operations, and data teams to improve build and deployment processes.
  • Provide day-to-day support, ad hoc requests, and cross-team project execution for production ML environments.

Requirements

  • 4.5+ years of experience for Sr. Engineer level or 7+ years for Module Lead level in AWS MLOps and DevOps.
  • Hands-on experience with AWS SageMaker, including Pipelines, Model Registry, and Studio.
  • Experience with EMR and OpenSearch, including kNN/vector search.
  • Strong Python and Bash scripting skills for CI/CD, provisioning, and monitoring.
  • Experience supporting FastAPI or Spring Boot web services and Linux servers, including Solr/OpenSearch.
  • Experience with AWS services such as S3, DynamoDB, Lambda, and Step Functions.
  • Experience with cost control and reporting for AWS infrastructure.
  • Experience with databases such as MySQL and MongoDB.
  • Strong Linux and networking fundamentals.
  • Hands-on expertise with ML tools such as MLflow, Airflow, Metaflow, and ONNX.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

DevOps Engineer

Apptronik 51-250 Aerospace & Defense

Apptronik is seeking a Senior DevOps Engineer to own the infrastructure, CI/CD, and developer platforms that help software teams ship Apollo robot software quickly and reliably across on-prem and cloud environments.

AWS Azure Bash Buildkite C++ CircleCI Embedded Systems GCP GitHub Actions GitLab CI GitOps Grafana Helm Jenkins Kubernetes Linux OpenTelemetry Prometheus Python Terraform
28 minutes ago

Senior Software Engineer (Typescript / FrontEnd) - AI/ML

ClickHouse 51-250 IT Services

ClickHouse is hiring a Senior Software Engineer to build AI/ML-powered features for ClickHouse Cloud, connecting its high-performance database platform with production AI capabilities across the backend and user interface.

AWS Azure ClickHouse GCP JavaScript Python React TypeScript
1 hour, 3 minutes ago

Software Engineer II - Model Platform

Abnormal AI Internet Software & Services

Abnormal AI is hiring a Software Backend Engineer II for its Detection Team to build the model platform and infrastructure that powers email and cloud attack detection at scale.

AWS Azure Django GCP Go Kubernetes PostgreSQL Python
1 hour, 3 minutes ago

DevOps - SRE Engineer - Argentina

Coderio 51-250 Internet Software & Services

Coderio is hiring a remote DevOps/SRE Engineer in Argentina to ensure the stability, scalability, and efficient operation of the infrastructure supporting its digital platforms.

Argo CD Flux GitHub Actions Helm Jenkins Kubernetes OpenShift Terraform
1 hour, 23 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers