BOLD

BOLD

BOLD is a tech company dedicated to transforming work lives by providing online products, tools, guidance, and support for job seekers, employers, and businesses. Founded by passionate entrepreneurs, BOLD empowers individuals to reach their professiona...

Internet Software & Services
251-1K
Founded 2004

Description

  • Design and maintain end-to-end MLOps pipelines for data ingestion, feature engineering, model training, deployment, and monitoring.
  • Productionize ML and GenAI services in collaboration with data scientists on model serving and workflow optimization.
  • Implement monitoring, alerting, and observability to reduce MTTR and improve production reliability.
  • Manage data stores, feature stores, and search infrastructure for scalable ML inference.
  • Automate CI/CD for ML models and infrastructure with governance and security compliance.
  • Handle security patching, cost optimization, and 24x7 on-call support for critical services.
  • Coordinate with development, QA, operations, and data teams to improve build and deployment processes.
  • Provide day-to-day support, ad hoc requests, and cross-team project execution for production ML environments.

Requirements

  • 4.5+ years of experience for Sr. Engineer level or 7+ years for Module Lead level in AWS MLOps and DevOps.
  • Hands-on experience with AWS SageMaker, including Pipelines, Model Registry, and Studio.
  • Experience with EMR and OpenSearch, including kNN/vector search.
  • Strong Python and Bash scripting skills for CI/CD, provisioning, and monitoring.
  • Experience supporting FastAPI or Spring Boot web services and Linux servers, including Solr/OpenSearch.
  • Experience with AWS services such as S3, DynamoDB, Lambda, and Step Functions.
  • Experience with cost control and reporting for AWS infrastructure.
  • Experience with databases such as MySQL and MongoDB.
  • Strong Linux and networking fundamentals.
  • Hands-on expertise with ML tools such as MLflow, Airflow, Metaflow, and ONNX.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Staff Machine Learning Engineer, Offline Infrastructure

Unity 5K-10K Internet Software & Services

Unity Vector is hiring a Staff ML Engineer to evolve its offline ML platform that supports large-scale data pipelines, training datasets, orchestration, and distributed model training across the company.

Apache Airflow Apache Spark Feature Engineering Flink Machine Learning MLOps Python PyTorch
5 minutes ago

DevOps Engineer

Tactacam 51-250 Household Durables

Tactacam is seeking a DevOps Engineer to support and improve the reliability, scalability, and security of its AWS- and Lambda-based data infrastructure and development workflows.

Android AWS AWS CDK Bash Datadog Elasticsearch GitHub Actions iOS JavaScript Kubernetes OpenSearch Python Serverless Shell Scripting Terraform TypeScript
30 minutes ago

Staff Development Experience Engineer

Galaxy 251-1K Capital Markets

Galaxy is seeking a hands-on Technical Lead to improve developer experience and platform delivery across its digital assets and data center infrastructure environment.

AWS Azure CI/CD Flux GCP GitHub GitHub Actions GitOps Go HashiCorp Vault Helm Jenkins Kubernetes Python Rancher Terraform TypeScript
45 minutes ago

Contract: Senior Software Engineer, Back End

Newsela 251-1K Diversified Consumer Services

Newsela’s EveryDay Labs team is hiring a Sr. Software Engineer to build and improve the systems behind attendance analytics, process management, and family communication for K–12 schools.

AWS AWS SES Terraform
1 hour, 15 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers