FBS Sr Database Operations Engineer (AWS Bedrock experience)

18 hours, 2 minutes ago
Full-time
Lead
DevOps and Infrastructure
Capgemini

Capgemini

Capgemini is a global leader in consulting, technology services, and digital transformation, empowering businesses with innovative solutions and expertise to thrive in a rapidly evolving market.

Internet Software & Services
100K+
Founded 1967
$93M raised

Description

  • Ensure the availability, scalability, reliability, and security of cloud platforms and services.
  • Design, deploy, and govern AI-powered agents for autonomous self-healing and automated resource management.
  • Handle operational requests involving Terraform changes, S3 updates, user access management, and managed cloud services.
  • Supervise and refine AI-generated Infrastructure-as-Code and maintain complex automation pipelines using Terraform, Ansible, and Jenkins/CloudBees.
  • Implement AI-based automation to monitor cloud performance, respond to incidents, and manage operational issues autonomously.
  • Use GenAI tools to perform real-time root cause analysis, correlate logs and metrics, and generate runbooks and incident summaries.
  • Develop predictive ML models to forecast outages or bottlenecks and configure proactive monitoring and alerting.
  • Manage security and compliance by detecting configuration drift, remediating vulnerabilities, and supporting audits and governance standards.
  • Collaborate with application, architecture, AIOps, FinOps, and security teams to ensure production readiness and resolve incidents.
  • Review architectural designs, support migrations and deployments, troubleshoot middleware and production issues, and drive cloud cost optimization.

Requirements

  • Experience ensuring availability, scalability, reliability, and security of cloud platforms and services.
  • Hands-on experience with Terraform for EC2 changes, S3 updates, user access management, and cloud infrastructure operations.
  • Experience with managed cloud services such as SageMaker, Bedrock, Storage Gateway, RDS, and Transfer Family.
  • Experience supervising or refining AI-generated Infrastructure-as-Code using Terraform, Ansible, and Jenkins/CloudBees.
  • Experience implementing AI-driven automation for cloud operations, incident response, and autonomous monitoring.
  • Experience with GenAI tools for root cause analysis, log and metric correlation, and incident summarization.
  • Experience developing and training predictive ML models for telemetry analysis and proactive alerting.
  • Knowledge of security and compliance controls, including IAM, network, and security policy management.
  • Hands-on experience deploying applications, workloads, and data to cloud environments, including migration from on-premises or other cloud providers.
  • Advanced experience working with finance and procurement teams to implement cloud cost optimization strategies.
  • Ability to support production environments, troubleshoot complex scenarios, and work with vendors and application teams.
  • Preferred familiarity with middleware components, architecture review, cloud releases, and new cloud services/features.

Benefits

  • Competitive compensation and benefits package.
  • Competitive salary with performance-based bonuses.
  • Comprehensive benefits package.
  • Flexible work arrangements, including remote and/or office-based options.
  • Private health insurance.
  • Paid time off.
  • Training and development opportunities in partnership with renowned companies.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Machine Learning Engineer

airSlate 251-1K Professional Services

airSlate is seeking a Senior Machine Learning Engineer to develop and deploy ML and AI solutions that support high-impact marketing, SEO, and customer value initiatives at global scale.

AWS BERT Deep Learning Feature Engineering GPT LLM Machine Learning Python Reinforcement Learning SageMaker SEO
3 hours, 2 minutes ago

Senior Engineering Manager - Accelerated Compute Memory Systems

Pryon 51-250 Internet Software & Services

Pryon is seeking a Senior Engineering Manager to lead its Super Compute Memory team building cloud-native ingestion, retrieval, and inference infrastructure for large-scale AI memory workloads across commercial and federal deployments.

Apache Airflow AWS Azure C++ CloudFormation Datadog GCP Go Grafana Java Kafka Kubeflow Kubernetes Machine Learning NLP Prometheus Pulumi Python PyTorch RabbitMQ Rust TensorFlow Terraform
3 hours, 17 minutes ago

Senior Machine Learning Engineer

Spotify Media

Spotify’s Personalization team is hiring a Senior Machine Learning Engineer to help develop and improve recommendation systems that keep millions of listeners engaged across the main homepage and other personalized experiences.

Agile Apache Spark AWS GCP Java Machine Learning Python PyTorch Scala Scikit-learn Statistics TensorFlow
3 hours, 32 minutes ago

Machine Learning Engineer Lead

MUTT DATA 51-250 Internet Software & Services

Mutt Data is hiring a remote Machine Learning Engineer Lead in Argentina to lead data and ML projects that build scalable forecasting, recommendation, and AI systems for clients.

Apache Airflow AWS Azure Databricks dbt Deep Learning Docker Feature Engineering GCP Generative AI Jupyter Keras Machine Learning MLflow MLOps NumPy Pandas Plotly Python PyTorch Scikit-learn SQL TensorFlow XGBoost
3 hours, 32 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers