Développeuse ou développeur en fiabilité de production / Production Reliability Engineer

1 hour, 53 minutes ago
Full-time
Mid Level
Software Development
Unity

Unity

Unity is the top platform for real-time 3D content creation, empowering creators across industries to bring their ideas to life with interactive 2D and 3D content.

Internet Software & Services
5K-10K
Founded 2004

Description

  • Tackle complex multi-tenant infrastructure challenges, including tenant isolation, policy enforcement, cost allocation, SOX compliance, and scaling shared Kubernetes clusters.
  • Contribute to the direction of cloud infrastructure standardization at Unity.
  • Work with engineering and Site Reliability Engineering teams to improve service applications, infrastructure, and deployment practices.
  • Help strengthen platform capabilities such as secrets management, policy enforcement, deployment pipelines, cost attribution, and security hardening.
  • Write proposals, contribute to technical discussions, and help align engineering peers through clear communication and technical quality.
  • Partner with development teams to improve service resiliency by sharing best practices and applying reliability engineering principles.
  • Design repeatable, observable, and well-documented systems with a strong bias toward automation.
  • Collaborate across a global team and work effectively across time zones.

Requirements

  • Proven experience as an engineer with strong expertise in Kubernetes, cloud-native architecture, and infrastructure-as-code.
  • Experience building and operating production infrastructure, ideally on multi-tenant or shared platforms.
  • Strong experience with GCP and exposure to other cloud providers such as AWS and Azure.
  • Ability to evaluate trade-offs in cloud services and make sound architectural decisions.
  • Track record of delivering platform capabilities in areas such as secrets management, policy enforcement, deployment pipelines, cost attribution, or security hardening.
  • Ability to influence through technical work and clear communication.
  • Experience partnering with development teams to improve service resiliency and apply reliability engineering practices.
  • Comfort working on a global team across multiple time zones.
  • Nice to have: Golang, Python, or Node.js.
  • Nice to have: Kubernetes, Helm, Kustomize, and ArgoCD.
  • Nice to have: Terraform, Vault, and infrastructure-as-code best practices.
  • Nice to have: GKE, Cloud SQL, IAM, networking, and BigQuery.
  • Nice to have: Docker and containerization best practices.
  • Nice to have: GitHub Actions and CI/CD pipeline design.
  • Nice to have: cloud networking, DNS, and TLS certificate management.
  • Nice to have: multi-cloud infrastructure patterns for AWS and Azure.
  • Nice to have: FinOps practices and cloud cost optimization.
  • Ability to communicate professionally in English, written and spoken.
  • No relocation support available.
  • No work visa or immigration sponsorship available.

Benefits

  • Comprehensive health, life, and disability insurance.
  • Commute subsidy.
  • Employee stock ownership.
  • Competitive retirement and pension plans.
  • Generous vacation and personal days.
  • Support for new parents through leave and family-care programs.
  • Mental health and wellbeing programs and support.
  • Training and development programs.
  • Employee Resource Groups and a Global Employee Assistance Program.
  • Office food snacks.
  • Volunteering and donation matching program.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Site Reliability Engineer

Alpaca 51-250 Capital Markets

Alpaca is hiring a Site Reliability Engineer to keep its brokerage platform reliable and operable across cloud, Kubernetes, observability, messaging, and database systems, with a strong focus on PostgreSQL reliability on the trading-critical path.

DNS GitOps Go Kafka Kubernetes Linux Load Balancing PostgreSQL Python RabbitMQ Secrets Management TLS
1 hour, 30 minutes ago

DevOps - SRE Engineer - Argentina

Coderio 51-250 Internet Software & Services

Coderio is hiring a remote DevOps/SRE Engineer in Argentina to ensure the stability, scalability, and efficient operation of the infrastructure supporting its digital platforms.

Argo CD Flux GitHub Actions Helm Jenkins Kubernetes OpenShift Terraform
1 hour, 43 minutes ago

AI Platform Engineer

NEORIS 5K-10K Internet Software & Services

NEORIS, part of the EPAM group, is seeking a Principal AI Platform Engineer to design and advance enterprise-scale AI platform capabilities that support governed ML and AI delivery across the organization.

Apache Spark AWS CI/CD Cybersecurity Kubernetes MLOps Python Terraform
2 hours, 8 minutes ago

Platform Engineer III

Veeam Software 1K-5K Internet Software & Services

Veeam is hiring a Platform Engineer for the Veeam Data Cloud to build and operate a secure, reliable platform that helps teams develop, test, deploy, and monitor the VDC product.

AWS Azure Bash Docker Git GitHub Actions Go Helm Java Kubernetes Microservices Pulumi Python Serverless Terraform
6 hours, 22 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers