Senior/Staff Platform Engineer

2 weeks ago
Full-time
Lead
DevOps and Infrastructure
VRChat

VRChat

VRChat provides a platform that enables users to create and explore immersive virtual reality experiences, allowing for social interaction and community-driven content creation through its Unity SDK.

Internet Software & Services
51-250
Founded 2014
$95M raised

Description

  • Operate and improve production infrastructure with a focus on reliability, security, performance, and cost efficiency.
  • Define, measure, and improve reliability using SLIs, SLOs, SLAs, error budgets, and DORA metrics.
  • Build and improve monitoring, alerting, dashboards, logging, and incident response processes.
  • Participate in incident management, root cause analysis, postmortems, and follow-up remediation.
  • Automate infrastructure and operational workflows using infrastructure-as-code and scripting tools.
  • Work closely with engineering teams to improve service reliability, deployment quality, and operational readiness.
  • Turn ambiguous infrastructure, reliability, and operational problems into clear, scalable, and measurable solutions.
  • Engage with backend codebases through code reviews, pull requests, and occasional feature or tooling work.
  • Plan and deploy infrastructure in collaboration with IT, Engineering, and functional leaders.

Requirements

  • 8+ years of experience in SRE, DevOps, Platform Engineering, or Infrastructure Engineering.
  • Strong experience operating high-availability production systems.
  • Experience with cloud or hybrid cloud environments and tools such as Terraform or OpenTofu.
  • Strong knowledge of Linux, networking, automation, observability, and incident management.
  • Strong communication skills and ability to work with technical and non-technical stakeholders.
  • Operational knowledge of databases such as MongoDB, Elasticsearch, or Redis.
  • Experience with AWS, including core infrastructure services, cost optimization, and multi-account architecture (preferred).
  • Experience with Kubernetes, including networking, service discovery, ingress, and workload reliability (preferred).
  • Experience with Cilium or other Kubernetes networking/security solutions (preferred).
  • Experience supporting large-scale storage systems or working with CDNs, caching, distributed systems, or real-time platforms (preferred).

Benefits

  • 100% remote work from anywhere.
  • Health benefits.
  • 401(k) for US employees and RRSP for Canadian employees.
  • Stock options.
  • Generous paid holiday schedule.
  • Unlimited/flexible vacation time.
  • Paid parental leave benefits.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Platform Engineer

Ometria 51-250 Media

Ometria is hiring a remote Platform Engineer in Portugal to help build, scale, and maintain the cloud-based infrastructure and platform that supports its retail customer data and experience product.

AWS CI/CD DevSecOps Docker Go Kafka Kubernetes Microservices PostgreSQL Python React Terraform
5 hours, 10 minutes ago

Sr. Solutions Architect (DevSecOps) II (6444)

MetroStar 251-1K IT Services

MetroStar is seeking a Sr. Solutions Architect (DevSecOps) II to lead secure platform and cloud solution efforts for containerized, microservices-based environments while ensuring compliance, continuous monitoring, and incident response readiness.

AWS CI/CD Cybersecurity DevSecOps Jenkins Kubernetes Microservices OpenShift SonarQube Splunk
11 hours, 3 minutes ago

Senior Infrastructure Software Engineer, Search Platform

Dropbox 1K-5K Internet Software & Services

Dropbox is hiring a Senior Infrastructure Engineer to build and scale the search and indexing systems behind Dropbox Dash and core file search, with a focus on reliability, performance, and global user experience.

C++ Elasticsearch Go Java Python
14 hours, 43 minutes ago

Senior Software Engineer - PerfectScale by DoiT, Portugal(Remote)

Zendesk 5K-10K Professional Services

DoiT is hiring a remote Senior Software Engineer to help lead the design, development, and technical direction of its PerfectScale Kubernetes optimization platform for cloud-driven customers across EMEA.

Apache Spark AWS Azure CI/CD ClickHouse dbt Docker GCP GitOps Go Java Kubernetes Microservices PostgreSQL Python Rust Trino
1 day, 8 hours ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers