Flip App

Flip App

Flip is the employee app reshaping workplace communication by empowering every employee with a digital workspace for effective communication and workflow management.

Internet Software & Services
51-250
Founded 2018

Description

  • Expand and optimize cloud infrastructure on Azure and Kubernetes to support global growth.
  • Design and implement zero-downtime deployments, rollback mechanisms, and disaster-recovery strategies.
  • Evolve the observability stack using Loki, Grafana, Tempo, and Mimir and help define and optimize SLOs.
  • Design, develop, and optimize infrastructure as code with Pulumi in Go to reduce toil and enable self-service.
  • Promote CI/CD best practices, incident management, post-mortems, and developer experience across engineering.
  • Collaborate with the squad and engineering leadership on platform direction, including scalability, cost optimization, security, and compliance.
  • Support platform reliability through on-call participation and operational ownership.

Requirements

  • 1–3 years of hands-on experience as an SRE, Platform Engineer, DevOps Engineer, Infrastructure Engineer, Cloud Engineer, or Backend Engineer with an infrastructure focus.
  • Experience operating and scaling cloud infrastructure on Azure, GCP, or AWS.
  • Deep knowledge of Kubernetes and container orchestration in production environments.
  • Hands-on experience with observability tools such as Prometheus, Mimir, Loki, or ELK, including SLOs and error budgets.
  • Solid software development skills in Go, Python, or Kotlin; Go is preferred.
  • Experience with infrastructure as code tools such as Pulumi, OpenTofu, or Terraform.
  • Experience with configuration management tools such as Ansible or Chef.
  • Collaborative mindset, strong communication skills, and business-fluent English.
  • Willingness to participate in on-call rotations.
  • Preferred experience building and operating high-throughput, highly available systems in production.
  • Preferred experience with Azure Kubernetes Service (AKS).
  • Preferred experience with Kubernetes Gateway API and Envoy Gateway.
  • Preferred familiarity with GitOps workflows and CI/CD pipeline design.
  • Preferred knowledge of service mesh technologies such as Linkerd or Istio.
  • Preferred experience with Kubernetes Operators such as Strimzi or CNPG.
  • Preferred experience operating highly available PostgreSQL.

Benefits

  • Remote-first work with flexibility to work from home.
  • Occasional in-person collaboration in the Berlin or Stuttgart offices with advance notice.
  • Covered E-Gym-Wellpass membership and job bike leasing.
  • Relaxed working atmosphere with motivated and committed colleagues.
  • Regular team events and culture days.
  • Opportunity to shape the company and grow with a fast-growing tech organization.
  • Ability to work abroad within the European Union, subject to discussion.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Staff Operations Engineer

Mozilla 251-1K Internet Software & Services

Mozilla is hiring a Staff Operations Engineer to lead the design, reliability, and evolution of hybrid-cloud and workplace infrastructure across teams.

Ansible DNS Linux Puppet Python TCP/IP Unix
5 hours, 25 minutes ago

Principal Site Reliability Engineer (SRE)

Symmetrio Professional Services

Symmetrio is recruiting a Principal Site Reliability Engineer for a rapidly growing healthcare technology company to own the reliability, scalability, security, and performance of a mission-critical SaaS platform used by healthcare providers across the United States.

Active Directory AWS CI/CD Datadog Django Grafana Kubernetes Python Terraform Windows Server
5 hours, 40 minutes ago

Performance Test Engineer Lead

PartnerOne 51-250 Media

An enterprise performance engineering role at a cloud-focused organization, responsible for validating the scalability, stability, and production readiness of distributed systems across Azure and hybrid environments.

Azure CI/CD Kubernetes PowerShell
5 hours, 55 minutes ago

Site Reliability Engineer

MLabs 11-50 Internet Software & Services

Remote UK-hours Site Reliability Engineering role at a financial technology company, focused on automating and operating the infrastructure that supports global integration services for financial institutions.

Active Directory Ansible AWS CI/CD GCP OAuth PostgreSQL SAML
6 hours, 10 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers