Flip App

Flip App

Flip is the employee app reshaping workplace communication by empowering every employee with a digital workspace for effective communication and workflow management.

Internet Software & Services
51-250
Founded 2018

Description

  • Expand and optimize cloud infrastructure on Azure and Kubernetes to support global growth.
  • Design and implement zero-downtime deployments, rollback mechanisms, and disaster-recovery strategies.
  • Evolve the observability stack using Loki, Grafana, Tempo, and Mimir and help define and optimize SLOs.
  • Design, develop, and optimize infrastructure as code with Pulumi in Go to reduce toil and enable self-service.
  • Promote CI/CD best practices, incident management, post-mortems, and developer experience across engineering.
  • Collaborate with the squad and engineering leadership on platform direction, including scalability, cost optimization, security, and compliance.
  • Support platform reliability through on-call participation and operational ownership.

Requirements

  • 1–3 years of hands-on experience as an SRE, Platform Engineer, DevOps Engineer, Infrastructure Engineer, Cloud Engineer, or Backend Engineer with an infrastructure focus.
  • Experience operating and scaling cloud infrastructure on Azure, GCP, or AWS.
  • Deep knowledge of Kubernetes and container orchestration in production environments.
  • Hands-on experience with observability tools such as Prometheus, Mimir, Loki, or ELK, including SLOs and error budgets.
  • Solid software development skills in Go, Python, or Kotlin; Go is preferred.
  • Experience with infrastructure as code tools such as Pulumi, OpenTofu, or Terraform.
  • Experience with configuration management tools such as Ansible or Chef.
  • Collaborative mindset, strong communication skills, and business-fluent English.
  • Willingness to participate in on-call rotations.
  • Preferred experience building and operating high-throughput, highly available systems in production.
  • Preferred experience with Azure Kubernetes Service (AKS).
  • Preferred experience with Kubernetes Gateway API and Envoy Gateway.
  • Preferred familiarity with GitOps workflows and CI/CD pipeline design.
  • Preferred knowledge of service mesh technologies such as Linkerd or Istio.
  • Preferred experience with Kubernetes Operators such as Strimzi or CNPG.
  • Preferred experience operating highly available PostgreSQL.

Benefits

  • Remote-first work with flexibility to work from home.
  • Occasional in-person collaboration in the Berlin or Stuttgart offices with advance notice.
  • Covered E-Gym-Wellpass membership and job bike leasing.
  • Relaxed working atmosphere with motivated and committed colleagues.
  • Regular team events and culture days.
  • Opportunity to shape the company and grow with a fast-growing tech organization.
  • Ability to work abroad within the European Union, subject to discussion.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Database Reliability Engineer (DBRE) (worldwide remote)

CloudLinux 51-250 IT Services

CloudLinux / TuxCare is hiring a Senior Database Reliability Engineer to own and improve the reliability, automation, and incident response of its production PostgreSQL and broader database infrastructure.

Ansible ClickHouse DNS GitLab Grafana JIRA Linux MongoDB OpsGenie PostgreSQL Redis Terraform TLS
52 minutes ago

Associate SRE

66degrees 251-1K IT Services

66degrees is hiring a Site Reliability Engineer to support enterprise Google Cloud environments through reliability engineering, automation, and incident response for client workloads.

Agile Datadog GCP Kanban Kubernetes Linux Prometheus Python Scrum Terraform
3 hours, 35 minutes ago

Senior Site Reliability Engineer - AWS

Filevine 251-1K Specialized Consumer Services

Filevine is hiring a Senior Site Reliability Engineer to embed with cross-functional teams and improve the reliability, automation, and scalability of its AWS-based legal technology platform.

AWS Bash CI/CD EC2 Kubernetes PowerShell Python
13 hours, 31 minutes ago

Staff Site Reliability Engineer

Puck 1-10 Internet Software & Services

Domino is hiring a senior Site Reliability Engineer to build AI-assisted reliability systems and strengthen the operational resilience of its cloud-based data science platform.

Go Kubernetes Linux LLM Python
14 hours, 33 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers