Lucidya

Lucidya

Lucidya provides a leading platform for customer experience management in the Arab World, utilizing AI-driven social media analytics and monitoring tools to enhance strategic decision-making and improve brand performance across various social channels.

Media
51-250
Founded 2016
$7M raised

Description

  • Design and maintain highly available, fault-tolerant, and scalable infrastructure.
  • Proactively identify and eliminate single points of failure before they cause incidents.
  • Manage and continuously improve cloud workloads across AWS, GCP, or Azure.
  • Use Infrastructure as Code, such as Terraform, to standardize and scale infrastructure.
  • Operate, troubleshoot, and scale Kubernetes clusters in production.
  • Implement and refine monitoring and alerting systems using tools such as Prometheus, Grafana, Datadog, or ELK.
  • Respond to incidents, lead root cause analysis, and drive follow-up improvements.
  • Write scripts and build tooling to automate repetitive operational work.
  • Collaborate with DevOps and engineering teams to resolve performance bottlenecks and improve CI/CD reliability.
  • Help define and promote reliability best practices across the organization.

Requirements

  • ~3 years of experience in SRE, DevOps, or infrastructure engineering.
  • Hands-on experience with cloud environments such as AWS, GCP, or Azure.
  • Production experience with Kubernetes and the ability to troubleshoot cluster issues.
  • Experience using Terraform or similar Infrastructure as Code tools.
  • Strong working knowledge of Docker and containerized workloads.
  • Ability to write automation scripts in Python, Bash, or similar languages.
  • Understanding of CI/CD pipelines such as Jenkins, GitHub Actions, or Bitbucket.
  • Solid grasp of networking, load balancing, and high-availability design.
  • Experience implementing observability tools such as Prometheus, Grafana, Datadog, or ELK.
  • Ability to distinguish meaningful alerts from noise and focus on actionable signals.
  • Experience with RabbitMQ or Redis in production is a plus.
  • Familiarity with Ansible or AWX is a plus.
  • Exposure to multi-cloud or hybrid environments is a plus.
  • Cloud certifications in AWS or GCP, or Linux certifications, are a plus.
  • Background from ITI (Information Technology Institute) is a plus.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Staff Platform Site Reliability Specialist (Observability & Kubernetes) (copy)

Everbridge 1K-5K Internet Software & Services

Everbridge is hiring a Staff Platform Site Reliability Specialist to own and evolve its enterprise observability platform and Kubernetes environment across a large-scale cloud-native AWS and GCP infrastructure.

AWS GCP Grafana Kubernetes Terraform
27 minutes ago

Senior SRE Engineer

Stellar Cyber 51-250 Professional Services

Stellar Cyber is hiring a Senior Site Reliability Engineer to strengthen the reliability, scalability, and operational excellence of its cloud-based cybersecurity platform.

Apache Spark Argo CD AWS Azure Bash Bitbucket CI/CD Elasticsearch GCP GitHub Actions Grafana Helm Kafka Kubernetes MongoDB Prometheus Python Redis Terraform
57 minutes ago

Senior Site Reliability Engineer

Airalo 51-250 Airlines

Airalo is hiring a Senior Site Reliability Engineer in its fully remote Engineering team to help scale and improve the reliability of the global eSIM platform used by millions of travellers.

Agile AWS Datadog GitHub Actions Go Java Kubernetes OpenTelemetry Prometheus Python Scrum Terraform
57 minutes ago

Senior SRE Engineer

Stellar Cyber 51-250 Professional Services

Stellar Cyber is seeking a Senior Site Reliability Engineer to strengthen the reliability, scalability, and operational excellence of its cloud-native security platforms used by enterprises, government agencies, and MSSPs.

Apache Spark Argo CD AWS Azure Bash Bitbucket CI/CD Elasticsearch GCP GitHub Actions Grafana Helm Kafka Kubernetes MongoDB Prometheus Python Redis Terraform
1 hour, 12 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers