Stellar Cyber

Stellar Cyber

Stellar Cyber provides Next Gen SIEM Security, Network Detection, and Response platforms with AI-driven threat analysis, empowering lean security teams to secure environments effectively.

Professional Services
51-250
Founded 2017
$80M raised

Description

  • Administer and maintain container orchestration platforms and containerized workloads.
  • Monitor and troubleshoot production systems, including participation in on-call rotations.
  • Improve observability across systems and data platforms by enhancing monitoring, logging, and alerting.
  • Administer and optimize cloud-based environments across multiple providers.
  • Manage and support distributed data platforms and real-time processing systems.
  • Develop and maintain continuous integration and delivery pipelines for reliable deployments.
  • Own and implement Infrastructure as Code practices for consistency and scalability.
  • Automate and orchestrate infrastructure using programming and scripting languages.
  • Perform system administration and networking tasks for internal and external environments.
  • Collaborate with engineers and stakeholders across multiple time zones.

Requirements

  • 5+ years of experience in Site Reliability Engineering, DevOps, or Platform Engineering roles.
  • Proven success leading large-scale production systems in cloud environments such as AWS, GCP, Azure, or OCI.
  • Demonstrated leadership in incident response, on-call best practices, and reliability-focused culture.
  • Strong experience with production on-call operations and incident management.
  • Advanced proficiency in Kubernetes administration and troubleshooting.
  • Hands-on experience with observability tools including Prometheus, Grafana, Loki, and Alertmanager.
  • Knowledge of chat-based operations interfaces and/or auto-remediation controllers using an AI agentic framework.
  • Understanding of AI agents for auto-triaging alerts, correlating signals, and suggesting root-cause hypotheses.
  • Experience operating data platforms such as Elasticsearch, MongoDB, Spark, Kafka, and Redis.
  • Proficiency with public cloud services, Python, Bash, Terraform, Helm, and CI/CD tools such as GitHub Actions, Bitbucket, or ArgoCD.
  • Strong technical background in distributed systems, databases, networking, and Linux administration.
  • Bachelor's degree in Computer Science, Engineering, or a related technical field.
  • Certifications in AWS, GCP, Observability, Linux, or Kubernetes are a plus.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Staff Platform Site Reliability Specialist (Observability & Kubernetes) (copy)

Everbridge 1K-5K Internet Software & Services

Everbridge is hiring a Staff Platform Site Reliability Specialist to own and evolve its enterprise observability platform and Kubernetes environment across a large-scale cloud-native AWS and GCP infrastructure.

AWS GCP Grafana Kubernetes Terraform
26 minutes ago

Senior Site Reliability Engineer

Airalo 51-250 Airlines

Airalo is hiring a Senior Site Reliability Engineer in its fully remote Engineering team to help scale and improve the reliability of the global eSIM platform used by millions of travellers.

Agile AWS Datadog GitHub Actions Go Java Kubernetes OpenTelemetry Prometheus Python Scrum Terraform
56 minutes ago

Senior Site Reliability Engineer (EST)

Teikametrics 251-1K Media

Teikametrics is hiring a Senior Site Reliability Engineer in Bengaluru/remote to manage and improve the cloud infrastructure, deployment systems, and operational reliability of its AI-driven marketplace platform.

AWS Bash CI/CD CircleCI Databricks Datadog Docker GCP Java JavaScript Kafka Kubernetes OpenSearch PostgreSQL Python Terraform
1 hour, 11 minutes ago

Site Reliability Engineer II

Cority 251-1K Chemicals

Cority is hiring a remote Site Reliability Engineer II to support the reliability, performance, and scalability of its cloud-hosted services and database platforms.

Bash CI/CD GitLab Jenkins Kubernetes Linux Oracle PostgreSQL PowerShell Python Redis SQL Server
1 hour, 11 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers