Stellar Cyber

Stellar Cyber

Stellar Cyber provides Next Gen SIEM Security, Network Detection, and Response platforms with AI-driven threat analysis, empowering lean security teams to secure environments effectively.

Professional Services
51-250
Founded 2017
$80M raised

Description

  • Administer and maintain container orchestration platforms and containerized workloads.
  • Monitor and troubleshoot production systems and participate in on-call rotations to ensure reliability.
  • Improve observability across systems and data platforms by enhancing monitoring, logging, and alerting.
  • Administer and optimize cloud-based environments across multiple providers.
  • Manage and support distributed data platforms and real-time processing systems.
  • Develop and maintain CI/CD pipelines for efficient and reliable deployments.
  • Own and implement Infrastructure as Code practices to improve consistency and scalability.
  • Automate and orchestrate infrastructure using programming and scripting languages.
  • Perform system administration and networking tasks for internal and external environments.
  • Collaborate with engineers and stakeholders across different time zones.

Requirements

  • 5+ years of experience in Site Reliability Engineering, DevOps, or Platform Engineering roles.
  • Proven success leading large-scale production systems in cloud environments such as AWS, GCP, Azure, or OCI.
  • Experience driving incident response, on-call best practices, and a reliability-focused culture.
  • Strong experience with production on-call operations and incident management.
  • Advanced proficiency in Kubernetes administration and troubleshooting.
  • Hands-on experience with observability tools such as Prometheus, Grafana, Loki, and Alertmanager.
  • Knowledge of chat-based operations interfaces and/or auto-remediation controllers using an AI agentic framework.
  • Understanding of AI agents for auto-triaging alerts, correlating signals, and suggesting root-cause hypotheses.
  • Experience operating data platforms such as Elasticsearch, MongoDB, Spark, Kafka, and Redis.
  • Proficiency with public cloud services such as AWS, Azure, GCP, or OCI.
  • Strong programming and automation skills in Python and Bash.
  • Deep understanding of Infrastructure as Code tools such as Terraform and Helm.
  • Experience with CI/CD pipelines and tools such as GitHub Actions, Bitbucket, and ArgoCD.
  • Strong technical background in distributed systems, databases, networking, and Linux administration.
  • Bachelor's degree in Computer Science, Engineering, or a related technical field.
  • Certifications in AWS, GCP, Observability, Linux, or Kubernetes are a plus.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

SRE - DevOps Engineer - Argentina

Coderio 51-250 Internet Software & Services

Coderio is hiring a remote DevOps/SRE Engineer in Argentina to ensure the stability, scalability, and efficient operation of the infrastructure that supports its global digital solutions.

Argo CD CI/CD Flux GitHub Actions GitOps Helm Jenkins Kubernetes OpenShift Terraform
2 hours, 23 minutes ago

Site Reliability Engineer

Recorded Future 251-1K Professional Services

Recorded Future is hiring a Site Reliability Engineer to strengthen the reliability, scalability, and performance of its critical cloud systems in close partnership with engineering teams.

AWS Chef Elasticsearch ELK Stack Grafana Kafka Kibana Kubernetes Linux Logstash Microservices MongoDB OpenTelemetry Prometheus RabbitMQ Terraform
3 hours, 40 minutes ago

Site Reliability Engineer

Kaseya 1K-5K IT Services

Kaseya is hiring a Site Reliability Engineer to own the reliability, automation, and production stability of its AWS-based services used by thousands of MSPs worldwide.

Ansible AWS Chef CloudFormation Datadog DevSecOps Elasticsearch Kibana Kubernetes MySQL PostgreSQL Puppet Secrets Management Serverless Terraform
5 hours, 3 minutes ago

Senior Site Reliability Engineer

Cribl 251-1K IT Services

Cribl is hiring a Senior Site Reliability Engineer in Poland to help build and operate the telemetry infrastructure and observability platform that supports its cloud products and enterprise customers.

Ansible AWS Azure CI/CD Grafana JavaScript Kibana Linux New Relic Node.js PagerDuty Prometheus Splunk Terraform TypeScript
19 hours, 46 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers