Creative Chaos

Creative Chaos

Creative Chaos is an integrated technology innovation firm that helps startups and enterprises bring their ideas to life through web, mobile, and IoT solutions.

Internet Software & Services
251-1K
Founded 2000

Description

  • Design and implement cloud landing zones with hub-and-spoke networking and policy guardrails across Azure and AWS.
  • Build and maintain Terraform modules, workspaces, remote state, and automated environment provisioning from development through production.
  • Operate and harden AKS and EKS clusters, including node pools, autoscaling, ingress, image scanning/signing, and zero-downtime upgrades.
  • Implement and improve CI/CD pipelines for build, test, security scanning, deployment, and gated promotions.
  • Enable platform services such as API Management/API Gateway, serverless compute, and messaging integrations.
  • Own observability across logs, metrics, tracing, alerting, runbooks, SLIs/SLOs, and on-call response.
  • Drive FinOps practices including tagging, cost allocation, rightsizing, savings plans/reserved instances, and egress optimization.
  • Onboard logs and telemetry into the SIEM and maintain security guardrails using cloud-native governance tools.
  • Enforce least-privilege access across Entra ID and AWS IAM, including managed identities and workload identity federation.
  • Lead incident investigations, perform root cause analysis, and implement preventative controls through policies, pipelines, and guardrails.

Requirements

  • Bachelor’s degree in IT, Computer Science, or a related field.
  • Minimum 5 years of related experience.
  • Hands-on production experience with both Azure and AWS.
  • Deep expertise in Terraform, including modules, workspaces, state management, and policy as code.
  • Strong Kubernetes operations experience with AKS/EKS, Helm, ingress controllers, and ACR/ECR.
  • Solid networking knowledge covering VNets/VPCs, routing, VPNs, Private Link/Endpoints, ExpressRoute/Direct Connect, load balancers, WAF, and DNS.
  • Strong identity and access management skills with Entra ID, AWS IAM, SSO/OIDC, and secrets management.
  • CI/CD implementation experience with GitHub Actions, Azure DevOps, or Jenkins, including security gates and artifact repositories.
  • Observability and SRE experience across metrics, logs, tracing, alerting, incident response, and post-mortems.
  • Strong scripting skills in PowerShell and Bash, with OS-level expertise across Linux and Windows.
  • Experience with disaster recovery patterns, high availability architectures, and RTO/RPO planning.
  • Preferred experience with M365 Conditional Access, AWS landing zone tooling, CloudFormation or Bicep, web hosting, data platforms, Kubernetes supply-chain security, and relevant certifications.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Staff Operations Engineer

Mozilla 251-1K Internet Software & Services

Mozilla is hiring a Staff Operations Engineer to lead the design, reliability, and evolution of hybrid-cloud and workplace infrastructure across teams.

Ansible DNS Linux Puppet Python TCP/IP Unix
6 hours, 17 minutes ago

Senior Infrastructure Security Engineer

Dropbox 1K-5K Internet Software & Services

Dropbox is hiring a Security Engineer to secure its AI and agentic infrastructure while helping protect products and users across cloud and on-prem environments.

Bash CI/CD CrowdStrike Go Java Kubernetes Linux LLM Node.js OAuth OpenID Connect OWASP Python Ruby Rust SIEM
6 hours, 33 minutes ago

Cloud Infrastructure Administrator II

Jenzabar 251-1K Internet Software & Services

Jenzabar is hiring a Cloud Infrastructure Administrator II to support cloud security operations, vulnerability remediation, and compliance efforts across its cloud environment.

AWS Azure Cloudflare CrowdStrike Cybersecurity GCP Kubernetes SIEM Terraform
6 hours, 47 minutes ago

Staff Operations Engineer

Mozilla 251-1K Internet Software & Services

Mozilla is hiring a Staff Operations Engineer to lead the architecture, reliability, and evolution of hybrid-cloud and workplace infrastructure across multiple teams.

Ansible DNS Linux Puppet Python TCP/IP Unix
6 hours, 47 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers