Creative Chaos

Creative Chaos

Creative Chaos is an integrated technology innovation firm that helps startups and enterprises bring their ideas to life through web, mobile, and IoT solutions.

Internet Software & Services
251-1K
Founded 2000

Description

  • Design and implement cloud landing zones with hub-and-spoke networking and policy guardrails across Azure and AWS.
  • Build and maintain Terraform modules, workspaces, remote state, and automated environment provisioning from development through production.
  • Operate and harden AKS and EKS clusters, including node pools, autoscaling, ingress, image scanning/signing, and zero-downtime upgrades.
  • Implement and improve CI/CD pipelines for build, test, security scanning, deployment, and gated promotions.
  • Enable platform services such as API Management/API Gateway, serverless compute, and messaging integrations.
  • Own observability across logs, metrics, tracing, alerting, runbooks, SLIs/SLOs, and on-call response.
  • Drive FinOps practices including tagging, cost allocation, rightsizing, savings plans/reserved instances, and egress optimization.
  • Onboard logs and telemetry into the SIEM and maintain security guardrails using cloud-native governance tools.
  • Enforce least-privilege access across Entra ID and AWS IAM, including managed identities and workload identity federation.
  • Lead incident investigations, perform root cause analysis, and implement preventative controls through policies, pipelines, and guardrails.

Requirements

  • Bachelor’s degree in IT, Computer Science, or a related field.
  • Minimum 5 years of related experience.
  • Hands-on production experience with both Azure and AWS.
  • Deep expertise in Terraform, including modules, workspaces, state management, and policy as code.
  • Strong Kubernetes operations experience with AKS/EKS, Helm, ingress controllers, and ACR/ECR.
  • Solid networking knowledge covering VNets/VPCs, routing, VPNs, Private Link/Endpoints, ExpressRoute/Direct Connect, load balancers, WAF, and DNS.
  • Strong identity and access management skills with Entra ID, AWS IAM, SSO/OIDC, and secrets management.
  • CI/CD implementation experience with GitHub Actions, Azure DevOps, or Jenkins, including security gates and artifact repositories.
  • Observability and SRE experience across metrics, logs, tracing, alerting, incident response, and post-mortems.
  • Strong scripting skills in PowerShell and Bash, with OS-level expertise across Linux and Windows.
  • Experience with disaster recovery patterns, high availability architectures, and RTO/RPO planning.
  • Preferred experience with M365 Conditional Access, AWS landing zone tooling, CloudFormation or Bicep, web hosting, data platforms, Kubernetes supply-chain security, and relevant certifications.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Infrastructure Software Engineer

Dropbox 1K-5K Internet Software & Services

Dropbox is hiring an Infrastructure Software Engineer to build and operate the core systems that support flagship products, large-scale data infrastructure, and future engineering initiatives.

C++ Go Java Python
3 hours, 13 minutes ago

Infrastructure Software Engineer

Dropbox 1K-5K Internet Software & Services

Dropbox is hiring an Infrastructure Software Engineer to build and improve the core systems that power its flagship products, support massive-scale data and connectivity, and enable a more reliable platform for millions of users.

C++ Go Java Python
5 hours, 5 minutes ago

IT Infrastructure Engineer

Terabase Energy 51-250 Renewable Electricity

Terabase Energy is seeking a Senior Infrastructure Engineer to own the virtualization, server, storage, backup, and network infrastructure that supports critical SCADA and engineering systems in a high-availability industrial environment.

Ansible PowerShell Terraform
5 hours, 9 minutes ago

Senior Lead Software Engineer - Developer Infrastructure

Klaviyo 1K-5K IT Services

Klaviyo is hiring a Senior Lead Software Engineer to own backend developer infrastructure architecture and drive platform reliability, dependency management, and engineering velocity across the company.

Apache Airflow Apache Spark AWS Azure Buildkite ClickHouse Django Docker FastAPI GCP Go Jest Kafka Kubernetes Microservices MySQL PostgreSQL Python RabbitMQ React Redis Terraform TypeScript
10 hours, 5 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers