AHEAD

AHEAD

AHEAD accelerates the impact of technology on clients by engineering customized data, developer, and infrastructure platforms that improve IT operations. By weaving together cloud infrastructure, intelligent operations, and modern applications, we help...

IT Services
1K-5K
$43M raised

Description

  • Lead day-to-day operations, administration, monitoring, support, and continuous improvement of customer cloud environments.
  • Own complex incidents, escalations, and problem investigations through advanced troubleshooting and durable resolution.
  • Plan and execute operational changes and recurring activities such as provisioning, access changes, maintenance, patching, and backup validation.
  • Serve as a senior escalation point for major incidents, high-impact issues, and after-hours change activity.
  • Maintain and reinforce ITSM processes for incident, request, change, problem, escalation, documentation, and customer communication.
  • Develop and maintain runbooks, SOPs, standards, knowledge articles, and technical documentation.
  • Mentor other Cloud Engineers and review work for quality, completeness, and operational best practices.
  • Improve monitoring, alerting, logging, tagging, policy, compliance, and cost visibility across managed cloud environments.
  • Operate and support production Red Hat OpenShift clusters, including health, upgrades, scaling, and lifecycle management.
  • Troubleshoot Kubernetes/OpenShift issues and coordinate remediation with application, network, and security teams.
  • Implement and validate backup, restore, and disaster recovery procedures for OpenShift and associated data.
  • Support automation and standardization efforts using infrastructure as code and GitOps practices.
  • Define and improve observability for cloud and OpenShift platforms and contribute to availability, performance, and capacity planning.
  • Participate in customer meetings, service reviews, and advisory discussions, and communicate technical risk and improvement opportunities clearly.

Requirements

  • 5+ years of customer-facing IT infrastructure, cloud operations, systems administration, or managed services support experience in production environments.
  • Strong operational expertise in at least one major cloud platform, with the ability to lead complex support and administration activities in Azure.
  • Experience with other clouds such as GCP, AWS, and OCI is strongly preferred.
  • Minimum 3+ years supporting a production OpenShift environment, including on-premises, ROSA, or ARO deployments.
  • Experience leading complex incidents, escalations, change execution, and problem investigations in production environments.
  • Experience with Windows and/or Linux server operations, networking fundamentals, identity and access management, monitoring, governance, and operational documentation.
  • Experience in a managed services, consulting, or multi-customer support environment, ideally with complex enterprise customers, is preferred.
  • Strong working knowledge of PowerShell, Python, Bash, infrastructure as code, automation, CI/CD, or related platform tooling is preferred.
  • Relevant advanced cloud, operations, or platform certifications are a plus.
  • Preferred certifications include Red Hat Certified Specialist in OpenShift Administration or Red Hat Certified OpenShift Administrator, RHCSA, CKA/CKAD, and AWS/Azure certifications.

Benefits

  • $140,000 - $160,000 annual OTE, including base salary and applicable target bonus.
  • Medical, dental, and vision insurance.
  • 401(k) retirement plan.
  • Paid company holidays.
  • Paid time off.
  • Paid parental and caregiver leave.
  • Cross-department training and development support, including sponsored certifications and credentials.
  • Remote full-time work arrangement with some travel and occasional after-hours/weekend support as needed.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Physical Security Systems Technician - IT & Infrastructure

Nebius 51-250 Internet Software & Services

Nebius is hiring a hands-on Physical Security Systems Technician to support the deployment, validation, and maintenance of CCTV and access control infrastructure across European sites.

Linux
1 hour, 43 minutes ago

Machine Learning Systems Engineer

Motional 1K-5K Automotive

Motional is hiring a Machine Learning Systems Engineer for its ML Acceleration team to improve large-scale model training systems for speed, cost, reliability, and throughput.

Machine Learning Python PyTorch
1 hour, 46 minutes ago

Licensed Civil Engineer - Data Center

Olsson 1K-5K Construction & Engineering

Olsson is hiring a Licensed Civil Engineer to support its Data Center Civil team on large hyperscale and colocation data center projects across the U.S., with a focus on designing critical infrastructure for complex engineering-driven developments.

3 hours, 8 minutes ago

IT Infra Lead

Weekday 11-50 Construction & Engineering

IT Infra Lead for a UK-based life sciences technology company, responsible for owning and evolving the infrastructure supporting India and UK operations for a regulatory compliance platform.

Azure CI/CD Cisco CRM DHCP DNS Fortinet JIRA macOS Palo Alto PowerShell Python SIEM
3 hours, 36 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers