AHEAD

AHEAD

AHEAD accelerates the impact of technology on clients by engineering customized data, developer, and infrastructure platforms that improve IT operations. By weaving together cloud infrastructure, intelligent operations, and modern applications, we help...

IT Services
1K-5K
$43M raised

Description

  • Lead client discovery sessions, assessments, and workshops focused on observability, telemetry, reliability, and operational maturity.
  • Define target-state observability architectures and roadmaps aligned to cloud, platform engineering, SRE, and AIOps initiatives.
  • Design and guide implementation across metrics, logs, traces, dashboards, alerting, and incident workflows.
  • Help clients adopt modern telemetry practices using OpenTelemetry and related open-source, cloud-native, and enterprise toolsets.
  • Build or refine dashboards, alerts, service views, and operational integrations to improve visibility, context, and signal quality.
  • Improve alert quality, incident triage, escalation paths, runbooks, and post-incident feedback loops.
  • Partner with engineering, platform, operations, and leadership stakeholders to align observability investments to business priorities.
  • Translate observability capabilities into measurable outcomes such as improved reliability, faster incident resolution, reduced alert noise, and better user experience.
  • Advise on telemetry governance, data quality, retention, cardinality, access, and cost optimization.
  • Contribute to implementation planning, solution governance, documentation, enablement, and operational handoff.
  • Support integration patterns across observability, ITSM, incident management, collaboration, workflow, and AI-assisted operations platforms.
  • Provide hands-on technical leadership, including configuration guidance, implementation oversight, design validation, troubleshooting support, and quality review.
  • Mentor delivery teams and help build reusable patterns, accelerators, and best practices within the practice.

Requirements

  • 6+ years of experience in consulting, engineering, SRE, platform engineering, or operations with strong observability responsibility.
  • Hands-on experience with observability concepts across metrics, logs, traces, alerting, service health, and incident response.
  • Experience with OpenTelemetry and familiarity with telemetry pipeline design, instrumentation patterns, and collection architecture.
  • Working knowledge of open-source observability tools such as Prometheus, Grafana, Loki, Tempo, Mimir, Jaeger, or Elastic.
  • Experience with one or more enterprise observability platforms such as Datadog, Dynatrace, Splunk, New Relic, Elastic, LogicMonitor, Honeycomb, or Chronosphere.
  • Strong understanding of Kubernetes, containers, cloud-native architectures, and at least one major public cloud platform.
  • Experience with Terraform, Helm, CI/CD pipelines, Ansible, or related automation and platform tooling.
  • Solid understanding of distributed systems, modern application architectures, and operational best practices across SRE, DevOps, or ITSM environments.
  • Experience defining service health models, SLIs, SLOs, alerting strategies, or reliability measurement frameworks.
  • Ability to lead client-facing conversations, structure ambiguous problems, and translate strategy into executable workstreams.
  • Demonstrated ability to operate in ambiguous client environments and communicate tradeoffs clearly across engineering, operations, and leadership audiences.
  • Strong written and verbal communication skills with both technical and executive audiences.
  • A continuous learning mindset and a collaborative approach to delivery and practice building.
  • Experience operationalizing error budgets, burn-rate alerting, production readiness reviews, or reliability governance practices.
  • Experience integrating observability with ServiceNow, incident management platforms, CMDB, or collaboration tools.
  • Experience with telemetry cost optimization, sampling strategies, retention policies, tagging standards, cardinality management, or observability governance.
  • Familiarity with platform engineering, internal developer platforms, service catalogs, golden paths, or self-service observability patterns.
  • Exposure to AIOps, anomaly detection, event correlation, automated remediation workflows, or MCP-enabled integration patterns for AI-assisted operations.
  • Familiarity with microservices, service mesh technologies, end-user monitoring, or digital experience monitoring concepts.
  • Generalist coding or scripting experience in Python, Java, Go, JavaScript, or .NET.

Benefits

  • Comprehensive health insurance coverage for employees, with options to extend coverage to dependents.
  • Paid time off and company holidays, along with additional leave benefits as per policy.
  • Flexible work arrangements supporting work-life balance.
  • Learning and development opportunities to support continuous growth and upskilling.
  • Employee wellness initiatives and programs focused on physical and mental well-being.
  • Retirement and statutory benefits in line with India regulations.
  • Sponsorship for certifications and credentials for continued learning.
  • Inclusive, people-first culture with a strong focus on collaboration and ownership.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Solutions Consultant

Emburse 251-1K Diversified Financial Services

Emburse is hiring a remote Solutions Consultant I to support its Pre-Sales team by demonstrating expense management and accounts payable software to SMB and Velocity prospects and helping advance sales opportunities.

2 hours, 28 minutes ago

Business Technology Specialist

Inspiroz 51-250 Internet Software & Services

Inspiroz is hiring a Business Technology Specialist to support a fast-growing commercial property management client by coordinating and executing technology integrations for newly acquired offices nationwide.

3 hours, 9 minutes ago

Global Technical Account Management Lead, Cash App Pay, Afterpay & Clearpay

Block 10K-50K Capital Markets

Block is hiring a leader for its global Technical Account Management team supporting Cash App Pay, Afterpay, and Clearpay merchants, to drive post-sales technical partnership, merchant health, and commercial growth.

E-commerce Google Tag Manager
4 hours, 11 minutes ago

Epic MyChart Analyst

Prominence 51-250 Professional Services

Prominence Advisors is hiring an Epic MyChart Analyst to support healthcare organizations with Epic consulting and implementation work that improves clinical and operational outcomes.

7 hours, 10 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers