AHEAD

AHEAD

AHEAD accelerates the impact of technology on clients by engineering customized data, developer, and infrastructure platforms that improve IT operations. By weaving together cloud infrastructure, intelligent operations, and modern applications, we help...

IT Services
1K-5K
$43M raised

Description

  • Lead client discovery sessions, assessments, and workshops focused on observability, telemetry, reliability, and operational maturity.
  • Define target-state observability architectures and roadmaps aligned to cloud, platform engineering, SRE, and AIOps initiatives.
  • Design and guide implementation across metrics, logs, traces, dashboards, alerting, and incident workflows.
  • Help clients adopt modern telemetry practices using OpenTelemetry and related open-source, cloud-native, and enterprise toolsets.
  • Build or refine dashboards, alerts, service views, and operational integrations to improve visibility, context, and signal quality.
  • Improve alert quality, incident triage, escalation paths, runbooks, and post-incident feedback loops.
  • Partner with engineering, platform, operations, and leadership stakeholders to align observability investments to business priorities.
  • Translate observability capabilities into measurable outcomes such as improved reliability, faster incident resolution, reduced alert noise, and better user experience.
  • Advise on telemetry governance, data quality, retention, cardinality, access, and cost optimization.
  • Contribute to implementation planning, solution governance, documentation, enablement, and operational handoff.
  • Support integration patterns across observability, ITSM, incident management, collaboration, workflow, and AI-assisted operations platforms.
  • Provide hands-on technical leadership, including configuration guidance, implementation oversight, design validation, troubleshooting support, and quality review.
  • Mentor delivery teams and help build reusable patterns, accelerators, and best practices within the practice.

Requirements

  • 6+ years of experience in consulting, engineering, SRE, platform engineering, or operations with strong observability responsibility.
  • Hands-on experience with observability concepts across metrics, logs, traces, alerting, service health, and incident response.
  • Experience with OpenTelemetry and familiarity with telemetry pipeline design, instrumentation patterns, and collection architecture.
  • Working knowledge of open-source observability tools such as Prometheus, Grafana, Loki, Tempo, Mimir, Jaeger, or Elastic.
  • Experience with one or more enterprise observability platforms such as Datadog, Dynatrace, Splunk, New Relic, Elastic, LogicMonitor, Honeycomb, or Chronosphere.
  • Strong understanding of Kubernetes, containers, cloud-native architectures, and at least one major public cloud platform.
  • Experience with Terraform, Helm, CI/CD pipelines, Ansible, or related automation and platform tooling.
  • Solid understanding of distributed systems, modern application architectures, and operational best practices across SRE, DevOps, or ITSM environments.
  • Experience defining service health models, SLIs, SLOs, alerting strategies, or reliability measurement frameworks.
  • Ability to lead client-facing conversations, structure ambiguous problems, and translate strategy into executable workstreams.
  • Demonstrated ability to operate in ambiguous client environments and communicate tradeoffs clearly across engineering, operations, and leadership audiences.
  • Strong written and verbal communication skills with both technical and executive audiences.
  • A continuous learning mindset and a collaborative approach to delivery and practice building.
  • Experience operationalizing error budgets, burn-rate alerting, production readiness reviews, or reliability governance practices.
  • Experience integrating observability with ServiceNow, incident management platforms, CMDB, or collaboration tools.
  • Experience with telemetry cost optimization, sampling strategies, retention policies, tagging standards, cardinality management, or observability governance.
  • Familiarity with platform engineering, internal developer platforms, service catalogs, golden paths, or self-service observability patterns.
  • Exposure to AIOps, anomaly detection, event correlation, automated remediation workflows, or MCP-enabled integration patterns for AI-assisted operations.
  • Familiarity with microservices, service mesh technologies, end-user monitoring, or digital experience monitoring concepts.
  • Generalist coding or scripting experience in Python, Java, Go, JavaScript, or .NET.

Benefits

  • Comprehensive health insurance coverage for employees, with options to extend coverage to dependents.
  • Paid time off and company holidays, along with additional leave benefits as per policy.
  • Flexible work arrangements supporting work-life balance.
  • Learning and development opportunities to support continuous growth and upskilling.
  • Employee wellness initiatives and programs focused on physical and mental well-being.
  • Retirement and statutory benefits in line with India regulations.
  • Sponsorship for certifications and credentials for continued learning.
  • Inclusive, people-first culture with a strong focus on collaboration and ownership.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Solutions Architect | Spain | Remote

Grafana 1K-5K IT Services

Grafana Labs is hiring a Senior Solutions Architect for Professional Services to help customers implement and expand Grafana-based observability solutions through hands-on technical consulting, training, and engagement delivery.

AWS Azure GCP Grafana Helm Jaeger Kubectl Kubernetes Prometheus
4 hours, 24 minutes ago

Implementation Consultant I - US Remote

PerfectServe 251-1K Internet Software & Services

PerfectServe is hiring an Implementation Consultant I to support large-scale enterprise deployments of its healthcare communication platform across complex health systems and physician groups.

4 hours, 24 minutes ago

Technical Customer Success Manager (Healthcare SaaS)

Symmetrio Professional Services

Symmetrio is recruiting a Customer Success Manager for a rapidly growing healthcare technology organization, where the role supports users of a healthcare software platform and serves as the link between customers and internal technical teams.

Active Directory OpenID Connect SAML
4 hours, 54 minutes ago

Epic Referrals/MyChart Analyst

Prominence 51-250 Professional Services

Prominence Advisors is hiring an Epic Referrals/MyChart Analyst to support healthcare organizations with process improvement, complex project work, and Epic-related consulting in healthcare IT.

4 hours, 54 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers