Airtable

Airtable

Airtable is a low-code platform for building next-gen apps, organizing data, and streamlining workflows with AI. It combines the speed of a spreadsheet with the power of a database, offering collaborative features and templates for various needs.

IT Services
1K-5K
Founded 2012
$1400M raised

Description

  • Lead the design and evolution of logging, metrics, and tracing pipelines to handle massive data volumes.
  • Evaluate and integrate observability technologies such as OpenTelemetry, ClickHouse, and the ELK stack.
  • Mentor a growing team of infrastructure engineers and share best practices in tracing, monitoring, and logging.
  • Define and uphold coding standards and operational excellence across the organization.
  • Partner with Deploy Infrastructure, Service Orchestration, and Product teams to embed observability throughout the development lifecycle.
  • Align infrastructure decisions with business goals to detect issues before they impact customers.
  • Own end-to-end reliability for observability tools and establish SLAs, SLOs, and error budgets.
  • Optimize the performance and cost of large-scale data pipelines and storage.
  • Shape the observability roadmap, including tracing coverage, monitoring dashboards, and logging pipeline improvements.
  • Instrument prompts, model calls, and RAG pipelines to capture latency, reliability, cost, and safety signals.
  • Design online and offline evaluation loops for LLM quality, including canary analysis and drift detection.
  • Build dashboards and alerts for token usage, error rates, guardrail triggers, and model performance, and connect them to tracing for prompt lineage.
  • Partner with AI and Product teams to define SLOs for AI features and improve models and prompts based on incidents.

Requirements

  • 6+ years of software engineering experience, with 3+ years focused on observability or infrastructure at scale.
  • Demonstrated success implementing and running production-grade logging, metrics, or tracing systems.
  • Proficiency in distributed systems concepts, data streaming pipelines, and Kubernetes.
  • Deep hands-on experience with tools such as Prometheus, Grafana, Datadog, OpenTelemetry, ELK Stack, Loki, or ClickHouse.
  • Comfort with at least one programming language such as Go, Python, or Java.
  • Experience mentoring engineers and collaborating across multiple teams.
  • Strong communication skills for presenting technical trade-offs and architectural plans.
  • Ability to own high-impact initiatives from design through production and maintenance.
  • Ability to balance short-term fixes with long-term strategic vision.
  • Experience with observability at scale is preferred.
  • Familiarity with LLM observability, prompt lineage, or AI feature monitoring is preferred.

Benefits

  • Base salary range of $187,000 to $260,000 USD for San Francisco Bay Area, Seattle, New York City, and Los Angeles locations.
  • Total compensation package includes benefits.
  • Total compensation package may include restricted stock units.
  • Total compensation package may include incentive compensation.
  • Equal opportunity employer with accommodations available for candidates who need assistance during the application or interview process.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Software Engineer

Unity 5K-10K Internet Software & Services

Senior Software Engineer at Unity to lead the design and implementation of a business-critical data platform that supports real business and customer outcomes across the company’s global products.

Flink JIRA Kafka
50 minutes ago

Engineer, Software

Calabrio 251-1K Professional Services

Verint is hiring a Software Engineer for its QM and PM engineering team to design and deliver end-to-end full-stack features for enterprise customer experience products in a global, collaborative Agile environment.

Agile AWS Azure BDD C# Confluence CSS Cypress Datadog Docker Elasticsearch GitHub Actions GitLab CI GitOps Grafana GraphQL Hibernate HTML Java JavaScript Jenkins Jest JIRA JUnit JWT Kanban Kubernetes Microservices MongoDB .NET Oracle Playwright PostgreSQL Prometheus React Redis REST API Scrum Selenium Spring Boot SQL TDD TestNG TypeScript
2 hours, 11 minutes ago

Senior Software Engineer, Full Stack

SmarterDx 11-50 Professional Services

SmarterDx is hiring a fully remote Senior Software Engineer in the US to build and improve full-stack clinical AI products used by hospital systems to capture clinical context and support revenue and quality outcomes.

Angular AWS Elasticsearch Java Kubernetes Next.js PostgreSQL Python React TypeScript
2 hours, 12 minutes ago

Junior Software Developer, Backend

Hootsuite 10K-50K Media

Hootsuite is hiring a Junior Software Developer to join a small agile team in Luxembourg that delivers and improves production-ready software and features for customers.

Agile AWS CI/CD Docker EC2 Go Grafana Kafka Kubernetes Microservices Prometheus Scala
2 hours, 17 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers