GitLab

GitLab: The comprehensive DevOps platform revolutionizing software development with automation, AI workflows, and essential tools for efficient collaboration.

Internet Software & Services

Information Technology

1K-5K (1630)

Founded 2014

157 open positions

Links

View All Jobs

Intermediate Site Reliability Engineer, Cloud Cost Utilization

4 weeks, 1 day ago

United Kingdom

Full-time

Mid Level

Site Reliability Engineer (SRE)

DevOps and Infrastructure

Ansible AWS GCP Grafana Prometheus Terraform

Apply Now

GitLab

GitLab: The comprehensive DevOps platform revolutionizing software development with automation, AI workflows, and essential tools for efficient collaboration.

Internet Software & Services

1K-5K

Founded 2014

View All Jobs 157

Description

Design and maintain cloud resource tagging and labeling strategies across GCP and AWS to support accurate cost attribution.
Develop tooling and pipelines to ingest, normalize, and report on cloud billing data using the FOCUS specification.
Automate cost anomaly detection, forecasting, and alerting for infrastructure spend.
Contribute to observability and monitoring stacks, including Prometheus, LGTM, and ELK, to surface cost efficiency signals.
Partner with Finance and Engineering leadership to support cloud cost forecasting for planning and budget discussions.
Act as a subject matter expert for cloud cost attribution, tagging strategy, and FOCUS adoption across GitLab Infrastructure.
Collaborate with Finance and Compliance teams on audits, certifications, and financial reporting needs related to cloud infrastructure usage.
Contribute to infrastructure-as-code efforts using Terraform and Ansible to embed cost controls and tagging requirements into provisioning workflows.
Improve cloud billing data quality and develop standards and workflows that help teams understand the real cost of the services they run.
Work through technical and organizational ambiguity and connect infrastructure data with business context to help teams act on cost signals.

Requirements

Hands-on experience with cloud cost management in GCP and/or AWS, including billing data, pricing models, and optimization approaches.
Familiarity with, or interest in adopting, the FinOps FOCUS specification for multi-cloud cost analysis.
Experience designing or implementing cloud resource tagging and labeling strategies and improving adoption across teams.
Comfort working across technical and business functions, including Engineering, Finance, and other stakeholders.
Experience with infrastructure as code, including Terraform and Ansible.
Familiarity with observability tooling, including Grafana, and an understanding of how reliability and cost signals can be connected.
Ability to explain technical cost data clearly to non-engineering audiences and support informed decision-making.
A self-directed approach to work, with comfort operating in a fully remote and asynchronous environment.
All team members are expected to incorporate AI into their daily workflows to drive efficiency, innovation, and impact.
Candidates with varying levels of experience are welcome, and applicants are encouraged to apply even if they do not meet every requirement.

Benefits

Benefits to support health, finances, and well-being.
Flexible Paid Time Off.
Team Member Resource Groups.
Equity Compensation and Employee Stock Purchase Plan.
Growth and Development Fund.
Parental leave.
Home office support.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Site Reliability Engineer

Kaseya 1K-5K IT Services

Kaseya is hiring a Site Reliability Engineer to own the reliability, automation, and production stability of its AWS-based services used by thousands of MSPs worldwide.

Canada Full-time Mid Level Site Reliability Engineer (SRE)

$85k-$96k

Ansible AWS Chef CloudFormation Datadog DevSecOps Elasticsearch Kibana Kubernetes MySQL PostgreSQL Puppet Secrets Management Serverless Terraform

3 hours, 45 minutes ago

Apply

3 hours, 45 minutes ago

Senior Site Reliability Engineer

Cribl 251-1K IT Services

Cribl is hiring a Senior Site Reliability Engineer in Poland to help build and operate the telemetry infrastructure and observability platform that supports its cloud products and enterprise customers.

Poland Full-time Senior Site Reliability Engineer (SRE)

Ansible AWS Azure CI/CD Grafana JavaScript Kibana Linux New Relic Node.js PagerDuty Prometheus Splunk Terraform TypeScript

16 hours, 10 minutes ago

Apply

16 hours, 10 minutes ago

Senior Site Reliability Engineer

Block 10K-50K Capital Markets

Block is hiring a Site Reliability Engineer to improve the reliability of its platform and critical infrastructure supporting Tier 0 services and safe product development.

Australia Full-time Senior Site Reliability Engineer (SRE)

AWS CI/CD Datadog DynamoDB Envoy gRPC HTTP Java JSON Kotlin Kubernetes Microservices MySQL Terraform

1 day, 4 hours ago

Apply

1 day, 4 hours ago

Site Reliability Engineer

Recorded Future 251-1K Professional Services

Recorded Future is hiring a Site Reliability Engineer to strengthen the reliability, scalability, and performance of its critical cloud systems in close partnership with engineering teams.

Sweden Full-time Mid Level Site Reliability Engineer (SRE)

AWS Chef Elasticsearch ELK Stack Grafana Kafka Kibana Kubernetes Linux Logstash Microservices MongoDB OpenTelemetry Prometheus RabbitMQ Terraform

1 day, 17 hours ago

Apply

1 day, 17 hours ago

GitLab

Tags

Links

Intermediate Site Reliability Engineer, Cloud Cost Utilization

GitLab

Description

Requirements

Benefits

Similar Roles

Site Reliability Engineer

Senior Site Reliability Engineer

Senior Site Reliability Engineer

Site Reliability Engineer

You're on a roll! Sign up now to keep applying.