Mitratech

Mitratech

Mitratech specializes in automating processes through innovative solutions and services in compliance, risk management, enterprise HR, and legal operations, empowering organizations to effectively manage risks and scale for future growth.

Professional Services
1K-5K
Founded 1987
$3M raised

Description

  • Lead, mentor, and develop a team of DevOps engineers and SREs.
  • Build a collaborative, inclusive team culture focused on high-quality service delivery.
  • Set and track team goals aligned with business objectives.
  • Design, implement, and manage highly available, scalable cloud infrastructure.
  • Oversee Infrastructure as Code implementation to automate provisioning and configuration.
  • Identify and resolve bottlenecks in deployment pipelines and infrastructure performance.
  • Define and maintain SLOs, SLIs, and error budgets.
  • Drive incident management, post-mortem analysis, and continuous improvement efforts.
  • Improve monitoring, logging, and alerting systems.
  • Establish and refine CI/CD pipelines for smooth releases with minimal or zero downtime.
  • Collaborate with development teams on DevOps best practices, code quality, security, and performance.
  • Implement security best practices, including secrets management, vulnerability scanning, and automated patching.
  • Ensure compliance with standards such as SOC 2 and ISO 27001.
  • Work cross-functionally with product, engineering, and operations teams.
  • Provide stakeholders with regular updates on system health, incidents, and improvement initiatives.
  • Analyze cloud and infrastructure spend and implement cost optimization strategies.
  • Manage budgets and vendor relationships for tools and services.

Requirements

  • Bachelor’s degree in Computer Science, Engineering, or a related field; a Master’s degree is a plus.
  • Proven experience managing DevOps or SRE teams in fast-paced environments.
  • Hands-on experience with cloud platforms such as AWS, Azure, OCI, or GCP.
  • OCI experience is preferred.
  • Experience with containerization technologies such as Docker and Kubernetes.
  • Deep understanding of the software development lifecycle (SDLC) and Agile practices.
  • Track record of driving operational efficiency, incident resolution, and automation.
  • Expertise with CI/CD tools such as Jenkins, CircleCI, or Azure DevOps.
  • Experience operating Kubernetes platforms such as AKS, EKS, or similar.
  • Experience using managed languages such as Python, Go, C#, Java, or similar.
  • Experience designing tooling to simplify operational management of SaaS/PaaS systems.
  • Experience with monitoring and observability tools such as Prometheus, Splunk, New Relic, Datadog, or ELK Stack.
  • Strong knowledge of infrastructure-as-code tools such as Terraform, Bicep, or CloudFormation.
  • Excellent leadership and people management abilities.
  • Strong problem-solving skills and attention to detail.
  • Exceptional communication skills for cross-functional collaboration and stakeholder management.

Benefits

  • Equal-opportunity employer with a strong commitment to diversity and inclusion.
  • Opportunity to work with a globally dispersed team.
  • Culture that supports learning opportunities and professional growth.
  • Entrepreneurial environment with enterprise-level investment.
  • Chance to work with complex, leading-edge technologies.
  • Fast-paced environment with broad impact across globally used products.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Manager, Software Engineering - Storage Platform

Figma 1K-5K Internet Software & Services

Figma is hiring an Engineering Manager to lead its Databases team, which owns the core data layer behind the company’s product and platform as it scales.

LLM MySQL PostgreSQL
13 hours, 11 minutes ago

Site Reliability Engineer

Stack AV 201-500 information technology & services

Stack AV is hiring a Site Reliability Engineer to keep its compute platform for large-scale autonomous systems development reliable, scalable, and ready to support engineering and research workloads.

CI/CD Kubernetes Linux OpenTelemetry Prometheus
13 hours, 26 minutes ago

Senior Site Reliability Engineer

Stack AV 201-500 information technology & services

Stack AV is hiring a Site Reliability Engineer to support the reliability, scalability, and uptime of its production infrastructure for autonomous trucking systems.

AWS Bash CloudFormation GCP Kubernetes Linux OpenTelemetry Prometheus Python TCP/IP Terraform
13 hours, 41 minutes ago

Manager of Monitoring Operations

Ensono 1K-5K IT Services

BMC is hiring a Manager – Monitoring Operations to lead enterprise monitoring for IT infrastructure and applications across on-prem OpenShift, network, and OS monitoring platforms.

Grafana Kubernetes Linux Prometheus
1 day, 12 hours ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers