Senior Cloud Application Support Engineer (Remote - LATAM)

3 hours, 42 minutes ago
Contract
Senior
DevOps and Infrastructure
Atmosera

Atmosera

Atmosera is a trusted global cloud partner offering Azure managed cloud services with a focus on security and compliance for critical business applications worldwide.

IT Services
51-250
Founded 1995

Description

  • Perform real-time monitoring and incident dispositioning for critical client applications using Dynatrace and Azure Insights.
  • Correlate metrics, traces, and logs to conduct root cause analysis and identify performance bottlenecks in distributed environments.
  • Lead triage of complex alerting environments to reduce noise and ensure high-priority incidents are handled effectively.
  • Analyze metrics and daily reports to detect early signs of instability and prevent service disruptions.
  • Evaluate runbooks and establish new standards for operating procedures, governance, and client environment management.
  • Serve as the primary technical point of contact for P1 incidents and coordinate communication across technical and business stakeholders.
  • Automate manual reporting processes to improve operational efficiency and reporting accuracy.
  • Enforce SRE best practices and SLA compliance, including guidance on incident handling and problem record creation.
  • Mentor junior team members on complex procedures and APM telemetry interpretation.
  • Collaborate on product strategy and best practices to improve the performance and stability of client environments.

Requirements

  • Bachelor’s degree in computer science or a related technical field, or equivalent professional experience.
  • 5+ years of technical experience in managed service providers or cloud hosting environments, with a senior systems administration background.
  • Bilingual proficiency is required.
  • Expert-level proficiency in Dynatrace and Azure Insights, including advanced configuration and environment optimization.
  • Advanced technical expertise in correlating metrics, traces, and logs for root cause analysis.
  • Deep understanding of SRE principles and experience managing critical P1 incidents under strict SLAs.
  • Strong leadership and communication skills for handling P1/P2 tickets and stakeholder coordination.
  • Experience evaluating support documentation and establishing governance and operating procedures.
  • Experience automating manual reporting processes and translating telemetry into actionable business insights.
  • Microsoft Azure certifications are required within 90 days of employment, based on current certifications and skill level.
  • Advanced certifications in Dynatrace or other APM platforms are highly preferred.
  • Technical certifications in Azure, Windows, O365, SQL, Linux, VMware, Cisco, Palo Alto, AWS, GCP, Terraform, Dynatrace, or DevOps are a plus.

Benefits

  • Remote work within LATAM.
  • Contract position.
  • Opportunity to work with a Microsoft Partner with multiple specializations and a strong cloud/AI/security focus.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Site Reliability Engineer - Canada Wide - Remote

Newton 51-250 Capital Markets

Newton is hiring a remote Site Reliability Engineer across Canada to improve the reliability, resilience, and operational readiness of its crypto trading platform.

AWS Java JavaScript Python
3 hours, 12 minutes ago

Site Reliability Engineer - India

Zimperium 251-1K Professional Services

Zimperium is hiring a Senior Site Reliability Engineer in India to improve the reliability, automation, and scalability of its mobile security production systems and applications.

CI/CD Datadog Docker Java Kubernetes Linux Python Unix
3 hours, 27 minutes ago

Senior Site Reliability Engineer

Block 10K-50K Capital Markets

Block is hiring an SRE to improve the reliability of its platform and critical infrastructure for Tier 0 services, with a focus on safe, scalable operations and system-wide incident reduction.

AWS CI/CD Datadog DynamoDB Envoy gRPC HTTP Java JSON Kotlin Kubernetes MySQL Terraform
3 hours, 27 minutes ago

Senior Site Reliability Engineer

Block 10K-50K Capital Markets

Block is hiring a Site Reliability Engineer to improve the reliability of its platform and critical infrastructure, with a focus on scalable distributed systems, incident response, and system-wide operational resilience.

AWS CI/CD Datadog DynamoDB Envoy gRPC HTTP Java JSON Kotlin Kubernetes MySQL Terraform
3 hours, 27 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers