Dev.Pro

Dev.Pro is a globally distributed software development partner, specializing in custom outsourced software development for innovative technology companies to scale their businesses efficiently.

Internet Software & Services

Information Technology

251-1K (900)

Founded 2011

29 open positions

Links

View All Jobs

Intermediate Site Reliability Engineer - OP02079

1 month, 1 week ago

Chile

Full-time

Mid Level

Site Reliability Engineer (SRE)

Software Development

AWS Azure Datadog Docker ELK Stack GCP GitLab CI GraphQL Jenkins Kubernetes Microservices Node.js REST API TypeScript WebSockets

Apply Now

Dev.Pro

Dev.Pro is a globally distributed software development partner, specializing in custom outsourced software development for innovative technology companies to scale their businesses efficiently.

Internet Software & Services

251-1K

Founded 2011

View All Jobs 29

Description

Provide first-line operational support for a cloud-based production environment and respond to incidents promptly.
Monitor systems, troubleshoot production issues, and apply corrective actions to restore service.
Work with engineering teams on bug fixes, hotfixes, and escalations.
Administer MDM solutions and support remote software deployments.
Implement automated monitoring and alerting to improve incident detection and response.
Document operational processes, maintain knowledge bases, and create incident runbooks.
Participate in an on-call rotation to provide 24/7 critical incident coverage.
Contribute to post-incident reviews and improvements to monitoring, response, and resolution processes.
Build Node.js/TypeScript utilities to automate workflows, parse logs and JSON, and validate API payloads.
Troubleshoot REST/GraphQL integrations, analyze request/response traces, and support third-party API integrations.
Analyze system and application logs and telemetry to resolve issues.
Manage and administer system access.

Requirements

Bachelor’s degree in Computer Science, Engineering, or a related field.
3+ years of experience supporting production systems, with a focus on incident response and resolution.
Strong experience in operational support or SRE roles in cloud environments.
Proficiency in Node.js, including debugging, error handling, and performance troubleshooting.
Experience with AWS, Azure, or GCP and monitoring/troubleshooting cloud-native applications.
Experience working with APIs and integrations.
Familiarity with logging and monitoring tools such as Winston, Bunyan, Datadog, ELK Stack, and CloudWatch.
Experience with CI/CD pipelines and automated deployments using Jenkins, GitLab CI, or AWS CodePipeline.
Strong problem-solving skills in high-pressure, time-sensitive situations.
Strong communication skills for structured incident reporting and documentation.
Effective cross-functional collaboration with development, DevOps, and product teams.
Upper-Intermediate+ English level.
Desirable: experience with containerization tools such as Docker and Kubernetes.
Desirable: knowledge of REST APIs, WebSockets, and microservices architecture.
Desirable: familiarity with incident management frameworks such as ITIL and SRE practices.
Desirable: understanding of cloud security best practices.
Desirable: experience with mobile POS platforms or mobile application environments.
Desirable: familiarity with mobile device management (MDM) solutions.

Benefits

99.9% remote work with the ability to work from anywhere in the world.
30 paid days off per year for vacations, holidays, or personal time.
5 paid sick days, up to 60 days of medical leave, and up to 6 paid days off for major family events.
Partially covered health insurance after the probation period.
Wellness bonus for gym memberships, sports nutrition, and similar needs after 6 months.
Salary paid in U.S. dollars.
Approved overtime fully covered.
Access to English lessons, Dev.Pro University programs, and online team-building activities.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Alpaca is hiring a Site Reliability Engineer to keep its brokerage platform reliable and operable across cloud, Kubernetes, observability, messaging, and database systems, with a strong focus on PostgreSQL reliability on the trading-critical path.

Europe Full-time Mid Level Site Reliability Engineer (SRE)

DNS GitOps Go Kafka Kubernetes Linux Load Balancing PostgreSQL Python RabbitMQ Secrets Management TLS

1 hour, 28 minutes ago

Apply

1 hour, 28 minutes ago

Site Reliability Engineer

Kaseya 1K-5K IT Services

Kaseya is hiring a Site Reliability Engineer to own the reliability, automation, and production stability of its AWS-based services used by thousands of MSPs worldwide.

Canada Full-time Mid Level Site Reliability Engineer (SRE)

$85k-$96k

Ansible AWS Chef CloudFormation Datadog DevSecOps Elasticsearch Kibana Kubernetes MySQL PostgreSQL Puppet Secrets Management Serverless Terraform

5 hours, 28 minutes ago

Apply

5 hours, 28 minutes ago

SRE - DevOps Engineer - Argentina

Coderio 51-250 Internet Software & Services

Coderio is hiring a remote DevOps/SRE Engineer in Argentina to ensure the stability, scalability, and efficient operation of the infrastructure that supports its global digital solutions.

Argentina Full-time Mid Level Site Reliability Engineer (SRE)

Argo CD CI/CD Flux GitHub Actions GitOps Helm Jenkins Kubernetes OpenShift Terraform

9 hours, 8 minutes ago

Apply

9 hours, 8 minutes ago

Senior Site Reliability Engineer

Cribl 251-1K IT Services

Cribl is hiring a Senior Site Reliability Engineer in Poland to help build and operate the telemetry infrastructure and observability platform that supports its cloud products and enterprise customers.

Poland Full-time Senior Site Reliability Engineer (SRE)

Ansible AWS Azure CI/CD Grafana JavaScript Kibana Linux New Relic Node.js PagerDuty Prometheus Splunk Terraform TypeScript

16 hours, 41 minutes ago

Apply

16 hours, 41 minutes ago

Dev.Pro

Tags

Links

Intermediate Site Reliability Engineer - OP02079

Dev.Pro

Description

Requirements

Benefits

Similar Roles

Site Reliability Engineer

Site Reliability Engineer

SRE - DevOps Engineer - Argentina

Senior Site Reliability Engineer

You're on a roll! Sign up now to keep applying.