Nova: Onshore and Nearshore Engineering Solutions

Nova: Onshore and Nearshore Engineering Solutions

Nova: Onshore and Nearshore Engineering Solutions specializes in providing onshore and nearshore software development services, focusing on delivering secure, scalable, and intelligent engineering solutions in areas such as AWS, cloud engineering, and ...

Internet Software & Services

Description

  • Design, build, maintain, and scale production services and server farms across multiple data centers for complex cloud services.
  • Improve software architecture to increase scalability, service reliability, capacity, and performance.
  • Write automation code for provisioning and operating infrastructure at massive scale.
  • Collaborate with development teams to ensure applications are designed for infrastructure fit, scalability, and reliability from the ground up.
  • Work with QA to build pipelines and automation for deploying applications to production.
  • Troubleshoot incidents, test hypotheses, and identify root causes for system failures and service issues.
  • Write postmortem reviews and remediation recommendations after incidents.
  • Monitor system alerts, identify bad trends early, and respond to incidents to restore normal operations.
  • Author and maintain high-quality documentation for specifications, systems, and procedures.
  • Support and comply with the company’s Quality Management System policies and procedures.

Requirements

  • Bachelor’s degree, or equivalent, in computer science or a related discipline.
  • Knowledge of infrastructure-as-code tools such as Terraform, Ansible, Puppet, or Chef.
  • Experience with Kubernetes for cluster creation and management.
  • Knowledge of cloud platforms and services including Microsoft Azure, AWS, and Google Cloud.
  • Understanding of Azure services, virtual machines in Azure, and virtual network configuration.
  • Knowledge of cloud architecture and design patterns such as IaaS, PaaS, and SaaS.
  • Knowledge of CI/CD practices and scripting.
  • Scripting knowledge with PowerShell.
  • Ability to program in one or more high-level languages such as Python, Java, C/C++, Ruby, or JavaScript.
  • Experience with distributed storage technologies such as NFS, HDFS, Ceph, or Amazon S3, and dynamic resource management frameworks such as Apache Mesos, Kubernetes, or Yarn.
  • Proactive approach to identifying problems, performance bottlenecks, and areas for improvement.

Benefits

  • Base salary and permanent contract directly with the company.
  • Continuous training plan with paid certifications.
  • Career plan aligned with your development and knowledge.
  • Benefits above the law, including 12 days of paid time off.
  • 30-day Christmas bonus.
  • Medical insurance, life insurance, and savings fund.
  • Groceries bonus and quarterly performance bonus.
  • Computer equipment provided for work.
  • Optional 100% home office arrangement.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Site Reliability Engineer

Recorded Future 251-1K Professional Services

Recorded Future is hiring a Site Reliability Engineer to strengthen the reliability, scalability, and performance of its critical cloud systems in close partnership with engineering teams.

AWS Chef Elasticsearch ELK Stack Grafana Kafka Kibana Kubernetes Linux Logstash Microservices MongoDB OpenTelemetry Prometheus RabbitMQ Terraform
27 minutes ago

Senior Site Reliability Engineer (Remote - Brazil)

Loadsmart 251-1K Air Freight & Logistics

Loadsmart is hiring a Senior Site Reliability Engineer in Brazil to build and maintain its internal platform and ensure the reliability, safety, and operational excellence of critical engineering systems.

Ansible AWS Bash Chef CI/CD Docker Kubernetes PostgreSQL Python Terraform
27 minutes ago

Site Reliability Engineer

Alpaca 51-250 Capital Markets

Alpaca is hiring a Site Reliability Engineer to keep its brokerage platform reliable and operable across cloud, Kubernetes, observability, messaging, and database systems, with a strong focus on PostgreSQL reliability on the trading-critical path.

DNS GitOps Go Kafka Kubernetes Linux Load Balancing PostgreSQL Python RabbitMQ Secrets Management TLS
3 hours, 47 minutes ago

Site Reliability Engineer

Kaseya 1K-5K IT Services

Kaseya is hiring a Site Reliability Engineer to own the reliability, automation, and production stability of its AWS-based services used by thousands of MSPs worldwide.

Ansible AWS Chef CloudFormation Datadog DevSecOps Elasticsearch Kibana Kubernetes MySQL PostgreSQL Puppet Secrets Management Serverless Terraform
7 hours, 47 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers