Nova: Onshore and Nearshore Engineering Solutions

Nova: Onshore and Nearshore Engineering Solutions

Nova: Onshore and Nearshore Engineering Solutions specializes in providing onshore and nearshore software development services, focusing on delivering secure, scalable, and intelligent engineering solutions in areas such as AWS, cloud engineering, and ...

Internet Software & Services

Description

  • Design, build, maintain, and scale production services and server farms across multiple data centers for complex cloud services.
  • Improve software architecture to increase scalability, service reliability, capacity, and performance.
  • Write automation code for provisioning and operating infrastructure at massive scale.
  • Collaborate with development teams to ensure applications are designed for infrastructure fit, scalability, and reliability from the ground up.
  • Work with QA to build pipelines and automation for deploying applications to production.
  • Troubleshoot incidents, test hypotheses, and identify root causes for system failures and service issues.
  • Write postmortem reviews and remediation recommendations after incidents.
  • Monitor system alerts, identify bad trends early, and respond to incidents to restore normal operations.
  • Author and maintain high-quality documentation for specifications, systems, and procedures.
  • Support and comply with the company’s Quality Management System policies and procedures.

Requirements

  • Bachelor’s degree, or equivalent, in computer science or a related discipline.
  • Knowledge of infrastructure-as-code tools such as Terraform, Ansible, Puppet, or Chef.
  • Experience with Kubernetes for cluster creation and management.
  • Knowledge of cloud platforms and services including Microsoft Azure, AWS, and Google Cloud.
  • Understanding of Azure services, virtual machines in Azure, and virtual network configuration.
  • Knowledge of cloud architecture and design patterns such as IaaS, PaaS, and SaaS.
  • Knowledge of CI/CD practices and scripting.
  • Scripting knowledge with PowerShell.
  • Ability to program in one or more high-level languages such as Python, Java, C/C++, Ruby, or JavaScript.
  • Experience with distributed storage technologies such as NFS, HDFS, Ceph, or Amazon S3, and dynamic resource management frameworks such as Apache Mesos, Kubernetes, or Yarn.
  • Proactive approach to identifying problems, performance bottlenecks, and areas for improvement.

Benefits

  • Base salary and permanent contract directly with the company.
  • Continuous training plan with paid certifications.
  • Career plan aligned with your development and knowledge.
  • Benefits above the law, including 12 days of paid time off.
  • 30-day Christmas bonus.
  • Medical insurance, life insurance, and savings fund.
  • Groceries bonus and quarterly performance bonus.
  • Computer equipment provided for work.
  • Optional 100% home office arrangement.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Staff Operations Engineer

Mozilla 251-1K Internet Software & Services

Mozilla is hiring a Staff Operations Engineer to lead the design, reliability, and evolution of hybrid-cloud and workplace infrastructure across teams.

Ansible DNS Linux Puppet Python TCP/IP Unix
8 hours, 57 minutes ago

Principal Site Reliability Engineer (SRE)

Symmetrio Professional Services

Symmetrio is recruiting a Principal Site Reliability Engineer for a rapidly growing healthcare technology company to own the reliability, scalability, security, and performance of a mission-critical SaaS platform used by healthcare providers across the United States.

Active Directory AWS CI/CD Datadog Django Grafana Kubernetes Python Terraform Windows Server
9 hours, 12 minutes ago

Performance Test Engineer Lead

PartnerOne 51-250 Media

An enterprise performance engineering role at a cloud-focused organization, responsible for validating the scalability, stability, and production readiness of distributed systems across Azure and hybrid environments.

Azure CI/CD Kubernetes PowerShell
9 hours, 27 minutes ago

Site Reliability Engineer

MLabs 11-50 Internet Software & Services

Remote UK-hours Site Reliability Engineering role at a financial technology company, focused on automating and operating the infrastructure that supports global integration services for financial institutions.

Active Directory Ansible AWS CI/CD GCP OAuth PostgreSQL SAML
9 hours, 42 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers