Margo Bank

Unlock excellence with MARGO Consulting: where ambition, expertise, and innovation drive the most complex tech challenges.

Professional Services

Industrials

$8M raised

19 open positions

Links

View All Jobs

Network Reliability Engineer

1 month, 1 week ago

Poland

Contract

Mid Level

Site Reliability Engineer (SRE)

DevOps and Infrastructure

Ansible Bash CI/CD Debian DNS Elasticsearch GitLab Go Grafana Linux Load Balancing MariaDB Prometheus Python SaltStack TCP/IP Ubuntu

Apply Now

Margo Bank

Unlock excellence with MARGO Consulting: where ambition, expertise, and innovation drive the most complex tech challenges.

Professional Services

$8M raised

View All Jobs 19

Description

Build and support large-scale AI infrastructure with monitoring, diagnosis, and remediation of production incidents.
Troubleshoot high-impact production issues in collaboration with other engineering teams.
Participate in an on-call rotation to handle incidents and ensure service continuity.
Implement and maintain observability solutions to monitor AI infrastructure and application health.
Contribute to AI infrastructure lifecycle management across different environments and countries.
Promote and apply best practices for stability, resiliency, scalability, and security.
Maintain clear technical documentation for tools and procedures.
Contribute to the evolution of systems and tools based on production feedback.
Collaborate closely with development teams to ensure infrastructure readiness.
Participate in team rituals and knowledge-sharing initiatives.

Requirements

Experience with Go or Python.
Strong scripting skills in Bash and Python.
Hands-on experience with Linux systems, especially Ubuntu/Debian.
Preferred hands-on experience with GPU and HPC infrastructure.
Knowledge of networking concepts such as VLAN/LAN, TCP/IP, DNS, BGP, load-balancing, and IPv6.
Familiarity with monitoring and logging tools such as Prometheus, Grafana, and Elastic.
Comfort with Infrastructure-as-Code tools such as Ansible, Salt, and AWX.
Experience managing relational databases, especially MariaDB.
Understanding of CI/CD pipelines, especially GitLab.
Comfortable communicating in English, both written and spoken.
Proactive and solution-oriented mindset.
Passion for automation and continuous improvement.
Strong collaboration and communication skills.
Ability to work independently and as part of a team.
Willingness to mentor others and share knowledge.

Benefits

Remote work arrangement.
Permanent contract or B2B contract option.
Hourly rate of 200 zł - 250 zł.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Site Reliability Engineer

Counterpart Health 51-200 hospital & health care

Counterpart Health is hiring a Senior Site Reliability and Infrastructure Engineer to support and evolve the technology platform behind its primary care tool and maintain reliable infrastructure for domestic and international workloads.

United States Full-time Senior Site Reliability Engineer (SRE)

$160k-$208k

AWS Azure CI/CD Containerd DNS Docker GCP Go gRPC Helm Kubernetes Linux Load Balancing Prometheus Python Shell Scripting TCP/IP

14 hours, 51 minutes ago

Apply

14 hours, 51 minutes ago

Senior Test Platform & Reliability Engineer - Star Trek Fleet Command

Scopely 1K-5K Internet Software & Services

Scopely is hiring a Senior Test Platform & Reliability Engineer in Ireland to build validation, reliability, and developer enablement platforms for Star Trek Fleet Command’s large-scale live-service backend systems.

Ireland Full-time Senior SDET (Software Development Engineer in Test) Site Reliability Engineer (SRE)

AWS Bash CI/CD Docker GitLab Go Python Terraform

15 hours, 6 minutes ago

Apply

15 hours, 6 minutes ago

Senior Software Engineer - Databases, SRE | Canada | Remote

Grafana 1K-5K IT Services

Grafana Labs is hiring a Senior Software Engineer for its remote SRE team to improve reliability and operability of Grafana Cloud database services for high-SLA customers across AWS, GCP, and Azure.

Canada Full-time Senior Site Reliability Engineer (SRE) Software Engineer

$108k-$130k

AWS Azure GCP Go Helm Java Kubernetes Linux Microservices Python Terraform

1 day, 14 hours ago

Apply

1 day, 14 hours ago

Senior Site Reliability Engineer

Semios 51-250 Food Products

Semios Group is hiring a Senior Site Reliability Engineer to help scale, secure, and improve the reliability of its global agricultural technology platform.

Canada Full-time Senior Site Reliability Engineer (SRE)

$140k-$160k

AWS Azure Bash Buildkite CI/CD Datadog Docker Envoy GCP Git GitHub GitHub Actions GitLab Go Jenkins Kubernetes Linux NATS New Relic Prometheus Python Ruby Splunk Terraform

1 day, 15 hours ago

Apply

1 day, 15 hours ago

Margo Bank

Tags

Links

Network Reliability Engineer

Margo Bank

Description

Requirements

Benefits

Similar Roles

Senior Site Reliability Engineer

Senior Test Platform & Reliability Engineer - Star Trek Fleet Command

Senior Software Engineer - Databases, SRE | Canada | Remote

Senior Site Reliability Engineer

You're on a roll! Sign up now to keep applying.