Python Developer (Alerting & Monitoring)

2 hours, 36 minutes ago
Contract
Senior
Software Development
Xenon7

Xenon7

Xenon7 provides advanced AI solutions and consultancy services, leveraging a team of highly qualified experts and a strong emphasis on research and innovation to address complex industry challenges and enhance operational efficiency.

Internet Software & Services
Founded 2014

Description

  • Design and implement automated health checks for AWS resources and applications.
  • Build performance monitoring dashboards and scripts to track system health and SLAs.
  • Develop and configure alerting mechanisms and alarms using AWS services (CloudWatch metrics/logs/alarms, SNS, EventBridge).
  • Create Python-based automations to validate and enforce configuration consistency across multiple AWS accounts.
  • Develop scripts to detect anomalies, misconfigurations, and compliance gaps and trigger automated responses.
  • Implement automated service request workflows to engage engineering, DevOps, and support teams.
  • Integrate alerting and automation workflows with ticketing and communication systems (e.g., Slack, email, Jira).
  • Design scalable monitoring workflows and ensure timely notifications to relevant teams through automated workflows.

Requirements

  • 6+ years of experience in Python development and AWS automation.
  • Immediate availability / ready to join.
  • Strong hands-on experience with Python for automation, monitoring, and scripting.
  • Solid understanding and hands-on experience with AWS services including Lambda, CloudWatch, SNS, EventBridge, and X-Ray.
  • Experience building monitoring dashboards, alerts, and automated health checks.
  • High attention to detail with strong analytical, debugging, and optimization skills.
  • Ability to work independently in a remote environment.
  • Preferred: experience with enterprise monitoring tools such as Splunk, AppDynamics, Datadog, New Relic, or similar.
  • Preferred: exposure to CI/CD pipelines and DevOps practices.

Benefits

  • Flexible, remote work with outcome-focused expectations and autonomy.
  • Ecosystem of opportunity including client engagements, research collaborations, and mentorship paths.
  • Collaborative environment emphasizing continuous learning and engineering excellence.
  • Opportunities to lead projects, co-develop tools, and shape AI/ML initiatives through the company's Innovation Community.
  • Pathways for thought leadership, professional growth, and community-driven impact.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Software Engineer – Backend (Python / Typescript / Big Data / AWS / Kubernetes)

Varicent 251-1K Professional Services

Senior Software Engineer at Varicent contributing to the ELT application to simplify data workflows and enable faster insights by designing and scaling large-scale, data-intensive backend and cloud-native systems.

Apache Spark AWS CI/CD Docker DynamoDB EC2 Kafka Kubernetes Microservices Node.js Python REST API System Design Terraform TypeScript
1 hour, 36 minutes ago

Senior Software Engineer (Node.js) - OP02026

Dev.Pro 251-1K Internet Software & Services

Senior Software Engineer at Dev.Pro supporting a global point-of-sale platform to design and deliver high-load backend systems and implement data mapping and API integrations that ensure accurate, reliable transfer between diverse menu and order systems.

AWS CI/CD Docker EC2 MongoDB Node.js REST API Serverless TypeScript
2 hours, 6 minutes ago

Backend Engineer, Contract

66degrees 251-1K IT Services

66degrees is hiring a Backend Engineer (90-day contract) to support backend integrations and data-driven application enhancements for retail technology clients, working closely with product and client teams to optimize and troubleshoot backend systems.

Java Kotlin SOAP SQL
3 hours, 6 minutes ago

MuleSoft Integration Engineer

Fulfillment IQ 11-50 Professional Services

Fulfillment IQ is hiring a contract MuleSoft Integration Engineer to assess and remediate performance, scalability, and stability issues in its Mule Community Edition production integration platform and to establish sustainable operational and governance controls for long-term stability.

Agile Java REST API
3 hours, 6 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers