Accela

Accela

Accela is the leading provider of productivity and civic engagement software solutions for local, state, and federal government agencies. Their platform offers cloud-based solutions for permitting, licensing, code enforcement, and service request manag...

Internet Software & Services
251-1K
Founded 1999
$234M raised

Description

  • Serve as a technical leader for reliability engineering, operational excellence, and platform modernization across the Civic Platform.
  • Drive the evolution from VM-based architectures to containerized and cloud-native services in partnership with cross-functional engineering teams.
  • Improve and sustain platform availability, performance, scalability, security, and cost efficiency.
  • Define, implement, and operate SLOs, SLAs, and error budgets for critical platform services.
  • Lead observability initiatives across metrics, distributed tracing, logging, and monitoring to improve system visibility and incident response.
  • Drive root cause analysis for complex production incidents and facilitate blameless postmortems with tracked corrective actions.
  • Design, develop, and maintain automation, tooling, and software that improve reliability, efficiency, scalability, and developer productivity.
  • Serve as a senior escalation point during production incidents and platform changes affecting availability, performance, security, or compliance.
  • Partner with Security and Compliance teams to support regulatory and compliance requirements such as SOC 2, HIPAA, FedRAMP, StateRAMP, and PCI-DSS.
  • Translate operational metrics and platform health data into actionable insights for engineering leadership and executive stakeholders.
  • Mentor engineers across the Cloud Engineering organization and influence engineering best practices.

Requirements

  • 8+ years of experience in Site Reliability Engineering, Software Engineering, Cloud Infrastructure, or related disciplines within a SaaS environment.
  • Experience leading complex technical initiatives.
  • Demonstrated technical leadership in containerized and orchestrated environments, including Kubernetes or equivalent technologies.
  • Hands-on experience operating and supporting large-scale SaaS platforms on Microsoft Azure.
  • Experience developing automation and operational tooling using Python, PowerShell, Bash, or similar scripting languages.
  • Deep expertise designing, operating, analyzing, and troubleshooting complex distributed systems across application, infrastructure, networking, and operating system layers.
  • Strong experience with modern observability platforms, including monitoring, logging, metrics, and distributed tracing.
  • Demonstrated success leading incident response, Root Cause Analysis, and continuous improvement initiatives.
  • Experience establishing and maturing Incident, Problem, and Change Management practices.
  • Strong written and verbal communication skills for technical and executive audiences.
  • Experience using Git and GitHub-based development workflows.
  • Experience with Infrastructure-as-Code practices and tooling, particularly Terraform.
  • Experience with configuration management platforms such as Ansible.
  • Experience supporting SaaS platforms subject to public-sector compliance frameworks, including SOC 2, HIPAA, FedRAMP, StateRAMP, and PCI-DSS.
  • Experience implementing GitOps deployment methodologies using tools such as Argo CD or Flux.
  • Experience implementing and operating OpenTelemetry-based observability solutions.
  • Cloud FinOps experience, including cost optimization and resource efficiency initiatives within Microsoft Azure environments.
  • Strong Linux systems administration experience alongside Microsoft Windows expertise.
  • Experience leveraging AI-assisted engineering tools such as GitHub Copilot, Claude Code, or similar technologies to improve productivity and operational efficiency.
  • Up to 10% travel for team collaboration, planning sessions, conferences, and critical business initiatives.

Benefits

  • Annual base salary range of $160,000-$185,000.
  • Eligibility for an annual discretionary bonus target based on company and individual goal achievement.
  • Flexible time off.
  • Comprehensive medical, dental, and vision plans.
  • Family planning benefits.
  • 401(k) retirement savings plan with company match.
  • Health savings account with company contributions.
  • Flexible spending account.
  • Life, accident, and disability coverage.
  • Business travel insurance.
  • Employee assistance programs and other well-being benefits.
  • Remote-friendly work indicated by #LI-Remote.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Reliability Engineer

Barbaricum 251-1K Professional Services

Barbaricum is hiring a Senior Site Reliability Engineer to support MC&FP’s MODES contract by improving the reliability, scalability, resilience, and operational performance of IT and cloud systems in a federal mission environment.

Ansible AWS Azure Chef Cybersecurity DevSecOps GCP PowerShell Puppet Python
12 hours, 55 minutes ago

Sr. Site Reliability Engineer (Starshield)

SpaceX 10K-50K Aerospace & Defense

SpaceX is hiring a Senior Site Reliability Engineer for Starshield to build and operate reliable infrastructure and automation supporting secure government satellite systems.

Ansible Bash CI/CD Kubernetes Linux Python TCP/IP Terraform
1 day, 12 hours ago

Sr. Site Reliability Engineer (Starshield)

SpaceX 10K-50K Aerospace & Defense

SpaceX is hiring a Senior Site Reliability Engineer for Starshield to build and operate reliable infrastructure supporting government-focused satellite systems and national security missions.

Ansible Bash CI/CD Kubernetes Linux Python TCP/IP Terraform
1 day, 13 hours ago

Senior Site Reliability Engineer

DexCare 51-250 Health Care Providers & Services

DexCare is hiring a Senior Site Reliability Engineer to help operate and improve its AWS-based healthcare infrastructure that supports digital care access and reliable patient service delivery.

Agile AWS Azure CI/CD Datadog EC2 GitHub Actions Helm HIPAA JIRA Kubernetes Python Scrum Serverless Terraform
1 day, 13 hours ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers