Bounteous

Bounteous

Bounteous collaborates with ambitious brands across various industries, including finance and healthcare, to drive digital transformation by creating innovative and seamless customer experiences through advanced technologies such as AI, cloud, and data...

Internet Software & Services
1K-5K
Founded 2003

Description

  • Architect, build, and maintain scalable ETL/ELT pipelines on the Databricks Lakehouse Platform using PySpark, Spark SQL, and Delta Lake.
  • Design and implement medallion architecture data layers and enforce data quality, governance, and lineage standards.
  • Optimize Spark jobs and cluster configurations for performance and cost through partitioning, caching, and autoscaling strategies.
  • Implement and manage Unity Catalog for access control, governance, and cross-workspace asset sharing.
  • Build and orchestrate workflows using Databricks Workflows, Delta Live Tables, and CI/CD pipelines.
  • Collaborate with data scientists, analysts, and business stakeholders to translate requirements into reliable data products.
  • Establish engineering best practices, conduct code reviews, and mentor junior data engineers.
  • Monitor production pipelines, troubleshoot failures, and drive root-cause analysis and continuous improvement.
  • Promote and enforce information security practices, including secure use of information assets and password and malware protection protocols.
  • Assess and report security risks and ensure compliance with data privacy and protection standards such as GDPR and CCPA.

Requirements

  • 5+ years of data engineering experience, including 3+ years building production solutions on Databricks and Apache Spark.
  • Expert proficiency in Python (PySpark) and advanced SQL.
  • Deep hands-on experience with Delta Lake, Unity Catalog, and the medallion architecture pattern.
  • Strong experience with at least one major cloud platform: AWS, Azure, or GCP.
  • Proven track record optimizing Spark performance and managing cluster cost.
  • Experience with data modeling, warehousing concepts, and building dimensional or analytics-ready datasets.
  • Proficiency with Git-based version control, CI/CD, and infrastructure-as-code.
  • Bachelor's degree in Computer Science, Engineering, or equivalent practical experience.
  • Databricks certification (Data Engineer Associate/Professional) is preferred.
  • Experience with Delta Live Tables, structured streaming, and real-time data processing is preferred.
  • Familiarity with MLflow and production machine learning workflows is preferred.
  • Experience with orchestration tools such as Airflow or dbt, and data observability platforms is preferred.
  • Exposure to data governance, security, and compliance frameworks such as GDPR, HIPAA, or SOC 2 is preferred.
  • Hands-on experience with AI coding assistants such as Claude Code, GitHub Copilot, or Cursor is preferred.
  • Familiarity with LLM APIs and SDKs such as Anthropic Claude or OpenAI, prompt engineering, RAG, and vector search is preferred.

Benefits

  • Bounteous is willing to sponsor eligible candidates for employment visas.
  • Remote work is available, as indicated by the #LI-Remote designation.
  • Equal opportunity employer with non-discrimination protections across protected characteristics.
  • Accommodation support is provided for candidates in Canada throughout the hiring process.
  • Opportunity to subscribe to monthly job openings alerts to stay connected with the company.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Bioinformatics Production Analyst/Production Engineer

Natera 1K-5K Pharmaceuticals

Natera is hiring a Bioinformatics Production Analyst to support carrier screening and sequencing tests by analyzing high-throughput DNA data, improving production pipelines, and helping ensure high-quality diagnostics.

HIPAA LLM Plotly Python SQL
2 hours, 38 minutes ago

Data Migration Engineer

Mark43 251-1K Professional Services

Mark43 is hiring a Data Migration Engineer (ETL Developer) to design and execute data migrations that help public safety agencies adopt its software with clean, high-quality data.

SQL
3 hours, 8 minutes ago

Senior Data Engineer

Nimble Gravity 51-250 IT Services

Nimble Gravity is seeking a Data Engineer to build and scale production-ready data solutions that support critical business needs.

Apache Airflow Apache Spark CI/CD Dagster Databricks dbt Feature Engineering LLM Machine Learning Python SQL
3 hours, 23 minutes ago

Sr. Forward Deployed Engineer - Financial Services

Databricks 1K-5K IT Services

Databricks is hiring a Forward Deployed Engineer to work directly with customers on productionizing data and AI solutions on the Databricks platform.

Apache Spark AWS Azure CI/CD Databricks GCP JavaScript MLOps Python Scala TypeScript
3 hours, 23 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers