AB InBev

AB InBev

AB InBev is the world's largest brewer, committed to bringing people together for a better world. With a diverse portfolio of over 500 beer brands, including global favorites like Budweiser, Corona, and Stella Artois, the company focuses on brewing the...

Beverages
100K+
Founded 2008

Description

  • Support the development and maintenance of data pipelines, ingestion processes, and data transformations.
  • Create and maintain SQL queries, Python scripts, and Spark-based workloads for data processing and analytics.
  • Troubleshoot pipeline failures, data quality issues, and operational incidents.
  • Work with senior engineers to implement schema mappings, transformation logic, and data validation rules.
  • Ensure datasets meet expected schemas, data contracts, and quality standards.
  • Support metadata management, dataset documentation, lineage activities, and data classification efforts.
  • Automate repetitive operational and data management tasks to improve efficiency and reliability.
  • Contribute to monitoring, alerting, and operational support for data pipelines and workflows.
  • Participate in testing activities, including unit tests, transformation validation, and data quality checks.
  • Collaborate with Data Governance, Security, and Compliance teams and contribute to continuous improvement initiatives.

Requirements

  • Bachelor's degree in Computer Science, Computer Engineering, Information Systems, Data Science, Software Engineering, or a related field.
  • Basic to intermediate English proficiency.
  • Up to 2 years of experience in Data Engineering, Software Engineering, Data Analytics, or a related area.
  • Knowledge of SQL and Python.
  • Understanding of ETL/ELT concepts and data transformation processes.
  • Familiarity with relational databases and data warehousing concepts.
  • Basic knowledge of Spark, Databricks, or distributed data processing frameworks.
  • Familiarity with Git and version control workflows.
  • Basic understanding of cloud platforms such as AWS, Azure, or Google Cloud.
  • Knowledge of automation concepts and scripting for operational efficiency.
  • Basic understanding of data quality concepts and validation practices.
  • Familiarity with data governance principles, including metadata, ownership, stewardship, and documentation.
  • Basic knowledge of data classification concepts such as Public, Internal, Confidential, and Restricted.
  • Understanding of data lineage and traceability concepts.
  • Awareness of security best practices, including access management, secrets management, and least-privilege principles.
  • Strong analytical, problem-solving, and communication skills.
  • Willingness to learn new technologies and collaborate across teams.
  • Experience with Databricks, dbt, or similar technologies is preferred.
  • Familiarity with CI/CD tools such as GitHub Actions or Azure DevOps is preferred.
  • Familiarity with APIs, JSON, event-driven architectures, or messaging systems is preferred.
  • Exposure to vulnerability scanning, secret scanning, or secure development practices is preferred.
  • Understanding of privacy regulations such as LGPD, GDPR, or similar frameworks is preferred.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Clinical Data Manager (Senior)

Bioptimus 11-50 research

Bioptimus is seeking a Senior Clinical Data Manager to help structure and harmonize real-world clinical data for its STELA program and support the development of AI-ready biomedical datasets.

AWS GCP Git NumPy Pandas Python Statistics
1 hour, 24 minutes ago

Manager, Data/ML Platform

Inspiren 11-50 Internet Software & Services

Inspiren is seeking a Senior Manager to lead its Data + ML Platform group, building the core infrastructure that turns multimodal resident and care data into real-time insights and powers analytics, ML, and next-generation product capabilities.

LLM Machine Learning MLOps
1 hour, 24 minutes ago

Senior Data Engineer

Exadel 1K-5K Internet Software & Services

Exadel is hiring a Senior Data Engineer to help build and lead scalable data ingestion and curation platforms for an international transportation technology client modernizing rail operations.

Agile Apache Airflow Apache Spark AWS Cassandra CI/CD Elasticsearch GCP Hadoop Hive Java MySQL PostgreSQL Python Scala SQL Teradata
1 hour, 39 minutes ago

ETL (Informatica) Developer

RCG-CyberMedia Federal IT services, cybersecurity, IT modernization, workforce training, and government contracting

CTEC is seeking an ETL (Informatica) Developer to modernize legacy mainframe systems and deliver data integration solutions in a federal environment.

Git HDFS Kafka Oracle SQL
1 hour, 54 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers