Skip to content
mimi

Data Engineer using Azure Data Factory and Databricks energy client

S.i. Systems

Toronto · On-site Contract Mid Level Today

About the role

Position

Data Engineer with experience using Azure Data Factory and Databricks for our energy sector client. ID 25-199

Job Overview

  • As an Azure and Databricks Data Engineer, you will be responsible for designing, building and supporting the data driven applications which enable innovative, customer centric digital experiences.
  • Will be working as part of a cross-discipline agile team who help each other solve problems across all business areas.
  • Will build reliable, supportable & performant data lake & data warehouse products to meet the organization’s need for data to drive reporting analytics, applications, and innovation.
  • Will employ best practice in development, security, and accessibility and design to achieve the highest quality of service for our customers.
  • Build and productionize modular and scalable data ELT/ETL pipelines and data infrastructure leveraging the wide range of data sources across the organization.
  • Build curated common data models designed by the Data Modelers that offer an integrated, business‑centric single source of truth for business intelligence, reporting, and downstream system use, in collaboration with Data Architect.
  • Work closely with infrastructure, and cyber teams and Senior Data Developers to ensure data is secure in transit and at rest.
  • Clean, prepare and optimize datasets for performance, ensuring lineage and quality controls are applied throughout the data integration cycle.
  • Support Business Intelligence Analysts in modelling data for visualization and reporting, using dimensional data modeling and aggregation optimization methods.
  • Troubleshoot issues related to ingestion, data transformation and pipeline performance, data accuracy and integrity.
  • Collaborate with Business Analysts, data scientists, Senior Data Engineers, data Data Analysts and, solution Architects and Data Modelers to develop data pipelines to feed our data marketplace.
  • Assist in identifying, designing, and implementing internal process improvements: automating manual processes, optimizing data delivery, re‑designing infrastructure for greater scalability, etc.
  • Work with tools in the Microsoft Stack; Azure Data Factory, Azure Data Lake, Azure SQL Databases, Azure Data Warehouse, Azure Synapse Analytics Services, Azure Databricks, Microsoft Purview, and Power BI.
  • Work within the agile SCRUM work management framework in delivery of products and services, including contributing to feature & user story backlog item development, and utilizing related Kanban/SCRUM toolsets.
  • Assist in building data catalog and maintenance of relevant metadata for datasets published for enterprise use.
  • Develop optimized, performant data pipelines and models at scale using technologies such as Python, Spark and SQL, consuming data sources in XML, CSV, JSON, REST APIs, or other formats.
  • Document as‑built pipelines and data products within the product description, and utilize source control to ensure a maintainable code‑base.
  • Implement orchestration of data pipeline execution designed by Senior Data Engineers to ensure data products meet customer latency expectations, dependencies are managed, and datasets are as up‑to‑date as possible, with minimal disruption to end‑customer use.
  • Create tooling in collaboration with senior data engineers and data architects to help with day to day tasks, and reduce toil via automation wherever possible.
  • Work with Continuous Integration/Continuous Delivery and Dev Ops pipelines to assist in automate infrastructure, code delivery and product enhancement isolation and proper release management and versioning.
  • Monitor the ongoing operation of in‑production solutions, assist in troubleshooting issues, and provide Tier 2 support for datasets produced by the team, on an as‑required basis.
  • Implement and manage appropriate access to data products via role‑based access control based on guidance from senior data engineers.
  • Write and perform automated unit and regression testing for data product builds, assist with user acceptance testing and system integration testing as required, and assist in design of relevant test cases based on guidance from Data Architects.
  • Participate in peer code review sessions.

Qualifications

  • Completion of a four‑year University education in computer science, computer/software engineering or other relevant programs within data engineering, data analysis, artificial intelligence, or machine learning.
  • Experience as a Data Engineer designing and building data pipelines.
  • Fluent in creating data processing frameworks using Python, PySpark, Spark

SOL and SOLExperience with Azure Data Factory, ADLS, Synapse Analytics and Databricks

  • Experience building data pipelines for Data Lakehouses and Data Warehouses
  • Good understanding of data structures and data processing frameworks
  • Knowledge of data governance and data quality principles
  • Effective communication skills to translate technical details to non‑technical stakeholders

Requirements

  • Four‑year university degree in Computer Science, Software Engineering, Data Engineering, Data Analysis, AI, Machine Learning, or a related field
  • Proven experience as a Data Engineer designing and building data pipelines
  • Proficiency in Python, PySpark, and Spark for data processing frameworks
  • Hands‑on experience with Azure Data Factory, Azure Data Lake Storage (ADLS), Azure Synapse Analytics, and Azure Databricks
  • Experience building pipelines for Data Lakehouses and Data Warehouses
  • Strong understanding of data structures and processing frameworks
  • Knowledge of data governance and data quality principles
  • Effective communication skills for translating technical details to non‑technical stakeholders

Responsibilities

  • Design, build, and support data‑driven applications enabling innovative, customer‑centric digital experiences
  • Develop reliable, supportable, and performant data lake and data warehouse products for reporting, analytics, and applications
  • Apply best practices in development, security, and accessibility to deliver high‑quality services
  • Build and productionize modular, scalable ELT/ETL pipelines and data infrastructure across a wide range of data sources
  • Create curated common data models in collaboration with Data Architects and Data Modelers to provide a single source of truth
  • Collaborate with infrastructure, cyber, and senior data development teams to ensure data security in transit and at rest
  • Clean, prepare, and optimize datasets, applying lineage and quality controls throughout the integration cycle
  • Support Business Intelligence Analysts with dimensional modeling, aggregation optimization, and visualization preparation
  • Troubleshoot ingestion, transformation, pipeline performance, data accuracy, and integrity issues
  • Partner with Business Analysts, Data Scientists, Senior Data Engineers, Data Analysts, Solution Architects, and Data Modelers to develop data pipelines for a data marketplace
  • Identify, design, and implement process improvements such as automation, delivery optimization, and scalable infrastructure redesign
  • Utilize Microsoft Azure stack tools (Azure Data Factory, Azure Data Lake, Azure SQL Databases, Azure Synapse Analytics, Azure Databricks, Microsoft Purview, Power BI)
  • Operate within an Agile Scrum framework, contributing to backlog items and using Kanban/Scrum toolsets
  • Build and maintain data catalogs and metadata for enterprise‑wide datasets
  • Develop optimized, high‑performance pipelines and models at scale using Python, Spark, and SQL, consuming XML, CSV, JSON, REST APIs, etc.
  • Document as‑built pipelines and data products, maintain source‑controlled codebases
  • Orchestrate pipeline execution to meet latency expectations, manage dependencies, and keep datasets up‑to‑date
  • Create tooling and automation in collaboration with senior engineers and architects to reduce manual toil
  • Work with CI/CD and DevOps pipelines to automate infrastructure, code delivery, and release management
  • Monitor production solutions, provide Tier 2 support, and troubleshoot issues as needed
  • Implement role‑based access control for data products per senior engineer guidance
  • Write and execute automated unit and regression tests; assist with UAT and integration testing
  • Participate in peer code‑review sessions

Skills

Azure Data FactoryAzure Data LakeAzure SQL DatabaseAzure Synapse AnalyticsAzure DatabricksMicrosoft PurviewPower BIPythonPySparkSparkSQLXMLCSVJSONREST APIsCI/CDDevOpsAgile ScrumKanbanSource Control (Git)Unit TestingRegression TestingData GovernanceData QualityRole‑Based Access ControlData ModelingDimensional Modeling

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free