Data Engineer

Talent To Hire Inc.

Remote · Canada Full-time Senior 1mo ago

About the role

About

Our client is building the next generation of industrial intelligence, transforming complex automotive and industrial data into real-time, actionable insights powered by machine learning.

We are seeking an experienced Data Engineer who thrives in high-scale, production ML environments and enjoys working with complex, messy datasets to build reliable, scalable data foundations for advanced analytics.

You will join a team of 13 engineers and data professionals, working in a highly collaborative, fast-moving environment. The role is remote-friendly, with strong cross-functional interaction across engineering, data science, and business teams.

What You’ll Do

Design, build, and maintain scalable data pipelines and ETL processes for large structured and unstructured datasets
Develop and optimize Spark-based data workflows supporting production ML systems
Collaborate closely with data scientists, ML engineers, and business stakeholders
Translate complex business needs into scalable, production-grade data solutions
Build feature engineering pipelines for time-series and predictive models
Ensure data quality, governance, security, and reliability across systems
Continuously improve data architecture, performance, and scalability

What You Bring

6+ years in data engineering or ML data pipeline development
Strong experience with Apache Spark, PySpark, Databricks, Delta Lake
Advanced skills in Python, SQL, and Airflow
Deep understanding of Medallion architecture and ETL design patterns
Experience building time-series features (rolling windows, lags, trend indicators)
Ability to work closely with ML teams and translate data into model-ready structures
Bachelor’s or Master’s in Computer Science, Engineering, or related field

Strong Advantages (Preferred Experience)

Background in retail, e-commerce, or supply chain environments dealing with large, messy, high-volume datasets
Experience working directly with machine learning teams or supporting ML model development
Hands-on experience in forecasting, demand planning, or similar data-heavy business domains
Experience integrating data from ERP/CRM/WMS systems (SAP, Oracle, legacy platforms)
Exposure to feature stores or ML training/serving consistency frameworks
Experience with IBM DataStage or legacy ETL modernization projects
Experience scaling distributed ML or time-series models in production environments

Why This Role

This is a strong fit for someone who enjoys working in complex, real-world data environments, especially with messy, high-volume retail-style data and close collaboration with ML teams. Retail or similar domains are highly valued.

You’ll be part of a 13-person high-impact team, building infrastructure that powers next-generation industrial and predictive intelligence.

This candidate would ideally be working out of Toronto/Ottawa/Montreal.

Skills

AirflowApache SparkDatabricksDelta LakeETLIBM DataStageMLPythonPySparkSQL

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free

Data Engineer

About the role

About

What You’ll Do

What You Bring

Strong Advantages (Preferred Experience)

Why This Role

Skills

Similar roles

Senior Database Engineer

Software Engineer (Rust)

Mid-Level IoT Engineer

Don't send a generic resume