Skip to content
mimi

Data Engineer

Talent To Hire Inc.

Remote · Canada Full-time Senior Today

About the role

About

Our client is building the next generation of industrial intelligence, transforming complex automotive and industrial data into real-time, actionable insights powered by machine learning.

We are seeking an experienced Data Engineer who thrives in high-scale, production ML environments and enjoys working with complex, messy datasets to build reliable, scalable data foundations for advanced analytics.

You will join a team of 13 engineers and data professionals, working in a highly collaborative, fast-moving environment. The role is remote-friendly, with strong cross-functional interaction across engineering, data science, and business teams.

What You’ll Do

  • Design, build, and maintain scalable data pipelines and ETL processes for large structured and unstructured datasets
  • Develop and optimize Spark-based data workflows supporting production ML systems
  • Collaborate closely with data scientists, ML engineers, and business stakeholders
  • Translate complex business needs into scalable, production-grade data solutions
  • Build feature engineering pipelines for time-series and predictive models
  • Ensure data quality, governance, security, and reliability across systems
  • Continuously improve data architecture, performance, and scalability

What You Bring

  • 6+ years in data engineering or ML data pipeline development
  • Strong experience with Apache Spark, PySpark, Databricks, Delta Lake
  • Advanced skills in Python, SQL, and Airflow
  • Deep understanding of Medallion architecture and ETL design patterns
  • Experience building time-series features (rolling windows, lags, trend indicators)
  • Ability to work closely with ML teams and translate data into model-ready structures
  • Bachelor’s or Master’s in Computer Science, Engineering, or related field

Strong Advantages (Preferred Experience)

  • Background in retail, e-commerce, or supply chain environments dealing with large, messy, high-volume datasets
  • Experience working directly with machine learning teams or supporting ML model development
  • Hands-on experience in forecasting, demand planning, or similar data-heavy business domains
  • Experience integrating data from ERP/CRM/WMS systems (SAP, Oracle, legacy platforms)
  • Exposure to feature stores or ML training/serving consistency frameworks
  • Experience with IBM DataStage or legacy ETL modernization projects
  • Experience scaling distributed ML or time-series models in production environments

Why This Role

This is a strong fit for someone who enjoys working in complex, real-world data environments, especially with messy, high-volume retail-style data and close collaboration with ML teams. Retail or similar domains are highly valued.

You’ll be part of a 13-person high-impact team, building infrastructure that powers next-generation industrial and predictive intelligence.

This candidate would ideally be working out of Toronto/Ottawa/Montreal.

Skills

AirflowApache SparkDatabricksDelta LakeETLIBM DataStageMLPythonPySparkSQL

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free