Skip to content
mimi

Data Engineer

CBL Solutions

Toronto · On-site Full-time Mid Level 1w ago

About the role

About

We are seeking a Data Engineer with strong experience in AWS-native data services and Informatica to design, build, and optimize scalable data pipelines. The role involves working with large, structured and semi-structured datasets to support analytics, reporting, and downstream business intelligence platforms in a regulated enterprise environment.

Key Responsibilities

  • Design, develop, and maintain scalable ETL/ELT pipelines using AWS Glue and Informatica (PowerCenter / IDMC / IICS)
  • Build and optimize analytical datasets using Amazon Redshift
  • Enable data exploration and ad-hoc analysis using Amazon Athena
  • Ingest data from multiple sources (RDBMS, files, APIs, cloud storage) into AWS data platforms
  • Ensure data quality, data validation, and reconciliation across ingestion and transformation layers
  • Implement partitioning, indexing, compression, and performance tuning strategies for Redshift and Athena
  • Work with structured formats (CSV, JSON, Parquet, Avro) in Amazon S3
  • Collaborate with BI, analytics, and data science teams to deliver curated datasets
  • Implement logging, monitoring, and error handling for data pipelines
  • Adhere to security, governance, and compliance standards (PII, encryption, IAM, auditability)
  • Support production issues, root cause analysis, and continuous optimization

Required Skills & Qualifications

Core Technical Skills

  • Strong hands-on experience with AWS Glue (ETL Jobs, Crawlers, Data Catalog)
  • Experience using Amazon Athena for querying large datasets in S3
  • Solid experience with Amazon Redshift (schema design, performance tuning, WLM, distribution styles)
  • Strong experience with Informatica (PowerCenter or Intelligent Data Management Cloud – IICS)
  • Proficiency in SQL (complex joins, window functions, optimization)
  • Hands-on experience with Python or Spark (PySpark)
  • Strong understanding of data warehousing concepts (fact/dimension modeling, star/snowflake schema)

Cloud & Data Ecosystem

  • Experience with Amazon S3, IAM, CloudWatch
  • Understanding of data lake and lakehouse architectures
  • Experience working with batch and near-real-time data processing
  • Familiarity with CI/CD pipelines for data workflows

Preferred / Nice-to-Have Skills

  • Experience in banking, financial services, or insurance domains
  • Exposure to AWS Step Functions, Lambda, EMR
  • Knowledge of data governance, metadata management, and lineage
  • Experience with Informatica Cloud (IDMC) advanced services
  • Familiarity with Agile / Scrum delivery models
  • Exposure to regulatory and compliance-driven data environments

Skills

AWS GlueAmazon AthenaAmazon RedshiftAmazon S3CloudWatchIAMInformaticaInformatica CloudPythonSparkSQL

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free