CS
Data Engineer
CBL Solutions
Toronto · On-site Full-time Mid Level 1w ago
About the role
About
We are seeking a Data Engineer with strong experience in AWS-native data services and Informatica to design, build, and optimize scalable data pipelines. The role involves working with large, structured and semi-structured datasets to support analytics, reporting, and downstream business intelligence platforms in a regulated enterprise environment.
Key Responsibilities
- Design, develop, and maintain scalable ETL/ELT pipelines using AWS Glue and Informatica (PowerCenter / IDMC / IICS)
- Build and optimize analytical datasets using Amazon Redshift
- Enable data exploration and ad-hoc analysis using Amazon Athena
- Ingest data from multiple sources (RDBMS, files, APIs, cloud storage) into AWS data platforms
- Ensure data quality, data validation, and reconciliation across ingestion and transformation layers
- Implement partitioning, indexing, compression, and performance tuning strategies for Redshift and Athena
- Work with structured formats (CSV, JSON, Parquet, Avro) in Amazon S3
- Collaborate with BI, analytics, and data science teams to deliver curated datasets
- Implement logging, monitoring, and error handling for data pipelines
- Adhere to security, governance, and compliance standards (PII, encryption, IAM, auditability)
- Support production issues, root cause analysis, and continuous optimization
Required Skills & Qualifications
Core Technical Skills
- Strong hands-on experience with AWS Glue (ETL Jobs, Crawlers, Data Catalog)
- Experience using Amazon Athena for querying large datasets in S3
- Solid experience with Amazon Redshift (schema design, performance tuning, WLM, distribution styles)
- Strong experience with Informatica (PowerCenter or Intelligent Data Management Cloud – IICS)
- Proficiency in SQL (complex joins, window functions, optimization)
- Hands-on experience with Python or Spark (PySpark)
- Strong understanding of data warehousing concepts (fact/dimension modeling, star/snowflake schema)
Cloud & Data Ecosystem
- Experience with Amazon S3, IAM, CloudWatch
- Understanding of data lake and lakehouse architectures
- Experience working with batch and near-real-time data processing
- Familiarity with CI/CD pipelines for data workflows
Preferred / Nice-to-Have Skills
- Experience in banking, financial services, or insurance domains
- Exposure to AWS Step Functions, Lambda, EMR
- Knowledge of data governance, metadata management, and lineage
- Experience with Informatica Cloud (IDMC) advanced services
- Familiarity with Agile / Scrum delivery models
- Exposure to regulatory and compliance-driven data environments
Skills
AWS GlueAmazon AthenaAmazon RedshiftAmazon S3CloudWatchIAMInformaticaInformatica CloudPythonSparkSQL
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free