Lead Data Engineer

Intellectt Inc

Berkeley Heights · On-site Full-time Lead 2mo ago

About the role

Design, develop, and maintain scalable data pipelines for batch and real-time processing using AWS services
Build and optimize data lakes and data warehouses using Amazon S3, Redshift, and Glue
Develop robust ETL/ELT pipelines using Python, PySpark, and SQL
Implement efficient data modeling techniques such as star schema and dimensional modeling
Work with large-scale distributed systems using Hadoop and Apache Spark
Integrate AI/ML models into data pipelines to support advanced analytics
Automate infrastructure provisioning using Terraform (IaC)
Ensure data quality, governance, and security across pipelines
Collaborate with cross-functional teams including data scientists, analysts, and business stakeholders
Develop dashboards and reports using Power BI for business insights
Monitor and optimize performance of data pipelines and cloud resources.
Exposure to AI/ML frameworks (SageMaker, TensorFlow, etc.)

AI/MLAWSAthenaData EngineeringData ModelingEMRGlueHadoopIaCLambdaPower BIPythonPySparkRedshiftS3SageMakerSparkSQLStar SchemaTensorFlowTerraform

Stanbic Bank Tanzania

The Tatitlek Corporation

$165k – $185k/yr

Cosmoquick

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.