II
Lead Data Engineer
Intellectt Inc
Berkeley Heights · On-site Full-time Lead 3w ago
About the role
Key Skills Required
- AWS (S3, Redshift, Glue, Lambda, EMR, Athena)
- Data Engineering & Data Modeling (Star Schema, Snowflake, Dimensional Modeling)
- Python, PySpark, SQL
- Big Data Technologies (Hadoop, Spark)
- Infrastructure as Code (Terraform)
- AI/ML integration basics
- Visualization tools (Power BI)
Roles & Responsibilities
- Design, develop, and maintain scalable data pipelines for batch and real-time processing using AWS services
- Build and optimize data lakes and data warehouses using Amazon S3, Redshift, and Glue
- Develop robust ETL/ELT pipelines using Python, PySpark, and SQL
- Implement efficient data modeling techniques such as star schema and dimensional modeling
- Work with large-scale distributed systems using Hadoop and Apache Spark
- Integrate AI/ML models into data pipelines to support advanced analytics
- Automate infrastructure provisioning using Terraform (IaC)
- Ensure data quality, governance, and security across pipelines
- Collaborate with cross-functional teams including data scientists, analysts, and business stakeholders
- Develop dashboards and reports using Power BI for business insights
- Monitor and optimize performance of data pipelines and cloud resources.
- Exposure to AI/ML frameworks (SageMaker, TensorFlow, etc.)
Skills
AI/MLAWSAthenaData EngineeringData ModelingEMRGlueHadoopIaCLambdaPower BIPythonPySparkRedshiftS3SageMakerSparkSQLStar SchemaTensorFlowTerraform
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free