Engineer II, Enterprise Data Lakehouse
LPL Financial Global Capability Center
About the role
About
What if you could build a career where ambition meets innovation?
At LPL’s Global Capability Center, you'll find a collaborative culture where your voice matters, integrity guides every decision, and technology fuels progress. Your skills, talents, and ideas will redefine what's possible. LPL's success reflects its exceptional employees, who together pursue one noble purpose: empowering financial advisors to deliver personalized advice for all who need it. We’re proud to be expanding and reaching new heights in Hyderabad.
Join us as we create something extraordinary together.
Job Overview
We are seeking a hands-on Enterprise Data Lake/Lakehouse Engineer to design, build, and operate robust data lake and Lakehouse solutions that enable analytics, reporting, and AI-driven products. This role will be pivotal in bridging the gap between traditional data warehouses and modern data lakes, ensuring seamless data integration, governance, and accessibility for business intelligence and advanced analytics.
Responsibilities
- Implement and maintain scalable data lake and Lakehouse architectures using cloud-native services (AWS S3, Glue, Lake Formation, Delta Lake, Snowflake, etc.)
- Develop and optimize end-to-end data pipelines (batch and streaming) for ingesting, transforming, and storing structured and unstructured data at scale
- Integrate diverse data sources and ensure efficient, secure, and reliable data ingestion and processing
- Implement and enforce data governance, cataloging, lineage, and access controls (e.g., AWS DataZone / Glue Data Catalog or Unity Catalog, Collibra, Atlan)
- Collaborate with cross-functional teams (data scientists, BI engineers, product managers) to translate business needs into reliable, observable, and governed data products
- Drive adoption of modern data engineering frameworks (dbt, Airflow, Delta Live Tables, etc.) and DevOps practices (IaC, CI/CD, automated testing, monitoring)
- Champion data quality, security, and compliance (encryption, PII, GDPR, HIPAA, etc.) across all data lake/Lakehouse operations
- Mentor and guide team members, contribute to platform roadmaps, and promote best practices in data engineering and Lakehouse design
- Stay current with emerging trends in data Lakehouse technologies, open-source tools, and cloud platforms
What are we looking for?
We’re looking for strong collaborators who deliver exceptional client experiences and thrive in fast-paced, team-oriented environments. Our ideal candidates pursue greatness, act with integrity, and are driven to help our clients succeed. We value those who embrace creativity, continuous improvement, and contribute to a culture where we win together and create and share joy in our work.
Requirements
- 7+ years of experience in data engineering, software engineering, or cloud engineering, with at least 4 years focused on data lake or lakehouse environments in AWS
- Bachelor’s degree in Data Science, Computer Science or related field; Master’s degree preferred
- Experience establishing and developing high-performing engineering teams
- Demonstrable hands‑on experience with:
- Cloud data lake architectures: AWS S3, Glue, Lake Formation, Snowflake, or similar
- Data lake design patterns: raw, curated, consumption zones; medallion architecture
- Data versioning and schema evolution: e.g., Delta Lake, Apache Iceberg
- Data governance and cataloging: Unity Catalog, Collibra, Atlan, AWS Glue Data Catalog (experience in multiple tools preferred)
- Programming: Python and/or SQL (production code, reusable libraries, tests)
- Pipeline orchestration: Airflow, Step Functions, dbt, or similar
- DevOps for data: Terraform/CloudFormation, CI/CD, monitoring, and runbook creation
- Strong understanding of data modeling, data quality, and secure data onboarding/governance
- Experience with both batch and real‑time data processing
Preferences
- Experience with Spark, Snowflake or other big data frameworks.
- AWS and/or Snowflake architect or developer certifications.
- Demonstrated use of AI/ML tools to augment engineering productivity (prompting for code generation, LLMs for docs/tests, query optimization).
Skills & Tools
- AWS (S3, Glue, Lake Formation, IAM), Snowflake
- SQL, Python
- dbt, Airflow, Step Functions
- Terraform/CloudFormation, CI/CD (GitHub Actions, Jenkins)
- Observability (Dynatrace preferred, Datadog, Prometheus)
- LLM/AI augmentation tooling (preferred)
LPL Global Business Services, LLP - PRIVACY POLICY
Requirements
- 7+ years of experience in data engineering, software engineering, or cloud engineering, with at least 4 years focused on data lake or lakehouse environments in AWS
- Experience establishing and developing high-performing engineering teams
- Demonstrable hands-on experience with cloud data lake architectures: AWS S3, Glue, Lake Formation, Snowflake, or similar
- Demonstrable hands-on experience with data lake design patterns: raw, curated, consumption zones; medallion architecture
- Demonstrable hands-on experience with data versioning and schema evolution: e.g., Delta Lake, Apache Iceberg
- Demonstrable hands-on experience with data governance and cataloging: including any of the following (preferred experience in multiple tools) Unity Catalog, Collibra, Atlan, AWS Glue Data Catalog
- Demonstrable hands-on experience with programming: Python and/or SQL (production code, reusable libraries, tests)
- Demonstrable hands-on experience with pipeline orchestration: Airflow, Step Functions, dbt, or similar
- Demonstrable hands-on experience with DevOps for data: Terraform/CloudFormation, CI/CD, monitoring, and runbook creation
- Strong understanding of data modeling, data quality, and secure data onboarding/governance
- Experience with both batch and real-time data processing
Responsibilities
- Implement and maintain scalable data lake and Lakehouse architectures using cloud-native services (AWS S3, Glue, Lake Formation, Delta Lake, Snowflake, etc.)
- Develop and optimize end-to-end data pipelines (batch and streaming) for ingesting, transforming, and storing structured and unstructured data at scale
- Integrate diverse data sources and ensure efficient, secure, and reliable data ingestion and processing
- Implement and enforce data governance, cataloging, lineage, and access controls (e.g., AWS DataZone / Glue Data Catalog or Unity Catalog, Collibra, Atlan)
- Collaborate with cross-functional teams (data scientists, BI engineers, product managers) to translate business needs into reliable, observable, and governed data products
- Drive adoption of modern data engineering frameworks (dbt, Airflow, Delta Live Tables, etc.) and DevOps practices (IaC, CI/CD, automated testing, monitoring)
- Champion data quality, security, and compliance (encryption, PII, GDPR, HIPAA, etc.) across all data lake/Lakehouse operations
- Mentor and guide team members, contribute to platform roadmaps, and promote best practices in data engineering and Lakehouse design
- Stay current with emerging trends in data Lakehouse technologies, open-source tools, and cloud platforms
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free