All jobs

Engineer II, Enterprise Data Lakehouse

LPL Financial Global Capability Center

India · On-site Full-time Senior 1w ago

Apply with a tailored resume Save job

About the role

About

What if you could build a career where ambition meets innovation?

At LPL’s Global Capability Center, you'll find a collaborative culture where your voice matters, integrity guides every decision, and technology fuels progress. Your skills, talents, and ideas will redefine what's possible. LPL's success reflects its exceptional employees, who together pursue one noble purpose: empowering financial advisors to deliver personalized advice for all who need it. We’re proud to be expanding and reaching new heights in Hyderabad.

Join us as we create something extraordinary together.

Job Overview

We are seeking a hands-on Enterprise Data Lake/Lakehouse Engineer to design, build, and operate robust data lake and Lakehouse solutions that enable analytics, reporting, and AI-driven products. This role will be pivotal in bridging the gap between traditional data warehouses and modern data lakes, ensuring seamless data integration, governance, and accessibility for business intelligence and advanced analytics.

Responsibilities

Implement and maintain scalable data lake and Lakehouse architectures using cloud-native services (AWS S3, Glue, Lake Formation, Delta Lake, Snowflake, etc.)
Develop and optimize end-to-end data pipelines (batch and streaming) for ingesting, transforming, and storing structured and unstructured data at scale
Integrate diverse data sources and ensure efficient, secure, and reliable data ingestion and processing
Implement and enforce data governance, cataloging, lineage, and access controls (e.g., AWS DataZone / Glue Data Catalog or Unity Catalog, Collibra, Atlan)
Collaborate with cross-functional teams (data scientists, BI engineers, product managers) to translate business needs into reliable, observable, and governed data products
Drive adoption of modern data engineering frameworks (dbt, Airflow, Delta Live Tables, etc.) and DevOps practices (IaC, CI/CD, automated testing, monitoring)
Champion data quality, security, and compliance (encryption, PII, GDPR, HIPAA, etc.) across all data lake/Lakehouse operations
Mentor and guide team members, contribute to platform roadmaps, and promote best practices in data engineering and Lakehouse design
Stay current with emerging trends in data Lakehouse technologies, open-source tools, and cloud platforms

What are we looking for?

We’re looking for strong collaborators who deliver exceptional client experiences and thrive in fast-paced, team-oriented environments. Our ideal candidates pursue greatness, act with integrity, and are driven to help our clients succeed. We value those who embrace creativity, continuous improvement, and contribute to a culture where we win together and create and share joy in our work.

Requirements

7+ years of experience in data engineering, software engineering, or cloud engineering, with at least 4 years focused on data lake or lakehouse environments in AWS
Bachelor’s degree in Data Science, Computer Science or related field; Master’s degree preferred
Experience establishing and developing high-performing engineering teams
Demonstrable hands‑on experience with:
- Cloud data lake architectures: AWS S3, Glue, Lake Formation, Snowflake, or similar
- Data lake design patterns: raw, curated, consumption zones; medallion architecture
- Data versioning and schema evolution: e.g., Delta Lake, Apache Iceberg
- Data governance and cataloging: Unity Catalog, Collibra, Atlan, AWS Glue Data Catalog (experience in multiple tools preferred)
- Programming: Python and/or SQL (production code, reusable libraries, tests)
- Pipeline orchestration: Airflow, Step Functions, dbt, or similar
- DevOps for data: Terraform/CloudFormation, CI/CD, monitoring, and runbook creation
Strong understanding of data modeling, data quality, and secure data onboarding/governance
Experience with both batch and real‑time data processing

Preferences

Experience with Spark, Snowflake or other big data frameworks.
AWS and/or Snowflake architect or developer certifications.
Demonstrated use of AI/ML tools to augment engineering productivity (prompting for code generation, LLMs for docs/tests, query optimization).

Skills & Tools

AWS (S3, Glue, Lake Formation, IAM), Snowflake
SQL, Python
dbt, Airflow, Step Functions
Terraform/CloudFormation, CI/CD (GitHub Actions, Jenkins)
Observability (Dynatrace preferred, Datadog, Prometheus)
LLM/AI augmentation tooling (preferred)

LPL Global Business Services, LLP - PRIVACY POLICY

Requirements

7+ years of experience in data engineering, software engineering, or cloud engineering, with at least 4 years focused on data lake or lakehouse environments in AWS
Experience establishing and developing high-performing engineering teams
Demonstrable hands-on experience with cloud data lake architectures: AWS S3, Glue, Lake Formation, Snowflake, or similar
Demonstrable hands-on experience with data lake design patterns: raw, curated, consumption zones; medallion architecture
Demonstrable hands-on experience with data versioning and schema evolution: e.g., Delta Lake, Apache Iceberg
Demonstrable hands-on experience with data governance and cataloging: including any of the following (preferred experience in multiple tools) Unity Catalog, Collibra, Atlan, AWS Glue Data Catalog
Demonstrable hands-on experience with programming: Python and/or SQL (production code, reusable libraries, tests)
Demonstrable hands-on experience with pipeline orchestration: Airflow, Step Functions, dbt, or similar
Demonstrable hands-on experience with DevOps for data: Terraform/CloudFormation, CI/CD, monitoring, and runbook creation
Strong understanding of data modeling, data quality, and secure data onboarding/governance
Experience with both batch and real-time data processing

Responsibilities

Implement and maintain scalable data lake and Lakehouse architectures using cloud-native services (AWS S3, Glue, Lake Formation, Delta Lake, Snowflake, etc.)
Develop and optimize end-to-end data pipelines (batch and streaming) for ingesting, transforming, and storing structured and unstructured data at scale
Integrate diverse data sources and ensure efficient, secure, and reliable data ingestion and processing
Implement and enforce data governance, cataloging, lineage, and access controls (e.g., AWS DataZone / Glue Data Catalog or Unity Catalog, Collibra, Atlan)
Collaborate with cross-functional teams (data scientists, BI engineers, product managers) to translate business needs into reliable, observable, and governed data products
Drive adoption of modern data engineering frameworks (dbt, Airflow, Delta Live Tables, etc.) and DevOps practices (IaC, CI/CD, automated testing, monitoring)
Champion data quality, security, and compliance (encryption, PII, GDPR, HIPAA, etc.) across all data lake/Lakehouse operations
Mentor and guide team members, contribute to platform roadmaps, and promote best practices in data engineering and Lakehouse design
Stay current with emerging trends in data Lakehouse technologies, open-source tools, and cloud platforms

Skills

AWSAWS CloudFormationAWS GlueAWS IAMAWS Lake FormationAWS S3AirflowCI/CDDatadogDelta LakeDynatraceGitHub ActionsJenkinsLLMPythonPrometheusSQLSnowflakeStep FunctionsTerraformdbt

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free

Engineer II, Enterprise Data Lakehouse

About the role

About

Job Overview

Responsibilities

What are we looking for?

Requirements

Preferences

Skills & Tools

Requirements

Responsibilities

Skills

Similar roles

Sr. AI Engineer

Cybersecurity Senior Engineer

Data Engineer ( Databricks+ Cloud + ML Ops )

Don't send a generic resume