Big Data Software Engineer — Batch & OLAP

Andiamo

New York · On-site Full-time Lead 3mo ago

About the role

About The Role

Build robust batch pipelines and analytical stores for large-scale reporting, experimentation, and ML feature generation.

Responsibilities

Develop cost-efficient batch processing (Spark/Dataproc/EMR) and orchestration (Airflow/Dagster).
Design lakehouse tables (Iceberg/Delta/Hudi) with compaction, partitioning, and Z-ordering.
Implement semantic/metrics layers and OLAP acceleration (BigQuery/Snowflake/ClickHouse).
Enforce governance: lineage, access controls, PII handling, and retention policies.

Requirements

4+ years in big data; expert SQL and performance tuning.
Strong understanding of storage formats, file layout, and query engines.

About Andiamo

Talent Partners for the AI Revolution. As a globally recognized staffing and consulting firm, we specialize in placing the top 2% of technology and go-to-market professionals with the world’s largest and most well-known companies.

For over 20 years, we've maintained the status of tier-one vendor for firms such as Palantir, Amazon, Fluidstack, Bloomberg, Relativity Space, Firefly, MasterCard, Visa, Two Sigma, Citadel, as well as other major financial services firms, elite hedge funds, Google-backed tech start-ups, and major software firms.

Our talent solutions include Permanent Placement, Contract Staffing, Executive Search, and Dedicated Recruiting Services (RPO). Find out more at www.andiamogo.com

Skills

AirflowBigQueryClickHouseDataprocDeltaEMRHudiIcebergOLAPSQLSparkSnowflake

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free

Big Data Software Engineer — Batch & OLAP

About the role

About The Role

Responsibilities

Requirements

About Andiamo

Skills

Similar roles

Intermediate Backend Engineer

Staff Backend Engineer

Regional Asset Manager

Don't send a generic resume