Skip to content
mimi

Data Engineer – Python/PySpark

Centraprise

Jersey City · Hybrid Contract Mid Level Today

About the role

Job Description

Experience

6+ Years

Core Skills

  • Programming & Data Engineering: Python, PySpark, Spark, Hadoop, Kafka, ETL pipelines, ELT pipelines, scalable data processing
  • Big Data Technologies: Big Data ecosystems, distributed data processing, structured & unstructured data handling
  • Database Technologies: SQL, NoSQL databases, data modeling, query optimization
  • Domain Expertise: KYC, AML, Client Onboarding, client lifecycle management, financial data processing
  • Cloud & Platform Technologies: Cloud platforms, scalable data infrastructure, performance optimization
  • Engineering Practices: Agile/Scrum, cross-functional collaboration, pipeline reliability, Spark job optimization

Key Responsibilities

  • Design, develop, and maintain scalable ETL/ELT data pipeline
  • Process and manage large-scale structured and unstructured datasets
  • Build KYC/AML data solutions supporting client onboarding workflows
  • Optimize PySpark/Spark jobs for scalability and performance
  • Collaborate with agile teams, business stakeholders, and engineering teams
  • Ensure reliability and efficiency of big data processing solutions

Skills

Big DataCloud platformsData modelingETLHadoopKafkaKYCNoSQLOnboardingPythonPySparkQuery optimizationSparkSQLScalable data processing

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free