C
Data Engineer – Python/PySpark
Centraprise
Jersey City · Hybrid Contract Mid Level Today
About the role
Job Description
Experience
6+ Years
Core Skills
- Programming & Data Engineering: Python, PySpark, Spark, Hadoop, Kafka, ETL pipelines, ELT pipelines, scalable data processing
- Big Data Technologies: Big Data ecosystems, distributed data processing, structured & unstructured data handling
- Database Technologies: SQL, NoSQL databases, data modeling, query optimization
- Domain Expertise: KYC, AML, Client Onboarding, client lifecycle management, financial data processing
- Cloud & Platform Technologies: Cloud platforms, scalable data infrastructure, performance optimization
- Engineering Practices: Agile/Scrum, cross-functional collaboration, pipeline reliability, Spark job optimization
Key Responsibilities
- Design, develop, and maintain scalable ETL/ELT data pipeline
- Process and manage large-scale structured and unstructured datasets
- Build KYC/AML data solutions supporting client onboarding workflows
- Optimize PySpark/Spark jobs for scalability and performance
- Collaborate with agile teams, business stakeholders, and engineering teams
- Ensure reliability and efficiency of big data processing solutions
Skills
Big DataCloud platformsData modelingETLHadoopKafkaKYCNoSQLOnboardingPythonPySparkQuery optimizationSparkSQLScalable data processing
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free