AL
Senior Data Engineer – AWS & Python
Artech LLC
Malvern · flexible Contract Senior $50 – $58/hr 2w ago
About the role
Role Overview
This role is for a senior data engineer who can design and build scalable, real-time data pipelines on AWS using Python and Spark. This is a senior-level data engineering role focused on building scalable, real-time (streaming) data pipelines using Amazon Web Services and Python-based technologies.
Key Responsibilities
- Build and maintain event-driven data pipelines using AWS services such as Kinesis, MSK/Kafka, Lambda, Step Functions, SQS/SNS, and Glue/EMR.
- Develop ETL/ELT workflows using Python and PySpark, ensuring performance, scalability, and cost efficiency.
- Implement and optimize Spark-based data transformations, partitioning strategies, and data processing frameworks.
- Design and manage data lake and warehouse structures using S3, Glue Catalog, Athena, and/or Redshift.
- Build streaming solutions with checkpointing, stateful transformations, idempotency, and schema evolution.
- Ensure high standards of data quality, observability, monitoring, and alerting (CloudWatch, Datadog, etc.).
- Implement data security best practices including IAM, encryption (KMS), networking, and governance.
- Create reusable frameworks, internal libraries, and CI/CD pipelines for automated deployments.
- Collaborate with data scientists, analysts, and business teams to deliver well-modeled, reliable datasets.
- Lead design reviews, mentor junior engineers, and contribute to engineering best practices.
Required Skills
- Overall 8+ yrs of experience
- 5+ years of professional experience in Data Engineering.
- Experience of working on Java is an advantage
- Strong expertise in Python and PySpark for large-scale data processing.
- Advanced hands-on experience with AWS (S3, Glue, EMR, Lambda, Step Functions, Kinesis/MSK, DynamoDB, Athena, Redshift).
- Deep experience building event-driven and streaming data pipelines.
- Strong SQL experience for analytical and ETL workloads.
- Hands-on experience with workflow orchestration tools such as Airflow or Step Functions.
- Experience with CI/CD, Git, and Infrastructure-as-Code (Terraform or CloudFormation).
- Strong understanding of distributed systems, Spark performance tuning, data modeling, and cloud cost optimization.
- Knowledge of data security, encryption, networking, and compliance best practices in cloud environments.
Company Benefits & Culture
- Competitive salary and benefits package.
- Opportunities for professional growth and advancement.
- Dynamic and collaborative work environment.
Skills
AWSAthenaCI/CDCloudFormationCloudWatchDatadogDynamoDBEMRGitGlueIAMJavaKinesisKMSLambdaMSKPythonPySparkRedshiftS3SparkSQSSNSStep FunctionsTerraformSQL
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free