Skip to content
mimi

Data Engineer (Google Cloud Platform)

Showman Staffing

Remote · Canada Full-time Mid Level 1mo ago

About the role

Role Overview:

We’re looking for a skilled Data Engineer to design, build, and optimize scalable, cloud-native data pipelines on Google Cloud Platform (Google Cloud Platform). The role involves extensive work with Apache Airflow, Spark, Python, and Scala to develop high-performance data solutions supporting analytics, streaming, and generative AI initiatives.

Key Responsibilities:

  • Develop, automate, and maintain batch and streaming ETL pipelines using Apache Airflow, Apache Spark, Python, and Scala.
  • Build and manage cloud-based data ecosystems on Google Cloud Platform (BigQuery, Bigtable, Dataproc, Pub/Sub, Cloud Storage, IAM, VPC).
  • Design and optimize SQL and NoSQL data models for data lakes and warehouses (BigQuery, MongoDB, Snowflake).
  • Write complex SQL queries for advanced data transformation, aggregation, and analytics optimization within BigQuery or equivalent platforms.
  • Apply modern Test Driven Development (TDD) methodologies for big data pipelines, ensuring test automation across Airflow workflows, Spark jobs, and transformation logic.
  • Apply data mesh and data-as-a-product principles to enable reusable and domain-driven datasets.
  • Implement real time ingestion with Kafka Connect and process streaming data using Spark Streaming, Apache Flink, or similar technologies.
  • Optimize data performance, scalability, and cost efficiency across Google Cloud Platform components.
  • Ensure compliance with PCI and PII data with standards such as GDPR, PCI DSS, SOX, and CCPA.
  • Integrate GenAI tools such as OpenAI, Gemini, and Anthropic LLMs for intelligent data quality and analytics enhancement.
  • Collaborate with stakeholders, data scientists, and full stack engineers to deliver trusted, documented, and reusable data products.

Required Qualifications:

  • Bachelor’s or Master’s in Computer Science, Data Engineering, or related field.
  • 5+ years of hands-on experience with large-scale data engineering in cloud environments.
  • Advanced skills using Python, Scala, Spark ecosystem, SQL to build data pipelines.
  • Strong Google Cloud Platform expertise (BigQuery, Bigtable, Dataproc, Pub/Sub, IAM, VPC).
  • Proficiency in SQL/NoSQL modeling and data architecture for cloud data lakes.
  • Familiarity with streaming frameworks (Kafka, Flume).
  • Experience handling sensitive data and ensuring regulatory compliance.
  • Working knowledge of Docker, CI/CD, and modern DevOps practices for data platforms.

Preferred Qualifications:

  • Experience with Infrastructure as Code (IaC) tools such as Terraform or Ansible.
  • Contributions to open-source projects or internal developer tooling.
  • Prior experience building Customer Data Platforms (CDPs) inhouse.
  • Experience with AI-assisted developer tools (for example, IntelliJ plug-ins using OpenAI or Anthropic models), Codex CLI, Windsurf.

Skills

Apache AirflowApache SparkBigQueryBigtableCDPCCPACloud StorageCI/CDDockerDataprocDevOpsGDPRGeminiGoogle Cloud PlatformIAMKafkaKafka ConnectMongoDBNoSQLOpenAIPCI DSSPub/SubPythonScalaSpark StreamingSQLSnowflakeSOXTerraformVPC

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free