Skip to content
mimi

Senior Data Engineer

Altis Labs

Varennes · On-site Full-time Senior 1w ago

About the role

About Altis Labs

Altis Labs is the computational imaging company accelerating clinical trials with AI. We are on a mission to help get the most effective novel treatments to patients sooner.

Top 20 biopharma sponsors like AstraZeneca, Johnson & Johnson, and Bayer Pharmaceuticals use our AI models trained on the industry's largest cancer imaging database to measure treatment effect with greater confidence. Our fully‑automated AI models predict efficacy from clinical trial imaging data so that sponsors can optimize trial design and accelerate development of their most promising drugs.

Founded in 2019, Altis is a venture‑backed AI company headquartered in Toronto. We are actively growing our team in Canada and the US across functional areas.

About the Position

We are looking for a Senior Data Engineer to design, build, and operate robust data systems that power machine‑learning and analytics workflows in a medical and medical‑imaging context. This role sits at the intersection of data engineering, ML infrastructure, and cloud platforms, and will work across multiple cloud environments and compute backends.

You will be responsible for building scalable, reliable pipelines for medical imaging data (e.g., DICOM) and associated clinical metadata, supporting downstream ML research, model training, and production deployment. You should be comfortable operating in complex environments, collaborating closely with ML scientists, MLOps engineers, and software engineers.

This is a senior, hands‑on role with significant architectural responsibility.

Responsibilities

  • Design, build, and maintain scalable data pipelines for large‑scale medical imaging and clinical datasets
  • Ingest, validate, normalize, and version DICOM and related medical imaging formats
  • Build pipelines that support batch workloads
  • Ensure data quality, lineage, reproducibility, and auditability across systems

Medical & Imaging Context

  • Work with medical imaging data (CT, PET‑CT, MR, etc.) and associated metadata
  • Understand healthcare data constraints, including PHI handling, privacy, and compliance considerations
  • Collaborate with ML teams on feature extraction pipelines (e.g., volumes, segmentations, derived imaging features)
  • Operate across multi‑cloud environments (e.g., AWS, GCP, Slurm), including hybrid or on‑prem components
  • Deploy and manage data workloads on Kubernetes (writing Terraform and Helm charts or manifests where appropriate)
  • Work with containerized pipelines and distributed compute (e.g., GPU‑enabled workloads)
  • Optimize cost, performance, and reliability across cloud platforms
  • Partner closely with ML scientists, MLOps, and software engineers to support end‑to‑end workflows
  • Own systems in production: monitoring, alerting, debugging, and incident response
  • Contribute to technical direction, architecture decisions, and best practices
  • Mentor junior engineers and raise the overall data engineering bar

Qualifications

  • 6+ years of experience in data engineering or related roles
  • Strong experience building production‑grade data pipelines at scale
  • Excellent Python skills (and comfort with performance‑critical data workloads)
  • Deep understanding of data modeling, distributed systems, and ETL/ELT patterns
  • Hands‑on experience with medical imaging data, especially DICOM
  • Familiarity with medical or healthcare data contexts (clinical metadata, imaging workflows, regulatory constraints)
  • Experience working in regulated or privacy‑sensitive environments is strongly preferred
  • Experience operating data systems in at least one major cloud provider (AWS, GCP, Azure)
  • Working knowledge of Kubernetes (deploying workloads, debugging pods, managing resources)
  • Experience with containerized pipelines (Docker)

Nice to Have

  • Experience supporting ML or AI workloads, especially in imaging or healthcare
  • Familiarity with MLOps concepts (data versioning, experiment tracking, reproducibility)
  • Experience with GPU‑backed workloads and high‑performance compute
  • Knowledge of federated learning or distributed data architectures
  • Prior experience in startups or fast‑moving, ambiguous environments

Benefits

  • Competitive pay and generous equity participation
  • Coverage for medical, vision, and dental insurance
  • 4 weeks of vacation per year

Seniority Level

Mid‑Senior level

Employment Type

Full‑time

Job Function

Biotechnology Research, Hospitals and Health Care, and Software Development

Skills

AWSAzureDockerGCPHelmKubernetesPythonSlurmTerraform

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free