Sr. DevOps Consultant (Python, DataCenter Automation Engineer)
Centraprise
About the role
Sr. DevOps Consultant (Python, DataCenter Automation Engineer)
Toronto, Canada (3 Days Onsite/ Week)
12+ Months Contract
Job Description:
Core Focus:
Infrastructure testing, hardware validation, automation frameworks, and large-scale system reliability for compute clusters and data center environments.
Key Skills: • Infrastructure Testing & System Validation • Hardware Validation (Compute / AI Accelerators / Servers) • Automation Framework Development • Linux System Administration & Troubleshooting • Data Center Infrastructure Testing • System Performance, Reliability & Scalability Testing • Debugging & Root Cause Analysis
Technical Skills / Tools: • Programming: Python, Go, Scripting • Infrastructure: Linux, Compute Clusters • Networking: TCP/IP, Routing, Switching • Cluster Orchestration: Kubernetes, Slurm • Monitoring: Prometheus, Grafana • CI/CD: Infrastructure Testing Pipelines • Data Center Technologies
Responsibilities: • Build automation frameworks for infrastructure and system testing. • Design automated tests for AI accelerators, compute clusters, networking, and storage. • Validate performance, reliability, and scalability before production deployment. • Automate deployment validation and health checks across infrastructure clusters. • Troubleshoot hardware and infrastructure issues with engineering teams. • Develop scripts for system bring-up, diagnostics, and validation. • Document testing procedures, validation workflows, and operational playbooks.
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free