All jobs

Engineer

Huawei Canada

Edmonton · On-site Contract 3mo ago

Apply with a tailored resume Save job

About the role

Huawei Canada has an immediate 12-month contract opening for an Engineer.

About the team:

The Software-Hardware System Optimization Lab continuously improves the power efficiency and performance of smartphone products through software-hardware systems optimization and architecture innovation. We keep tracking the trends of cutting-edge technologies, building the competitive strength of mobile AI, graphics, multimedia, and software architecture for mobile phone products.

About the job:

Design and build scalable infrastructure to support Reinforcement Learning, Online Search, Recommendation Systems, large model fine-tuning and evaluation/deployment.
Develop efficient ML solutions for Recommendation Systems and RL problems, including Multi-Armed and Contextual Bandit, Tree Search, and Multi-Agent system orchestration.
Implement and optimize deep learning architectures, including custom Transformers for agentic and decision-making systems.
Apply search and optimization techniques to efficiently fine-tune RL and ML models.
Work with large multimodal models (LLMs, VLMs), analyze their components, and fine-tune them for task-specific applications.
Conduct systematic benchmarking, new papers reading, experimentation, and validation in both simulation and real-world product environments.
Collaborate closely with research teams to scale online RL training capabilities and improve system robustness and accuracy.
Explore and integrate emerging AI methodologies and tools into production platforms.

Job requirements

About the ideal candidate:

Master’s or PhD in Computer Science, Machine Learning, or a related field.
Excellent Python programming skills with strong software engineering practices.
Strong foundation in Reinforcement Learning, Deep Learning, Recommender Systems, and Transformer-based architectures.
Demonstrated experience implementing RL algorithms beyond academic prototypes.
Hands-on experience with PyTorch and distributed training frameworks such as DeepSpeed.
Proven research excellence, including at least one publication in top-tier venues (e.g., NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV, ICRA, RLC).
Familiarity with LLM post-training techniques such as RLHF, PPO/GRPO, SFT, LoRA, or MoE is a strong asset.
Experience with multi-agent RL systems or tool-use agents is an advantage.

Skills

Deep LearningDeepSpeedICCVICMLICRAICLRLLMsLoRAMachine LearningMoENeurIPSPPOPyTorchPythonReinforcement LearningRLHFRLCSFTTransformersVLMs

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free

Engineer

About the role

About the team:

About the job:

Job requirements

About the ideal candidate:

Skills

Similar roles

AI Developer

Regional Asset Manager

backend developer

Don't send a generic resume