Senior, Data Scientist
Walmart
About the role
Position Summary
What you'll do... Join Walmart and your work could help over 275 million global customers live better every week. Yes, we are the Fortune #1 company. But you’ll quickly find we’re a company who wants you to feel comfortable bringing your whole self to work. A career at Walmart is where the world’s most complex challenges meet a kinder way of life. Our mission spreads far beyond the walls of our stores. Join us and you'll discover why we are a world leader in diversity and inclusion, sustainability, and community involvement. From day one, you’ll be empowered and equipped to do the best work of your life. careers.walmart.com
About Walmart Global Tech Org
Imagine working in an environment where one line of code can make life easier for hundreds of millions of people and put a smile on their face. That is what we do at Walmart Global Tech. We are a team of 15,000+ software engineers, data scientists and service professionals within Walmart, the world’s largest retailer, delivering innovations that improve how our customers shop and empower our 2.2 million associates. To others, innovation looks like an app, service, or some code, but Walmart has always been about people. People are why we innovate, and people power our innovations. Being human led is our true disruption.
About Emerging Tech Team
The Emerging Tech team is passionate about solving customer and associate problems with the newest technologies. The team is responsible for creating breakthrough capabilities, delivering frictionless experiences, and making these technologies easily available to thousands of Walmart developers and 2.2 million associates. The applications and services built on these capabilities are used by hundreds of millions of customers daily. We are building new platforms to bring physical and digital world together.
What You Will Be Part Of
At Walmart’s Emerging Tech Extended Reality team, we own some of the most challenging, fascinating, and impactful work in the fields of Computer vision, Machine learning and Deep Learning for next generation Augmented Reality and Virtual Reality experiences.
We are looking for Applied Scientists/researchers/Computer vision engineers with algorithms, programming and/or systems background.
Key Responsibilities
- Design Multi-Modal Evaluation Frameworks: Develop and validate novel evaluation metrics for non-deterministic outputs, specifically video, image, for 3D assets, and audio.
- Build "AI-as-a-Judge" Systems: Fine-tune Vision-Language Models (VLMs) and Reward Models to serve as automated evaluators, creating scalable proxies for human judgment.
- Lead Experimentation & Causal Inference: Design and analyze A/B tests to measure the downstream business impact of GenAI content; apply causal inference techniques to understand how specific asset attributes drive user engagement.
- Orchestrate Human-in-the-Loop (RLHF) Strategy: Define protocols for human evaluation, managing the relationship with annotation partners to create high-quality "Golden Sets" for benchmarking and Reinforcement Learning from Human Feedback (RLHF).
- Strategic Cross-Functional Partnership: Collaborate with ML Engineers and Product Managers to establish "Go/No-Go" model launch criteria based on latency, safety, and perceptual quality standards.
- Research & Innovation: Stay current with state-of-the-art research in perceptual quality (e.g., FID, CLIP scores, VQA) and implement advanced techniques to detect hallucinations, artifacts, or bias in generated content.
Minimum Qualifications
- Master's degree in Computer Science with a specialization in Computer Vision, Machine Learning, or equivalent practical experience.
- 3+ years of experience with machine learning algorithms and tools.
- Strong foundation in statistical analysis, experimental design (A/B testing), and causal inference.
- Hands‑on experience with Generative AI evaluation (e.g., using LLMs/VLMs for evaluation, computing FID/IS/CLIP scores, or designing perceptual studies).
- Proficiency in Python and deep learning frameworks (PyTorch, TensorFlow) for analyzing model outputs and building evaluation pipelines.
- Experience processing unstructured data (image, video, 3D meshes) for analytical purposes.
Preferred Qualifications
- PhD in Machine Learning, Computer Science, or a related technical field.
- Experience designing Reward Models for RLHF pipelines.
- Deep understanding of 3D geometry processing (meshes, point clouds) and how to mathematically quantify "3D quality" (e.g., mesh manifoldness, texture resolution).
- Experience with Crowdsourcing platforms and designing instructions for subjective human evaluation.
- Publication record or practical experience in Computational Photography, Computer Vision Quality Assessment, or Psychophysics.
- Experience with Big Data tools (Spark, SQL, BigQuery) for analyzing large‑scale experiment results.
Benefits
- Incentive awards for performance
- 401(k) match, stock purchase plan
- Paid maternity and parental leave, PTO, multiple health plans
- Medical, vision, and dental coverage
- Short‑term and long‑term disability, company discounts, Military Leave Pay, adoption and surrogacy expense reimbursement, and more
- PTO and/or PPTO for vacation, sick leave, holidays, or other purposes (amount depends on job classification and length of employment)
- Live Better U: Walmart‑paid education benefit program covering tuition, books, and fees
Equal Opportunity Employer
Walmart, Inc. is an Equal Opportunity Employer – By Choice. We believe we are best equipped to help our associates, customers and the communities we serve live better when we really know them. That means understanding, respecting and valuing unique styles, experiences, identities, ideas and opinions – while being inclusive of all people.
Compensation
- Annual salary range: $117,000.00 – $234,000.00
- Additional compensation includes annual or quarterly performance bonuses.
- Additional compensation for certain positions may also include stock.
Additional Minimum Qualifications (Alternative Paths)
- Option 1: Bachelor’s degree in Statistics, Economics, Analytics, Mathematics, Computer Science, Information Technology, or related field and 3 years' experience in an analytics related field.
- Option 2: Master’s degree in Statistics, Economics, Analytics, Mathematics, Computer Science, Information Technology, or related field and 1 year's experience in an analytics related field.
- Option 3: 5 years' experience in an analytics or related field.
Additional Preferred Qualifications
- Data science, machine learning, optimization models
- Master’s degree in Machine Learning, Computer Science, Information Technology, Operations Research, Statistics, Applied Mathematics, Econometrics
- Successful completion of assessments in Python, Spark, Scala, or R
- Experience with open source frameworks (e.g., scikit‑learn, TensorFlow, PyTorch)
- Background in creating inclusive digital experiences, knowledge of Web Content Accessibility Guidelines (WCAG) 2.2 AA standards, assistive technologies, and integrating digital accessibility
Primary Location
1345 Crossman Ave, Sunnyvale, CA 94089‑1114, United States of America
Drug‑Free Workplace
Walmart and its subsidiaries are committed to maintaining a drug‑free workplace and have a zero‑tolerance policy regarding the use of illegal drugs and alcohol on the job. This policy applies to all employees and aims to create a safe and productive work environment.
Requirements
- Master's degree in Computer Science with a specialization in Computer Vision, Machine Learning, or equivalent practical experience.
- 3+ years of experience with machine learning algorithms and tools.
- Strong foundation in statistical analysis, experimental design (A/B testing), and causal inference.
- Hands-on experience with Generative AI evaluation (e.g., using LLMs/VLMs for evaluation, computing FID/IS/CLIP scores, or designing perceptual studies).
- Proficiency in Python and deep learning frameworks (PyTorch, TensorFlow) for analyzing model outputs and building evaluation pipelines.
- Experience processing unstructured data (image, video, 3D meshes) for analytical purposes.
Responsibilities
- Develop and validate novel evaluation metrics for non-deterministic outputs, specifically video, image, for 3D assets, and audio.
- Fine-tune Vision-Language Models (VLMs) and Reward Models to serve as automated evaluators, creating scalable proxies for human judgment.
- Design and analyze A/B tests to measure the downstream business impact of GenAI content; apply causal inference techniques to understand how specific asset attributes drive user engagement.
- Define protocols for human evaluation, managing the relationship with annotation partners to create high-quality "Golden Sets" for benchmarking and Reinforcement Learning from Human Feedback (RLHF).
- Collaborate with ML Engineers and Product Managers to establish "Go/No-Go" model launch criteria based on latency, safety, and perceptual quality standards.
- Stay current with state-of-the-art research in perceptual quality (e.g., FID, CLIP scores, VQA) and implement advanced techniques to detect hallucinations, artifacts, or bias in generated content.
Benefits
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free