Distinguished AI Engineer
Capital One
About the role
Position Overview
At Capital One, we are creating responsible and reliable AI systems that change banking for good. Our industry-leading use of machine learning enables real‑time, personalized customer experiences—from alerting customers about unusual charges to answering their questions in real time. We are committed to building world‑class applied science and engineering teams to drive breakthrough product experiences and scalable, high‑performance AI infrastructure.
The Intelligent Foundations and Experiences (IFX) team is central to bringing our vision for AI to life. We collaborate with partners across the company to advance the state‑of‑the‑art in AI engineering, building and deploying proprietary solutions that deliver value to millions of customers.
Key Responsibilities
- Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI‑powered products that transform associate work and customer interactions.
- Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability.
- Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, and PyTorch.
- Invent and introduce state‑of‑the‑art LLM optimization techniques to enhance scalability, cost, latency, and throughput of large scale production AI systems.
- Contribute to the technical vision and long‑term roadmap of foundational AI systems at Capital One.
Required Qualifications
- Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields with at least 8 years of experience developing AI/ML algorithms or technologies; or a Master's degree in a related field with at least 6 years of relevant experience.
- At least 8 years of programming experience with languages such as Python, Go, Scala, or Java.
Preferred Qualifications
- 8 years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud).
- Experience architecting, designing, developing, integrating, delivering, and supporting complex AI systems.
- Proven ability to lead and mentor multiple engineering teams while influencing cross‑functional stakeholders up to the VP level.
- Hands‑on experience developing AI/ML algorithms (e.g. LLM Inference, Similarity Search, VectorDBs, Guardrails) using languages such as Python, C++, C#, Java, or Golang.
- Expertise in optimizing training and inference software for improved hardware utilization, latency, throughput, and cost.
- Passion for staying abreast of the latest AI research and applying novel techniques in production.
- Excellent communication and presentation skills with the ability to articulate complex AI concepts clearly.
Benefits & Perks
- Compensation: Cambridge, MA: $263,900 – $301,200; McLean, VA: $263,900 – $301,200; New York, NY: $287,800 – $328,500; San Francisco, CA: $287,800 – $328,500; San Jose, CA: $287,800 – $328,500.
- Performance‑Based Incentives: Eligible to earn cash bonus(es) and/or long‑term incentives (LTI), depending on plan specifics.
- Benefits: Comprehensive, competitive, and inclusive health, financial, and other benefits designed to support your total well‑being. (Details available on the job
Requirements
- Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related field with at least 8 years of AI/ML experience
- Master's degree in a related field with at least 6 years of AI/ML experience
- Minimum 8 years of programming experience in Python, Go, Scala, or Java
Responsibilities
- Partner with cross-functional engineers, research scientists, technical program managers, and product managers to deliver AI-powered products
- Design, develop, test, deploy, and support AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability
- Leverage Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, and PyTorch
- Invent and introduce state-of-the-art LLM optimization techniques to improve scalability, cost, latency, and throughput
- Contribute to the technical vision and long‑term roadmap of foundational AI systems
Benefits
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free