Skip to content
mimi

Data Scientist (Machine Learning)

Citigroup Inc.

Indiana · On-site Full-time Senior $80k – $120k/yr 3d ago

About the role

Data Science, Data Scientist – Analytics & Information Management (AIM), Gurugram

Excited to grow your career?

• We value our talented employees and strive to help associates grow professionally before recruiting new talent. If you think the open position is right for you, we encourage you to apply!

• Our people make all the difference in our success.

• We are seeking a Senior Data Scientist for our Business Analytics Team. The ideal candidate will drive the development and implementation of analytical solutions to support key business objectives for Banking Operations & Analytics as part of COO (Chief Operating Office).

Role & Responsibilities

• The role will be Data Scientist in the Data Science and Modeling for Banking Operations & Analytics Team, reporting to the AVP/VP leading the team.

• Incumbents will work with large, complex, and unstructured data using tools such as Python, PySpark, SQL, R, etc. to build modeling solutions for various business requirements.

• Primary focus areas include model building, model validations, model implementation, and model governance for multiple portfolios.

• Responsible for documenting data requirements, data collection/processing/cleaning, and exploratory data analysis, including statistical models/algorithms and data visualization techniques.

• May be referred to as Data Scientists.

• Collaborate with team members and business partners to jointly build model‑driven solutions using traditional methods (Linear, Logistic, Segmentation, etc.) as well as machine‑learning solutions.

• Work with model governance & fair lending teams to ensure compliance of models in accordance with Citi standards.

Core Competencies

• Business Obsession – Create client‑centric analytic solutions to business problems with a holistic view of multiple businesses.

• Analytic Project Execution – Own and deliver multiple complex analytic projects, translating business context into modeling solutions that create economic value.

• Domain Expert – Develop deep expertise in banking operations and analytics, operational risk, and related fields.

• Modeling and Tech Savvy – Stay current with the latest modeling techniques, machine‑learning and deep‑learning algorithms, sharing knowledge within the team.

• Statistical Mindset – Proficiency in basic statistics, hypothesis testing, segmentation, and predictive modeling.

• Communication Skills – Translate and articulate technical ideas to a larger audience, including influencing peers and senior management.

• Strong Project Management Skills

• Contribute to organizational initiatives, including competency development, training, and organizational building activities.

Qualifications

• 5+ years of experience in data analytics roles with proficiency in SQL, SAS, Python, PySpark, etc.

• Technical Skills

• Sound knowledge of machine‑learning, deep‑learning, and statistical modeling techniques.

• Experience with ML frameworks and Python libraries such as scikit‑learn, xgboost, Keras, NLTK, BERT, TensorFlow.

• Hands‑on experience in PySpark/Python/R programming and strong SQL experience.

• Experience with large datasets, data warehouses, and pulling data via programming.

• Strong background in statistical analysis.

• Experience with Transformers/LLMs (OpenAI, Claude, Gemini, etc.), prompt engineering, RAG architectures, and tools/frameworks such as TensorFlow, PyTorch, Hugging Face Transformers, LangChain/Graph, LlamaIndex.

• Understanding of transformers/language models.

• Familiarity with vector databases and fine‑tuning techniques.

• Experience developing and deploying AI solutions in partnership with tech and business teams.

• AI/Gen AI proficiency and thought leadership in financial/business analysis and/or credit/risk analysis with the ability to impact key business drivers via disciplined analytics.

Education

• Bachelor's or Master’s degree in Economics, Statistics, Mathematics, Information Technology, Computer Applications, Engineering, etc.

Time Type: Full time

Job Family Group: Decision Management

Job Family: Specialized Analytics (Data Science/Computational Statistics)

Job Level: C11

Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.

View Citi’s EEO Policy Statement and the Know Your Rights poster.

Requirements

  • 5+ years of experience in data analytics roles with proficiency in SQL, SAS, Python, PySpark, etc.
  • Sound knowledge of machine-learning, deep-learning, and statistical modeling techniques
  • Experience with ML frameworks and Python libraries such as scikit-learn, xgboost, Keras, NLTK, BERT, TensorFlow
  • Hands-on experience in PySpark/Python/R programming and strong SQL experience
  • Experience with large datasets, data warehouses, and pulling data via programming
  • Strong background in statistical analysis

Responsibilities

  • Drive the development and implementation of analytical solutions to support key business objectives for Banking Operations & Analytics
  • Build modeling solutions for various business requirements using tools such as Python, PySpark, SQL, R, etc.
  • Model building, model validations, model implementation, and model governance for multiple portfolios
  • Document data requirements, data collection/processing/cleaning, and exploratory data analysis
  • Collaborate with team members and business partners to jointly build model-driven solutions
  • Work with model governance & fair lending teams to ensure compliance of models in accordance with Citi standards

Skills

PythonPySparkSQLRMachine learningDeep learningStatistical modelingScikit-learnXgboostKerasNLTKBERTTensorFlow

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free