TA
Data Scientist (NLP)
The ACI Group, Inc.
Remote · US Contract 1mo ago
About the role
Description
Seeking a Data Scientist to support long term federal client engagements projects in the DC Metro area. The role will apply statistical programming, modeling, visualization techniques, data mining, and forecasting skills to analyze challenging public sector problems.
Responsibilities
- Pre-processing - Demonstrate the skills and experience to collect, clean, and prepare data sets for input into a computational model using Python. Strong candidates will explain various methods you have applied using common pre-processing functions such as stop word removal, stemming, lemmatization, and tokenization.
- Feature Engineering and Attribute Evaluation - Candidate must demonstrate experience with NLP feature engineering methods such as TF-IDF, word2vec, GloVe, and FastText identifying the key determinants for modeling that exist in the business process and within existing data sets as well as selecting evaluation protocols (model techniques).
- Modeling - Candidates will have practiced skills and experience selecting classification modeling techniques to fit the business problem. Examples will include techniques such as machine learning (ML) supervised and unsupervised learning, regression, neural networks and deep learning, natural language processing, etc.
- Validation - Strong candidates will describe their experience with investigating, reporting, and justifying model results.
- Visualization- Experience in presenting the results of their modeling activities, depicting the insights realized, and explaining the relevance of their results to the organization's business challenges.
Requirements
- Master's degree required, and PhD preferred in Statistics, Mathematics, Computer Science, or similar (NOTE: Will accept a Bachelor's Degree is Python skills are strong)
- High degree of experience utilizing Python to support NLP use cases such as Document Summarization, Named Entity Recognition, Sentiment Analysis, and/or Topic Modeling
- At least four years of experience developing scalable, production-ready NLP solutions using sci-kit learn, Keras, TensorFlow, PyTorch, Spark NLP.
- Experience using git/github to version control source code
- Experience leveraging transformer architecture to develop NLP models
- Experience with open source NLP packages such as Gensim, SpaCy, or NLTK.
- Experience with BERT, GPT-J, RoBERTa, T5 or other transformers
- Experience with GenAI and Prompt Engineering is a plus
- Experience in Databricks and MLFlow is a plus
- Experience with machine translation and transcription of foreign language documents using Microsoft Azure translation services is a plus
- Experience working in an AWS cloud environment and with related AWS services such as Bedrock and Textract
- Experience coordinating and maintaining user stories
- Must be a US citizen
- Must be able to obtain and maintain a Public trust security clearance
About The ACI Group
Since 1988, The ACI Group, a Baltimore-based staffing firm, has been committed to hiring the industry's leading professionals, and presenting exciting career opportunities. We have access to varied types of contract, permanent and contract-to-perm positions and offer a choice of employment options including a full benefits package.
Skills
AWS BedrockAWS TextractBERTDatabricksFastTextGensimGenAIGloVeGPT-JKerasMLFlowNLTKNLPPrompt EngineeringPyTorchPythonRoBERTaSpark NLPSpaCyT5TensorFlowTF-IDFTransformersUS CitizensWord2Vec
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free