Skip to content
mimi

Staff Data Engineer/Scientist

caci international inc

Chantilly · On-site Full-time Lead 4d ago

About the role

The Opportunity:

We are looking for a Staff Data Engineer/Scientist looking for new challenging problems. You will support the development of AI/ML algorithms in a multitude of disciplines from large language models, natural language processing, and time-series predictive analytics. Additionally, we have a team of excellent researchers and software developers who are eager to mentor and teach their craft.

Responsibilities:

Lead and mentor an interdisciplinary team consisting of both developers and researchers. The team's core focus is the implementation of ETL pipelines to support a variety of AI/ML and LLM solutions, which in turn address a broad range of customer challenges.

  • Assembles large, complex sets of data to support AI/ML algorithm implementation
  • Builds required infrastructure for optimal extraction, transformation and loading of data from various data sources
  • Curate and maintain data that is stored in support of metrics and evaluation
  • Implement Artificial Intelligence/Machine Learning algorithms
  • Identifies, designs, and implements internal process improvements including re-designing infrastructure for greater scalability, optimizing data delivery, and automating manual processes
  • Using Agile methodologies to develop software.

Qualifications:

Required:

  • B.S. in data science, AI/ML, computer science, or related field
  • Minimum six (6) years of relevant experience as a Data Engineer/Scientist.
  • Experience developing data pipelines and normalizing data with canonical Python packages (e.g. Num Py, Pandas, Polars)
  • Experience contributing on a team using version control (e.g. git, Git Lab, Bitbucket)
  • Active TS/SCI U.S. Government Security Clearance with a recent Full-Scope Polygraph (FSP)

Desired:

  • M.S. or PhD in data science, AI/ML, computer science, or related field
  • Experience with Gitlab, Dev Sec Ops utilizing test-driven development, containers, (e.g. Docker, Docker Compose), cloud services (e.g. AWS), tools for distributed computing (e.g. Spark, Pyspark)
  • Experience leading an interdisciplinary team of researchers and software developers
  • Experience with any of the following:
    • Large Language Models and experience identifying ways to incorporate them into new domains and applications
    • Applying Transformer-based architectures to domains in other areas outside of Natural Language Processing (NLP) such as computer vision
    • Natural Language Processing algorithms such as BERT
    • Reinforcement learning and familiarity with Gymnasium Gym, Open Env, Torch RL, RLlib, and Stable Baselines
    • Applying clustering algorithms and/or deep neural networks to real life problems
    • Implementing tracking and pattern-of-life algorithms
  • Experience with GenAI Ops techniques (e.g. LLM-as-a-judge) and frameworks (e.g. Lang Fuse, MLFlow, Arize Phoenix)
  • Experience with Machine Learning libraries and frameworks such as Hugging Face and Lang Chain
  • Experience with Linux
  • Familiarity with using AWS cloud computing resources such as EC2, S3, Lambda, Bedrock, etc.
  • Experience with any of the following additional languages: Java, C++, Rust, Go, and/or C#
  • Experience implementing algorithms on the GPU in Python or C++ using CUDA and other CUDA libraries
  • Experience in application deployment, virtualization, and containerization (e.g. Podman, Docker, Kubernetes, Rancher)
  • Experience shaping and writing proposals

What You Can Expect:

  • A culture of integrity. At CACI, we place character and innovation at the center of everything we do. As a valued team member, you'll be part of a high-performing group dedicated to our customer's missions and driven by a higher purpose - to ensure the safety of our nation.
  • An environment of trust. CACI values the unique contributions that every employee brings to our company and our customers - every day. You'll have the autonomy to take the time you need through a unique flexible time off benefit and have…

Skills

AWSBERTC#C++DockerDocker ComposeEC2GenAI OpsGitGitLabGoGymnasium GymHugging FaceJavaKubernetesLambdaLang ChainLang FuseLinuxLLMMLFlowNatural Language ProcessingNumPyOpen EnvPandasPodmanPolarsPythonRancherReinforcement learningRLlibRustS3SparkStable BaselinesTorch RLTransformer-based architectures

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free