Mid-level AI Engineer/Data Engineer
Jobs via Dice
About the role
Dice is the leading career destination for tech experts at every stage of their careers. Our client, TrueHire Staffing LLC, is seeking the following. Apply via Dice today!
Job Title: Mid-level AI Engineer/Data Engineer ( EAD)
Location: Philadelphia, PA
Duration: 12+ months contract
Interview: in-person
Should have:
• Have 4-8 years of software development/engineering with AI and Data Engineering experience • Have worked in the investment management, investment banking area processing FINANCIAL MARKET DATA pipelines, RAG, Vector databases • Fluent with Python and API development and streaming systems like Kafka or similar
We are building a platform that converts unstructured financial data ( emails, corporate actions, index announcements ) into high-quality, structured datasets used by financial institutions.
This is not a typical “ LLM wrapper” role.
You will work on systems that:
• Extract data from noisy, inconsistent sources • Validate and reconcile outputs across multiple inputs • Ensure correctness, traceability, and auditability
The challenge is not just applying LLMs—it’s making them reliable in production for financial workflows.
What You’ll Work On
• Designing pipelines that process high-volume financial documents (batch + near real-time) • Building LLM-powered extraction workflows ( classification, parsing, summarization ) • Implementing validation layers (rule-based + model-based) to reduce hallucinations • Developing retrieval systems using embeddings and vector search • Architecting end-to-end systems: ingestion → processing → storage → serving • Ensuring data quality, observability, and fault tolerance • Collaborating with product to turn messy data into usable financial intelligence
Core Requirements
• Strong Python and backend/data engineering experience • Experience building production data pipelines (ETL, streaming, or async systems) • Solid understanding of distributed systems and failure modes • Experience working with LLM-based systems in production: • Prompt design • Output validation • Retry/fallback strategies • Evaluation and monitoring • Experience with data storage systems (SQL + NoSQL) • Familiarity with cloud infrastructure (AWS or similar)
Preferred Experience
• Experience with RAG / vector search systems • Background in financial data or capital markets • Experience with streaming systems (Kafka, etc.) • Experience building multi-step or agent-style workflows
Best Regards,
Ashish Singh
Truehire Staffing,
5900, Balcones Drive Suit 100, Austin, TX, 78731
Email ID:
Web:
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free