I
AI/ML & LLM
Innoventrics
Burnsville · On-site Full-time Senior $60 – $65/hr Today
About the role
About Us
We are seeking a skilled Data Engineer with expertise in Google Cloud Platform (GCP), AI/ML, and Large Language Models (LLMs) to design, build, and optimize scalable data pipelines and AI-driven solutions. The ideal candidate will work closely with data scientists, ML engineers, and business stakeholders to enable advanced analytics and intelligent applications.
Key Responsibilities
- Design, develop, and maintain scalable data pipelines on GCP
- Build and optimize ETL/ELT workflows using tools like Cloud Dataflow, Dataproc, and BigQuery
- Collaborate with AI/ML teams to deploy and support machine learning models in production
- Integrate and manage LLM-based applications (e.g., prompt engineering, fine-tuning, RAG pipelines)
- Develop data architectures supporting real-time and batch processing
- Ensure data quality, governance, and security best practices
- Optimize performance and cost of cloud-based data systems
- Work with APIs and external data sources for ingestion and processing
- Implement monitoring, logging, and alerting for data workflows
Required Skills
- Strong experience with Google Cloud Platform (GCP) services
- Hands-on experience with:
- BigQuery
- Cloud Storage
- Dataflow / Apache Beam
- Dataproc (Spark/Hadoop)
- Proficiency in Python and SQL
- Experience with ETL/ELT pipeline development
- Knowledge of machine learning workflows and model deployment
- Hands-on experience with LLMs (Large Language Models) such as GPT-based or open-source models
- Understanding of:
- Prompt engineering
- Retrieval-Augmented Generation (RAG)
- Vector databases (e.g., Pinecone, FAISS)
Preferred Qualifications
- Experience with Vertex AI on GCP
- Familiarity with Docker, Kubernetes
- Experience in Airflow / Cloud Composer
- Knowledge of CI/CD pipelines
- Exposure to data warehousing and data lake architectures
- Understanding of MLOps practices
Skills
Apache BeamBigQueryCloud DataflowCloud StorageDataprocDockerETLFAISSGCPGoogle CloudHadoopKubernetesLLMMachine LearningMLOpsPineconePythonRAGSQLSparkVertex AI
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free