CG
Data Engineer – AI/ML
CodeHire Global Solutions
On-site Full-time Senior $85k – $110k/yr 1w ago
About the role
We are seeking a skilled and experienced Data Engineer – AI/ML to join our growing data and analytics team. The ideal candidate will have strong expertise in building scalable data pipelines, supporting AI/ML workflows, and working with large-scale structured and unstructured datasets.
The candidate will work closely with Data Scientists, ML Engineers, and business teams to design, develop, and optimize data solutions that power machine learning and AI-driven applications.
Key Responsibilities
- Design, develop, and maintain scalable data pipelines and ETL/ELT workflows
- Build and optimize data architectures for AI/ML applications
- Integrate data from multiple structured and unstructured sources
- Support machine learning model training, deployment, and monitoring workflows
- Develop and maintain batch and real-time data processing solutions
- Collaborate with Data Scientists and ML Engineers to prepare datasets for modeling
- Ensure data quality, governance, security, and compliance standards
- Optimize database performance, query execution, and data storage solutions
- Automate data validation, monitoring, and pipeline orchestration processes
- Participate in code reviews, testing, deployment, and production support activities
Required Skills
- 6+ years of experience in Data Engineering
- Strong experience with Python and SQL
- Hands-on experience with big data technologies and distributed data processing
- Experience building ETL/ELT pipelines
- Strong knowledge of cloud platforms such as AWS, Azure, or GCP
- Experience with data warehousing and data lake solutions
- Knowledge of AI/ML workflows and model data preparation
- Experience with Spark, PySpark, or Databricks
- Strong understanding of REST APIs and data integration patterns
- Experience with version control and CI/CD pipelines
Preferred Skills
- Experience supporting AI/ML or Generative AI projects
- Familiarity with MLOps concepts and ML lifecycle management
- Experience with streaming technologies such as Kafka
- Knowledge of containerization tools like Docker and Kubernetes
- Experience with orchestration tools such as Airflow
- Strong analytical and troubleshooting skills
Technologies
- Python
- SQL
- Spark / PySpark
- Databricks
- Azure / AWS / GCP
- Airflow
- Kafka
- Docker
- Kubernetes
- Git
- CI/CD Pipelines
Pay
$85,000.00-$110,000.00 per year
Work Location
In person
Skills
AWSAzureDatabricksDockerGCPGitKafkaKubernetesPythonSQLSparkPySparkAirflowCI/CD Pipelines
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free