Skip to content
mimi

AI Engineer

Above and Beyond Talent Acquisition, Inc

Paramus · On-site Contract $100 – $135/hr Yesterday

About the role

Position Title: Sr AI Engineer Location: Paramus, NJ Duration: 06 Month Contract Status: Hybrid Employment Type: Contract role on W-2 through Above and Beyond Talent Acquisition, Inc. (A&B Talent) Pay Range: USD $135 an hour W2

Client Info / Who they are:

Above and Beyond Talent Acquisition proudly represents our Client, a global leader in water, waste, and energy management. The client offers a full spectrum of water, waste, and energy management services, including water and wastewater treatment, commercial and hazardous waste collection and disposal, energy consulting, and resource recovery. The client helps commercial, industrial, healthcare, higher education, and municipality customers throughout North America. The client has approximately 10,000 employees working at over 350 locations across the continent.

Requirements / Who we are looking for: • 10–15 years of overall software engineering experience. • 5+ years of hands-on experience in Artificial Generative Intelligence, including LLMs, SLMs, RAG, and multi-agent systems. • Deep expertise in Google AI ecosystem: Gemini, Vertex AI, Google ADK, Google AI Studio, and Google Workspace integrations. • Proficiency in Python (primary) and familiarity with Node.js. • Strong background in cloud-native development on GCP. • Demonstrated experience with model fine-tuning (LoRA, QLoRA, PEFT) and model evaluation frameworks. • Solid understanding of MLOps, CI/CD for AI systems, and production deployment best practices. • Experience with multi-agent AI architectures using Semantic Kernel, or LangGraph. • Preferred Qualifications • Google Cloud Professional certifications ( Professional ML Engineer, Professional Cloud Architect) • Contributions to open-source AI/ML projects. • Experience with edge AI deployments and hybrid cloud-edge inference. • Familiarity with synthetic data generation pipelines. • Prior experience mentoring junior engineers or interns in AI/ML domains.

Performance Objectives / What you'll be doing:

1. Large & Small Language Model Engineering • Design, develop, and deploy Agents leveraging commercial LLMs such as Gemini (Google), GPT (OpenAI), and Claude Sonnet (Anthropic) for high-performance, large-context, and multimodal tasks. • Work with open-source/self-hosted LLMs including Mixtral (Mistral AI). • Architect and implement SLM-based solutions using lightweight models such as Phi-3 (Microsoft), Gemma (Google), and Mistral for resource-constrained environments. • Lead fine-tuning and customization of models using Vertex AI Tuning, Hugging Face Transformers, and parameter-efficient fine-tuning (PEFT) methods including LoRA and QLoRA. • Apply training frameworks such as PyTorch, TensorFlow, or JAX for model experimentation and development. • Generate synthetic data and evaluate models using HELM, lm-evaluation-harness, and custom benchmarks.

2. Google AI & Workspace Integration • Lead the design and implementation of AI-powered solutions deeply integrated with Google Workspace (Docs, Sheets, Drive, Gmail, Meet), Big Query and Lakehouse. • Architect and build intelligent agents and workflows using Google Agent Development Kit (ADK). • Leverage Google AI Studio as the primary IDE, VSCode for AI application development and prototyping. • Utilize Google Cloud Platform (GCP) services including: • Vertex AI for ML model training, tuning, and deployment • GKE (Google Kubernetes Engine) for container orchestration • Cloud Run for serverless deployment • Cloud Functions for event-driven AI tasks • Vertex AI Vector DBs for semantic search and retrieval

3. Design & Planning • Lead requirements gathering using Confluence for documentation and team collaboration. • Create detailed system architecture diagrams and AI workflows using Lucidchart. • Design UI/UX prototypes in Figma for AI-powered application interfaces. • Manage project delivery and sprint planning using Jira. • Oversee data preparation and management: cleaning, transforming, and organizing data for AI/ML workflows. • Conduct data analysis using Jupyter Notebooks and pandas for exploration and preprocessing. • Leverage Hugging Face Model Hub for model comparison, selection, and download.

4. Development Frameworks & Tools • Orchestrate LLM/SLM applications using LangChain, LlamaIndex, and LangGraph. • Build multi-agent systems with Semantic Kernel, and LangGraph. • Manage and optimize prompts using LangSmith and PromptLayer. • Deploy models locally with Ollama or at scale with vLLM for efficient inference. • Track experiments, metrics, and results with MLflow or Weights & Biases. • Manage code and data versioning with Git.

5. Vector Databases & Semantic Search • Implement semantic search and Retrieval-Augmented Generation (RAG) pipelines using Vertex AI Vector DBs and ChromaDB. • Design and optimize end-to-end RAG architectures for enterprise-grade knowledge retrieval.

6. Backend Development • Develop robust RESTful APIs using FastAPI (Python) or Express.js (Node.js). • Manage and secure APIs using Mulesoft, Apigee.

7. Frontend Development • Build modern user interfaces using React or Angular. • Utilize Material-UI for consistent, accessible, and modern UI components. • Prototype and plan UI/UX workflows using Figma.

8. Development Tools & Code Quality • Write and debug code in VS Code with Python and GitHub Copilot extensions. • Leverage GitHub Copilot for AI-assisted code suggestions and productivity. • Manage source code with GitHub or GitLab. • Enforce code quality and standards using SonarQube, ESLint, and Pylint.

9. Testing & Quality Assurance • Conduct LLM-specific testing using RAGAS and DeepEval for LLM/RAG pipeline evaluation. • Use LangSmith Evaluators for prompt testing and hallucination detection. • Write and execute unit tests using pytest. • Ensure output quality and reliability using LangChain Evaluators and custom metrics.

10. Deployment & Infrastructure • Orchestrate containers at scale with Kubernetes (K8s), and Google GKE. • Automate CI/CD pipelines using GitHub Actions or GitLab CI. • Support on-premise, cloud (GCP/Vertex AI), and hybrid infrastructure deployments including edge devices for local inference.

11. LLM Monitoring & Observability • Monitor LLM performance and usage with LangSmith and Weights & Biases. • Track and optimize AI infrastructure costs using OpenMeter and custom dashboards. • Set up continuous evaluation pipelines to ensure ongoing model quality and reliability. • Monitor application and model performance end-to-end with LangSmith observability tools.

Perks of working with US / What We offer: • Competitive Salary. • Health, dental, and vision insurance. • Company 401K plan

The salary, other compensation, and benefits information are accurate as of the date of this posting. The Company reserves the right to modify this information at any time, subject to applicable law. “Above and Beyond Talent is an equal opportunity employer and staffing firm.”

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free