ZC
Senior AI Product Manager / Engineer
Zenius Corporation
Rose Hill · On-site Full-time Senior 2w ago
About the role
Summary Of The Role
We're looking for a Senior AI Product Manager / Engineer to design and build end-to-end AI systems - from model deployment and optimization to autonomous agent orchestration. This is not a research-only role. You'll operate at the intersection of:
- LLMs & multimodal models
- agent frameworks and workflow orchestration
- production-grade infrastructure
You will help define how intelligent systems are built, deployed, and scaled in real-world environments.
What You'll Work On
- Design and implement autonomous AI agents capable of multi-step reasoning, tool use, and workflow execution
- Build agent orchestration systems (memory, planning, tool calling, state management)
- Deploy and serve models (LLMs, vision, multimodal) in production environments
- Optimize models via: fine-tuning (LoRA, full fine-tune), quantization (INT8, 4-bit, GGUF, etc.), distillation and performance tuning
- Develop multi-model pipelines (generation + retrieval + tools + agents)
- Integrate external tools/APIs into agent workflows
- Build evaluation systems for: reasoning quality, hallucination detection, task success rates
Core Responsibilities
AI / ML Systems
- Architect and implement end-to-end AI pipelines
- Work with open-source and proprietary models (LLMs, diffusion, etc.)
- Implement RAG systems, embeddings, and vector search
- Design prompting + system instruction strategies
- Improve latency, throughput, and cost efficiency
Infrastructure & Deployment
- Deploy models using modern stacks (containers, GPUs, serverless where applicable)
- Build scalable inference systems
- Manage model versioning, monitoring, and rollback strategies
- Work with distributed systems and async processing pipelines
Agent & Workflow Engineering
- Build custom agent frameworks or extend existing ones
- Implement: planning/reasoning loops, tool usage, memory (short-term + long-term)
- Design reusable workflows for real-world use cases
Software Engineering Excellence
- Write clean, maintainable, production-grade code (Python primarily)
- Design APIs and services for internal and external use
- Collaborate with product and design to ship user-facing features
Process & Engineering Rigor
- Write clear technical requirements (PRDs / tech specs)
- Produce and maintain technical documentation
- Conduct code reviews and enforce engineering standards
- Define evaluation metrics and testing strategies for AI systems
- Participate in architecture discussions and system design
Requirements
Must-Have
- 3-5+ years in software engineering, with a strong focus on AI/ML systems
- Hands-on experience with LLMs and/or multimodal models
- Experience building or working with AI agents or multi-step workflows
- Strong Python skills and familiarity with ML frameworks (PyTorch, etc.)
- Experience with: model deployment (Docker, cloud, GPU infra), fine-tuning, and/or quantization
- Solid understanding of: prompt engineering, RAG architectures, embeddings + vector databases
Nice-to-Have
- Experience with frameworks like LangGraph, LangChain, LlamaIndex, or custom agent systems
- Familiarity with model serving tools (vLLM, TensorRT, ONNX, etc.)
- Experience with distributed systems and high-scale APIs
- Background in performance optimization/systems engineering
- Contributions to open-source AI projects
What We Value
- Builders who ship, not just experiment
- Strong systems thinking (not just model-level thinking)
- Ability to move between research ideas production systems
- Clear communication and documentation habits
- Ownership mindset and product intuition
Why This Role
- Work on cutting-edge agent systems, not just wrappers
- High ownership and ability to shape architecture
- Build a full-stack AI platform, not a narrow feature
- Fast-moving environment with real-world impact
Skills
DockerLLMsLangChainLangGraphLlamaIndexONNXPyTorchPythonTensorRTvLLMcloudcontainersdistributed systemsembeddingsfine-tuninggpumodel deploymentmultimodal modelsprompt engineeringquantizationrag architecturesvector databasesvector search
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free