Skip to content
mimi

Lead Machine Learning Engineer - Speech Synthesis

Telnyx

Jasper · On-site Full-time Lead 5d ago

About the role

Join Telnyx, an industry leader revolutionizing global connectivity. We are dedicated to transforming traditional systems and delivering innovative solutions that solve real-world challenges. Our strong financial stability allows us to invest in cutting-edge technologies while fostering an environment of continuous learning and growth for our team.

We envision a future fueled by borderless connectivity and limitless innovation. By being part of our team, you will lay the groundwork for an interconnected world. We are searching for passionate individuals eager to contribute their skills and grow their careers in this evolving industry. The Impact You'll Drive

As a Lead Machine Learning Engineer specializing in Speech Synthesis, you will be a cornerstone in building Telnyx's next-generation speech synthesis systems. This is a greenfield opportunity, where you will define the architecture, stack, and best practices for training and deploying state-of-the-art multilingual text-to-speech (TTS) models that power our voice AI agents.

Your responsibilities will include: • Owning the technology stack from day one by designing and implementing ML training and inference pipelines for multilingual speech synthesis. • Creating low-latency TTS systems optimized for real-time, streaming speech generation with sub-100ms response times. • Building and fine-tuning cutting-edge multilingual TTS systems using modern architectures, including LLM-based methods. • Developing massive-scale data processing pipelines for text, audio, and phonetic data across numerous languages. • Running distributed training across multi-node GPU clusters, tracking results, and quickly iterating on experiments. • Collaborating with infrastructure and voice platform teams to deploy models globally. • Evaluating and implementing emerging techniques and bringing them to production-grade systems. Technologies You'll Work With • Infrastructure: Docker, Kubernetes, Ray, Kubeflow, MLflow, Weights & Biases • Data Systems: Kafka, Redis, PostgreSQL, Parquet • Defining the stack that supports distributed training, data processing, and inference for global deployment. What We're Looking For • 6+ years of experience in machine learning or speech systems engineering. • Hands-on expertise with neural TTS, speech synthesis, or similar areas (ASR, voice cloning, multilingual modeling). • A proven track record of tackling challenging problems in multilingual TTS or related fields. • Experience with LLM-based approaches for speech synthesis. • Strong proficiency in Python and PyTorch. • Experience in deploying models efficiently using frameworks like ONNX or TensorRT. • Leadership experience in guiding small teams and setting technical direction. • A production-oriented mindset focused on building fast, stable, and maintainable systems.

By joining Telnyx, where voice technology, infrastructure, and AI intersect, your work on multilingual TTS will be central to our vision of creating intelligent, real-time global communications. Apply now to be part of this exciting journey!

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free