San Francisco, California, United States Hybrid Full-time $160,000 - $250,000/year

Bland AI is hiring a Machine Learning Engineer, TTS Systems

About the Role

What You'll Do

Own the end-to-end deployment of neural text-to-speech models, ensuring they operate reliably at scale with minimal latency. You'll refine inference pipelines using advanced post-training methods to enhance both audio fidelity and system throughput. Work closely with engineering and research teams to integrate new capabilities, run controlled experiments, and continuously improve live systems. Design robust, scalable infrastructure that supports expressive, multi-speaker, and controllable voice synthesis. Establish clear standards for monitoring, reliability, and performance optimization across production environments.

Requirements

  • Proven experience deploying large neural TTS models in production, either on cloud platforms or on-premises.
  • Deep technical knowledge of inference optimization techniques including quantization, kernel tuning, and efficient batching.
  • Familiarity with real-time audio processing constraints and strategies to maintain quality under low-latency demands.
  • Strong grasp of distributed systems, GPU utilization, and scalable backend architectures.
  • Ability to troubleshoot and resolve issues affecting voice quality, system performance, or uptime.
  • Adaptability to fast-moving environments with a hands-on approach to system ownership.

Preferred Qualifications

  • Contributions to open-source TTS or audio processing frameworks.
  • Background in telephony, live communication systems, or enterprise voice applications.

Benefits

  • Comprehensive healthcare coverage including dental and vision
  • Meaningful equity in a rapidly growing company
  • Access to all necessary tools and equipment for effective work
  • Hybrid work model with options for remote work within the U.S. or in-office collaboration in San Francisco
  • Modern office space located in Jackson Square, SF, featuring rooftop views
Required Skills
Deploying neural TTS modelsTTS inference optimizationQuantizationKernel optimizationBatching strategiesGRPODistributed systemsGPU accelerationReal-time audio processingScalable production infrastructureDiagnosing performance issuesCloud deploymentOn-prem deploymentRLHFDPO Deploying neural TTS modelsTTS inference optimizationQuantizationKernel optimizationBatching strategiesGRPODistributed systemsGPU accelerationReal-time audio processingScalable production infrastructureDiagnosing performance issuesCloud deploymentOn-prem deploymentRLHFDPO
Ready to relocate and code from paradise?

Thailand or Vietnam — your office, your rules

Iglu offers relocation to Bangkok, Chiang Mai, Ho Chi Minh City, or Hong Kong. Full employment, legal setup, and a community of 200+ digital professionals.

Relocation to 5 countries
Full legal work setup
Developer community access
Work-life balance culture
Explore locations
Relocation support included
About company
Bland AI

Transform your enterprise communication with Bland AI. Automate inbound and outbound phone calls using AI that sounds human. Perfect for sales, customer support, and operations with customizable voices and seamless integrations.

Bland AI provides a robust platform for building, deploying, and monitoring voice AI agents that handle real-time conversations. The platform supports industries such as healthcare, insurance, and financial services, enabling businesses to automate repetitive calls, improve customer service, and reduce operational costs.

With self-hosted infrastructure optimized for speed, security, and reliability, Bland AI offers dedicated instances, global voice delivery, and full data privacy. The platform integrates with major telephony providers and enterprise systems via API, SIP, or batch processing.

All jobs at Bland AI Visit website
Job Details
Department Engineering
Category data
Posted 2 months ago