Automating Audiobooks: Lessons from the Trenches of AI Speech Synthesis

What if you could transform your text into a compelling audiobook—without ever recording a word? In this talk, we’ll dive into using Azure AI Speech to turn long-form content into immersive audiobooks.

You’ll discover the differences between real-time and batch synthesis, the quirks of SSML formatting, how to instruct Azure OpenAI to generate valid rich SSML markup to explicitly direct the speech synthesis, what it takes to train a custom voice, and the strengths and pitfalls of AI-generated narration.

Expect real-world examples, hands-on insights, and a discussion on the evolving role of AI in storytelling.

 

Share this on...