Azure Text-To-Speech Userscript

Hello everyone!

I made a userscript that I wanted to share. With this script you can generate the text to speech audio for your lesson and it will automatically upload the audio and generate the timestamps.

You do need a Microsoft Azure account to be able to use their neural text to speech, but the process is not very difficult and the selection of voices you can use is great. They do have a free tier in which you can get up to 500k characters per month of text to speech and I believe they give you $200 credit when you sign up if you ever exceed this limit.

Anyways, I made this script because I am studying Japanese and the generated audio and timestamp does not sync very well which makes it hard to use the audio when using sentence viewer. This script fixes that issue and allows me to use voices that I like from Microsoft Azure.

2 Likes

Hi Kevin,
Not sure if you’ve seen this… GitHub - rany2/edge-tts: Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

You might be able to set it up so that it doesn’t need an azure api key.