How to generate timestamps that actually work?

AvecLeCoeur · January 13, 2024, 11:45pm

Goal: I have an mp3 file and the corresponding text. I would like to have the audio properly aligned with the text in a LingQ lesson.

What I have tried: Import a lesson –
I upload the mp3 file and paste the text into the text box. Hit Save.
Then I open the lesson and hit the “generate timestamps” link on the left side of the page.

Is that the correct procedure? It does not result in any improvement. The audio and text are not aligned.

In practice, I get a more satisfactory result by not adding any text and simply asking LingQ to generate it. Then I fix the text (or not).

I could continue this way, but I can see that a better result is technically possible, so I ask: Am I doing something wrong, or does LingQ just not support this operation?

The language is Danish, if that matters.

Thanks!

sp06 · January 14, 2024, 7:52am

Let me know if you have luck! Lol. Ive been on this platform for nearly 2 years and haven’t seen an improvement in this area except for when whisper sync was added. But since it takes so long I rarely use it. Also you can only upload very small clips which I think is silly as I would like to do it for a whole audiobook and don’t have the time to do it 40 times for a single book.

Pr0metheus · January 15, 2024, 3:29am

After seeing this thread, I was playing about with this tonight and I found that if you hit “Regenerate Lesson” and then hit “Generate timestamps” it kinda works.

I couldn’t fully test it - I only had a short MP3 clip from the lesson I was using, so it didn’t generate the correct timestamps, but it did at least generate them, which is better than the results I was getting before. It may be that if you have the correct length of MP3 file, it might get pretty close. Worth a try anyway. It’s late now and I’m about to hit the sack, but I will do further testing tomorrow.

Pr0metheus · January 15, 2024, 11:40am

I just tried it again this morning, with the full audio of a lesson, and it works like a charm. So the procedure I used this morning was as follows:

Edit the lesson.
Regenerate Lesson (this is the important bit).
Perform the next two steps immediately after regenerating the lesson (i.e. don’t click out of the webpage - I believe in order for it to work, the text has to appear on the page in the regenerated format)
Add the audio file.
Generate timestamps.

I think you can also do all this after you already have the audio file loaded, but I did not test the procedure that way. [edit] I just tried it, and that works too.

So the procedure is simple:

Edit the lesson.
Regenerate Lesson.
Generate Timestamps.

Hope this helps!

nfera · January 15, 2024, 1:14pm

You have to make sure the .mp3 and the transcript are exactly the same. You can’t have an introduction in the podcast/audiobook which is not in the transcript or stuff written in the transcript, which is not actually said. Furthermore, it often has issues with music, such as is common in introductions for podcasts.

If you really need timestamps, I recommend using YouTube content. Alternatively, use Whisper to generate the transcript.

Pr0metheus · January 15, 2024, 1:20pm

I just tested it this morning, and I definitely had words in the transcript which weren’t said, and it still worked fine. But you’re right in that it’s definitely a good idea to remove any music and make sure the transcript and the audio match as perfectly as possible before trying to generate the timestamps.

bamboozled · January 15, 2024, 1:37pm

You have to click the “generate timestamps” button 2x! (regenerate lesson is an unrelated feature)
here is a video: https://youtu.be/5x85QPpABfk

Pr0metheus · January 15, 2024, 1:52pm

I’ve never clicked on “Generate Timestamps” twice. “Regenerate Lesson” can’t be unrelated, since if you click on it, “Generate Timestamps” works (with only one click) and if you don’t, it doesn’t. Maybe it’s a bug, but it definitely works.

LuPeng · January 15, 2024, 2:57pm

If you have a text, it’s a good idea to convert it to Word format (.docx) before importing and - as @nfera wrote - make sure that the transcript matches the audio exactly. Then the results are great.

Speech to text unfortunately leads to quite a few errors, especially in Danish, i.e. words and grammatical forms that do not exist in Danish, fx in “Koen paa isen”. This may be better in other languages.