Incorrect chunking of mp3 for transcription (audio cuts off)

This is not a new issue.
The idea of transcription is great, but the implementation has one tiny bug which would be trivial to fix.
The text is properly chunked into sentences. HOWEVER, the audio cuts off before the end of the sentence. This may be a rounding error, not sure.
Any way, this is so easy to fix, just add 500ms to the end of each audio chunk. It is better to overlap a bit then to cut off the end of the sentence!

I can’t believe this hasn’t been fixed yet. Who would want to pay for transcription when the implementation is so shoddy… Please have someone spend one hour on this, it’s not rocket science.

2 Likes

If someone wants to try, import this mp3 file (link expires in 3 days)
Go to sentence view and listen to the first five or six phrases
Even if you don’t know Turkish, looking at the text you can see that for many sentences the audio cuts off too fast

1 Like

我不得不说,英语也有同样的问题,这个问题相比其他的问题,算是很小的问题了

We’ll look into this and see what can be done. Thanks.

It’s worse than I thought
Look at this video
First you’ll see some completely out of sync text vs audio
Then at the end you’ll see that hundreds of audio have the same text line
Please share this with your team, something really big is broken in “audio transcription”
It’s unusable…
That link says Filen but if you click on it it’s a video, okay?

1 Like

@balat This is reported and will be investigate further.

你的这段课程里面的文字是否因为词汇太多,被分为了两节课?但是两节课其中的第一节课的语音是完整的,没有像课程一样被分割?如果是那样的话,完整的语音,匹配不了不完整的文字,时间戳到后面会全部错乱,应该是这个逻辑。

No, this is just when importing a single mp3 file in the browser: Import / Audio (Import audio for transcription)

Okay, thank you Zoran. Looking forward to this being fixed.
It will be absolutely amazing.
In theory it’s a terrific feature, that’s one of the reasons I signed up for premium.

Hello @balat Could you please look into the Settings/Reader/Voice setting. What voice is specified there? Thank you :hibiscus:

Hi, we are importing an mp3, so it’s not about voice settings… The voice is the voice of the mp3

@balat thank you for clarification. I tried my MP3 file and it is fine. Could you please share your file, so I can check it? Also please try to import this file again - will there be the same mistakes? Thank you for your help!

Hi, sorry about the delay.
It is worse than ever. Just tried it again. Opened a new ticket.
If you’d like to try it, I uploaded the file.
The steps to replicate are on this thread (on my screen there’s no contrast, but the words “this thread” are a link you can click on).

1 Like