Mass Importing of GLOSS dialogues to LingQ - a call to action

Hello all.

I have started the process of mass importing all the content from GLOSS (global language online support system) to LingQ. For those of you who know GLOSS is a treasure trove of short dialogues/audio segments for many languages that have few resources online, the beautiful thing is that for the most part each one of lessons have an accompanying transcript.

I have tried to let the AI transcribe the lessons from the raw audio but for a lot of languages the results have been less than stellar. (due to the audio quality or the language itself not being recognised well)

This was spurred on by a video a watched of Steve where he mentions the power of listening to a narrow amount of content with a high degree of repetition. I have been doing so recently with the Iraqi dialogues from GLOSS and i few that it has really been beneficial to me, a few of the clips i can almost recite by heart now haha. I have felt a massive improvements in my comprehension and even my fluency when speaking with Iraqi friends.

The problem is though the process of downloading each audio Mp3, uploading it to Lingq, grabbing the transcript from the GLOSS website, going back to LingQ. Changing the Lesson text to the correct non AI transcribed version… it is quite tedious to say the least. Especially considering there are literally hundreds of dialogues available per Arabic dialect. (not to mention the other languages that are available that i would like to upload eventually… looking at you Farsi)

For now i want to focus on importing all the Arabic content. I am organising them neatly into seperate courses based on the dialect plus the corresponding level on GLOSS it came from. e.g ‘GLOSS - IRAQI - Level 1’

If anyone wants to join me in getting this treasure trove of content onto LingQ I think it would be a worthwhile endeavour for all of us studying Languages lacking in graded learning material with transcripts. Perhaps whoever is interested can pick a language and work solely on importing the content from that Language to better utilise our combined efforts.

3 Likes

Great idea, I’d like to help. A few things though.

  • A systematic way of cataloguing things (I.e, naming, labelling)
  • A way to stop duplication of course

Since you’ve already started, we can follow your lead?

perhaps we could use this thread to co-ordinate. i am already half way through importing the iraqi arabic dialouges i dont think it would take us long if we had a few people.

The way i am labelling them is like this (GLOSS - LANGUAGE NAME - Level 1 etc) and then the lesson names are just the same as they are on gloss. Maybe we could make a whatts app group to make it easier to work together.

Hey, I’d be down. I’m doing Arabic now too, and can pitch in.

1 Like

I’m currently half way the Iraqi uploads. Do you wanna tackle the Levantine dialogues?

unfortunately you have to go to the ‘lesson source’ sub page on each GLOSS lesson to copy the text. It’s the easiest way to do it if all be it cumbersome. Apart from that you can download the mp3 of the lessons quite easily

I think it’s probably easier if I make us a what’s app group for this project. Thoughts?

I could join that. DM me. I’m not there yet with dialects; I’m still doing MSA. I would start with Egyptian.

Here, this all could be a private onversation.

2 Likes

The Spanish and Russian resources on GLOSS are also quite impressive!

It would be great to have these imported as Lingq courses too.

1 Like

I have just completed all the dialogues from Gloss Iraqi. I am now working on MSA and Russian. I have changed things up and instead of having seperate courses for each level i think it is more straight forward to just put all the lessons from each corresponding language into one course, it became too messy the other way. As long as they are uploaded in order from easiest to hardest i don’t think it should make a huge difference.

3 Likes

Where are they? Are they uploaded publicly?