So, I did some experiments with conversion between English and romance languages, after reading various threads here about the subject. I figured the community can elaborate on these since foreign translated editions may be more easily available to others. Data would be especially interesting from slavic language editions of the same books.
This is very non scientific, but it may be able to provide some data points.
The Experiment: Compare unique word counts of different language editions of the same book to get an idea of the ratio of known words needed in different languages to understand the same book.
Tool: Calibre ā Edit Book ā Tools ā Reports to obtain total and unique word counts.
Iāll use English as the common denominator comparison value of ā1ā
Here is what I have so far:
LES MISERABLES
French Original
Total Words: 515,420
Unique Words: 35,652
English Edition
Total Words: 576,039
Unique Words: 28,019
Unique word ratio French to English: 1.27
THE CHILDREN OF CAPTAIN GRANT
French Original
Total Words: 192,221
Unique Words: 18,328
English Edition
Total Words: 155,596
Unique Words: 12,263
Unique word ratio French to English: 1.49
AROUND THE WORLD IN 80 DAYS
French Original
Total Words: 67,524
Unique Words: 9,749
English Edition
Total Words: 63,890
Unique Words: 7,712
Unique word ratio French to English: 1.26
PRIDE AND PREJUDICE
English Original
Total Words: 125,352
Unique Words: 7,260
French Edition
Total Words: 106,396
Unique Words: 9,623
Unique word ratio French to English: 1.32
THE ENGLISH SPY
English original:
Total Words: 110,246
Unique Words: 11,349
Spanish Edition
Total Words: 114,234
Unique Words: 14,193
Unique word ratio Spanish to English: 1.25
Though there are many caveats to this experiment, it seems to me that the ratios here are pretty consistent between Romance languages and English back and forth translations. With an average ratio of 1.31
Iād be very curious to see numbers for German and other languages for the same titles.