Dataset for all ISO639 code sorted by country/territory?

Mats Blakstad mats.gbproject at
Sat Sep 17 07:10:26 CDT 2016

I manage to find a dataset on the website of Ethnologue, though it doesn't
look like open source, need to check with them exactly how I'm allowed to
use it:

Thanks for the explanation Phillippe. I know it is not an easy issue. Look
for different resources on the web, any specific links or feedbacks would
be helpful.

On 17 September 2016 at 13:35, Philippe Verdy <verdy_p at> wrote:

> Not all languages are sorted, only those for which there are released data
> in CLDR.
> And languages frequently belong to several countries/territories at the
> same time, with different official or recognized status (itself independant
> of the number of actual speakers, which is very frequently roughly
> estimated).
> Some countries are giving official statistics about their national or
> regional languages, but frequently these stats are old, or underestimated
> or overestimated for political reasons, or some languages are mixed as if
> they were only one, or simply discarded if it is considered locally as a
> secondary language, even if the official language is superficially
> understood but taken as a primary one.
> Statistics are also forgetting native speakers living abroad in a
> diaspora, or secondary learners of a language taught in foreign countries.
> 2016-09-17 11:19 GMT+02:00 Mats Blakstad <mats.gbproject at>:
>> Hi
>> Is there any dataset that contains all languages in the world sorted by
>> country/territory?
>> I found this at Unicode, but seems like only containing the most spoken
>> languages in each country and not the smaller once:
>> tory_language_information.html
>> Thanks in advance for help.
>> Best regards
>> Mats Blakstad
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <>

More information about the Unicode mailing list