Endangered Languages


Visualization

Introduction

I am exploring, and visualizing, critically endangered languages with around 0-50 speakers  around the world. I am contrasting these data points with the first official language of each country. I am then forming conclusions and making generalizations based on the data. 

Explanation, Inspiration

I was inspired by duolingo; I had been heavily anticipating duolingo’s launch of Yiddish and was mildly disappointed when it was released. I was reading the discussion forums regarding its launch and who would contribute and there was some discussion of how it’s hard to get exact spelling or an agreed upon definition because most of the people contributing had only had some of the language passed down. In 2019 duolingo announced their endangered language course for Navajo and Hawaiin and I wondered how much power an app could have when it comes to both preserving and then encouraging users to revive nearly extinct languages. The Endangered Languages Project has this map but it didn’t list the name of the language anywhere, just grouped the dots by region. I wanted to kind of reverse their map and have the names listed with the color representing a range of the number of speakers. 

Datasets and tools

I used two datasets, one CSV and one Shapefile to create this map. The two datasets I used were:  Endangered languages: the full list and Resource Watch data 2015 I used QGIS to manipulate the files into looking how I envisioned my map (kinda, sorta!) 

Methods and process

I gave myself time to understand or try to understand the little quirks of QGIS and I think it helped a bit, I wanted to represent the world and the most critically endangered languages in the most comprehensive way possible.

I played around with the world languages color scales as well as the color for the dots that represent the number of speakers. I consulted my peer reviewer about colors and sizing of the dots. While I appreciated their advice I continued to change things because this was a visualization I felt strongly about and therefore am not sure I will ever be satisfied with! I had to hide many labels because the labels were messed up. I’m not sure how to explain it concisely but I tried to represent a 0-200 range and yet even with every other range unchecked there were labels for languages whose number of speakers fell outside of the selected range. 

Visualizations and interpretation

Legend:

Legend to use for the following visualizations

Extinct Languages Visualizations:

North America
South America
Africa

From the visuals I can conclude that each continent seems to have lost a similar amount of indigenous languages over the years. North America’s are, however, less evenly dispersed than the extinct languages around South America and Africa.

Critically Endangered Languages:

Africa and the languages that are critically endangered.
North America and its critically endangered languages.

It was surprising to me that North America still had so many indigenous languages that were still being spoken as of 2015, even by 5-50 people. It appears that the same places where languages have gone extinct in North America also contain a myriad of endangered languages on the brink of extinction. The endangered languages are more concentrated in Africa, for the most part.


I thought it was interesting that the largest concentration of endangered points were in South Asia, nearing the top of Australia.

Europe & Asia’s endangered languages.

Reflections

I think for what I know and what I learned it was successful! I liked it. I think learning how to delete things and studying the documentation more closely or knowing exactly what to look up would change some things for me. I would like to keep working on this visualization though, I find the topic really interesting and the information seems relevant to some extent, with the rise of language learning apps and programs (mango languages, lingoda, etc.). I would also spend more time checking the languages are correct and trying to separate the dots. Finally, I would like to have a way to ensure the dots are landing in an exact location. I wasn’t sure how to go about checking that.

Here are snippets of some initial visuals, trial and error stage:

I wanted this to show the number of people who still spoke the language but I couldn’t center the number under the tag for the life of me.
This was made with the world languages data set/ shape file.

Citations

Endangered languages: the full list | News | theguardian.com. (2011, April 15). The Guardian. Retrieved April 10, 2023, from https://www.theguardian.com/news/datablog/2011/apr/15/language-extinct-endangered#data

Home. (n.d.). YouTube. Retrieved April 10, 2023, from https://resourcewatch.org/data/explore/soc_071_world_languages?section=Discover&selectedCollection=&zoom=3&lat=0&lng=0&pitch=0&bearing=0&basemap=dark&labels=light&layers=%255B%257B%2522dataset%2522%253A%252220662342-dcdd-4a42-9f58-bcc80217de71%2522%252C%25

Knowledge sharing. (n.d.). Endangered Languages Project. Retrieved April 10, 2023, from https://www.endangeredlanguages.com/userquery/

Wen, A. (2019, November 6). Duolingo Preserves Endangered Languages — Business Today Online Journal. Business Today Online Journal. Retrieved April 10, 2023, from https://journal.businesstoday.org/bt-online/2019/duolingo-preserves-endangered-languages