Network Map: Top 1000 Frequent Occurring Surnames Between Races


Lab Reports, Networks

Introduction

I was able to interpret data provided by studying information from the 2010 Census. The data presents a normalized list of the top 1000 shared surnames between six, general and world race groups as defined by the US census. By creating a network visualization the intention is to determine the strengths between races and popular surnames. Specifically, my intentions are to connect surnames across six race groups, and ultimately see the connections and note any strengths in number of occurrences.

Discussion

I researched several network maps that cluster occurrences and sort them into specific groups.

Network Map: Hashtag cooccurrence network of tweets about Trump

“Hashtag cooccurrence network of tweets about Trump (collected thanks to Twitter search API and using the simple query « trump ») during the election night” (Cortext, 2017).

Proust Topic Network

Jeff Drouin “actually spends some time looking at the words that connect the topics and their meaning” (Meeks, 2011).

Stanford digital humanities projects and participants sorted by modularity

“Stanford digital humanities projects and participants sorted by modularity” (Meeks, 2010).

Materials and Procedures

Open Refine helps to consolidate the columns and information and spread them into rows so that software like Tableau and Gephi can read the data properly (in the order the the program can build structured maps and graphs).

Filtering and categorizing in OpenRefine

Filtering and categorizing in OpenRefine

Once the spreadsheet was cleaned up, the CSV file was taken into Gephi. The graphing software performs several functions that calculates the strength between primary nodes. In this case, the primary nodes are the six groups that are separated by common race groups.

Building the network in Gephi

Building the network in Gephi

Results

Though there are origins associated with surnames, they are not contingent to one race. This network map shows surnames and their association to different groups of races as defined by the census. It is easy to understand that surnames are not exclusive to a specific race, but the map offers more insight into how surnames are shared between races. Broadly, surnames seemed to be shared across the six groups mostly with the Latinx and White communities. More specificaly, one can infer that Black, Multi-racial, and Latinx groups share a high count of surnames with White communities. The same can be said that the other race groups share a high count of surnames with Latinx communities.

Network Map: Top 1000 Frequent Occurring Surnames

The network map shows the strength of ties between races and the top 1000 surnames. For example the number one surname that is shared across races is Smith.

Future Directions

I should break up the network map to show strengths between specific groups. This way more of the map is visible and the strength between the nodes is more prominent. For example, I can map and create a visualization of the strength of number of occurrences of a surname between Latinx and White, or between Black and White.

I should also note that this map can be supplemented with statistical information. In the future, I hope to culminate my data with the support of statistical graphs that can be created in Tableau. This will support the results with complementary data about the number of times each surname occurs in each race group. Tableau graphs are powerful in creating a separation between the categories of data, to see a clearer reference.

References

Cortext. (2017, March 17). Hashtag cooccurrence network of tweets about Trump. Retrieved July 13, 2018, from https://www.cortext.net/hashtag-cooccurrence-network-of-tweets-about-trump/

Meeks, E. (2011, July 6). Topic Networks in Proust. Retrieved July 13, 2018, from https://dhs.stanford.edu/algorithmic-literacy/topic-networks-in-proust/

Meeks, E. (2010, October 13). The Digital Humanities as a Network Map. Retrieved July 13, 2018, from https://dhs.stanford.edu/the-digital-humanities-as/the-digital-humanities-as-a-network-map/

Olson, R. (2016, March 04). Revisiting the vaccine visualizations. Retrieved July 5, 2018, from http://www.randalolson.com/2016/03/04/revisiting-the-vaccine-visualizations/

United States Department of Commerce. Bureau of the Census. Frequently Occurring Surnames from the 2010 Census. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 2017-05-29. https://doi.org/10.3886/E100668V1