{"id":18572,"date":"2020-07-02T17:17:39","date_gmt":"2020-07-02T21:17:39","guid":{"rendered":"http:\/\/studentwork.prattsi.org\/infovis\/?p=18572"},"modified":"2020-07-02T17:17:47","modified_gmt":"2020-07-02T21:17:47","slug":"social-circles-in-the-twitter-verse","status":"publish","type":"post","link":"https:\/\/studentwork.prattsi.org\/infovis\/visualization\/social-circles-in-the-twitter-verse\/","title":{"rendered":"Social Circles in the Twitter-Verse"},"content":{"rendered":"\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1070\" height=\"570\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Network-lab-photo.png?fit=840%2C447&amp;ssl=1\" alt=\"\" class=\"wp-image-18580\" srcset=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Network-lab-photo.png?w=1070&amp;ssl=1 1070w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Network-lab-photo.png?resize=300%2C160&amp;ssl=1 300w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Network-lab-photo.png?resize=1024%2C545&amp;ssl=1 1024w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Network-lab-photo.png?resize=768%2C409&amp;ssl=1 768w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Network-lab-photo.png?resize=800%2C426&amp;ssl=1 800w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Network-lab-photo.png?resize=338%2C180&amp;ssl=1 338w\" sizes=\"auto, (max-width: 840px) 100vw, 840px\" \/><\/figure>\n\n\n\n<p><strong>Introduction<\/strong><\/p>\n\n\n\n<p>Social networks have become increasingly popular in the past decade. Mobile applications and websites such as Instagram, Twitter, and Facebook are becoming the primary means of communication for both individuals and businesses alike. But what can these social networks tell us about our communication patterns and social structures? By analyzing network data, we can see how different users (or \u201cnodes\u201d) are connected. A connection between nodes can be anything from a \u201cfriendship\u201d, a mention, a comment, or a private message. We call these connections <em>edges<\/em>, and they can provide valuable insight into how information passes through society and different social groups.<\/p>\n\n\n\n<p>In this lab, I looked at data that was gathered from Twitter. The nodes represent distinct Twitter users, and the edges represent interactions between users \u2013 ie, mentions, retweets, or follows. By analyzing the data with tools in Gephi, I was able to identify a few distinct social groups. The visualization revealed how certain groups seem to be more connected to each other than to the greater Twitter universe. Using this visualization, we can start to make predictions about how information is shared and\/or consumed within these groups, and how the concept of an \u2018information echo chamber\u201d can come to exist.<\/p>\n\n\n\n<p><strong>Design References<\/strong><\/p>\n\n\n\n<p>Before starting this project, I watched a <a href=\"https:\/\/www.youtube.com\/watch?v=dSx5_PjaWVE\">youtube tutorial<\/a> on how to visualize communities. The video helped me to understand how to use some of the available tools in Gephi, as well as gave me some ideas for how to present my data (picture of the example visualization from the video is below). Perhaps the main takeaway from the video was to make good color choices for the different communities, or social circles, represented in the visualization. While Gephi generates random color schemes when you partition the data by modularity class, sometimes a few color adjustments to the will make the data more visually appealing.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"568\" height=\"403\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Inspiration.png?resize=568%2C403&#038;ssl=1\" alt=\"\" class=\"wp-image-18574\" srcset=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Inspiration.png?w=568&amp;ssl=1 568w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Inspiration.png?resize=300%2C213&amp;ssl=1 300w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Inspiration.png?resize=254%2C180&amp;ssl=1 254w\" sizes=\"auto, (max-width: 568px) 100vw, 568px\" \/><\/figure>\n\n\n\n<p><strong>Materials<\/strong><\/p>\n\n\n\n<p>I created my visualization using <a href=\"https:\/\/gephi.org\/\">Gephi<\/a>, a network data analysis software that can be downloaded for free. The <a href=\"https:\/\/snap.stanford.edu\/data\/ego-Twitter.html\">data<\/a> that I used was gathered from Twitter and made publicly available by Stanford University.<\/p>\n\n\n\n<p><strong>Methods<\/strong><\/p>\n\n\n\n<p>I began creating this visualization by first downloading my dataset and saving it in a CSV format. I briefly looked over the data in a plain text editor to make sure it was normalized and formatted properly. After confirming that my data was in good shape, I then imported it into Gephi to start the visualization.<\/p>\n\n\n\n<p>Upon importing the data, the initial visualization is a hairball of nodes and edges. My first step to achieving some visual clarity was to re-size the nodes based on degree (ie, how many connections they had). Using the tools on the left-hand side of the screen, I made nodes with higher degrees of connectivity appear larger, and nodes with lower degrees appear smaller. Pictured below is the tool and settings that I used to achieve this.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"275\" height=\"246\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Node-re-size.png?resize=275%2C246&#038;ssl=1\" alt=\"\" class=\"wp-image-18575\" srcset=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Node-re-size.png?w=275&amp;ssl=1 275w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Node-re-size.png?resize=201%2C180&amp;ssl=1 201w\" sizes=\"auto, (max-width: 275px) 100vw, 275px\" \/><\/figure>\n\n\n\n<p>My next step was to analyze the network to see if there were certain nodes that were more closely connected to each other. In other words, I wanted to see if there were distinct social circles within my network that communicated with each other more than with other nodes on the network. To do this, I found the Statistics tab on the right-hand side of the screen and ran the Modularity process (pictured below). This found a handful of Twitter communities within my network.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"229\" height=\"251\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Modularity.png?resize=229%2C251&#038;ssl=1\" alt=\"\" class=\"wp-image-18576\" srcset=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Modularity.png?w=229&amp;ssl=1 229w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Modularity.png?resize=164%2C180&amp;ssl=1 164w\" sizes=\"auto, (max-width: 229px) 100vw, 229px\" \/><\/figure>\n\n\n\n<p>After identifying my Twitter communities, I then wanted to visually separate them by color. To do this, I used the partitioning feature (pictured below). When I ran this, Gephi randomly selected a palette of colors to assign to each module. However, I made a few adjustments to the colors so that they would appear more distinctly on the visualization.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"279\" height=\"240\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Partition.png?resize=279%2C240&#038;ssl=1\" alt=\"\" class=\"wp-image-18577\" srcset=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Partition.png?w=279&amp;ssl=1 279w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Partition.png?resize=209%2C180&amp;ssl=1 209w\" sizes=\"auto, (max-width: 279px) 100vw, 279px\" \/><\/figure>\n\n\n\n<p>My next step was to eliminate nodes with low degrees of connectivity in order to \u201cclean up\u201d the visualization and only show the nodes with higher degrees. I accomplished this by using the filter on the right-hand side of the screen, and selecting the In-Degree Range option in the Topology category. Pictured below, I found that only displaying nodes with 40+ degrees provided the maximum amount of useful information with a minimal amount of visual clutter.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"243\" height=\"527\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/filter.png?resize=243%2C527&#038;ssl=1\" alt=\"\" class=\"wp-image-18578\" srcset=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/filter.png?w=243&amp;ssl=1 243w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/filter.png?resize=138%2C300&amp;ssl=1 138w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/filter.png?resize=83%2C180&amp;ssl=1 83w\" sizes=\"auto, (max-width: 243px) 100vw, 243px\" \/><\/figure>\n\n\n\n<p>Finally, I ran the ForceAtlas2 Layout on my visualization using mostly out-of-the-box settings. I made a handful of adjustments to some of the visual aspects of the image, such as labeling the nodes and adjusting the opacity of both the nodes and edges before saving it as a PDF. Pictured below is the final product.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"726\" height=\"574\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Twitter-Network-Black.png?resize=726%2C574&#038;ssl=1\" alt=\"\" class=\"wp-image-18579\" srcset=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Twitter-Network-Black.png?w=726&amp;ssl=1 726w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Twitter-Network-Black.png?resize=300%2C237&amp;ssl=1 300w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2020\/07\/Twitter-Network-Black.png?resize=228%2C180&amp;ssl=1 228w\" sizes=\"auto, (max-width: 726px) 100vw, 726px\" \/><\/figure>\n\n\n\n<p><strong>Reflection<\/strong><\/p>\n\n\n\n<p>The biggest challenge in this lab was learning the software. Gephi is an extremely powerful program with lots of features that I would like to explore further. Many of the features in Gephi lend themselves to different types of network datasets. Given that social media data represents only one type of network, I would really like to try more visualizations in the future with different data to see the full extent of what Gephi is capable of.<\/p>\n\n\n\n<p><strong>References<\/strong><\/p>\n\n\n\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-flow wp-block-group-is-layout-flow\">\n<figure class=\"wp-block-embed-youtube wp-block-embed is-type-video is-provider-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper\">\n<div class=\"video-wrapper\"><span class=\"embed-youtube\" style=\"text-align:center; display: block;\"><iframe loading=\"lazy\" class=\"youtube-player\" width=\"840\" height=\"473\" src=\"https:\/\/www.youtube.com\/embed\/dSx5_PjaWVE?version=3&#038;rel=1&#038;showsearch=0&#038;showinfo=1&#038;iv_load_policy=1&#038;fs=1&#038;hl=en-US&#038;autohide=2&#038;wmode=transparent\" allowfullscreen=\"true\" style=\"border:0;\" sandbox=\"allow-scripts allow-same-origin allow-popups allow-presentation allow-popups-to-escape-sandbox\"><\/iframe><\/span><\/div>\n<\/div><\/figure>\n<\/div><\/div>\n\n\n\n<p><a href=\"https:\/\/snap.stanford.edu\/data\/ego-Twitter.html\">https:\/\/snap.stanford.edu\/data\/ego-Twitter.html<\/a><\/p>\n\n\n\n<p><a href=\"https:\/\/gephi.org\/\">https:\/\/gephi.org\/<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Introduction Social networks have become increasingly popular in the past decade. Mobile applications and websites such as Instagram, Twitter, and Facebook are becoming the primary means of communication for both individuals and businesses alike. But what can these social networks tell us about our communication patterns and social structures? By analyzing network data, we can&hellip;<\/p>\n","protected":false},"author":718,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_feature_clip_id":0,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_post_was_ever_published":false},"categories":[1],"tags":[105,106,108,125,110],"coauthors":[533],"class_list":["post-18572","post","type-post","status-publish","format-standard","hentry","category-visualization","tag-network","tag-network-visualization","tag-social-media","tag-social-network","tag-twitter"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/paBdcV-4Py","_links":{"self":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/posts\/18572","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/users\/718"}],"replies":[{"embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/comments?post=18572"}],"version-history":[{"count":1,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/posts\/18572\/revisions"}],"predecessor-version":[{"id":18581,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/posts\/18572\/revisions\/18581"}],"wp:attachment":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/media?parent=18572"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/categories?post=18572"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/tags?post=18572"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/coauthors?post=18572"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}