{"id":11312,"date":"2018-11-07T01:52:27","date_gmt":"2018-11-07T06:52:27","guid":{"rendered":"http:\/\/studentwork.prattsi.org\/infovis\/?p=11312"},"modified":"2019-01-10T23:54:45","modified_gmt":"2019-01-11T04:54:45","slug":"marvel-hero-social-network-gephi","status":"publish","type":"post","link":"https:\/\/studentwork.prattsi.org\/infovis\/labs\/marvel-hero-social-network-gephi\/","title":{"rendered":"Marvel Hero Social Network &#8211; Gephi"},"content":{"rendered":"<h2><strong>Introduction<\/strong><\/h2>\n<p>As everyone\u2019s childhood, superheroes were also a major part of my childhood. I was fascinated by Spiderman and always dreamt of becoming one when I grow up. I\u2019m pretty sure it\u2019s every child\u2019s dream to become his favorite superhero one day. Fast-forward to November 2018, I\u2019m still mesmerized by superheroes, but the only difference is that I don\u2019t dream of becoming one now.<\/p>\n<p>When I talk of superheroes, the first thing that strikes to me is the Marvel Cinematic Universe. It\u2019s been more than a decade since the first Marvel superhero movie &#8211; Iron Man came out and till now they have released 20 movies featuring different superheroes. Since 2008, Marvel h<span style=\"font-size: 1.0625rem\">as built a fanbase of millions.<\/span><\/p>\n<p>Being one in those millions, I always keep in mind that I\u2019m constantly updated with the latest content related to the Marvel Cinematic Universe. While doing that I came across a dataset based on Marvel social network which included occasions where two or more superheroes appeared together in the same franchise. In order to do a network visualization and analysis of the data set, I used Gephi.<\/p>\n<p>By doing a network visualization of this dataset, I wanted to map out large clusters of superheroes who appeared in the same franchise.<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" class=\"size-large wp-image-11314 aligncenter\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/11\/elijah-o-donnell-378338-unsplash-1.jpg?resize=840%2C459\" alt=\"\" width=\"840\" height=\"459\" srcset=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/11\/elijah-o-donnell-378338-unsplash-1.jpg?resize=1024%2C560&amp;ssl=1 1024w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/11\/elijah-o-donnell-378338-unsplash-1.jpg?resize=300%2C164&amp;ssl=1 300w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/11\/elijah-o-donnell-378338-unsplash-1.jpg?resize=768%2C420&amp;ssl=1 768w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/11\/elijah-o-donnell-378338-unsplash-1.jpg?w=1680 1680w\" sizes=\"auto, (max-width: 840px) 100vw, 840px\" \/><\/p>\n<h2><\/h2>\n<h2><\/h2>\n<p>&nbsp;<\/p>\n<h2><strong>Inspiration\/Critique<\/strong><\/h2>\n<p>After I came across this dataset, I wanted to see the type of network visualizations that are already available related to the superhero datasets. This helped me in making my visualization better and understandable. Some of the examples that I liked were:<\/p>\n<ol>\n<li><strong>Pierre Gutierrez\u2019s Marvel Social Graph Analysis<\/strong><\/li>\n<\/ol>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-11316 aligncenter\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/11\/marvel_graph_50.jpg?resize=763%2C428\" alt=\"\" width=\"763\" height=\"428\" srcset=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/11\/marvel_graph_50.jpg?w=763&amp;ssl=1 763w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/11\/marvel_graph_50.jpg?resize=300%2C168&amp;ssl=1 300w\" sizes=\"auto, (max-width: 763px) 100vw, 763px\" \/><\/p>\n<h6 style=\"text-align: center\">Source:\u00a0<a href=\"https:\/\/blog.dataiku.com\/2015\/05\/19\/marvel-social-graph-analysis\">https:\/\/blog.dataiku.com\/2015\/05\/19\/marvel-social-graph-analysis<\/a><\/h6>\n<p>I came across this visualization while going through an article on Dataiku. My main idea was inspired by this visualization. I like the way it clearly bifurcates different movie characters and represents them in different colors. It looks clean from a design point of view and can be understood easily.<\/p>\n<p><strong>2.\u00a0F\u00e9lix Luginb\u00fchl\u2019s Social Network of the Marvel Cinematic Universe<\/strong><\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-11317 aligncenter\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/11\/nkHCxCaVHW.gif?resize=636%2C594\" alt=\"\" width=\"636\" height=\"594\" \/><\/p>\n<h6 style=\"text-align: center\">Source:\u00a0<a href=\"http:\/\/felixluginbuhl.com\/network\/\">http:\/\/felixluginbuhl.com\/network\/<\/a><\/h6>\n<p>While I was looking for inspirations, this visualization caught my attention. It is such that it has the characters of the Marvel movies randomly scattered and when you click on a character, it tells you which movies you can find that character in. It also has some user controls such as zoom in and zooms out for the users. I really liked the way this visualization was created.<\/p>\n<p>&nbsp;<\/p>\n<h2><strong>Materials<\/strong><\/h2>\n<ol>\n<li><a href=\"https:\/\/github.com\/gephi\/gephi\/wiki\/Datasets\">The Marvel Social Network Gephi file<\/a> &#8211; This network of superheroes was constructed by Cesc Rossell\u00f3, Ricardo Alberich, and Joe Miro from the University of the Balearic Islands. The data was collected by Infochimps and transformed and enhanced by Kai Chang.<\/li>\n<li><a href=\"https:\/\/gephi.org\/\">Gephi<\/a> &#8211; It is an open-source software for network visualization and analysis. It helps data analysts to intuitively reveal patterns and trends, highlight outliers and tells stories with their data. It uses a 3D render engine to display large graphs in real-time and to speed up the exploration.<\/li>\n<\/ol>\n<p>&nbsp;<\/p>\n<h2><strong>Method to Create This Visualization<\/strong><\/h2>\n<ol>\n<li><strong>Selecting the Dataset<\/strong><\/li>\n<\/ol>\n<p>Selecting the right dataset has always been a nightmare for me. As I struggled while finding the right dataset in my previous lab (Tableau), this lab was no less. With so much free data available, I was facing difficulty in narrowing down the dataset topic. After much effort, I came across a dataset which was related to the Marvel universe. The best part about this dataset was that it was a Gephi file and it didn\u2019t require any cleanup. Now that I had the dataset\/ Gephi file, it was time to proceed to the main step.<\/p>\n<p>2.\u00a0<strong>Data Visualization Using Gephi<\/strong><\/p>\n<p>In order to make a network visualization from my dataset, I started exploring Gephi. It was my first interaction with this software, so it took me a while to understand the whole interface. When I started the visualization process, I realized that the dataset was huge, and I need to cut down the dataset. This is where I applied the \u2018Range (Degree)\u2019 filter to it. Once the filter was applied, only .34% of the total nodes and .29% of the total edges were visible. After this, I moved to the layout selection, where I selected \u2018ForceAtlas 2\u2019. I then used &#8216;Expansion&#8217; to increase the spacing between the clusters. I did try out the other layouts but, in my opinion, ForceAtlas 2 + Expansion produced the best output. I could clearly see four different clusters of my data scattered in a triangular shape. It was now time to separate each cluster by assigning different colors. So, I decided to run the Modularity function and then assigned colors to the Nodes based on Modularity Class.<\/p>\n<p>For formatting, I moved to the \u2018Preview\u2019 tab where I changed the font style and font size. I also played with the thickness option under the \u2018Edges\u2019 menu.<\/p>\n<p>After all this, my network visualization was ready, and it can be viewed under the \u2018Findings\u2019 section of this report.<\/p>\n<p>&nbsp;<\/p>\n<h2><strong>Findings<\/strong><\/h2>\n<p>The final visualization with the ForceAtlas 2 +\u00a0Expansion layout represented data in four clusters. I could clearly identify 3 clusters &#8211; \u2018Fantastic Four\u2019 and \u2018X-Men\u2019 and characters affiliated to the \u2018Avengers\u2019. I think the Avengers group included characters from the movies and comic books.<\/p>\n<p>I also used some filtering option for my visualization and the statistics were as follows &#8211;<\/p>\n<p>Average Degree &#8211; 35.027<\/p>\n<p>Modularity &#8211; 0.299<\/p>\n<p>Range (Degree) Settings &#8211; From 924 to 2189<\/p>\n<p>&nbsp;<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-11318 aligncenter\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/11\/1-1.png?resize=840%2C490\" alt=\"\" width=\"840\" height=\"490\" srcset=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/11\/1-1.png?w=2479&amp;ssl=1 2479w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/11\/1-1.png?resize=300%2C175&amp;ssl=1 300w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/11\/1-1.png?resize=768%2C448&amp;ssl=1 768w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/11\/1-1.png?resize=1024%2C597&amp;ssl=1 1024w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/11\/1-1.png?w=1680 1680w\" sizes=\"auto, (max-width: 840px) 100vw, 840px\" \/><\/p>\n<h6 style=\"text-align: center\">Visualization I created using Gephi<\/h6>\n<h2><\/h2>\n<p>&nbsp;<\/p>\n<h2><strong>Reflections<\/strong><\/h2>\n<p>Overall, I enjoyed exploring Gephi but I think it takes time to get used to it. After working on this project, I feel that Gephi restricts the user and does not give as much freedom as Tableau.<\/p>\n<p>Though it is a very powerful tool to make amazing network visualizations, it comes with its drawbacks. The biggest drawback that I feel in this software is the inability to undo any action. It was difficult for me to experiment with the software without the undo button. In order to tackle this inability, I had to save a new version of that particular project after every change I made. As a user, it was very frustrating for me. Also, I think some small features like zoom in and zoom out also worked against my mental model.<\/p>\n<p>Talking about my dataset, I feel that my dataset was very big for this software to process. So next time when I will use Gephi to make any visualization, I will make sure that my dataset is not that big. This will help in making better visualizations.<\/p>\n<p>In all, Gephi has a lot of potential provided it is designed in the right way.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Introduction As everyone\u2019s childhood, superheroes were also a major part of my childhood. I was fascinated by Spiderman and always dreamt of becoming one when I grow up. I\u2019m pretty sure it\u2019s every child\u2019s dream to become his favorite superhero one day. Fast-forward to November 2018, I\u2019m still mesmerized by superheroes, but the only difference&hellip;<\/p>\n","protected":false},"author":548,"featured_media":11314,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[149,342],"tags":[282,283,206,207],"coauthors":[352],"class_list":["post-11312","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-labs","category-networks","tag-data","tag-datavisualization","tag-gephi-network-visualization","tag-marvel"],"jetpack_featured_media_url":"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/11\/elijah-o-donnell-378338-unsplash-1.jpg?fit=2333%2C1275&ssl=1","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/paBdcV-2Ws","_links":{"self":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/posts\/11312","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/users\/548"}],"replies":[{"embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/comments?post=11312"}],"version-history":[{"count":10,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/posts\/11312\/revisions"}],"predecessor-version":[{"id":11413,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/posts\/11312\/revisions\/11413"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/media\/11314"}],"wp:attachment":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/media?parent=11312"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/categories?post=11312"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/tags?post=11312"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/coauthors?post=11312"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}