{"id":9910,"date":"2018-07-17T11:57:18","date_gmt":"2018-07-17T15:57:18","guid":{"rendered":"http:\/\/studentwork.prattsi.org\/infovis\/?p=9910"},"modified":"2019-01-11T00:06:38","modified_gmt":"2019-01-11T05:06:38","slug":"zacharys-karate-club","status":"publish","type":"post","link":"https:\/\/studentwork.prattsi.org\/infovis\/labs\/zacharys-karate-club\/","title":{"rendered":"Zachary\u2019s Karate Club"},"content":{"rendered":"<p><span style=\"font-weight: 400\">Zachary\u2019s Karate Club is a well-known dataset that describes the relationships in a university karate club, used by Wayne W. Zachary in his paper \u201c<\/span><span style=\"font-weight: 400\">An Information Flow Model for Conflict and Fission in Small Groups.\u201d This dataset is famous for its clear depiction of community structure, which occurs when nodes of a network can be grouped into densely connected sets. In the case of Zachary\u2019s Karate Club, the network can be split into two groups centered around Mr. Hi, the karate teacher, and John A, the club president, and the network accurately predicts how the karate club splits into two new clubs after an argument regarding pay causes a divide between Hr. Hi and John A. By recording the meetings of group members outside the context of the club itself, the network shows which club members will join which new club in 33 of 34 cases.<\/span><\/p>\n<p><span style=\"font-weight: 400\">Many visualizations of Zachary\u2019s Karate Club have been created since the original paper was published in 1977. The original visualization of this network places each node (member of the club) of the network in a circular pattern and then draws the edges (relationships outside of the context of the club) in straight lines between them.<\/span><\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-9912\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/07\/Social_Network_Model_of_Relationships_in_the_Karate_Club.png?resize=477%2C517\" alt=\"The original visualization of Zachary's Karate Club.\" width=\"477\" height=\"517\" srcset=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/07\/Social_Network_Model_of_Relationships_in_the_Karate_Club.png?w=477&amp;ssl=1 477w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/07\/Social_Network_Model_of_Relationships_in_the_Karate_Club.png?resize=277%2C300&amp;ssl=1 277w\" sizes=\"auto, (max-width: 477px) 100vw, 477px\" \/><\/p>\n<p><span style=\"font-weight: 400\">In this visualization, nodes 1 and 34 are Mr. Hi and John A respectively. The visualization had the requirement of being printed in black and white, and it does clearly show that nodes 1 and 34 are the most connected of the network. However, because of how dense the relationships are for Mr. Hi and John A, the upper half of the visualization becomes difficult to read. It is also difficult to tell the two new clubs apart from each other given that every line and node is represented in the same manner.<\/span><\/p>\n<p><span style=\"font-weight: 400\">Zachary\u2019s Karate Club was used again in 2002 by Michelle Girvan and Mark Newman to demonstrate community structure in their paper <\/span><a href=\"http:\/\/www.pnas.org\/content\/99\/12\/7821\"><span style=\"font-weight: 400\">\u201cCommunity structure in social and biological networks.\u201d<\/span><\/a><span style=\"font-weight: 400\"> In this paper, they use several different methods to visualize the data.<\/span><\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-9913\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/07\/F4.large_.jpg?resize=840%2C1348\" alt=\"Michelle Girvan and Mark Newman's visualizations.\" width=\"840\" height=\"1348\" srcset=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/07\/F4.large_.jpg?w=1122&amp;ssl=1 1122w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/07\/F4.large_.jpg?resize=187%2C300&amp;ssl=1 187w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/07\/F4.large_.jpg?resize=768%2C1232&amp;ssl=1 768w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/07\/F4.large_.jpg?resize=638%2C1024&amp;ssl=1 638w\" sizes=\"auto, (max-width: 840px) 100vw, 840px\" \/><\/p>\n<p><span style=\"font-weight: 400\">Figure A uses a computer generated network to lay out each member of the club and groups them by Mr. Hi (gray square) and John A (white circle). This makes the division in the group much more clear. Figure B demonstrates the same network using Girvan and Newman\u2019s method which calculates \u201cbetweenness\u201d for each node, defined as \u201cthe number of shortest paths between pairs of vertices that run along it.\u201d If two nodes share the same number of shortest paths, their dividing point is weighted equally in the hierarchy, thus splitting the network into groups. While this method clearly splits the club into two groups, it becomes difficult to see Mr. Hi and John A as central to those splits. Figure C shows Zachary\u2019s Karate Club split using edge-independent path counts and it fails to split the network accurately.<\/span><\/p>\n<p><span style=\"font-weight: 400\">For my own visualization, I used Gephi to create a network using the Zachary\u2019s Karate Club dataset available on the <\/span><a href=\"https:\/\/github.com\/gephi\/gephi\/wiki\/Datasets\"><span style=\"font-weight: 400\">Gephi Wiki<\/span><\/a><span style=\"font-weight: 400\">. This dataset comes preformatted for Gephi\u2019s needs but the visualization is created by the user. The original Zachary\u2019s Karate Club dataset is weighted by several measures of friendship, but the most frequently used dataset is this lighter unweighted version, as it still demonstrates the same groups without the added information.<\/span><\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-9914\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/07\/karate-club-3-01-1.png?resize=840%2C383\" alt=\"My visualization.\" width=\"840\" height=\"383\" srcset=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/07\/karate-club-3-01-1.png?w=1921&amp;ssl=1 1921w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/07\/karate-club-3-01-1.png?resize=300%2C137&amp;ssl=1 300w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/07\/karate-club-3-01-1.png?resize=768%2C350&amp;ssl=1 768w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/07\/karate-club-3-01-1.png?resize=1024%2C466&amp;ssl=1 1024w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/07\/karate-club-3-01-1.png?w=1680 1680w\" sizes=\"auto, (max-width: 840px) 100vw, 840px\" \/><\/p>\n<p><span style=\"font-weight: 400\">My visualization uses a network divided by color to demonstrate the split between each group. Gephi\u2019s default settings detect four groups but since I knew the story of Zachary\u2019s Karate Club as I approached the visualization, I adjusted the settings to display only two groups. Like Zachary\u2019s original network, this network accurately predicts the division for 33 of 34 of the members (number 9 was predicted to ally with John A but in actuality opted for Mr. Hi). The color and layout help show the two groups, like Figure A of Givan and Newman\u2019s example, but I also adjusted the node size to represent the number of connections each club member has to other members. This helps visualize the connectedness of that club member, similar to what I liked about Zachary\u2019s first map. In order to better show the story of Zachary\u2019s Karate Club, I also used names rather than numbers for the central characters, which brings the viewer\u2019s eye directly to Mr. Hi and John A so they can see that these individuals are the driving force of this division.<\/span><\/p>\n<p><span style=\"font-weight: 400\">Other than the number of groups, several other adjustments had to be made in order to achieve this visualization in Gephi. Gephi defaults to curved edges, which in this case resulted in a muddled appearance. The straight lines in this visualization keeps the relationships between each node clear. I also sized the labels according to the node size, though in Gephi the relationship between the smallest and largest sizes turned out to be too extreme for readability. Unfortunately, adjusting the location and size of labels in Gephi is nigh impossible, so I exported the network to svg in order to edit the labels in Illustrator, only to find that the nodes weren\u2019t exporting. It turns out that a plugin called Polygon Shaped Nodes causes the issue, and uninstalling the plugin fixes the problem. Once I had the entire graph in Illustrator, I was able to adjust label size and position, attempting to keep a balance between readability and relationship to node size. Though the image itself is high resolution and readable when seen at a 1:1 scale, this graph does have difficulty being read at smaller scales. However, the weight given to the most connected nodes tells the most important story in the network, which is how the club divided into two new clubs.<\/span><\/p>\n<p><span style=\"font-weight: 400\">If I were to carry this project forward I would like to explore how weighted relationships might affect the visualization. I would also like to explore how different shapes may change how the viewer understands the story. My visualization shows the split centered around Mr. Hi and John A, but visualizations like Givan and Newman\u2019s Figure B seem to have a less political viewpoint with the same information. I would also want to explore further how to best show that Member 9 is predicted to join John A but actually joined Mr. Hi. So far the best I came up with was using a key for colors vs. shapes but I wanted to create a visualization where a key wouldn\u2019t be required, so I shelved that idea for now and focused on this final product. Overall, Zachary\u2019s Karate Club is a well-studied social network that was very interesting to learn from.<\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Zachary\u2019s Karate Club is a well-known dataset that describes the relationships in a university karate club, used by Wayne W. Zachary in his paper \u201cAn Information Flow Model for Conflict and Fission in Small Groups.\u201d This dataset is famous for its clear depiction of community structure, which occurs when nodes of a network can be&hellip;<\/p>\n","protected":false},"author":537,"featured_media":9914,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[149,342],"tags":[89,106],"coauthors":[349],"class_list":["post-9910","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-labs","category-networks","tag-network-graphs","tag-network-visualization"],"jetpack_featured_media_url":"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/07\/karate-club-3-01-1.png?fit=1921%2C875&ssl=1","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/paBdcV-2zQ","_links":{"self":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/posts\/9910","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/users\/537"}],"replies":[{"embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/comments?post=9910"}],"version-history":[{"count":2,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/posts\/9910\/revisions"}],"predecessor-version":[{"id":9916,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/posts\/9910\/revisions\/9916"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/media\/9914"}],"wp:attachment":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/media?parent=9910"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/categories?post=9910"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/tags?post=9910"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/coauthors?post=9910"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}