{"id":7212,"date":"2017-08-15T18:55:17","date_gmt":"2017-08-15T22:55:17","guid":{"rendered":"http:\/\/research.prattsils.org\/?p=7212"},"modified":"2017-08-15T18:55:17","modified_gmt":"2017-08-15T22:55:17","slug":"final-exploring-interdisciplinarity-higher-education","status":"publish","type":"post","link":"https:\/\/studentwork.prattsi.org\/infovis\/visualization\/final-exploring-interdisciplinarity-higher-education\/","title":{"rendered":"Final: Exploring the Interdisciplinarity of Higher Education"},"content":{"rendered":"<h3>Introduction<\/h3>\n<p>Academic publishing is a large, lucrative business. From textbooks to journal articles and many formats in between, the publishing and distribution of scholarly materials rake in\u00a0over twenty-five\u00a0billion\u00a0dollars a year (Bluestone, 2015).<\/p>\n<p>I have the pleasure of working for a giant in the industry, ITHAKA Harbors, Inc. Specifically, I work for a subsidiary called JSTOR, a digital platform which licenses journals and makes their content available to various subscribers in higher education. The journals are organized by subject to facilitate efficient and efficacious discovery, but inevitably these groupings prove outdated, as fields of study become more interdisciplinary and less easily defined. To ensure that JSTOR remains both relevant and innovative, the content development department is in constant conversation about the boundaries of current subjects and the potential of new ones.<\/p>\n<p>For the least six months, I have collected data on the latest developments in higher education. My data collection, or, more aptly, data selection process is outlined in Appendix I. The purpose of the dataset is two-fold:<\/p>\n<ul>\n<li>To provide a snapshot of the current trends in higher education<\/li>\n<li>To inform general content development inquiries<\/li>\n<\/ul>\n<p>In addition to producing this data, I am also charged with designing visualizations to help communicate key conclusions. During conversations with many interested parties within ITHAKA, the desire to highlight and explore the interdisciplinarity of the data arose repeatedly.<\/p>\n<h3>Discussion<\/h3>\n<p>Given its utility as a display of relationships and interconnectedness, a network visualization is appropriate to show the most commonly occurring subjects and those which are most frequently associated.<\/p>\n<p>In his search to discern the centrality of Hegel in the history of philosophy, Adam Hogan, shares a vis created by his friend using Gephi:<\/p>\n<p><a href=\"https:\/\/i0.wp.com\/www.designandanalytics.com\/sites\/default\/files\/Philo-final-low-res.png\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" class=\"alignnone size-medium wp-image-7215\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infoshow\/wp-content\/uploads\/sites\/2\/2017\/08\/Philo-final-low-res-620x576.png?resize=620%2C576\" alt=\"\" width=\"620\" height=\"576\" \/><\/a><\/p>\n<p>The graphic shows Hegel as a highly influential figure, but Hogan was dubious, saying that \u201c[his] money would have been on Plato\u201d (Hogan, 2012). Using data scraped from Wikipedia and relying on his own experience with the history of philosophy, Hogan tests his theory by recreating the vis with a few key changes.<\/p>\n<p>First Hogan changes the edges from undirected to directed, a better proxy for the inspired\/inspirer relationship. No discernable differences. Next, Hogan swaps the \u201cdefault Authority measure\u201d in Gephi (Hogan, 2012) for PageRank, \u201cthe secret sauce that Google used\u201d to decide which sites were most important in search results (Sullivan, 2016). Certain that this alteration would be the key to a more accurate vis, Hogan met with disappointment again.<\/p>\n<p>Eventually, Hogan decided that the issue lay within the data and did not involve Gephi at all. I can certainly relate as my own vis has had problems I have not been able to resolve in Gephi. The main difference is that I have<a href=\"https:\/\/media.giphy.com\/media\/OPU6wzx8JrHna\/giphy.gif\" target=\"_blank\" rel=\"noopener noreferrer\"> no one to blame for an unruly dataset but myself<\/a>.<\/p>\n<p>Here\u2019s Hogan\u2019s vis again (<a href=\"http:\/\/www.designandanalytics.com\/visual-social-network-analysis-in-R-and-gephi-part-II\" target=\"_blank\" rel=\"noopener noreferrer\">part 2<\/a>), but with <a href=\"http:\/\/www.designandanalytics.com\/philosophers-gephi\/\" target=\"_blank\" rel=\"noopener noreferrer\">an interactive interface<\/a>.<\/p>\n<p>Because the audience for my vis will almost certainly value a movable, scalable graphic over a pdf, I\u2019m committed to untangling the complications of SigmaJS to produce something similar.<\/p>\n<p>The following visualization is a result of manipulating a JSTOR dataset.<\/p>\n<p><a href=\"http:\/\/diging.github.io\/tethne\/api\/tutorial.mallet.html\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" class=\"alignnone size-medium wp-image-7216\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infoshow\/wp-content\/uploads\/sites\/2\/2017\/08\/semantic_network-620x533.png?resize=620%2C533\" alt=\"\" width=\"620\" height=\"533\" \/><\/a><\/p>\n<p>I am exclusively interested in the aesthetics of this vis, however. In particular, I\u2019m partial to substituting the nodes for the node labels and to relative node (font) sizing according to \u201cstructural importance (betweenness centrality)\u201d. Also, the edges are thicker for \u201cmore strongly\u2026 associated\u201d words. (\u201cGenerating and Visualizing\u2026\u201d, 2015).<\/p>\n<p>Nodus Labs produced a vis with color coded groupings to show the structure of Russian protest groups.<\/p>\n<p><a href=\"http:\/\/noduslabs.com\/cases\/russian-protest-network-analysis-facebook-gephi-netvizz\/\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-7217\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infoshow\/wp-content\/uploads\/sites\/2\/2017\/08\/group-without-putin-510x378.png?resize=510%2C378\" alt=\"\" width=\"510\" height=\"378\" \/><\/a><\/p>\n<p>Fortunately, they outlined their process, and in Future Directions, I share my intent to apply their approach to a more representative dataset than what is used in this lab.<\/p>\n<h3>Materials<\/h3>\n<p>I used a computer with the following programs: Gephi (0.9.1) and the SigmaJS plugin (for Gephi 0.9.0), R, and Excel.<\/p>\n<h3>Methods and Results<\/h3>\n<p>Data selection is outlined in Appendix I. From the larger dataset, I cut out the Subjects field and pasted it into a CSV file. This file proved to large to parse through in R, so I pared down the CSV again to the first 100 records. I uploaded this csv file to my R workspace and applied <a href=\"https:\/\/docs.google.com\/document\/d\/1O_KH_4OGMdnj54w1wLcLaG4CtnWbo5PbRC2a5sx4-Hg\/edit\" target=\"_blank\" rel=\"noopener noreferrer\">the R code provided by Professor Chris Sula<\/a>.<\/p>\n<p>for generating all possible edges and attaching the frequency of their occurrence within the data. The code includes a line which saves the edges table to a new CSV file. In Excel, I trimmed the cells to remove the leading spaces that would cause duplicate data in Gephi. I also removed subjects which were not considered \u201cofficial,\u201d e.g. \u201cApplicable JSTOR Subject Not Found.\u201d<\/p>\n<p>The edges table was uploaded to Gephi with Type set to Undirected. I copied Node names into the Labels column and assigned the field containing frequency to the edges\u2019 Weight field. Then, I calculated betweenness and Weighted Degree.<\/p>\n<p>I applied the following layouts to the final vis in this order: OpenOrd, Expansion (ten times), Noverlap. I set node color to #75b5ff and node size to a scale of 5 to 100 by Weighted Degree. I set edge color on a scale from pale to deep orange by edge\u2019s weight and edge thickness by the same measure.<\/p>\n<p>To be consistent with other visualizations I\u2019ve made for my office, I chose #D1D1D1 as the background color.<\/p>\n<p>I performed calculations for diameter and density. The diameter was 1, which is expected because each subject in the small dataset was connected to every other subject at least once. The density was high for the same reason.<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" class=\"alignnone size-medium wp-image-7218\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infoshow\/wp-content\/uploads\/sites\/2\/2017\/08\/Final-Vis-620x620.png?resize=620%2C620\" alt=\"\" width=\"620\" height=\"620\" \/><\/p>\n<h3>Future Directions<\/h3>\n<p>I experimented with several layouts before resetting and applying the final mix. While they produced interesting visuals, they did not show the groupings I envisioned (like those of the Russian protest groups vis).<\/p>\n<p>Here&#8217;s the Fruchterman Reingold<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" class=\"alignnone size-medium wp-image-7219\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infoshow\/wp-content\/uploads\/sites\/2\/2017\/08\/Fruchterman-Reingold-620x396.png?resize=620%2C396\" alt=\"\" width=\"620\" height=\"396\" \/><\/p>\n<p>and the Force Atlas:<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" class=\"alignnone size-medium wp-image-7220\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infoshow\/wp-content\/uploads\/sites\/2\/2017\/08\/Force-Atlas-620x396.png?resize=620%2C396\" alt=\"\" width=\"620\" height=\"396\" \/><\/p>\n<p>For future iterations, I will try still more layouts, starting with the procedure of Nodus Labs.<\/p>\n<p>I would like for the vis to be interactive before presenting it to my supervisors (one of whom expressed explicitly her desire to see this done). Unfortunately, because of unknown complications with SigmaJS, this version is only available as an image or from the Gephi file. Embedding the vis in a site on our intranet with the ability to be manipulated and modified for the user\u2019s specific interest is the ideal, so I will continue troubleshooting.<\/p>\n<p>Further, the data used to create this vis were not a representative sampling. The most prominent nodes (like Public Policy &amp; Administration) do not reflect the statistics taken of the original data (which show other subjects to be most popular by frequency of occurrence and other measures). Though, the vis is accurate in displaying which subjects occur together (Health Policy, Public Health, and Health Sciences are strongly linked, as is Health Policy and Public Policy &amp; Administration). Because Gephi was limited in the size of the imported file just like R, the full dataset is not an option for analysis. Therefore, I will work with the other data scientists and information professionals in my office to select \u201cbetter\u201d records for analysis and visualization.<\/p>\n<p>On a strictly aesthetic basis, coworkers who have viewed the vis explained that the color scheme is misleading and the overlay of visual cues causes confusion. In class, we discussed that the use of color only stretches so far, so I will be revisiting those design decisions as well.<\/p>\n<p>Additionally, the data are private, so the final visualization won\u2019t be shareable outside of my company. Next steps for this vis thus include making it publicly available, so that all interested parties have access to this wealth of information concerning higher education.<\/p>\n<h3>References<\/h3>\n<p>Bluestone, Marisa (2015). U.S. Publishing Industry\u2019s Annual Survey Reveals Nearly $28 Billion Revenue in 2015. <em>Association of American Publishers<\/em>. Retrieved from http:\/\/newsroom.publishers.org\/us-publishing-industrys-annual-survey-reveals-nearly-28-billion-in-revenue-in-2015\/<\/p>\n<p>Generating and Visualizing Topic Models with Tethne and MALLET (2015). <em>ASU Digital Innovation Group<\/em>. Retrieved from http:\/\/diging.github.io\/tethne\/api\/tutorial.mallet.html<\/p>\n<p>Hogan, Adam (2012). Visualizing the History of Philosophy as a social network: The Problem with Hegel. <em>Design &amp; Analytics<\/em>. Retrieved from http:\/\/www.designandanalytics.com\/visualizing-the-history-of-philosophy-as-a-social-network-the-problem-with-hegel<\/p>\n<p>Nodus Labs (2011). Network Analysis of Russian Protest Groups on Facebook using Gephi and Netvizz. <em>Nodus Labs<\/em>. Retrieved from http:\/\/noduslabs.com\/cases\/russian-protest-network-analysis-facebook-gephi-netvizz\/<\/p>\n<p>Sullivan, Danny (2016). RIP Google PageRank score: A restrospective on how it ruined the web. <em>Search Engine Land<\/em>. Retrieved from http:\/\/searchengineland.com\/rip-google-pagerank-retrospective-244286<\/p>\n<p>&nbsp;<\/p>\n<h3>Appendix I: Data Selection Process<\/h3>\n<ol>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Move from highest to lowest ranked on THE World University Rankings<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Find institutional website.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Utilize Google advanced search shorthand to locate relevant sub-sites. The following slides explain this process.<\/span>\n<ol>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Key search terms are listed in a later slide.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">If information is difficult to glean from Google search, use the search function (and all other available tools) on the institution\u2019s home page.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Read every relevant site that has the institutional site name as its base (e.g. for University of California, Berkeley , peruse all pertinent sites with the base \u201cberkeley.edu\u201d).<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Relevance refers to the potential for each initiative to produce new scholarship as a \u201ctangible priorit[y]\u201d of the institution.<\/span><\/li>\n<\/ol>\n<\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Pull data from these sites and organize in Access Database.<\/span>\n<ol>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">If no date is found, record is excluded from data set.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Consult JSTOR subject page to determine appropriate classification, if necessary (e.g. Sustainability includes \u201cenergy,\u201d \u201cenergy policy,\u201d etc.)<\/span><\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Introduction Academic publishing is a large, lucrative business. From textbooks to journal articles and many formats in between, the publishing and distribution of scholarly materials rake in\u00a0over twenty-five\u00a0billion\u00a0dollars a year (Bluestone, 2015). I have the pleasure of working for a giant in the industry, ITHAKA Harbors, Inc. Specifically, I work for a subsidiary called JSTOR,&hellip;<\/p>\n","protected":false},"author":225,"featured_media":7215,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[1],"tags":[],"coauthors":[],"class_list":["post-7212","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-visualization"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/paBdcV-1Sk","_links":{"self":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/posts\/7212","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/users\/225"}],"replies":[{"embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/comments?post=7212"}],"version-history":[{"count":0,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/posts\/7212\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/"}],"wp:attachment":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/media?parent=7212"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/categories?post=7212"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/tags?post=7212"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/coauthors?post=7212"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}