{"id":1858,"date":"2015-12-01T02:58:04","date_gmt":"2015-12-01T02:58:04","guid":{"rendered":"http:\/\/listheory.prattsils.org\/?p=1858"},"modified":"2015-12-01T02:58:04","modified_gmt":"2015-12-01T02:58:04","slug":"sciencescapes-approach-to-big-data-in-the-realm-of-scientific-research","status":"publish","type":"post","link":"https:\/\/studentwork.prattsi.org\/foundations\/2015\/12\/01\/sciencescapes-approach-to-big-data-in-the-realm-of-scientific-research\/","title":{"rendered":"Sciencescape\u2019s Approach to Big Data in the Realm of Scientific Research"},"content":{"rendered":"<p>The current image of consumer technology, personal computers and front-end applications, is one of simplicity; small boxes that house smaller components and microchips, departing further and further away from the hulking, esoteric, push-pin and vacuum tube monoliths of the past. This is, of course, ideal, as the growingly tech minded society compels its citizens to participate more wholly within its self-designated bounds.<\/p>\n<p>&nbsp;<\/p>\n<figure style=\"width: 522px\" class=\"wp-caption alignnone\"><img loading=\"lazy\" decoding=\"async\" class=\"\" src=\"https:\/\/i.ytimg.com\/vi\/1Ve5u7jfxds\/maxresdefault.jpg\" alt=\"\" width=\"522\" height=\"294\" \/><figcaption class=\"wp-caption-text\">Even Google\u2019s server rooms express the hushed, confident, uniformity of its homepage<\/figcaption><\/figure>\n<p>&nbsp;<\/p>\n<p><span style=\"font-weight: 400\">But this picture belies the massive data swarm on the other side of the front end. We might only catch glimpses of it through our interfaces, but Big Data is consuming the world, for good and ill.<\/span><\/p>\n<p><span style=\"font-weight: 400\">In their article \u201cCRITICAL QUESTIONS FOR BIG DATA,\u201d Danah Boyd &amp; Kate Crawford characterize the concept:<\/span><\/p>\n<blockquote><p><span style=\"font-weight: 400\">We define Big Data as a cultural, technological, and scholarly phenomenon that rests on the interplay of: <\/span><\/p>\n<p><span style=\"font-weight: 400\">(1) Technology: maximizing computation power and algorithmic accuracy to gather, analyze, link, and compare large data sets. <\/span><\/p>\n<p><span style=\"font-weight: 400\">(2) Analysis: drawing on large data sets to identify patterns in order to make economic, social, technical, and legal claims. <\/span><\/p>\n<p><span style=\"font-weight: 400\">(3) Mythology: the widespread belief that large data sets offer a higher form of intelligence and knowledge that can generate insights that were previously impossible, with the aura of truth, objectivity, and accuracy (663).<\/span><\/p><\/blockquote>\n<p><span style=\"font-weight: 400\">This interplay implies the Big Data approach to research and thought, where the potential of technology to facilitate, manipulate, process, and analyse torrents of data roots the belief that Big Data holds meaningful value in understanding the world thus constructed.<\/span><\/p>\n<p><i><span style=\"font-weight: 400\">Sciencescape<\/span><\/i><span style=\"font-weight: 400\"> very much falls under the banner of this paradigm: it\u2019s data set, the universe of scientific academic research and journal articles. I am intensely interested in the site, because of its attempt to organize, for our ease of access, a nebula that doubles in output every nine years (Van Noorden). To this end, the site claims to have indexed 24 million articles.<\/span><\/p>\n<p><span style=\"font-weight: 400\">This number, in itself, is not impressive, as big academic indexes are common. Instead, the novelty arises from <\/span><i><span style=\"font-weight: 400\">Sciencescape<\/span><\/i><span style=\"font-weight: 400\">\u2019s<\/span> <span style=\"font-weight: 400\">method of presentation and management. Here is a video from their <a href=\"https:\/\/sciencescape.org\/about\">About <\/a>section:<\/span><\/p>\n<p><iframe loading=\"lazy\" width=\"1000\" height=\"563\" src=\"https:\/\/www.youtube.com\/embed\/XaUKZf00ub8?feature=oembed\" frameborder=\"0\" allow=\"autoplay; encrypted-media\" allowfullscreen><\/iframe><\/p>\n<p><span style=\"font-weight: 400\">From this clip, it appears as though <\/span><i><span style=\"font-weight: 400\">Sciencescape <\/span><\/i><span style=\"font-weight: 400\">wishes to establish a web of scholarly data through computational algorithmic methods to pick out key concepts such as subjects, authors, institutions, <\/span><i><span style=\"font-weight: 400\">etc.<\/span><\/i><span style=\"font-weight: 400\">, as a means to order them in a way that exposes their topic, field of study, and impact. Big Data methodology for Big Data sets, <\/span><i><span style=\"font-weight: 400\">Sciencescape <\/span><\/i><span style=\"font-weight: 400\">works to provide meaningful orientation in the realm of science at a level that would seem too grand for more traditional approaches. The site offers free accounts to access to their database, so let us now explore the site\u2019s aspirations in their implementation.<\/span><\/p>\n<p><span style=\"font-weight: 400\">The way we will discover the functionality of the site is through a contemplation of the elements as they present themselves on the guided tour of <\/span><i><span style=\"font-weight: 400\">Sciencescape<\/span><\/i><span style=\"font-weight: 400\">. I do this not only to easily distinguish features of note, but as a means to peer into the minds of the site\u2019s creators; I want to get an idea of what they think as valuable, novel, and unfamiliar, in their approach to Big Data analysis.<\/span><\/p>\n<p><span style=\"font-weight: 400\">First, the overall presentation of the site before log-in feels very much like the now familiar startup interface (though the company\u2019s headquarters is in Toronto). <\/span><\/p>\n<p><a href=\"http:\/\/listheory.prattsils.org\/wp-content\/uploads\/Capture.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-1863\" src=\"http:\/\/listheory.prattsils.org\/wp-content\/uploads\/Capture.png\" alt=\"Capture\" width=\"1900\" height=\"932\" \/><\/a><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-weight: 400\">Very professional and enticing toward potential users and investors alike, this web page contains all the standard links to the information about the project and those behind it. Clicking on the about tab allows a user to learn about the functionality of the site. Of particular note, the publishing partners sub-tab:<\/span><\/p>\n<p>&nbsp;<\/p>\n<p><a href=\"http:\/\/listheory.prattsils.org\/wp-content\/uploads\/Capture2.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-medium wp-image-1864\" src=\"http:\/\/listheory.prattsils.org\/wp-content\/uploads\/Capture2-300x253.jpg\" alt=\"Capture2\" width=\"300\" height=\"253\" \/><\/a><\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-weight: 400\">From the about tab, we are also able to access the guided tour, which allows us to bypass log-in. <\/span><\/p>\n<p><b>PAPERS<\/b><\/p>\n<p><span style=\"font-weight: 400\">The first stop is a page for an individual paper:<\/span><\/p>\n<p>&nbsp;<\/p>\n<p><a href=\"http:\/\/listheory.prattsils.org\/wp-content\/uploads\/Capture31.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-1866\" src=\"http:\/\/listheory.prattsils.org\/wp-content\/uploads\/Capture31.jpg\" alt=\"Capture3\" width=\"1919\" height=\"927\" \/><\/a><\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-weight: 400\">This particular web portal holds many interactive elements. For instance, we can follow the writers and journal, into the past and into the future when new publications are added. If the paper is available to us, then we can receive a .pdf. Other web 2.0 mainstays like \u201cAdd to Library,\u201d \u201cBroadcast,\u201d and \u201cBookmark\u201d also appear, as well as buttons to facebook, twitter, and email.<\/span><\/p>\n<p><span style=\"font-weight: 400\">At the bottom of the page, there are lists of citations to and from this specific paper, but also \u201crecommended readings\u201d and \u201cfields.\u201d I am not certain as to what factors go into the recommendations, perhaps relevancies with authors and topics, but the \u201cFields\u201d tab interests me the most. The list is populated with associated concepts: all of which expand through time, in order to encompass other articles.<\/span><\/p>\n<p><b>FIELDS<\/b><\/p>\n<p><span style=\"font-weight: 400\">The next main section of the tour presents an example of these \u201cFields.\u201d Displayed is the timeline associated with AIDS research:<\/span><\/p>\n<p><a href=\"http:\/\/listheory.prattsils.org\/wp-content\/uploads\/Capture4.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone  wp-image-1867\" src=\"http:\/\/listheory.prattsils.org\/wp-content\/uploads\/Capture4.jpg\" alt=\"Capture4\" width=\"737\" height=\"301\" \/><\/a><\/p>\n<p><span style=\"font-weight: 400\">These field pages, I think, really display the greatest value in <\/span><i><span style=\"font-weight: 400\">Sciencescape<\/span><\/i><span style=\"font-weight: 400\">. This research timeline shows the sudden jump in scientific awareness of AIDS during the period of April-July 1983, when 825 citations mentioned it. The three months before show only 30 citations. This tool is immediately helpful in accounting for the history of the disease as science has seen it.<\/span><\/p>\n<p><span style=\"font-weight: 400\">The other significant value on this page is the \u201cEigenfactor,\u201d which <\/span><i><span style=\"font-weight: 400\">Sciencescape <\/span><\/i><span style=\"font-weight: 400\">uses to rate papers\u2019 impact. In the field of AIDS research, we see that a paper published in May 1983 influenced others to the greatest amount. <\/span><\/p>\n<p><span style=\"font-weight: 400\">The Eigenfactor is one of the defining characteristics of whole engine:<\/span><\/p>\n<blockquote><p><span style=\"font-weight: 400\">The Eigenfactor algorithms take into account not only the citations, but where the citations come from. \u00a0This is similar to how modern search engine algorithms rank websites. \u00a0A citation from a highly referenced journal, article or author carries more weight than one from a journal nobody has read (https:\/\/sciencescape.org\/eigenfactor).<\/span><\/p><\/blockquote>\n<p><span style=\"font-weight: 400\">With this factor, scientists have a potential metric for significance that accounts for the entire web of data, rather than just explicitly referenced citations.<\/span><\/p>\n<p><b>THE FINAL STOPS<\/b><\/p>\n<p><span style=\"font-weight: 400\">The last two sections of the tour deal with the facilities in place to allow someone to keep up to date on her research. She can follow elements, such as journals, universities, authors, and fields. New papers associated with these concepts then appear in her Feed as they are published. Also, an extra level of organization exists; called \u201cStreams.\u201d these groups can be constructed by the follower, as a means to further link the items of which she follows. She can sort by these streams to focus her attention on one area of association.<\/span><\/p>\n<p><a href=\"http:\/\/listheory.prattsils.org\/wp-content\/uploads\/Capture5.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-1868\" src=\"http:\/\/listheory.prattsils.org\/wp-content\/uploads\/Capture5.jpg\" alt=\"Capture5\" width=\"1896\" height=\"648\" \/><\/a><\/p>\n<p><span style=\"font-weight: 400\">Overall, the tour was promising to me, as an information scientist. Ideally, this specific conglomeration of social and semantic tools should increase relevancy and visibility over the extreme output of scientific research. Nevertheless, I worry about the implementation of such algorithms over such big data sets; I am not a bio-scientist, and so I cannot judge how well <\/span><i><span style=\"font-weight: 400\">Sciencescape<\/span><\/i><span style=\"font-weight: 400\">\u2019s metrics work in coordinating fields and impacts. Certainly, more inquiry is needed, but the concepts that underlie the site are compelling.<\/span><\/p>\n<p>&nbsp;<\/p>\n<hr \/>\n<p style=\"text-align: center\">Works Cited<\/p>\n<p style=\"text-align: left\">danah boyd &amp; Kate Crawford (2012) CRITICAL QUESTIONS FOR BIG DATA, Information, Communication &amp; Society, 15:5, 662-679,<\/p>\n<p style=\"text-align: left\">Van Noordan, Richard &#8220;Global Scientific Output Doubles Every Nine Years.\u201d <i>News blog<\/i>. Web. 30 Nov. 2015. &lt;http:\/\/blogs.nature.com\/news\/2014\/05\/global-scientific-output-doubles-every-nine-years.html&gt;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The current image of consumer technology, personal computers and front-end applications, is one of simplicity; small boxes that house smaller components and microchips, departing further and further away from the hulking, esoteric, push-pin and vacuum tube monoliths of the past. This is, of course, ideal, as the growingly tech minded society compels its citizens to [&hellip;]<\/p>\n","protected":false},"author":319,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[2],"tags":[],"class_list":["post-1858","post","type-post","status-publish","format-standard","hentry","category-articles"],"jetpack_featured_media_url":"","_links":{"self":[{"href":"https:\/\/studentwork.prattsi.org\/foundations\/wp-json\/wp\/v2\/posts\/1858","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/studentwork.prattsi.org\/foundations\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/studentwork.prattsi.org\/foundations\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/foundations\/wp-json\/wp\/v2\/users\/319"}],"replies":[{"embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/foundations\/wp-json\/wp\/v2\/comments?post=1858"}],"version-history":[{"count":0,"href":"https:\/\/studentwork.prattsi.org\/foundations\/wp-json\/wp\/v2\/posts\/1858\/revisions"}],"wp:attachment":[{"href":"https:\/\/studentwork.prattsi.org\/foundations\/wp-json\/wp\/v2\/media?parent=1858"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/foundations\/wp-json\/wp\/v2\/categories?post=1858"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/foundations\/wp-json\/wp\/v2\/tags?post=1858"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}