{"id":8589,"date":"2018-03-01T00:23:15","date_gmt":"2018-03-01T05:23:15","guid":{"rendered":"http:\/\/studentwork.prattsi.org\/infovis\/?p=8589"},"modified":"2018-03-03T22:43:06","modified_gmt":"2018-03-04T03:43:06","slug":"info-vis-lab-2-tableaupublic","status":"publish","type":"post","link":"https:\/\/studentwork.prattsi.org\/infovis\/labs\/info-vis-lab-2-tableaupublic\/","title":{"rendered":"Info Vis Lab 2: TableauPublic"},"content":{"rendered":"<p><a href=\"https:\/\/public.tableau.com\/profile\/kevin.cosenza#!\/vizhome\/IVLab2-MortalityData\/Dashboard1\">Link to the Visualization<\/a><\/p>\n<p>For this lab the goal was to twofold: to become familiar with OpenRefine and TableauPublic, and to create more complex visualizations using cleaned data. The first visualization is \u201cDeaths by Gender and Decade,\u201d which is a bar graph that displays the total deaths for each decade, split by male and female. The second visualization is \u201cDeath Totals by Year,\u201d which displays the death totals for each year in a dot chart. The final visualization is \u201cDeaths by Age Group,\u201d which shows the number of deaths for each age group (in intervals of ten years) via a bar graph. These visualizations are designed to display the data with different filters so that alternate trends can be recognized and analyzed.<\/p>\n<p>There were three visualizations that I used as inspiration. The first was \u201c<a href=\"https:\/\/blogs.cdc.gov\/nchs-data-visualization\/deaths-in-the-us\/\">Age-adjusted Death Rates and Life Expectancy at Birth<\/a>,\u201d based on NCHS data. This visualization showed the country\u2019s death rate from 1900-2013, with a filter for both Race and Sex. I used this as the basis for the \u201cDeaths by Gender and Decade\u201d data sorting, since I had the Sex data available. The second example I used was \u201c<a href=\"https:\/\/www.cdc.gov\/nchs\/data-visualization\/mortality-leading-causes\/\">Age-adjusted Death Rates for All Causes: United States, 1999\u20132015<\/a>,\u201d which came from the CDC. I liked the way this dot chart looked when using yearly data over a period of time, which is how the \u201cDeath Totals\u201d data looks when filtered to a single decade. The third visualization is \u201c<a href=\"https:\/\/vizhub.healthdata.org\/mortality\/5q0-analysis\">Under-5 Mortality Rate, 2016<\/a>\u201d from healthdata.org. The age range visualizations in the top right section made me realize that I could group my age data, therefore making it easier to parse.<\/p>\n<p>In order to create these visualizations, the original dataset had to be converted to a CSV file and then purged of unnecessary columns in OpenRefine. The data is from <a href=\"http:\/\/www.mortality.org\/cgi-bin\/hmd\/country.php?cntr=USA&amp;level=1\">mortality.org<\/a>, and it is a record of all the reported deaths in the United States from 1933-2014, with the subject\u2019s age, sex, and year of death. TableauPublic was then used to create the three visualizations.<\/p>\n<p>I chose these three visualizations because I wanted to display each of the facets that was included in the dataset. \u201cDeaths by Gender and Decade\u201d uses the Year and the Sex facets, \u201cDeath Totals by Year\u201d uses just the Year facet and the raw death count, and \u201cDeaths by Age Group\u201d uses the death count and age facet (though the ages are grouped by decade).<\/p>\n<p>I had some trouble with the \u201cDeath Totals by Year\u201d visualization, mainly because the dataset originally had data for the year 2015. This data skewed the entire set because the year contained 640+ records, whereas the rest of the years didn\u2019t have more than 450 records each. This resulted in a jump of over 2 million deaths, which heavily altered the visualization\u2019s range Once I removed the 2015 data, the scale become much more reasonable. I also had issues with the \u201cDeaths by Gender and Decade,\u201d which was due to the unintuitive nature of the facets. I repeatedly tried to drag the Color filter over to the facets, instead of the other way around.<\/p>\n<p>When it came to the \u201cDeaths by Age Group\u201d visualization, there were many entries that had a +\/-5-year margin of error and were therefore designated as occurring at the midpoint of their perspective decades (i.e. 35, 45, 55, etc.). Since this would skew the data if it was presented in individual years, I grouped the data by decade, making it more accurate and easier to understand.<\/p>\n<p>Overall, I found cleaning up my data and columns in OpenRefine useful, but I had to go back to it a few times once I started visualizing my data in TableauPublic. It was frustrating to have to go back and re-upload the data to TableauPublic after I removed the 2015 data, especially since it wasn\u2019t clear that it would be an issue even after I looked at the data in OpenRefine.<\/p>\n<p>In terms of usability, I often found TableauPublic to be incomprehensible and nonsensical. Instead of being able to rearrange data or filters, I ended up simply undoing steps until I was back to before the problem occurred. In one instance, I wanted to change the labeling of the Year axis to be listed by decades instead of individual years. This proved to be impossibly difficult, as there was no way to edit the axis without right-clicking on it. In this case, however, the years were so packed together that there was no space on the axis that didn\u2019t correspond (when clicked) to a specific year. It turned out that the problem was that the Year data was set to \u201cdiscrete\u201d instead of \u201ccontinuous,\u201d so the program believed there to be distinct and unchangeable breaks between each year. I mainly focused on watching the tutorial videos on how to clean up and then input data, and I wish I\u2019d spent more time looking at how to actually create the visualizations.<\/p>\n<p>One feature I did like is the ability to use a visualization as a filter. By filtering my data via \u201cDeaths by Gender and Decade,\u201d I was able to notice the drop in child mortality (from ages 0-9) that occurs as the decades pass, even as the total deaths rise (See Fig. 1 and Fig. 2).<\/p>\n<div id=\"attachment_8783\" style=\"width: 278px\" class=\"wp-caption aligncenter\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-8783\" class=\"wp-image-8783 size-medium\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/03\/TableauPublic-Lab1.png?resize=278%2C300\" alt=\"\" width=\"278\" height=\"300\" srcset=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/03\/TableauPublic-Lab1.png?resize=278%2C300&amp;ssl=1 278w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/03\/TableauPublic-Lab1.png?w=508&amp;ssl=1 508w\" sizes=\"auto, (max-width: 278px) 100vw, 278px\" \/><p id=\"caption-attachment-8783\" class=\"wp-caption-text\">Fig. 1: 1930&#8217;s Data<\/p><\/div>\n<div id=\"attachment_8784\" style=\"width: 268px\" class=\"wp-caption aligncenter\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-8784\" class=\"wp-image-8784 size-medium\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/03\/TableauPublic-Lab2.png?resize=268%2C300\" alt=\"\" width=\"268\" height=\"300\" srcset=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/03\/TableauPublic-Lab2.png?resize=268%2C300&amp;ssl=1 268w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/03\/TableauPublic-Lab2.png?w=489&amp;ssl=1 489w\" sizes=\"auto, (max-width: 268px) 100vw, 268px\" \/><p id=\"caption-attachment-8784\" class=\"wp-caption-text\">Fig. 2: 1970&#8217;s Data<\/p><\/div>\n<p>Ultimately, while I\u2019m quite happy with the visualizations produced in TableauPublic, I need a lot more time to learn its quirks and shortcuts so that the next dashboard I create won\u2019t be as much of a grind.<\/p>\n<h4 style=\"text-align: center\">Visualization Examples<\/h4>\n<p><u><a href=\"https:\/\/vizhub.healthdata.org\/mortality\/5q0-analysis\">https:\/\/vizhub.healthdata.org\/mortality\/5q0-analysis<\/a><\/u><\/p>\n<p><u><a href=\"https:\/\/www.cdc.gov\/nchs\/data-visualization\/mortality-leading-causes\/\">https:\/\/www.cdc.gov\/nchs\/data-visualization\/mortality-leading-causes\/<\/a><\/u><\/p>\n<p><u><a href=\"https:\/\/blogs.cdc.gov\/nchs-data-visualization\/deaths-in-the-us\/\">https:\/\/blogs.cdc.gov\/nchs-data-visualization\/deaths-in-the-us\/<\/a><\/u><\/p>\n<h4 style=\"text-align: center\">Data Source<\/h4>\n<p><u><a href=\"http:\/\/www.mortality.org\/cgi-bin\/hmd\/country.php?cntr=USA&amp;level=1\">http:\/\/www.mortality.org\/cgi-bin\/hmd\/country.php?cntr=USA&amp;level=1<\/a><\/u><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Link to the Visualization For this lab the goal was to twofold: to become familiar with OpenRefine and TableauPublic, and to create more complex visualizations using cleaned data. The first visualization is \u201cDeaths by Gender and Decade,\u201d which is a bar graph that displays the total deaths for each decade, split by male and female.&hellip;<\/p>\n","protected":false},"author":227,"featured_media":8783,"comment_status":"open","ping_status":"open","sticky":false,"template":"template-centered.php","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[149],"tags":[115,154,155,156],"coauthors":[],"class_list":["post-8589","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-labs","tag-data-visualization","tag-lab-report","tag-openrefine","tag-tableaupublic"],"jetpack_featured_media_url":"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2018\/03\/TableauPublic-Lab1.png?fit=508%2C548&ssl=1","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/paBdcV-2ex","_links":{"self":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/posts\/8589","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/users\/227"}],"replies":[{"embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/comments?post=8589"}],"version-history":[{"count":7,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/posts\/8589\/revisions"}],"predecessor-version":[{"id":8901,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/posts\/8589\/revisions\/8901"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/media\/8783"}],"wp:attachment":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/media?parent=8589"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/categories?post=8589"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/tags?post=8589"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/coauthors?post=8589"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}