{"id":24824,"date":"2021-07-05T23:52:20","date_gmt":"2021-07-06T03:52:20","guid":{"rendered":"https:\/\/studentwork.prattsi.org\/infovis\/?p=24824"},"modified":"2021-07-05T23:56:55","modified_gmt":"2021-07-06T03:56:55","slug":"mens-tennis-stars-1991-2016","status":"publish","type":"post","link":"https:\/\/studentwork.prattsi.org\/infovis\/labs\/mens-tennis-stars-1991-2016\/","title":{"rendered":"Men\u2019s Tennis Stars: 1991-2016"},"content":{"rendered":"\n<figure class=\"wp-block-image size-large\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"840\" height=\"473\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/Rafael-Nadal-Novak-Djokovic-and-Roger-Federer-all-breeze-into-Wimbledon-quarter-finals-7.jpg?resize=840%2C473&#038;ssl=1\" alt=\"\" class=\"wp-image-24825\" srcset=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/Rafael-Nadal-Novak-Djokovic-and-Roger-Federer-all-breeze-into-Wimbledon-quarter-finals-7.jpg?w=1024&amp;ssl=1 1024w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/Rafael-Nadal-Novak-Djokovic-and-Roger-Federer-all-breeze-into-Wimbledon-quarter-finals-7.jpg?resize=300%2C169&amp;ssl=1 300w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/Rafael-Nadal-Novak-Djokovic-and-Roger-Federer-all-breeze-into-Wimbledon-quarter-finals-7.jpg?resize=768%2C432&amp;ssl=1 768w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/Rafael-Nadal-Novak-Djokovic-and-Roger-Federer-all-breeze-into-Wimbledon-quarter-finals-7.jpg?resize=800%2C450&amp;ssl=1 800w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/Rafael-Nadal-Novak-Djokovic-and-Roger-Federer-all-breeze-into-Wimbledon-quarter-finals-7.jpg?resize=320%2C180&amp;ssl=1 320w\" sizes=\"auto, (max-width: 840px) 100vw, 840px\" \/><figcaption>The Big Three in men&#8217;s tennis. Source: <a href=\"https:\/\/www.sanantoniobasketballcourts.com\/court-builder-news\/the-big-three-in-men-s-tennis-who-retires-first\" target=\"_blank\" rel=\"noreferrer noopener\">Grand Slam Courts<\/a><\/figcaption><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\">Introduction<\/h2>\n\n\n\n<p class=\"has-dark-gray-color has-text-color\">Every year, it seems like it\u2019s the same handful of tennis players vying for the championship of the four major tournaments. I\u2019m not a big tennis fan, but I could easily list out the top five or so tennis players in the world, only because they haven\u2019t changed in the past two decades \u2014 or so it seems.<\/p>\n\n\n\n<p class=\"has-dark-gray-color has-text-color\">Sports data lends itself easily to network visualizations: it\u2019s often the same players or same teams playing together in a tournament or championship series. And having rounds of elimination means that the top players amass connections\/edges every time they proceed to the next round. By mapping out the men\u2019s tennis matches from the past few decades (1991-2016), I would be able to quickly identify the dominant players who have demonstrated remarkable career longevity.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Methods<\/h2>\n\n\n\n<p class=\"has-dark-gray-color has-text-color\">The dataset I found comes from a larger source of <a href=\"https:\/\/datahub.io\/sports-data\/atp-world-tour-tennis-data\">ATP World Tour data on Data Hub<\/a>. The data comes directly from the ATP Tour website, and is organized by years\/decades: 1877-1967, 1968-1990, 1991-2016, and 2017 (when it was last updated). I am only focusing on the 1991-2016 dataset, since those are the players that I know best. (Note: The ATP dataset only includes men\u2019s tennis, not women\u2019s \u2014 it would be interesting to compare the two, however.)<\/p>\n\n\n\n<p class=\"has-dark-gray-color has-text-color\">First, I cleaned and prepped the dataset so it was ready to be imported into Gephi. Using R, I removed all columns except for three: Winner, Loser, and Tournament. Then I renamed the \u201cWinner\u201d and \u201cLoser\u201d columns to \u201cSource\u201d and \u201cTarget,\u201d respectively. I also examined the \u201cTournament\u201d variable and saw that it contained 125 different values. I left those in for now and quickly imported the table into Gephi just to see what kind of result I would get. I made it a directed graph, ranking by out degree to visualize the winners \u2014 using both size and color of node to indicate significance.<\/p>\n\n\n\n<figure class=\"wp-block-image alignwide size-large\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"840\" height=\"719\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_directed_all-1024x877.png?resize=840%2C719&#038;ssl=1\" alt=\"\" class=\"wp-image-24826\" srcset=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_directed_all.png?resize=1024%2C877&amp;ssl=1 1024w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_directed_all.png?resize=300%2C257&amp;ssl=1 300w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_directed_all.png?resize=768%2C658&amp;ssl=1 768w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_directed_all.png?resize=1536%2C1315&amp;ssl=1 1536w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_directed_all.png?resize=2048%2C1754&amp;ssl=1 2048w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_directed_all.png?resize=800%2C685&amp;ssl=1 800w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_directed_all.png?resize=210%2C180&amp;ssl=1 210w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_directed_all.png?w=1680 1680w\" sizes=\"auto, (max-width: 840px) 100vw, 840px\" \/><figcaption>Out-degree graph of dataset containing all tournaments.<\/figcaption><\/figure>\n\n\n\n<p class=\"has-dark-gray-color has-text-color\">Sure enough, the results showed some predictable names (Roger Federer, Andre Agassi) but also a lot of surprises (Tommy Haas? Lleyton Hewitt as the central figure?). It was also odd to see someone like Pete Sampras represented in a smaller node than someone like Carlos Moya.<\/p>\n\n\n\n<p class=\"has-dark-gray-color has-text-color\">I went back to the dataset and filtered by just the four major tournaments: U.S. Open, Australian Open, Wimbledon, and French Open (Roland Garros). There are a lot of small tournaments (in Beijing, Bangkok, Tel-Aviv, Las Vegas, etc.) that the major players just simply don\u2019t participate in, and to include the stats of those matches with the major ones would skew the results.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Results<\/h2>\n\n\n\n<p class=\"has-dark-gray-color has-text-color\">With this smaller dataset, I now have 1355 nodes and 16099 edges. The average degree per node is 11.881, the network diameter is 10, and the graph density is .009, or only .9%, which is pretty low. I created two graphs with directed edges: one for out degree (to see how many times a player has won) and in degree (to see how many times a player has lost).<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"630\" height=\"1024\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_out-degree_winners-630x1024.png?resize=630%2C1024&#038;ssl=1\" alt=\"\" class=\"wp-image-24828\" srcset=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_out-degree_winners.png?resize=630%2C1024&amp;ssl=1 630w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_out-degree_winners.png?resize=185%2C300&amp;ssl=1 185w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_out-degree_winners.png?resize=768%2C1248&amp;ssl=1 768w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_out-degree_winners.png?resize=945%2C1536&amp;ssl=1 945w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_out-degree_winners.png?resize=1260%2C2048&amp;ssl=1 1260w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_out-degree_winners.png?resize=800%2C1300&amp;ssl=1 800w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_out-degree_winners.png?resize=111%2C180&amp;ssl=1 111w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_out-degree_winners.png?w=1786&amp;ssl=1 1786w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_out-degree_winners.png?w=1680 1680w\" sizes=\"auto, (max-width: 630px) 100vw, 630px\" \/><figcaption>Out-degree graph of dataset with the four major tournaments.<\/figcaption><\/figure>\n\n\n\n<p class=\"has-dark-gray-color has-text-color\">The out-degree graph now makes much more sense: As suspected, the \u201cbig three\u201d of men\u2019s tennis are the prominent nodes: Federer, Nadal, and Djokovic. These three players have been the dominant men\u2019s tennis players since 2003 (through today!). Federer seems to be the top player of the group, represented in a larger node, and the close proximity of Nadal and Djokovic seems to indicate that they often play each other \u2014 they are also known to be each other\u2019s primary rival.<\/p>\n\n\n\n<p class=\"has-dark-gray-color has-text-color\">Because this dataset starts in 1991, the graph also shows the main players of the 1990s and early 2000s, namely Andre Agassi and Pete Sampras, found in the lower left section of the graph. Other nodes in the region all belong to that same era: Michael Chang, Todd Martin, etc.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"576\" height=\"1024\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_in-degree_losers-576x1024.png?resize=576%2C1024&#038;ssl=1\" alt=\"\" class=\"wp-image-24829\" srcset=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_in-degree_losers.png?resize=576%2C1024&amp;ssl=1 576w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_in-degree_losers.png?resize=169%2C300&amp;ssl=1 169w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_in-degree_losers.png?resize=768%2C1364&amp;ssl=1 768w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_in-degree_losers.png?resize=865%2C1536&amp;ssl=1 865w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_in-degree_losers.png?resize=1153%2C2048&amp;ssl=1 1153w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_in-degree_losers.png?resize=800%2C1421&amp;ssl=1 800w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_in-degree_losers.png?resize=101%2C180&amp;ssl=1 101w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_in-degree_losers.png?w=1690&amp;ssl=1 1690w\" sizes=\"auto, (max-width: 576px) 100vw, 576px\" \/><figcaption>In-degree graph of dataset with the four major tournaments.<\/figcaption><\/figure>\n\n\n\n<p class=\"has-dark-gray-color has-text-color\">The in-degree graph shows which players had often competed in these four major tournaments but had the highest number of losses. Here, no one figure really stands out; most of the nodes seem to be roughly the same small\/medium size. It\u2019s noticeable, however, how much the prominent figures from the out-degree graph have now shrunk into much smaller nodes.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"566\" height=\"1024\" src=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_by-major-1-566x1024.png?resize=566%2C1024&#038;ssl=1\" alt=\"\" class=\"wp-image-24831\" srcset=\"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_by-major-1.png?resize=566%2C1024&amp;ssl=1 566w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_by-major-1.png?resize=166%2C300&amp;ssl=1 166w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_by-major-1.png?resize=768%2C1389&amp;ssl=1 768w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_by-major-1.png?resize=849%2C1536&amp;ssl=1 849w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_by-major-1.png?resize=1132%2C2048&amp;ssl=1 1132w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_by-major-1.png?resize=800%2C1447&amp;ssl=1 800w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_by-major-1.png?resize=100%2C180&amp;ssl=1 100w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_by-major-1.png?w=1790&amp;ssl=1 1790w, https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/tennis_majors_by-major-1.png?w=1680 1680w\" sizes=\"auto, (max-width: 566px) 100vw, 566px\" \/><figcaption>Out-degree graph with edges color-coded by tournament. Purple = French Open; Green = US Open; Orange: Wimbledon; Blue: Australian Open.<\/figcaption><\/figure>\n\n\n\n<p class=\"has-dark-gray-color has-text-color\">I was also curious to see if I could partition and color code the edges by tournament. I know from the dataset that the tournaments are each roughly about 25% of the total, so they\u2019re all evenly represented. The graph, however, did not give me any meaningful insight. From this visualization, it doesn\u2019t seem like any of the prominent players (Federer, Nadal, Djokovic, Agassi, Sampras) excelled at a particular tournament more than the other three.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Reflections<\/h2>\n\n\n\n<p class=\"has-dark-gray-color has-text-color\">Visualizing the matches of the four major tennis tournaments as a network allowed me to quickly identify the players that have dominated the sport (for men) since 1991. It also suggests that a tennis player\u2019s successful career can be long, very long \u2014 lasting even decades. Looking at the careers of each individual player, this seems to be true. In 2018, Federer won 20 Grand Slam titles at the age of 36, sharing the record with Nadal. Djokovic trails closely behind at 19 titles. Serena Williams, at the age of 35, won 23 titles. This may be due to advancements in medicine and equipment, as well as players training how to play more skillfully, not having to rely on their athleticism as much.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Introduction Every year, it seems like it\u2019s the same handful of tennis players vying for the championship of the four major tournaments. I\u2019m not a big tennis fan, but I could easily list out the top five or so tennis players in the world, only because they haven\u2019t changed in the past two decades \u2014&hellip;<\/p>\n","protected":false},"author":3036,"featured_media":24825,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[149,342],"tags":[],"coauthors":[1389],"class_list":["post-24824","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-labs","category-networks"],"jetpack_featured_media_url":"https:\/\/i0.wp.com\/studentwork.prattsi.org\/infovis\/wp-content\/uploads\/sites\/3\/2021\/07\/Rafael-Nadal-Novak-Djokovic-and-Roger-Federer-all-breeze-into-Wimbledon-quarter-finals-7.jpg?fit=1024%2C576&ssl=1","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/paBdcV-6so","_links":{"self":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/posts\/24824","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/users\/3036"}],"replies":[{"embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/comments?post=24824"}],"version-history":[{"count":3,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/posts\/24824\/revisions"}],"predecessor-version":[{"id":24833,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/posts\/24824\/revisions\/24833"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/media\/24825"}],"wp:attachment":[{"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/media?parent=24824"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/categories?post=24824"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/tags?post=24824"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infovis\/wp-json\/wp\/v2\/coauthors?post=24824"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}