MOMA Analysis

July 23, 2020 / 0 comments

Dataset: My project is the MOMA archive analysis report. I get a JSON file from MOMA GitHub. It has many different variables, ‘title’, ‘Nationality’, ‘Gender’, ‘medium’, etc. Each class has those variables. Interest: I’m curious about which artist has the most amount of works collected by the MOMA library? What material is the most popular…

Read more →

Lyrics Sentiment Analysis

July 22, 2020 / 0 comments

For this final project, I conducted a sentiment analysis on the top 40 songs from five of my favorite artists – Drake, Ariana Grande, Mariah Carey, Ne-Yo, and Rihanna. A sentiment analysis is the process of determining wether writing is positive, negative, or neutral. I thought that it would be particularly interesting to see how…

Read more →

Hard to Count Populations & US Census

July 22, 2020 / 0 comments

This project uses publicly available Census Bureau data to explore the correlation between response rate and resource allocation as tied to the decennial U.S. Census and American Community Survey (ACS), with an eye specifically on different populations at the state and then national level. Of course, the process of allocating funds is complex, with many…

Read more →

Duterte’s Drug War

July 22, 2020 / 0 comments

This project examines data provided by the Columbia University School of Journalism’s Stabile Center for Investigative Journalism. Compelled by the lack of available information on EJKs by the Philippine government, Stabile Center attempted to verify deaths under the anti-drug campaign and released their data and research as a downloadable Microsoft excel files for three cities,…

Read more →

Analysing sneakers sales in StockX

July 22, 2020 / 0 comments

GOALS: To understand the popularity, price stability and premium value of the Adidas Yeezy and Nike Jordan Team Size: 1 Time Frame: 2 weeks Tools: Python, Selenium, MongoDB, Flask, Plotly, (Heroku) My Role: Data Engineering, Data Visualization If the answer is YES; you’re interested in reselling or buying; Here’s the interesting findings in StockX where…

Read more →

Pop science media & scholarly discourse

July 22, 2020 / 0 comments

This project was done in conjunction with INFO 628 Data Librarianship. It takes the popular science podcast Radiolab and converts the entities mentioned in the podcast to RDF to analyze whether Linked Open Data could be used to connect the general public with scholarly research at the places where they are already accessing it..

Asian Restaurants Price Map

July 21, 2020 / 0 comments

Project Description The Project attempts to create maps of three types of restaurants in the New York City based on their price settings. And analyze is based on what reflect on the maps. All the data sources are requested through the Yelp API. The project has two parts: a Python script that makes an API…

Read more →

Automated Image Classification and the Metropolitan Museum

July 21, 2020 / 0 comments

A visual, and quantitative analysis of Google Vision, Amazon Rekognition, and Human generated tags for the collection highlights of the Metropolitan Musuem of New York. With artificial intelligences’ increasing influence over the way we search for and interpret content in digital domains it can be difficult to dissect where the qualities of labels and associations…

Read more →

Mapping the 2018 Squirrel Census of Central Park

July 21, 2020 / 0 comments

“It Seems I Hit the Squirrel Jackpot!” is a project that explores data from the 2018 Squirrel Census of Central Park. Census takers counted every single squirrel in park and recorded a number of observations that, when put together, provide a comprehensive picture of both Central Park’s squirrels and its people. This project, which was…

Read more →

Mapping NYC Music

July 21, 2020 / 0 comments

Music has such a storied role in the culture of New York City, particularly live music. Streaming music services started to flourish around 2015 and have since become the primary way people listen to music. We initially created this project to explore what streaming services were trending on a national level compared to what artists…

Read more →

SAAM API Artists

July 21, 2020 / 0 comments

The SAAM API Artists Project extrapolates demographic data from the American Art API in the hopes of illuminating the work of under represented artists. Using the .get request I gathered information from the API (GetSAAMData.py) and transformed it into structured data (SAAMFinal.py). The next step in my project was an attempt to gather ethnicity or…

Read more →

Dungeons & Python

July 20, 2020 / 0 comments

A project using Python programming to create a 5th Edition Dungeons & Dragons character generator. Dungeons & Dragons is a tabletop roleplaying game in which each player creates a character with a combination of traits, skills, and equipment to generate a unique game-playing experience. The customization options are nearly endless — and at times, that…

Read more →

21st-Century Collecting Trends

July 20, 2020 / 0 comments

This project uses Python to retrieve and filter object metadata for the collections of three different art museums in the United States: The Metropolitan Museum of Art in New York City; The Cleveland Museum of Art; and the Harvard Art Museums in Cambridge, MA. The aim of the project is to provide flat data for…

Read more →

Oscar Winners of 2011-2020

July 20, 2020 / 0 comments

In this project, I used the data of The Oscar Award, 1927 – 2020, and Movie Database (IMDB Alternative) to make visualizations of Oscar winners of 2011-2020. Number of Oscars and IMDB rating are both good indicators of the qualification of a film, and therefore the visualizations I created could give the overall ranking of…

Read more →

Planned Construction and Airbnb

July 20, 2020 / 0 comments

This project was carried out to function as a tool for analyzing and geospatially visualizing planned construction in New York City in terms of potential future Airbnb revenue. The end product could be used either internally by Airbnb, or externally by the City of New York, financial companies, or other stakeholders. For example, Airbnb could…

Read more →

Time Series of Stocks

July 20, 2020 / 0 comments

This project compares the stock prices of four companies i.e “Facebookâ€, “Amazonâ€, “Google†and “Netflix†for the last two years. Through the Yahoo Finance API all the historical data was called to get the “dateâ€, “open priceâ€, “close priceâ€, “highâ€, “lowâ€, “volume†and “adjusted close†for the four companies which was then saved as json…

Read more →

Women in Movies

July 19, 2020 / 0 comments

There is a common situation that in movies, female are more frequently to be set as secondary characters. The existing movie dataset shows that the percentage of women speaking characters is far less than men, similar situation as directors, writers, and producers. I want to make comparative visualization to give people more intuitive information..

Just Keep Swimming

July 19, 2020 / 0 comments

The original goal of this project was to map the movement of sea turtles. I was inspired by an article that I read a few years ago in Nautilus magazine, Where the Wild Things Go. Oliver Uberti wrote the article and also created the gorgeous illustrated maps of elephant and sea turtle migration data. “To…

Read more →

Building a Seed Library Catalogue

July 19, 2020 / 0 comments

Some public libraries are adding seed libraries: patrons can request a seed, usually for an edible plant, from the library’s catalog, grow it, and then “return” the seeds by donating seeds saved from the mature plant. Unlike most library materials, these seeds often don’t have structured metadata; a list of seeds may just be available…

Read more →

Auto-Poems

July 19, 2020 / 0 comments

With this project, I wanted to create new collage poems from a txt file of all of the poems from the Poetry Foundation site. The code will pull random lines from the txt file and write it to a new file. Run the code as many times as you’d like to create a poem of…

Read more →