Scraping for Library Jobs

February 3, 2019 / 0 comments

For this project, I used Python to attempt to scrape job listings from three popular professional associations’ websites: Society of American Archivists (SAA) American Library Association (ALA) Association for Information Science and Technology (ASIS&T;) I first scraped the websites for links to detailed job postings and stored the URLs in a JSON file. Then I…

Read more →

Images of the Solar System

February 3, 2019 / 0 comments

This project uses APIs to collect image records from cultural heritage and science institutions. I used DPLA’s API (https://pro.dp.la/developers/api-codex) and NASA’s Image and Video Library API (https://api.nasa.gov/api.html#Images). I’ve gathered content related to the plants of our solar system (including Pluto), sorted the results by planet, and pulled out relevant information about the image, including but…

Read more →

NYC HIV/AIDS Services and Facts Dashboard

February 3, 2019 / 0 comments

Project Overview This dashboard includes facts and figures related to NYC HIV/AIDS data available through NYC Open Data. This dashboard also includes charts that provide location and contact information for HIV testing sites and HIV/AIDS services locations across the city’s five boroughs. The facts and figures are only available for the years 2011 through 2015…

Read more →

Web Scraping of Turner Paintings

February 3, 2019 / 0 comments

Project Inspired by the Tate collection, which houses the largest collection of Joseph Mallord William Turner works, I created a timeline based on the artworks Turner produced while visiting the Isle of Wight, UK. The timeline follows the artist’s first trip to the island in 1975, as he made his way onto the north of…

Read more →

U.S. Documentaries, 1878 to 2017

February 3, 2019 / 0 comments

The goal of this project is to use data from IMDB to compare the growth of documentary versus non-documentary movies made in the United States from 1878 to 2017 and to compare the gender breakdown of different jobs on documentary crews. Currently, the initial data has been gathered to do the analysis but further cleaning…

Read more →

Stock Ticker Capital Appreciation Comparison

February 2, 2019 / 0 comments

The Objective of this projects was to use various coding languages and platforms to visualize data in a more digestible way by none tech savvy audiences. It is important to display financial information in an understandable manner. Stocks Tickers performance are a method of getting a clear view of a companies performance in real time….

Read more →

Some New Classics

February 2, 2019 / 0 comments

Countless lists of “100 Classics to Read” (or similar) exist for those works which have been defined as “capital L” Literature, works that stand the test of time. When one thinks about reading “the classics” as it pertains to literature, certain books and authors come to mind, many of which are books from white and/or…

Read more →

Exploring Chinese Traditional Medicine

February 2, 2019 / 0 comments

For this final project, the goal is to use python to perform web scraping and collect data that would generate meaningful visualizations. I chose to explore the topic of Traditional Chinese Medicine (CTM). I wanted to learn what is the most commonly used herb in all formulas that I could find online. I decided to…

Read more →

Where are the MTA Art Works?

February 2, 2019 / 0 comments

Brief This project is a simple Python web scraping and data mapping practice. It is an assignment of my Python class at Pratt Institute: Programing for Culture Heritage, instructed by Matthew Miller. This project was my first attempt to scrape the web for a raw data and to visualize it. In this project, I scraped…

Read more →

Sentiment Analysis of “#gene_editing” tweets

February 2, 2019 / 0 comments

The final project of Programming for Culture Heritage is an integrity project with API request, data scraping and cleaning, semantical analyze, save data into csv file, and data visualization by python and tableau. RRecently, A Chinese researcher used CRISPR – Cas9 technology that created the first gene-edited twin babies. CRISPR is easier to use and…

Read more →

NYC Park Monuments

February 1, 2019 / 0 comments

These scripts are designed to pull information from a publically available csv about New York park monuments and combine that information with data harvested from wikidata into a single file. The goal is to produce a file containing all this information that can be easily analyzed and used to expose interesting correlations between information about…

Read more →

Organizing Image Collections

February 1, 2019 / 0 comments

Blend Images is a commercial stock photography collection of approximately 100,000 images produced by over 200 photographers. This project explores how Python may be used as a tool to create separate sub-collections by searching string attributes, generating separate metadata for those collections, and moving or copying jpeg files from the original directory to new folders….

Read more →

IMDB visualization

February 1, 2019 / 0 comments

Python code using IMDB library to get the movies data from IMDB website, from 1980s to present. And try to use those data to create the timeline data visualization with the tableau. Sorry, your browser doesn’t support embedded videos..

Anxiety of Influence

February 1, 2019 / 0 comments

Reception of the Fantastic in 19th-Century Spain My objective with this project was to address the contentious issue of whether or not Spanish authors in the 19th century, had access to translations of non-Spanish language Fantastic texts. Specifically, I built a data set and used it to analyze frequency, location, and quantity of publications. The…

Read more →

Analyzing Changes in Citi Bike Trips after the Introduction of E-Bikes

February 1, 2019 / 0 comments

For my project, conducted data analysis on how Citi Bike trips have changed after Citi Bike introduced about 200 e-bikes on August 20th, 2018. To do so, first I grouped the docks into the neighborhoods in which the docks are located for the period before the introduction of e-bikes and the period after their introduction….

Read more →

Hopper at the Theatre

April 12, 2018 / 0 comments

This project investigates the cultural and theatrical landscape of Edward Hopper’s New York using linked open data (LOD) technologies. Leveraging the unparalleled collection of Hopper’s art and archival materials held by the Whitney Museum of American Art, Hopper at the Theatre contextualizes the artist’s personal and professional geographies towards the end of the interwar period…

Read more →

Time(s) Splitter

April 12, 2018 / 0 comments

Time(s) Splitter by Richard Goldstein, 2017 The Programing for Cultural Heritage course provided me with my first Python encounter. In the background of learning its basic programming syntax and functions, William Burroughs kept creeping into my mind with Python being a means to scrape, recompose, and clarify or give new meaning to text. Using the…

Read more →

contamiNation

April 12, 2018 / 0 comments

View Map What is this? What does it mean? This map was created in order to make localized data on lead contamination of United States public water sources available to the public. Such contamination stems from corrosion of old lead and copper plumbing fixtures that may be part of a building’s plumbing or the public…

Read more →

Spotlight on New York Vaudeville

April 12, 2018 / 0 comments

New York Public Library (NYPL) has embarked on a program to build an open database of the performing arts called Ensemble. Volunteers are transcribing theater playbills from the NYPL’s collection. As part of the program, 200 theater playbills from 1911-1922 have been transcribed and are available online. Each playbill has its own webpage that contains…

Read more →

Endangered Plants of Westchester and Fairfield Counties

April 12, 2018 / 0 comments

While it is easy to get lists of threatened and endangered plants by state, it is not so easy to do this by county. The Greenwich Land Trust has tasked me with providing them with a list of endangered plants on their lands (over 700 acres) in lower Fairfield county adjacent to southern Westchester county….

Read more →