Visualisation of Airbnb data in Paris


Visualization

Introduction

This visualisation is an exploration of the Airbnb data in Paris. It analysis the spread of the Airbnb’s as per locality and then further explores proximity to the metro as a parameter. The overlay of the road network map gives idea of proximity to roads and not just the metro. Finally the overlay of the arrondissements map gives an idea of the spread of these Airbnb’s across different localities with transportation as a key data point.

Road Network Paris
Airbnb and metro proximity

Airbnb data with metro proximity and road network

Dataset

Airbnb csv data for Parishttps://www.kaggle.com/datasets/thedevastator/airbnb-prices-in-european-cities

Paris road network shapefilehttps://geodata.lib.utexas.edu/catalog/stanford-db591kz1574

Paris arrondissements shapefilehttps://data.metabolismofcities.org/library/maps/38838/

The data sheet explores Airbnb data in Europe in terms of prices and has city-wise data available as csv files. The data was accessed using Kaggle. The datasheet was a fairly tidy one and didn’t require much cleaning. The data covers the following attributes.

Column nameDescription
realSumThe total price of the Airbnb listing. (Numeric)
room_typeThe type of room being offered (e.g. private, shared, etc.). (Categorical)
room_sharedWhether the room is shared or not. (Boolean)
room_privateWhether the room is private or not. (Boolean)
person_capacityThe maximum number of people that can stay in the room. (Numeric)
host_is_superhostWhether the host is a superhost or not. (Boolean)
multiWhether the listing is for multiple rooms or not. (Boolean)
bizWhether the listing is for business purposes or not. (Boolean)
cleanliness_ratingThe cleanliness rating of the listing. (Numeric)
guest_satisfaction_overallThe overall guest satisfaction rating of the listing. (Numeric)
bedroomsThe number of bedrooms in the listing. (Numeric)
distThe distance from the city centre. (Numeric)
metro_distThe distance from the nearest metro station. (Numeric)
lngThe longitude of the listing. (Numeric)
latThe latitude of the listing. (Numeric)
Data Attributes Airbnb

The shape files for mapping were obtained from 2 different sources, one with the roadway mapping and one with the region/arrondissement mapping.

Tools -used

Tools that were used to make this visualisation were Excel, Open Refine, QGIS and Figma. The report was made and published using WordPress.

Process

The making of this visualisation required tools like OpenRefine, excel, QGIS and Figma. Each tool served a separate purpose

Research

The first phase of my process involved researching for data sources that had geographic attributes as a key parameter. I started looking at housing data and then found travel and bnb data in this domain which further intrigued my interest.

My second phase of research involved looking for open-source shapefiles for maps of Paris and I came across a road-network file which I thought could add a lot of perspective to my visualisation and finally found a locality map as well.

Open-Refine- Data Cleaning

I used Open Refine to clean the data. The data was fairly clean, hence I just exported it as a csv

QGIS Data Representation

I had to import the csv in QGIS and add layers of the road network shape file and arrondissements shape file.

Visualisations & Observations

1. Airbnb locations – I overlaid the Airbnb locations over the arrondissement map to get a visualisation of the spread. I used dots to represent the location
2. Metro proximity – I used an expression to highlight metro areas and specifically mark airbnb’s with proximity less than 200 mts to the metro.
if( "metro_dist" <0.2, "metro_dist",0)

3. Metro proximity and road network – I overlaid the above mappings on the road-network map by exporting files in the same dimensions and using Figma to overlay them.

Reflection and Critique

Limitations

QGIS is an open-source software with documentation that is quite detailed for a few cases and minimal for a few others. The software sometimes crashes and I had challenges with importing .shp files. It also sometimes renders .shx in different co-ordinates which again was a challenge while producing this visualisation.

Positives

QGIS is a good tool for geo-spatial visualisations, simple expressions can help generate a lot of insightful data. The layering feature is quite powerful for geographic visualisation of data.

Peer Critique and changes

My final chart had some issues with overlay and it was suggested to me that I find a way to overlay a map without which the visualisation looks incomplete. I worked on it using Figma as a tool and I also eventually found a locality map which matched the co-ordinate system of my data thus improving the visualisation.

Bibliography

Airbnb Prices in European Cities. (n.d.). Retrieved April 11, 2023, from https://www.kaggle.com/datasets/thedevastator/airbnb-prices-in-european-cities

Vasserot Road Network (1810-1836) – University of Texas Libraries GeoData. (n.d.). Retrieved April 11, 2023, from https://geodata.lib.utexas.edu/catalog/stanford-db591kz1574

Arrondissements de Paris | Metabolism of Cities Data Hub. (n.d.). Retrieved April 11, 2023, from https://data.metabolismofcities.org/library/maps/38838/

QGIS Tutorials and Tips — QGIS Tutorials and Tips. (n.d.). Retrieved April 11, 2023, from https://www.qgistutorials.com/en/index.html