Marvel Social Network Visualization


Lab Reports, Networks

Introduction

Marvel started in 1939 as Timely Publications, and by the early 1950s, had generally become known as Atlas Comics. The Marvel branding began in 1961, the year that the company launched The Fantastic Four and other superhero titles created by Steve Ditko, Stan Lee, Jack Kirby and many others.

Marvel counts among its characters such well-known superheroes as Spider-Man, Iron Man, Captain America, Thor, the Hulk, Captain Marvel, Black Panther, Deadpool, Doctor Strange, Wolverine, Daredevil, Ghost Rider and the Punisher, such teams as the Avengers, the X-Men, the Fantastic Four, the Inhumans and the Guardians of the Galaxy, and antagonists including Thanos, Doctor Doom, Apocalypse, Magneto, Red Skull, Green Goblin, Ultron, Doctor Octopus, Loki, Kingpin and Venom. Most of Marvel’s fictional characters operate in a single reality known as the Marvel Universe, with most locations mirroring real-life places; many major characters are based in New York City.

This visualization will present an overview of Marvel’s superhero networks.

 

Materials

  1. The Marvel Social Network Gephi file – This network of superheroes was constructed by Cesc Rosselló, Ricardo Alberich, and Joe Miro from the University of the Balearic Islands. The data was collected by Infochimps and transformed and enhanced by Kai Chang.
  2. Gephi – It is an open-source software for network visualization and analysis. It helps data analysts to intuitively reveal patterns and trends, highlight outliers and tells stories with their data. It uses a 3D render engine to display large graphs in real-time and to speed up the exploration.

 

Inspiration

After choosing the dataset, I found this fan-made visualization which is also about the Marvel networking mining. Closeness Centrality and Yifan Hu algorithm is applied to this network. It distances the communities that are independent to the network.  Main characters, or characters with more connections, have bigger nodes. And node colors indicate different superhero communities. 

Link to Graph: https://www.slideshare.net/MuthuArasan/marvel-graph-network-mining

Method

Dataset:

I found the data set on Gefi wiki. I chose it because I’ve personally very interested in the subject. The problem with it is the dataset is too huge that Gephi tends to crash quite often.

Gephi:

  • Filter: The dataset is very huge and since I mainly want to present the more relative characters, I used the filter function and exclude the characters that has degree lower than 10 (The range of degree for this dataset is from 1 to 2189).
  • Layout: I first chose to apply the ForceAtlas2 as the layout. Although “Dissuade Hubs” and “Prevent Overlap” were applied to centralize the key characters(nodes with higher degrees) and prevent the overlap between so many different characters, somehow the main characters were always seemed to be drifting away from the centre. So I tried to use the Yifan Hu algorithm. To avoid the nodes from overlapping, I set the Optimal Distance to 9000.0. In my experience, Yifan Hu is easier and faster to run while Gephi crashed a lot times trying to run ForceAtlas2.
  • Node Size: I adjusted the node size by their degrees. Nodes with bigger size corresponds to the key players like Iron Man, Captain America, Spiderman etc.
  • Node Color: I adjusted the node color by Modularity Class. Different superhero groups are distributed with different colors. Avengers-centered characters are green, Spiderman-centered characters are blue, Doctor Strange-centered characters are pink, X-Men are purple, Fantastic Four are black.

 

Results

This is the final visualization of the Marvel social networks.

 

From the graph we can see the most important characters are Captain America, Spiderman, Iron Man,Wolverine, and Mr. Fantastic. We can also see how the avengers community–Captain America, Iron Man, Thor and Spiderman are highly connected while the X-Men group and the Fantastic Four are relatively separated.

Closeness Centrality shows which nodes are connected to other nodes that have high degree. It’s a measure of influence of a node in a network. According to this, the highest influencer in the network is Captain America followed by Spider-man.

 

Reflection

I still have a lot of confusions about how to use Gephi. I hope I could explore more about it and thus be able to better understand the complicated functions.