Programming Assignment 2: Visualize Network Data
Parse the characters and movie information from the List of Star Wars Characters web page.
| Film | Frequency |
|---|---|
| The Clone Wars | 117 |
| undefined | 103 |
| Ep. I: The Phantom Menace | 61 |
| Ep. III: Revenge of the Sith | 58 |
| Ep. VI: Return of the Jedi | 57 |
| Ep. II: Attack of the Clones | 56 |
| Ep. VII: The Force Awakens | 49 |
| Rebels | 42 |
| Ep. IV: A New Hope | 36 |
| Ep. V: The Empire Strikes Back | 35 |
| Ep. VIII: The Last Jedi | 30 |
| Rogue One | 28 |
| Forces of Destiny | 14 |
| Solo: A Star Wars Story | 12 |
| The Clone Wars TV series | 3 |
The Star Wars Universe
What is the data set that you chose? Why? Inspired by Félix Luginbühl’‘s Which Marvel Characters and Movies are the Most Central? (http://felixluginbuhl.com/network/), I chose to explore a similar question but this one related to Star Wars. Using Wikipedia’’s list of Star Wars characters (https://en.wikipedia.org/wiki/List_of_Star_Wars_characters), I wrote a program to scrape the page and identify the characters and the movies associated with each character. Plus since my family are Star Wars fans, I thought it would be an interesting and fun project.
Did you use a subset of the data? If so, what was it? On the web page, I used the “Name” and “Portrayal” columns to extract the character and movie information.
Are there any particular aspects of your visualization to which you would like to bring attention? After preparing the data, the information displays nicely in an interactive web page. Selecting a character highlights the movie associations plus the characters associated with those movies. Selecting a movie highlights the characters associated it. The screenshot above highlights Yoda and Ep. I: The Phantom Menace.
What do you think the data and your visualization show? Character relationships in Star Wars movies.