###Introduction
The Quantified Self (QS) is a movement motivated to leverage the synergy of wearables, analytics, and “Big Data”. This movement exploits the ease and convenience of data acquisition through the internet of things (IoT) to feed the growing obsession of personal informatics and quotidian data. The website http://quantifiedself.com/ is a great place to start to understand more about the QS movement.
The value of the QS for our class is that its core mandate is to visualize and generate questions and insights about a topic that is of immense importance to most people – themselves. It also produces a wealth of data in a variety of forms. Therefore, designing this project around the QS movement makes perfect sense because it offers you the opportunity to be both the data and question provider, the data analyst, the vis designer, and the end user. This means you will be in the unique position of being capable of providing feedback and direction at all points along the data visualization/analysis life cycle. The goal of the project is to collect, analyze and visualize the data using the tools and methods covered in class. Additionally, using the data-driven approach, I will create a summary which answers the following questions based on the data collected 1) What is the spending by month in 2019? 2) Which category costed the most in 2019? 3) How often do i use credit card? 4) What does the March and November spending look like? 5) WHich is the most frequent merchandise that i had visited? ### Data preparation
## Parsed with column specification:
## cols(
## Date = col_character(),
## Week = col_character(),
## Name = col_character(),
## Holder = col_character(),
## Number = col_character(),
## Category = col_character(),
## Amount = col_double()
## )
There are 7 variables we are interested in:
Card Number
###Q1: What is the spending by month in 2019?
## geom_path: Each group consists of only one observation. Do you need to adjust
## the group aesthetic?
My spending by month here is quite spreadout, and this graph gives me a general idea about how it goes by month. I summarized the total amount spent in each month and usd bar chart display the numbers. From the graph i can tell that the average spending by month is around $2500. June has been my most spending month,since that month i bought a watch for my mom.Sweet memory for her,remarkable memory for my statement.
###Q2: Which category costed the most in 2019?
I have a accidental event in Merchandise& Supplies as i menthoned before, thats because of a watch. Besides of that, the most spending category for me are“restaurant”and "travel. When with the Amex points in mind, this gives me an idea about how should i use this credit card to get more points.
###Q3: How often do i use credit card?
##
## Attaching package: 'plotly'
## The following object is masked from 'package:ggplot2':
##
## last_plot
## The following object is masked from 'package:stats':
##
## filter
## The following object is masked from 'package:graphics':
##
## layout
From the plot we can see that, still, the most spending is Merchandise. Besides, in November, the second spending is Travel/Entertainment. I checked detailed information. The auto insurance was renewed in March and Thanksgiving holiday was in November, which caused the majority of the spending.
###Q4: What does the March and November spending look like?
Q5:WHich is the most frequent merchandise that i had visited?
## I decided to use a scatterplot because it gives me a lot of information at a glance about my purchase behavior over time. I can see that my and Fashion purchases tend to be larger in amount each time. Transportation mainly consists of small purchases, but they are numerous. Also, it looks like recently the only large purchase I made was for Merchandise , and I’m keeping my purchases to below $100. You can hover each dot to get more information about the purchase.