Data 607 Assignment week 9

Overview:

This particular session will cover on how to connect to New York times(NYT) API through Rstudio and load the data in the form of data frame. First we have to create an API key. In order to create that we have to sign up to NYT using the link: https://developer.nytimes.com/accounts/create, after signing we can create the API key using link : https://developer.nytimes.com/my-apps/new-app . Once we have created the API key with NYT we can easily fetch the data in form of JSON file in Rstudio and then convert it into a data frame for further investigations.

Our aim is to find the top ranking book in the Fiction Genre on date: 01/20/2023. In order to achieve our goal we have to set up our Rstudio environment. I will be using the following packages called in the code chunk:

# Setting up the environment
library(tidyverse)
library(httr)
library(jsonlite)
library(RCurl)

First I will use GET() function from httr package to secure a connection with the NYT API. The following link contains the date and genre of the books. These links are pre given on NYT API page: (link)

res = GET("https://api.nytimes.com/svc/books/v3/lists/2023-01-20/hardcover-fiction.json?api-key=Sbf8cqn7VjRc3DFuYTQ0Z5Y9Pe852kHM")
res

## Response [https://api.nytimes.com/svc/books/v3/lists/2023-01-20/hardcover-fiction.json?api-key=Sbf8cqn7VjRc3DFuYTQ0Z5Y9Pe852kHM]
##   Date: 2023-03-25 05:36
##   Status: 200
##   Content-Type: application/json; charset=UTF-8
##   Size: 35 kB

When the status = 200 it means that our connection is established with the API and we can import our required data from the link. Since the data imported will be in the JSON format so we will have to use functions from jsonlite package. First we will contents of the link into a variable in the form of Character using rawToChar() function from Base R.

a <- rawToChar(res$content)

Then, we can use fromJSON() function from jsonlite package by setting the argument flatten to TRUE and converting it into data frame using data.frame() function.

data <- fromJSON(a, flatten = TRUE)|>
  data.frame()

After that we can use select and rename function from tidyverse to extract the required results and save it into a variable as data frame.

book_lists <-data|>
  select(results.books.rank, results.books.title,results.books.author,results.books.contributor,results.books.publisher,status)|>
  rename(Status = status, Title=results.books.title, Author=results.books.author, Contributor=results.books.contributor, Rank=results.books.rank, Publisher =results.books.publisher)

knitr::kable(book_lists)

Rank	Title	Author	Contributor	Publisher	Status
1	LESSONS IN CHEMISTRY	Bonnie Garmus	by Bonnie Garmus	Doubleday	OK
2	THE HOUSE IN THE PINES	Ana Reyes	by Ana Reyes	Dutton	OK
3	WITHOUT A TRACE	Danielle Steel	by Danielle Steel	Delacorte	OK
4	THE BOYS FROM BILOXI	John Grisham	by John Grisham	Doubleday	OK
5	DEMON COPPERHEAD	Barbara Kingsolver	by Barbara Kingsolver	Harper	OK
6	FAIRY TALE	Stephen King	by Stephen King	Scribner	OK
7	TOMORROW, AND TOMORROW, AND TOMORROW	Gabrielle Zevin	by Gabrielle Zevin	Knopf	OK
8	MAD HONEY	Jodi Picoult and Jennifer Finney Boylan	by Jodi Picoult and Jennifer Finney Boylan	Ballantine	OK
9	THE MIDNIGHT LIBRARY	Matt Haig	by Matt Haig	Viking	OK
10	BABEL	R.F. Kuang	by R.F. Kuang	Harper Voyager	OK
11	AGE OF VICE	Deepti Kapoor	by Deepti Kapoor	Riverhead	OK
12	A WORLD OF CURIOSITIES	Louise Penny	by Louise Penny	Minotaur	OK
13	THE VILLA	Rachel Hawkins	by Rachel Hawkins	St. Martin’s	OK
14	REMARKABLY BRIGHT CREATURES	Shel Van Pelt	by Shelby Van Pelt	Ecco	OK
15	DREAMLAND	Nicholas Sparks	by Nicholas Sparks	Random House	OK

I will just end it by making a simple plot to see if everything is working fine the data frame.

ggplot()+
  geom_bar(book_lists, mapping = aes(x=Publisher, fill = Publisher))+coord_flip()+theme_bw()+theme(legend.position = "none")

Data 607 Assignment week 9

Umer Farooq

2023-03-21

Overview: