Introduction

This data set provides information about book checkouts from the Seattle Public Library such as a book’s checkout type, material, checkout year and month, number of checkouts, etc… This information dates from 2017-2023 and only contains items that were checked out at least 10 times a month.The trend I will be analyzing is the flow and direction that the number of checkouts take over the years for specific books and types of books.

Summary Information

This dataset highlights many changes over time concerning the borrowing habits of those who use the services of the Seattle Public Library. For instance, the year with the most ebook checkouts was 2020 with . This makes sense as this was the height of the COVID-19 pandemic, which would have caused people to move indoors and obtain books electronically. The year with the least number of ebook checkouts was 2023. This makes sense as 2023 is just starting, but with this trend, it looks like 2023 will have just as many, if not even more checkouts than previous years. Additionally, by analyzing specific books such as “Outside”, a better conclusion is made about the trends of book checkouts throughout the years. Specifically, the year with the most checkouts of this book was 2018 while the year with the least number of checkouts was 2022. Finally, by seeing an overall difference in the number of checkouts throughout the years, we are able to make a better conclusion about the trend of checkouts. In this case, I used 2022 as it was the last full year of the dataset and found that 2017 had 1668954 more checkouts than the year 2022. This may be for many reasons such as an increasingly more accessible library of texts found online other than the Seattle Public Library. This data overal shows varying trends of checkouts at different levels of specificity.

The Dataset

The data was collected and published by the Seattle Public Library.

The dataset includes information on checkouts from the Seattle Public Library system, including the title of the borrowed item, the author, the publication year, the publisher, the ISBN, and the number of checkouts by item, the usage type, checkout type, material type, checkout year and month ranging from from 2017 to 2023. This dataset has 12 columns and 816453 rows.

The data was likely collected electronically through the library’s checkout system and stored in a database. It is unclear if the data includes all checkouts or a sample.

The data was likely collected to track the popularity of different titles and evaluate the trends of book checkouts throughout the Seattle Public Library system

Ethical questions could include ensuring the privacy of borrower data and avoiding biased analyses or interpretations. It is also important to consider whether the data includes sensitive information such as borrowers’ names or addresses and take steps to protect that information.

Possible limitations or problems could include incomplete or inaccurate data, lack of context, and biases caused by social and political trends. The dataset also does not include information on the specific locations that the books were taken after they were checked out where items were checked out, which could limit the ability to draw conclusions about borrowing patterns in specific locations.

Your Choice

This line plot was included as a way of measuring two very specific cases/authors. In this situation, choosing Dr. Seuss and J.K. Rowling was a method to ensure that both analyzed authors were well known, especially to younger audiences. This graph indicates that both authors over the years have seen great decreases in checkouts, especially Dr. Seuss. Both authors saw their lowest number of checkouts in about 2021 and have been steadily increasing from there, but definitely not to their original capacity.