Antonio Rubiera
12/12/2019
The Securities and Exchange Commission requires U.S. stock-issuing, or publicly listed companies to file a large number of reports. These reports contain financial data annotated with text of varying lengths. In this shiny app, we have collected a small sample of recent annotations contained in the financial reports of three companies with different types of operations, and different styles of text annotation. Apple is here as an example of terse language, and GE is here as an example of verbose text. Walmart is included here to show a large retailer.
The shiny app is located here:
The text is transformed using the tm_map function of the tm packages, one of the NLP (Natural Language Processing) packages. After turning the text into a VCorpus in tm, we:
The word clouds can give us a quick feel for the language of a company. For example, the word “loss” is larger, and therefore more frequent, in our text sample for General Electric, than the word “benefit.”