19th October 2024

Introduction

  • Overview of the project.
  • Objectives: To analyze text data from Twitter, Blogs, and News sources.
  • Scope: Focus on n-gram frequency analysis.

Methodology

  • Data Collection:
    • Sources: Twitter, Blogs, News.
    • Pre-processing: Cleaning and tokenizing text data.
  • Analysis Techniques:
    • N-gram frequency calculation.
    • Visualization using ggplot2.

Results

  • Key Findings:
    • High-frequency n-grams identified in Twitter data.
    • Comparison of n-gram usage across different sources.

Word Cloud of News N-grams

Word Cloud of News N-grams = 3

Word Cloud of News N-grams = 3