Apache HTTP Server Log Analysis

Sayli Gharat -09 & Srobona Roychowdhury -22
30/04/2017


Business Analytics using R & Shiny Apps
PGDM-Research & Business Analytics
2016-2018

Project Objectives

To perform Apache HTTP Server Log Analysis which will help us to

  • to enhance our knowledge about R
  • to develop exploratory & visualization skills
  • to enable us to use language & coding skills in complex situations

Project Overview

The project had to be done across multiple steps which included

  • understanding a sample HTTP access log file
  • reading a log file using file input control in shiny app
  • analyzing as required to find out about requests sent to server, timestamp , host,etc
  • visualizing results in shiny app

Screenshot of the application

alt text

Analytics

This web application when provided with a any suitable log file, helps us in analyzing the log entries as follows:

  • Number of page hits and bytes with respect to date
  • Number of page hits and bytes with respect to hour
  • Number of page hits and bytes with respect to host
  • Number of page hits and bytes with respect to method
  • Number of page hits and bytes with respect to url

    mainly to know the patterns in traffic to a particular webpage

Meeting Project Objectives

Objectives were met using:

  • appropriate functions of R
  • visualization tools in shiny app

Experience with R

For Data Analytics, there seems to be no better language than R especially when it comes to very large datasets.With R multiples objectives can be met successfully in spite of how large the operations are