Introduction

Nowdays, a massive amount of data has been generated for every single second, which is increasing the need for data analysis. The data is not only in form of figures, but also in text. That leads to the term of data mining and text mining in data science.

The following parts are dedicated for describing some typical outputs of data analysis excerpted from Hien, N.N. (2016).

Data mining

Data mining is the computational process of sketching patterns in data sets by using several methods such as statistics. The following part is an expample taken from Hien, N.N. (2016).

LPS implementation publications with respects to Time (1988-2015)

To show how big the research gap of Lean production system (LPS) implementation in pharmaceutical small-medium sized enterprises (pSMEs) exists in the body of knowledge (BoK), we can obtain the data on Google Scholar in terms of research papers on the topic by using the following search terms:

Syntax for searching on Google scholar:

  1. LPS implememtation in General: “allintitle: lean manufacturing OR implementation OR practices OR production OR”sucess factors" OR barriers -beef -meat -fat -weight -overweight -glucose -pigs -lamb -sheep -tissue -pork -tissues"

  2. LPS implememtation for SMEs: “allintitle: lean SMEs manufacturing OR implementation OR practices OR production OR”sucess factors" OR barriers -beef -meat -fat -weight -overweight -glucose -pigs -lamb -sheep -tissue -pork -tissues"

  3. LPS implememtation for pharmaceutical SMEs: “allintitle: lean pharmaceutical industry manufacturing OR implementation OR practices OR production OR”sucess factors" barriers -beef -meat -fat -weight -overweight -glucose -pigs -lamb -sheep -tissue -pork -tissues"

The result will show the number of research papers whose title contains these keyworks. These articles are filtered in accordance with time period by using the sorting function of Google sholar.

And then the data can be plotted as the following information chart from which the history of the Bok on lean implementation can instantly be captured.

The history of Lean publications

Lean publications in summary

Text mining

Text mining involves in analyzing data in form of text. A thoudsand of words will be analyzed to discover the pattern of words to deduce the research outcomes

Bar chart of literature review

The following bar chart shows the database of 30 reviewed research papers which will be used as an input for Text mining: