Introduction

Nowdays, a massive amount of data has been generated for every single second, which is increasing the need for data analysis. The data is not only in form of figures, but also in text. That leads to the term of data mining and text mining in data science.

The following parts are dedicated for describing some typical outputs of data analysis excerpted from Hien, N.N. (2016).

Data mining

Data mining is the computational process of sketching patterns in data sets by using several methods such as statistics. The following part is an expample taken from Hien, N.N. (2016).

LPS implementation publications with respects to Time (1988-2015)

To show how big the research gap of Lean production system (LPS) implementation in pharmaceutical small-medium sized enterprises (pSMEs) exists in the body of knowledge (BoK), we can obtain the data on Google Scholar in terms of research papers on the topic by using the following search terms:

Syntax for searching on Google scholar:

  1. LPS implememtation in General: โ€œallintitle: lean manufacturing OR implementation OR practices OR production ORโ€sucess factors" OR barriers -beef -meat -fat -weight -overweight -glucose -pigs -lamb -sheep -tissue -pork -tissues"

  2. LPS implememtation for SMEs: โ€œallintitle: lean SMEs manufacturing OR implementation OR practices OR production ORโ€sucess factors" OR barriers -beef -meat -fat -weight -overweight -glucose -pigs -lamb -sheep -tissue -pork -tissues"

  3. LPS implememtation for pharmaceutical SMEs: โ€œallintitle: lean pharmaceutical industry manufacturing OR implementation OR practices OR production ORโ€sucess factors" barriers -beef -meat -fat -weight -overweight -glucose -pigs -lamb -sheep -tissue -pork -tissues"

The result will show the number of research papers whose title contains these keyworks. These articles are filtered in accordance with time period by using the sorting function of Google sholar.

And then the data can be plotted as the following information chart from which the history of the Bok on lean implementation can instantly be captured.

The history of Lean publications

Lean publications in summary

Text mining

Text mining involves in analyzing data in form of text. A thoudsand of words will be analyzed to discover the pattern of words to deduce the research outcomes

Bar chart of literature review

The following bar chart shows the database of 30 reviewed research papers which will be used as an input for Text mining:

Summary of Literature review

The following bar chart indicates the number of lean influencing factors associated with the author of reviewed research papers

Network Of Lean barriers

The result of text mining on the literature review is illustrated by the following network

The light green bubbles on the network of lean barriers indicates the authors of reviewed research papers. The faded orange bubbles show that lean barriers were reported less than four times, while normal orange bubbles represent the barriers reported around 4 and 7 times found in the literature. The lean barriers, which were declared more than 7 times by the authors, are considered as critical lean obstacles on the path toward LPS. The node of โ€œknowledgeโ€, โ€œresistanceโ€, and โ€œtrainingโ€ are the abbreviation for the โ€œlack of knowledgeโ€, โ€œresistance to leanโ€ and โ€œlack of trainingโ€ barrier respectively. These nodes are the most significant lean barriers because they have dense links to more than 7 authors.

Once one of any red nodes is moved in a any direction, the entire system of nodes will follow.

The red sun of lean barriers

The thin red lines laid out around the outer circle indicate the authors of reviewed research papers. The red curved lines within the red sun represent paths leading to the authorsโ€™ research results on lean obstacles. The size of quadrangular objects sketched around the circle shows how many times a barrier was detected through the text mining process. The faded orange objects show that lean barriers were reported less than 4 times or 4 red curved lines getting into the objects, while normal orange bubbles represent the barriers reported around 4 and 7 times found in the literature. The lean barriers illustrated by the dark red objects, which were claimed more than 7 times by the authors, are considered as critical lean barriers on the path toward LPS. The node of โ€œknowledgeโ€, โ€œresistanceโ€, and โ€œtrainingโ€ are the abbreviation for the โ€œlack of knowledgeโ€, โ€œresistance to leanโ€ and โ€œlack of trainingโ€ barrier respectively. These nodes are the most significant lean barriers due to the assumption that they have more than 7 red lines coming from the reviewed papers.

Bar chart of barriers

The following bar chart is to summarize the number of lean barriers found in the reviewed research papers

Network Of success factors

The light green bubbles on the network of lean success factors indicate the authors of reviewed research papers. The faded orange bubbles show that success factors were reported less than two times, while normal orange bubbles represent the success factors reported around 3 and 5 times found in the literature. The lean success factors, which were stressed more than 5 times by the authors, are considered as critical factors on the path toward LPS. The node of โ€œleadershipโ€, โ€œknowledgeโ€, โ€œskillsโ€, โ€œhumanโ€, and โ€œculture/changeโ€ are the abbreviation for the โ€œleadership managementโ€, โ€œknowledge on leanโ€, โ€œlean skillsโ€, โ€œhuman aspectsโ€, and โ€œcultural changeโ€ barrier in the same order. These nodes are the most critical lean success factors because they have dense links to more than 6 authors.

Once one of any dark orange nodes is moved in a any direction, the entire system of nodes will be changed in shape.

Circlic network of success factors

The size of a bubble shows how many times a lean success factor was identified by the literature. The arrows show the suggestions proposed by the authors presented by the light green bubbles on the network. The orange bubbles show that success factors were reported less than 2 times or 2 arrows, while yellow bubbles represent the success factors reported around 3 and 5 times found in the literature. The lean success factors represented by the dark green bubbles, which were stressed more than 5 times by the authors, are considered as critical success factors on the path toward LPS. The node of โ€œleadershipโ€, โ€œknowledgeโ€, โ€œskillsโ€, โ€œhumanโ€, and โ€œculture/changeโ€ are the abbreviation for the โ€œleadership managementโ€, โ€œknowledge on leanโ€, โ€œlean skillsโ€, โ€œhuman aspectsโ€, and โ€œcultural changeโ€ in the same order. These nodes are the most critical lean success factors because of the assumption that they have more than 6 arrows coming from the reviewed literature.

Bar chart of success factors

To summarize the above network, the bar chart will be used again as below:

Lean implementation diagram

The following interative diagram is not structured well at this moment, but you can play on it by grasping the vertical line.

Reference

Hien, N.N (2016). Implementation of lean production systems for small-medium sized enterprises. Unpublished PhD Thesis, Technical Univesity of Berlin, Germany