Actuarial Mathematics

Krishna Kumar Shrestha

2/14/2020

Data Analysis

Data analysis is a method in which data is collected and organized so that one can derive helpful information from it for specific purpose.

Forms of Data Analysis

Descriptive

Descriptive analysis is an important first step for conducting statistical analyses. It gives you an idea of the distribution of your data, helps you detect outliers , and enable you identify associations among variables, thus preparing you for conducting further statistical analyses.

Two key measures, or parameters, used in a descriptive analysis are

Inferential analysis

Inferential statistics are usually the most important part of a dissertation’s statistical analysis. Inferential statistics are used to allow a researcher to make statistical inferences, that is draw conclusions about the study population based upon the sample data.

Predictive analysis

It involve using of past data to predict about future outcomes.

Data Analysis Process

Data Analysis Process

Data Analysis Process

Data sources

Data collection plays a very crucial role in the statistical analysis. In research, there are different methods used to gather information, all of which fall into two categories, i.e. primary data, and secondary data.

Primary data

Secondary Data

Ways reduce the effect of bias

Different ways of data collection affect analysis :

Big data

The properties that can lead data to be classified as ‘big’ include:

Data security, privacy and regulation

In the design of any investigation, consideration of issues related to data security, privacy and complying with relevant regulations should be paramount. It is especially important to be aware that combining different data from different ‘anonymised’ sources can mean that individual cases become identifiable.

Another point to be aware of is that just because data has been made available on the internet, doesn’t mean that that others are free to use it as they wish. This is a very complex area and laws vary between jurisdictions.

Reproducible research

Reproducibility refers to the idea that when the results of a statistical analysis are reported, sufficient information is provided so that an independent third party can repeat the analysis and arrive at the same results.

Replication can be hard, or expensive or impossible, for example if:

Due to the possible difficulties of replication, reproducibility of the statistical analysis is often a reasonably alternative standard

Elements required for reproducibility

Problems without Reproducibility

Doing things ‘by hand’ is very likely to create problems in reproducing the work. Examples of doing things by hand are:

The value of reproducibility

Many actuarial analyses are undertaken for commercial, not scientific, reasons and are not published, but reproducibility is still valuable:

Issues that reproducibility does not address: