Answer 1:

Value: the value that can be derived from accessing and analyzing big data;

Veracity: the discrepancy found in data;

Variety: a combination of data types that are being dumped into the system;

Volume: the sheer volume of data being generated each second;

Velocity: the speed at which data is emanating and changes are occurring between different datasets.

Answer 2: Volume and value are, in my opinion, the two most important V’s, and, above all, volume will be the most important one to deal with in the future. Sets and sources of data are increasing at exponential rates, and this can only be projected to increase in the coming years.

Answer 3:

a: 4.5 petabytes= 4500 terabytes;

b: 6.813 zettabytes= 6.813 x 10^-3 exabytes;

c: 50 mil petabytes= 50 zettabytes.

Answer 4:

If X predicts Y, this does not mean X causes Y.Accurate predicition depends heavily on measuring the correct variables. Prediction is quite difficult, so a better strategy would be to approximate the parabolic equation into a linear equation, so that we can confidently predict the successive values. In fact, with the current equation, the parabola is concave down, therefore the values drop past the vertex, which does not allow us to predict. Therefore, using a linear equation would be ideal.