The aim of this project is to perform business analytics for Polish flower seller which is the largest producer of daffodils in the country, with 3 other flower types rounding off their offerings. To perform business analystic, it is used Q1 2020 sales data which is not integrated, as it comes from different reporting systems.
Business requirements are following:
Combine the datasets, so it is possible to monitor monthly sales trends for each of the 4 flower types at each store location.
Take note of any discrepancies or apparent faults in the data, so the company can use that information to amend their systems.
Prepare a presentation in R Markdown with 5 slides maximum, showing the most important observations from your data analysis.
Do not modify the input files in any way.
Prepare the R code for data import in such a way that you will be able to import data for more than 3 months with no modifications to your code.
Prepare the R Markdown presentation in such a way that it will automatically update when more than 3 months of data is used as an input.
Polish letters in store names are replaced with “?” in summary of sales files.
There is a store with name “Parviflora” without Store ID. It can be Parviflora headquarters but it is not clear.
Katowice store seems to be closed before January because there is just one transaction and which is a return on January. Since a flower was returned, the store still in the January sales list.
Name of the Swiebodzin store does not have ‘Parviflora’ in the beginning. Also, there is no sales information about Daffodils.
On the Daffodils sales report, there is a Store with ID number which is “345” . There is not any information regarding this store ID.
There is another store with 10-digit ID number on the Daffodils sales file. Actually, it seems that its ID number copied from its location number.
After all this information, there are two stores on the Daffodil sales list without any store information and there are two stores (Headquarters and Swiebodzin) which do not have any Daffodil sales. So, they could match. But we can not match them since we do not have any further information about those. Swiebodzin store ID is “570” but stores with IDs like 5xx, generally have retail stores. So, it might be the case that Swiebodzin’s daffodil sales were written to store with ID 345 and this store might be assigned to Swiebodzin.
Another discrepancy with store ID which has 10-digit ID. When we sum transactions of this store in March 2020, it differs from the total transaction amount. It should be a miscalculation. Because of that, total transaction amount of Daffodil sales on March is wrong and as expected total daffodil sales amount is wrong.
## PhantomJS not found. You can install it with webshot::install_phantomjs(). If it is installed, please make sure the phantomjs executable can be found via the PATH variable.