Introduction
The Jester data set includes 4.1 million joke ratings gathered from 1999 to 2003. In this project, I’ll create a Spark based recommendation system using ALS matrix factorization.
References
Goldberg, K. et al. (2000) ‘Eigentaste: A Constant Time Collaborative Filtering Algorithm’, Information Retrieval. Kluwer Academic Publishers, 4(2), pp. 133–151. doi: 10.1023/A:1011419012209.
LS0tCnRpdGxlOiAiRmluYWwgUHJvamVjdCBQcm9wb3NhbCIKc3VidGl0bGU6ICJEQVRBLTYxMiwgU3VtbWVyIDIwMTkiCmF1dGhvcjogIkZlcm5hbmRvIEZpZ3VlcmVzIgpvdXRwdXQ6IGh0bWxfbm90ZWJvb2sKLS0tCgoKIyMgSW50cm9kdWN0aW9uCgpUaGUgSmVzdGVyIGRhdGEgc2V0IGluY2x1ZGVzIDQuMSBtaWxsaW9uIGpva2UgcmF0aW5ncyBnYXRoZXJlZCBmcm9tIDE5OTkgdG8gMjAwMy4gSW4gdGhpcyBwcm9qZWN0LCBJJ2xsIGNyZWF0ZSBhIFNwYXJrIGJhc2VkIHJlY29tbWVuZGF0aW9uIHN5c3RlbSB1c2luZyBBTFMgbWF0cml4IGZhY3Rvcml6YXRpb24uCgojIyBSZWZlcmVuY2VzCgpHb2xkYmVyZywgSy4gZXQgYWwuICgyMDAwKSDigJhFaWdlbnRhc3RlOiBBIENvbnN0YW50IFRpbWUgQ29sbGFib3JhdGl2ZSBGaWx0ZXJpbmcgQWxnb3JpdGht4oCZLCBJbmZvcm1hdGlvbiBSZXRyaWV2YWwuIEtsdXdlciBBY2FkZW1pYyBQdWJsaXNoZXJzLCA0KDIpLCBwcC4gMTMz4oCTMTUxLiBkb2k6IDEwLjEwMjMvQToxMDExNDE5MDEyMjA5Lgo=