Pleasure to be at Royal High
School!
https://www.instagram.com/reel/CbBHDk_gVng/?igshid=MDJmNzVkMjY=
What Math level would be good enough to pursue STEM?
https://www.weforum.org/agenda/2021/10/stem-degrees-most-valuable/
U.S population is about 332 million and 18 year and older
are about 258.3 million in 2020. Nearly 8.6 million STEM jobs in the
U.S..
https://www.comparably.com/salaries/salaries-for-hollywood-actor
## Job Prob Median Expected
## 1 Celebrities/Art/Sport/Youtuber 20,000/258,300,000=0.000077 $252,016 $19.4
## 2 STEM 8,600,000/258,300,000=0.033 $90,000 $2970
## [1] "X156 times =15631%"
-No advice for Math lovers, because they’re better than me!
-Practice books help from a foundation, and invest about 30 minutes of
your time daily.
-Once you finish a whole book, buy the same
subject workbook from a different company, and do the same process.
(1) Statisticians
Bachelors or Masters
Programs used : SAS, SPSS, R or MATLAB
Bureau of Statistics, SAT corporation, Probability analyst for sport
teams (NBA), Analyst (Cheesecake Factory, Gas/Electric companies,
Finance)
Salary
(2) Data scientist
Masters or Doctorate
Programs used : R, Python, SQL,
SAS, or MATLAB
Overlaps with computer science.
Tech
companies : Google, Amazon, Apple, Airbnb, Instacart etc
Salary
(3) Bio statistician
Masters or Doctorate
Programs used : SAS, R, Matlab
Bio and Pharmaceutical companies, Health companies: Amgen, Kaiser
Permanente, lots of them in California
Salary
(4) Acualist
Bachelors or Masters (Test based), Calculus based.
Programs
used : Excel, SAS
Insurance, Finance, East Coast or San Francisco
Salary
What is Deep Learning? https://youtu.be/6M5VXKLf4D4
Bayes’ theorem
https://en.wikipedia.org/wiki/Bayes%27_theorem
library(tidyverse)
library(tidymodels)
library(tidytext)
library(tokenizers)
library(hcandersenr)
library(magrittr) # needs to be run every time you start R and want to use %>%
library(dplyr)
library(RColorBrewer)
library(wordcloud)
library(NLP)
library(tm)
library(slam)
Bag-of-words model: the following is a very simple function to
tokenize the text, i.e. separate all the words, filter out irrelevant
stuff and in this case only retain words that consist of more than 3
characters (so stop words such as “the”, “a”, “an”, “in” are not part of
the analysis):
length(pos_tokens)
## [1] 414468
print(pos_tokens[1:50])
## [1] "films" "adapted" "from" "comic" "books"
## [6] "have" "plenty" "success" "whether" "they're"
## [11] "about" "superheroes" "batman" "superman" "spawn"
## [16] "geared" "toward" "kids" "casper" "arthouse"
## [21] "crowd" "ghost" "world" "there's" "never"
## [26] "really" "been" "comic" "book" "like"
## [31] "from" "hell" "before" "starters" "created"
## [36] "alan" "moore" "eddie" "campbell" "brought"
## [41] "medium" "whole" "level" "'80s" "with"
## [46] "12-part" "series" "called" "watchmen" "moore"
length(neg_tokens)
## [1] 370966
print(neg_tokens[1:50])
## [1] "plot" "teen" "couples" "church" "party"
## [6] "drink" "then" "drive" "they" "into"
## [11] "accident" "guys" "dies" "girlfriend" "continues"
## [16] "life" "nightmares" "what's" "deal" "watch"
## [21] "movie" "sorta" "find" "critique" "mind-fuck"
## [26] "movie" "teen" "generation" "that" "touches"
## [31] "very" "cool" "idea" "presents" "very"
## [36] "package" "which" "what" "makes" "this"
## [41] "review" "even" "harder" "write" "since"
## [46] "generally" "applaud" "films" "which" "attempt"
Train what words(Tokens) are in the positive text file, and
what words(Tokens) are in the negative text file. Also Train what
words(Tokens) aren’t in the Positive, and what words(Tokens) aren’t in
the Negative. Through that, the prediction model calculates the
probability of occurrence for tokens.
# positive
calc_Sentiment("This is a wonderful movie. I really loved it!")
## [1] "positive"
calc_Sentiment("I found this film so awesome, it made me cry")
## [1] "positive"
# negative
calc_Sentiment("This is a horrible movie. I really hated it!")
## [1] "negative"
calc_Sentiment("I have never seen such crap. The director should be fired")
## [1] "negative"
calc_Sentiment("Misery and Stand By Me were the best adaptations up until this one, now you can add Shawshank to that list. This is simply one of the best films ever made and I know I am not the first to say that and I certainly won't be the last. The standing on the IMDb is a true barometer of that. #3 as of this date and I'm sure it could be number 1. So I'll just skip all the normal praise of the film because we all know how great it is. But let me perhaps add that what I find so fascinating about Shawshank is that Stephen King wrote it. King is one of the best writers in the world. Books like IT and the Castle Rock series are some of the greatest stories ever told. But his best adaptations are always done by the best directors. The Shining was brilliantly interpreted by Kubrick and of course the aforementioned Misery and Stand By Me are both by Rob Reiner. Now Frank Darabont comes onto the scene and makes arguably the best King film ever. He seems to understand what King wants to say and he conveys that beautifully. What makes this film one of the best ever made is the message it conveys. It is one of eternal hope. Andy Dufresne, played by Tim Robbins, has been sent to prison for a crime he did not commit. But he never loses hope. He never gives up his quest to become a free man again. His years of tenacity, patience and wits keep him not only sane, but it gives his mind and a spirit a will to live. This film has a different feel to it. There has never been anything like it before and I don't know if there will again. I'm not going to say any more about this film, it has already been said, but just suffice to say that I am glad that Forrest Gump won best picture in 94. I would have been equally glad if Pulp Fiction or Shawshank would have won. It is that good of a movie and one that will be appreciated for years to come.")
## [1] "positive"
calc_Sentiment("Brett Kelly - super cheap director located in Canada with a huge potential to become 'worst director ever born' (nomination for 'Worst movie ever made' is also a must for pretty much every single feature he directs) did it again....I mean seriously? 'Jurassic Shark' (yeah I know it rather wasn't original title and was changed because from the marketing point of view it sounds 'hot') is one of the worst piece of garbage you will ever encounter. It makes Asylum movies look like a spectacular Hollywood blockbusters(but then again Asylum spends at least 50-100k for their movies). Kelly's modus operandi is 'we have a free 10k, let's shoot the movie') and it shows on the screen. Acting was never even remotely close to decent in his movies but with 'Jurassic Shark' it reaches the bottom(or something below bottom if it exists). Two blonde bimbos(not really attractive by any means) sitting in bikini on the beach for the first few minutes of the movie are asking to be bitch-slapped for doing what they are doing(which I don't know what is but not acting, that's for sure) and the director should be mutilated for casting them. As far as the special effects go, there aren't any, but if you are asking about 'horrible special effects wannabes' - yes sir, there are quite a few. From the piece of wood called 'shark' to cgi shark which looks so bad, that I don't even know what can I compare with it? (probably only sand castles build by mentally disabled 5 years old kids). I could go on and on(others did it as I see) but I really have no desire to write any longer about this piece of garbage. There is absolutely nothing good to be said about this movie and even though Brett Kelly did one watchable movie in the past 'Prey for the Beast' (and remember, I said 'watchable' not 'decent') I won't be fooled ever again and won't buy any of his movies again. Let them stay where they belong - in a trash bin.")
## [1] "negative"