The link to this article can be found here
Data science continues to evolve as one of the most promising and in-demand career paths for skilled professionals. Today, successful data professionals must advance past the traditional skills of analyzing large amounts of data, data mining, and programming skills. In order to uncover useful intelligence for their organizations, data scientists must master the full spectrum of the data science life cycle and possess a level of flexibility and understanding to maximize returns at each phase of the process.
“The ability to take data — to be able to understand it, to process it, to extract value from it, to visualize it, to communicate it — that’s going to be a hugely important skill in the next decades.”
## [[1]]
## [1] "Quoted from: Hal Varian"
##
## [[2]]
## [1] "Occupation: chief economist at Google and UC Berkeley professor of information sciences, business, and economics"
The Data Science Life Cycle
Following the image above, effective data scientists are able to
# These skills are required in all industries, causing skilled data scientists to be increasingly valuable to companies.
In the past decade, data scientists have become well-rounded, data-driven individuals with high-level technical skills who are capable of building complex quantitative algorithms to organize and synthesize large amounts of information used to answer questions and drive strategy in their organization. This is coupled with the experience in communication and leadership needed to deliver tangible results to various stakeholders across an organization or business.
Data scientists need to be curious and result-oriented, with exceptional industry-specific knowledge and communication skills that allow them to explain highly technical results to their non-technical counterparts. They possess a strong quantitative background in statistics and linear algebra as well as programming knowledge with focuses in data warehousing, mining, and modeling to build and analyze algorithms.
In reality, about 80% of what data scientists do is data cleaning.