Date article was published: June 28, 2019

Summary

This article stresses the importance of knowing SQL for data scientists. SQL helps data scientists with an understanding of relational databases and is one of the most important languages in data science.

## [1] "Easy to Learn and Use"               "Understanding your Dataset"         
## [3] "Integrates with Scripting Languages" "Manage huge volumes of data"        
## [5] "A Gateway to Data Science Jobs"

When trying to find a job in data science, one would wonder what the most important skills are. A foundation in Python and R sure, but a language that connects data and databases together is important. That language is SQL. SQL uses simple structure and English words for its commands and functions. SQL integrates with Python and R and it can also work with huge amounts of data. The ability to store, update, access control, and manipulate datasets are highly marketable skills. In conclusion, SQL is a must know language for data scientists.

Basic Article Stats

Claps Responses
484 1

Tags

## [1] "Data Science"     "Sql"              "Analytics Vidhya" "Big Data"        
## [5] "MySQL"

There are many areas that SQL can be applied to. Each area has huge amounts of data that needs to be stored. The data in there can be manipulated, stored, or deleted.

What is SQL?

SQL stands for “Structured Query Language” and it is a programming language that is used to create, maintain, and retrieve relational databases. It is critical in accessing, updating, inserting, manipulating and modifying data. It was started in the 1970s.

Areas of Application

## [1] "computer software" "health"            "manufacturing"    
## [4] "transport"         "banking"

SQL can be used in many other database management systems. This is beneficial because it allows flexibility.

Database Management System Integrations

## [1] "MySQL"                "Microsoft SQL Server" "Oracle Database"     
## [4] "dBase"

Author Profile: Francis Onyango

Headshot

Headshot

Data Science Enthusiast | Python | SQL | R | Learning | Blogger @ www.yourdataguy.org

Thoughts

I agree that SQL is an important skill for a data scientist to know since it develops understanding of relational databases. Since SQL can be integrated with Python and R, it is an important tool to have. This means that when someone is working in Python or R, they will be able to interact with the database in the environment. Being able to work with huge datasets is a marketable skill for any company since data is the new gold. I found it useful that the article showed how important SQL is to data scientists. This will make me want to build a strong familiarity with SQL. Basically, it showed how knowing SQL can give you an edge.

Plots

Plot 1 and the Code

orange_plot <- ggplot(Orange, aes(x=age, y=circumference, color = Tree)
             ) + 
            geom_point()

print(orange_plot + ggtitle("Relationship Between Circumference of Orange Trees and Age"))

Plot 2 and the Code

header <- "Relationship between Chicken Weights and Feed"

chicken_data <- ggplot(chickwts, aes(x=feed, y=weight, color = feed)
             ) + 
            geom_point()

print(chicken_data + ggtitle("Relationship Between Chicken Weights and Feed"))