SQL for Data Science

As data collection has increased exponentially, so has the need for people skilled at using and interacting with data; to be able to think critically, and provide insights to make better decisions and optimize their businesses. This is a data scientist, “part mathematician, part computer scientist, and part trend spotter” (SAS Institute, Inc.). According to Glassdoor, being a data scientist is the best job in America; with a median base salary of $110,000 and thousands of job openings at a time. The skills necessary to be a good data scientist include being able to retrieve and work with data, and to do that you need to be well versed in SQL, the standard language for communicating with database systems.

 

Coursera Plus banner featuring three learners on a blue background.

Week 1:Getting Started and Selecting & Retrieving Data with SQL

In this module, you will be able to define SQL and discuss how SQL differs from other computer languages. You will be able to compare and contrast the roles of a database administrator and a data scientist, and explain the differences between one-to-one, one-to-many, and many-to-many relationships with databases. You will be able to use the SELECT statement and talk about some basic syntax rules. You will be able to add comments in your code and synthesize its importance.

Week 2: Filtering, Sorting, and Calculating Data with SQL

In this module, you will be able to use several more new clauses and operators including WHERE, BETWEEN, IN, OR, NOT, LIKE, ORDER BY, and GROUP BY. You will be able to use the wildcard function to search for more specific or parts of records, including their advantages and disadvantages, and how best to use them. You will be able to discuss how to use basic math operators, as well as aggregate functions like AVERAGE, COUNT, MAX, MIN, and others to begin analyzing our data.

Week 3: Subqueries and Joins in SQL

In this module, you will be able to discuss subqueries, including their advantages and disadvantages, and when to use them. You will be able to recall the concept of a key field and discuss how these help us link data together with JOINs. You will be able to identify and define several types of JOINs, including the Cartesian join, an inner join, left and right joins, full outer joins, and a self join. You will be able to use aliases and pre-qualifiers to make your SQL code cleaner and efficient.

Week 4:Modifying and Analyzing Data with SQL

In this module, you will be able to discuss how to modify strings by concatenating, trimming, changing the case, and using the substring function. You will be able to discuss the date and time strings specifically. You will be able to use case statements and finish this module by discussing data governance and profiling. You will also be able to apply fundamental principles when using SQL for data science. You’ll be able to use tips and tricks to apply SQL in a data science context.

Sadie St. Lawrence, Instructor

AI Strategy Consultant for Accenture Applied Intelligence

 

Ready to get started?

Learn More or Enroll Now

Data science continues to evolve and grow, and whether a learner is looking to break into the field or brush up on skills, Coursera has courses for every level.  Here is the list of the top 10 data science courses to help you find the right content for your goals.

Top 10 Data Science Courses

  1. Google Data Analytics Professional Certificate
  2. IBM Data Science Professional Certificate
  3. Python for Everybody from the University of Michigan
  4. Machine Learning from Stanford University
  5. Learn SQL Basics for Data Science from UC Davis
  6. Deep Learning from DeepLearning.AI
  7. DeepLearning.AI TensorFlow Developer Professional Certificate
  8. Natural Language Processing from DeepLearning.AI
  9. Data Visualization with Tableau from UC Davis
  10. Generative Adversarial Networks (GANs) from DeepLearning.AI
Coursera Learner working on a presentation with Coursera logo and