SQL for Data Science

A 3 month course which helps apply SQL for data science using real world examples for practice

  • Domain : CSE
Enroll Now View demo

A Quick Overview

SQL for Data Science Course is designed to take students from a beginner level to advanced level in the field of SQL. The course is covered in a manner so that SQL capabilities can be utilized for extensive data exploration. Each week thoroughly covers the syntax for SQL commands as well as a hands-on session on the RDBMS tool. Data science domain deals with data and results in the form of decisions that can be used to boost various businesses. It is an art of extracting meaningful insights from the available data. SQL is used everywhere from small to big companies. It is a declarative language where focus is on doing rather than how it is performed. Integration with programming languages like R, Python is easy with SQL. Playing with a large dataset is clearly possible with SQL. This course will teach basic select commands to complex cursors by taking real life dataset examples.

Moreover in this course, students work on real time datasets such as analysing consumer behaviour from an ecommerce data set as Project 1 and Career options of Alumni database as Project 2. Both projects deal with real time data sets and give students complete scope for exploration and thorough understanding of SQL.


COURSE SYLLABUS

1Introduction

  • Introduction to Data Science
  • Applications of Data Science
  • Why is SQL required for Data Science?
  • Database Management System (DBMS)
  • Relational Database Management System (RDBMS)
  • Basic terminology in RDBMS
  • Data Constraints
  • Entity Relationship Model
  • What is SQL
  • Categories of SQL Commands
  • Hands on executing simple SQL statements on RDBMS tool

2Database Creation, Manipulation

  • Detailed SQL Data types
  • Creating database
  • Create Table
  • Using Constraints
  • Insert Table
  • Altering Table structure
  • Dropping Database and Table
  • Delete, Update
  • Hands on importing sample database schema

3Database Selection

  • Select statement
  • Removing Duplicates
  • Use of Alias
  • Use of Where
  • Use of Wildcards
  • Limit clause
  • Arithmetic Operators
  • Mathematical Functions
  • Hands on creating backups and restore for large databases

4Database Selection

  • Generating Strings
  • String Functions
  • Date Functions
  • Conversion Functions

5Database Selection

  • Comparison Operators
  • Logical Operators
  • Order By
  • Group By
  • Aggregate Functions
  • Using aggregate functions with Group by clause
  • Union Operator
  • Sub-query

6Querying Multiple Tables

  • Need to Join Multiple Tables
  • Cartesian Product
  • Inner Join
  • Left Join
  • Right Join
  • Self Join
  • Delete Join
  • Update Join
  • Hands on Joining more than two tables in sample database

7Data Exploration

  • What is Data Exploration?
  • Structure of Data
  • Understanding E-R Diagram
  • How to Use SQL for Data Exploration
  • Significance of
    • Joins
    • Sub queries
    • Inbuilt functions
  • Other important capabilities of SQL for data exploration
  • Hands On
  • Working with NULL values
  • Making trends in Data
  • Identifying Outliers
  • Creating Data Summary

8Index, View, Transaction

  • Creating Index
  • Use of Index
  • Type of Index and Index Strategies
  • Views
  • Views as Weapon for Data Analysis
  • Multi user database
  • What is Transaction
  • Save points
  • Hands on working on Multi user database environment

9Querying with Conditions

  • Querying with Conditions
  • Searched Case Expression
  • Simple Case Expression
  • Applications of Case Expression
  • Common Error Codes
  • Hands on working with Json type data

10Stored Procedures

  • Stored Procedures as friend for Data Analysis
  • Creating Stored Procedures
  • Removing Stored Procedures
  • Altering Stored Procedures
  • Conditional Statements
  • Loops
  • Hands on working with Cursors

11Integrating SQL with Excel

  • Hands on
  • Accessing MySQL data with MS Excel
  • Running SQL statements with Excel
  • Combining Excel and SQL statements for data representation

12Integrating SQL with Python

  • Hands on
  • Working with Python
  • Accessing SQL data with Python
  • Running basic SQL statements with Python
  • Running inbuilt python functions on SQL data


Projects Overview

Project 1

Highlights

The business model Customer to Customer (C2C) allows customers to do business with each other. This model is growing fast with e-commerce platforms where sellers may be required to pay some amount and buyer can buy it without paying anything. E-Commerce website brings the seller and buyer to the same platform. 

Analyzing the user's database will lead to understanding the business perspective. Behavior of the users can be traced in terms of business with exploration of the user’s database. 

Dataset: One .csv file with name users_data with 98913 rows and 27 columns

Tasks to be performed

  1. Create new schema as ecommerce
  2. Import .csv file users_data into MySQL
  3. Run SQL command to see the structure of table
  4. Run SQL command to select first 100 rows of the database
  5. How many distinct values exist in table for the field "country" and "language"
  6. Check whether male users are having maximum followers or female users.
  7. Calculate the total number of users that
    1. Uses Profile Picture in their Profile
    2. Uses Application for Ecommerce platform
    3. Uses Android app
    4. Uses ios app
  8. Calculate the total number of buyers for each country and sort the result in descending order of total number of buyers.
  9. Calculate the average number of sellers for each country and sort the result in ascending order of total number of sellers.
  10. Display name of top 10 countries having maximum products pass rate.
  11. Calculate the number of users on an ecommerce platform for different language choices.
  12. Check the choice of female users about putting the product in a wishlist or to like socially on an ecommerce platform.
  13. Check the choice of male users about being seller or buyer.
  14. Which country is having maximum number of buyers?
  15. List the name of 10 countries having zero number of sellers.
  16. Display record of top 110 users who have used ecommerce platform recently.
  17. Calculate the number of female users those who have not logged in since last 100 days.
  18. Display the number of female users of each country at ecommerce platform.
  19. Display the number of male users of each country at ecommerce platform.
  20. Calculate the average number of products sold and bought on ecommerce platform by male users for each country.

 

Project 2

Highlights

Once a student is passed out from a Institute or College, he/she is known as Alumni of the Institute. Alumni’s career growth plays important role in Institute’s ranking and other networking activities. In this project, career choices of alumni of two Universities will be analyzed with respect to their passing year as well as the course they completed. 

Dataset: Six .csv file (Alumni record of College A and College B) Higher Studies, Self Employed and Service/Job record 

  • College_A_HS ~ Higher Studies Record of College A
  • College_A_SE ~ Self Employed Record of College A
  • College_A_SJ ~ Service/Job Record of College A
  • College_B_HS ~ Higher Studies Record of College B
  • College_B_SE ~ Higher Studies Record of College B
  • College_B_SJ ~ Higher Studies Record of College B

Tasks to be performed

  1. Create new schema as alumni
  2. Import all .csv files into MySQL
  3. Run SQL command to see the structure of six tables
  4. Display first 1000 rows of tables (College_A_HS, College_A_SE, College_A_SJ, College_B_HS, College_B_SE, College_B_SJ) with Python.
  5. Import first 1500 rows of tables (College_A_HS, College_A_SE, College_A_SJ, College_B_HS, College_B_SE, College_B_SJ) into MS Excel.
  6. Perform data cleaning on table College_A_HS and store cleaned data in view College_A_HS_V.
  7. Perform data cleaning on table College_A_SE and store cleaned data in view College_A_SE_V.
  8. Perform data cleaning on table College_A_SJ and store cleaned data in view College_A_SJ_V.
  9. Perform data cleaning on table College_B_HS and store cleaned data in view College_B_HS_V.
  10. Perform data cleaning on table College_B_SE and store cleaned data in view College_B_SE_V.
  11. Perform data cleaning on table College_B_SJ and store cleaned data in view College_B_SJ_V.
  12. Make procedure to use string function/s for converting record of Name, FatherName, MotherName into lower case for views (College_A_HS_V, College_A_SE_V, College_A_SJ_V, College_B_HS_V, College_B_SE_V, College_B_SJ_V) 
  13. Import the created views (College_A_HS_V, College_A_SE_V, College_A_SJ_V, College_B_HS_V, College_B_SE_V, College_B_SJ_V) into MS Excel and make pivot chart for location of Alumni. 
  14. Make data summary for all six views (College_A_HS_V, College_A_SE_V, College_A_SJ_V, College_B_HS_V, College_B_SE_V, College_B_SJ_V) for demographic analysis through SQL. 
  15. Make data summary for views (College_A_SJ_V, College_B_SJ_V) for company choices of Alumni through SQL. 
  16. Make data summary for the decision of being entrepreneur of Alumni of particular domain through SQL. 
  17. Analyze the choice of course after being graduated from College for College A and College B through SQL. (Use the cleaned data only)
  18. Analyze the choice of Institute after being graduated from College for College A and College B through SQL (Use the cleaned data only)
  19. Analyze the choice of higher studies in India and abroad for College A and College B through SQL (Use the cleaned data only)
  20. Calculate the percentage of career choice of College A and College B Alumni (w.r.t Higher Studies, Self Employed and Service/Job) 




Flexible Course Fees

Choose the plan that’s right for you

Basic

2 Months Access

$94.99

Per month for 3 months

  • Access Duration : 2 Months
  • Mode of Delivery : Online
  • Project Portfolio : Available
  • Certification : Available
  • Email Support : Available
  • Forum Support : Available
Premium

Lifetime Access

$203.55

Per month for 3 months

  • Access Duration : Lifetime
  • Mode of Delivery : Online
  • Project Portfolio : Available
  • Certification : Available
  • Individual Video Support : 12/ Month
  • Group Video Support : 12/ Month
  • Email Support : Available
  • Forum Support : Available
  • Telephone Support : Available
  • Dedicated Support Engineer : Available

WHO IS THIS COURSE FOR ?


SOFTWARE COVERED


Testimonials

Companies hire from us

See all

Certification

  • Top 5% of the class will get a merit certificate
  • Course completion certificates will be provided to all students
  • Build a professional portfolio
  • Automatically link your technical projects
  • E-verified profile that can be shared on LinkedIn

SKILL LYNC WORKS TO GET YOU A JOB

See all

Frequently Asked Questions

1Who can take your course?

Students are required to have basic knowledge of database management systems.

2What is included in your course?

Learning SQL for data science field, relational database creation, storage, querying, analyzing, splitting and merging, mathematical and other functions required for data manipulation of hands on real world data.

3What will the student gain from your course?

  • Creating, inserting, updating and querying of relational database
  • Methods to understand data patterns
  • Data Cleaning and Computation
  • Using SQL for Data Analysis & Data Visualization

4What software skills are you teaching and how well are these tools used in the industry?

SQL, Relational Database tool, these tools are widely used in industry for database storage.

5What real world application for the tools and techniques will you teach in this course?

  • RDBMS tools are extensively used in Organizations for managing different types of data
  • The course will cover hands on examples with real world data

6Which companies use these techniques and for what?

SQL is standard for storing relational databases in all large enterprises irrespective of its domain. Few big names are Amazon, Flipkart, Microsoft, Google , Ola, Zomato, Paytm

7How is your course going to help me in my path to MS or PhD?

This course will cover SQL basics and hands on with a large dataset for data analysis. Students interested for doing specialization in Data Analytics, Data Science or related fields will get good knowledge of SQL for strong concept building.

8How is this course going to help me get a job?

SQL experts are in very much demand and industry is always looking for the right talent.


You Might Also Be Interested In

Related Courses

See all

The Skill-Lync Advantage

See all