Professional Documents
Culture Documents
AND DATAVISUALIZATION
Course Code- INT 350
Continuous Assessment-II
Important Guidelines:
Rank the StudentName, based on StudentMarks as the rank would be stored in a new column
StudentRank.
[5
Marks]
3. Using your own dataset perform the below followings using python
a.Histogram
b.Barchart
c.Heatmap
[5
Marks]
4.Using your own dataset perform data cleaning and data handling?
[5
Marks]
5.Perform boxplot for detecting outlier in python using your own dataset?
[5
Marks]
6.Explain the necessity of data visualization?
[5
Marks]
SQL AND DATA VISUALISATION (INT-351) CA-2
Set-2
1.Create the table given below
In the table above, apply the RANK() function and add 3 more students, Peter, Bob and Kim.
[5
Marks]
2.Perform scatterplot and pairplot using your own dataset in python?
[5
Marks]
3.Distinguish clustering and Non Non clustering indexing?
[5
Marks]
4.Perform Bar graph and line chart using your own dataset in python for data visualization.
[5
Marks]
5.Analyse the outliers in your dataset using Boxplot.
[5
Marks]
6.Perform bar chart and stacked bar chart using your own dataset.
[5
Marks]
[5
marks]
[5
marks]
4 Aman 10
7 Raghav 11
8 Sameer 12
11 Science
12 Biology
[5
marks]
4. Choose the dataset of your choice and perform the following visualization
a) Box plot
b) Scatter plot
c) Heat map
d) Line plot
[5
marks]
5. Using Pair plot and heat maps perform the multivariate and bivariate analysis
respectively.
[5
marks]
6. Describe in detail
a) Distribution plot
b) Differentiate Bar and Stacked bar
[5
marks]
Using these tables, write a SQL query to return the top 3 'online' orders and their
associated locations based on revenue generated. [5
marks]
year total_sale
2015 23000
2016 25000
2017 34000
2018 32000
2019 33000
Use lag() and lead() function to compare annual sale amounts across years. [5 Marks]
5. Using any dataset of your choice perform bivariate analysis and interpret each graph –
a) Line Charts
b) Bar Graph
c) Box plots
[5 Marks]
6. Briefly discuss about Stacked Bar Graphs with an example [5
Marks]
1.Write an SQL query that reports the best seller by total sales price, If there is a tie, rep
ort
them all.
[5 Marks]
2.Write an SQL query that reports the buyers who have bought S8 but not iPhone. Note
that S8 and iPhone are products present in the Product table.
[5 Marks]
3. Write an SQL query that reports the products that were only sold in spring 2019.
That is, between 2019-01-01 and 2019-03-31 inclusive.
[5
Marks]
4. From the following dataframe, write a SQL query to find the even or odd values.
Return "Even" for even number and "Odd" for odd number.
.
[5
Marks]
5. Write an SQL query that returns all product names of the products in the Sales table
along with their selling year and price.
[5
Marks]
6. Write an SQL query that returns the total quantity sold for every product id.
[5
Marks]
SQL AND DATA VISUALISATION(INT-351) CA-2
SET – 8
1. From the above dataframes, write a SQL query to find those retailers who have
bought 'key board' but not 'mouse'. Return retailer ID.
[5
Marks]
2. From the following dataframe, write a SQL query to display those items that were
only sold in the 2nd quarter of a year, i.e. April 1st to June end for the year 2020.
Return item code and item description.
[5
Marks]
3.
.
From the following dataframe, write a SQL query to find the highest purchase with its
corresponding item for each customer. In case of a same quantity purchase find the item
code which is smallest.
The output must be sorted by increasing of customer_id. Return customer ID,lowest
item code and purchase quantity.
[5
Marks]
4.
From the following dataframes, write a SQL query to display those managers who have
average experience for each scheme.
[5
Marks]
5. From the above dataframes, write a SQL query to find those schemes which executed
by minimum number of employees. Return scheme code.
[5
Marks]
6. From the following dataframes, write a SQL query to find those experienced
manager who execute the schemes. Return scheme code and scheme manager ID.
[5Marks]
SQL AND DATA VISUALISATION (INT-351) CA-2
Set-9
1. Suppose you're given the following tables called 'orders' and 'order_info'. The table
'orders' shows revenue values for unique orders along with the associated channel ('online'
or 'in_store') while the table 'order_info' shows the order's ID along with its location.
Using these tables, write a SQL query to return the top 3 'online' orders and their
associated locations based on revenue generated. [5
marks]
year total_sale
2015 23000
2016 25000
2017 34000
2018 32000
2019 33000
Use lag() and lead() function to compare annual sale amounts across years. [5 Marks]
4. Take any dataset of your choice and perform outlier analysis using boxplots. Write the
interpretation of the graph. [5
Marks]
5. Using any dataset of your choice perform bivariate analysis and interpret each graph –
a) Line Charts
b) Bar Graph
c) Box plots
[5 Marks]
6. Briefly discuss about Stacked Bar Graphs with an example [5 Marks]
7.
SQL AND DATA VISUALISATION (INT-351) CA-2
Set-10
1.Given the 'orders' and 'order_info' tables, write a SQL query to find the total revenue
generated from 'in_store' orders. [5 marks]
Table: employee_salaries
5.Using a dataset of your choice, create a scatter plot to visualize the relationship
between two variables. Interpret the scatter plot and discuss any correlation you
observe. [5 Marks]
6.Describe the key differences between a bar chart and a histogram in data
visualization. Provide scenarios where each type of chart is more appropriate. [5 Marks]
1.Given the 'orders' and 'order_info' tables, write a SQL query to find the order with
the highest revenue and its associated location, regardless of the channel. [5 marks]
Table: customer_feedback
4.Discuss the concept of data preprocessing in the context of machine learning. Why is it
important, and what are some common preprocessing steps? [5 Marks]
5.Use Python's Seaborn library to create a heatmap to visualize the correlation matrix
of variables in a dataset. Interpret the heatmap and explain the relationships between
variables. [5 Marks]
6.Compare and contrast a line chart and a scatter plot in data visualization. Provide
examples of scenarios where each type of chart is more suitable. [5 Marks]
1.Given the 'orders' and 'order_info' tables, write a SQL query to find the total revenue
generated from 'online' orders. [5 marks]
2.Consider a table 'product_inventory' with the following structure:
Table: product_inventory
5.Using a dataset of your choice, create a bar chart to compare the sales performance of
different product categories. Interpret the chart and discuss any trends or insights. [5
Marks]
6.Define and describe the use cases for box plots in data visualization. Provide an
example dataset where a box plot would be particularly informative. [5 Marks]
RegisterationN
SerialNo umber Name RollNumber Status Set no
Surya Pratap
1 12113296 Singh RK21UGA01 P 1
Garikipati
2 12113435 Satvik RK21UGA02 P 2
3 12113072 Aniket Sharma RK21UGA03 P 3
Nitin Kumar
4 12113048 Mundhra RK21UGA04 P 4
5 12112308 Shaik Ismail RK21UGA05 P 5
Aravind
6 12112090 Kontham RK21UGA06 P 6
7 12112233 Ankit Yadav RK21UGA07 P 7
Chettupalli
8 12111860 Rohan Apuroop RK21UGA08 P 8
Mallekedi
9 12111835 Shrivardhan RK21UGA09 P 9
Badugu
10 12111599 Naveen Babu RK21UGA10 P 10
Maddipatla
11 12111519 Durga Prasad RK21UGA11 P 11
12 12110761 Manish Chetla RK21UGA12 P 12
13 12113626 Anuj Shrivatri RK21UGA13 P 1
Ansh
14 12100958 Chourasia RK21UGA14 P 2
15 12109373 Mabel Blossom RK21UGA15 P 3
Badam
Venkata
16 12108517 Suyameendra RK21UGA16 P 4
Kurukuri
17 12107223 Krishna Vamsi RK21UGA17 P 5
18 12106498 Jhulesh Swami RK21UGA18 P 6
19 12106435 Chanchal Alam RK21UGA19 P 7
Garadala
20 12107073 Mokshith Sai RK21UGA20 P 8
21 12105160 Gopal Gupta RK21UGA21 P 9
22 12105231 Vaggu Arun RK21UGA22 P 10
23 12105513 Ankit Siwach RK21UGA23 P 11
24 12101448 Tanu Verma RK21UGA24 P 12
RegisterationN
SerialNo umber Name RollNumber Status Set no
Chikkam
Chiranjeevi Sai
25 12101273 Murari RK21UGA25 P 1
Jandhyala Sai
26 12103123 Kashyap RK21UGA26 P 2
Manchigarla
27 12114271 Devadas RK21UGA27 P 3
28 12115067 Khushi Gupta RK21UGA28 P 4
29 12114925 Sachin Kumar RK21UGA29 P 5
30 12114795 Rohit Kumar RK21UGA30 P 6
31 12114691 Atishay Chugh RK21UGA31 P 7
32 12116676 Kartik Saxena RK21UGA32 P 8
Ayush Raj
33 12103357 Singh RK21UGB33 P 9
34 12101119 Vaibhav Kohli RK21UGB34 P 10
35 12101377 Nikhil Mahana RK21UGB35 P 11
36 12102363 Kaki Himanth RK21UGB36 P 12
37 12105702 Gaurav Kumar RK21UGB37 P 1
Komal
38 12106222 Pasumarthy RK21UGB38 P 2
Pentakota
39 12104835 Srinu RK21UGB39 P 3
Shravani
40 12105016 Deshpande RK21UGB40 P 4
Uday Singh
41 12106923 Senger RK21UGB41 P 5
Aditya
42 12108642 Parashar RK21UGB42 P 6
Harsha
Vardhan Naidu
43 12108377 Nalli RK21UGB43 P 7
44 12109308 Simran Rathor RK21UGB44 P 8
Madrasu
Venkata
45 12109290 Manikanta RK21UGB45 P 9
Vemula Sahith
46 12109200 Reddy RK21UGB46 P 10
Neelarapu
47 12109553 Sanjay RK21UGB47 P 11
48 12015922 Piyush Kumar RK21UGB48 P 12
RegisterationN
SerialNo umber Name RollNumber Status Set no
49 12113608 Adhwaith S RK21UGB49 P 1
50 12113531 Rayan Rafeek RK21UGB50 P 2
Yanamaddi
51 12110942 Vamshi RK21UGB51 P 3
Koppula Jaya
52 12111005 Vardhan Reddy RK21UGB52 P 4
Shaik Julfeen
53 12110554 Ahmadh RK21UGB53 P 5
Santhosh
54 12110600 Kumar S RK21UGB54 P 6
Kota Venkat
Sandeep
55 12112064 Reddy RK21UGB55 P 7
Santosh James
56 12112397 Kalyani RK21UGB56 P 8
Nalla
Purushotham
57 12114817 Raj RK21UGB57 P 9
58 12115033 Ashwini Kumar RK21UGB58 P 10
59 12115007 Satyam Gupta RK21UGB59 P 11
60 12115732 Kondi Manish RK21UGB60 P 12
Muhammed
61 12115337 Basil CP RK21UGB61 P 1
Gummala
62 12115135 Chandrakanth RK21UGB62 P 2
63 12115288 Pulkit Narsaria RK21UGB63 P 3
64 12115196 Kanishka K R RK21UGB64 P 4