Welcome to Scribd!

Sppu Dsbda QP Nov - Dec - 2023

Uploaded by

0% found this document useful (0 votes)

6 views3 pages

This document contains 8 questions related to data science and big data analytics. It covers topics such as data analytics cycle, roles in analytics projects, types of analytics, clustering algorithms, text analysis techniques, performance metrics, and data visualization tools. The questions involve explaining concepts, listing types or steps, performing calculations, and discussing applications.

Original Description:

Original Title

sppu dsbda QP Nov_Dec_2023

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

6 views3 pages

Sppu Dsbda QP Nov - Dec - 2023

Uploaded by

yakoxap698

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 3

Search inside document

Total No. of Questions : 8] SEAT No.

8
23
P-7545 [Total No. of Pages : 3

ic-
tat
[6180]-53

5s
T.E. (Computer Engineering)

3:3
02 91
9:5
DATA SCIENCE AND BIG DATA ANALYTICS

0
30
(2019 Pattern) (Semester - II) (310251)
2/1 13
Time : 2½ Hours] [Max. Marks : 70
0
2/2
.23 GP

Instructions to the candidates :

1) Answer Q1 or Q2, Q3 or Q4, Q5 or Q6. Q7 or Q8.
E
81

2) Neat diagrams must be drawn wherever necessary.

8
C

23
3) Figures to the right side indicate full marks.

ic-
4) Assume suitable data if necessary.
16

tat
5) Use of Scientific calculator is permitted.
8.2

5s
.24

Q1) a) Explain Data Analytics Cycle with suitable diagram and its phases. [8]
3:3
91
49

b) List and Explain the various activities involved in identifying potential

9:5
30

data resources as a part of discovery phase in Data Analytics Life Cycle?

[9]
01
02

OR
2/2
GP

Q2) a) List and explain the key roles for successful analytics project. [8]
2/1
CE

b) Write short note on : [9]

8
23
i) Common Tools for the Model Building
.23

ii) Model selection for Data Analytics ic-

tat
8.2

5s
.24

3:3

Q3) a) List and explain the various types of analytics in Big data. [9]
91
49

9:5

b) Calculates the support and confidence value for all the possible item sets.[9]
30
30

Transaction ID Items bought

01
02

1 Onion, Potato, Cold Drink

2/2
GP

2 Onion, Burger, Cold Drink

2/1

3 Eggs, Onion, Cold Drink

CE
81

4 Potato, Milk, Eggs

.23

5 Potato, Burger, Cold Drink, Milk, Eggs

OR
8.2

P.T.O.
.24
49
Q4) a) Explain the need of logistic regression along with its various types. [9]

8
23
b) Explain the following terms with suitable example. [9]

ic-
i) Removing Duplicates from dataset.

tat
5s
ii) Handling Missing Data

3:3
02 91
9:5
Q5) a) Suppose that the given data the task is to cluster points (with (x, y)

0
30
representing location) into three clusters, where the points are A1 (2, 10),
2/1 13
A2(2, 5), A3(8, 4), B1(5, 8), B2(7, 5), B3(6, 4), C1(1, 2), C2(4, 9). The
0
2/2
distance function is Euclidean distance. Suppose initially we assign A1,
.23 GP

B1 and C1 as the center of each cluster, respectively. [8]

Use the k-means algorithm to show only show only the first round of
81

8
C

23
execution with cluster center.

ic-
b) Explain the following Text Analysis steps with suitable example [9]
16

tat
8.2

i) Part-of-speech(POS)tagging

5s
.24

3:3
ii) Lemmatization
91
49

9:5
OR
30
30

Q6) a) Given the confusion matrix, Calculate Accuracy, Precision, Recall, Error
01
02

rate with description on Diabetic Risk. [8]

2/2
GP

Predicted classes
2/1

Classes Diabetic Risk Diabetic Risk

CE
81

8
-Yes -No

23
.23

Actual Diabetic Risk- 90 210

ic-
16

tat
classes Yes
8.2

Diabetic Risk- 140 9560

.24

3:3
91

No
49

9:5
30

b) Explain the Text Preprocessing steps with suitable example. [9]

30
01
02
2/2

Q7) a) List the few data visualization tools and discuss any four applications of
GP
2/1

data visualization along with the use of the various plots with Python/R
CE

or suitable tool. [9]

b) List the challenges of Data Visualization. Explain the types of visualization

.23

with example. [9]

16
8.2

OR
.24

[6180]-53 2
49
Q8) a) Explain in detail the Hadoop Ecosystem with suitable diagram along with

8
23
the various components. [9]

ic-
b) Write a short note on the following. [9]

tat
5s
a) Map Reduce

3:3
b) Pig

02 91
9:5
0
30
2/1 13 
0
2/2
.23 GP
E
81

8
C

23
ic-
16

tat
8.2

5s
.24

3:3
91
49

9:5
30
30
01
02
2/2
GP
2/1
CE
81

8
23
.23

ic-
16

tat
8.2

5s
.24

3:3
91
49

9:5
30
30
01
02
2/2
GP
2/1
CE
81
.23
16
8.2
.24

[6180]-53 3
49

1112sem1 ST5203
Document14 pages
1112sem1 ST5203
jiashengrox
No ratings yet
Dsbda Nov2023
Document3 pages
Dsbda Nov2023
cryptoshubz1
No ratings yet
Nov Dec 2022
Document3 pages
Nov Dec 2022
Dr.Sujatha Rao
No ratings yet
BI Nov - Dec Sppu Question Paper
Document3 pages
BI Nov - Dec Sppu Question Paper
Sakshi Dhamane
No ratings yet
May Jun 2022
Document2 pages
May Jun 2022
vishakha salunkhe
No ratings yet
Q.P. Nov - Dec - 2022
Document2 pages
Q.P. Nov - Dec - 2022
santosh.kawade
No ratings yet
May Jun 2022
Document2 pages
May Jun 2022
Dr.Sujatha Rao
No ratings yet
Dsbda May2022
Document2 pages
Dsbda May2022
cryptoshubz1
No ratings yet
Q.P. May - Jun - 2023
Document2 pages
Q.P. May - Jun - 2023
santosh.kawade
No ratings yet
Feb - 2023 3
Document1 page
Feb - 2023 3
MANSI SHARDUL
No ratings yet
SE - Nov - Dec - 2023
Document2 pages
SE - Nov - Dec - 2023
Sabhya Lokhande
No ratings yet
Be - Computer Engineering - Semester 6 - 2023 - February - Artificial Intelligence Ai Pattern 2019
Document1 page
Be - Computer Engineering - Semester 6 - 2023 - February - Artificial Intelligence Ai Pattern 2019
pranav.khandagale
No ratings yet
InSem Feb - 2023
Document1 page
InSem Feb - 2023
vanshikakubde2
No ratings yet
SeFeb - 2023
Document1 page
SeFeb - 2023
Ayan Shaikh
No ratings yet
Be Computer-Engineering - Semester-4 - 2022 - November - Microprocessor-Pattern-2019
Document2 pages
Be Computer-Engineering - Semester-4 - 2022 - November - Microprocessor-Pattern-2019
deshmukh1726
No ratings yet
Be Artificial Intelligence and Data Science Semester 3 2022 October Fundamentals of Data Structure Fods Pattern 2019
Document2 pages
Be Artificial Intelligence and Data Science Semester 3 2022 October Fundamentals of Data Structure Fods Pattern 2019
parth.ambe.aids.2022
No ratings yet
Be - Information Technology Engineering - Semester 6 - 2022 - May - Data Science and Big Data Analytics Ds Bda Pattern 2019
Document2 pages
Be - Information Technology Engineering - Semester 6 - 2022 - May - Data Science and Big Data Analytics Ds Bda Pattern 2019
nonsushacker
No ratings yet
DbFeb - 2023
Document1 page
DbFeb - 2023
Ayan Shaikh
No ratings yet
DbFeb - 2023
Document1 page
DbFeb - 2023
Ayan Shaikh
No ratings yet
DSDBA Sppu Dsbda QP
Document11 pages
DSDBA Sppu Dsbda QP
yakoxap698
No ratings yet
SPPU TE Question Papers Insem Endsem 2019-2023
Document1 page
SPPU TE Question Papers Insem Endsem 2019-2023
yakoxap698
No ratings yet
Be Mechanical Engineering Semester 8 2023 February Computer Integrated Manufacturing Cim 2019 Pattern
Document1 page
Be Mechanical Engineering Semester 8 2023 February Computer Integrated Manufacturing Cim 2019 Pattern
Kunal Gaikwad
No ratings yet
IOT - May - Jun - 2022
Document2 pages
IOT - May - Jun - 2022
Sabhya Lokhande
No ratings yet
Be - Electronics and Telecommunication Engineering - Semester 7 - 2022 - November - Cloud Computing CC Pattern 2019
Document2 pages
Be - Electronics and Telecommunication Engineering - Semester 7 - 2022 - November - Cloud Computing CC Pattern 2019
fake mail
No ratings yet
Nov - Decdd - 2022 Aaaaaaaaaaaaaaa
Document2 pages
Nov - Decdd - 2022 Aaaaaaaaaaaaaaa
pecoxor808
No ratings yet
Nov Dec 2019
Document2 pages
Nov Dec 2019
rai789456126
No ratings yet
Iot 15 Jun 2023
Document2 pages
Iot 15 Jun 2023
itsme68589
No ratings yet
IOTSOLVEDY
Document40 pages
IOTSOLVEDY
Nikhil Gadhave
No ratings yet
May Jun 2023
Document2 pages
May Jun 2023
Satyam Dash
No ratings yet
Be - Computer Engineering - Semester 6 - 2023 - May - Cloud Computing CC Pattern 2019
Document2 pages
Be - Computer Engineering - Semester 6 - 2023 - May - Cloud Computing CC Pattern 2019
sakshimavalkar0704
No ratings yet
Nov Dec 2023
Document2 pages
Nov Dec 2023
malharj1500
No ratings yet
Postgraduate PG Mba Semester 1 2019 November Business Research Methods 2019 Pattern
Document3 pages
Postgraduate PG Mba Semester 1 2019 November Business Research Methods 2019 Pattern
IVY
No ratings yet
SE AIDS - Internet of Things
Document2 pages
SE AIDS - Internet of Things
swapnil.gagare
No ratings yet
Sppu DBMS Que Paper 2
Document3 pages
Sppu DBMS Que Paper 2
vegetasama9667
No ratings yet
Aiml 3
Document3 pages
Aiml 3
rohitnikumbh27
No ratings yet
Os Insem (2019) @sppuitpro
Document4 pages
Os Insem (2019) @sppuitpro
Sarvesh Dharme
No ratings yet
May Jun 2023
Document2 pages
May Jun 2023
pranav
No ratings yet
May Jun 2023
Document2 pages
May Jun 2023
oysterkodepj
No ratings yet
Q.P. Feb - 2023
Document1 page
Q.P. Feb - 2023
santosh.kawade
No ratings yet
Be - Computer Engineering - Semester 4 - 2023 - February - Software Engineering Se Pattern 2019
Document1 page
Be - Computer Engineering - Semester 4 - 2023 - February - Software Engineering Se Pattern 2019
Vedant Patil
No ratings yet
ISR Insem PYQ
Document1 page
ISR Insem PYQ
Prashil Jain
No ratings yet
Nov Dec 2023
Document2 pages
Nov Dec 2023
prachiushkewar
No ratings yet
May Jun 2019
Document2 pages
May Jun 2019
pecoxor808
No ratings yet
Be Computer Engineering Semester 3 2023 May Fundamentals of Data Structures Pattern 2019
Document2 pages
Be Computer Engineering Semester 3 2023 May Fundamentals of Data Structures Pattern 2019
Siddhant Dhavale
No ratings yet
May Jun 2023
Document2 pages
May Jun 2023
Chaitany
No ratings yet
Be - Computer Engineering - Semester 8 - 2019 - March - Information and Cyber Security Ics Pattern 2015
Document2 pages
Be - Computer Engineering - Semester 8 - 2019 - March - Information and Cyber Security Ics Pattern 2015
Akanksha Pagar
No ratings yet
CEGP013091: 49.248.216.238 02/07/2022 08:34:13 Static-238
Document2 pages
CEGP013091: 49.248.216.238 02/07/2022 08:34:13 Static-238
service
No ratings yet
Medical Certificate
Document2 pages
Medical Certificate
dhruv2year
No ratings yet
2022 Question Paper Iot
Document2 pages
2022 Question Paper Iot
pranavmahalankar
No ratings yet
HPC 1
Document2 pages
HPC 1
itsme68589
No ratings yet
HPC - Pyq
Document2 pages
HPC - Pyq
tanay.bhor3
No ratings yet
CEGP013091: 49.248.216.238 20/01/2023 13:47:15 Static-238
Document2 pages
CEGP013091: 49.248.216.238 20/01/2023 13:47:15 Static-238
Nikita Thakur
No ratings yet
Be - Electrical Engineering - Semester 8 - 2019 - May - Elective IV - Smart Grid SG Pattern 2015
Document2 pages
Be - Electrical Engineering - Semester 8 - 2019 - May - Elective IV - Smart Grid SG Pattern 2015
Harikrishna G
No ratings yet
May Jun 2022
Document2 pages
May Jun 2022
Aashu Thakur
No ratings yet
Be Computer-Engineering Semester-4 2022 May Microprocessor-Pattern-2019
Document2 pages
Be Computer-Engineering Semester-4 2022 May Microprocessor-Pattern-2019
Shubham Sagar
No ratings yet
May Jun 2023
Document2 pages
May Jun 2023
prachiushkewar
No ratings yet
QP MIS (SE) AI&DS May 2022
Document2 pages
QP MIS (SE) AI&DS May 2022
123vidya
No ratings yet
DM November 2023
Document2 pages
DM November 2023
Shreya Mankar
No ratings yet
Aiml 1
Document3 pages
Aiml 1
rohitnikumbh27
No ratings yet
Transdisciplinary Engineering Design Process
From Everand
Transdisciplinary Engineering Design Process
Atila Ertas
No ratings yet
Computer Architecture Technology Trends
From Everand
Computer Architecture Technology Trends
Architecture Technology Architecture Technology Corpor
Rating: 4 out of 5 stars
4/5 (1)
Regresi Data Panel Pooled Model (PLS)
Document3 pages
Regresi Data Panel Pooled Model (PLS)
Ayi Wahid
No ratings yet
Descriptive Statistics: Frequency Distributions and Related Statistics
Document47 pages
Descriptive Statistics: Frequency Distributions and Related Statistics
ha ssan
No ratings yet
Ridge Regression
Document5 pages
Ridge Regression
Julia
No ratings yet
The Normal Distribution: Standard Normal Values. and Vice Versa
Document3 pages
The Normal Distribution: Standard Normal Values. and Vice Versa
MYRAVIE NOVES
No ratings yet
Assignment 5 - BUS 336
Document3 pages
Assignment 5 - BUS 336
Omar Al-lheebi
No ratings yet
Harvard Lecture Series Session 4 - Factor Analysis
Document50 pages
Harvard Lecture Series Session 4 - Factor Analysis
rashed azad
No ratings yet
JCN 10 774 Wald Test
Document1 page
JCN 10 774 Wald Test
Enggar Rindu Primandani
No ratings yet
PQT QB
Document7 pages
PQT QB
Sasemohan Chinnasamy
No ratings yet
Sample Midterm Exam Questions
Document13 pages
Sample Midterm Exam Questions
A K
No ratings yet
Time Series Analysis BRT Delhi
Document81 pages
Time Series Analysis BRT Delhi
M Mushtaq
No ratings yet
Add Maths Folio
Document30 pages
Add Maths Folio
DavinAvalani
No ratings yet
31 BP MQP STATS Question 6
Document5 pages
31 BP MQP STATS Question 6
21S1409 -Alden Menezes
No ratings yet
SAT Subject Statistics
Document12 pages
SAT Subject Statistics
yuhan
No ratings yet
Hypothesis Testing
Document50 pages
Hypothesis Testing
Bojing Picache
No ratings yet
Business Statistics in Practice 8th Edition Bowerman Solutions Manual
Document31 pages
Business Statistics in Practice 8th Edition Bowerman Solutions Manual
ariannenhannv0nwk
100% (30)
MATHEMATICS 10 4th PT
Document10 pages
MATHEMATICS 10 4th PT
regor velasco
No ratings yet
The Work-Family Conflict Scale PDF
Document12 pages
The Work-Family Conflict Scale PDF
Neilermind
0% (1)
Chap 05 Time Series Analysis and Forecasting
Document63 pages
Chap 05 Time Series Analysis and Forecasting
jecelchristine
No ratings yet
Te Unter 2008
Document9 pages
Te Unter 2008
Matus Goljer
No ratings yet
Data Analytics
Document4 pages
Data Analytics
Narakatla Srinu
No ratings yet
References
Document2 pages
References
Syrine Quintero
No ratings yet
Business Research Methods Zikmund CHP 20
Document32 pages
Business Research Methods Zikmund CHP 20
Tooba
No ratings yet
DM Quiz2 Ans DJ
Document4 pages
DM Quiz2 Ans DJ
Divyanshu Jain
No ratings yet
Creating KPI Dashboards Using Excel Part 3
Document7 pages
Creating KPI Dashboards Using Excel Part 3
b_any
No ratings yet
Math 1040
Document11 pages
Math 1040
api-238312886
No ratings yet
Mean Reversion in Profitability and Earnings: Evidence From India (2007-2020)
Document22 pages
Mean Reversion in Profitability and Earnings: Evidence From India (2007-2020)
Ronit Roy
No ratings yet
Hypothesis 3 - Estadistica - WebAssign
Document39 pages
Hypothesis 3 - Estadistica - WebAssign
KATYA VANESSA PEÑA DOROTEO
No ratings yet
Assignment 2 0f Inferential Statistics-Converted-Compressed-1 PDF
Document21 pages
Assignment 2 0f Inferential Statistics-Converted-Compressed-1 PDF
eram
No ratings yet
Difference Between Descriptive and Inferential Statistics
Document8 pages
Difference Between Descriptive and Inferential Statistics
Kasthuri Letchuman
No ratings yet