Welcome to Scribd!

Quiz 4 - Data Preparation

Uploaded by

100% found this document useful (1 vote)

216 views2 pages

This document is a quiz on data preparation techniques. It covers topics like data quality issues, imputing missing data, outliers, feature selection, and principal component analysis (PCA). The questions ask about identifying data quality issues, replacing missing values, defining outliers, using domain knowledge to address issues, feature selection methods, the goal of feature sets, properties of zero-normalized data, and statements that are not true about PCA.

Original Description:

Original Title

Quiz 4 - Data Preparation.docx

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

100% found this document useful (1 vote)

216 views2 pages

Quiz 4 - Data Preparation

Uploaded by

Mr.Padmanaban V

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

Quiz 4 - Data Preparation

1. Which of the following is NOT a data quality issue?

 Inconsistent data
 Scaled data
 Missing values
 Duplicate data

2. Imputing missing data means to

 replace missing values with something reasonable.

 drop samples with missing values.
 replace missing values with outliers.
 merge samples with missing values.

3. A data sample with values that are considerably different than

the rest of the other data samples in the dataset is called an/a
_____________.

 Outlier
 Invalid data
 Noise
 Inconsistent data

4. Which one of the following examples illustrates the use of

domain knowledge to address a data quality issue?

 Simply discard the samples that lie significantly outside the distribution of your
data
 Drop samples with missing values
 Merge duplicate records while retaining relevant data
 None of these

5. Which of the following is NOT an example of feature selection?

 Adding an in-state feature based on an applicant's home state.

 Re-formatting an address field into separate street address, city, state, and zip
code fields.
 Removing a feature with a lot of missing values.
 Replacing a missing value with the variable mean.

6. Which one of the following is the best feature set for your
analysis?

 Feature set with the smallest set of features that best capture the
characteristics of the data for the intended application
 Feature set with the smallest number of features
 Feature set with the largest number of features
 Feature set that contains exclusively re-coded features

7. The mean value and the standard deviation of a zero-

normalized feature are

 mean = 0 and standard deviation = 0

 mean = 1 and standard deviation = 0
 mean = 0 and standard deviation = 1
 mean = 1 and standard deviation = 1

8. Which of the following is NOT true about PCA?

 PCA stands for principal component analysis

 PC1 and PC2, the first and second principal components, respectively, are always
orthogonal to each other.
 PC1, the first principal component , captures the largest amount of variance in
the data along a single dimension.
 PCA is a dimensionality reduction technique that removes a feature that is
very correlated with another feature.

Relational Data Model A Complete Guide - 2020 Edition
From Everand
Relational Data Model A Complete Guide - 2020 Edition
Gerardus Blokdyk
No ratings yet
Design and Implementation of Electronic Billing System
Document64 pages
Design and Implementation of Electronic Billing System
Damilola Adegbemile
No ratings yet
Soa
Document129 pages
Soa
Bala Krishnan
100% (1)
Anatomy of A MapReduce Job
Document5 pages
Anatomy of A MapReduce Job
kumar
No ratings yet
Criteria Wise Requirement 1 NBA
Document47 pages
Criteria Wise Requirement 1 NBA
Arun prasath
100% (1)
Hadoop - Quick Guide Hadoop - Big Data Overview
Document41 pages
Hadoop - Quick Guide Hadoop - Big Data Overview
Selva Kumar
No ratings yet
Big Data Analytics PPT-2 (Section-A)
Document10 pages
Big Data Analytics PPT-2 (Section-A)
bhuvnesh Kumar
No ratings yet
CS8492 DBMS - Part A & Part B
Document26 pages
CS8492 DBMS - Part A & Part B
RAJA.S 41
No ratings yet
Data Mining Final Exam
Document1 page
Data Mining Final Exam
Steph
No ratings yet
Artificial Intelligence Notes Unit-4 Lecture-1 Expert Systems
Document7 pages
Artificial Intelligence Notes Unit-4 Lecture-1 Expert Systems
Shivangi Thakur
No ratings yet
Assignment 1 PDF
Document2 pages
Assignment 1 PDF
sameerroushan
No ratings yet
Co-Po Big Data Analytics
Document41 pages
Co-Po Big Data Analytics
suni
No ratings yet
NLP Exam
Document3 pages
NLP Exam
hazemtarek
No ratings yet
Forward Chaining and Backward Chaining in Ai: Inference Engine
Document18 pages
Forward Chaining and Backward Chaining in Ai: Inference Engine
Sanju Shree
No ratings yet
Bigdatacourse
Document10 pages
Bigdatacourse
Gursimran Singh Goraya
No ratings yet
Chapter-4 Conditional and Iterative Statements in Python
Document30 pages
Chapter-4 Conditional and Iterative Statements in Python
ashishiet
100% (1)
Human-Computer Interaction: - Dr. Muhammad Raza - Assistant Professor
Document25 pages
Human-Computer Interaction: - Dr. Muhammad Raza - Assistant Professor
ansa hj
No ratings yet
Ch1 Sad & Access MCQ
Document10 pages
Ch1 Sad & Access MCQ
gobinath
No ratings yet
18CSC305J - Artificial Intelligence Unit IV Question Bank Part A
Document7 pages
18CSC305J - Artificial Intelligence Unit IV Question Bank Part A
axar kumar
No ratings yet
DWH Quiz2 With Answers
Document14 pages
DWH Quiz2 With Answers
Swapna
No ratings yet
R7411205-Information Retrieval Systems
Document4 pages
R7411205-Information Retrieval Systems
sivabharathamurthy
No ratings yet
C# Test 1
Document5 pages
C# Test 1
Raluca-Maria Sălcudean
No ratings yet
Introduction To Cpu Simulator
Document8 pages
Introduction To Cpu Simulator
Teresha
No ratings yet
Advanced Database MCQ Worksheet For Exit Exam 2015@
Document11 pages
Advanced Database MCQ Worksheet For Exit Exam 2015@
Magarsaa
No ratings yet
Dbms Unit-I
Document80 pages
Dbms Unit-I
Laxmi Venki
100% (4)
A Seminar Presentation On "Big Data": Presented By: Divyanshu Bhardwaj Department of Computer Science VIII Semester
Document19 pages
A Seminar Presentation On "Big Data": Presented By: Divyanshu Bhardwaj Department of Computer Science VIII Semester
Divyanshu Bhardwaj
No ratings yet
Database Management System Case Study
Document4 pages
Database Management System Case Study
RupESH Kamble
No ratings yet
DIGI-Net: A Deep Convolutional Neural Network For Multi-Format Digit Recognition
Document11 pages
DIGI-Net: A Deep Convolutional Neural Network For Multi-Format Digit Recognition
Huseyin Kusetogullari
No ratings yet
Motivation of Data Mining
Document4 pages
Motivation of Data Mining
Hazael Estabillo Lopez
No ratings yet
Data Warehousing
Document77 pages
Data Warehousing
nanirajm
No ratings yet
STM Important Questions
Document5 pages
STM Important Questions
vinaykuk6569
No ratings yet
Question Bank - Unit I
Document2 pages
Question Bank - Unit I
Rama
No ratings yet
OOAD Model Paper 20-03-2023
Document6 pages
OOAD Model Paper 20-03-2023
balakailasa
100% (1)
ARM: An Advanced Microcontroller
Document54 pages
ARM: An Advanced Microcontroller
shubham
No ratings yet
BTCS9202 Data Sciences Lab Manual
Document39 pages
BTCS9202 Data Sciences Lab Manual
Raj Singh
No ratings yet
Cloud Computing Unit-I JNTUH R16
Document31 pages
Cloud Computing Unit-I JNTUH R16
vizier
No ratings yet
Data Warehousing and Data Mining JNTU Previous Years Question Papers
Document4 pages
Data Warehousing and Data Mining JNTU Previous Years Question Papers
Rakesh Varma
No ratings yet
PDF
Document23 pages
PDF
HAMDI GDHAMI
No ratings yet
Parallel Port
Document31 pages
Parallel Port
Richie Mulyo Liauren
No ratings yet
Module 1
Document79 pages
Module 1
Remya Raveendran
No ratings yet
Network Management Book - Principles and Practice Test Questions Bank
Document14 pages
Network Management Book - Principles and Practice Test Questions Bank
aimen_riyadh
No ratings yet
Ai-Unit2 - QB-VDP
Document13 pages
Ai-Unit2 - QB-VDP
axar kumar
No ratings yet
Final Practical Exam Questions
Document6 pages
Final Practical Exam Questions
saravanakumar
No ratings yet
Threaded Binary Tree
Document25 pages
Threaded Binary Tree
pramodsoni0007
No ratings yet
Unit - 26 - Machine - Learning - Assignment - 01 (1) Alish
Document42 pages
Unit - 26 - Machine - Learning - Assignment - 01 (1) Alish
alish khalid
No ratings yet
Machine Learning Paradigms
Document27 pages
Machine Learning Paradigms
teja ayyagari
No ratings yet
Unit 3
Document53 pages
Unit 3
Manvika
100% (1)
CS 551: Banker's Algorithm
Document4 pages
CS 551: Banker's Algorithm
sgdhruv
No ratings yet
Automatic Indexing
Document15 pages
Automatic Indexing
Sunitha Rekha
100% (1)
22mcal17 CN Lab Manual
Document20 pages
22mcal17 CN Lab Manual
drpoornasandur
No ratings yet
Quiz#1Reschduled CS502 Design and Analysis of Algorithms
Document2 pages
Quiz#1Reschduled CS502 Design and Analysis of Algorithms
Imran
67% (6)
Dbms Unit 5.2 (Ar16)
Document23 pages
Dbms Unit 5.2 (Ar16)
X Boy
No ratings yet
Question Bank
Document2 pages
Question Bank
AnjaliKansal
No ratings yet
Uni1 Networking With Java PDF
Document22 pages
Uni1 Networking With Java PDF
Sagar Kusekar
No ratings yet
Managing State: 5.1 The Problem of State in Web Applications
Document17 pages
Managing State: 5.1 The Problem of State in Web Applications
Yashu Gowda
No ratings yet
Understanding Data Mining
Document21 pages
Understanding Data Mining
Yah yah yahhhhh
No ratings yet
System Conception: Chapter - 1
Document26 pages
System Conception: Chapter - 1
Sujith N
No ratings yet
3.2 Pca
Document27 pages
3.2 Pca
Javada Javada
No ratings yet
Business Report: Advanced Statistics Module Project - II
Document9 pages
Business Report: Advanced Statistics Module Project - II
Prasad Mohan
No ratings yet
Unit 4 Basics of Feature Engineering
Document33 pages
Unit 4 Basics of Feature Engineering
Yash Desai
No ratings yet
Mechanical Hazards: Safety Presentation
Document12 pages
Mechanical Hazards: Safety Presentation
Mr.Padmanaban V
No ratings yet
Mechanicalhazards 161230025022
Document21 pages
Mechanicalhazards 161230025022
Mr.Padmanaban V
No ratings yet
Cardiopulmonary Resuscitation (CPR) : L/O/G/O
Document85 pages
Cardiopulmonary Resuscitation (CPR) : L/O/G/O
Mr.Padmanaban V
100% (1)
Idustrial Hazards Due To Fire Acciddents, Mechanical and Electrical Equipments
Document29 pages
Idustrial Hazards Due To Fire Acciddents, Mechanical and Electrical Equipments
Mr.Padmanaban V
No ratings yet
Quiz 6 - Classification
Document2 pages
Quiz 6 - Classification
Mr.Padmanaban V
No ratings yet
Quiz 4 - Data Preparation
Document2 pages
Quiz 4 - Data Preparation
Mr.Padmanaban V
100% (1)
Quiz 5 - Handling Missing Valuers in KNIME and Spark
Document2 pages
Quiz 5 - Handling Missing Valuers in KNIME and Spark
Mr.Padmanaban V
No ratings yet
Quiz 1 - Machine Learning Overview
Document2 pages
Quiz 1 - Machine Learning Overview
Mr.Padmanaban V
No ratings yet
Quiz 11 - Cluster Analysis in Spark
Document2 pages
Quiz 11 - Cluster Analysis in Spark
Mr.Padmanaban V
No ratings yet
Quiz 2 - Data Exploration
Document2 pages
Quiz 2 - Data Exploration
Mr.Padmanaban V
100% (1)
CC7003-Industrial Safety Management PDF
Document5 pages
CC7003-Industrial Safety Management PDF
Mr.Padmanaban V
No ratings yet
Quiz 3 - Data Exploration in KNIME and Spark
Document2 pages
Quiz 3 - Data Exploration in KNIME and Spark
Mr.Padmanaban V
No ratings yet
Quiz 10 - Regression, Cluster Analysis, & Association Analysis
Document3 pages
Quiz 10 - Regression, Cluster Analysis, & Association Analysis
Mr.Padmanaban V
No ratings yet
Quiz 9 - Model Evaluation in KNIME and Spark
Document2 pages
Quiz 9 - Model Evaluation in KNIME and Spark
Mr.Padmanaban V
No ratings yet
Quiz 8 - Model Evaluation
Document2 pages
Quiz 8 - Model Evaluation
Mr.Padmanaban V
No ratings yet
Ece SYLLABUS PDF
Document119 pages
Ece SYLLABUS PDF
saran52_ece
No ratings yet
ChatGPT Side Hustles 2024 - Unlock the Digital Goldmine and Get AI Working for You Fast with More Than 85 Side Hustle Ideas to Boost Passive Income, Create New Cash Flow, and Get Ahead of the Curve
From Everand
ChatGPT Side Hustles 2024 - Unlock the Digital Goldmine and Get AI Working for You Fast with More Than 85 Side Hustle Ideas to Boost Passive Income, Create New Cash Flow, and Get Ahead of the Curve
Alec Rowe
No ratings yet
Summary of Mustafa Suleyman's The Coming Wave
From Everand
Summary of Mustafa Suleyman's The Coming Wave
Milkyway Media
No ratings yet
The Master Algorithm: How the Quest for the Ultimate Learning Machine Will Remake Our World
From Everand
The Master Algorithm: How the Quest for the Ultimate Learning Machine Will Remake Our World
Pedro Domingos
Rating: 4.5 out of 5 stars
4.5/5 (107)
Scary Smart: The Future of Artificial Intelligence and How You Can Save Our World
From Everand
Scary Smart: The Future of Artificial Intelligence and How You Can Save Our World
Mo Gawdat
Rating: 4.5 out of 5 stars
4.5/5 (55)
Generative AI: The Insights You Need from Harvard Business Review
From Everand
Generative AI: The Insights You Need from Harvard Business Review
Harvard Business Review
Rating: 4.5 out of 5 stars
4.5/5 (2)
Four Battlegrounds: Power in the Age of Artificial Intelligence
From Everand
Four Battlegrounds: Power in the Age of Artificial Intelligence
Paul Scharre
Rating: 5 out of 5 stars
5/5 (5)
100M Offers Made Easy: Create Your Own Irresistible Offers by Turning ChatGPT into Alex Hormozi
From Everand
100M Offers Made Easy: Create Your Own Irresistible Offers by Turning ChatGPT into Alex Hormozi
Ben Preston
No ratings yet
Artificial Intelligence: The Insights You Need from Harvard Business Review
From Everand
Artificial Intelligence: The Insights You Need from Harvard Business Review
Thomas H. Davenport
Rating: 4.5 out of 5 stars
4.5/5 (104)
The AI Advantage: How to Put the Artificial Intelligence Revolution to Work
From Everand
The AI Advantage: How to Put the Artificial Intelligence Revolution to Work
Thomas H. Davenport
Rating: 4 out of 5 stars
4/5 (7)
Deep Utopia: Life and Meaning in a Solved World
From Everand
Deep Utopia: Life and Meaning in a Solved World
Nick Bostrom
No ratings yet
Architects of Intelligence: The truth about AI from the people building it
From Everand
Architects of Intelligence: The truth about AI from the people building it
Martin Ford
Rating: 4.5 out of 5 stars
4.5/5 (21)
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
From Everand
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
Alec Rowe
No ratings yet
HBR's 10 Must Reads on AI, Analytics, and the New Machine Age
From Everand
HBR's 10 Must Reads on AI, Analytics, and the New Machine Age
Michael E. Porter
Rating: 4.5 out of 5 stars
4.5/5 (69)
The Digital Mind: How Science is Redefining Humanity
From Everand
The Digital Mind: How Science is Redefining Humanity
Arlindo Oliveira
Rating: 4.5 out of 5 stars
4.5/5 (2)
Working with AI: Real Stories of Human-Machine Collaboration (Management on the Cutting Edge)
From Everand
Working with AI: Real Stories of Human-Machine Collaboration (Management on the Cutting Edge)
Thomas H. Davenport
Rating: 5 out of 5 stars
5/5 (5)
Your AI Survival Guide: Scraped Knees, Bruised Elbows, and Lessons Learned from Real-World AI Deployments
From Everand
Your AI Survival Guide: Scraped Knees, Bruised Elbows, and Lessons Learned from Real-World AI Deployments
Sol Rashidi
No ratings yet
Artificial Intelligence: A Guide for Thinking Humans
From Everand
Artificial Intelligence: A Guide for Thinking Humans
Melanie Mitchell
Rating: 4.5 out of 5 stars
4.5/5 (30)
T-Minus AI: Humanity's Countdown to Artificial Intelligence and the New Pursuit of Global Power
From Everand
T-Minus AI: Humanity's Countdown to Artificial Intelligence and the New Pursuit of Global Power
Michael Kanaan
Rating: 4 out of 5 stars
4/5 (4)
Artificial Intelligence For Dummies
From Everand
Artificial Intelligence For Dummies
John Mueller
Rating: 4.5 out of 5 stars
4.5/5 (15)
ChatGPT Millionaire 2024 - Bot-Driven Side Hustles, Prompt Engineering Shortcut Secrets, and Automated Income Streams that Print Money While You Sleep. The Ultimate Beginner’s Guide for AI Business
From Everand
ChatGPT Millionaire 2024 - Bot-Driven Side Hustles, Prompt Engineering Shortcut Secrets, and Automated Income Streams that Print Money While You Sleep. The Ultimate Beginner’s Guide for AI Business
Alec Rowe
No ratings yet
Python Machine Learning - Third Edition: Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow 2, 3rd Edition
From Everand
Python Machine Learning - Third Edition: Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow 2, 3rd Edition
Sebastian Raschka
Rating: 5 out of 5 stars
5/5 (2)
Artificial Intelligence & Generative AI for Beginners: The Complete Guide
From Everand
Artificial Intelligence & Generative AI for Beginners: The Complete Guide
David M. Patel
Rating: 5 out of 5 stars
5/5 (1)
Make Money with ChatGPT: Your Guide to Making Passive Income Online with Ease using AI: AI Wealth Mastery
From Everand
Make Money with ChatGPT: Your Guide to Making Passive Income Online with Ease using AI: AI Wealth Mastery
Ben Preston
No ratings yet
ChatGPT: Is the future already here?
From Everand
ChatGPT: Is the future already here?
Rodrigo Serzedello
Rating: 4 out of 5 stars
4/5 (21)
Writing AI Prompts For Dummies
From Everand
Writing AI Prompts For Dummies
Stephanie Diamond
No ratings yet
Who's Afraid of AI?: Fear and Promise in the Age of Thinking Machines
From Everand
Who's Afraid of AI?: Fear and Promise in the Age of Thinking Machines
Thomas Ramge
Rating: 4.5 out of 5 stars
4.5/5 (13)
Hands-On System Design: Learn System Design, Scaling Applications, Software Development Design Patterns with Real Use-Cases
From Everand
Hands-On System Design: Learn System Design, Scaling Applications, Software Development Design Patterns with Real Use-Cases
Harsh Kumar Ramchandani
No ratings yet
The Roadmap to AI Mastery: A Guide to Building and Scaling Projects
From Everand
The Roadmap to AI Mastery: A Guide to Building and Scaling Projects
Somdip Dey
No ratings yet
Mastering Large Language Models: Advanced techniques, applications, cutting-edge methods, and top LLMs (English Edition)
From Everand
Mastering Large Language Models: Advanced techniques, applications, cutting-edge methods, and top LLMs (English Edition)
Sanket Subhash Khandare
No ratings yet
Workforce 4.0: How AI, the Home Office, and the Gig Economy Are Disrupting the Status Quo
From Everand
Workforce 4.0: How AI, the Home Office, and the Gig Economy Are Disrupting the Status Quo
Gerard Szatvanyi
No ratings yet