Midtemr-Data - Mining-Nguyen Tuan Hung - K194141723

Uploaded by

Tuan Hung

0% found this document useful (0 votes)

40 views3 pages

Original Title

midtemr-data_mining-Nguyen Tuan Hung - K194141723

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

40 views3 pages

Midtemr-Data - Mining-Nguyen Tuan Hung - K194141723

Uploaded by

Tuan Hung

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 3

Search inside document

Full name: Nguyễn Tuấn Hưng

MSSV: K194141723
MID TERM

1. Explain the effect of Type I and Type II errors to model’s result. Give example.

A Type 1 error, often known as a false positive, happens when a researcher rejects a
true null hypothesis incorrectly. This means that your report that your findings are
significant when in fact they have occurred by chance.

A Type II error is when you don't realize there was an effect when there was. In
actuality, your study may not have had sufficient statistical power to identify a
significant effect.

Example: Based on your modest symptoms, you decide to get tested for COVID-19.
There are two types of errors that could occur:

Type I error (false positive): the test result indicates that you have coronavirus when
you don't.

Type II error (false negative): the test result indicates that you are free of
coronavirus, while you are in fact infected.

2. In your opinion, what is the most important thing to do in data mining process?
Explain and give example.

The most important in data mining is data cleaning. It's important since using dirty
data directly in mining might lead to confusion in operations and erroneous
findings.

For example, data cleansing includes things like removing missing values,
addressing spelling and grammar errors, standardizing data sets, repairing errors
like missing codes, empty fields, and finding duplicate information.

3. Is it possible to apply classification method for a dataset if both input and output
are interval? Explain.
The answer is no. Because if the confidence intervals for two models greatly
overlap, this indicates (statistical) equivalency and may be a cause to prefer the less
complex or more interpretable model.

4. Give your comments on the yellow line as below.

In my opinion, this model’s performance is bad, even useless. Its AUC is lower than
0.5 which implies that its predictive ability is worse than random.

5. Is it possible to have equal “macro avg” and “weighted avg”? Explain.

Maybe. If “0” label ’s precision and “1” label ’s precision are equal, “0” support
and “1” support must be equal.

7. Should we use the same train/test split ratio for any cases of data mining? Explain
why and give example.

There is no fixed rule for separation training and testing data sets. It depends on the
dataset. Too few training samples can also cause the model to be underfitting or too
much training samples can cause it to be overfitting. It may depend on the size of
dataset. A larger test set size shows the potential of the model in the real world.
There are two competing concerns: with less training data, your parameter estimates
have greater variance. With less testing data, your performance statistic will have
greater variance. Broadly speaking you should be concerned with dividing data such
that neither variance is too high.

9. What happen if “True Negative + False Positive = 0”. Give your opinion about
the model which result this phenomenon.

This causes a result that all '0'(Negative) labels are correctly classified and whereas
all the '1'(Positive) labels are incorrectly classified.

10. Explain the effect of overfitting & underfitting to the result. Give example

- Overfitting: Overfitting happens when a model learns the detail and noise in the
training data to the extent that it negatively impacts the performance of the model
on new data. There are two main impacts that can occur as a result of this.
To begin with, the model "accidentally" performs well in the train set, despite the
fact that the model's errors in predicting the test set output remain high.
Second, because the dataset is limited and non-representative, the prediction may be
very poor when the sample is expanded.

- Underfitting: The model is unable to recognize the logic and insights of the train
data, resulting in a poor prediction for the test data output.

13.

Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
Rating: 5 out of 5 stars
5/5 (1)
Q1-What's The Trade-Off Between Bias and Variance?
Document5 pages
Q1-What's The Trade-Off Between Bias and Variance?
morry123
100% (1)
Learn Software Testing in 24 Hours
From Everand
Learn Software Testing in 24 Hours
Alex Nordeen
No ratings yet
ML Interviw Questions
Document11 pages
ML Interviw Questions
618Vishwajit Pawar
No ratings yet
Assessing and Improving Prediction and Classification: Theory and Algorithms in C++
From Everand
Assessing and Improving Prediction and Classification: Theory and Algorithms in C++
Timothy Masters
No ratings yet
100 Q and A Data Science
Document23 pages
100 Q and A Data Science
Vikash Rryder
No ratings yet
Common Errors in Statistics (and How to Avoid Them)
From Everand
Common Errors in Statistics (and How to Avoid Them)
Phillip I. Good
No ratings yet
3141b86-6fd4-7726-D8ad-20a1516bcd Statistics Interview Cheat Sheet - Emmading - Com. All Rights Reserved.
Document10 pages
3141b86-6fd4-7726-D8ad-20a1516bcd Statistics Interview Cheat Sheet - Emmading - Com. All Rights Reserved.
Pradeep Sandra
No ratings yet
Techniques
Document5 pages
Techniques
Flash Gordon
No ratings yet
Ken Black QA 5th Chapter 9 Solution
Document44 pages
Ken Black QA 5th Chapter 9 Solution
Rushabh Vora
50% (2)
Factors That Can Affect Model Performance: Seonwoo Lee
Document34 pages
Factors That Can Affect Model Performance: Seonwoo Lee
SW Lee
No ratings yet
EvaluationQuestions Class 10 Ai
Document6 pages
EvaluationQuestions Class 10 Ai
kritavearn
No ratings yet
CH 0910
Document40 pages
CH 0910
darshshri7777
No ratings yet
Statistics Interview Questions
Document20 pages
Statistics Interview Questions
sairamesht
No ratings yet
Evaluating A Machine Learning Model
Document14 pages
Evaluating A Machine Learning Model
Jean
No ratings yet
Data Science Interview Questions Solved
Document54 pages
Data Science Interview Questions Solved
aalto falto
No ratings yet
CH 09
Document10 pages
CH 09
aqillah
No ratings yet
Types of Error
Document8 pages
Types of Error
Wiljay Fabon
No ratings yet
Data Science Interview Questions: Answer Here
Document54 pages
Data Science Interview Questions: Answer Here
neeraj12121
No ratings yet
DataScience Interview Questions
Document66 pages
DataScience Interview Questions
ravi Kiran
No ratings yet
You Explain Boundary Value Analysis
Document3 pages
You Explain Boundary Value Analysis
vikram
No ratings yet
Statistics Interview Questions
Document39 pages
Statistics Interview Questions
ravindra bhalsing
No ratings yet
Statistics Interview Questions & Answers For Data Scientists
Document43 pages
Statistics Interview Questions & Answers For Data Scientists
mlissali
No ratings yet
Data Science Unit 5
Document11 pages
Data Science Unit 5
Hanuman Jyothi
No ratings yet
Validation Over Under Fir Unit 5
Document6 pages
Validation Over Under Fir Unit 5
Harpreet Singh Bagga
No ratings yet
How Measurement Error Affects The Four Ways We Use Data: Home Content
Document13 pages
How Measurement Error Affects The Four Ways We Use Data: Home Content
tehky63
No ratings yet
Modelling
Document6 pages
Modelling
Cat
No ratings yet
Scientific Method: Piercing Survey Analysis
Document3 pages
Scientific Method: Piercing Survey Analysis
Sharon Rose Genita Medez
No ratings yet
Beginers Guide To Statistics
Document18 pages
Beginers Guide To Statistics
Sreejit Menon
No ratings yet
Cbse - Department of Skill Education Artificial Intelligence
Document12 pages
Cbse - Department of Skill Education Artificial Intelligence
Nipun Sharma
No ratings yet
Machine Learning IQs
Document13 pages
Machine Learning IQs
pixelheart
100% (1)
Discussion 6a & 6b
Document3 pages
Discussion 6a & 6b
Aisha Robertson (student)
No ratings yet
Machine Learning Questions
Document19 pages
Machine Learning Questions
Mojdeh Soltani
No ratings yet
ML Interview Questions PDF
Document20 pages
ML Interview Questions PDF
nandex777
100% (4)
Data Science Interview Questions Explained
Document55 pages
Data Science Interview Questions Explained
Arunachalam Narayanan
100% (2)
Mostwinningabtestresultsareillusory 0
Document16 pages
Mostwinningabtestresultsareillusory 0
Ajeet_1991
No ratings yet
MPC 006
Document19 pages
MPC 006
Swarnali Mitra
No ratings yet
Lab Report
Document11 pages
Lab Report
Nicolae Barba
No ratings yet
How To Calculate Outliers
Document7 pages
How To Calculate Outliers
Celina Borillo
No ratings yet
Statistical Model Validation
Document4 pages
Statistical Model Validation
sophia787
No ratings yet
Physics Olympiad Error and Data Analysis Note
Document7 pages
Physics Olympiad Error and Data Analysis Note
Science Olympiad Blog
100% (1)
DAS
Document3 pages
DAS
Muhammad Arif Hassan
No ratings yet
Statistics Interview 02
Document30 pages
Statistics Interview 02
Sudharshan Venkatesh
100% (1)
Z Test Research Papers
Document5 pages
Z Test Research Papers
afeaudffu
100% (1)
#Imbalanced Datasets #Oversampling #Undersampling #R
Document14 pages
#Imbalanced Datasets #Oversampling #Undersampling #R
lucianmol
100% (1)
Sources of Error in Science Labs
Document4 pages
Sources of Error in Science Labs
John Osborne
No ratings yet
Free Fall Acceleration and Error Analysis
Document3 pages
Free Fall Acceleration and Error Analysis
nrl syafiqa
100% (1)
Food Analysis Lab - Accuracy and Precision Assessment
Document4 pages
Food Analysis Lab - Accuracy and Precision Assessment
Hajarul Ajiehah
No ratings yet
Statistical analysis conclusions interpretation
Document3 pages
Statistical analysis conclusions interpretation
Errol de los Santos
No ratings yet
ONE-SAMPLE T-TEST
Document8 pages
ONE-SAMPLE T-TEST
Arvella Albay
No ratings yet
Dheeraj Coca Cola Mba
Document84 pages
Dheeraj Coca Cola Mba
ranaindia2011
100% (1)
### Data Exploration: 'Yes' 'No' 'Agency' 'Direct' 'Employee Referral' 'Yes' 'No'
Document6 pages
### Data Exploration: 'Yes' 'No' 'Agency' 'Direct' 'Employee Referral' 'Yes' 'No'
Varshini Kandikatla
100% (1)
Research Paper With A Chi-Square Test
Document8 pages
Research Paper With A Chi-Square Test
fysfs7g3
100% (1)
Interview Questions ML
Document83 pages
Interview Questions ML
kprdeepak
100% (1)
Underfitting and Overfitting
Document4 pages
Underfitting and Overfitting
hokijic810
No ratings yet
Interview Questions On Machine Learning
Document22 pages
Interview Questions On Machine Learning
Praveen
100% (4)
Data Science Interview Questions and Answers
Document41 pages
Data Science Interview Questions and Answers
Bibal Benifa
100% (1)
Unit - 5 Hypothesis Testing
Document18 pages
Unit - 5 Hypothesis Testing
UP 16 Ghaziabad
No ratings yet
Important
Document14 pages
Important
ashok
No ratings yet
A "Short" Introduction To Model Selection
Document25 pages
A "Short" Introduction To Model Selection
Suvin Chandra Gandhi (MT19AIE325)
No ratings yet
Final Exam R
Document3 pages
Final Exam R
Tuan Hung
No ratings yet
Risk Tolerance Questions To Best Determine Client Portfolio Allocation Preferences
Document10 pages
Risk Tolerance Questions To Best Determine Client Portfolio Allocation Preferences
Tuan Hung
No ratings yet
Financial Planning Model - K194141723 - Nguyen Tuan Hung
Document4 pages
Financial Planning Model - K194141723 - Nguyen Tuan Hung
Tuan Hung
No ratings yet
University of Economics and Law: December 31, 2021, at 9:30 A.M. Zoom Room
Document11 pages
University of Economics and Law: December 31, 2021, at 9:30 A.M. Zoom Room
Tuan Hung
No ratings yet
Payment Solutions Are Being Applied in Vietnam: Group 7
Document36 pages
Payment Solutions Are Being Applied in Vietnam: Group 7
Tuan Hung
No ratings yet
Inputs: Problem 17.3
Document4 pages
Inputs: Problem 17.3
Tuan Hung
No ratings yet
UEL Management Course Overview
Document20 pages
UEL Management Course Overview
Tuan Hung
No ratings yet
SIKA Concrete Repair Site Handbook
Document24 pages
SIKA Concrete Repair Site Handbook
keesinvong
No ratings yet
Sims4 App Info
Document5 pages
Sims4 App Info
oltisor
No ratings yet
Kirera
Document3 pages
Kirera
murithiian6588
No ratings yet
Design of A Small Flight Control System
Document120 pages
Design of A Small Flight Control System
Ruben Ruben
No ratings yet
Our Products: Powercore Grain Oriented Electrical Steel
Document20 pages
Our Products: Powercore Grain Oriented Electrical Steel
koalaboi
No ratings yet
Pricelist
Document2,276 pages
Pricelist
adilcms
No ratings yet
Dcs Ict2113 (Apr22) - Lab
Document6 pages
Dcs Ict2113 (Apr22) - Lab
Marwa Najem
No ratings yet
Free Fall Experiment
Document31 pages
Free Fall Experiment
LeerzejPunto
No ratings yet
Product+Catalogue+2021+New+Final Preview
Document34 pages
Product+Catalogue+2021+New+Final Preview
sanizam79
No ratings yet
Lesson Plan Sir Marcos
Document7 pages
Lesson Plan Sir Marcos
Jhon Agustin
No ratings yet
ABB Softstarters, Type PSR, PSS, PST/PSTB
Document50 pages
ABB Softstarters, Type PSR, PSS, PST/PSTB
Elias
100% (1)
Lesson No. 1 - Pipe Sizing Hydraulics
Document4 pages
Lesson No. 1 - Pipe Sizing Hydraulics
usaid saifullah
No ratings yet
Modular Kitchen Analysis
Document3 pages
Modular Kitchen Analysis
Ghanithan Subramaniam
No ratings yet
Company Profile Nadal en PDF
Document2 pages
Company Profile Nadal en PDF
kfctco
100% (1)
Review of Train Wheel Fatigue Life
Document15 pages
Review of Train Wheel Fatigue Life
abdurhman suleiman
No ratings yet
Work Text INGE09: Understanding The Self
Document24 pages
Work Text INGE09: Understanding The Self
Francis Dave Mabborang
No ratings yet
Research Article: Noise-Cancelling CMOS Active Inductor and Its Application in RF Band-Pass Filter Design
Document8 pages
Research Article: Noise-Cancelling CMOS Active Inductor and Its Application in RF Band-Pass Filter Design
Abhay S Kochhar
No ratings yet
Optimizing Blended Learning with Synchronous and Asynchronous Technologies
Document24 pages
Optimizing Blended Learning with Synchronous and Asynchronous Technologies
Anonymous GOUaH7F
No ratings yet
Short Time Fourier Transform
Document37 pages
Short Time Fourier Transform
GopikaPrasad
No ratings yet
Power Electronics Lab 1 (07DEM19F1005)
Document15 pages
Power Electronics Lab 1 (07DEM19F1005)
Mohd Faizul Idham Ahmad
No ratings yet
Baghouse Compressed Air
Document17 pages
Baghouse Compressed Air
manh hung le
No ratings yet
Os Ass-1 PDF
Document18 pages
Os Ass-1 PDF
Dinesh R
100% (1)
Socrates 8d
Document8 pages
Socrates 8d
carolina
No ratings yet
Spray Up Molding
Document2 pages
Spray Up Molding
neelagaantan
No ratings yet
Development of Science in Africa - Coverage
Document2 pages
Development of Science in Africa - Coverage
Jose Jeramie
No ratings yet
SRFP 2015 Web List PDF
Document1 page
SRFP 2015 Web List PDF
abhishek
No ratings yet
Is BN 9789526041957
Document72 pages
Is BN 9789526041957
supriya rakshit
No ratings yet
Xerox University Microfilms: 300 North Zeeb Road Ann Arbor, Michigan 48106
Document427 pages
Xerox University Microfilms: 300 North Zeeb Road Ann Arbor, Michigan 48106
Muhammad Haris Khan Khattak
No ratings yet
Cause and Effect Diagram for Iron in Product
Document2 pages
Cause and Effect Diagram for Iron in Product
Hung
No ratings yet
Yoki 644 LPV BS - 7250215 - SDS - EU - 7923568
Document14 pages
Yoki 644 LPV BS - 7250215 - SDS - EU - 7923568
MohamedAhmedShawky
No ratings yet
ChatGPT Side Hustles 2024 - Unlock the Digital Goldmine and Get AI Working for You Fast with More Than 85 Side Hustle Ideas to Boost Passive Income, Create New Cash Flow, and Get Ahead of the Curve
From Everand
ChatGPT Side Hustles 2024 - Unlock the Digital Goldmine and Get AI Working for You Fast with More Than 85 Side Hustle Ideas to Boost Passive Income, Create New Cash Flow, and Get Ahead of the Curve
Alec Rowe
No ratings yet
Generative AI: The Insights You Need from Harvard Business Review
From Everand
Generative AI: The Insights You Need from Harvard Business Review
Harvard Business Review
Rating: 4.5 out of 5 stars
4.5/5 (2)
Make Money with ChatGPT: Your Guide to Making Passive Income Online with Ease using AI: AI Wealth Mastery
From Everand
Make Money with ChatGPT: Your Guide to Making Passive Income Online with Ease using AI: AI Wealth Mastery
Ben Preston
No ratings yet
Scary Smart: The Future of Artificial Intelligence and How You Can Save Our World
From Everand
Scary Smart: The Future of Artificial Intelligence and How You Can Save Our World
Mo Gawdat
Rating: 4.5 out of 5 stars
4.5/5 (54)
ChatGPT Millionaire 2024 - Bot-Driven Side Hustles, Prompt Engineering Shortcut Secrets, and Automated Income Streams that Print Money While You Sleep. The Ultimate Beginner’s Guide for AI Business
From Everand
ChatGPT Millionaire 2024 - Bot-Driven Side Hustles, Prompt Engineering Shortcut Secrets, and Automated Income Streams that Print Money While You Sleep. The Ultimate Beginner’s Guide for AI Business
Alec Rowe
No ratings yet
Summary of Mustafa Suleyman's The Coming Wave
From Everand
Summary of Mustafa Suleyman's The Coming Wave
Milkyway Media
No ratings yet
The Master Algorithm: How the Quest for the Ultimate Learning Machine Will Remake Our World
From Everand
The Master Algorithm: How the Quest for the Ultimate Learning Machine Will Remake Our World
Pedro Domingos
Rating: 4.5 out of 5 stars
4.5/5 (107)
Who's Afraid of AI?: Fear and Promise in the Age of Thinking Machines
From Everand
Who's Afraid of AI?: Fear and Promise in the Age of Thinking Machines
Thomas Ramge
Rating: 4.5 out of 5 stars
4.5/5 (13)
AI and Machine Learning for Coders: A Programmer's Guide to Artificial Intelligence
From Everand
AI and Machine Learning for Coders: A Programmer's Guide to Artificial Intelligence
Laurence Moroney
Rating: 4 out of 5 stars
4/5 (2)
Writing AI Prompts For Dummies
From Everand
Writing AI Prompts For Dummies
Stephanie Diamond
No ratings yet
Artificial Intelligence For Dummies
From Everand
Artificial Intelligence For Dummies
John Mueller
Rating: 4.5 out of 5 stars
4.5/5 (15)
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
From Everand
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
Alec Rowe
No ratings yet
Deep Utopia: Life and Meaning in a Solved World
From Everand
Deep Utopia: Life and Meaning in a Solved World
Nick Bostrom
No ratings yet
Artificial Intelligence: The Insights You Need from Harvard Business Review
From Everand
Artificial Intelligence: The Insights You Need from Harvard Business Review
Thomas H. Davenport
Rating: 4.5 out of 5 stars
4.5/5 (104)
101 Midjourney Prompt Secrets Volume 2
From Everand
101 Midjourney Prompt Secrets Volume 2
Marcus Byrne
Rating: 5 out of 5 stars
5/5 (1)
Mastering Large Language Models: Advanced techniques, applications, cutting-edge methods, and top LLMs (English Edition)
From Everand
Mastering Large Language Models: Advanced techniques, applications, cutting-edge methods, and top LLMs (English Edition)
Sanket Subhash Khandare
No ratings yet
Artificial Intelligence: A Guide for Thinking Humans
From Everand
Artificial Intelligence: A Guide for Thinking Humans
Melanie Mitchell
Rating: 4.5 out of 5 stars
4.5/5 (30)
Midjourney Mastery - The Ultimate Handbook of Prompts
From Everand
Midjourney Mastery - The Ultimate Handbook of Prompts
Andreea Todinca
Rating: 4.5 out of 5 stars
4.5/5 (2)
2084: Artificial Intelligence and the Future of Humanity
From Everand
2084: Artificial Intelligence and the Future of Humanity
John C Lennox
Rating: 4 out of 5 stars
4/5 (81)
Artificial Intelligence: The Complete Beginner’s Guide to the Future of A.I.
From Everand
Artificial Intelligence: The Complete Beginner’s Guide to the Future of A.I.
John Adamssen
Rating: 4 out of 5 stars
4/5 (15)
AI Money Machine: Unlock the Secrets to Making Money Online with AI
From Everand
AI Money Machine: Unlock the Secrets to Making Money Online with AI
Lucas Bennett
No ratings yet
ChatGPT: Is the future already here?
From Everand
ChatGPT: Is the future already here?
Rodrigo Serzedello
Rating: 4 out of 5 stars
4/5 (21)
100+ Amazing AI Image Prompts: Expertly Crafted Midjourney AI Art Generation Examples
From Everand
100+ Amazing AI Image Prompts: Expertly Crafted Midjourney AI Art Generation Examples
Prompt Engineering Publishing
No ratings yet
Artificial Intelligence & Generative AI for Beginners: The Complete Guide
From Everand
Artificial Intelligence & Generative AI for Beginners: The Complete Guide
David M. Patel
Rating: 5 out of 5 stars
5/5 (1)
How to Make Money Online Using ChatGPT Prompts: Secrets Revealed for Unlocking Hidden Opportunities. Earn Full-Time Income Using ChatGPT with the Untold Potential of Conversational AI.
From Everand
How to Make Money Online Using ChatGPT Prompts: Secrets Revealed for Unlocking Hidden Opportunities. Earn Full-Time Income Using ChatGPT with the Untold Potential of Conversational AI.
Lucas Foster
No ratings yet
AI for Educators: AI for Educators
From Everand
AI for Educators: AI for Educators
Matt Miller
Rating: 5 out of 5 stars
5/5 (1)
ChatGPT For Dummies
From Everand
ChatGPT For Dummies
Pam Baker
No ratings yet
The AI Advantage: How to Put the Artificial Intelligence Revolution to Work
From Everand
The AI Advantage: How to Put the Artificial Intelligence Revolution to Work
Thomas H. Davenport
Rating: 4 out of 5 stars
4/5 (7)
ChatGPT For Fiction Writing: AI for Authors
From Everand
ChatGPT For Fiction Writing: AI for Authors
Nova Leigh
Rating: 5 out of 5 stars
5/5 (2)
Power and Prediction: The Disruptive Economics of Artificial Intelligence
From Everand
Power and Prediction: The Disruptive Economics of Artificial Intelligence
Ajay Agrawal
Rating: 4.5 out of 5 stars
4.5/5 (38)