You are on page 1of 7

Go Back to Machine Learning

Course Content

ML Quiz 3

Type : Graded Quiz

Attempts : 1/1

Questions : 9

Time : 30m

Due Date : Feb 13, 11:59 PM

Your Score : 10/10

Instructions

Attempt History

Attempt #1
Feb 13, 6:55
PM
Marks: 10

Q No: 1 Correct
Answer Marks:
1/1

You have created a document-term matrix of the data, treating every tweet as one
document. Which of the following is correct, in regard to the document term
matrix?

A) Removal of stop words(like 'in', 'a', 'to' etc) from the data will affect the
dimensionality of the data
B) Stemming of words in the data will reduce the dimensionality of the data
C) Converting all the words in lowercase will not affect the dimensionality of the
This study data
source was downloaded by 100000840671323 from CourseHero.com on 05-29-2022 10:23:57 GMT -05:00

https://www.coursehero.com/file/134042755/ML-Quiz-3-Machine-Learning-Great-Learningdocx/ 1/6
B

A&B You Selected

Choices A and B are correct because stop word removal will decrease the number of features in the matrix, Ste

Q No: 2 Correct Answer


Marks:
1/1
The number of times a term occurs in a document is
called its
Term Frequency You Selected

Matrix Rate

Inverse Document Frequency

Correct Answer
Q No: 3
Marks:
1/1
Bigram is a two-word sequence of words like “please turn”, “turn your”, or ”your
homework”

True You Selected

False

This study source was downloaded by 100000840671323 from CourseHero.com on 05-29-2022 10:23:57 GMT -05:00
Correct Answer
Q No: 4
https://www.coursehero.com/file/134042755/ML-Quiz-3-Machine-Learning-Great-Learningdocx/
Marks: 1/1

The following are an example of which type of data?

A) Satellite Imagery
B) Email
C) Social Media
D) Mobile Data

Unstructured Data You Selected

Structured Data

Unstructured data files often include text and multimedia content. Examples include e-mail messages

Correct Answer
Q No: 5
Marks:
1/1
In a document-term-matrix - columns correspond to the documents while rows
correspond to the words

True

False You Selected

A document-term matrix or term-document matrix is a mathematical matrix that describes the frequency of te

Correct Answer
Q No: 6
Marks:
1/1
Assume you have a text file with 1000 tweets. Document term matrix (DTM)
is created to treat every tweet as one document after removing the special
characters and stop words. Which of the following word is definitely not
expected
This study to bebypart
source was downloaded of DTM?
100000840671323 from CourseHero.com on 05-29-2022 10:23:57 GMT -05:00

https://www.coursehero.com/file/134042755/ML-Quiz-3-Machine-Learning-Great-Learningdocx/
Document

Have You Selected

Remove

Inside

As stop words are removed from the document, Have will not be part of DTM

Correct Answer
Q No: 7
Marks:
1.50/1.50

How many Bigrams can be generated from the following sentence, after
performing all the text cleaning steps?

Sentence: “#Great - Learning is the best institute * to learn @data _ science.”

6 You Selected

After performing stopword removal, stemming and punctuation replacement the text becomes great learn b
Bigrams – great learn, learn best, best institute, institute learn, learn data, data science

This study source was downloaded by 100000840671323 from CourseHero.com on 05-29-2022 10:23:57 GMT -05:00

https://www.coursehero.com/file/134042755/ML-Quiz-3-Machine-Learning-Great-Learningdocx/
Q No: 8 Correct
Answer Marks:
1.50/1.50

What is the number of words with a frequency count greater than one in the
following sentence without cleaning the text?

Note: inverted commas and full stops are not part of the sentence.

Sentence: “I have participated in the Great Learning Hackathon and got 3rd rank in the
Hackathon”.

3 You Selected

Words with frequency count> 1: Hackathon, the, in

Q No: 9 Correct
Answer Marks:
1/1

In linguistic morphology, is the process for reducing inflected words


to their root form.

Tokenization

Stemming You Selected

Text-proofing

Rooting

This study source was downloaded by 100000840671323 from CourseHero.com on 05-29-2022 10:23:57 GMT -05:00

https://www.coursehero.com/file/134042755/ML-Quiz-3-Machine-Learning-Great-Learningdocx/
"Stemming is the process of reducing inflection in words to their root forms such as
Comments:
mapping a group of words to the same stem even if the stem itself is not a valid word in
the Language + Add comments

This study source was downloaded by 100000840671323 from CourseHero.com on 05-29-2022 10:23:57 GMT -05:00

https://www.coursehero.com/file/134042755/ML-Quiz-3-Machine-Learning-Great-Learningdocx/
This study source was downloaded by 100000840671323 from CourseHero.com on 05-29-2022 10:23:57 GMT -05:00

https://www.coursehero.com/file/134042755/ML-Quiz-3-Machine-Learning-Great-Learningdocx/
Powered by TCPDF (www.tcpdf.org)

You might also like