Professional Documents
Culture Documents
Machine Learning
• Hierarchical Clustering
• Sentiment Analysis
• Word Embeddings
• Ensemble Methods
• Spam detection
• translation
• sentimental analysis
• text summarization
Proprietary content. ©Great Learning. All Rights Reserved.
Unauthorized use or distribution prohibited.
3
Text: Unstructured Data
A 1 1 1 1 1
B 1 1 1 1 1
• 2-gram (bigram):
John likes likes to play soccer to play John is is reading reading a a book
A 1 1 1 1
B 1 1 1 1
• Remove punctuations
• Remove numbers
Term 2
…
…
Term M