Movie Rating Prediction with NLP

Uploaded by

Rishubh Gandhi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views3 pages

Movie Rating Prediction with NLP

Uploaded by

Rishubh Gandhi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Experiment 7

Code:
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
data = pd.read_csv('ratings_sample.csv')
data.head()
data['movie_id'] = data['movie_id'].str.replace('+', ' ')
data.describe()
data.info()
data.isnull().sum()
data = data.dropna()
data.isnull().sum()
data.info()
# Assign unique integer IDs to each distinct movie
data['movie_id'] = pd.factorize(data['movie_id'])[0]
data['production_companies'] = pd.factorize(data['production_companies'])[0]
data['production_countries'] = pd.factorize(data['production_countries'])[0]
import nltk
nltk.download('stopwords')
nltk.download('wordnet')
nltk.download('omw-1.4')
import re
from nltk.tokenize import word_tokenize
from nltk.corpus import stopwords
from nltk.stem import WordNetLemmatizer
# Initialize WordNet lemmatizer and stopwords
lemmatizer = WordNetLemmatizer()
stop_words = set(stopwords.words('english'))
# Function to preprocess text
def preprocess_text(text):
text = re.sub(r'<[^>]+>', '', text)
text = re.sub(r'[^a-zA-Z]', ' ', text)
text = text.lower()
words = word_tokenize(text)
words = [lemmatizer.lemmatize(word) for word in words if word not in stop_words]
processed_text = ' '.join(words)
return processed_text
# Apply preprocessing to the 'overview' column
data['overview'] = data['overview'].apply(preprocess_text)
from sklearn.feature_extraction.text import TfidfVectorizer
tfidf_vectorizer = TfidfVectorizer(max_features=1000)
overview_features = tfidf_vectorizer.fit_transform(data['overview'])
overview_features_array = overview_features.toarray()
# Split the genres into individual genres
genres_list = data['genres'].str.split(' ')
# Get unique genres
unique_genres = set(genre for sublist in genres_list for genre in sublist)
for genre in unique_genres:
data[genre] = data['genres'].str.contains(genre).astype(int)
data.drop('genres', axis=1, inplace=True)
data.info()

from sklearn.model_selection import train_test_split

from sklearn.linear_model import LinearRegression
from sklearn.metrics import mean_squared_error
tfidf_vectorizer = TfidfVectorizer(max_features=1000)
overview_features = tfidf_vectorizer.fit_transform(data['overview'])
combined_features = overview_features
X_train, X_test, y_train, y_test = train_test_split(combined_features, data['rating'], test_size=0.2,
random_state=42)
lr_model = LinearRegression()
lr_model.fit(X_train, y_train)
y_pred = lr_model.predict(X_test)
mse = mean_squared_error(y_test, y_pred)
print("Mean Squared Error:", mse)
Output:

Flask App for Text Classification and Analysis
No ratings yet
Flask App for Text Classification and Analysis
6 pages
NLTK Text Analysis and Sentiment Review
No ratings yet
NLTK Text Analysis and Sentiment Review
22 pages
Spam Detection with Python and NLP
No ratings yet
Spam Detection with Python and NLP
3 pages
RecommenderSystem File
No ratings yet
RecommenderSystem File
24 pages
NLP Lab
No ratings yet
NLP Lab
18 pages
NLP with NLTK: Restaurant Reviews Analysis
No ratings yet
NLP with NLTK: Restaurant Reviews Analysis
5 pages
Sentiment Analysis with Python Code
No ratings yet
Sentiment Analysis with Python Code
7 pages
N011 Lab4 NLP
No ratings yet
N011 Lab4 NLP
12 pages
Text Preprocessing and Sentiment Analysis
No ratings yet
Text Preprocessing and Sentiment Analysis
13 pages
Restaurant Review Sentiment Analysis
No ratings yet
Restaurant Review Sentiment Analysis
3 pages
NLP Tushar
No ratings yet
NLP Tushar
21 pages
Python NLP Techniques Guide
No ratings yet
Python NLP Techniques Guide
18 pages
Product Review Sentiment Analysis
No ratings yet
Product Review Sentiment Analysis
2 pages
NLP2 Prasen
No ratings yet
NLP2 Prasen
6 pages
Document Classification with Naive Bayes
No ratings yet
Document Classification with Naive Bayes
9 pages
Research on Text Topic Modeling
No ratings yet
Research on Text Topic Modeling
26 pages
Text Classification with Word2Vec
No ratings yet
Text Classification with Word2Vec
3 pages
NLP Lab Manual for B.E. Students
No ratings yet
NLP Lab Manual for B.E. Students
21 pages
Bag of Words and N-grams Analysis
No ratings yet
Bag of Words and N-grams Analysis
7 pages
News Classification with TF-IDF and PCA
No ratings yet
News Classification with TF-IDF and PCA
2 pages
LSTM Sentiment Analysis on IMDB Reviews
No ratings yet
LSTM Sentiment Analysis on IMDB Reviews
18 pages
Top News Categories Analysis and Model
No ratings yet
Top News Categories Analysis and Model
4 pages
TF-IDF Feature Extraction Guide
No ratings yet
TF-IDF Feature Extraction Guide
7 pages
NLP Transformer-Based Models Used For Sentiment Analysis: 1. BERT
No ratings yet
NLP Transformer-Based Models Used For Sentiment Analysis: 1. BERT
98 pages
T SNE Visualization of Amazon Reviews With Polarity Based Color Coding+
No ratings yet
T SNE Visualization of Amazon Reviews With Polarity Based Color Coding+
29 pages
Python CA 4
No ratings yet
Python CA 4
9 pages
Experiment 3 Word2Vec Custom Vectors Generation and Performing Classification
No ratings yet
Experiment 3 Word2Vec Custom Vectors Generation and Performing Classification
4 pages
Sentiment Classification with AIML Techniques
No ratings yet
Sentiment Classification with AIML Techniques
31 pages
Sentiment Analysis with NLTK
No ratings yet
Sentiment Analysis with NLTK
4 pages
Code Text
No ratings yet
Code Text
4 pages
Sample
No ratings yet
Sample
6 pages
Spell Correction Using Word Probabilities
No ratings yet
Spell Correction Using Word Probabilities
10 pages
Amazon Food Reviews Analysis
No ratings yet
Amazon Food Reviews Analysis
37 pages
FINDS Algorithm Implementation in Python
No ratings yet
FINDS Algorithm Implementation in Python
22 pages
Transformer Models for Sentiment Analysis
No ratings yet
Transformer Models for Sentiment Analysis
45 pages
Extra Feature NLP
No ratings yet
Extra Feature NLP
5 pages
Assignment
No ratings yet
Assignment
6 pages
Mobile Phone Review Analysis Code
No ratings yet
Mobile Phone Review Analysis Code
7 pages
NLP Pipeline for Sentiment Analysis
No ratings yet
NLP Pipeline for Sentiment Analysis
7 pages
British Airways Forage Report
No ratings yet
British Airways Forage Report
12 pages
Movie Review Sentiment Analysis
No ratings yet
Movie Review Sentiment Analysis
18 pages
Assign 3
No ratings yet
Assign 3
1 page
Feature Extraction Techniques in NLP
No ratings yet
Feature Extraction Techniques in NLP
10 pages
PyTerrier Indexing Lab Guide
No ratings yet
PyTerrier Indexing Lab Guide
7 pages
Sentiment Analysis with TextBlob & Matplotlib
No ratings yet
Sentiment Analysis with TextBlob & Matplotlib
34 pages
Deep Learning for Image Captioning
No ratings yet
Deep Learning for Image Captioning
58 pages
Foundations of Python For AI
No ratings yet
Foundations of Python For AI
67 pages
Data Visualization and SVM Experiments
No ratings yet
Data Visualization and SVM Experiments
19 pages
Module 2 Feature Engineering and Text Representation
No ratings yet
Module 2 Feature Engineering and Text Representation
19 pages
Movie Recommendation System
No ratings yet
Movie Recommendation System
2 pages
Topic Classifierby David Caleb
No ratings yet
Topic Classifierby David Caleb
7 pages
Python NLP Exercises and Code
No ratings yet
Python NLP Exercises and Code
12 pages
Naive Bayes
No ratings yet
Naive Bayes
1 page
Naïve Bayes Classifier Implementation
No ratings yet
Naïve Bayes Classifier Implementation
5 pages
10253.exp 5
No ratings yet
10253.exp 5
12 pages
Code
No ratings yet
Code
18 pages
Predicting Yelp Restaurant Ratings
No ratings yet
Predicting Yelp Restaurant Ratings
10 pages
NLP Text Processing Techniques
No ratings yet
NLP Text Processing Techniques
5 pages
Mids Practical 3
No ratings yet
Mids Practical 3
2 pages
USAP Semester I & IV Timetable 2023
No ratings yet
USAP Semester I & IV Timetable 2023
5 pages
NT 040624430
No ratings yet
NT 040624430
6,968 pages
Mobile Computing Unit 3
No ratings yet
Mobile Computing Unit 3
86 pages
Computer Networks Lab Manual for B.Tech
No ratings yet
Computer Networks Lab Manual for B.Tech
26 pages
PS3 To PS4 - FirmwareUpdateGuide - 202203
No ratings yet
PS3 To PS4 - FirmwareUpdateGuide - 202203
5 pages
IMAWOP 9 en
No ratings yet
IMAWOP 9 en
106 pages
Guidelines Computer System Architecture
No ratings yet
Guidelines Computer System Architecture
4 pages
Internship Report ML
No ratings yet
Internship Report ML
27 pages
EN 50575:2014+A1:2016 Overview
No ratings yet
EN 50575:2014+A1:2016 Overview
26 pages
Lecture Notes DPCPS Unit 2
No ratings yet
Lecture Notes DPCPS Unit 2
24 pages
Rhyming Bingo
No ratings yet
Rhyming Bingo
11 pages
GitHub - Peggy1502 - Fraud-Detection-Handbook - Machine Learning For Credit Card Fraud Detection - Practical Handbook
No ratings yet
GitHub - Peggy1502 - Fraud-Detection-Handbook - Machine Learning For Credit Card Fraud Detection - Practical Handbook
5 pages
CSS COC1 Week 5-8 Correction Guide
No ratings yet
CSS COC1 Week 5-8 Correction Guide
5 pages
Control Center and Waste Management Plan
No ratings yet
Control Center and Waste Management Plan
15 pages
Yr10 - CS - 23-24 - T2 - W6 - L3 - Intro To Database
No ratings yet
Yr10 - CS - 23-24 - T2 - W6 - L3 - Intro To Database
29 pages
Dafson's Healthcare - Corporate Deck2023
No ratings yet
Dafson's Healthcare - Corporate Deck2023
11 pages
1MRK504172-BEN - en - G - Product Guide, Transformer Protection RET650 Version 2.2
No ratings yet
1MRK504172-BEN - en - G - Product Guide, Transformer Protection RET650 Version 2.2
111 pages
Quick Guide On Operations Management With Analytics v2024
No ratings yet
Quick Guide On Operations Management With Analytics v2024
207 pages
G12 IT TG 2023 Web
100% (3)
G12 IT TG 2023 Web
161 pages
EXO Word Prediction with LSTM
No ratings yet
EXO Word Prediction with LSTM
7 pages
Płutø of Lautech Ams103 Questions and Answers For Test and Exam (Reviewed
No ratings yet
Płutø of Lautech Ams103 Questions and Answers For Test and Exam (Reviewed
10 pages
Agriculture Timeline in Australia
No ratings yet
Agriculture Timeline in Australia
5 pages
ATA W SUBURBIA Tecamacpowercenter
No ratings yet
ATA W SUBURBIA Tecamacpowercenter
43 pages
2marks - DS
No ratings yet
2marks - DS
22 pages
Rear Door Lining Removal Guide
No ratings yet
Rear Door Lining Removal Guide
2 pages
Overview of Computer-Aided Manufacturing
No ratings yet
Overview of Computer-Aided Manufacturing
3 pages
Sap MM Resume
100% (1)
Sap MM Resume
3 pages
Comptia: Exam Questions N10-009
No ratings yet
Comptia: Exam Questions N10-009
40 pages
(MPCE Note - Session 1) Maritime Peplink Certified Engineer - Selecting The Equipment
No ratings yet
(MPCE Note - Session 1) Maritime Peplink Certified Engineer - Selecting The Equipment
56 pages
ECSS E ST 10C System Engineering General Requirement
No ratings yet
ECSS E ST 10C System Engineering General Requirement
100 pages
ITSecurity Regulations
No ratings yet
ITSecurity Regulations
9 pages
AI Agents: Types and Functions
No ratings yet
AI Agents: Types and Functions
88 pages
Getting Started Guide: 5G Toolbox™
No ratings yet
Getting Started Guide: 5G Toolbox™
112 pages
BD5935 en
No ratings yet
BD5935 en
6 pages

Movie Rating Prediction with NLP

Uploaded by

Movie Rating Prediction with NLP

Uploaded by

Experiment 7

from sklearn.model_selection import train_test_split

You might also like