Fake News Detection with Python

This document provides an overview of a project to detect fake news articles using natural language processing and machine learning techniques in Python. It introduces what fake news is and describes using scikit-learn libraries for classification. The document outlines the prerequisites, packages, and machine learning algorithms used, including Flask, NumPy, Pandas, regular expressions, stopwords, PorterStemmer, TFIDFVectorizer, train-test splitting, logistic regression, and calculating accuracy scores.

Uploaded by

Adarsh Lenin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

278 views14 pages

Fake News Detection with Python

Uploaded by

Adarsh Lenin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

FAKE NEWS DETECTION

BY
• ADARSH LENIN
• ATHUL P
• BIMAL MURALI
• NIDHIN PHILIP ALEX
INTRODUCTION

• What is Fake News?

• A type of yellow journalism, fake news encapsulates pieces of news that may be hoaxes and is generally
spread through social media and other online media. This is often done to further or impose certain
ideas and is often achieved with political agendas. Such news items may contain false and/or
exaggerated claims, and may end up being viralized by algorithms, and users may end up in a filter
bubble.
• Fake News Detection in Python
• In this project, we have used various natural language processing techniques and machine learning
algorithms to classify fake news articles using sci-kit libraries from python.
FLOWCHART
PREREQUISITES
• PYTHON
• FLASK
• HTML
• CSS
FLASK

• Flask is a web framework, it’s a Python module that lets you develop web
applications easily. It’s has a small and easy-to-extend core: it’s a
microframework that doesn’t include an ORM (Object Relational Manager) or
such features.
• It does have many cool features like url routing, template engine. It is a WSGI
web app framework.
PACKAGES

NUMPY

NumPy, which stands for Numerical Python, is a library consisting of

multidimensional array objects and a collection of routines for
processing those arrays. Using NumPy, mathematical and logical
operations on arrays can be performed
PANDAS

Pandas is an open source Python package that is most widely used

for data science/data analysis and machine learning tasks. It is built
on top of another package named Numpy, which provides support
for multi-dimensional [Link] for crerating and storing data
frames
REGULAR EXPRESSION

Regular Expression, is a sequence of characters that

forms a search [Link] can be used to check if a
string contains the specified search pattern.
STOPWORDS

The stopwords in “nltk” library are the most common words in data.
They are words that you do not want to use to describe the topic of your
content. Words that doesn’t add much value to a paragraph
PORTERSTEMMER

The Porter stemming algorithm (or 'Porter stemmer') is a process for

removing the commoner morphological and inflexional endings from words
in English. It gives root word for a particular word
TFIDFVECTORIZER

Term frequency-inverse document frequency is a text vectorizer

that transforms the text into a usable vector. It combines 2
concepts, Term Frequency (TF) and Document Frequency (DF). The
term frequency is the number of occurrences of a specific term in
a document.
TRAIN AND SPLIT

The train-test split is used to estimate the performance of machine

learning algorithms that are applicable for prediction-based
Algorithms/Applications. This method is a fast and easy procedure to
perform such that we can compare our own machine learning model
results to machine results.
LOGISTIC REGRESSION

Logistic Regression is a Machine Learning classification algorithm that is

used to predict the probability of a categorical dependent variable. In
logistic regression, the dependent variable is a binary variable that
contains data coded as 1 (yes, success, etc.) or 0 (no, failure, etc.).
ACCURACY SCORE

The accuracy_score method is used to calculate the accuracy of either the

faction or count of correct prediction in Python Scikit learn. Mathematically
it represents the ratio of the sum of true positives and true negatives out of
all the predictions

Machine Learning in Python Main Developments and T
100% (1)
Machine Learning in Python Main Developments and T
44 pages
Data Exploration and Visualization Guide
100% (1)
Data Exploration and Visualization Guide
23 pages
Big Data Mining Framework Overview
No ratings yet
Big Data Mining Framework Overview
30 pages
Programming For Data Science - Assignment 1
No ratings yet
Programming For Data Science - Assignment 1
2 pages
Complete Roadmap To Learn Python For Data Analysis
No ratings yet
Complete Roadmap To Learn Python For Data Analysis
5 pages
Python OOP Guide for Developers
No ratings yet
Python OOP Guide for Developers
94 pages
Practical R Programming Guide
No ratings yet
Practical R Programming Guide
103 pages
NLTK: Python for Natural Language Processing
No ratings yet
NLTK: Python for Natural Language Processing
23 pages
Smart Traffic Management with IoT & ML
No ratings yet
Smart Traffic Management with IoT & ML
6 pages
Database Administrator
No ratings yet
Database Administrator
17 pages
Python Programming Workshop Overview
No ratings yet
Python Programming Workshop Overview
108 pages
Power BI Training Course Guide
100% (1)
Power BI Training Course Guide
6 pages
H2o Training Day
No ratings yet
H2o Training Day
180 pages
Feature Engineering for Regression Models
No ratings yet
Feature Engineering for Regression Models
23 pages
1.introduction To Python For Data Science
No ratings yet
1.introduction To Python For Data Science
6 pages
Python PPT
No ratings yet
Python PPT
60 pages
Python Programming for Data Science I
No ratings yet
Python Programming for Data Science I
6 pages
Mobile-Based SIWES Placement Recommendation System (A Case Study of Nigerian Universities)
No ratings yet
Mobile-Based SIWES Placement Recommendation System (A Case Study of Nigerian Universities)
7 pages
Introduction to Machine Learning
100% (1)
Introduction to Machine Learning
17 pages
K-means Clustering Explained
No ratings yet
K-means Clustering Explained
13 pages
ENG 202: Computers and Engineering Object Oriented Programming in PYTHON
No ratings yet
ENG 202: Computers and Engineering Object Oriented Programming in PYTHON
56 pages
Understanding Data Science Basics
100% (1)
Understanding Data Science Basics
31 pages
Data Analysis With Pandas - Introduction To Pandas Cheatsheet - Codecademy PDF
100% (1)
Data Analysis With Pandas - Introduction To Pandas Cheatsheet - Codecademy PDF
3 pages
Python Data Structures for Business Analytics
No ratings yet
Python Data Structures for Business Analytics
56 pages
Introduction to Python Programming
No ratings yet
Introduction to Python Programming
14 pages
Beginners Python Cheat Sheet PCC Plotly PDF
No ratings yet
Beginners Python Cheat Sheet PCC Plotly PDF
2 pages
Data Visualization Ebook
No ratings yet
Data Visualization Ebook
15 pages
Python Programming Fundamentals Guide
100% (1)
Python Programming Fundamentals Guide
7 pages
Module 6 Data Visualiztion Matplotlib
No ratings yet
Module 6 Data Visualiztion Matplotlib
69 pages
Blockchain-Enabled Multi-Drone COVID-19 Response
No ratings yet
Blockchain-Enabled Multi-Drone COVID-19 Response
20 pages
Python For Data Science
No ratings yet
Python For Data Science
5 pages
Rapidminer 4.6 Tutorial
100% (1)
Rapidminer 4.6 Tutorial
695 pages
Python Data Science
No ratings yet
Python Data Science
25 pages
Weka Tutorial
No ratings yet
Weka Tutorial
2 pages
Scikit Learn Docs
100% (1)
Scikit Learn Docs
1,810 pages
Essential Python Libraries for Data Science
No ratings yet
Essential Python Libraries for Data Science
12 pages
Jupiter Notebook Tricks
100% (1)
Jupiter Notebook Tricks
9 pages
Python Data Types: Lists, Tuples, Sets, Dictionaries
No ratings yet
Python Data Types: Lists, Tuples, Sets, Dictionaries
83 pages
Python Classes and Objects Guide
No ratings yet
Python Classes and Objects Guide
6 pages
Python Programming Basics and Data Analysis
No ratings yet
Python Programming Basics and Data Analysis
53 pages
Python Control Flow Statements and Loops: Pynative
No ratings yet
Python Control Flow Statements and Loops: Pynative
16 pages
Python Notes For Beginners (Autosaved)
No ratings yet
Python Notes For Beginners (Autosaved)
52 pages
Introduction to Natural Language Processing
No ratings yet
Introduction to Natural Language Processing
45 pages
Machine Learning With Python.
0% (1)
Machine Learning With Python.
13 pages
Spark SQL PPT 3.2.3 and 3.2.4
No ratings yet
Spark SQL PPT 3.2.3 and 3.2.4
17 pages
CIS 519 Machine Learning Assignment 2
No ratings yet
CIS 519 Machine Learning Assignment 2
12 pages
Data Analysis Tutorial
No ratings yet
Data Analysis Tutorial
152 pages
TOP 21 DATA SCIENCE PROJECTS - Part 1
No ratings yet
TOP 21 DATA SCIENCE PROJECTS - Part 1
6 pages
A719552767 - 20992 - 7 - 2019 - Lecture10 Python OOP
No ratings yet
A719552767 - 20992 - 7 - 2019 - Lecture10 Python OOP
15 pages
Python Basics for Data Science
100% (1)
Python Basics for Data Science
8 pages
ML Roadmap
No ratings yet
ML Roadmap
11 pages
Database Management Systems by Raghu Ramakrishnan: Special Features of Book
No ratings yet
Database Management Systems by Raghu Ramakrishnan: Special Features of Book
3 pages
Fake News Detection
100% (1)
Fake News Detection
25 pages
Fake News Detection Methodology Guide
No ratings yet
Fake News Detection Methodology Guide
9 pages
Explore and Reduce The Spreading of Fake News Using Machine Learning
No ratings yet
Explore and Reduce The Spreading of Fake News Using Machine Learning
5 pages
Fake News Detection: Using Machine Learning & Python (Predicting Website)
No ratings yet
Fake News Detection: Using Machine Learning & Python (Predicting Website)
13 pages
Project Report
No ratings yet
Project Report
12 pages
Questions Answers Chapter Wise
No ratings yet
Questions Answers Chapter Wise
4 pages
Final Presentation Fake News Detection
No ratings yet
Final Presentation Fake News Detection
11 pages
File Organization Design and Analysis Guide
No ratings yet
File Organization Design and Analysis Guide
2 pages
Exam Professional Data Engineer Topic 1 Question 88 Discussion - ExamTopics
No ratings yet
Exam Professional Data Engineer Topic 1 Question 88 Discussion - ExamTopics
1 page
Chapter9 Network Management Updated
No ratings yet
Chapter9 Network Management Updated
26 pages
Python Programming Model Question Paper
No ratings yet
Python Programming Model Question Paper
41 pages
Exp 3
No ratings yet
Exp 3
6 pages
On-Bright Confidential To Bona: Quasi-Resonant Flyback PWM Controller General Description Features
No ratings yet
On-Bright Confidential To Bona: Quasi-Resonant Flyback PWM Controller General Description Features
14 pages
End To End Cloud Migration Plan From On Premises To Clloud
No ratings yet
End To End Cloud Migration Plan From On Premises To Clloud
19 pages
History of Early Computing Devices
No ratings yet
History of Early Computing Devices
14 pages
NM Record Asif
No ratings yet
NM Record Asif
46 pages
Certification of G-PON Equipment Ensures Optimal Performance in The Field
No ratings yet
Certification of G-PON Equipment Ensures Optimal Performance in The Field
15 pages
IT Support Requests for Ethiopian Bank
No ratings yet
IT Support Requests for Ethiopian Bank
4 pages
Infotech JSS2 Database 2 WK 9
0% (1)
Infotech JSS2 Database 2 WK 9
3 pages
30 Beginner C Programming Exercises With Solutions
No ratings yet
30 Beginner C Programming Exercises With Solutions
22 pages
Graphics Standards 04-MAR-21
100% (1)
Graphics Standards 04-MAR-21
44 pages
Computational Thinking in Science Course
No ratings yet
Computational Thinking in Science Course
3 pages
G Suite vs. Office 365: Security and Management Tools
No ratings yet
G Suite vs. Office 365: Security and Management Tools
2 pages
Harsha Verse DSA CrashCourse Resources
No ratings yet
Harsha Verse DSA CrashCourse Resources
9 pages
How To Perform A Clean Uninstall of Autodesk Products On Windows
No ratings yet
How To Perform A Clean Uninstall of Autodesk Products On Windows
2 pages
PowerVault ME4 - A Disk Firmware Update Is Available For Your System - Dell US
No ratings yet
PowerVault ME4 - A Disk Firmware Update Is Available For Your System - Dell US
6 pages
Awscdk
No ratings yet
Awscdk
341 pages
Operating Systems Exam Questions 2020
No ratings yet
Operating Systems Exam Questions 2020
3 pages
Class 10 Computer
No ratings yet
Class 10 Computer
2 pages
Learn Azure Fundamentals for AZ-900 Exam
No ratings yet
Learn Azure Fundamentals for AZ-900 Exam
1 page
The Fs - createReadStream Method - Dustin John Pfister at Github Pages
No ratings yet
The Fs - createReadStream Method - Dustin John Pfister at Github Pages
7 pages
Introduction To Computer System
No ratings yet
Introduction To Computer System
2 pages
C Language Programming Fundamentals Guide
No ratings yet
C Language Programming Fundamentals Guide
15 pages
CICS Debugging for System Programmers
No ratings yet
CICS Debugging for System Programmers
58 pages
Ace Analytics JavaScript Metrics System
No ratings yet
Ace Analytics JavaScript Metrics System
11 pages
VLAN Important Notes To Review
No ratings yet
VLAN Important Notes To Review
10 pages
BIOS Master Password Generator For Laptops
No ratings yet
BIOS Master Password Generator For Laptops
1 page

Fake News Detection with Python

Uploaded by

Fake News Detection with Python

Uploaded by

FAKE NEWS DETECTION

• What is Fake News?

NumPy, which stands for Numerical Python, is a library consisting of

Pandas is an open source Python package that is most widely used

Regular Expression, is a sequence of characters that

The Porter stemming algorithm (or 'Porter stemmer') is a process for

Term frequency-inverse document frequency is a text vectorizer

The train-test split is used to estimate the performance of machine

Logistic Regression is a Machine Learning classification algorithm that is

The accuracy_score method is used to calculate the accuracy of either the

You might also like