Welcome to Scribd!

Analyzing The Federalist Papers

Uploaded by

0% found this document useful (0 votes)

18 views19 pages

The document provides a summary of the Federalist Papers dataset that will be used for the text analysis example. It describes that the Federalist Papers were written between 1787-1788 by Alexander Hamilton, James Madison, and John Jay to argue for the adoption of the US Constitution. It outlines the corpus that will be used, including filtering out certain authors and topics. It then outlines the 9 step guided example process for text analysis that will be performed on the corpus.

Original Description:

Federalist

Original Title

2017 11 16 - Text Analysis - Federalist

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

18 views19 pages

Analyzing The Federalist Papers

Uploaded by

cjon

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 19

Search inside document

TEXT ANALYSIS

EXAMPLE
The Federalist Papers Session 1

Credit: SAS Institute

Pages 38 - 63 of that big PDF
The Federalist Papers

A collection of 85 documents, written between 1787 and

1788.
During post-Revolutionary War era, argued for the adoption
of the US Constitution (sans Bill of Rights) by New York.

Collaborative work between Alexander Hamilton, James

Madison, and John Jay.
Madison

Hamilton Jay
Timeline
1788 Papers all have been published, but are un-attributed.
1804 Hamilton duels Aaron Burr, and gives his attorney list
of authorship prior-to just in case. Dies in the duel.
1818 Madison releases his own list, with some differences.
Attributes differences to hastily-assembled initial list.
Some instances changed to a collaboration between
Madison and Hamilton.
Most of differences are where Madison claims authorship
to things Hamilton took credit for.
Also one pretty-much definite typo regarding one of Jays.
Our Corpus

85 Essays, sub-setted to 77
51 Hamilton (Train)
14 Madison (Train)
12 Disputed (Predict)
190,000+ words of free text (old school Natural Language)
8,752 unique words
Two exam bonus points if you email me AFTER
class with the data discrepancy shown here and
explanation of why it happened.
Guided Example Step 1

Create project (or open existing one)

Add Library pointing to the folder containing your data
Create a new Data Source. Changes:
Authors role = Label
Targets level = Nominal
Create a new Diagram
Add your new Data Source to your new Diagram
Guided Example Step 2

Add a Filter node from the Sample ribbon.

We want to customize this, so set the default options for Class and
Interval filtering to None.
Then go into the menu for Class variables and exclude:
Records pertaining to Jay (we know he didnt write it)
Records pertaining to Hamilton and Madison collaboration
(blended styles)
Filter by selecting Target = 2 or 3

Check In: Do you have 77 items remaining?

Guided Example Step 3

Ensure a binary target using the Metadata node (Utility)

Enter the Target menu.
Change our target variable to a Binary level.
Run.
Guided Example Step 4

Text Parsing node.

Stop List = DMTXT.FederalistStop.
Find Entities = Standard.

Stoplist: A list of words to automatically omit.

Run.
Guided Example Step 5

Text Filter node.

Term Weight = Inverse Document Frequency

A term is considered more impactful based on rarity
Minimum Number of Documents = 2

Run.
Guided Example Step 6

Text Cluster node.

Exact or Maximum Number = Exact

Number of Clusters= 2
We want exactly 2 clusters, because we want to bucket
into Madison or Hamilton only

Run.
Guided Example Step 7

Text Topic node.

Number of Multi-Term Topics = 5

Run.
Guided Example Step 8

Regression node

Defaults are fine

Logistic Regression will be employed

Run.
Guided Example Step 9

Within the properties of your Regression node:

Exported Data >> TRAIN data >> Explore button.
Click on the plot wizard icon.
In the plot wizard, select Bar, Next.
Roles:
Target = Category
I_Target = Group
Click Finish. Behold the graph.

Text Categorization: See Chapter 16 in Manning&Schütze
Document89 pages
Text Categorization: See Chapter 16 in Manning&Schütze
Vonny Pawaka
No ratings yet
Qualitative Data Analysis Techniques
Document16 pages
Qualitative Data Analysis Techniques
Aakash Maghnani
No ratings yet
On The Authorship of The Federalist Papers
Document21 pages
On The Authorship of The Federalist Papers
api-202492975
No ratings yet
Qualitative Data Analysis
Document16 pages
Qualitative Data Analysis
Aris Nur Azhar
No ratings yet
Introduction To Database Design Workshop: March 22-23, 1999 Entity-Relationship Diagrams: Introductory Description What Is An Entity-Relationship Diagram (ERD) ?
Document15 pages
Introduction To Database Design Workshop: March 22-23, 1999 Entity-Relationship Diagrams: Introductory Description What Is An Entity-Relationship Diagram (ERD) ?
Shashank Veerkar
No ratings yet
Take Assessment: Exercise 1
Document11 pages
Take Assessment: Exercise 1
Sayatbek Orazbekov
No ratings yet
Introduction To Microsoft Access
Document3 pages
Introduction To Microsoft Access
Ahmet Semih Ozkul
No ratings yet
Open and Manage Attribute Data in MapInfo
Document10 pages
Open and Manage Attribute Data in MapInfo
Jhon S Smith
No ratings yet
CaFSET (Antigua) Office Workbook - Sixth Edition - Access Sample Pages
Document11 pages
CaFSET (Antigua) Office Workbook - Sixth Edition - Access Sample Pages
cafset
No ratings yet
Latex
Document6 pages
Latex
Siva Kumar
No ratings yet
SQL Server Question Paper - 1
Document3 pages
SQL Server Question Paper - 1
api-3766129
No ratings yet
Introduction To Python Lecture 3: Python Standard Library - Part 1
Document41 pages
Introduction To Python Lecture 3: Python Standard Library - Part 1
Kris Test
No ratings yet
Data Structures Test
Document4 pages
Data Structures Test
Mohapatra Sarada
No ratings yet
Harvard
Document3 pages
Harvard
Samer dawan
No ratings yet
(Cambridge Ielts 1) Practice Test 1 - Key
Document3 pages
(Cambridge Ielts 1) Practice Test 1 - Key
Angie Nguyen
100% (1)
Considering A Real Time Application
Document8 pages
Considering A Real Time Application
Ne vaznho
No ratings yet
Database Management 2020
Document5 pages
Database Management 2020
Rose Fuen
No ratings yet
Practical Revision NOTES - 10 B
Document18 pages
Practical Revision NOTES - 10 B
kingcarlos2005
No ratings yet
02 Data
Document41 pages
02 Data
rafihassan
No ratings yet
Project 2
Document2 pages
Project 2
Gorrd
No ratings yet
Data Base
Document23 pages
Data Base
Aîlăn Mohammed
No ratings yet
Access Database Guide
Document23 pages
Access Database Guide
Aîlăn Mohammed
No ratings yet
Database Management System
Document71 pages
Database Management System
Paschal
No ratings yet
CC-Lec 4
Document40 pages
CC-Lec 4
Ch Salman
No ratings yet
Latihan 1 - Pemrograman Basis Data
Document4 pages
Latihan 1 - Pemrograman Basis Data
Ci man
No ratings yet
File Organizations and Indexing: R&G Chapter 8
Document40 pages
File Organizations and Indexing: R&G Chapter 8
Kishor Peddi
No ratings yet
Datasets and Tables Managing Large Volumes of Data
Document46 pages
Datasets and Tables Managing Large Volumes of Data
Vranda Gupta
No ratings yet
Symbol Tables
Document12 pages
Symbol Tables
Epsitha Yeluri
No ratings yet
Defining New Data Types in C++
Document33 pages
Defining New Data Types in C++
Stephy Withlotsoflove
No ratings yet
Amazing Interview Bible for Amazon Jobs
Document14 pages
Amazing Interview Bible for Amazon Jobs
yashwanthr3
33% (6)
Software Engineering II Linked Lists Algorithms Data Structures
Document14 pages
Software Engineering II Linked Lists Algorithms Data Structures
7565006
No ratings yet
EViews Workshop
Document26 pages
EViews Workshop
Isuru Wijerathne
No ratings yet
Run dynamic and alignment calculations and generate LQTA descriptors
Document4 pages
Run dynamic and alignment calculations and generate LQTA descriptors
ALDO JAVIER GUZMAN DUXTAN
No ratings yet
FINAL COPY ICT9 - ENHANCED MODULE1 - 3Qtr (Latest) PDF
Document13 pages
FINAL COPY ICT9 - ENHANCED MODULE1 - 3Qtr (Latest) PDF
Maria Alexa Burgos
No ratings yet
Tutorial 2: FIT 1029 - Algorithmic Problem Solving
Document2 pages
Tutorial 2: FIT 1029 - Algorithmic Problem Solving
Ali Alabid
No ratings yet
02 Abstract Data Types
Document21 pages
02 Abstract Data Types
Raahima Aamir
No ratings yet
Access 2007 Tutorial 1
Document53 pages
Access 2007 Tutorial 1
benjaminED
No ratings yet
Overview of Query Evaluation: R&G Chapter 12
Document30 pages
Overview of Query Evaluation: R&G Chapter 12
budisetiono56
No ratings yet
DT Style Guide
Document5 pages
DT Style Guide
Riko Piliang
No ratings yet
UNIX exercises: Log in, change password, create files and directories
Document3 pages
UNIX exercises: Log in, change password, create files and directories
Siddarth Prakash P
No ratings yet
Comp 8 L-2
Document2 pages
Comp 8 L-2
deepa garg
No ratings yet
Referncing Guide
Document18 pages
Referncing Guide
Ali Chaudhary
No ratings yet
Alteryx Webinar Lecture 1 - Slides PDF
Document56 pages
Alteryx Webinar Lecture 1 - Slides PDF
askerman 3
No ratings yet
Ms Access Training Present A Ion
Document31 pages
Ms Access Training Present A Ion
S.m. Paiman Aslami
No ratings yet
My Access Handout 1
Document4 pages
My Access Handout 1
api-27149177
100% (1)
Prompt.: Focused Learning Target Evidence
Document9 pages
Prompt.: Focused Learning Target Evidence
api-60533829
No ratings yet
Intro To Database Management
Document42 pages
Intro To Database Management
Angella S Williams
No ratings yet
Lecture 2
Document19 pages
Lecture 2
bluepixarlamp
No ratings yet
Text Mining with R: Tools and Techniques
Document15 pages
Text Mining with R: Tools and Techniques
SumitKishore
No ratings yet
Summary: Getting Started With The Data: Readtable Hurrs Readtable (,, 5, ,)
Document3 pages
Summary: Getting Started With The Data: Readtable Hurrs Readtable (,, 5, ,)
Hemant Bhadoria
No ratings yet
Ms Access Notes
Document27 pages
Ms Access Notes
Mandillah S Eddie
No ratings yet
08 Textmining
Document37 pages
08 Textmining
Panda Damanik
No ratings yet
Vb17-The ADO Set of Records
Document9 pages
Vb17-The ADO Set of Records
kknathan3689
No ratings yet
Chapter 20 Notes
Document7 pages
Chapter 20 Notes
bakie
No ratings yet
MYSQL Commands Class10
Document13 pages
MYSQL Commands Class10
Sahana Banu
No ratings yet
Data Mining and Knowledge Discovery
Document61 pages
Data Mining and Knowledge Discovery
kambam swarna kanth reddy
No ratings yet
Mail Merge: A Powerful Tool For Communicating Data To Families
Document21 pages
Mail Merge: A Powerful Tool For Communicating Data To Families
Cybrewspace Computer Sales
No ratings yet
Previous Page Next Page: Advertisements
Document8 pages
Previous Page Next Page: Advertisements
aparesh kumar banerjee
No ratings yet
Elementary Number Theory: Second Edition
From Everand
Elementary Number Theory: Second Edition
Underwood Dudley
Rating: 4 out of 5 stars
4/5 (4)
Advanced Algorithms and Data Structures
From Everand
Advanced Algorithms and Data Structures
Marcello La Rocca
No ratings yet
Lecture 4A
Document36 pages
Lecture 4A
cjon
No ratings yet
Lecture 1B
Document24 pages
Lecture 1B
cjon
No ratings yet
CD
Document90 pages
CD
cjon
No ratings yet
ENGR 3215: Hapter With Other References
Document24 pages
ENGR 3215: Hapter With Other References
cjon
No ratings yet
Sample Business Plan - We Can Do It Consulting
Document8 pages
Sample Business Plan - We Can Do It Consulting
Ibrahem Mabrouk
100% (1)
Queueing Systems and Simulation: University of Connecticut Fall 2018, MEM 4225
Document9 pages
Queueing Systems and Simulation: University of Connecticut Fall 2018, MEM 4225
cjon
No ratings yet
Cse 3100 Lab 0
Document3 pages
Cse 3100 Lab 0
cjon
No ratings yet
Lecture 1B
Document24 pages
Lecture 1B
cjon
No ratings yet
Polynomial ADT: Specification, Implementation and Correctness
Document10 pages
Polynomial ADT: Specification, Implementation and Correctness
cjon
No ratings yet
Chapter 3 Methods of Analysis PDF
Document15 pages
Chapter 3 Methods of Analysis PDF
Caio Cabral
No ratings yet
Ch. 6 - Interest Rates
Document20 pages
Ch. 6 - Interest Rates
cjon
No ratings yet
Machine Learning - Part 1
Document80 pages
Machine Learning - Part 1
cjon
100% (1)
Class 5 - Staffing and Resource Allocation
Document95 pages
Class 5 - Staffing and Resource Allocation
cjon
No ratings yet
MENG 370 Poisson's Ratio and Young's Modulus
Document69 pages
MENG 370 Poisson's Ratio and Young's Modulus
cjon
No ratings yet
Finman6e Irm14 Final
Document11 pages
Finman6e Irm14 Final
cjon
No ratings yet
Class 4 - Scheduling
Document58 pages
Class 4 - Scheduling
cjon
No ratings yet
Readme
Document8 pages
Readme
Ninad Mg
No ratings yet
Case in Point
Document20 pages
Case in Point
ssriram07
No ratings yet
CH 13
Document29 pages
CH 13
Ilda Khaki
100% (2)
2017 10 12 - Association Rules & Lift
Document12 pages
2017 10 12 - Association Rules & Lift
cjon
No ratings yet
Winsev6 FR
Document17 pages
Winsev6 FR
Julian Ramos
No ratings yet
Dingoo Juegos
Document2 pages
Dingoo Juegos
grodeslin
No ratings yet
Management+Consulting+Case+ Club+Company
Document11 pages
Management+Consulting+Case+ Club+Company
Thái Anh
0% (1)
Identity and Access Management Suite
Document2 pages
Identity and Access Management Suite
Vladan Dabovic
No ratings yet
Saiil Gribb
Document137 pages
Saiil Gribb
ferruccio sabatino
No ratings yet
manual comando Dreambox DM500 versão HD
Document71 pages
manual comando Dreambox DM500 versão HD
silveira_manuel
No ratings yet
ACDC and audio files metadata
Document531 pages
ACDC and audio files metadata
Marckie Roldan Tajo
No ratings yet
Without A Budget: Marketing
Document15 pages
Without A Budget: Marketing
Stanly Christiawan
No ratings yet
Contoh Laporan Output
Document7 pages
Contoh Laporan Output
Ivan Budi Susetyo
No ratings yet
cs9lscc b0310
Document1,034 pages
cs9lscc b0310
cturina
No ratings yet
Biomedical 5 8sem
Document48 pages
Biomedical 5 8sem
sriramaero
No ratings yet
SK 20230504101813396
Document2 pages
SK 20230504101813396
teamlose
No ratings yet
VCET PlacedDetails 2020 Jan2
Document9 pages
VCET PlacedDetails 2020 Jan2
Akiraa
No ratings yet
CasoRegistroCurso UML
Document12 pages
CasoRegistroCurso UML
Eel Dde
No ratings yet
Oracle Apps Technical Materialdoc PDF Free
Document336 pages
Oracle Apps Technical Materialdoc PDF Free
herculean2010
No ratings yet
Crypt 1 A AES, IDEA, Blowfish Intro
Document40 pages
Crypt 1 A AES, IDEA, Blowfish Intro
west_lmn
No ratings yet
Administrator's Guide To VMware Virtual SAN
Document63 pages
Administrator's Guide To VMware Virtual SAN
Hoang Khuyen
No ratings yet
1SDA074199R1 Ekip Touch Lsig E1 2 E6 2
Document2 pages
1SDA074199R1 Ekip Touch Lsig E1 2 E6 2
Vanderson Beltrão de Carvalho
No ratings yet
Mixing Tips - Your Daily Mixing Tips - PDF - PRO 4
Document5 pages
Mixing Tips - Your Daily Mixing Tips - PDF - PRO 4
Lucas eduardo
No ratings yet
5G Architecture
Document8 pages
5G Architecture
hamidboulahia
No ratings yet
Firepower Management Center Configuration Guide, V6.6
Document2,814 pages
Firepower Management Center Configuration Guide, V6.6
Wafik
No ratings yet
Understanding Multi Surface ISS1
Document13 pages
Understanding Multi Surface ISS1
KolyaY
No ratings yet
Unit-2 1. What Are The Stages of Information Assimilation? (2) (Am-16) - Ans
Document12 pages
Unit-2 1. What Are The Stages of Information Assimilation? (2) (Am-16) - Ans
shivani
No ratings yet
ABB-Welcome: User Manual
Document58 pages
ABB-Welcome: User Manual
jack
No ratings yet
Tutorial On CommuniCationsnsn
Document10 pages
Tutorial On CommuniCationsnsn
Koolesh Joymungul
No ratings yet
Lica Unit2
Document59 pages
Lica Unit2
Sri Prakash Narayanam
No ratings yet
Introduction to MEMS packaging processes and equipment
Document10 pages
Introduction to MEMS packaging processes and equipment
Punyabrata Ghatak
No ratings yet
Social Media: A Double-Edged Sword
Document4 pages
Social Media: A Double-Edged Sword
roseller
No ratings yet
Optimization of Box Section for Double Beam Bridge Crane Girder
Document8 pages
Optimization of Box Section for Double Beam Bridge Crane Girder
Somi Khan
100% (2)
2N Helios IP Vario Installation Manual en 2.3
Document79 pages
2N Helios IP Vario Installation Manual en 2.3
Andrei Tryfy
No ratings yet