You are on page 1of 2

JAIPURIA INSTITUTE OF MANAGEMENT, NOIDA

PGDM (G-SM-M); TRIMESTER V; ACADEMIC YEAR 2019-20


Course Code and title BA 501: Text Analytics
Credits 3
Term and Year V Term, 2019-20
Course Pre-requisite(s) Basic conceptual knowledge in quantitative
methods and basic analytics algorithms
Course Requirement(s)
Course Schedule (day and time of class) As per Time-Table
Classroom # (Location)
Course Instructor
Course Instructor Email
Course Instructor Phone (Office)
Student Consultation Hours
Office location

Course Objectives

1. To understanding the history, evolution, terminology and perspectives of text analytics


2. To apply various text analytics models and techniques through hands-on assignments in
text analytics techniques

Course Outcomes

On completion of this course, the students will be able to:

CO1. Use basic methods for information extraction and retrieval of textual data
CO2. Apply text processing techniques to prepare documents for statistical modelling
CO3. Apply relevant machine learning models for analyzing textual data and correctly
interpreting the results
CO4. Use machine-learning models for text prediction

Catalog Description

Given the dominance of text information over the Internet, mining high-quality information from
text becomes increasingly critical. The actionable knowledge extracted from text data facilitates
our life in a broad spectrum of areas, including business intelligence, information acquisition,
social behavior analysis and decision-making. In this course, we cover important topics in text
mining including: basic natural language processing techniques, document representation, text
categorization and clustering, document summarization and sentiment analysis.

Course Content

UNIT 1: Overview of Text Analytics 2 lecture hours


Basic organization and major topics of this course, logistic issues and course
requirements.
UNIT 2: Natural language processing 3 lecture hours
The basic techniques in natural language processing, including tokenization,
partof-speech tagging, chunking, syntax parsing and named entity recognition.

UNIT 3: Document representation 5 lecture hours


Representation of unstructured text documents with appropriate format and
structure to support later automated text mining algorithms.

UNIT 4: Text categorization 8 lecture hours


Assigning a text document to one or more classes or categories using several basic
supervised text categorization algorithms, including Naive Bayes, k Nearest
Neighbor (kNN) and Support Vector Machines
UNIT 5: Text clustering 7 lecture hours
Identifying the clustering structure of a corpus of text documents and assigning
documents to the identified cluster(s).

UNIT 6: Sentiment Analysis 5 lecture hours


Extracting subjective information in source materials with sentiment analysis,
sentiment polarity prediction, review mining, and aspect identification.

You might also like