You are on page 1of 2

CSE4022 NATURAL LANGUAGE PROCESSING L T P J C

3 0 0 4 4
Pre-requisite Nil Syllabus version
v1.0
Course Objectives:
1. To introduce the fundamental concepts and techniques of Natural language Processing for
analyzing words based on Morphology and CORPUS.
2. To examine the NLP models and interpret algorithms for classification of NLP sentences by
using both the traditional, symbolic and the more recent statistical approach.
3. To get acquainted with the algorithmic description of the main language levels that includes
morphology, syntax, semantics, and pragmatics for information retrieval and machine
translation applications.

Expected Course Outcome:


1. Understand the principles and Process the Human Languages Such as English and other
Indian Languages using computers.
2. Creating CORPUS linguistics based on digestive approach (Text Corpus method)
3. Demonstrate understanding of state-of-the-art algorithms and techniques for text-based
processing of natural language with respect to morphology.
4. Perform POS tagging for a given natural language.
5. Select a suitable language modelling technique based on the structure of the language.
6. Check the syntactic and semantic correctness of sentences using grammars and labelling.
7. Develop Computational Methods for Real World Applications and explore deep learning
based NLP

Student Learning Outcomes (SLO): 2,7,17


Module:1 INTRODUCTION TO NLP 3 hours
Introduction to various levels of natural language processing, Ambiguities and computational
challenges in processing various natural languages. Introduction to Real life applications of NLP
such as spell and grammar checkers, information extraction, question answering, and machine
translation.

Module:2 TEXT PROCESSING 6 hours


Character Encoding, Word Segmentation, Sentence Segmentation, Introduction to Corpora,
Corpora Analysis.

Module:3 MORPHOLOGY 6 hours


Inflectional and Derivation Morphology, Morphological Analysis and Generation using finite state
transducers.

Module:4 LEXICAL SYNTAX 6 hours


Introduction to word types, POS Tagging, Maximum Entropy Models for POS tagging, Multi-
word Expressions.

Module:5 LANGUAGE MODELING 6 hours


The role of language models. Simple N-gram models. Estimating parameters and smoothing.
Evaluating language models.

Module:6 SYNTAX & SEMANTICS 10 hours


Introduction to phrases, clauses and sentence structure, Shallow Parsing and Chunking, Shallow
Parsing with Conditional Random Fields (CRF), Lexical Semantics, Word Sense
Disambiguation, WordNet, Thematic Roles, Semantic Role Labelling with CRFs.

Module:7 APPLICATIONS OF NLP 6 hours


NL Interfaces, Text Summarization, Sentiment Analysis, Machine Translation, Question
answering.

Module:8 RECENT TRENDS 2 hours


Recent Trends in NLP

Total Lecture hours: 45 hours

Text Book(s)
1. Daniel Jurafsky and James H. Martin “Speech and Language Processing”, 3rd edition,
Prentice Hall, 2009.
Reference Books
1. Chris Manning and HinrichSchütze, “Foundations of Statistical Natural Language
Processing”, 2nd edition, MITPress Cambridge, MA, 2003.
2. NitinIndurkhya, Fred J. Damerau “Handbook of Natural Language Processing”, Second
Edition, CRC Press, 2010.
3. James Allen “Natural Language Understanding”, Pearson Publication 8th Edition. 2012.

Mode of Evaluation: Continuous Assessment Test –I (CAT-I), Continuous Assessment Test –II
(CAT-II), Digital Assignments/ Quiz / Completion of MOOC, Final Assessment Test (FAT).
Recommended by Board of Studies 04-04-2014
Approved by Academic Council No. 37 Date 16-06-2015

You might also like