0% found this document useful (0 votes)
68 views1 page

AI Health Chatbot Project Tasks

The document provides instructions for students at Delhi Public School Bangalore - East to complete an artificial intelligence project on health chatbots. It includes 3 sample documents as a corpus and tasks students to perform preprocessing techniques like sentence segmentation, tokenization, stopwords removal, lowercase conversion, stemming, and lemmatization on the corpus. Students are then asked to create a bag of words table, generate TFIDF values, and find the highest and lowest valued words.

Uploaded by

vaishnavi ravuri
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
68 views1 page

AI Health Chatbot Project Tasks

The document provides instructions for students at Delhi Public School Bangalore - East to complete an artificial intelligence project on health chatbots. It includes 3 sample documents as a corpus and tasks students to perform preprocessing techniques like sentence segmentation, tokenization, stopwords removal, lowercase conversion, stemming, and lemmatization on the corpus. Students are then asked to create a bag of words table, generate TFIDF values, and find the highest and lowest valued words.

Uploaded by

vaishnavi ravuri
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

DELHI PUBLIC SCHOOL BANGALORE - EAST

ARTIFICIAL INTELLIGENCE
Project (To be done in the Practical Journal)

The Corpus

Document 1: We can use health chatbots for treating stress.


Document 2: We can use NLP to create chatbots and we will be making health chatbots now.
Document 3: Health Chatbots cannot replace human counsellors now.

Accomplish the following challenges on the basis of the corpus given above.

1. Sentence Segmentation
2. Tokenisation
3. Stopwords removal
4. Lowercase conversion
5. Stemming
6. Lemmatisation
7. Bag of Words: Create a document vector table for all documents.
8. Generate TFIDF values for all the words.
9. Find the words having highest value.
10. Find the words having the least value.

**********

You might also like