Welcome to Scribd!

A2 Language Model

Uploaded by

0% found this document useful (0 votes)

7 views1 page

This document outlines an assignment to build a language model that can generate coherent text based on a given input. Students are tasked with acquiring a text dataset, preprocessing the data, training a model on the data, and developing a web application to demonstrate the model. The deliverables are a Jupyter notebook documenting the work, a README file, and the web app folder. The assignment is divided into three main tasks: 1) choosing a dataset, 2) training a model on the data, and 3) developing a web app to interface with the trained model.

Original Description:

Original Title

A2_Language_Model

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

7 views1 page

A2 Language Model

Uploaded by

Munthitra Thadthapong

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 1

Search inside document

Chaklam Silpasuwanchai, Todsavad Tangtortan

AT82.05 Artificial Intelligence: Natural Language Understanding (NLU)

A2: Language Model
In this assignment, we will focus on building a language model using a text dataset of your choice. The
objective is to train a model that can generate coherent and contextually relevant text based on a given
input. Additionally, you will develop a simple web application to demonstrate the capabilities of your
language model interactively.
Note: You are ENCOURAGED to work with your friends, but DISCOURAGED to blindly copy other’s
work. Both parties will be given 0.
Note: Comments should be provided sufficiently so we know you understand. Failure to do so can raise
suspicion of possible copying/plagiarism.
Note: You will be graded upon (1) documentation, (2) experiment, (3) implementation.
Note: This is a one-weeks assignment, but start early.
Deliverables: The GitHub link containing the jupyter notebook, a README.md of the github, and
the folder of your web application called ‘app’.
Task 1. Dataset Acquisition - Your first task is to find a suitable text dataset. (1 points)
1) Choose your dataset and provide a brief description. Ensure to source this dataset from reputable
public databases or repositories. It is imperative to give proper credit to the dataset source in your
documentation.
Note: The dataset can be based on any theme such as Harry Potter, Star Wars, jokes, Isaac Asimov’s
works, Thai stories, etc. The key requirement is that the dataset should be text-rich and suitable for
language modeling.
Task 2. Model Training - Incorporate the chosen dataset into our existing code framework. Train a
language model that can understand the context and style of the text.
1) Detail the steps taken to preprocess the text data. (1 points)
2) Describe the model architecture and the training process. (1 points)
Task 3. Text Generation - Web Application Development - Develop a simple web application that
demonstrates the capabilities of your language model. (2 points)
1) The application should include an input box where users can type in a text prompt.
2) Based on the input, the model should generate and display a continuation of the text. For example,
if the input is ”Harry Potter is”, the model might generate ”a wizard in the world of Hogwarts”.
3) Provide documentation on how the web application interfaces with the language model.

Best of luck in developing your text-generation model!

1.A Review of Programming Paradigms Throughout The History - With A Suggestion Toward A Future Approach
Document87 pages
1.A Review of Programming Paradigms Throughout The History - With A Suggestion Toward A Future Approach
ShivaJps
100% (1)
Spp2 Siemens TXP HW Manual
Document293 pages
Spp2 Siemens TXP HW Manual
Diego Armando Vanegas Rodriguez
No ratings yet
Authorized Licensed Use Limited To: 802 Member. Downloaded On October 14,2010 at 01:47:30 UTC From IEEE Xplore. Restrictions Apply
Document77 pages
Authorized Licensed Use Limited To: 802 Member. Downloaded On October 14,2010 at 01:47:30 UTC From IEEE Xplore. Restrictions Apply
Bart
No ratings yet
Oscilloscope Tutorial
Document23 pages
Oscilloscope Tutorial
mrana_56
No ratings yet
OOPS Interview Questions
Document58 pages
OOPS Interview Questions
aditya miet
No ratings yet
Topical Chat Towards Knowledge Grounded Open Domain Conversations
Document5 pages
Topical Chat Towards Knowledge Grounded Open Domain Conversations
Com Digful
No ratings yet
Object-Oriented Programming With Swift 2 - Sample Chapter
Document38 pages
Object-Oriented Programming With Swift 2 - Sample Chapter
Packt Publishing
No ratings yet
Designing and Implementing Conversationa
Document12 pages
Designing and Implementing Conversationa
stevenkanyanjua1
No ratings yet
A1 Engine Search
Document2 pages
A1 Engine Search
Munthitra Thadthapong
No ratings yet
PHP OOPS Interview Questions
Document117 pages
PHP OOPS Interview Questions
Nadeem Shaikh
No ratings yet
Deep Learning in Practice Project Two: NLP of The Holy Quran in Python
Document11 pages
Deep Learning in Practice Project Two: NLP of The Holy Quran in Python
shoaib riaz
No ratings yet
How To Create A Private ChatGPT With Your Own Data
Document11 pages
How To Create A Private ChatGPT With Your Own Data
Jackall
No ratings yet
Fake News Detection Using LSTM
Document67 pages
Fake News Detection Using LSTM
shoaib arshath
No ratings yet
A3 Machine Translation
Document2 pages
A3 Machine Translation
Munthitra Thadthapong
No ratings yet
Lab 03 CMP7202
Document7 pages
Lab 03 CMP7202
ALAO
No ratings yet
Learning Object-Oriented Programming - Sample Chapter
Document29 pages
Learning Object-Oriented Programming - Sample Chapter
Packt Publishing
No ratings yet
Ch01 - Introduction To OOP - 2020 PDF
Document15 pages
Ch01 - Introduction To OOP - 2020 PDF
Kaleb
No ratings yet
Assignment 4 - Comp8547
Document2 pages
Assignment 4 - Comp8547
Jay Saavn
No ratings yet
Why The Lucky Stif
Document14 pages
Why The Lucky Stif
joshua lim
No ratings yet
Worksheet 4.8 SQ LITE .
Document7 pages
Worksheet 4.8 SQ LITE .
Oscarinho Ortiz
No ratings yet
Mad ct2 Part 1 PDF
Document40 pages
Mad ct2 Part 1 PDF
550-Shivendra Tiwari
No ratings yet
Prompt Egineering Techniques
Document31 pages
Prompt Egineering Techniques
Kanishk Ghonge
100% (1)
Osdc Lua 20230211
Document51 pages
Osdc Lua 20230211
doudoustaff
No ratings yet
Lecture 1.1 - Introducing Java
Document21 pages
Lecture 1.1 - Introducing Java
ayan
No ratings yet
CSC514-CSC528 Group Term Project
Document2 pages
CSC514-CSC528 Group Term Project
VictorAyoJimoh
No ratings yet
Java History
Document31 pages
Java History
H-Kati
No ratings yet
Course: COMP1649 Interaction Design Contribution: 100% of Course
Document6 pages
Course: COMP1649 Interaction Design Contribution: 100% of Course
FAID HN Nguyen Ngoc Trung
No ratings yet
Minor Project Synopsis 2021 Group Aab
Document3 pages
Minor Project Synopsis 2021 Group Aab
Abhijeet Shastri
No ratings yet
Foundation of Programing: Object
Document50 pages
Foundation of Programing: Object
Ganesh
No ratings yet
ISE 121 OOP Lab Manual 2018
Document3 pages
ISE 121 OOP Lab Manual 2018
Ashley Tanaka Mutenha
No ratings yet
A Pattern Language For Reverse Engineering: Serge Demeyer, Stéphane Ducasse, Oscar Nierstrasz
Document20 pages
A Pattern Language For Reverse Engineering: Serge Demeyer, Stéphane Ducasse, Oscar Nierstrasz
mysticsoul
No ratings yet
Irvine Syllabus CCTP 711 2020
Document10 pages
Irvine Syllabus CCTP 711 2020
Orlando Fernández
No ratings yet
Learn Kotlin for Android Development: The Next Generation Language for Modern Android Apps Programming
From Everand
Learn Kotlin for Android Development: The Next Generation Language for Modern Android Apps Programming
Peter Späth
No ratings yet
Text Mining - Analytics
Document35 pages
Text Mining - Analytics
Sukeshan R
No ratings yet
Lesson Plan Mail Merge
Document4 pages
Lesson Plan Mail Merge
Rye Enriquez
No ratings yet
Assignment 1
Document2 pages
Assignment 1
ashish
No ratings yet
Python Chat Bot Project
Document6 pages
Python Chat Bot Project
Gagana
100% (1)
Introducing Object-Oriented Programming (OOP)
Document21 pages
Introducing Object-Oriented Programming (OOP)
Junrie Banz
No ratings yet
Learning Dart - Second Edition - Sample Chapter
Document30 pages
Learning Dart - Second Edition - Sample Chapter
Packt Publishing
100% (2)
A4 Resume Parser
Document1 page
A4 Resume Parser
Munthitra Thadthapong
No ratings yet
Developing A Language For Spoken Programming: Benjamin M. Gordon Department of Computer Science University of New Mexico
Document7 pages
Developing A Language For Spoken Programming: Benjamin M. Gordon Department of Computer Science University of New Mexico
John Watson
No ratings yet
Thesis Proposal
Document4 pages
Thesis Proposal
beki4
No ratings yet
R18 B.TECH CSE IIIYear Labs Syllabus
Document15 pages
R18 B.TECH CSE IIIYear Labs Syllabus
NANUVALA TIRUPATHI
No ratings yet
A Reader's Guide To Teaching Beginners To Code
Document10 pages
A Reader's Guide To Teaching Beginners To Code
api-438276283
No ratings yet
Online Stock Management
Document28 pages
Online Stock Management
palraj20021121
No ratings yet
Application Development With MapObjects and Visual Basic PDF
Document9 pages
Application Development With MapObjects and Visual Basic PDF
Wanly Pereira
No ratings yet
Chatbot
Document3 pages
Chatbot
shilpa
No ratings yet
Python Chatbot Project: January 2022
Document6 pages
Python Chatbot Project: January 2022
VISHNU PRASHANTH REDDY S
No ratings yet
Getting Started: Android Developer Website
Document20 pages
Getting Started: Android Developer Website
SolisterADV
No ratings yet
File Handling
Document3 pages
File Handling
PollarOpposite
No ratings yet
An Object Oriented Programming Language - Practical Go Lessons-39
Document12 pages
An Object Oriented Programming Language - Practical Go Lessons-39
looklo
No ratings yet
Python Chatbot Project
Document6 pages
Python Chatbot Project
Bhavani Reddy
No ratings yet
Unit-1: 1. Introduction To Object Oriented Programming
Document12 pages
Unit-1: 1. Introduction To Object Oriented Programming
ssrkm gupta
No ratings yet
18 Thesis
Document4 pages
18 Thesis
bsh6df70
100% (2)
Javandroid
Document44 pages
Javandroid
engmik2010
No ratings yet
Cs101 Solved Finalterm Papers Mega File
Document27 pages
Cs101 Solved Finalterm Papers Mega File
Intisar Ahmad
50% (2)
Python Chatbot Project: January 2022
Document6 pages
Python Chatbot Project: January 2022
my pc
No ratings yet
Educational Quiz (Eduquizz) : Name-Harsh Patel Class and Section-12B Roll No - 9
Document70 pages
Educational Quiz (Eduquizz) : Name-Harsh Patel Class and Section-12B Roll No - 9
Harsh Patel
No ratings yet
Twitter Sentiment Analysis Using Python TweetX
Document3 pages
Twitter Sentiment Analysis Using Python TweetX
Alexander Didenko
No ratings yet
Pro C# 8 with .NET Core 3: Foundational Principles and Practices in Programming
From Everand
Pro C# 8 with .NET Core 3: Foundational Principles and Practices in Programming
Andrew Troelsen
No ratings yet
Msbte Board Question Paper
Document162 pages
Msbte Board Question Paper
Mr. Omkar
No ratings yet
Core Objective-C in 24 Hours
From Everand
Core Objective-C in 24 Hours
Keith Lee
Rating: 5 out of 5 stars
5/5 (1)
Basic Guide to Programming Languages Python, JavaScript, and Ruby
From Everand
Basic Guide to Programming Languages Python, JavaScript, and Ruby
Kiet Huynh
No ratings yet
At Commands Lte 3GPP TS 27 007
Document214 pages
At Commands Lte 3GPP TS 27 007
Ravi SV
No ratings yet
Installation Instructions: CD200D Digital Ignition System Form CD200D II 9-10
Document25 pages
Installation Instructions: CD200D Digital Ignition System Form CD200D II 9-10
wcuevasm
No ratings yet
PPT2-Simplex Method
Document118 pages
PPT2-Simplex Method
Adeksem Marta
No ratings yet
प्याज खेती-Onion production technology in Nepal
Document69 pages
प्याज खेती-Onion production technology in Nepal
Ramindra Suwal
83% (6)
Filemaker Versions
Document1 page
Filemaker Versions
dit_marzio
No ratings yet
Understanding of Mipi I3c White Paper v0.95
Document13 pages
Understanding of Mipi I3c White Paper v0.95
Nhan
No ratings yet
Week 3 - Introduction To The Profession - Presentation
Document16 pages
Week 3 - Introduction To The Profession - Presentation
fhuck mheel
No ratings yet
WAP Push Messages
Document10 pages
WAP Push Messages
kartik
No ratings yet
Baby Shoe Template Instructions
Document5 pages
Baby Shoe Template Instructions
kawaibaby
No ratings yet
Design & Analysis of Algorithms - 88 MCQs With Answers - Part 1 - Department of Computer Engineers PDF
Document29 pages
Design & Analysis of Algorithms - 88 MCQs With Answers - Part 1 - Department of Computer Engineers PDF
mainak
No ratings yet
Jobs in Mumbai
Document4 pages
Jobs in Mumbai
abhinav agarwal
No ratings yet
Object Oriented Programming - Online Notes
Document66 pages
Object Oriented Programming - Online Notes
vaibhav agarwal
No ratings yet
Jntua - M Tech - r17 - Jntua M.tech Regulation r17 Eee Electrical Power Engineering Course Structure Syllabus
Document53 pages
Jntua - M Tech - r17 - Jntua M.tech Regulation r17 Eee Electrical Power Engineering Course Structure Syllabus
Raghu
No ratings yet
MSCBSC Threads295785
Document26 pages
MSCBSC Threads295785
Rafania Lintang
No ratings yet
Oppo Tablet Xiaomi Resmi Tam: HP Bebas Sinyal - Pulsa
Document1 page
Oppo Tablet Xiaomi Resmi Tam: HP Bebas Sinyal - Pulsa
Ghozali
No ratings yet
Applying Learning Theories To Online Instructional Design
Document24 pages
Applying Learning Theories To Online Instructional Design
sah108_pk796
No ratings yet
ECE 307 - Lecture 1 DC Circuit Components, Connections, and KCL
Document45 pages
ECE 307 - Lecture 1 DC Circuit Components, Connections, and KCL
Carl Fitzpatrick
No ratings yet
6ED10521MD080BA0 Datasheet en PDF
Document2 pages
6ED10521MD080BA0 Datasheet en PDF
Juan Valladares
No ratings yet
Plagiarism and Its Detection in Programming Languages
Document8 pages
Plagiarism and Its Detection in Programming Languages
Glen E. Enaje
No ratings yet
Module 8
Document1 page
Module 8
Cath Lapuz
No ratings yet
Cs Worksheet
Document2 pages
Cs Worksheet
Darkwisp
No ratings yet
Tatenda Muzenda Attachment Report
Document27 pages
Tatenda Muzenda Attachment Report
Blessed Majongwe
No ratings yet
Attachment 2 - AHIMS PDF
Document2 pages
Attachment 2 - AHIMS PDF
BIKASH MEHTA
No ratings yet
PLC - Dcs Notes
Document154 pages
PLC - Dcs Notes
Secret Service
No ratings yet
A Proposal On Machine Learning Via Dynamical Systems
Document11 pages
A Proposal On Machine Learning Via Dynamical Systems
Pol Mestres
No ratings yet
The Demise of SiteWatch
Document58 pages
The Demise of SiteWatch
Rainlad Cummings
No ratings yet