Welcome to Scribd!

AIMESOFT20220304 AnhNT Kỹ Sư Xử Lý Ngôn Ngữ Tự Nhiên

Uploaded by

0% found this document useful (0 votes)

1 views2 pages

Once you upload an approved document, you will be able to download the document Once you upload an approved document, you will be able to download the document Once you upload an approved document, you will be able to download the document

Original Title

AIMESOFT20220304_AnhNT_Kỹ_sư_Xử_lý_Ngôn_ngữ_Tự_nhiên

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

1 views2 pages

AIMESOFT20220304 AnhNT Kỹ Sư Xử Lý Ngôn Ngữ Tự Nhiên

Uploaded by

Trâm Thùy

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

BÀI TEST VÒNG 1

Kỹ sư Xử lý Ngôn ngữ Tự nhiên

Số: 20220304

Một số lưu ý dành cho bạn:

- Viết chương trình bài test bằng ngôn ngữ bạn sử dụng thành thạo nhất
- Thời hạn gửi bài: muộn nhất vào 23.59 (ICT) thứ Tư, ngày 09/03/2022.
- Gửi bài và mọi thắc mắc vui lòng liên hệ theo địa chỉ:
Email: jobs@aimesoft.com
Số điện thoại: 0989558851 (Mr Huy)

Câu 1.
Trong bộ gõ tiếng Nhật, để chuyển một dãy các kí tự Hiragana (chữ mềm) sang chữ Kanji, người
ta thường dùng một từ điển như sau

Cách đọc Từ Kanji

かんじ感じ

かんじ漢字

かんじ幹事

へんかん変換

へんかん返還

Khi user nhập vào chuỗi ký tự かんじへんかん, chúng ta cần liệt kê các trường hợp có thể
convert được từ chuỗi ký tự này sang chữ Kanji tương ứng.

1) Để lưu trữ từ điển dạng như trên một cách hiệu quả, người ta thường dùng loại cấu trúc dữ
liệu nào?

2) Download file từ điển dict.txt (cách đọc Hiragana và Hán tự tương ứng cách nhau bằng dấu
cách), dùng cấu trúc dữ liệu ở câu 1 để lưu từ điển vào máy tính. (cho phép sử dụng thư viện)

Câu 2.
Pseudo code dưới đây mô tả thuật toán Naive Bayes cho bài toán phân loại văn bản. (Nguồn:
https://web.stanford.edu/~jurafsky/slp3/4.pdf)
Yêu cầu:

1) Sử dụng data sentiment analysis ở link:

https://raw.githubusercontent.com/minhpqn/nlp_100_drill_exercises/master/data/sentiment.txt

Chia dữ liệu theo tỷ lệ 80/20 trong đó 80% dùng để train mô hình và 20% dùng để đánh giá mô
hình.

2) Cài đặt hai function train/test mô tả trong pseudo-code trên bằng ngôn ngữ Python sau đó
huấn luyện trên tập train và đưa ra accuracy trên tập test. Tập train/test sinh ra ở phần 1).

The Art of War: A New Translation
From Everand
The Art of War: A New Translation
Sun Tzu
Rating: 4 out of 5 stars
4/5 (3044)
Art of War: The Definitive Interpretation of Sun Tzu's Classic Book of Strategy
From Everand
Art of War: The Definitive Interpretation of Sun Tzu's Classic Book of Strategy
Stephen F. Kaufman
Rating: 4 out of 5 stars
4/5 (3321)
How to Win Friends and Influence People: Updated For the Next Generation of Leaders
From Everand
How to Win Friends and Influence People: Updated For the Next Generation of Leaders
Dale Carnegie
Rating: 4 out of 5 stars
4/5 (2315)
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Rating: 4 out of 5 stars
4/5 (5807)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Rating: 4.5 out of 5 stars
4.5/5 (3285)
The 7 Habits of Highly Effective People
From Everand
The 7 Habits of Highly Effective People
Stephen R. Covey
Rating: 4 out of 5 stars
4/5 (353)
The 7 Habits of Highly Effective People Personal Workbook
From Everand
The 7 Habits of Highly Effective People Personal Workbook
Stephen R. Covey
Rating: 4 out of 5 stars
4/5 (2515)
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Rating: 4.5 out of 5 stars
4.5/5 (20045)
The Handmaid's Tale
From Everand
The Handmaid's Tale
Margaret Atwood
Rating: 4 out of 5 stars
4/5 (13226)
The Iliad: The Fitzgerald Translation
From Everand
The Iliad: The Fitzgerald Translation
Robert Fitzgerald
Rating: 4 out of 5 stars
4/5 (5646)
Freakonomics Rev Ed
From Everand
Freakonomics Rev Ed
Steven D. Levitt
Rating: 4 out of 5 stars
4/5 (7863)
Life of Pi: A Novel
From Everand
Life of Pi: A Novel
Yann Martel
Rating: 4 out of 5 stars
4/5 (13473)
Pride and Prejudice: Bestsellers and famous Books
From Everand
Pride and Prejudice: Bestsellers and famous Books
Jane Austen
Rating: 4.5 out of 5 stars
4.5/5 (20479)
Habit 3 Put First Things First: The Habit of Integrity and Execution
From Everand
Habit 3 Put First Things First: The Habit of Integrity and Execution
Stephen R. Covey
Rating: 4 out of 5 stars
4/5 (2507)
How To Win Friends And Influence People
From Everand
How To Win Friends And Influence People
Dale Carnegie
Rating: 4.5 out of 5 stars
4.5/5 (6527)
Habit 1 Be Proactive: The Habit of Choice
From Everand
Habit 1 Be Proactive: The Habit of Choice
Stephen R. Covey
Rating: 4 out of 5 stars
4/5 (2556)
Anna Karenina: Bestsellers and famous Books
From Everand
Anna Karenina: Bestsellers and famous Books
Leo Tolstoy
Rating: 4 out of 5 stars
4/5 (7503)
The Odyssey: (The Stephen Mitchell Translation)
From Everand
The Odyssey: (The Stephen Mitchell Translation)
Stephen Mitchell
Rating: 4 out of 5 stars
4/5 (7771)
Habit 6 Synergize: The Habit of Creative Cooperation
From Everand
Habit 6 Synergize: The Habit of Creative Cooperation
Stephen R. Covey
Rating: 4 out of 5 stars
4/5 (2499)
The 7 Habits of Highly Effective People
From Everand
The 7 Habits of Highly Effective People
Stephen R. Covey
Rating: 4 out of 5 stars
4/5 (2568)
Wuthering Heights Complete Text with Extras
From Everand
Wuthering Heights Complete Text with Extras
Emily Bronte
Rating: 4 out of 5 stars
4/5 (9955)
The Iliad: A New Translation by Caroline Alexander
From Everand
The Iliad: A New Translation by Caroline Alexander
Homer
Rating: 4 out of 5 stars
4/5 (5719)
The Picture of Dorian Gray (The Original 1890 Uncensored Edition + The Expanded and Revised 1891 Edition)
From Everand
The Picture of Dorian Gray (The Original 1890 Uncensored Edition + The Expanded and Revised 1891 Edition)
Oscar Wilde
Rating: 4 out of 5 stars
4/5 (9054)
Oscar Wilde: The Unrepentant Years
From Everand
Oscar Wilde: The Unrepentant Years
Nicholas Frankel
Rating: 4 out of 5 stars
4/5 (10242)
American Gods: The Tenth Anniversary Edition
From Everand
American Gods: The Tenth Anniversary Edition
Neil Gaiman
Rating: 4 out of 5 stars
4/5 (12949)
Stardust
From Everand
Stardust
Neil Gaiman
Rating: 4 out of 5 stars
4/5 (8139)
Wuthering Heights (Seasons Edition -- Winter)
From Everand
Wuthering Heights (Seasons Edition -- Winter)
Emily Bronte
Rating: 4 out of 5 stars
4/5 (9486)
The Picture of Dorian Gray: Classic Tales Edition
From Everand
The Picture of Dorian Gray: Classic Tales Edition
Oscar Wilde
Rating: 4 out of 5 stars
4/5 (9758)