Course Title:自然語言處理實作

(Natural Language Processing Lab)
Course number:CS 563200 3 credits
Instructor: Jason S. Chang 張俊盛

The course consists of a set of small exercises on natural language processing based on statistical
approach. The purpose is to give students opportunity to work with real problems and data in
natural
language processing. Each session will start with explanation of background, experiental data,
and
snippets of code. The students are required to do the assignment in class. The instuctor and
teaching
assistants will be on hand to help students. 每週準備一個題目與實驗資料,題目為基本的研究任務(通
常由某
一篇論文取材)。老師會先講解背景與題目(30─60分鐘),之後同學開始寫程式解決問題。過程中,如果
有問題
可以互相討論,或詢問助教、老師。我們要求當天做完題目,將結果展示給助教,助教根據結果的正確性與完
成時間
打分數,並隨時公布最新的累積分數資料。學期的分數就是每週分數的總和。

Note: (1) 第一週再原訂時間(9/27 星期五上課),之後預計改為星期三下午第7,8,9堂上課。
(2) 上課地點為舊資電館 326 室(使用 iMac 電腦)。

The list of topics planned for the Fall, 2013 is as follows:

Topics

2014
09/16 1. Python introduction (file handling, n grams)

09/23 2. Language model and ngrams
09/30 3. Extracting collocations from ngram
10/07 4. Using MapReduce to handle big data
10/14 5. Using MapReduce to extract collocations
10/21 6. Using MapReduce to extract collocations (II)


10/28 7. Pig
11/04 8. Using MapReduce to extract GDEX
11/11 9. Tiny search engine
11/18 10. Using Maximum Entropy Model to correct grammatical errors
11/25 11. EM Algorithm
12/02 12. Project Announcement
12/09 13. Project discussion
12/16 14. Project discussion
12/23 15. Project discussion
12/30 16. Project presentation

2015
01/06 17. Project presentation
01/13 18. Project presentation