Course Title:自然語言處理實作

(Natural Language Processing Lab)
Course number:CS 563200 3 credits
Instructor: Jason S. Chang 張俊盛

The course consists of a set of small exercises on natural language processing based on statistical
approach. The purpose is to give students opportunity to work with real problems and data in natural
language processing. Each session will start with explanation of background, experiental data, and
snippets of code. The students are required to do the assignment in class. The instuctor and teaching
assistants will be on hand to help students. The list of topics planned for the Fall, 2012 is as follows.

Topics

2012

9/17 1. Python introduction (file handling, n grams)
9/24 2. Corpus Processing and Ngrams and Ngram Precision (BLEU)
10/1 3. Hidden Markov Model and Dynamic Programming
10/8 4. POS Tagging and Brill's tagger
10/15 5. NLTK and WordNet
10/22 6. Extracting Collocations

10/29 Midterm

11/5 7. Classifying AN-VN Collocations
11/12 8. Clustering Collocations
11/19 9. Class-based Baysian Model
11/26 10. Finding Best Example Sentences
12/3 11. EM Algorithm
12/10 12. Word Alignment with IBM Model 1
12/17 13. Conditional Random Field
12/24 14. Chinese Word Segmentation

12/31 New Year Holiday

2013

1/7 Final Exam