Course Title:自然語言處理實作

(Natural Language Processing Lab)
Course number:CS 563200 3 credits
Instructor: Jason S. Chang 張俊盛

The course consists of a set of small exercises on natural language processing based on
statistical
approach. The purpose is to give students opportunity to work with real problems and data
in natural
language processing. Each session will start with explanation of background, experiental
data, and
snippets of code. The students are required to do the assignment in class. The instuctor and
teaching
assistants will be on hand to help students. The list of topics planned for the Fall, 2006 is
as follows.

Topics
9/24 1. Python introduction (file handling, n grams)
10/1 2. Corpus Processing and Ngrams and Ngram Precision (BLEU)
10/8 3. Edit Distance and Dynamic Programming
10/15 4. Sequence Labeling (POS tagger)
10/22 5. Natural language processing using NLTK and WordNet
10/29 6. Collocations (in Python)
11/5 Midterm
11/12 7. Collocations (Using MapReduce and Web 1T)
11/19 8. Using MapReduce to build VerbOcean
11/26 9. Statistical Classifier (ME Model) - Sentence Matching
12/3 10. Sentence Alignment
12/10 11. IBM Model 1
12/17 12. Constructing Bilingual Phrases Based on Consistent Block
12/24 13. Machine Translation Decoder (I)
12/31 14 Machine Translation Decoder (II)
1/7 Final Exam