CS-332-NLP
A repo for materials relating to the tutorial of CS-332 NLP
Contents
-
Tutorial 1:
- Introduction
- Corpus
- Regular expression
- Tokenization
-
Tutorial 2:
- Normalization
- Parsing
- Morpheme
- Stemming
- Lemmatization
Acknowledgements
- Speech and Language Processing. Daniel Jurafsky & James H. Martin. (Edition 2 & 3)
- Marcinkiewicz, M. A. (1994). Building a large annotated corpus of English: The Penn Treebank. Using Large Corpora, 273.
- http://su.diva-portal.org/smash/record.jsf?pid=diva2%3A686162&dswid=9114