October 22, 2009 · slides · (No comments)

Here are the slides from today’s class covering python and shell integration, and homework 7 difficulties.

Also note that I have extended homework 8 until this Sunday at midnight.

pdf iconling5200-nltk3-4-slides.pdf

October 15, 2009 · slides · (No comments)

Here are the slides from today’s class covering normalization and tokenization using regular expressions and the nltk. We did not get to the last section on tokenization, so we will postpone that until Tuesday. Have a nice weekend.

pdf iconling5200-nltk3-3-slides.pdf

October 13, 2009 · slides · (No comments)

Here are the slides from today’s class covering unicode and regular expressions in python. I corrected the problem with the codecs.open example.

pdf iconling5200-nltk3-2-slides.pdf

October 8, 2009 · slides · (No comments)

Here are the slides from today’s class covering string basics and methods in python.

pdf iconling5200-nltk-3-1-slides.pdf

October 6, 2009 · slides · (No comments)

Here are the slides from today’s class covering file input and output, reading from stdin, and command line arguments and options. Please also look at the args.py and opts.py files under resources/py, which have some examples. Two other things to note:

  1. The combined course notes file now has an appendix with solutions to practice problems
  2. I updated the celex.txt file in resources/texts. Please run svn update to get the latest version

pdf iconling5200-nltk3-slides.pdf

October 1, 2009 · slides · (No comments)

Here are the slides from today’s class covering semantic relations. We also talked about automatic historical linguistics. You can try out the handy script from the repository in resources/py/auto_histling.py

Martha Palmer also gave a brief introduction to some of the corpora and databases that are available for use. Please look at the list on the linguistics website. If you are interested in using one of these, you will need to ask Martha for an account on the verbs or babel server.

pdf iconling5200-nltk2-2-slides.pdf

September 29, 2009 · slides · (No comments)

Here are the slides for today’s class covering nltk lexical resources, python functions and modules.

We did not get to the last part about automatically doing historical linguistics. We will cover that on Thursday.

pdf iconling5200-nltk2-1-slides.pdf

September 24, 2009 · slides · (No comments)

Here are the slides from today’s class, covering an introduction to various corpora in the NLTK, and a discussion of calculating conditional frequency distributions.

pdf iconling5200-nltk2-slides.pdf

September 22, 2009 · slides · (No comments)

Here are the slides from today’s class covering control structures (conditionals and loops) in python.

pdf iconling5200-nltk1-2-slides.pdf

September 17, 2009 · slides · (No comments)

Here are the slides from today, which give examples of how to use lists in python, and how to calculate word frequency using the NLTK.

pdf iconling5200-nltk1-1-slides.pdf