<? bloginfo('name'); ?>

Feeds

Comments

Slides from Oct. 22nd

October 22, 2009 · slides · (No comments)

Here are the slides from today’s class covering python and shell integration, and homework 7 difficulties.

Also note that I have extended homework 8 until this Sunday at midnight.

pdf icon ling5200-nltk3-4-slides.pdf

Slides from Oct. 15th

October 15, 2009 · slides · (No comments)

Here are the slides from today’s class covering normalization and tokenization using regular expressions and the nltk. We did not get to the last section on tokenization, so we will postpone that until Tuesday. Have a nice weekend.

pdf icon ling5200-nltk3-3-slides.pdf

Slides from Oct. 13th

October 13, 2009 · slides · (No comments)

Here are the slides from today’s class covering unicode and regular expressions in python. I corrected the problem with the codecs.open example.

pdf icon ling5200-nltk3-2-slides.pdf

Slides from Oct. 8th

October 8, 2009 · slides · (No comments)

Here are the slides from today’s class covering string basics and methods in python.

pdf icon ling5200-nltk-3-1-slides.pdf

Slides for Oct. 6th

October 6, 2009 · slides · (No comments)

Here are the slides from today’s class covering file input and output, reading from stdin, and command line arguments and options. Please also look at the args.py and opts.py files under resources/py, which have some examples. Two other things to note:

The combined course notes file now has an appendix with solutions to practice problems
I updated the celex.txt file in resources/texts. Please run svn update to get the latest version

pdf icon ling5200-nltk3-slides.pdf

Slides from Oct. 1

October 1, 2009 · slides · (No comments)

Here are the slides from today’s class covering semantic relations. We also talked about automatic historical linguistics. You can try out the handy script from the repository in resources/py/auto_histling.py

Martha Palmer also gave a brief introduction to some of the corpora and databases that are available for use. Please look at the list on the linguistics website. If you are interested in using one of these, you will need to ask Martha for an account on the verbs or babel server.

pdf icon ling5200-nltk2-2-slides.pdf

Slides for Sep. 29th

September 29, 2009 · slides · (No comments)

Here are the slides for today’s class covering nltk lexical resources, python functions and modules.

We did not get to the last part about automatically doing historical linguistics. We will cover that on Thursday.

pdf icon ling5200-nltk2-1-slides.pdf

Slides from Sep 24th

September 24, 2009 · slides · (No comments)

Here are the slides from today’s class, covering an introduction to various corpora in the NLTK, and a discussion of calculating conditional frequency distributions.

pdf icon ling5200-nltk2-slides.pdf

Slides for Sep. 22nd

September 22, 2009 · slides · (No comments)

Here are the slides from today’s class covering control structures (conditionals and loops) in python.

pdf icon ling5200-nltk1-2-slides.pdf

Slides on python lists and word frequency

September 17, 2009 · slides · (No comments)

Here are the slides from today, which give examples of how to use lists in python, and how to calculate word frequency using the NLTK.

pdf icon ling5200-nltk1-1-slides.pdf

OrangeJuice theme by Theme Museum. Log in

Linguistics 5200 Fall 2009

Introduction to computational corpus linguistics