Here are the slides from today’s class covering python and shell integration, and homework 7 difficulties.
Also note that I have extended homework 8 until this Sunday at midnight.
Here are the slides from today’s class covering python and shell integration, and homework 7 difficulties.
Also note that I have extended homework 8 until this Sunday at midnight.
Here are the slides from today’s class covering normalization and tokenization using regular expressions and the nltk. We did not get to the last section on tokenization, so we will postpone that until Tuesday. Have a nice weekend.
Here are the slides from today’s class covering unicode and regular expressions in python. I corrected the problem with the codecs.open example.
Here are the slides from today’s class covering string basics and methods in python.
Here are the slides from today’s class covering file input and output, reading from stdin, and command line arguments and options. Please also look at the args.py and opts.py files under resources/py, which have some examples. Two other things to note:
Here are the slides from today’s class covering semantic relations. We also talked about automatic historical linguistics. You can try out the handy script from the repository in resources/py/auto_histling.py
Martha Palmer also gave a brief introduction to some of the corpora and databases that are available for use. Please look at the list on the linguistics website. If you are interested in using one of these, you will need to ask Martha for an account on the verbs or babel server.
Here are the slides for today’s class covering nltk lexical resources, python functions and modules.
We did not get to the last part about automatically doing historical linguistics. We will cover that on Thursday.
Here are the slides from today’s class, covering an introduction to various corpora in the NLTK, and a discussion of calculating conditional frequency distributions.
Here are the slides from today’s class covering control structures (conditionals and loops) in python.
Here are the slides from today, which give examples of how to use lists in python, and how to calculate word frequency using the NLTK.