December 17, 2009 · News · (No comments)

I have made comments in your final papers and commited them to the subversion repository. I made the comments directly in the pdf (if you submitted in something other than pdf, I converted it to pdf, with the same basename). You should be able to view the comments in your favorite pdf viewer (adobe reader, skim, okular, apple’s preview, etc.)

I also added a file to each of your directories called course_grades.txt which lists all your grades for the course. It is a tab delimited file. For pretty viewing, you might want to open it with a spreadsheet program.

I also wanted to say thank you for an exciting course. As always, I end up learning quite a bit from teaching. I hope that you learned quite a bit as well, and that the topics we covered are useful to you in future endeavors.

Enjoy your well deserved break.

December 1, 2009 · News · (No comments)

Here is the final presentation schedule for next week: Tuesday

  • 12:40 Matt Cecil
  • 12:55 Keith Mertz
  • 13:10 Ashwini Vaidya


  • 12:40 Anwen Fredriksen
  • 12:55 Steve Vihel
  • 13:10 Calvin Pohawpotchoko
  • 13:25 Sam Perdue

Please be sure to follow these guidelines:

  • 5-10 minutes long
  • Should prepare handouts or slides
    • slides in pdf or ppt format please
    • E-mail me slides by 10 a.m.
    • You can use (preferably) my computer or yours
  • Use your classmates as resources for ideas
November 12, 2009 · News · 2 comments

Sam found a nice program to automatically identify the language of a text using trigrams. You might find it of interest.

October 29, 2009 · News · (No comments)

I added some more tips to homework 9 in the comments section. I also corrected the get_wiki() function. Sorry about the inconvience there.

October 19, 2009 · News · (No comments)

Unfortunately my son is sick and cannot go to daycare tomorrow, so I have to cancel class. We will shift the syllabus back one day. Thursday we will talk more about strings, go over homework 7, and discuss shell integration.

October 1, 2009 · News · (No comments)

I have made several changes to the syllabus, including:

  • Added a reading for Tue, Oct. 6th
  • Changed homework schedule, so that final project proposals are due Nov. 6th

In addition, I have added a file while contains all of the notes from the class so far. You can get it from the svn repository under slides.

October 1, 2009 · News, notes · (No comments)

Here are the notes for today’s class on Semantic relations.

pdf iconling5200-nltk2-2-notes.pdf

September 19, 2009 · News · (No comments)

Several people have still been having problems installing all the python packages required for doing the exercises in the NLTK book, especially Windows users. After digging around a bit more, I have discovered another possible solution. Try installing the python distribution from Enthought. It comes pre-packaged with a bunch of different packages like matplotlib, numpy, and scipy. It is a big download, but it seemed to work ok for me. After installing that, I re-installed the nltk using the Windows .msi installed from the nltk website. Once you have these installed, you can run python using the IDLE program, which should be in your start menu.

If you want to use this version of python through cygwin, you need to add it to your path.
From cygwin, type:

nano ~/.bash_profile

And edit the file to include:

export PATH="/cygdrive/c/Python25:${PATH"}

Then quit and restart cygwin.

I hope this works for those people who have still been having problems.

September 15, 2009 · News · 3 comments

I figured out the adj question. We were indeed being led astray by the similarity between “adj” and “adv”. We should have been looking at the simlilarity between “adv” and “v”. When we use a regular expression like (adj|v). on the string “quickly, adv.”, we see that the string does contain “v.”. However, the string does not contain ” v.”

September 15, 2009 · News · 1 comment

I would like to know a bit about how you think the course is going. Please answer the following questions to let me know what you think. You must be logged in to fill out the questionnaire. Note that you have to click vote for each question (but it won’t take you to a new page, so it shouldn’t be too annoying).

The homework questions reflect the material covered in class.

  • 4 (strongly agree) (60%, 3 Votes)
  • 3 (somewhat agree) (40%, 2 Votes)
  • 2 (somewhat disagree) (0%, 0 Votes)
  • 1 (strongly disagree) (0%, 0 Votes)

Total Voters: 5

Loading ... Loading ...

Lectures cover

  • 3 (a little too much material) (100%, 5 Votes)
  • 4 (much too much material) (0%, 0 Votes)
  • 2 (not quite enough material) (0%, 0 Votes)
  • 1 (much too little material) (0%, 0 Votes)

Total Voters: 5

Loading ... Loading ...

The instructor is enthusiastic

  • 3 (somewhat agree) (75%, 3 Votes)
  • 4 (strongly agree) (25%, 1 Votes)
  • 2 (somewhat disagree) (0%, 0 Votes)
  • 1 (strongly disagree) (0%, 0 Votes)

Total Voters: 4

Loading ... Loading ...

I am learning a lot in this course

  • 4 (strongly agree) (75%, 3 Votes)
  • 3 (somewhat agree) (25%, 1 Votes)
  • 2 (somewhat agree) (0%, 0 Votes)
  • 1 (strongly disagree) (0%, 0 Votes)

Total Voters: 4

Loading ... Loading ...

I would describe the difficulty of this course as:

  • 3 (More difficult than average) (100%, 3 Votes)
  • 4 (One of the most difficult courses I have ever taken) (0%, 0 Votes)
  • 2 (Less difficult than average) (0%, 0 Votes)
  • 1 (One of the easiest courses I have taken) (0%, 0 Votes)

Total Voters: 3

Loading ... Loading ...

The instructor is easy to talk to

  • 4 (strongly agree) (100%, 5 Votes)
  • 3 (somewhat agree) (0%, 0 Votes)
  • 2 (somewhat disagree) (0%, 0 Votes)
  • 1 (strongly disagree) (0%, 0 Votes)

Total Voters: 5

Loading ... Loading ...

Finally, are there any topics not currently in the syllabus which you would like to cover, or any other suggestions. Please leave a comment (your name will appear with your comment).