Linguistics 5200 Fall 2009 http://robfelty.com/teaching/ling5200Fall2009 Introduction to computational corpus linguistics Fri, 18 Dec 2009 00:07:43 +0000 http://backend.userland.com/rss092 en Final grades / happy holidays I have made comments in your final papers and commited them to the subversion repository. I made the comments directly in the pdf (if you submitted in something other than pdf, I converted it to pdf, with the same basename). You should be able to view the comments in your ... http://robfelty.com/teaching/ling5200Fall2009/2009/12/final-grades-happy-holidays/ final papers This is just a reminder that final papers are due next Wednesday, Dec. 16th by 5 p.m. The papers should be 5-15 pages long. Please commit them to the subversion repository, along with all code you wrote, and any additional resources I need to run your code, such as texts, ... http://robfelty.com/teaching/ling5200Fall2009/2009/12/final-papers/ Notes for Dec. 3rd Here are today's notes on advanced regular expressions in python and perl. ling5200-grep2-notes.pdf http://robfelty.com/teaching/ling5200Fall2009/2009/12/notes-for-dec-3rd/ Final presentation schedule Here is the final presentation schedule for next week: Tuesday 12:40 Matt Cecil 12:55 Keith Mertz 13:10 Ashwini Vaidya Thursday 12:40 Anwen Fredriksen 12:55 Steve Vihel 13:10 Calvin Pohawpotchoko 13:25 Sam Perdue Please be sure to follow these guidelines: 5-10 minutes long Should ... http://robfelty.com/teaching/ling5200Fall2009/2009/12/final-presentation-schedule/ Notes for Dec. 1st Here are today's notes on Bayesian and maximum entropy classifiers. ling5200-nltk6-2-notes.pdf http://robfelty.com/teaching/ling5200Fall2009/2009/12/notes-for-dec-1st/ Homework 11 solution Most students did very well on this assignment. The only consistent shortcoming was having unnecessary loops in the tag_errors function. These unnecessary loops lead to an increase in execution time of about 10%. Class statistics for Homework 11 mean56.71 standard deviation8.58 In this homework you will practice part of speech tagging, and evaluating part ... http://robfelty.com/teaching/ling5200Fall2009/2009/11/homework-11-solution/ Notes for Nov. 19th Here are today's notes on classifier evaluation and decision trees ling5200-nltk6-notes.pdf http://robfelty.com/teaching/ling5200Fall2009/2009/11/notes-for-nov-19th/ Notes for Nov. 17th Here are today's notes on supervised classification ling5200-nltk6-notes.pdf http://robfelty.com/teaching/ling5200Fall2009/2009/11/notes-for-nov-17th/ Homework 10 Solution Most students did well on this assignment. Please take a detailed look at my solution in resources/hmwk/hmwk10.py Class statistics for Homework 10 mean51.67 standard deviation7.28 Use svn to copy my solution to homework 8 from resources/py into your personal directory as hmwk10.py (5 points) svn cp resources/py/hmwk8.py students/robfelty/hmwk10.py Modify the mean_word_len and mean_sent_len functions to accept ... http://robfelty.com/teaching/ling5200Fall2009/2009/11/homework-10-solution/ Homework 11 – Part of speech tagging In this homework you will practice part of speech tagging, and evaluating part of speech taggers. The homework covers material up to Nov. 12, and is due Nov. 20th. Use the unigram tagger to evaluate the accuracy of tagging of the romance and the adventure ... http://robfelty.com/teaching/ling5200Fall2009/2009/11/homework-11-part-of-speech-tagging/