Assignment 3: Parsing
Assigned: February 27th, 2006
Due: March 5 , 2006
Specifications:
Goal: This is an exercise
on NLP parsing.
1) Obtain a CFG Parser.
Here are some pointers (there are many more resources out there):
Stanford
Parser (Manning and Klein)
Collins Parser
Statistical Lexicalized Grammar.
Charniak's Parser
Statistical Context Free Grammar
NLTK Natural
Language tools in python, it has the CFG model and Parser. You may
want/have to specify your
own grammar.
You are free to use also non NLP parsers (deterministic or non
deterministic, e.g. Earley's, Lex & Yacc), however this will amount
probably to more work (the idea is that you do as much experimentation
as you want).
In your submission, briefly describe what kind parser it
is.
2) Parse half of your corpus from assignment 2 (e.g. aprox 250
words).
Submit the parsing output.
3) Write a script to recognize Explicit Subjects and
Objects of a sentence (main or embedded sentence) from the
parser output, all the other arguments are ignored. (If there is a
relative clause, leave the relative clause out of the argument).
Examples: <subject John> visited
<object England>.
< subject Mary> bought <object a book> for John.
<subject Mary > bought <object John > a
book. (note we select here just one object).
<subject I > found < object the book>
that <subject Mary> bought
Submit the output of your script.
4) Score how well the recognition does, in terms of
Precision/Recall (Exact Match for each of the Arguments).