HLTy logo

NAACL-HLT 2003 Tutorial:


T6: Annotation of Temporal and Event Expressions
James Pustejovsky, Brandeis University and Inderjeet Mani, MITRE
Tuesday May 27 afternoon

Humans live in a dynamic world, where actions bring about consequences, and the facts and properties associated with entities change over time. Without a robust ability to identify events in NL data and temporally situate them, the real 91aboutness92 of the information can be missed. In appreciation of this need, there has recently been a renewed interest in temporal and event-based reasoning for NLP, aimed at addressing challenges in areas such as information extraction, question-answering, and summarization.

This tutorial will begin with an overview of theoretical work on tense, aspect, and event structure in natural language, as well as the fundamentals of temporal reasoning. It will then go on to discuss the annotation of temporal and event expressions in corpora, including the TimeML specification language and other results from the ARDA/NRRC Workshop on Temporal and Event Recognition for Question Answering Systems (TERQAS). The tutorial will examine how to formally distinguish events and their temporal anchoring in documents, and will discuss algorithms for ordering events mentioned in a document relative to each other and for computing closure over an entire discourse of events.

Tutorial attendees can expect to learn about current methodologies and computational resources, the outstanding problems in the area, as well as obtain follow-up pointers to the research literature. Attendees should be familiar with information extraction and the notion of corpus annotation. The course should appeal to those with an interest in leveraging robust semantic analysis for tasks like question-answering, information extraction, and summarization.

James Pustejovsky is Professor of Computer Science at Brandeis University where he is Director of the Laboratory for Linguistics and Com putation. Pustejovsky conducts research in the areas of computational linguistics, lexical semantics, knowledge representation, bioinformatics, and informat ion retrieval and extraction. He was organizer and PI for the ARDA-sponsored research workshop that created the metadata markup language TimeML. He has participated in numerous DARPA and NSF efforts in Knowledge Extractio n and Natural Language Engineering, including the MUC and TIPSTER projects. His publications include numerous books on semantics and corpus processing.

Inderjeet Mani is a Senior Principal Scientist at the MITRE Corporation in McLean, Virginia, and an adjunct faculty in Computational Linguistics at Georgetown University. Mani92s research, funded by MITRE, NSF, DARPA, and others, includes information extraction, automatic summarization, and bioinformatics. Mani helped develop the TIMEX2 annotation scheme for representing aspects of the meaning of temporal expressions in natural languages under the DARPA TIDES research program. He has worked (with Georgetown University) to develop TIMEX2-annotated corpora and taggers for different languages, and has also (with Columbia University) investigated methods for ordering events in news. His publications include two books on automatic summarization.


James Pustejovsky