Marc Verhagen

Computational Linguist, Programmer & Toolbuilder

location: Medford, Massachusetts
contact: marc@cs.brandeis.edu



Education

Skills

Work Experience

2007 - 2008 Senior Research Scientist, Computer Science Department, Brandeis University.
2004 - 2006 Postdoctoral Research Fellow, Computer Science Department, Brandeis University.
2002 - 2003 Consultant for TERQAS and TANGO, two ARDA-AQUAINT workshops on advanced question-answering technology.
1997 - 2002 Co-founder of LingoMotors Inc, Cambridge MA.
Senior Programmer, Director of Tool Development and Master Toolbuilder.
1995 - 1996 Summer intern at Apple Computer, Intelligent Systems Group
1994 - 1997 Teaching Assistant for Computer Science, Brandeis University.
1992 - 1994 Research Officer, CL/MT group (Computational Linguistics and Machine Translation), Dept of Language and Linguistics, Essex University, England.
1989 - 1991 Teaching Assistant for Computational Linguistics, OTS, University of Utrecht, Netherlands.

Work Details

Brandeis University

Project manager for TARSQI, a two-year project that aims to (i) develop tools for automatic extraction of temporal information, and (ii) create a corpus annotated with temporal information.

Project manager of a 2-year follow-up project on the TARSQI project, aimed at integrating temporal processing tools into an extendable and adaptable environment.

Project manager of NURI/NGA-funded research into combining natural language parsing of texts with image processing.

Project manager of NIH-funded research on relation extraction for the biomedical domain.

TERQAS and TANGO

Participated in definition of TimeML, an annotation language for temporal information. Created extensions to the Alembic Workbench, aimed at semi-graphical annotation of temporal relations. Implemented a temporal closure algorithm and embedded it in an annotation tool. Defined specifications for fully-graphical annotation tool.

LingoMotors

Designed and implemented the first prototype of LingoMotors' NLP software, mainly aimed at extracting relations from free text. The prototype was implemented in Perl.

Created a Smalltalk tool suite for knowledge representation. Included various browsers that allow viewing and editing of a hierarchy of types and lexical items. The knowledge resources were implemented as Smalltalk objects and were stored in an Oracle database.

Managed a group of 4-8 engineers, including Smalltalkers, C/Tcl/Tk programmers and Oracle DBA's.

Apple Computer

Designed and implemented an NLP system that extracts the relevant terms in a document, using Perl, LISP and the Macintosh Toolserver. Implemented concept clustering for document spaces.

Brandeis University (as a graduate student)

Designed and implemented Textract1, an automatic text indexing system, in Perl.

Participated in Medstract, a project aimed at automatically extracting up-to-date information from biological sequence databases and Medline abstracts. Created various NLP tools in Python, including a pattern matcher and an acronym generalizer.

CL/MT group

Participated in several European Community funded projects. Researched discourse grammars, the NP specifier system, large scale grammars and collocations. Implemented results in various Natural Language Processing platforms.

Was an exchange student at Essex prior to joining the CL/MT group.

Other Experience

Selected Publications

Marc Verhagen (2005, forthcoming). T-BOX: Drawing TimeML Relations. In: Proceedings of Dagstuhl Seminar 05151: Annotating, Extracting and Reasoning about Time and Events (Pustejovsky, Katz and Schilder, eds). Dagstuhl Seminar Proceedings, Germany.

Marc Verhagen (2005, forthcoming). Temporal Closure in an Annotation Environment. In: Language Resources and Evaluation, volume 39. Kluwer, Netherlands.

Marc Verhagen, Inderjeet Mani, Roser Sauri, Jessica Littman, Robert Knippen, Seok Bae Jang, Anna Rumshisky, John Phillips, James Pustejovsky (2005). Automating Temporal Annotation with TARSQI. Short paper. In: Proceedings of the 43rd Annual Meeting of the ACL. Ann Arbor, USA.

Roser Sauri and Marc Verhagen (2005). Temporal Information in Intensional Contexts. Short paper. In Proceedings of the Sixth International Workshop on Computational Semantics (IWCS-6). Tilburg, Netherlands.

Marc Verhagen (2004). Times Between the Lines, PhD thesis. Brandeis University, Waltham, USA.

James Pustejovsky, Bran Boguraev, Marc Verhagen, Paul Buitelaar and Michael Johnston (1997). Semantic Indexing and Typed Hyperlinking. In Proceedings of the AAAI Spring Symposium Series, Stanford University, California.

Josef van Genabith, Stella Markantonatou, Louisa Sadler and Marc Verhagen (1994). HPSG. In: Grammatical Formalisms: Issues in Migration, (Markantonatou & Sadler, eds). Volume 4 of Studies in Machine Translation and Natural Language Processing, Commission of the European Communities.

Dirk Heylen, Kerry Maxwell and Marc Verhagen (1994). Lexical Functions and Machine Translation. In Proceedings of the 13th International Conference on Computational Linguistics (COLING). Kyoto, Japan.

Dirk Heylen, Andre Schenk and Marc Verhagen (1993). A Constraint-based Representation Scheme of Collocational Structures. In 6th Conference of the European Chapter of the Association for Computational Linguistics (EACL). OTS, Utrecht, The Netherlands.

Marc Verhagen (1990). Support Verbs in Disneyworld. In Papers from the first CLIN-meeting. OTS, Utrecht, The Netherlands.