ACL 2007 CD | CONFERENCE ON CD | ACL 2007 ONLINE | ACL ONLINE


The 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)

CONFERENCE PROGRAM OVERVIEW

Thursday, June 28, 2007

9:00–10:45Session 1: Plenary Session
10:45–11:15Morning Break
11:15–12:30Sessions 2a and 2b
12:30–14:00Lunch
14:00–15:40Sessions 3a and 3b
15:40–16:00Afternoon Break
16:00–18:30Session 4: All Posters

Friday, June 29, 2007

9:00–10:40Sessions 5a and 5b
10:40–11:15Morning Break
11:15–12:30Sessions 6a and 6b
12:30–14:00Lunch
14:00–15:40Sessions 7a and 7b
15:40–16:00Afternoon Break
16:00–18:30Session 8: All Posters

Saturday, June 30, 2007

9:00–10:00Session 9: Plenary Session
10:00–10:50Sessions 10a, 10b, and 10c
10:50–11:15Morning Break
11:15–12:30Sessions 11a, 11b, and 11c
12:30–14:00Lunch
14:00–15:40Sessions 12a, 12b, and 12c
15:40–16:15Afternoon Break
16:15–17:30Sessions 13a, 13b, and 13c

CONFERENCE PROGRAM

Thursday, June 28, 2007

 Session 1: Plenary Session
9:00–9:10Opening Remarks
9:10–10:10Invited Talk: Baby Bayesians? Evidence for Statistical Hypothesis Selection in Infant Language Learning
LouAnn Gerken, University of Arizona
10:15–10:45Modelling Compression with Discourse Constraints
James Clarke and Mirella Lapata
 Session 2a: Question Answering
11:15–11:40Using Semantic Roles to Improve Question Answering
Dan Shen and Mirella Lapata
11:40–12:05What is the Jeopardy Model? A Quasi-Synchronous Grammar for QA
Mengqiu Wang, Noah A. Smith and Teruko Mitamura
12:05–12:30Learning Unsupervised SVM Classifier for Answer Selection in Web Question Answering
Youzheng Wu, Ruiqiang Zhang, Xinhui Hu and Hideki Kashioka
 Session 2b: Machine Translation
11:15–11:40Improving Word Alignment with Bridge Languages
Shankar Kumar, Franz J. Och and Wolfgang Macherey
11:40–12:05Getting the Structure Right for Word Alignment: LEAF
Alexander Fraser and Daniel Marcu
12:05–12:30Improving Statistical Machine Translation Using Word Sense Disambiguation
Marine Carpuat and Dekai Wu
 Session 3a: Generation, Summarization, and Discourse
14:00–14:25Large Margin Synchronous Generation and its Application to Sentence Compression
Trevor Cohn and Mirella Lapata
14:25–14:50Incremental Text Structuring with Online Hierarchical Ranking
Erdong Chen, Benjamin Snyder and Regina Barzilay
14:50–15:15Automatically Identifying the Arguments of Discourse Connectives
Ben Wellner and James Pustejovsky
15:15–15:40Incremental Generation of Plural Descriptions: Similarity and Partitioning
Albert Gatt and Kees van Deemter
 Session 3b: Parsing
14:00–14:25A Comparative Evaluation of Deep and Shallow Approaches to the Automatic Detection of Common Grammatical Errors
Joachim Wagner, Jennifer Foster and Josef van Genabith
14:25–14:50Characterizing the Errors of Data-Driven Dependency Parsing Models
Ryan McDonald and Joakim Nivre
14:50–15:15Probabilistic Models of Nonprojective Dependency Trees
David A. Smith and Noah A. Smith
15:15–15:40Structured Prediction Models via the Matrix-Tree Theorem
Terry Koo, Amir Globerson, Xavier Carreras and Michael Collins
 Session 4: All Posters (16:00–18:30)
 Using Foreign Inclusion Detection to Improve Parsing Performance
Beatrice Alex, Amit Dubey and Frank Keller
 LEDIR: An Unsupervised Algorithm for Learning Directionality of Inference Rules
Rahul Bhagat, Patrick Pantel and Eduard Hovy
 Modelling Polysemy in Adjective Classes by Multi-Label Classification
Gemma Boleda, Sabine Schulte im Walde and Toni Badia
 Improving Query Spelling Correction Using Web Search Results
Qing Chen, Mu Li and Ming Zhou
 Towards Robust Unsupervised Personal Name Disambiguation
Ying Chen and James Martin
 Compressing Trigram Language Models With Golomb Coding
Kenneth Church, Ted Hart and Jianfeng Gao
 Joint Morphological and Syntactic Disambiguation
Shay B. Cohen and Noah A. Smith
 Unsupervised Part-of-Speech Acquisition for Resource-Scarce Languages
Sajib Dasgupta and Vincent Ng
 Semi-Supervised Classification for Extracting Protein Interaction Sentences using Dependency Parsing
Gunes Erkan, Arzucan Ozgur and Dragomir R. Radev
 A Sequence Alignment Model Based on the Averaged Perceptron
Dayne Freitag and Shahram Khadivi
 Instance Based Lexical Entailment for Ontology Population
Claudio Giuliano and Alfio Gliozzo
 Recovering Non-Local Dependencies for Chinese
Yuqing Guo, Haifeng Wang and Josef van Genabith
 Exploiting Multi-Word Units in History-Based Probabilistic Generation
Deirdre Hogan, Conor Cafferkey, Aoife Cahill and Josef van Genabith
 Hierarchical System Combination for Machine Translation
Fei Huang and Kishore Papineni
 Using RBMT Systems to Produce Bilingual Corpus for SMT
Xiaoguang Hu, Haifeng Wang and Hua Wu
 Why Doesn’t EM Find Good HMM POS-Taggers?
Mark Johnson
 Probabilistic Coordination Disambiguation in a Fully-Lexicalized Japanese Parser
Daisuke Kawahara and Sadao Kurohashi
 A New Perceptron Algorithm for Sequence Labeling with Non-Local Features
Jun’ichi Kazama and Kentaro Torisawa
 Extending a Thesaurus in the Pan-Chinese Context
Oi Yee Kwong and Benjamin K. Tsou
 Low-Quality Product Review Detection in Opinion Summarization
Jingjing Liu, Yunbo Cao, Chin-Yew Lin, Yalou Huang and Ming Zhou
 Improving Statistical Machine Translation Performance by Training Data Selection and Optimization
Yajuan Lu, Jin Huang and Qun Liu
 Topic Segmentation with Hybrid Document Indexing
Irina Matveeva and Gina-Anne Levow
 Syntactic Re-Alignment Models for Machine Translation
Jonathan May and Kevin Knight
 Detecting Compositionality of Verb-Object Combinations using Selectional Preferences
Diana McCarthy, Sriram Venkatapathy and Aravind Joshi
 Explorations in Automatic Book Summarization
Rada Mihalcea and Hakan Ceylan
 Part-of-Speech Tagging for Middle English through Alignment and Projection of Parallel Diachronic Texts
Taesun Moon and Jason Baldridge
 Flexible, Corpus-Based Modelling of Human Plausibility Judgements
Sebastian Padó, Ulrike Padó and Katrin Erk
 V-Measure: A Conditional Entropy-Based External Cluster Evaluation Measure
Andrew Rosenberg and Julia Hirschberg
 Bayesian Document Generative Model with Explicit Multiple Topics
Issei Sato and Hiroshi Nakagawa
 Smooth Bilingual N-Gram Translation
Holger Schwenk, Marta R. Costa-jussa and Jose A. R. Fonollosa
 Morphological Disambiguation of Hebrew: A Case Study in Classifier Combination
Danny Shacham and Shuly Wintner
 Enhancing Single-Document Summarization by Combining RankNet and Third-Party Sources
Krysta Svore, Lucy Vanderwende and Christopher Burges
 Automatic Identification of Important Segments and Expressions for Mining of Business-Oriented Conversations at Contact Centers
Hironori Takeuchi, L Venkata Subramaniam, Tetsuya Nasukawa and Shourya Roy
 Smoothed Bloom Filter Language Models: Tera-Scale LMs on the Cheap
David Talbot and Miles Osborne
 Word Sense Disambiguation Incorporating Lexical and Structural Semantic Information
Takaaki Tanaka, Francis Bond, Timothy Baldwin, Sanae Fujita and Chikara Hashimoto
 An Approach to Text Corpus Construction which Cuts Annotation Costs and Maintains Reusability of Annotated Data
Katrin Tomanek, Joachim Wermter and Udo Hahn
 Antecedent Selection Techniques for High-Recall Coreference Resolution
Yannick Versley
 Methods to Integrate a Language Model with Semantic Information for a Word Prediction Component
Tonio Wandmacher and Jean-Yves Antoine
 Bilingual Cluster Based Models for Statistical Machine Translation
Hirofumi Yamamoto and Eiichiro Sumita
 A Systematic Comparison of Training Criteria for Statistical Machine Translation
Richard Zens, Sasa Hasan and Hermann Ney
 Phrase Reordering Model Integrating Syntactic Knowledge for SMT
Dongdong Zhang, Mu Li, Chi-Ho Li and Ming Zhou
 Identification and Resolution of Chinese Zero Pronouns: A Machine Learning Approach
Shanheng Zhao and Hwee Tou Ng
 Parsimonious Data-Oriented Parsing
Willem Zuidema

Friday, June 29, 2007

 Session 5a: Semantics
9:00–9:25Generating Lexical Analogies Using Dependency Relations
Andy Chiu, Pascal Poupart and Chrysanne DiMarco
9:25–9:50Cross-Lingual Distributional Profiles of Concepts for Measuring Semantic Distance
Saif Mohammad, Iryna Gurevych, Graeme Hirst and Torsten Zesch
9:50–10:15Lexical Semantic Relatedness with Random Graph Walks
Thad Hughes and Daniel Ramage
10:15–10:40Experimental Evaluation of LTAG-Based Features for Semantic Role Labeling
Yudong Liu and Anoop Sarkar
 Session 5b: Parsing
9:00–9:25Japanese Dependency Analysis Using the Ancestor-Descendant Relation
Akihiro Tamura, Hiroya Takamura and Manabu Okumura
9:25–9:50A Discriminative Learning Model for Coordinate Conjunctions
Masashi Shimbo and Kazuo Hara
9:50–10:15Recovery of Empty Nodes in Parse Structures
Denis Filimonov and Mary Harper
10:15–10:40Treebank Annotation Schemes and Parser Evaluation for German
Ines Rehbein and Josef van Genabith
 Session 6a: Document Analysis
11:15–11:40Semi-Markov Models for Sequence Segmentation
Qinfeng Shi, Yasemin Altun, Alex Smola and S.V.N. Vishwanathan
11:40–12:05A Graph-Based Approach to Named Entity Categorization in Wikipedia Using Conditional Random Fields
Yotaro Watanabe, Masayuki Asahara and Yuji Matsumoto
12:05–12:30MavenRank: Identifying Influential Members of the US Senate Using Lexical Centrality
Anthony Fader, Dragomir R. Radev, Michael H. Crespin, Burt L. Monroe, Kevin M. Quinn and Michael Colaresi
 Session 6b: Grammar Learning
11:15–11:40Bootstrapping Feature-Rich Dependency Parsers with Entropic Priors
David A. Smith and Jason Eisner
11:40–12:05Online Learning of Relaxed CCG Grammars for Parsing to Logical Form
Luke Zettlemoyer and Michael Collins
12:05–12:30The Infinite PCFG Using Hierarchical Dirichlet Processes
Percy Liang, Slav Petrov, Michael Jordan and Dan Klein
 Session 7a: Information Extraction
14:00–14:25Exploiting Wikipedia as External Knowledge for Named Entity Recognition
Jun’ichi Kazama and Kentaro Torisawa
14:25–14:50Large-Scale Named Entity Disambiguation Based on Wikipedia Data
Silviu Cucerzan
14:50–15:15Effective Information Extraction with Semantic Affinity Patterns and Relevant Regions
Siddharth Patwardhan and Ellen Riloff
15:15–15:40Tree Kernel-Based Relation Extraction with Context-Sensitive Structured Parse Tree Information
GuoDong Zhou, Min Zhang, DongHong Ji and QiaoMing Zhu
 Session 7b: Machine Translation
14:00–14:25Chinese Syntactic Reordering for Statistical Machine Translation
Chao Wang, Michael Collins and Philipp Koehn
14:25–14:50Binarizing Syntax Trees to Improve Syntax-Based Machine Translation Accuracy
Wei Wang, Kevin Knight and Daniel Marcu
14:50–15:15What Can Syntax-Based MT Learn from Phrase-Based MT?
Steve DeNeefe, Kevin Knight, Wei Wang and Daniel Marcu
15:15–15:40Online Large-Margin Training for Statistical Machine Translation
Taro Watanabe, Jun Suzuki, Hajime Tsukada and Hideki Isozaki
  Session 8: All Posters (16:00–18:30)
 Consult the list of poster titles under Session 4.

Saturday, June 30, 2007

 Session 9: Plenary Session
9:00–10:00Invited Talk: Hashing, Sketching, and Other Approximate Algorithms for High-Dimensional Data
Piotr Indyk, Massachusetts Institute of Technology
 Session 10a: Machine Learning (supervised classifiers)
10:00–10:25Scalable Term Selection for Text Categorization
Jingyang Li and Maosong Sun
10:25–10:50Active Learning for Word Sense Disambiguation with Methods for Addressing the Class Imbalance Problem
Jingbo Zhu and Eduard Hovy
 Session 10b: Machine Learning (sequential models)
10:00–10:25Semi-Supervised Structured Output Learning Based on a Hybrid Generative and Discriminative Approach
Jun Suzuki, Akinori Fujino and Hideki Isozaki
10:25–10:50Finding Good Sequential Model Structures using Output Transformations
Edward Loper
 Session 10c: Information Retrieval
10:00–10:25A Statistical Language Modeling Approach to Lattice-Based Spoken Document Retrieval
Tee Kiah Chia, Haizhou Li and Hwee Tou Ng
10:25–10:50Learning Noun Phrase Query Segmentation
Shane Bergsma and Qin Iris Wang
 Session 11a: Information Extraction
11:15–11:40Bootstrapping Information Extraction from Field Books
Sander Canisius and Caroline Sporleder
11:40–12:05Extracting Data Records from Unstructured Biomedical Full Text
Donghui Feng, Gully Burns and Eduard Hovy
12:05–12:30Multiple Alignment of Citation Sentences with Conditional Random Fields and Posterior Decoding
Ariel Schwartz, Anna Divoli and Marti Hearst
 Session 11b: Machine Translation
11:15–11:40Large Language Models in Machine Translation
Thorsten Brants, Ashok C. Popat, Peng Xu, Franz J. Och and Jeffrey Dean
11:40–12:05Factored Translation Models
Philipp Koehn and Hieu Hoang
12:05–12:30Translating Unknown Words by Analogical Learning
Philippe Langlais and Alexandre Patry
 Session 11c: Phonetics and Phonology
11:15–11:40A Probabilistic Approach to Diachronic Phonology
Alexandre Bouchard, Percy Liang, Thomas Griffiths and Dan Klein
11:40–12:05Learning Structured Models for Phone Recognition
Slav Petrov, Adam Pauls and Dan Klein
12:05–12:30Inducing Search Keys for Name Filtering
L. Karl Branting
 Session 12a: CoNLL Shared Task Session (dependency parsing)
14:00–14:15The CoNLL 2007 Shared Task on Dependency Parsing
Joakim Nivre, Johan Hall, Sandra Kübler, Ryan McDonald, Jens Nilsson, Sebastian Riedel and Deniz Yuret
14:15–14:30Single Malt or Blended? A Study in Multilingual Parser Optimization
Johan Hall, Jens Nilsson, Joakim Nivre, Gülsen Eryigit, Beáta Megyesi, Mattias Nilsson and Markus Saers
14:30–14:45Probabilistic Parsing Action Models for Multi-Lingual Dependency Parsing
Xiangyu Duan, Jun Zhao and Bo Xu
14:45–15:00Fast and Robust Multilingual Dependency Parsing with a Generative Latent Variable Model
Ivan Titov and James Henderson
15:00–15:15Multilingual Dependency Parsing Using Global Features
Tetsuji Nakagawa
15:15–15:30Experiments with a Higher-Order Projective Dependency Parser
Xavier Carreras
15:30–15:45Log-Linear Models of Non-Projective Trees, k-best MST Parsing and Tree-Ranking
Keith Hall, Jiri Havelka and David A. Smith
 Session 12b: Machine Translation
14:00–14:25Improving Translation Quality by Discarding Most of the Phrasetable
Howard Johnson, Joel Martin, George Foster and Roland Kuhn
14:25–14:50Hierarchical Phrase-Based Translation with Suffix Arrays
Adam Lopez
14:50–15:15An Empirical Study on Computing Consensus Translations from Multiple Machine Translation Systems
Wolfgang Macherey and Franz J. Och
15:15–15:40Learning to Find English to Chinese Transliterations on the Web
Jian-Cheng Wu and Jason S. Chang
 Session 12c: Word Senses
14:00–14:25Learning to Merge Word Senses
Rion Snow, Sushant Prakash, Daniel Jurafsky and Andrew Y. Ng
14:25–14:50Improving Word Sense Disambiguation Using Topic Features
Junfu Cai, Wee Sun Lee and Yee Whye Teh
14:50–15:15A Topic Model for Word Sense Disambiguation
Jordan Boyd-Graber, David Blei and Xiaojin Zhu
15:15–15:40Validation and Evaluation of Automatically Acquired Multiword Expressions for Grammar Engineering
Aline Villavicencio, Valia Kordoni, Yi Zhang, Marco Idiart and Carlos Ramisch
 Session 13a: CoNLL Shared Task Session (dependency parsing)
16:15–16:30Dependency Parsing and Domain Adaptation with LR Models and Parser Ensembles
Kenji Sagae and Jun’ichi Tsujii
16:30–16:45Frustratingly Hard Domain Adaptation for Dependency Parsing
Mark Dredze, John Blitzer, Partha Pratim Talukdar, Kuzman Ganchev, João Graca and Fernando Pereira
16:45–17:15Analysis: Sandra Kübler, Ryan McDonald
17:15–17:30Discussion
 Session 13b: Sentiment
16:15–16:40Crystal: Analyzing Predictive Opinions on the Web
Soo-Min Kim and Eduard Hovy
16:40–17:05Extracting Aspect-Evaluation and Aspect-Of Relations in Opinion Mining
Nozomi Kobayashi, Kentaro Inui and Yuji Matsumoto
17:05–17:30Building Lexicon for Sentiment Analysis from Massive Collection of HTML Documents
Nobuhiro Kaji and Masaru Kitsuregawa
 Session 13c: Tagging
16:15–16:40Determining Case in Arabic: Learning Complex Linguistic Behavior Requires Complex Linguistic Features
Nizar Habash, Ryan Gabbard, Owen Rambow, Seth Kulick and Mitch Marcus
16:40–17:05Mandarin Part-of-Speech Tagging and Discriminative Reranking
Zhongqiang Huang, Mary Harper and Wen Wang
17:05–17:30Building Domain-Specific Taggers without Annotated (Domain) Data
John Miller, Manabu Torii and K. Vijay-Shanker
 Concluding Session
17:30Closing Remarks
 Additional CoNLL Shared Task Papers (dependency parsing)
 Multilingual Dependency Parsing and Domain Adaptation using DeSR
Giuseppe Attardi, Felice Dell’Orletta, Maria Simi, Atanas Chanev and Massimiliano Ciaramita
 Hybrid Ways to Improve Domain Independence in an ML Dependency Parser
Eckhard Bick
 A Constraint Satisfaction Approach to Dependency Parsing
Sander Canisius and Erik Tjong Kim Sang
 A Two-Stage Parser for Multilingual Dependency Parsing
Wenliang Chen, Yujie Zhang and Hitoshi Isahara
 Incremental Dependency Parsing Using Online Learning
Richard Johansson and Pierre Nugues
 Online Learning for Deterministic Dependency Parsing
Prashanth Reddy Mannem
 Covington Variations
Svetoslav Marinov
 A Multilingual Dependency Analysis System Using Online Passive-Aggressive Learning
Le-Minh Nguyen, Akira Shimazu, Phuong-Thai Nguyen and Xuan-Hieu Phan
 Global Learning of Labeled Dependency Trees
Michael Schiehlen and Kristina Spranger
 Pro3Gres Parser in the CoNLL Domain Adaptation Shared Task
Gerold Schneider, Kaarel Kaljurand, Fabio Rinaldi and Tobias Kuhn
 Structural Correspondence Learning for Dependency Parsing
Nobuyuki Shimizu and Hiroshi Nakagawa
 Adapting the RASP System for the CoNLL07 Domain-Adaptation Task
Rebecca Watson and Ted Briscoe
 Multilingual Deterministic Dependency Parsing Framework using Modified Finite Newton Method Support Vector Machines
Yu-Chieh Wu, Jie-Chi Yang and Yue-Shi Lee