Thursday, June 28, 2007 |
| Session 1: Plenary Session |
9:00–9:10 | Opening Remarks |
9:10–10:10 | Invited Talk:
Baby Bayesians? Evidence for Statistical Hypothesis Selection in
Infant Language Learning LouAnn Gerken, University of Arizona |
10:15–10:45 | Modelling Compression with Discourse Constraints
James Clarke and Mirella Lapata |
| Session 2a: Question Answering |
11:15–11:40 | Using Semantic Roles to Improve Question Answering
Dan Shen and Mirella Lapata |
11:40–12:05 | What is the Jeopardy Model? A Quasi-Synchronous Grammar for QA
Mengqiu Wang, Noah A. Smith and Teruko Mitamura |
12:05–12:30 | Learning Unsupervised SVM Classifier for Answer Selection in Web Question Answering
Youzheng Wu, Ruiqiang Zhang, Xinhui Hu and Hideki Kashioka |
| Session 2b: Machine Translation |
11:15–11:40 | Improving Word Alignment with Bridge Languages
Shankar Kumar, Franz J. Och and Wolfgang Macherey |
11:40–12:05 | Getting the Structure Right for Word Alignment: LEAF
Alexander Fraser and Daniel Marcu |
12:05–12:30 | Improving Statistical Machine Translation Using Word Sense Disambiguation
Marine Carpuat and Dekai Wu |
| Session 3a: Generation, Summarization, and Discourse |
14:00–14:25 | Large Margin Synchronous Generation and its Application to Sentence Compression
Trevor Cohn and Mirella Lapata |
14:25–14:50 | Incremental Text Structuring with Online Hierarchical Ranking
Erdong Chen, Benjamin Snyder and Regina Barzilay |
14:50–15:15 | Automatically Identifying the Arguments of Discourse Connectives
Ben Wellner and James Pustejovsky |
15:15–15:40 | Incremental Generation of Plural Descriptions: Similarity and Partitioning
Albert Gatt and Kees van Deemter |
| Session 3b: Parsing |
14:00–14:25 | A Comparative Evaluation of Deep and Shallow Approaches to the Automatic Detection of Common Grammatical Errors
Joachim Wagner, Jennifer Foster and Josef van Genabith |
14:25–14:50 | Characterizing the Errors of Data-Driven Dependency Parsing Models
Ryan McDonald and Joakim Nivre |
14:50–15:15 | Probabilistic Models of Nonprojective Dependency Trees
David A. Smith and Noah A. Smith |
15:15–15:40 | Structured Prediction Models via the Matrix-Tree Theorem
Terry Koo, Amir Globerson, Xavier Carreras and Michael Collins |
| Session 4: All Posters (16:00–18:30) |
| Using Foreign Inclusion Detection to Improve Parsing Performance
Beatrice Alex, Amit Dubey and Frank Keller |
| LEDIR: An Unsupervised Algorithm for Learning Directionality of Inference Rules
Rahul Bhagat, Patrick Pantel and Eduard Hovy |
| Modelling Polysemy in Adjective Classes by Multi-Label Classification
Gemma Boleda, Sabine Schulte im Walde and Toni Badia |
| Improving Query Spelling Correction Using Web Search Results
Qing Chen, Mu Li and Ming Zhou |
| Towards Robust Unsupervised Personal Name Disambiguation
Ying Chen and James Martin |
| Compressing Trigram Language Models With Golomb Coding
Kenneth Church, Ted Hart and Jianfeng Gao |
| Joint Morphological and Syntactic Disambiguation
Shay B. Cohen and Noah A. Smith |
| Unsupervised Part-of-Speech Acquisition for Resource-Scarce Languages
Sajib Dasgupta and Vincent Ng |
| Semi-Supervised Classification for Extracting Protein Interaction Sentences using Dependency Parsing
Gunes Erkan, Arzucan Ozgur and Dragomir R. Radev |
| A Sequence Alignment Model Based on the Averaged Perceptron
Dayne Freitag and Shahram Khadivi |
| Instance Based Lexical Entailment for Ontology Population
Claudio Giuliano and Alfio Gliozzo |
| Recovering Non-Local Dependencies for Chinese
Yuqing Guo, Haifeng Wang and Josef van Genabith |
| Exploiting Multi-Word Units in History-Based Probabilistic Generation
Deirdre Hogan, Conor Cafferkey, Aoife Cahill and Josef van Genabith |
| Hierarchical System Combination for Machine Translation
Fei Huang and Kishore Papineni |
| Using RBMT Systems to Produce Bilingual Corpus for SMT
Xiaoguang Hu, Haifeng Wang and Hua Wu |
| Why Doesn’t EM Find Good HMM POS-Taggers?
Mark Johnson |
| Probabilistic Coordination Disambiguation in a Fully-Lexicalized Japanese Parser
Daisuke Kawahara and Sadao Kurohashi |
| A New Perceptron Algorithm for Sequence Labeling with Non-Local Features
Jun’ichi Kazama and Kentaro Torisawa |
| Extending a Thesaurus in the Pan-Chinese Context
Oi Yee Kwong and Benjamin K. Tsou |
| Low-Quality Product Review Detection in Opinion Summarization
Jingjing Liu, Yunbo Cao, Chin-Yew Lin, Yalou Huang and Ming Zhou |
| Improving Statistical Machine Translation Performance by Training Data Selection and Optimization
Yajuan Lu, Jin Huang and Qun Liu |
| Topic Segmentation with Hybrid Document Indexing
Irina Matveeva and Gina-Anne Levow |
| Syntactic Re-Alignment Models for Machine Translation
Jonathan May and Kevin Knight |
| Detecting Compositionality of Verb-Object Combinations using Selectional Preferences
Diana McCarthy, Sriram Venkatapathy and Aravind Joshi |
| Explorations in Automatic Book Summarization
Rada Mihalcea and Hakan Ceylan |
| Part-of-Speech Tagging for Middle English through Alignment and Projection of Parallel Diachronic Texts
Taesun Moon and Jason Baldridge |
| Flexible, Corpus-Based Modelling of Human Plausibility Judgements
Sebastian Padó, Ulrike Padó and Katrin Erk |
| V-Measure: A Conditional Entropy-Based External Cluster Evaluation Measure
Andrew Rosenberg and Julia Hirschberg |
| Bayesian Document Generative Model with Explicit Multiple Topics
Issei Sato and Hiroshi Nakagawa |
| Smooth Bilingual N-Gram Translation
Holger Schwenk, Marta R. Costa-jussa and Jose A. R. Fonollosa |
| Morphological Disambiguation of Hebrew: A Case Study in Classifier Combination
Danny Shacham and Shuly Wintner |
| Enhancing Single-Document Summarization by Combining RankNet and Third-Party Sources
Krysta Svore, Lucy Vanderwende and Christopher Burges |
| Automatic Identification of Important Segments and Expressions for Mining of Business-Oriented Conversations at Contact Centers
Hironori Takeuchi, L Venkata Subramaniam, Tetsuya Nasukawa and Shourya Roy |
| Smoothed Bloom Filter Language Models: Tera-Scale LMs on the Cheap
David Talbot and Miles Osborne |
| Word Sense Disambiguation Incorporating Lexical and Structural Semantic Information
Takaaki Tanaka, Francis Bond, Timothy Baldwin, Sanae Fujita and Chikara Hashimoto |
| An Approach to Text Corpus Construction which Cuts Annotation Costs and Maintains Reusability of Annotated Data
Katrin Tomanek, Joachim Wermter and Udo Hahn |
| Antecedent Selection Techniques for High-Recall Coreference Resolution
Yannick Versley |
| Methods to Integrate a Language Model with Semantic Information for a Word Prediction Component
Tonio Wandmacher and Jean-Yves Antoine |
| Bilingual Cluster Based Models for Statistical Machine Translation
Hirofumi Yamamoto and Eiichiro Sumita |
| A Systematic Comparison of Training Criteria for Statistical Machine Translation
Richard Zens, Sasa Hasan and Hermann Ney |
| Phrase Reordering Model Integrating Syntactic Knowledge for SMT
Dongdong Zhang, Mu Li, Chi-Ho Li and Ming Zhou |
| Identification and Resolution of Chinese Zero Pronouns: A Machine Learning Approach
Shanheng Zhao and Hwee Tou Ng |
| Parsimonious Data-Oriented Parsing
Willem Zuidema |
Friday, June 29, 2007 |
| Session 5a: Semantics |
9:00–9:25 | Generating Lexical Analogies Using Dependency Relations
Andy Chiu, Pascal Poupart and Chrysanne DiMarco |
9:25–9:50 | Cross-Lingual Distributional Profiles of Concepts for Measuring Semantic Distance
Saif Mohammad, Iryna Gurevych, Graeme Hirst and Torsten Zesch |
9:50–10:15 | Lexical Semantic Relatedness with Random Graph Walks
Thad Hughes and Daniel Ramage |
10:15–10:40 | Experimental Evaluation of LTAG-Based Features for Semantic Role Labeling
Yudong Liu and Anoop Sarkar |
| Session 5b: Parsing |
9:00–9:25 | Japanese Dependency Analysis Using the Ancestor-Descendant Relation
Akihiro Tamura, Hiroya Takamura and Manabu Okumura |
9:25–9:50 | A Discriminative Learning Model for Coordinate Conjunctions
Masashi Shimbo and Kazuo Hara |
9:50–10:15 | Recovery of Empty Nodes in Parse Structures
Denis Filimonov and Mary Harper |
10:15–10:40 | Treebank Annotation Schemes and Parser Evaluation for German
Ines Rehbein and Josef van Genabith |
| Session 6a: Document Analysis |
11:15–11:40 | Semi-Markov Models for Sequence Segmentation
Qinfeng Shi, Yasemin Altun, Alex Smola and S.V.N. Vishwanathan |
11:40–12:05 | A Graph-Based Approach to Named Entity Categorization in Wikipedia Using Conditional Random Fields
Yotaro Watanabe, Masayuki Asahara and Yuji Matsumoto |
12:05–12:30 | MavenRank: Identifying Influential Members of the US Senate Using Lexical Centrality
Anthony Fader, Dragomir R. Radev, Michael H. Crespin, Burt L. Monroe, Kevin M. Quinn and Michael Colaresi |
| Session 6b: Grammar Learning |
11:15–11:40 | Bootstrapping Feature-Rich Dependency Parsers with Entropic Priors
David A. Smith and Jason Eisner |
11:40–12:05 | Online Learning of Relaxed CCG Grammars for Parsing to Logical Form
Luke Zettlemoyer and Michael Collins |
12:05–12:30 | The Infinite PCFG Using Hierarchical Dirichlet Processes
Percy Liang, Slav Petrov, Michael Jordan and Dan Klein |
| Session 7a: Information Extraction |
14:00–14:25 | Exploiting Wikipedia as External Knowledge for Named Entity Recognition
Jun’ichi Kazama and Kentaro Torisawa |
14:25–14:50 | Large-Scale Named Entity Disambiguation Based on Wikipedia Data
Silviu Cucerzan |
14:50–15:15 | Effective Information Extraction with Semantic Affinity Patterns and Relevant Regions
Siddharth Patwardhan and Ellen Riloff |
15:15–15:40 | Tree Kernel-Based Relation Extraction with Context-Sensitive Structured Parse Tree Information
GuoDong Zhou, Min Zhang, DongHong Ji and QiaoMing Zhu |
| Session 7b: Machine Translation |
14:00–14:25 | Chinese Syntactic Reordering for Statistical Machine Translation
Chao Wang, Michael Collins and Philipp Koehn |
14:25–14:50 | Binarizing Syntax Trees to Improve Syntax-Based Machine Translation Accuracy
Wei Wang, Kevin Knight and Daniel Marcu |
14:50–15:15 | What Can Syntax-Based MT Learn from Phrase-Based MT?
Steve DeNeefe, Kevin Knight, Wei Wang and Daniel Marcu |
15:15–15:40 | Online Large-Margin Training for Statistical Machine Translation
Taro Watanabe, Jun Suzuki, Hajime Tsukada and Hideki Isozaki |
|
Session 8: All Posters (16:00–18:30) |
| Consult the list of poster titles under Session 4. |
Saturday, June 30, 2007 |
| Session 9: Plenary Session |
9:00–10:00 | Invited Talk:
Hashing, Sketching, and Other Approximate Algorithms for
High-Dimensional Data Piotr Indyk, Massachusetts Institute of Technology |
| Session 10a: Machine Learning (supervised classifiers) |
10:00–10:25 | Scalable Term Selection for Text Categorization
Jingyang Li and Maosong Sun |
10:25–10:50 | Active Learning for Word Sense Disambiguation with Methods for Addressing the Class Imbalance Problem
Jingbo Zhu and Eduard Hovy |
| Session 10b: Machine Learning (sequential models) |
10:00–10:25 | Semi-Supervised Structured Output Learning Based on a Hybrid Generative and Discriminative Approach
Jun Suzuki, Akinori Fujino and Hideki Isozaki |
10:25–10:50 | Finding Good Sequential Model Structures using Output Transformations
Edward Loper |
| Session 10c: Information Retrieval |
10:00–10:25 | A Statistical Language Modeling Approach to Lattice-Based Spoken Document Retrieval
Tee Kiah Chia, Haizhou Li and Hwee Tou Ng |
10:25–10:50 | Learning Noun Phrase Query Segmentation
Shane Bergsma and Qin Iris Wang |
| Session 11a: Information Extraction |
11:15–11:40 | Bootstrapping Information Extraction from Field Books
Sander Canisius and Caroline Sporleder |
11:40–12:05 | Extracting Data Records from Unstructured Biomedical Full Text
Donghui Feng, Gully Burns and Eduard Hovy |
12:05–12:30 | Multiple Alignment of Citation Sentences with Conditional Random Fields and Posterior Decoding
Ariel Schwartz, Anna Divoli and Marti Hearst |
| Session 11b: Machine Translation |
11:15–11:40 | Large Language Models in Machine Translation
Thorsten Brants, Ashok C. Popat, Peng Xu, Franz J. Och and Jeffrey Dean |
11:40–12:05 | Factored Translation Models
Philipp Koehn and Hieu Hoang |
12:05–12:30 | Translating Unknown Words by Analogical Learning
Philippe Langlais and Alexandre Patry |
| Session 11c: Phonetics and Phonology |
11:15–11:40 | A Probabilistic Approach to Diachronic Phonology
Alexandre Bouchard, Percy Liang, Thomas Griffiths and Dan Klein |
11:40–12:05 | Learning Structured Models for Phone Recognition
Slav Petrov, Adam Pauls and Dan Klein |
12:05–12:30 | Inducing Search Keys for Name Filtering
L. Karl Branting |
| Session 12a: CoNLL Shared Task Session (dependency parsing) |
14:00–14:15 | The CoNLL 2007 Shared Task on Dependency Parsing
Joakim Nivre, Johan Hall, Sandra Kübler, Ryan McDonald, Jens Nilsson, Sebastian Riedel and Deniz Yuret |
14:15–14:30 | Single Malt or Blended? A Study in Multilingual Parser Optimization
Johan Hall, Jens Nilsson, Joakim Nivre, Gülsen Eryigit, Beáta Megyesi, Mattias Nilsson and Markus Saers |
14:30–14:45 | Probabilistic Parsing Action Models for Multi-Lingual Dependency Parsing
Xiangyu Duan, Jun Zhao and Bo Xu |
14:45–15:00 | Fast and Robust Multilingual Dependency Parsing with a Generative Latent Variable Model
Ivan Titov and James Henderson |
15:00–15:15 | Multilingual Dependency Parsing Using Global Features
Tetsuji Nakagawa |
15:15–15:30 | Experiments with a Higher-Order Projective Dependency Parser
Xavier Carreras |
15:30–15:45 | Log-Linear Models of Non-Projective Trees, k-best MST Parsing and Tree-Ranking
Keith Hall, Jiri Havelka and David A. Smith |
| Session 12b: Machine Translation |
14:00–14:25 | Improving Translation Quality by Discarding Most of the Phrasetable
Howard Johnson, Joel Martin, George Foster and Roland Kuhn |
14:25–14:50 | Hierarchical Phrase-Based Translation with Suffix Arrays
Adam Lopez |
14:50–15:15 | An Empirical Study on Computing Consensus Translations from Multiple Machine Translation Systems
Wolfgang Macherey and Franz J. Och |
15:15–15:40 | Learning to Find English to Chinese Transliterations on the Web
Jian-Cheng Wu and Jason S. Chang |
| Session 12c: Word Senses |
14:00–14:25 | Learning to Merge Word Senses
Rion Snow, Sushant Prakash, Daniel Jurafsky and Andrew Y. Ng |
14:25–14:50 | Improving Word Sense Disambiguation Using Topic Features
Junfu Cai, Wee Sun Lee and Yee Whye Teh |
14:50–15:15 | A Topic Model for Word Sense Disambiguation
Jordan Boyd-Graber, David Blei and Xiaojin Zhu |
15:15–15:40 | Validation and Evaluation of Automatically Acquired Multiword Expressions for Grammar Engineering
Aline Villavicencio, Valia Kordoni, Yi Zhang, Marco Idiart and Carlos Ramisch |
| Session 13a: CoNLL Shared Task Session (dependency parsing) |
16:15–16:30 | Dependency Parsing and Domain Adaptation with LR Models and Parser Ensembles
Kenji Sagae and Jun’ichi Tsujii |
16:30–16:45 | Frustratingly Hard Domain Adaptation for Dependency Parsing
Mark Dredze, John Blitzer, Partha Pratim Talukdar, Kuzman Ganchev, João Graca and Fernando Pereira |
16:45–17:15 | Analysis: Sandra Kübler, Ryan McDonald |
17:15–17:30 | Discussion |
| Session 13b: Sentiment |
16:15–16:40 | Crystal: Analyzing Predictive Opinions on the Web
Soo-Min Kim and Eduard Hovy |
16:40–17:05 | Extracting Aspect-Evaluation and Aspect-Of Relations in Opinion Mining
Nozomi Kobayashi, Kentaro Inui and Yuji Matsumoto |
17:05–17:30 | Building Lexicon for Sentiment Analysis from Massive Collection of HTML Documents
Nobuhiro Kaji and Masaru Kitsuregawa |
| Session 13c: Tagging |
16:15–16:40 | Determining Case in Arabic: Learning Complex Linguistic Behavior Requires Complex Linguistic Features
Nizar Habash, Ryan Gabbard, Owen Rambow, Seth Kulick and Mitch Marcus |
16:40–17:05 | Mandarin Part-of-Speech Tagging and Discriminative Reranking
Zhongqiang Huang, Mary Harper and Wen Wang |
17:05–17:30 | Building Domain-Specific Taggers without Annotated (Domain) Data
John Miller, Manabu Torii and K. Vijay-Shanker |
| Concluding Session |
17:30 | Closing Remarks |
| Additional CoNLL Shared Task Papers (dependency parsing) |
| Multilingual Dependency Parsing and Domain Adaptation using DeSR
Giuseppe Attardi, Felice Dell’Orletta, Maria Simi, Atanas Chanev and Massimiliano Ciaramita |
| Hybrid Ways to Improve Domain Independence in an ML Dependency Parser
Eckhard Bick |
| A Constraint Satisfaction Approach to Dependency Parsing
Sander Canisius and Erik Tjong Kim Sang |
| A Two-Stage Parser for Multilingual Dependency Parsing
Wenliang Chen, Yujie Zhang and Hitoshi Isahara |
| Incremental Dependency Parsing Using Online Learning
Richard Johansson and Pierre Nugues |
| Online Learning for Deterministic Dependency Parsing
Prashanth Reddy Mannem |
| Covington Variations
Svetoslav Marinov |
| A Multilingual Dependency Analysis System Using Online Passive-Aggressive Learning
Le-Minh Nguyen, Akira Shimazu, Phuong-Thai Nguyen and Xuan-Hieu Phan |
| Global Learning of Labeled Dependency Trees
Michael Schiehlen and Kristina Spranger |
| Pro3Gres Parser in the CoNLL Domain Adaptation Shared Task
Gerold Schneider, Kaarel Kaljurand, Fabio Rinaldi and Tobias Kuhn |
| Structural Correspondence Learning for Dependency Parsing
Nobuyuki Shimizu and Hiroshi Nakagawa |
| Adapting the RASP System for the CoNLL07 Domain-Adaptation Task
Rebecca Watson and Ted Briscoe |
| Multilingual Deterministic Dependency Parsing Framework using Modified Finite Newton Method Support Vector Machines
Yu-Chieh Wu, Jie-Chi Yang and Yue-Shi Lee |