ACL 2005 CD | WORKSHOP ON CD | ACL 2005 ONLINE | ACL ONLINE |
WORKSHOP PROGRAM
Wednesday, June 29, 2005 | |
8:45–8:50 | Opening Remarks |
Session 1: Summarization Metrics I | |
8:50–9:15 | A Methodology for Extrinsic Evaluation of Text Summarization: Does ROUGE Correlate? Bonnie Dorr, Christof Monz, Stacy President, Richard Schwartz and David Zajic |
9:15–9:40 | On the Subjectivity of Human Authored Summaries BalaKrishna Kolluru and Yoshihiko Gotoh |
Session 2: MT Metrics I | |
9:40–10:05 | Preprocessing and Normalization for Automatic Evaluation of Machine Translation Gregor Leusch, Nicola Ueffing, David Vilar and Hermann Ney |
10:05–10:30 | Syntactic Features for Evaluation of Machine Translation Ding Liu and Daniel Gildea |
10:30–11:00 | Break |
Session 3: Invited Talk | |
11:00–12:00 | Kathy McKeown on Results of the Multilingual Summarization Evaluation |
Session 4: Student Session - Work in Progress | |
12:00–12:15 | Evaluation of Sentence Selection on Spoken Dialogue by Xiaodan Zhu |
12:15–12:30 | Toward a Predictive Statistical Model of Task-based Performance by Callandra R. Tate |
12:30–2:15 | Lunch |
Session 5: Summarization Metrics II | |
2:15–2:40 | Evaluating Automatic Summaries of Meeting Recordings Gabriel Murray, Steve Renals, Jean Carletta and Johanna Moore |
2:40–3:05 | Evaluating Summaries and Answers: Two Sides of the Same Coin? Jimmy Lin and Dina Demner-Fushman |
3:05–3:30 | Evaluating DUC 2004 Tasks with the QARLA Framework Enrique Amigó, Julio Gonzalo, Anselmo Peñas and Felisa Verdejo |
Session 6: MT Metrics II | |
4:00–4:25 | On Some Pitfalls in Automatic Evaluation and Significance Testing for MT Stefan Riezler and John T. Maxwell |
4:25–4:50 | METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments Satanjeev Banerjee and Alon Lavie |
Session 7: Panel Discussion and Open Forum on Future Plans | |
4:50–5:50 | Panel Discussion |
5:50–6:00 | Future Plans |