Conference Program Home | NAACL 2021 Proceedings | NAACL 2021 WEBSITE | ACL WEBSITE |
PROGRAM
Mon 07 Jun 2021 (all times PDT, UTC-7) | |
08:00–09:00 Keynote | |
Session chair: Anna Rumshisky (University of Massachusetts Lowell) | |
Humans Learn From Task Descriptions and So Should Our Models Hinrich Schuetze | |
09:00–10:20 Session 1 (click to expand/collapse) | |
09:00–10:20 1A: Information Extraction | |
Session chair: Ni Lao (Apple) | |
Knowledge Router: Learning Disentangled Representations for Knowledge Graphs Shuai Zhang, Xi Rao, Yi Tay and Ce Zhang | |
Distantly Supervised Relation Extraction with Sentence Reconstruction and Knowledge Base Priors Fenia Christopoulou, Makoto Miwa and Sophia Ananiadou | |
Cross-Task Instance Representation Interactions and Label Dependencies for Joint Information Extraction with Graph Convolutional Networks Minh Van Nguyen, Viet Lai and Thien Huu Nguyen | |
Abstract Meaning Representation Guided Graph Encoding and Decoding for Joint Information Extraction Zixuan Zhang and Heng Ji | |
A Frustratingly Easy Approach for Entity and Relation Extraction Zexuan Zhong and Danqi Chen | |
Event Time Extraction and Propagation via Graph Attention Networks Haoyang Wen, Yanru Qu, Heng Ji, Qiang Ning, Jiawei Han, Avi Sil, Hanghang Tong and Dan Roth | |
09:00–10:20 1B: Interpretability and Analysis of Models for NLP | |
Session chair: Svitlana Volkova (Pacific Northwest National Laboratory) | |
Probing Word Translations in the Transformer and Trading Decoder for Encoder Layers Hongfei Xu, Josef van Genabith, Qiuhui Liu and Deyi Xiong | |
Mediators in Determining what Processing BERT Performs First Aviv Slobodkin, Leshem Choshen and Omri Abend | |
Automatic Generation of Contrast Sets from Scene Graphs: Probing the Compositional Consistency of GQA Yonatan Bitton, Gabriel Stanovsky, Roy Schwartz and Michael Elhadad | |
Multilingual Language Models Predict Human Reading Behavior Nora Hollenstein, Federico Pirovano, Ce Zhang, Lena Jäger and Lisa Beinborn | |
Do Syntactic Probes Probe Syntax? Experiments with Jabberwocky Probing Rowan Hall Maudslay and Ryan Cotterell | |
A Non-Linear Structural Probe Jennifer C. White, Tiago Pimentel, Naomi Saphra and Ryan Cotterell | |
Concealed Data Poisoning Attacks on NLP Models Eric Wallace, Tony Zhao, Shi Feng and Sameer Singh | |
09:00–10:20 1C: Machine Translation | |
Session chair: Valia Kordoni (Humboldt-Universitaet zu Berlin, Germany) | |
Backtranslation Feedback Improves User Confidence in MT, Not Quality Vilém Zouhar, Michal Novák, Matúš Žilinec, Ondřej Bojar, Mateo Obregón, Robin L. Hill, Frédéric Blain, Marina Fomicheva, Lucia Specia and Lisa Yankovskaya | |
Data Filtering using Cross-Lingual Word Embeddings Christian Herold, Jan Rosendahl, Joris Vanvinckenroye and Hermann Ney | |
Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine Translation Alexandra Chronopoulou, Dario Stojanovski and Alexander Fraser | |
Neural Machine Translation without Embeddings Uri Shaham and Omer Levy | |
Counterfactual Data Augmentation for Neural Machine Translation Qi Liu, Matt Kusner and Phil Blunsom | |
Cultural and Geographical Influences on Image Translatability of Words across Languages Nikzad Khani, Isidora Tourni, Mohammad Sadegh Rasooli, Chris Callison-Burch and Derry Tanti Wijaya | |
Multilingual BERT Post-Pretraining Alignment Lin Pan, Chung-Wei Hang, Haode Qi, Abhishek Shah, Saloni Potdar and Mo Yu | |
09:00–10:20 1D: NLP Applications | |
Session chair: Yuval Pinter (Georgia Tech) | |
A Million Tweets Are Worth a Few Points: Tuning Transformers for Customer Service Tasks Amir Hadifar, Sofie Labat, Veronique Hoste, Chris Develder and Thomas Demeester | |
Paragraph-level Rationale Extraction through Regularization: A case study on European Court of Human Rights Cases Ilias Chalkidis, Manos Fergadiotis, Dimitrios Tsarapatsanis, Nikolaos Aletras, Ion Androutsopoulos and Prodromos Malakasiotis | |
Answering Product-Questions by Utilizing Questions from Other Contextually Similar Products Ohad Rozen, David Carmel, Avihai Mejer, Vitaly Mirkis and Yftah Ziser | |
EnSidNet: Enhanced Hybrid Siamese-Deep Network for grouping clinical trials into drug-development pathways Lucia Pagani | |
DATE: Detecting Anomalies in Text via Self-Supervision of Transformers Andrei Manolache, Florin Brad and Elena Burceanu | |
A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Source Code Nadezhda Chirkova and Sergey Troshin | |
Fast and Scalable Dialogue State Tracking with Explicit Modular Decomposition Dingmin Wang, Chenghua Lin, Qi Liu and Kam-Fai Wong | |
09:00–10:20 1E: Sentence-level Semantics and Textual Inference | |
Session chair: Roy Bar-Haim (IBM Research AI) | |
Augmented SBERT: Data Augmentation Method for Improving Bi-Encoders for Pairwise Sentence Scoring Tasks Nandan Thakur, Nils Reimers, Johannes Daxenberger and Iryna Gurevych | |
SmBoP: Semi-autoregressive Bottom-up Semantic Parsing Ohad Rubin and Jonathan Berant | |
SGL: Speaking the Graph Languages of Semantic Parsing via Multilingual Translation Luigi Procopio, Rocco Tripodi and Roberto Navigli | |
Fool Me Twice: Entailment from Wikipedia Gamification Julian Eisenschlos, Bhuwan Dhingra, Jannis Bulian, Benjamin Börschinger and Jordan Boyd-Graber | |
Meta-Learning for Domain Generalization in Semantic Parsing Bailin Wang, Mirella Lapata and Ivan Titov | |
10:20–11:40 Session 2 (click to expand/collapse) | |
10:20–11:40 2A: Language Generation | |
Session chair: Peng Qi (JD AI Research) | |
Aspect-Controlled Neural Argument Generation Benjamin Schiller, Johannes Daxenberger and Iryna Gurevych | |
Text Generation from Discourse Representation Structures Jiangming Liu, Shay B. Cohen and Mirella Lapata | |
APo-VAE: Text Generation in Hyperbolic Space Shuyang Dai, Zhe Gan, Yu Cheng, Chenyang Tao, Lawrence Carin and Jingjing Liu | |
DART: Open-Domain Structured Data Record to Text Generation Linyong Nan, Dragomir Radev, Rui Zhang, Amrit Rau, Abhinand Sivaprasad, Chiachun Hsieh, Xiangru Tang, Aadit Vyas, Neha Verma, Pranav Krishna, Yangxiaokang Liu, Nadia Irwanto, Jessica Pan, Faiaz Rahman, Ahmad Zaidi, Mutethia Mutuma, Yasin Tarabar, Ankit Gupta, Tao Yu, Yi Chern Tan, Xi Victoria Lin, Caiming Xiong, Richard Socher and Nazneen Fatema Rajani | |
[TACL] An Error Analysis Framework for Shallow Surface Realisation Shimorina, Anastasia, Parmentier, Yannick, Gardent, Claire | |
TuringAdvice: A Generative and Dynamic Evaluation of Language Use Rowan Zellers, Ari Holtzman, Elizabeth Clark, Lianhui Qin, Ali Farhadi and Yejin Choi | |
10:20–11:40 2B: Multilinguality | |
Session chair: Jonathan Clark (Google Research) | |
When Being Unseen from mBERT is just the Beginning: Handling New Languages With Multilingual Language Models Benjamin Muller, Antonios Anastasopoulos, Benoît Sagot and Djamé Seddah | |
Multi-Adversarial Learning for Cross-Lingual Word Embeddings Haozhou Wang, James Henderson and Paola Merlo | |
Multi-view Subword Regularization Xinyi Wang, Sebastian Ruder and Graham Neubig | |
mT5: A Massively Multilingual Pre-trained Text-to-Text Transformer Linting Xue, Noah Constant, Adam Roberts, Mihir Kale, Rami Al-Rfou, Aditya Siddhant, Aditya Barua and Colin Raffel | |
MetaXL: Meta Representation Transformation for Low-resource Cross-lingual Learning Mengzhou Xia, Guoqing Zheng, Subhabrata Mukherjee, Milad Shokouhi, Graham Neubig and Ahmed Hassan Awadallah | |
[TACL] Parameter Space Factorization for Zero-Shot Learning across Tasks and Languages Edoardo M. Ponti, Ivan Vulić, Ryan Cotterell, Marinela Parovic, Roi Reichart, Anna Korhonen | |
10:20–11:40 2C: Question Answering | |
Session chair: Sara Rosenthal (IBM Research) | |
Open Domain Question Answering over Tables via Dense Retrieval Jonathan Herzig, Thomas Müller, Syrine Krichene and Julian Eisenschlos | |
Open-Domain Question Answering Goes Conversational via Question Rewriting Raviteja Anantha, Svitlana Vakulenko, Zhucheng Tu, Shayne Longpre, Stephen Pulman and Srinivas Chappidi | |
QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering Michihiro Yasunaga, Hongyu Ren, Antoine Bosselut, Percy Liang and Jure Leskovec | |
XOR QA: Cross-lingual Open-Retrieval Question Answering Akari Asai, Jungo Kasai, Jonathan Clark, Kenton Lee, Eunsol Choi and Hannaneh Hajishirzi | |
SPARTA: Efficient Open-Domain Question Answering via Sparse Transformer Matching Retrieval Tiancheng Zhao, Xiaopeng Lu and Kyusong Lee | |
[TACL] Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies Mor Geva, Daniel Khashabi, Elad Segal, Tushar Khot, Dan Roth, Jonathan Berant | |
10:20–11:40 2D: Special Theme: New Challenges in NLP | |
Session chair: Ahmed Awadallah (Microsoft Research ) | |
Implicitly Abusive Language – What does it actually look like and why are we not getting there? Michael Wiegand, Josef Ruppenhofer and Elisabeth Eder | |
The Importance of Modeling Social Factors of Language: Theory and Practice Dirk Hovy and Diyi Yang | |
On learning and representing social meaning in NLP: a sociolinguistic perspective Dong Nguyen, Laura Rosseel and Jack Grieve | |
Get Your Vitamin C! Robust Fact Verification with Contrastive Evidence Tal Schuster, Adam Fisch and Regina Barzilay | |
Representing Numbers in NLP: a Survey and a Vision Avijit Thawani, Jay Pujara, Filip Ilievski and Pedro Szekely | |
What Will it Take to Fix Benchmarking in Natural Language Understanding? Samuel R. Bowman and George Dahl | |
10:20–11:40 2E: Summarization | |
Session chair: Fei Liu (University of Central Florida) | |
Extending Multi-Document Summarization Evaluation to the Interactive Setting Ori Shapira, Ramakanth Pasunuru, Hadar Ronen, Mohit Bansal, Yael Amsterdamer and Ido Dagan | |
Identifying Helpful Sentences in Product Reviews Iftah Gamzu, Hila Gonen, Gilad Kutiel, Ran Levy and Eugene Agichtein | |
Noisy Self-Knowledge Distillation for Text Summarization Yang Liu, Sheng Shen and Mirella Lapata | |
Improving Zero and Few-Shot Abstractive Summarization with Intermediate Fine-tuning and Data Augmentation Alexander Fabbri, Simeng Han, Haoyuan Li, Haoran Li, Marjan Ghazvininejad, Shafiq Joty, Dragomir Radev and Yashar Mehdad | |
Enhancing Factual Consistency of Abstractive Summarization Chenguang Zhu, William Hinthorn, Ruochen Xu, Qingkai Zeng, Michael Zeng, Xuedong Huang and Meng Jiang | |
[TACL] Extractive Opinion Summarization in Quantized Transformer Spaces Stefanos Angelidis, Reinald Kim Amplayo, Yoshihiko Suhara, Xiaolan Wang, Mirella Lapata | |
11:40–13:00 Session 3 (click to expand/collapse) | |
11:40–13:00 3A: Dialogue and Interactive Systems | |
Session chair: Nigel Ward (University of Texas at El Paso) | |
Few-shot Intent Classification and Slot Filling with Retrieved Examples Dian Yu, Luheng He, Yuan Zhang, Xinya Du, Panupong Pasupat and Qi Li | |
"Nice Try, Kiddo": Investigating Ad Hominems in Dialogue Responses Emily Sheng, Kai-Wei Chang, Prem Natarajan and Nanyun Peng | |
Human-like informative conversations: Better acknowledgements using conditional mutual information Ashwin Paranjape and Christopher Manning | |
A Comparative Study on Schema-Guided Dialogue State Tracking Jie Cao and Yi Zhang | |
Spoken Language Understanding for Task-oriented Dialogue Systems with Augmented Memory Networks Jie Wu, Ian Harris and Hongzhi Zhao | |
How to Motivate Your Dragon: Teaching Goal-Driven Agents to Speak and Act in Fantasy Worlds Prithviraj Ammanabrolu, Jack Urbanek, Margaret Li, Arthur Szlam, Tim Rocktäschel and Jason Weston | |
11:40–13:00 3B: Information Extraction | |
Session chair: Yuhao Zhang (Amazon, AWS AI) | |
Linking Entities to Unseen Knowledge Bases with Arbitrary Schemas Yogarshi Vyas and Miguel Ballesteros | |
Self-Training with Weak Supervision Giannis Karamanolakis, Subhabrata Mukherjee, Guoqing Zheng and Ahmed Hassan Awadallah | |
Neural Language Modeling for Contextualized Temporal Graph Generation Aman Madaan and Yiming Yang | |
Probabilistic Box Embeddings for Uncertain Knowledge Graph Reasoning Xuelu Chen, Michael Boratko, Muhao Chen, Shib Sankar Dasgupta, Xiang Lorraine Li and Andrew McCallum | |
Document-Level Event Argument Extraction by Conditional Generation Sha Li, Heng Ji and Jiawei Han | |
Template Filling with Generative Transformers Xinya Du, Alexander Rush and Claire Cardie | |
11:40–13:00 3C: Interpretability and Analysis of Models for NLP | |
Session chair: Siva Reddy (McGill/Mila) | |
Towards Interpreting and Mitigating Shortcut Learning Behavior of NLU models Mengnan Du, Varun Manjunatha, Rajiv Jain, Ruchi Deshpande, Franck Dernoncourt, Jiuxiang Gu, Tong Sun and Xia Hu | |
On Attention Redundancy: A Comprehensive Study Yuchen Bian, Jiaji Huang, Xingyu Cai, Jiahong Yuan and Kenneth Church | |
Does BERT Pretrained on Clinical Notes Reveal Sensitive Data? Eric Lehman, Sarthak Jain, Karl Pichotta, Yoav Goldberg and Byron Wallace | |
Low-Complexity Probing via Finding Subnetworks Victor Sanh and Alexander Rush | |
An Empirical Comparison of Instance Attribution Methods for NLP Pouya Pezeshkpour, Sarthak Jain, Byron Wallace and Sameer Singh | |
Generalization in Instruction Following Systems Soham Dan, Michael Zhou and Dan Roth | |
[CL] Interpretability Analysis for Named Entity Recognition to Understand System Predictions and How They Can Improve Oshin Agarwal, Yinfei Yang, Byron C. Wallace, Ani Nenkova | |
11:40–13:00 3D: Language Grounding to Vision, Robotics and Beyond | |
Session chair: Karthik Narasimhan (Princeton University) | |
LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time Image-Text Retrieval Siqi Sun, Yen-Chun Chen, Linjie Li, Shuohang Wang, Yuwei Fang and Jingjing Liu | |
Measuring Social Biases in Grounded Vision and Language Embeddings Candace Ross, Boris Katz and Andrei Barbu | |
MTAG: Modal-Temporal Attention Graph for Unaligned Human Multimodal Language Sequences Jianing Yang, Yongxin Wang, Ruitao Yi, Yuying Zhu, Azaan Rehman, Amir Zadeh, Soujanya Poria and Louis-Philippe Morency | |
Grounding Open-Domain Instructions to Automate Web Support Tasks Nancy Xu, Sam Masling, Michael Du, Giovanni Campagna, Larry Heck, James Landay and Monica Lam | |
Modular Networks for Compositional Instruction Following Rodolfo Corona, Daniel Fried, Coline Devin, Dan Klein and trevor darrell | |
Improving Cross-Modal Alignment in Vision Language Navigation via Syntactic Information Jialu Li, Hao Tan and Mohit Bansal | |
11:40–13:00 3E: Machine Learning for NLP: Classification and Structured Prediction Models | |
Session chair: Qiang Ning (Amazon) | |
Improving Pretrained Models for Zero-shot Multi-label Text Classification through Reinforced Label Hierarchy Reasoning Hui Liu, Danqing Zhang, Bing Yin and Xiaodan Zhu | |
Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self-Training Approach Yue Yu, Simiao Zuo, Haoming Jiang, Wendi Ren, Tuo Zhao and Chao Zhang | |
Posterior Differential Regularization with f-divergence for Improving Model Robustness Hao Cheng, Xiaodong Liu, Lis Pereira, Yaoliang Yu and Jianfeng Gao | |
Understanding Hard Negatives in Noise Contrastive Estimation Wenzheng Zhang and Karl Stratos | |
Certified Robustness to Word Substitution Attack with Differential Privacy Wenjie Wang, Pengfei Tang, Jian Lou and Li Xiong | |
DReCa: A General Task Augmentation Strategy for Few-Shot Natural Language Inference Shikhar Murty, Tatsunori Hashimoto and Christopher Manning | |
16:00–17:00 Keynote | |
Session chair: Dilek Hakkani-Tur (Amazon Alexa AI) | |
From Disembodied to Embodied Multimodal Learning Dhruv Batra | |
17:00–18:20 Session 4 (click to expand/collapse) | |
17:00–18:20 4A: Machine Translation | |
Session chair: Wenhu Chen (UC Santa Barbara/Google AI) | |
Harnessing Multilinguality in Unsupervised Machine Translation for Rare Languages Xavier Garcia, Aditya Siddhant, Orhan Firat and Ankur Parikh | |
Macro-Average: Rare Types Are Important Too Thamme Gowda, Weiqiu You, Constantine Lignos and Jonathan May | |
Assessing Reference-Free Peer Evaluation for Machine Translation Sweta Agrawal, George Foster, Markus Freitag and Colin Cherry | |
The Curious Case of Hallucinations in Neural Machine Translation Vikas Raunak, Arul Menezes and Marcin Junczys-Dowmunt | |
Towards Continual Learning for Multilingual Machine Translation via Vocabulary Substitution Xavier Garcia, Noah Constant, Ankur Parikh and Orhan Firat | |
Towards Modeling the Style of Translators in Neural Machine Translation Yue Wang, Cuong Hoang and Marcello Federico | |
[TACL] Unsupervised Bitext Mining and Translation via Self-trained Contextual Embeddings Phillip Keung, Julian Salazar, Yichao Lu, Noah A. Smith | |
17:00–18:20 4B: Question Answering | |
Session chair: Bhuwan Dhingra (Google AI) | |
Self-Supervised Test-Time Learning for Reading Comprehension Pratyay Banerjee, Tejas Gokhale and Chitta Baral | |
Capturing Row and Column Semantics in Transformer Based Question Answering over Tables Michael Glass, Mustafa Canim, Alfio Gliozzo, Saneem Chemmengath, Vishwajeet Kumar, Rishav Chakravarti, Avi Sil, Feifei Pan, Samarth Bharadwaj and Nicolas Rodolfo Fauceglia | |
Explainable Multi-hop Verbal Reasoning Through Internal Monologue Zhengzhong Liang, Steven Bethard and Mihai Surdeanu | |
Robust Question Answering Through Sub-part Alignment Jifan Chen and Greg Durrett | |
Text Modular Networks: Learning to Decompose Tasks in the Language of Existing Models Tushar Khot, Daniel Khashabi, Kyle Richardson, Peter Clark and Ashish Sabharwal | |
RECONSIDER: Improved Re-Ranking using Span-Focused Cross-Attention for Open Domain Question Answering Srinivasan Iyer, Sewon Min, Yashar Mehdad and Wen-tau Yih | |
On the Transferability of Minimal Prediction Preserving Inputs in Question Answering Shayne Longpre, Yi Lu and Chris DuBois | |
17:00–18:20 4C: Sentence-level Semantics and Textual Inference | |
Session chair: Ves Stoyanov (Facebook AI) | |
Understanding by Understanding Not: Modeling Negation in Language Models Arian Hosseini, Siva Reddy, Dzmitry Bahdanau, R Devon Hjelm, Alessandro Sordoni and Aaron Courville | |
DuoRAT: Towards Simpler Text-to-SQL Models Torsten Scholak, Raymond Li, Dzmitry Bahdanau, Harm de Vries and Chris Pal | |
Looking Beyond Sentence-Level Natural Language Inference for Question Answering and Text Summarization Anshuman Mishra, Dhruvesh Patel, Aparna Vijayakumar, Xiang Lorraine Li, Pavan Kapanipathi and Kartik Talamadupula | |
Structure-Grounded Pretraining for Text-to-SQL Xiang Deng, Ahmed Hassan Awadallah, Christopher Meek, Oleksandr Polozov, Huan Sun and Matthew Richardson | |
Incremental Few-shot Text Classification with Multi-round New Classes: Formulation, Dataset and System Congying Xia, Wenpeng Yin, Yihao Feng and Philip Yu | |
Temporal Reasoning on Implicit Events from Distant Supervision Ben Zhou, Kyle Richardson, Qiang Ning, Tushar Khot, Ashish Sabharwal and Dan Roth | |
Disentangling Semantics and Syntax in Sentence Embeddings with Pre-trained Language Models James Y. Huang, Kuan-Hao Huang and Kai-Wei Chang | |
17:00–18:20 4D: Summarization | |
Session chair: Rui Zhang (Penn State University) | |
Structure-Aware Abstractive Conversation Summarization via Discourse and Action Graphs Jiaao Chen and Diyi Yang | |
A New Approach to Overgenerating and Scoring Abstractive Summaries Kaiqiang Song, Bingqing Wang, Zhe Feng and Fei Liu | |
D2S: Document-to-Slide Generation Via Query-Based Text Summarization Edward Sun, Yufang Hou, Dakuo Wang, Yunfeng Zhang and Nancy X. R. Wang | |
Efficient Attentions for Long Document Summarization Luyang Huang, Shuyang Cao, Nikolaus Parulian, Heng Ji and Lu Wang | |
RefSum: Refactoring Neural Summarization Yixin Liu, Zi-Yi Dou and Pengfei Liu | |
Annotating and Modeling Fine-grained Factuality in Summarization Tanya Goyal and Greg Durrett | |
17:00–18:20 4E: Syntax: Tagging, Chunking, and Parsing | |
Session chair: Sheng Zhang (Microsoft Research ) | |
Larger-Context Tagging: When and Why Does It Work? Jinlan Fu, Liangjing Feng, Qi Zhang, Xuanjing Huang and Pengfei Liu | |
Neural Sequence Segmentation as Determining the Leftmost Segments Yangming Li, Lemao Liu and Kaisheng Yao | |
PCFGs Can Do Better: Inducing Probabilistic Context-Free Grammars with Many Symbols Songlin Yang, Yanpeng Zhao and Kewei Tu | |
GEMNET: Effective Gated Gazetteer Representations for Recognizing Complex Entities in Low-context Input Tao Meng, Anjie Fang, Oleg Rokhlenko and Shervin Malmasi | |
[CL] Universal Dependencies Marie-Catherine de Marneffe, Christopher D. Manning, Joakim Nivre, Daniel Zeman | |
18:20–19:40 Session 5 (click to expand/collapse) | |
18:20–19:40 5A: Dialogue and Interactive Systems | |
Session chair: Huan Sun (The Ohio State University) | |
Generating Negative Samples by Manipulating Golden Responses for Unsupervised Learning of a Response Evaluation Model ChaeHun Park, Eugene Jang, Wonsuk Yang and Jong Park | |
How Robust are Fact Checking Systems on Colloquial Claims? Byeongchang Kim, Hyunwoo Kim, Seokhee Hong and Gunhee Kim | |
Fine-grained Post-training for Improving Retrieval-based Dialogue Systems Janghoon Han, Taesuk Hong, Byoungjae Kim, Youngjoong Ko and Jungyun Seo | |
Put Chatbot into Its Interlocutor’s Shoes: New Framework to Learn Chatbot Responding with Intention Hsuan Su, Jiun-Hao Jhan, Fan-yun Sun, Saurav Sahay and Hung-yi Lee | |
Adding Chit-Chat to Enhance Task-Oriented Dialogues Kai Sun, Seungwhan Moon, Paul Crook, Stephen Roller, Becka Silvert, Bing Liu, Zhiguang Wang, Honglei Liu, Eunjoon Cho and Claire Cardie | |
18:20–19:40 5B: Discourse and Pragmatics | |
Session chair: Yangfeng Ji (University of Virginia) | |
Incorporating Syntax and Semantics in Coreference Resolution with Heterogeneous Graph Attention Network Fan Jiang and Trevor Cohn | |
Context Tracking Network: Graph-based Context Modeling for Implicit Discourse Relation Recognition Yingxue Zhang, Fandong Meng, Peng Li, Ping Jian and Jie Zhou | |
Improving Neural RST Parsing Model with Silver Agreement Subtrees Naoki Kobayashi, Tsutomu Hirao, Hidetaka Kamigaito, Manabu Okumura and Masaaki Nagata | |
RST Parsing from Scratch Thanh-Tung Nguyen, Xuan-Phi Nguyen, Shafiq Joty and Xiaoli Li | |
Did they answer? Subjective acts and intents in conversational discourse Elisa Ferracane, Greg Durrett, Junyi Jessy Li and Katrin Erk | |
Evaluating the Impact of a Hierarchical Discourse Representation on Entity Coreference Resolution Performance Sopan Khosla, James Fiacco and Carolyn Rosé | |
Bridging Resolution: Making Sense of the State of the Art Hideo Kobayashi and Vincent Ng | |
18:20–19:40 5C: Machine Learning for NLP: Language Modeling and Sequence to Sequence Models | |
Session chair: Lei Yu (DeepMind) | |
Explicitly Modeling Syntax in Language Models with Incremental Parsing and a Dynamic Oracle Yikang Shen, Shawn Tan, Alessandro Sordoni, Siva Reddy and Aaron Courville | |
Revisiting the Weaknesses of Reinforcement Learning for Neural Machine Translation Samuel Kiegeland and Julia Kreutzer | |
Learning to Organize a Bag of Words into Sentences with Neural Networks: An Empirical Study Chongyang Tao, Shen Gao, Juntao Li, Yansong Feng, Dongyan Zhao and Rui Yan | |
Mask Attention Networks: Rethinking and Strengthen Transformer Zhihao Fan, Yeyun Gong, Dayiheng Liu, Zhongyu Wei, Siyuan Wang, Jian Jiao, Nan Duan, Ruofei Zhang and Xuanjing Huang | |
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding Dongling Xiao, Yu-Kun Li, Han Zhang, Yu Sun, Hao Tian, Hua Wu and Haifeng Wang | |
Lattice-BERT: Leveraging Multi-Granularity Representations in Chinese Pre-trained Language Models Yuxuan Lai, Yijia Liu, Yansong Feng, Songfang Huang and Dongyan Zhao | |
18:20–19:40 5D: Lexical Semantics | |
Session chair: Ken Church (Baidu) | |
Modeling Event Plausibility with Consistent Conceptual Abstraction Ian Porada, Kaheer Suleman, Adam Trischler and Jackie Chi Kit Cheung | |
UmlsBERT: Clinical Domain Knowledge Augmentation of Contextual Embeddings Using the Unified Medical Language System Metathesaurus George Michalopoulos, Yuanxin Wang, Hussam Kaka, Helen Chen and Alexander Wong | |
Field Embedding: A Unified Grain-Based Framework for Word Representation Junjie Luo, Xi Chen, Jichao Sun, Yuejia Xiang, Ningyu Zhang and Xiang Wan | |
MelBERT: Metaphor Detection via Contextualized Late Interaction using Metaphorical Identification Theories Minjin Choi, Sunkyung Lee, Eunseong Choi, Heesoo Park, Junhyuk Lee, Dongwon Lee and Jongwuk Lee | |
Non-Parametric Few-Shot Learning for Word Sense Disambiguation Howard Chen, Mengzhou Xia and Danqi Chen | |
18:20–19:40 5E: Sentiment Analysis and Stylistic Analysis | |
Session chair: Shi Zong (Nanjing University) | |
Why Do Document-Level Polarity Classifiers Fail? Karen Martins, Pedro O.S Vaz-de-Melo and Rodrygo Santos | |
A Unified Span-Based Approach for Opinion Mining with Syntactic Constituents Qingrong Xia, Bo Zhang, Rui Wang, Zhenghua Li, Yue Zhang, Fei Huang, Luo Si and Min Zhang | |
Target-specified Sequence Labeling with Multi-head Self-attention for Target-oriented Opinion Words Extraction Yuhao Feng, Yanghui Rao, Yuyao Tang, Ninghua Wang and He Liu | |
Does syntax matter? A strong baseline for Aspect-based Sentiment Analysis with RoBERTa Junqi Dai, Hang Yan, Tianxiang Sun, Pengfei Liu and Xipeng Qiu | |
Domain Divergences: A Survey and Empirical Analysis Abhinav Ramesh Kashyap, Devamanyu Hazarika, Min-Yen Kan and Roger Zimmermann | |
Target-Aware Data Augmentation for Stance Detection Yingjie Li and Cornelia Caragea | |
19:40–21:00 Session 6 (click to expand/collapse) | |
19:40–21:00 6A: Speech | |
Session chair: Yao Qian (Microsoft) | |
End-to-end ASR to jointly predict transcriptions and linguistic annotations Motoi Omachi, Yuya Fujita, Shinji Watanabe and Matthew Wiesner | |
Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation Hirofumi Inaguma, Tatsuya Kawahara and Shinji Watanabe | |
Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks Siddharth Dalmia, Brian Yan, Vikas Raunak, Florian Metze and Shinji Watanabe | |
SPLAT: Speech-Language Joint Pre-Training for Spoken Language Understanding Yu-An Chung, Chenguang Zhu and Michael Zeng | |
Worldly Wise (WoW) - Cross-Lingual Knowledge Fusion for Fact-based Visual Spoken-Question Answering Kiran Ramnath, Leda Sari, Mark Hasegawa-Johnson and Chang Yoo | |
Align-Refine: Non-Autoregressive Speech Recognition via Iterative Realignment Ethan A. Chi, Julian Salazar and Katrin Kirchhoff | |
19:40–21:00 6B: NLP Applications | |
Session chair: Wenhan Xiong (Facebook AI) | |
Everything Has a Cause: Leveraging Causal Inference in Legal Text Analysis Xiao Liu, Da Yin, Yansong Feng, Yuting Wu and Dongyan Zhao | |
Counterfactual Supporting Facts Extraction for Explainable Medical Record Based Diagnosis with Graph Network Haoran Wu, Wei Chen, Shuang Xu and Bo Xu | |
Personalized Response Generation via Generative Split Memory Network Yuwei Wu, Xuezhe Ma and Diyi Yang | |
Towards Few-shot Fact-Checking via Perplexity Nayeon Lee, Yejin Bang, Andrea Madotto and Pascale Fung | |
Active$^2$ Learning: Actively reducing redundancies in Active Learning methods for Sequence Tagging and Machine Translation Rishi Hazra, Parag Dutta, Shubham Gupta, Mohammed Abdul Qaathir and Ambedkar Dukkipati | |
Generating An Optimal Interview Question Plan Using A Knowledge Graph And Integer Linear Programming Soham Datta, Prabir Mallick, Sangameshwar Patil, Indrajit Bhattacharya and Girish Palshikar | |
19:40–21:00 6C: Machine Learning for NLP: Classification and Structured Prediction Models | |
Session chair: Lingpeng Kong (The University of Hong Kong) | |
Model Extraction and Adversarial Transferability, Your BERT is Vulnerable! Xuanli He, Lingjuan Lyu, Lichao Sun and Qiongkai Xu | |
A Global Past-Future Early Exit Method for Accelerating Inference of Pre-trained Language Models Kaiyuan Liao, Yi Zhang, Xuancheng Ren, Qi Su, Xu Sun and Bin He | |
Masked Conditional Random Fields for Sequence Labeling Tianwen Wei, Jianwei Qi, Shenghuan He and Songtao Sun | |
Heterogeneous Graph Neural Networks for Concept Prerequisite Relation Learning in Educational Data Chenghao Jia, Yongliang Shen, Yechun Tang, Lu Sun and Weiming Lu | |
Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models Wenkai Yang, Lei Li, Zhiyuan Zhang, Xuancheng Ren, Xu Sun and Bin He | |
DA-Transformer: Distance-aware Transformer Chuhan Wu, Fangzhao Wu and Yongfeng Huang | |
19:40–21:00 6D: Language Resources and Evaluation | |
Session chair: Alexandros Papangelis (Amazon, Alexa AI) | |
ASAP: A Chinese Review Dataset Towards Aspect Category Sentiment Analysis and Rating Prediction Jiahao Bu, Lei Ren, Shuang Zheng, Yang Yang, Jingang Wang, Fuzheng Zhang and Wei Wu | |
Are NLP Models really able to Solve Simple Math Word Problems? Arkil Patel, Satwik Bhattamishra and Navin Goyal | |
WRIME: A New Dataset for Emotional Intensity Estimation with Subjective and Objective Annotations Tomoyuki Kajiwara, Chenhui Chu, Noriko Takemura, Yuta Nakashima and Hajime Nagahara | |
KPQA: A Metric for Generative Question Answering Using Keyphrase Weights Hwanhee Lee, Seunghyun Yoon, Franck Dernoncourt, Doo Soon Kim, Trung Bui, Joongbo Shin and Kyomin Jung | |
StylePTB: A Compositional Benchmark for Fine-grained Controllable Text Style Transfer Yiwei Lyu, Paul Pu Liang, Hai Pham, Eduard Hovy, Barnabás Póczos, Ruslan Salakhutdinov and Louis-Philippe Morency | |
Blow the Dog Whistle: A Chinese Dataset for Cant Understanding with Common Sense and World Knowledge Canwen Xu, Wangchunshu Zhou, Tao Ge, Ke Xu, Julian McAuley and Furu Wei | |
COVID-19 Named Entity Recognition for Vietnamese Thinh Hung Truong, Mai Hoang Dao and Dat Quoc Nguyen | |
19:40–21:00 6E: Computational Social Science and Cultural Analytics | |
Session chair: Vivek Kulkarni (Twitter) | |
Framing Unpacked: A Semi-Supervised Interpretable Multi-View Model of Media Frames Shima Khanehzar, Trevor Cohn, Gosia Mikolajczak, Andrew Turpin and Lea Frermann | |
Automatic Classification of Neutralization Techniques in the Narrative of Climate Change Scepticism Shraey Bhatia, Jey Han Lau and Timothy Baldwin | |
Suicide Ideation Detection via Social and Temporal User Representations using Hyperbolic Learning Ramit Sawhney, Harshit Joshi, Rajiv Ratn Shah and Lucie Flek | |
WikiTalkEdit: A Dataset for modeling Editors’ behaviors on Wikipedia Kokil Jaidka, Andrea Ceolin, Iknoor Singh, Niyati Chhaya and Lyle Ungar | |
The structure of online social networks modulates the rate of lexical change Jian Zhu and David Jurgens | |
Modeling Framing in Immigration Discourse on Social Media Julia Mendelsohn, Ceren Budak and David Jurgens | |
Tue 08 Jun 2021 (all times PDT, UTC-7) | |
08:00–09:00 Keynote | |
Session chair: Luke Zettlemoyer (University of Washington & Facebook) | |
Generating Reality: Technical and Social Explorations in Generative Machine Learning Research Shakir Mohamed | |
09:00–10:20 Session 7 (click to expand/collapse) | |
09:00–10:20 7A: Computational Social Science and Cultural Analytics | |
Session chair: Dallas Card (Stanford) | |
Modeling the Severity of Complaints in Social Media Mali Jin and Nikolaos Aletras | |
What About the Precedent: An Information-Theoretic Analysis of Common Law Josef Valvoda, Tiago Pimentel, Niklas Stoehr, Ryan Cotterell and Simone Teufel | |
Introducing CAD: the Contextual Abuse Dataset Bertie Vidgen, Dong Nguyen, Helen Margetts, Patricia Rossini and Rebekah Tromble | |
Lifelong Learning of Hate Speech Classification on Social Media Jing Qian, Hong Wang, Mai ElSherief and Xifeng Yan | |
Learning to Recognize Dialect Features Dorottya Demszky, Devyani Sharma, Jonathan Clark, Vinodkumar Prabhakaran and Jacob Eisenstein | |
[TACL] Characterizing English Variation across Social Media Communities with BERT Lucy Li, David Bamman | |
09:00–10:20 7B: Green NLP | |
Session chair: Roy Schwartz (The Hebrew University of Jerusalem) | |
Static Embeddings as Efficient Knowledge Bases? Philipp Dufter, Nora Kassner and Hinrich Schütze | |
Highly Efficient Knowledge Graph Embedding Learning with Orthogonal Procrustes Analysis Xutan Peng, Guanyi Chen, Chenghua Lin and Mark Stevenson | |
Rethinking Network Pruning – under the Pre-train and Fine-tune Paradigm Dongkuan Xu, Ian En-Hsu Yen, Jinxi Zhao and Zhibin Xiao | |
Towards a Comprehensive Understanding and Accurate Evaluation of Societal Biases in Pre-Trained Transformers Andrew Silva, Pradyumna Tambwekar and Matthew Gombolay | |
Detoxifying Language Models Risks Marginalizing Minority Voices Albert Xu, Eshaan Pathak, Eric Wallace, Suchin Gururangan, Maarten Sap and Dan Klein | |
HONEST: Measuring Hurtful Sentence Completion in Language Models Debora Nozza, Federico Bianchi and Dirk Hovy | |
09:00–10:20 7C: Language Grounding to Vision, Robotics and Beyond | |
Session chair: Xin Eric Wang (UC Santa Cruz) | |
EaSe: A Diagnostic Tool for VQA based on Answer Diversity Shailza Jolly, Sandro Pezzelle and Moin Nabi | |
DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization Zineng Tang, Jie Lei and Mohit Bansal | |
Improving Generation and Evaluation of Visual Stories via Semantic Consistency Adyasha Maharana, Darryl Hannan and Mohit Bansal | |
Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models Po-Yao Huang, Mandela Patrick, Junjie Hu, Graham Neubig, Florian Metze and Alexander Hauptmann | |
Video Question Answering with Phrases via Semantic Roles Arka Sadhu, Kan Chen and Ram Nevatia | |
[TACL] Latent Compositional Representations Improve Systematic Generalization in Grounded Question Answering Ben Bogin: ben.bogin@, Jonathan Berant, Sanjay Subramanian, Matt Gardner | |
09:00–10:20 7D: Language Resources and Evaluation | |
Session chair: Sowmya Vajjala (National Research Council, Canada) | |
From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding Rob van der Goot, Ibrahim Sharaf, Aizhan Imankulova, Ahmet Üstün, Marija Stepanović, Alan Ramponi, Siti Oryza Khairunnisa, Mamoru Komachi and Barbara Plank | |
WEC: Deriving a Large-scale Cross-document Event Coreference dataset from Wikipedia Alon Eirew, Arie Cattan and Ido Dagan | |
Challenging distributional models with a conceptual network of philosophical terms Yvette Oortwijn, Jelke Bloem, Pia Sommerauer, Francois Meyer, Wei Zhou and Antske Fokkens | |
KILT: a Benchmark for Knowledge Intensive Language Tasks Fabio Petroni, Aleksandra Piktus, Angela Fan, Patrick Lewis, Majid Yazdani, Nicola De Cao, James Thorne, Yacine Jernite, Vladimir Karpukhin, Jean Maillard, Vassilis Plachouras, Tim Rocktäschel and Sebastian Riedel | |
[TACL] AMR Similarity Metrics from Principles Juri Opitz, Letitia Parcalabescu, Anette Frank | |
[TACL] Evaluating Document Coherence Modelling Aili Shen, Meladel Mistica, Bahar Salehi, Hang Li, Timothy Baldwin, Jianzhong Qi | |
09:00–10:20 7E: Machine Learning for NLP: Classification and Structured Prediction Models | |
Session chair: Paul Michel (Carnegie Mellon University) | |
A Survey on Recent Approaches for Natural Language Processing in Low-Resource Scenarios Michael A. Hedderich, Lukas Lange, Heike Adel, Jannik Strötgen and Dietrich Klakow | |
Temporal Knowledge Graph Completion using a Linear Temporal Regularizer and Multivector Embeddings Chengjin Xu, Yung-Yu Chen, Mojtaba Nayyeri and Jens Lehmann | |
UDALM: Unsupervised Domain Adaptation through Language Modeling Constantinos Karouzos, Georgios Paraskevopoulos and Alexandros Potamianos | |
Beyond Black & White: Leveraging Annotator Disagreement via Soft-Label Multi-Task Learning Tommaso Fornaciari, Alexandra Uma, Silviu Paun, Barbara Plank, Dirk Hovy and Massimo Poesio | |
Clustering-based Inference for Biomedical Entity Linking Rico Angell, Nicholas Monath, Sunil Mohan, Nishant Yadav and Andrew McCallum | |
Variance-reduced First-order Meta-learning for Natural Language Processing Tasks Lingxiao Wang, Kevin Huang, Tengyu Ma, Quanquan Gu and Jing Huang | |
Diversity-Aware Batch Active Learning for Dependency Parsing Tianze Shi, Adrian Benton, Igor Malioutov and Ozan İrsoy | |
10:20–11:40 Session 8 (click to expand/collapse) | |
10:20–11:40 8A: Machine Learning for NLP: Language Modeling and Sequence to Sequence Models | |
Session chair: Srini Iyer (Facebook AI Research) | |
Can Latent Alignments Improve Autoregressive Machine Translation? Adi Haviv, Lior Vassertail and Omer Levy | |
Smoothing and Shrinking the Sparse Seq2Seq Search Space Ben Peters and André F. T. Martins | |
Unified Pre-training for Program Understanding and Generation Wasi Ahmad, Saikat Chakraborty, Baishakhi Ray and Kai-Wei Chang | |
Hyperparameter-free Continuous Learning for Domain Classification in Natural Language Understanding Ting Hua, Yilin Shen, Changsheng Zhao, Yen-Chang Hsu and Hongxia Jin | |
[TACL] A Primer in BERTology: What We Know About How BERT Works Anna Rogers, Olga Kovaleva, Anna Rumshisky | |
10:20–11:40 8B: NLP Applications | |
Session chair: Emily Prud'hommeaux (Boston College) | |
On the Embeddings of Variables in Recurrent Neural Networks for Source Code Nadezhda Chirkova | |
Cross-Lingual Word Embedding Refinement by $_1$ Norm Optimisation Xutan Peng, Chenghua Lin and Mark Stevenson | |
Semantic Frame Forecast Chieh-Yang Huang and Ting-Hao Huang | |
MUSER: MUltimodal Stress detection using Emotion Recognition as an Auxiliary Task Yiqun Yao, Michalis Papakostas, Mihai Burzo, Mohamed Abouelenien and Rada Mihalcea | |
Learning to Decompose and Organize Complex Tasks Yi Zhang, Sujay Kumar Jauhar, Julia Kiseleva, Ryen White and Dan Roth | |
Continual Learning for Text Classification with Information Disentanglement Based Regularization Yufan Huang, Yanzhe Zhang, Jiaao Chen, Xuezhi Wang and Diyi Yang | |
10:20–11:40 8C: Sentence-level Semantics and Textual Inference | |
Session chair: Mrinmaya Sachan (ETH Zurich) | |
Learning from Executions for Semantic Parsing Bailin Wang, Mirella Lapata and Ivan Titov | |
Learning to Synthesize Data for Semantic Parsing Bailin Wang, Wenpeng Yin, Xi Victoria Lin and Caiming Xiong | |
Edge: Enriching Knowledge Graph Embeddings with External Text Saed Rezayi, Handong Zhao, Sungchul Kim, Ryan Rossi, Nedim Lipka and Sheng Li | |
FLIN: A Flexible Natural Language Interface for Web Navigation Sahisnu Mazumder and Oriana Riva | |
Game-theoretic Vocabulary Selection via the Shapley Value and Banzhaf Index Roma Patel, Marta Garnelo, Ian Gemp, Chris Dyer and Yoram Bachrach | |
Incorporating External Knowledge to Enhance Tabular Reasoning J. Neeraja, Vivek Gupta and Vivek Srikumar | |
Compositional Generalization for Neural Semantic Parsing via Span-level Supervised Attention Pengcheng Yin, Hao Fang, Graham Neubig, Adam Pauls, Emmanouil Antonios Platanios, Yu Su, Sam Thomson and Jacob Andreas | |
10:20–11:40 8D: Sentiment Analysis and Stylistic Analysis | |
Session chair: Preslav Nakov (Qatar Computing Research Institute, HBKU) | |
Domain Adaptation for Arabic Cross-Domain and Cross-Dialect Sentiment Analysis from Contextualized Word Embedding Abdellah El Mekki, Abdelkader El Mahdaouy, Ismail Berrada and Ahmed Khoumsi | |
Multi-task Learning of Negation and Speculation for Targeted Sentiment Classification Andrew Moore and Jeremy Barnes | |
A Disentangled Adversarial Neural Topic Model for Separating Opinions from Plots in User Reviews Gabriele Pergola, Lin Gui and Yulan He | |
Graph Ensemble Learning over Multiple Dependency Trees for Aspect-level Sentiment Classification Xiaochen Hou, Peng Qi, Guangtao Wang, Rex Ying, Jing Huang, Xiaodong He and Bowen Zhou | |
Emotion-Infused Models for Explainable Psychological Stress Detection Elsbeth Turcan, Smaranda Muresan and Kathleen McKeown | |
Aspect-based Sentiment Analysis with Type-aware Graph Convolutional Networks and Layer Ensemble Yuanhe Tian, Guimin Chen and Yan Song | |
10:20–11:40 8E: Syntax: Tagging, Chunking, and Parsing | |
Session chair: Mike Lewis (Facebook AI) | |
Supertagging-based Parsing with Linear Context-free Rewriting Systems Thomas Ruprecht and Richard Mörbitz | |
Outside Computation with Superior Functions Parker Riley and Daniel Gildea | |
Learning Syntax from Naturally-Occurring Bracketings Tianze Shi, Ozan İrsoy, Igor Malioutov and Lillian Lee | |
[CL] What Should/Do/Can LSTMs Learn When Parsing Auxiliary Verb Constructions? Miryam de Lhoneux, Sara Stymne, Joakim Nivre | |
[TACL] Reducing Confusion in Active Learning for Part-Of-Speech Tagging Aditi Chaudhary, Antonios Anastasopoulos, Zaid Sheikh, Graham Neubig | |
11:40–13:00 Business Meeting | |
16:00–17:00 Keynote | |
Session chair: Luke Zettlemoyer (University of Washington & Facebook) | |
Moving the Needle in NLP Technology for the Processing of Code-Switched Language Thamar Solorio | |
17:00–18:20 Session 9 (click to expand/collapse) | |
17:00–18:20 9A: Dialogue and Interactive Systems | |
Session chair: Yang Liu (Amazon, Alexa AI) | |
Bot-Adversarial Dialogue for Safe Conversational Agents Jing Xu, Da Ju, Margaret Li, Y-Lan Boureau, Jason Weston and Emily Dinan | |
Non-Autoregressive Semantic Parsing for Compositional Task-Oriented Dialog Arun Babu, Akshat Shrivastava, Armen Aghajanyan, Ahmed Aly, Angela Fan and Marjan Ghazvininejad | |
Example-Driven Intent Prediction with Observers Shikib Mehri and Mihail Eric | |
Imperfect also Deserves Reward: Multi-Level and Sequential Reward Modeling for Better Dialog Management Zhengxu Hou, Bang Liu, Ruihui Zhao, Zijing Ou, Yafei Liu, Xi Chen and Yefeng Zheng | |
Action-Based Conversations Dataset: A Corpus for Building More In-Depth Task-Oriented Dialogue Systems Derek Chen, Howard Chen, Yi Yang, Alexander Lin and Zhou Yu | |
Controlling Dialogue Generation with Semantic Exemplars Prakhar Gupta, Jeffrey Bigham, Yulia Tsvetkov and Amy Pavel | |
17:00–18:20 9B: Information Retrieval and Text Mining | |
Session chair: Qingyao Ai (University of Utah) | |
COIL: Revisit Exact Lexical Match in Information Retrieval with Contextualized Inverted List Luyu Gao, Zhuyun Dai and Jamie Callan | |
X-Class: Text Classification with Extremely Weak Supervision Zihan Wang, Dheeraj Mekala and Jingbo Shang | |
Fine-tuning Encoders for Improved Monolingual and Zero-shot Polylingual Neural Topic Modeling Aaron Mueller and Mark Dredze | |
Exploring the Relationship Between Algorithm Performance, Vocabulary, and Run-Time in Text Classification Wilson Fearn, Orion Weller and Kevin Seppi | |
Faithfully Explainable Recommendation via Neural Logic Reasoning Yaxin Zhu, Yikun Xian, Zuohui Fu, Gerard de Melo and Yongfeng Zhang | |
You Sound Like Someone Who Watches Drama Movies: Towards Predicting Movie Preferences from Conversational Interactions Sergey Volokhin, Joyce Ho, Oleg Rokhlenko and Eugene Agichtein | |
[TACL] Sparse, Dense, and Attentional Representations for Text Retrieval Yi Luan, Jacob Eisenstein, Kristina Toutanova, Michael Collins | |
17:00–18:20 9C: Language Grounding to Vision, Robotics and Beyond | |
Session chair: John Lalor (University of Notre Dame) | |
Reading and Acting while Blindfolded: The Need for Semantics in Text Game Agents Shunyu Yao, Karthik Narasimhan and Matthew Hausknecht | |
SOrT-ing VQA Models : Contrastive Gradient Learning for Improved Consistency Sameer Dharur, Purva Tendulkar, Dhruv Batra, Devi Parikh and Ramprasaath R. Selvaraju | |
Semi-Supervised Policy Initialization for Playing Games with Language Hints Tsu-Jui Fu and William Yang Wang | |
Revisiting Document Representations for Large-Scale Zero-Shot Learning Jihyung Kil and Wei-Lun Chao | |
17:00–18:20 9D: Language Resources and Evaluation | |
Session chair: Pradeep Dasigi (Allen Institute for AI) | |
Negative language transfer in learner English: A new dataset Leticia Farias Wanderley, Nicole Zhao and Carrie Demmans Epp | |
SentSim: Crosslingual Semantic Evaluation of Machine Translation Yurun Song, Junchen Zhao and Lucia Specia | |
Quality Estimation for Image Captions Based on Large-scale Human Evaluations Tomer Levinboim, Ashish V. Thapliyal, Piyush Sharma and Radu Soricut | |
CaSiNo: A Corpus of Campsite Negotiation Dialogues for Automatic Negotiation Systems Kushal Chawla, Jaysa Ramirez, Rene Clever, Gale Lucas, Jonathan May and Jonathan Gratch | |
News Headline Grouping as a Challenging NLU Task Philippe Laban, Lucas Bandarkar and Marti A. Hearst | |
Olá, Bonjour, Salve! XFORMAL: A Benchmark for Multilingual Formality Style Transfer Eleftheria Briakou, Di Lu, Ke Zhang and Joel Tetreault | |
17:00–18:20 9E: Machine Learning for NLP: Classification and Structured Prediction Models | |
Session chair: Guangtao Wang (JD AI Research) | |
Grouping Words with Semantic Diversity Karine Chubarian, Abdul Rafae Khan, Anastasios Sidiropoulos and Jia Xu | |
Noise Stability Regularization for Improving BERT Fine-tuning Hang Hua, Xingjian Li, Dejing Dou, Chengzhong Xu and Jiebo Luo | |
FlowPrior: Learning Expressive Priors for Latent Variable Sentence Models Xiaoan Ding and Kevin Gimpel | |
HTCInfoMax: A Global Model for Hierarchical Text Classification via Information Maximization Zhongfen Deng, Hao Peng, Dongxiao He, Jianxin Li and Philip Yu | |
[TACL] Modeling Content and Context with Deep Relational Learning Maria Leonor Pacheco, Dan Goldwasser | |
Knowledge Guided Metric Learning for Few-Shot Text Classification Dianbo Sui, Yubo Chen, Binjie Mao, Delai Qiu, Kang Liu and Jun Zhao | |
18:20–19:40 Session 10 (click to expand/collapse) | |
18:20–19:40 10A: Dialogue and Interactive Systems | |
Session chair: Ramesh Manuvinakurike (Intel labs) | |
Ensemble of MRR and NDCG models for Visual Dialog Idan Schwartz | |
Supervised Neural Clustering via Latent Structured Output Learning: Application to Question Intents Iryna Haponchyk and Alessandro Moschitti | |
ConVEx: Data-Efficient and Few-Shot Slot Labeling Matthew Henderson and Ivan Vulić | |
CREAD: Combined Resolution of Ellipses and Anaphora in Dialogues Bo-Hsiang Tseng, Shruti Bhargava, Jiarui Lu, Joel Ruben Antony Moniz, Dhivya Piraviperumal, Lin Li and Hong Yu | |
Knowledge-Driven Slot Constraints for Goal-Oriented Dialogue Systems Piyawat Lertvittayakumjorn, Daniele Bonadiman and Saab Mansour | |
Clipping Loops for Sample-Efficient Dialogue Policy Optimisation Yen-Chen Wu and Carl Edward Rasmussen | |
18:20–19:40 10B: Information Extraction | |
Session chair: Alan Ritter (Georgia Tech) | |
Integrating Lexical Information into Entity Neighbourhood Representations for Relation Prediction Ian Wood, Mark Johnson and Stephen Wan | |
Noisy-Labeled NER with Confidence Estimation Kun Liu, Yao Fu, Chuanqi Tan, Mosha Chen, Ningyu Zhang, Songfang Huang and Sheng Gao | |
TABBIE: Pretrained Representations of Tabular Data Hiroshi Iida, Dung Thai, Varun Manjunatha and Mohit Iyyer | |
Better Feature Integration for Named Entity Recognition Lu Xu, Zhanming Jie, Wei Lu and Lidong Bing | |
ZS-BERT: Towards Zero-Shot Relation Extraction with Attribute Representation Learning Chih-Yao Chen and Cheng-Te Li | |
Graph Convolutional Networks for Event Causality Identification with Rich Document-level Structures Minh Tran Phu and Thien Huu Nguyen | |
A Context-Dependent Gated Module for Incorporating Symbolic Semantics into Event Coreference Resolution Tuan Lai, Heng Ji, Trung Bui, Quan Hung Tran, Franck Dernoncourt and Walter Chang | |
18:20–19:40 10C: Language Generation | |
Session chair: Greg Durrett (UT Austin) | |
Multi-Style Transfer with Discriminative Feedback on Disjoint Corpus Navita Goyal, Balaji Vasan Srinivasan, Anandhavelu N and Abhilasha Sancheti | |
FUDGE: Controlled Text Generation With Future Discriminators Kevin Yang and Dan Klein | |
Controllable Text Simplification with Explicit Paraphrasing Mounica Maddela, Fernando Alva-Manchego and Wei Xu | |
Knowledge Graph Based Synthetic Corpus Generation for Knowledge-Enhanced Language Model Pre-training Oshin Agarwal, Heming Ge, Siamak Shakeri and Rami Al-Rfou | |
Choose Your Own Adventure: Paired Suggestions in Collaborative Writing for Evaluating Story Generation Models Elizabeth Clark and Noah A. Smith | |
[TACL] There Once Was a Really Bad Poet, It Was Automated but You Didn’t Know It Jianyou, jw542@duke.edu, Xiaoxuan, zhangxiaoxuanaa@gmail.com, Yuren Zhou, Christopher Suh, Cynthia Rudin | |
18:20–19:40 10D: Multilinguality | |
Session chair: Radu Florian (IBM Research AI) | |
InfoXLM: An Information-Theoretic Framework for Cross-Lingual Language Model Pre-Training Zewen Chi, Li Dong, Furu Wei, Nan Yang, Saksham Singhal, Wenhui Wang, Xia Song, Xian-Ling Mao, Heyan Huang and Ming Zhou | |
Context-Interactive Pre-Training for Document Machine Translation Pengcheng Yang, Pei Zhang, Boxing Chen, Jun Xie and Weihua Luo | |
Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots Samson Tan and Shafiq Joty | |
X-METRA-ADA: Cross-lingual Meta-Transfer learning Adaptation to Natural Language Understanding and Question Answering Meryem M’hamdi, Doo Soon Kim, Franck Dernoncourt, Trung Bui, Xiang Ren and Jonathan May | |
Explicit Alignment Objectives for Multilingual Bidirectional Encoders Junjie Hu, Melvin Johnson, Orhan Firat, Aditya Siddhant and Graham Neubig | |
Cross-lingual Cross-modal Pretraining for Multimodal Retrieval Hongliang Fei, Tan Yu and Ping Li | |
Wikipedia Entities as Rendezvous across Languages: Grounding Multilingual Language Models by Predicting Wikipedia Hyperlinks Iacer Calixto, Alessandro Raganato and Tommaso Pasini | |
18:20–19:40 10E: Question Answering | |
Session chair: Jing Huang (JD AI Research) | |
multiPRover: Generating Multiple Proofs for Improved Interpretability in Rule Reasoning Swarnadeep Saha, Prateek Yadav and Mohit Bansal | |
Adaptable and Interpretable Neural MemoryOver Symbolic Knowledge Pat Verga, Haitian Sun, Livio Baldini Soares and William Cohen | |
CLEVR_HYP: A Challenge Dataset and Baselines for Visual Question Answering with Hypothetical Actions over Images Shailaja Keyur Sampat, Akshay Kumar, Yezhou Yang and Chitta Baral | |
Refining Targeted Syntactic Evaluation of Language Models Benjamin Newman, Kai-Siang Ang, Julia Gong and John Hewitt | |
Universal Adversarial Attacks with Natural Triggers for Text Classification Liwei Song, Xinwei Yu, Hsuan-Tung Peng and Karthik Narasimhan | |
QuadrupletBERT: An Efficient Model For Embedding-Based Large-Scale Retrieval Peiyang Liu, Sen Wang, Xi Wang, Wei Ye and Shikun Zhang | |
19:40–21:00 Session 11 (click to expand/collapse) | |
19:40–21:00 11A: Ethics, Bias, and Fairness | |
Session chair: Swabha Swayamdipta (Allen Institute for AI) | |
Dynamically Disentangling Social Bias from Task-Oriented Representations with Adversarial Attack Liwen Wang, Yuanmeng Yan, Keqing He, Yanan Wu and Weiran Xu | |
An Empirical Investigation of Bias in the Multimodal Analysis of Financial Earnings Calls Ramit Sawhney, Arshiya Aggarwal and Rajiv Ratn Shah | |
Beyond Fair Pay: Ethical Implications of NLP Crowdsourcing Boaz Shmueli, Jan Fell, Soumya Ray and Lun-Wei Ku | |
On Transferability of Bias Mitigation Effects in Language Model Fine-Tuning Xisen Jin, Francesco Barbieri, Brendan Kennedy, Aida Mostafazadeh Davani, Leonardo Neves and Xiang Ren | |
Case Study: Deontological Ethics in NLP Shrimai Prabhumoye, Brendon Boldt, Ruslan Salakhutdinov and Alan W Black | |
Privacy Regularization: Joint Privacy-Utility Optimization in LanguageModels Fatemehsadat Mireshghallah, Huseyin Inan, Marcello Hasegawa, Victor Rühle, Taylor Berg-Kirkpatrick and Robert Sim | |
On the Impact of Random Seeds on the Fairness of Clinical Classifiers Silvio Amir, Jan-Willem van de Meent and Byron Wallace | |
19:40–21:00 11B: Interpretability and Analysis of Models for NLP | |
Session chair: Tiancheng Zhao (Zhejiang University) | |
Topic Model or Topic Twaddle? Re-evaluating Semantic Interpretability Measures Caitlin Doogan and Wray Buntine | |
Discourse Probing of Pretrained Language Models Fajri Koto, Jey Han Lau and Timothy Baldwin | |
UniDrop: A Simple yet Effective Technique to Improve Transformer without Extra Cost Zhen Wu, Lijun Wu, Qi Meng, Yingce Xia, Shufang Xie, Tao Qin, Xinyu Dai and Tie-Yan Liu | |
tWT–WT: A Dataset to Assert the Role of Target Entities for Detecting Stance of Tweets Ayush Kaushal, Avirup Saha and Niloy Ganguly | |
Learning to Learn to be Right for the Right Reasons Pride Kavumba, Benjamin Heinzerling, Ana Brassard and Kentaro Inui | |
Double Perturbation: On the Robustness of Robustness and Counterfactual Bias Evaluation Chong Zhang, Jieyu Zhao, Huan Zhang, Kai-Wei Chang and Cho-Jui Hsieh | |
Explaining Neural Network Predictions on Sentence Pairs via Learning Word-Group Masks Hanjie Chen, Song Feng, Jatin Ganhotra, Hui Wan, Chulaka Gunasekara, Sachindra Joshi and Yangfeng Ji | |
19:40–21:00 11C: Machine Translation | |
Session chair: Orhan Firat (Google Research) | |
Almost Free Semantic Draft for Neural Machine Translation Xi Ai and Bin Fang | |
Pruning-then-Expanding Model for Domain Adaptation of Neural Machine Translation Shuhao Gu, Yang Feng and Wanying Xie | |
Multi-Hop Transformer for Document-Level Machine Translation Long Zhang, Tong Zhang, Haibo Zhang, Baosong Yang, Wei Ye and Shikun Zhang | |
Continual Learning for Neural Machine Translation Yue Cao, Hao-Ran Wei, Boxing Chen and Xiaojun Wan | |
Self-Training for Unsupervised Neural Machine Translation in Unbalanced Training Data Scenarios Haipeng Sun, Rui Wang, Kehai Chen, Masao Utiyama, Eiichiro Sumita and Tiejun Zhao | |
Smart-Start Decoding for Neural Machine Translation Jian Yang, Shuming Ma, Dongdong Zhang, Juncheng Wan, Zhoujun Li and Ming Zhou | |
Multi-Task Learning with Shared Encoder for Non-Autoregressive Machine Translation Yongchang Hao, Shilin He, Wenxiang Jiao, Zhaopeng Tu, Michael Lyu and Xing Wang | |
19:40–21:00 11D: NLP Applications | |
Session chair: Minjoon Seo (KAIST) | |
ER-AE: Differentially Private Text Generation for Authorship Anonymization Haohan Bo, Steven H. H. Ding, Benjamin C. M. Fung and Farkhund Iqbal | |
Distantly Supervised Transformers For E-Commerce Product QA Happy Mittal, Aniket Chakrabarti, Belhassen Bayar, Animesh Anant Sharma and Nikhil Rasiwasia | |
Quantitative Day Trading from Natural Language using Reinforcement Learning Ramit Sawhney, Arnav Wadhwa, Shivam Agarwal and Rajiv Ratn Shah | |
Restoring and Mining the Records of the Joseon Dynasty via Neural Language Modeling and Machine Translation Kyeongpil Kang, Kyohoon Jin, Soyoung Yang, Soojin Jang, Jaegul Choo and Youngbin Kim | |
Modeling Diagnostic Label Correlation for Automatic ICD Coding Shang-Chi Tsai, Chao-Wei Huang and Yun-Nung Chen | |
Self-Supervised Contrastive Learning for Efficient User Satisfaction Prediction in Conversational Agents Mohammad Kachuee, Hao Yuan, Young-Bum Kim and Sungjin Lee | |
19:40–21:00 11E: Special Theme: New Challenges in NLP | |
Session chair: Xun Wang (Microsoft) | |
A recipe for annotating grounded clarifications Luciana Benotti and Patrick Blackburn | |
Grey-box Adversarial Attack And Defence For Sentiment Classification Ying Xu, Xu Zhong, Antonio Jimeno Yepes and Jey Han Lau | |
How low is too low? A monolingual take on lemmatisation in Indian languages Kumar Saunack, Kumar Saurav and Pushpak Bhattacharyya | |
Causal Effects of Linguistic Properties Reid Pryzant, Dallas Card, Dan Jurafsky, Victor Veitch and Dhanya Sridhar | |
Dynabench: Rethinking Benchmarking in NLP Douwe Kiela, Max Bartolo, Yixin Nie, Divyansh Kaushik, Atticus Geiger, Zhengxuan Wu, Bertie Vidgen, Grusha Prasad, Amanpreet Singh, Pratik Ringshia, Zhiyi Ma, Tristan Thrush, Sebastian Riedel, Zeerak Waseem, Pontus Stenetorp, Robin Jia, Mohit Bansal, Christopher Potts and Adina Williams | |
Translational NLP: A New Paradigm and General Principles for Natural Language Processing Research Denis Newman-Griffis, Jill Fain Lehman, Carolyn Rosé and Harry Hochheiser | |
Wed 09 Jun 2021 (all times PDT, UTC-7) | |
09:00–10:20 Session 12 (click to expand/collapse) | |
09:00–10:20 12A: Discourse and Pragmatics | |
Session chair: Jessy Li (UT Austin) | |
Predicting Discourse Trees from Transformer-based Neural Summarizers Wen Xiao, Patrick Huber and Giuseppe Carenini | |
Probing for Bridging Inference in Transformer Language Models Onkar Pandit and Yufang Hou | |
Is Incoherence Surprising? Targeted Evaluation of Coherence Prediction from Language Models Anne Beyer, Sharid Loáiciga and David Schlangen | |
Stay Together: A System for Single and Split-antecedent Anaphora Resolution Juntao Yu, Nafise Sadat Moosavi, Silviu Paun and Massimo Poesio | |
[TACL] Decontextualization: Making Sentences Stand-Alone Eunsol Choi, Jennimaria Palomaki, Matthew Lamm, Tom Kwiatkowski, Dipanjan Das, Michael Collins | |
[CL] Universal Discourse Representation Structure Parsing Jiangming Liu, Shay B. Cohen, Mirella Lapata, Johan Bos | |
09:00–10:20 12B: Information Retrieval and Text Mining | |
Session chair: Thuy Vu (Amazon, Alexa AI) | |
Redefining Absent Keyphrases and their Effect on Retrieval Effectiveness Florian Boudin and Ygor Gallina | |
CoRT: Complementary Rankings from Transformers Marco Wrzalik and Dirk Krechel | |
Multi-source Neural Topic Modeling in Multi-view Embedding Spaces Pankaj Gupta, Yatin Chaudhary and Hinrich Schütze | |
Inductive Topic Variational Graph Auto-Encoder for Text Classification Qianqian Xie, Jimin Huang, Pan Du, Min Peng and Jian-Yun Nie | |
Self-Alignment Pretraining for Biomedical Entity Representations Fangyu Liu, Ehsan Shareghi, Zaiqiao Meng, Marco Basaldella and Nigel Collier | |
TaxoClass: Hierarchical Multi-Label Text Classification Using Only Class Names Jiaming Shen, Wenda Qiu, Yu Meng, Jingbo Shang, Xiang Ren and Jiawei Han | |
09:00–10:20 12C: Language Generation | |
Session chair: Antoine Bosselut (Stanford University) | |
MERMAID: Metaphor Generation with Symbolism and Discriminative Decoding Tuhin Chakrabarty, Xurui Zhang, Smaranda Muresan and Nanyun Peng | |
On Learning Text Style Transfer with Direct Rewards Yixin Liu, Graham Neubig and John Wieting | |
Focused Attention Improves Document-Grounded Generation Shrimai Prabhumoye, Kazuma Hashimoto, Yingbo Zhou, Alan W Black and Ruslan Salakhutdinov | |
NeuroLogic Decoding: (Un)supervised Neural Text Generation with Predicate Logic Constraints Ximing Lu, Peter West, Rowan Zellers, Ronan Le Bras, Chandra Bhagavatula and Yejin Choi | |
Ask what’s missing and what’s useful: Improving Clarification Question Generation using Global Knowledge Bodhisattwa Prasad Majumder, Sudha Rao, Michel Galley and Julian McAuley | |
Progressive Generation of Long Text with Pretrained Language Models Bowen Tan, Zichao Yang, Maruan Al-Shedivat, Eric Xing and Zhiting Hu | |
09:00–10:20 12D: Language Resources and Evaluation | |
Session chair: Seokhwan Kim (Amazon, Alexa AI) | |
SOCCER: An Information-Sparse Discourse State Tracking Collection in the Sports Commentary Domain Ruochen Zhang and Carsten Eickhoff | |
Plot-guided Adversarial Example Construction for Evaluating Open-domain Story Generation Sarik Ghazarian, Zixi Liu, Akash S M, Ralph Weischedel, Aram Galstyan and Nanyun Peng | |
MultiOpEd: A Corpus of Multi-Perspective News Editorials Siyi Liu, Sihao Chen, Xander Uyttendaele and Dan Roth | |
Swords: A Benchmark for Lexical Substitution with Improved Data Coverage and Quality Mina Lee, Chris Donahue, Robin Jia, Alexander Iyabor and Percy Liang | |
"I’m Not Mad": Commonsense Implications of Negation and Contradiction Liwei Jiang, Antoine Bosselut, Chandra Bhagavatula and Yejin Choi | |
Identifying Medical Self-Disclosure in Online Communities Mina Valizadeh, Pardis Ranjbar-Noiey, Cornelia Caragea and Natalie Parde | |
09:00–10:20 12E: Linguistic Theories, Cognitive Modeling and Psycholinguistics | |
Session chair: Costanza Navarretta (University of Copenhagen) | |
Language in a (Search) Box: Grounding Language Learning in Real-World Human-Machine Interaction Federico Bianchi, Ciro Greco and Jacopo Tagliabue | |
Finding Concept-specific Biases in Form–Meaning Associations Tiago Pimentel, Brian Roark, Søren Wichmann, Ryan Cotterell and Damián Blasi | |
How (Non-)Optimal is the Lexicon? Tiago Pimentel, Irene Nikkarinen, Kyle Mahowald, Ryan Cotterell and Damián Blasi | |
Word Complexity is in the Eye of the Beholder Sian Gooding, Ekaterina Kochmar, Seid Muhie Yimam and Chris Biemann | |
Linguistic Complexity Loss in Text-Based Therapy Jason Wei, Kelly Finn, Emma Templeton, Thalia Wheatley and Soroush Vosoughi | |
Ab Antiquo: Neural Proto-language Reconstruction Carlo Meloni, Shauli Ravfogel and Yoav Goldberg | |
On Biasing Transformer Attention Towards Monotonicity Annette Rios, Chantal Amrhein, Noëmi Aepli and Rico Sennrich | |
10:20–11:40 Session 13 (click to expand/collapse) | |
10:20–11:40 13A: NLP Applications | |
Session chair: Tristan Naumann (Microsoft Research) | |
Extracting a Knowledge Base of Mechanisms from COVID-19 Papers Tom Hope, Aida Amini, David Wadden, Madeleine van Zuylen, Sravanthi Parasa, Eric Horvitz, Daniel Weld, Roy Schwartz and Hannaneh Hajishirzi | |
Constrained Multi-Task Learning for Event Coreference Resolution Jing Lu and Vincent Ng | |
Empirical Evaluation of Pre-trained Transformers for Human-Level NLP: The Role of Sample Size and Dimensionality Adithya V Ganesan, Matthew Matero, Aravind Reddy Ravula, Huy Vu and H. Andrew Schwartz | |
Leveraging Deep Representations of Radiology Reports in Survival Analysis for Predicting Heart Failure Patient Mortality Hyun Gi Lee, Evan Sholle, Ashley Beecy, Subhi Al’Aref and Yifan Peng | |
On the Use of Context for Predicting Citation Worthiness of Sentences in Scholarly Articles Rakesh Gosangi, Ravneet Arora, Mohsen Gheisarieha, Debanjan Mahata and Haimin Zhang | |
Data and Model Distillation as a Solution for Domain-transferable Fact Verification Mitch Paul Mithun, Sandeep Suntwal and Mihai Surdeanu | |
Adapting Coreference Resolution for Processing Violent Death Narratives Ankith Uppunda, Susan Cochran, Jacob Foster, Alina Arseniev-Koehler, Vickie Mays and Kai-Wei Chang | |
10:20–11:40 13B: Question Answering | |
Session chair: Marek Rei (Imperial College London) | |
Time-Stamped Language Model: Teaching Language Models to Understand The Flow of Events Hossein Rajaby Faghihi and Parisa Kordjamshidi | |
If You Want to Go Far Go Together: Unsupervised Joint Candidate Evidence Retrieval for Multi-hop Question Answering Vikas Yadav, Steven Bethard and Mihai Surdeanu | |
SPARTQA: A Textual Question Answering Benchmark for Spatial Reasoning Roshanak Mirzaee, Hossein Rajaby Faghihi, Qiang Ning and Parisa Kordjamshidi | |
A Dataset of Information-Seeking Questions and Answers Anchored in Research Papers Pradeep Dasigi, Kyle Lo, Iz Beltagy, Arman Cohan, Noah A. Smith and Matt Gardner | |
Differentiable Open-Ended Commonsense Reasoning Bill Yuchen Lin, Haitian Sun, Bhuwan Dhingra, Manzil Zaheer, Xiang Ren and William Cohen | |
Does Structure Matter? Encoding Documents for Machine Reading Comprehension Hui Wan, Song Feng, Chulaka Gunasekara, Siva Sankalp Patel, Sachindra Joshi and Luis Lastras | |
Multi-Step Reasoning Over Unstructured Text with Beam Dense Retrieval Chen Zhao, Chenyan Xiong, Jordan Boyd-Graber and Hal Daumé III | |
10:20–11:40 13C: Lexical Semantics | |
Session chair: Marzena Karpinska (University of Massachusetts Amherst) | |
Scalable and Interpretable Semantic Change Detection Syrielle Montariol, Matej Martinc and Lidia Pivovarova | |
Scalar Adjective Identification and Multilingual Ranking Aina Garí Soler and Marianna Apidianaki | |
ESC: Redesigning WSD with Extractive Sense Comprehension Edoardo Barba, Tommaso Pasini and Roberto Navigli | |
Recent advances in neural metaphor processing: A linguistic, cognitive and social perspective Xiaoyu Tong, Ekaterina Shutova and Martha Lewis | |
Constructing Taxonomies from Pretrained Language Models Catherine Chen, Kevin Lin and Dan Klein | |
Event Representation with Sequential, Semi-Supervised Discrete Variables Mehdi Rezaee and Francis Ferraro | |
10:20–11:40 13D: Sentiment Analysis and Stylistic Analysis | |
Session chair: Pushkar Mishra (Facebook AI) | |
Seq2Emo: A Sequence to Multi-Label Emotion Classification Model Chenyang Huang, Amine Trabelsi, Xuebin Qin, Nawshad Farruque, Lili Mou and Osmar Zaïane | |
Knowledge Enhanced Masked Language Model for Stance Detection Kornraphop Kawintiranon and Lisa Singh | |
Learning Paralinguistic Features from Audiobooks through Style Voice Conversion Zakaria Aldeneh, Matthew Perez and Emily Mower Provost | |
Adapting BERT for Continual Learning of a Sequence of Aspect Sentiment Classification Tasks Zixuan Ke, Hu Xu and Bing Liu | |
Adversarial Learning for Zero-Shot Stance Detection on Social Media Emily Allaway, Malavika Srikanth and Kathleen McKeown | |
10:20–11:40 13E: Summarization | |
Session chair: Iz Beltagy (Allen Institute for AI) | |
Efficiently Summarizing Text and Graph Encodings of Multi-Document Clusters Ramakanth Pasunuru, Mengwen Liu, Mohit Bansal, Sujith Ravi and Markus Dreyer | |
Enriching Transformers with Structured Tensor-Product Representations for Abstractive Summarization Yichen Jiang, Asli Celikyilmaz, Paul Smolensky, Paul Soulos, Sudha Rao, Hamid Palangi, Roland Fernandez, Caitlin Smith, Mohit Bansal and Jianfeng Gao | |
What’s in a Summary? Laying the Groundwork for Advances in Hospital-Course Summarization Griffin Adams, Emily Alsentzer, Mert Ketenci, Jason Zucker and Noémie Elhadad | |
Understanding Factuality in Abstractive Summarization with FRANK: A Benchmark for Factuality Metrics Artidoro Pagnoni, Vidhisha Balachandran and Yulia Tsvetkov | |
GSum: A General Framework for Guided Neural Abstractive Summarization Zi-Yi Dou, Pengfei Liu, Hiroaki Hayashi, Zhengbao Jiang and Graham Neubig | |
[TACL] WikiAsp: A Dataset for Multi-domain Aspect-based Summarization Hiroaki Hayashi, Prashant Budania, Peng Wang, Chris Ackerson, Raj Neervannan, Graham Neubig | |
11:40–13:10 Best Paper Presentations | |
Session chair: Anna Rumshisky (University of Massachusetts Lowell) | |
Video-aided Unsupervised Grammar Induction Songyang Zhang, Linfeng Song, Lifeng Jin, Kun Xu, Dong Yu and Jiebo Luo | |
It’s Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners Timo Schick and Hinrich Schütze | |
Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources Simone Conia, Andrea Bacciu and Roberto Navigli | |
Learning How to Ask: Querying LMs with Mixtures of Soft Prompts Guanghui Qin and Jason Eisner | |
How many data points is a prompt worth? Teven Le Scao and Alexander Rush | |
Preregistering NLP research Emiel van Miltenburg, Chris van der Lee and Emiel Krahmer | |
17:00–18:20 Session 14 (click to expand/collapse) | |
17:00–18:20 14A: Computational Social Science and Cultural Analytics | |
Session chair: Diyi Yang (Georgia Tech) | |
Multitask Learning for Emotionally Analyzing Sexual Abuse Disclosures Ramit Sawhney, Puneet Mathur, Taru Jain, Akash Kumar Gautam and Rajiv Ratn Shah | |
Self Promotion in US Congressional Tweets Jun Wang, Kelly Cui and Bei Yu | |
Profiling of Intertextuality in Latin Literature Using Word Embeddings Patrick J. Burns, James Brofos, Kyle Li, Pramit Chaudhuri and Joseph P. Dexter | |
Identifying inherent disagreement in natural language inference Xinliang Frederick Zhang and Marie-Catherine de Marneffe | |
Modeling Human Mental States with an Entity-based Narrative Graph I-Ta Lee, Maria Leonor Pacheco and Dan Goldwasser | |
17:00–18:20 14B: Generation and Summarization | |
Session chair: Lili Mou (UAlberta; Amii) | |
A Simple and Efficient Multi-Task Learning Approach for Conditioned Dialogue Generation Yan Zeng and Jian-Yun Nie | |
Hurdles to Progress in Long-form Question Answering Kalpesh Krishna, Aurko Roy and Mohit Iyyer | |
ENTRUST: Argument Reframing with Language Models and Entailment Tuhin Chakrabarty, Christopher Hidey and Smaranda Muresan | |
Paragraph-level Simplification of Medical Texts Ashwin Devaraj, Iain Marshall, Byron Wallace and Junyi Jessy Li | |
An Empirical Study on Neural Keyphrase Generation Rui Meng, Xingdi Yuan, Tong Wang, Sanqiang Zhao, Adam Trischler and Daqing He | |
Attention Head Masking for Inference Time Content Selection in Abstractive Summarization Shuyang Cao and Lu Wang | |
17:00–18:20 14C: Interpretability and Analysis of Models for NLP | |
Session chair: Allyson Ettinger (University of Chicago) | |
Factual Probing Is [MASK]: Learning vs. Learning to Recall Zexuan Zhong, Dan Friedman and Danqi Chen | |
Evaluating Saliency Methods for Neural Language Models Shuoyang Ding and Philipp Koehn | |
Contextualized Perturbation for Textual Adversarial Attack Dianqi Li, Yizhe Zhang, Hao Peng, Liqun Chen, Chris Brockett, Ming-Ting Sun and Bill Dolan | |
DirectProbe: Studying Representations without Classifiers Yichu Zhou and Vivek Srikumar | |
Evaluating the Values of Sources in Transfer Learning Md Rizwan Parvez and Kai-Wei Chang | |
Too Much in Common: Shifting of Embeddings in Transformer Language Models and its Implications Daniel Biś, Maksim Podkorytov and Xiuwen Liu | |
17:00–18:20 14D: Machine Learning for NLP: Language Modeling and Sequence to Sequence Models | |
Session chair: Taylor Berg-Kirkpatrick (UC San Diego) | |
On the Inductive Bias of Masked Language Modeling: From Statistical to Syntactic Dependencies Tianyi Zhang and Tatsunori Hashimoto | |
Limitations of Autoregressive Models and Their Alternatives Chu-Cheng Lin, Aaron Jaech, Xin Li, Matthew R. Gormley and Jason Eisner | |
On the Transformer Growth for Progressive BERT Training Xiaotao Gu, Liyuan Liu, Hongkun Yu, Jing Li, Chen Chen and Jiawei Han | |
Revisiting Simple Neural Probabilistic Language Models Simeng Sun and Mohit Iyyer | |
ReadTwice: Reading Very Large Documents with Memories Yury Zemlyanskiy, Joshua Ainslie, Michiel de Jong, Philip Pham, Ilya Eckstein and Fei Sha | |
SCRIPT: Self-Critic PreTraining of Transformers Erik Nijkamp, Bo Pang, Ying Nian Wu and Caiming Xiong | |
17:00–18:20 14E: NLP Applications | |
Session chair: Kevin Small (Amazon) | |
Nutri-bullets Hybrid: Consensual Multi-document Summarization Darsh Shah, Lili Yu, Tao Lei and Regina Barzilay | |
AVA: an Automatic eValuation Approach for Question Answering Systems Thuy Vu and Alessandro Moschitti | |
SpanPredict: Extraction of Predictive Document Spans with Neural Attention Vivek Subramanian, Matthew Engelhard, Sam Berchuck, Liqun Chen, Ricardo Henao and Lawrence Carin | |
Text Editing by Command Felix Faltings, Michel Galley, Gerold Hintz, Chris Brockett, Chris Quirk, Jianfeng Gao and Bill Dolan | |
A Deep Metric Learning Approach to Account Linking Aleem Khan, Elizabeth Fleming, Noah Schofield, Marcus Bishop and Nicholas Andrews | |
Improving Factual Completeness and Consistency of Image-to-Text Radiology Report Generation Yasuhide Miura, Yuhao Zhang, Emily Tsai, Curtis Langlotz and Dan Jurafsky | |
18:20–19:40 Session 15 (click to expand/collapse) | |
18:20–19:40 15A: Language Grounding to Vision, Robotics and Beyond | |
Session chair: Aishwarya Padmakumar (Amazon) | |
Multimodal End-to-End Sparse Model for Emotion Recognition Wenliang Dai, Samuel Cahyawijaya, Zihan Liu and Pascale Fung | |
MIMOQA: Multimodal Input Multimodal Output Question Answering Hrituraj Singh, Anshul Nasery, Denil Mehta, Aishwarya Agarwal, Jatin Lamba and Balaji Vasan Srinivasan | |
OCID-Ref: A 3D Robotic Dataset With Embodied Language For Clutter Scene Grounding Ke-Jyun Wang, Yun-Hsuan Liu, Hung-Ting Su, Jen-Wei Wang, Yu-Siang Wang, Winston Hsu and Wen-Chin Chen | |
Unsupervised Vision-and-Language Pre-training Without Parallel Images and Captions Liunian Harold Li, Haoxuan You, Zhecan Wang, Alireza Zareian, Shih-Fu Chang and Kai-Wei Chang | |
Multitasking Inhibits Semantic Drift Athul Paul Jacob, Mike Lewis and Jacob Andreas | |
Probing Contextual Language Models for Common Ground with Visual Representations Gabriel Ilharco, Rowan Zellers, Ali Farhadi and Hannaneh Hajishirzi | |
18:20–19:40 15B: Machine Learning for NLP: Classification and Structured Prediction Models | |
Session chair: Arman Cohan (Allen Institute for AI) | |
BBAEG: Towards BERT-based Biomedical Adversarial Example Generation for Text Classification Ishani Mondal | |
Targeted Adversarial Training for Natural Language Understanding Lis Pereira, Xiaodong Liu, Hao Cheng, Hoifung Poon, Jianfeng Gao and Ichiro Kobayashi | |
Latent-Optimized Adversarial Neural Transfer for Sarcasm Detection Xu Guo, Boyang Li, Han Yu and Chunyan Miao | |
Self-training Improves Pre-training for Natural Language Understanding Jingfei Du, Edouard Grave, Beliz Gunel, Vishrav Chaudhary, Onur Celebi, Michael Auli, Veselin Stoyanov and Alexis Conneau | |
Supporting Clustering with Contrastive Learning Dejiao Zhang, Feng Nan, Xiaokai Wei, Shang-Wen Li, Henghui Zhu, Kathleen McKeown, Ramesh Nallapati, Andrew O. Arnold and Bing Xiang | |
[TACL] Self-supervised Regularization for Text Classification Meng Zhou, Zechen Li, Pengtao Xie | |
18:20–19:40 15C: NLP Applications | |
Session chair: Wei Xu (Georgia Tech) | |
TITA: A Two-stage Interaction and Topic-Aware Text Matching Model Xingwu Sun, Yanling Cui, Hongyin Tang, Qiuyu Zhu, Fuzheng Zhang and Beihong Jin | |
Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction Zhenghao Liu, Xiaoyuan Yi, Maosong Sun, Liner Yang and Tat-Seng Chua | |
Neural Network Surgery: Injecting Data Patterns into Pre-trained Models with Minimal Instance-wise Side Effects Zhiyuan Zhang, Xuancheng Ren, Qi Su, Xu Sun and Bin He | |
Discrete Argument Representation Learning for Interactive Argument Pair Identification Lu Ji, Zhongyu Wei, Jing Li, Qi Zhang and Xuanjing Huang | |
On Unifying Misinformation Detection Nayeon Lee, Belinda Z. Li, Sinong Wang, Pascale Fung, Hao Ma, Wen-tau Yih and Madian Khabsa | |
Frustratingly Easy Edit-based Linguistic Steganography with a Masked Language Model Honai Ueoka, Yugo Murawaki and Sadao Kurohashi | |
Few-Shot Text Classification with Triplet Networks, Data Augmentation, and Curriculum Learning Jason Wei, Chengyu Huang, Soroush Vosoughi, Yu Cheng and Shiqi Xu | |
18:20–19:40 15D: Phonology, Morphology and Word Segmentation | |
Session chair: Ekaterina Vylomova (The University of Melbourne) | |
Do RNN States Encode Abstract Phonological Alternations? Miikka Silfverberg, Francis Tyers, Garrett Nicolai and Mans Hulden | |
Pre-training with Meta Learning for Chinese Word Segmentation Zhen Ke, Liang Shi, Songtao Sun, Erli Meng, Bin Wang and Xipeng Qiu | |
Decompose, Fuse and Generate: A Formation-Informed Method for Chinese Definition Generation Hua Zheng, Damai Dai, Lei Li, Tianyu Liu, Zhifang Sui, Baobao Chang and Yang Liu | |
User-Generated Text Corpus for Evaluating Japanese Morphological Analysis and Lexical Normalization Shohei Higashiyama, Masao Utiyama, Taro Watanabe and Eiichiro Sumita | |
GPT Perdetry Test: Generating new meanings for new words Nikolay Malkin, Sameera Lanka, Pranav Goel, Sudha Rao and Nebojsa Jojic | |
18:20–19:40 15E: Sentence-level Semantics and Textual Inference | |
Session chair: John Wieting (Google Research) | |
Universal Semantic Tagging for English and Mandarin Chinese Wenxi Li, Yiyang Hou, Yajie Ye, Li Liang and Weiwei Sun | |
ShadowGNN: Graph Projection Neural Network for Text-to-SQL Parser Zhi Chen, Lu Chen, Yanbin Zhao, Ruisheng Cao, Zihan Xu, Su Zhu and Kai Yu | |
Contextualized and Generalized Sentence Representations by Contrastive Self-Supervised Learning: A Case Study on Discourse Relation Analysis Hirokazu Kiyomaru and Sadao Kurohashi | |
AMR Parsing with Action-Pointer Transformer Jiawei Zhou, Tahira Naseem, Ramón Fernandez Astudillo and Radu Florian | |
NL-EDIT: Correcting Semantic Parse Errors through Natural Language Interaction Ahmed Elgohary, Christopher Meek, Matthew Richardson, Adam Fourney, Gonzalo Ramos and Ahmed Hassan Awadallah | |
Unsupervised Concept Representation Learning for Length-Varying Text Similarity Xuchao Zhang, Bo Zong, Wei Cheng, Jingchao Ni, Yanchi Liu and Haifeng Chen | |
19:40–21:00 Session 16 (click to expand/collapse) | |
19:40–21:00 16A: Dialogue and Interactive Systems | |
Session chair: Alexandros Papangelis (Amazon) | |
Augmenting Knowledge-grounded Conversations with Sequential Knowledge Transition Haolan Zhan, Hainan Zhang, Hongshen Chen, Zhuoye Ding, Yongjun Bao and Yanyan Lan | |
Adversarial Self-Supervised Learning for Out-of-Domain Detection Zhiyuan Zeng, Keqing He, Yuanmeng Yan, Hong Xu and Weiran Xu | |
Leveraging Slot Descriptions for Zero-Shot Cross-Domain Dialogue StateTracking Zhaojiang Lin, Bing Liu, Seungwhan Moon, Paul Crook, Zhenpeng Zhou, Zhiguang Wang, Zhou Yu, Andrea Madotto, Eunjoon Cho and Rajen Subba | |
Hierarchical Transformer for Task Oriented Dialog Systems Bishal Santra, Potnuru Anusha and Pawan Goyal | |
Measuring the ‘I don’t know’ Problem through the Lens of Gricean Quantity Huda Khayrallah and João Sedoc | |
[TACL] Dialogue State Tracking with Incremental Reasoning Lizi Liao, Le Hong Long, Yunshan Ma, Wenqiang Lei, Tat-Seng Chua | |
19:40–21:00 16B: Information Extraction | |
Session chair: Siliang Tang (Zhejiang University) | |
RTFE: A Recursive Temporal Fact Embedding Framework for Temporal Knowledge Graph Completion Youri Xu, Haihong E, Meina Song, wenyu song, Xiaodong Lv, wang haotian and yang jinrui | |
Open Hierarchical Relation Extraction Kai Zhang, Yuan Yao, Ruobing Xie, Xu Han, Zhiyuan Liu, Fen Lin, Leyu Lin and Maosong Sun | |
Jointly Extracting Explicit and Implicit Relational Triples with Reasoning Pattern Enhanced Binary Pointer Network Yubo Chen, Yunqi Zhang, Changran Hu and Yongfeng Huang | |
Multi-Grained Knowledge Distillation for Named Entity Recognition Xuan Zhou, Xiao Zhang, Chenyang Tao, Junya Chen, Bing Xu, Wei Wang and Jing Xiao | |
SGG: Learning to Select, Guide, and Generate for Keyphrase Generation Jing Zhao, Junwei Bao, Yifan Wang, Youzheng Wu, Xiaodong He and Bowen Zhou | |
Towards Sentiment and Emotion aided Multi-modal Speech Act Classification in Twitter Tulika Saha, Apoorva Upadhyaya, Sriparna Saha and Pushpak Bhattacharyya | |
19:40–21:00 16C: Machine Translation | |
Session chair: Rui Wang (Shanghai Jiao Tong University) | |
Generative Imagination Elevates Machine Translation Quanyu Long, Mingxuan Wang and Lei Li | |
Non-Autoregressive Translation by Learning Target Categorical Codes Yu Bao, Shujian Huang, Tong Xiao, Dongqi Wang, Xinyu Dai and Jiajun CHEN | |
Training Data Augmentation for Code-Mixed Translation Abhirut Gupta, Aditya Vavre and Sunita Sarawagi | |
Rethinking Perturbations in Encoder-Decoders for Fast Training Sho Takase and Shun Kiyono | |
Context-aware Decoder for Neural Machine Translation using a Target-side Document-Level Language Model Amane Sugiyama and Naoki Yoshinaga | |
Machine Translated Text Detection Through Text Similarity with Round-Trip Translation Hoang-Quoc Nguyen-Son, Tran Thao, Seira Hidano, Ishita Gupta and Shinsaku Kiyomoto | |
19:40–21:00 16D: Question Answering | |
Session chair: Hung-yi Lee (National Taiwan University) | |
TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference Deming Ye, Yankai Lin, Yufei Huang and Maosong Sun | |
Breadth First Reasoning Graph for Multi-hop Question Answering Yongjie Huang and Meng Yang | |
Improving Zero-Shot Cross-lingual Transfer for Multilingual Question Answering over Knowledge Graph Yucheng Zhou, Xiubo Geng, Tao Shen, Wenqiang Zhang and Daxin Jiang | |
RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question Answering Yingqi Qu, Yuchen Ding, Jing Liu, Kai Liu, Ruiyang Ren, Wayne Xin Zhao, Daxiang Dong, Hua Wu and Haifeng Wang | |
DAGN: Discourse-Aware Graph Network for Logical Reasoning Yinya Huang, Meng Fang, Yu Cao, Liwei Wang and Xiaodan Liang | |
Designing a Minimal Retrieve-and-Read System for Open-Domain Question Answering Sohee Yang and Minjoon Seo | |
Unsupervised Multi-hop Question Answering by Question Generation Liangming Pan, Wenhu Chen, Wenhan Xiong, Min-Yen Kan and William Yang Wang | |
19:40–21:00 16E: Summarization | |
Session chair: Yang Liu (Microsoft) | |
Sliding Selector Network with Dynamic Memory for Extractive Summarization of Long Documents Peng Cui and Le Hu | |
AdaptSum: Towards Low-Resource Domain Adaptation for Abstractive Summarization Tiezheng Yu, Zihan Liu and Pascale Fung | |
QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization Ming Zhong, Da Yin, Tao Yu, Ahmad Zaidi, Mutethia Mutuma, Rahul Jha, Ahmed Hassan Awadallah, Asli Celikyilmaz, Yang Liu, Xipeng Qiu and Dragomir Radev | |
MM-AVS: A Full-Scale Dataset for Multi-modal Summarization Xiyan Fu, Jun Wang and Zhenglu Yang | |
MediaSum: A Large-scale Media Interview Dataset for Dialogue Summarization Chenguang Zhu, Yang Liu, Jie Mei and Michael Zeng | |
Improving Faithfulness in Abstractive Summarization with Contrast Candidate Generation and Selection Sihao Chen, Fan Zhang, Kazoo Sone and Dan Roth | |
Inference Time Style Control for Summarization Shuyang Cao and Lu Wang |