Conference Program Home | NAACL 2021 Proceedings | NAACL 2021 WEBSITE | ACL WEBSITE


The 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

PROGRAM

Mon 07 Jun 2021 (all times PDT, UTC-7)

 08:00–09:00 Keynote
  Session chair: Anna Rumshisky (University of Massachusetts Lowell)
 Humans Learn From Task Descriptions and So Should Our Models
Hinrich Schuetze
 09:00–10:20 Session 1 (click to expand/collapse)
 09:00–10:20 1A: Information Extraction
  Session chair: Ni Lao (Apple)
 Knowledge Router: Learning Disentangled Representations for Knowledge Graphs
Shuai Zhang, Xi Rao, Yi Tay and Ce Zhang
 Distantly Supervised Relation Extraction with Sentence Reconstruction and Knowledge Base Priors
Fenia Christopoulou, Makoto Miwa and Sophia Ananiadou
 Cross-Task Instance Representation Interactions and Label Dependencies for Joint Information Extraction with Graph Convolutional Networks
Minh Van Nguyen, Viet Lai and Thien Huu Nguyen
 Abstract Meaning Representation Guided Graph Encoding and Decoding for Joint Information Extraction
Zixuan Zhang and Heng Ji
 A Frustratingly Easy Approach for Entity and Relation Extraction
Zexuan Zhong and Danqi Chen
 Event Time Extraction and Propagation via Graph Attention Networks
Haoyang Wen, Yanru Qu, Heng Ji, Qiang Ning, Jiawei Han, Avi Sil, Hanghang Tong and Dan Roth
 09:00–10:20 1B: Interpretability and Analysis of Models for NLP
  Session chair: Svitlana Volkova (Pacific Northwest National Laboratory)
 Probing Word Translations in the Transformer and Trading Decoder for Encoder Layers
Hongfei Xu, Josef van Genabith, Qiuhui Liu and Deyi Xiong
 Mediators in Determining what Processing BERT Performs First
Aviv Slobodkin, Leshem Choshen and Omri Abend
 Automatic Generation of Contrast Sets from Scene Graphs: Probing the Compositional Consistency of GQA
Yonatan Bitton, Gabriel Stanovsky, Roy Schwartz and Michael Elhadad
 Multilingual Language Models Predict Human Reading Behavior
Nora Hollenstein, Federico Pirovano, Ce Zhang, Lena Jäger and Lisa Beinborn
 Do Syntactic Probes Probe Syntax? Experiments with Jabberwocky Probing
Rowan Hall Maudslay and Ryan Cotterell
 A Non-Linear Structural Probe
Jennifer C. White, Tiago Pimentel, Naomi Saphra and Ryan Cotterell
 Concealed Data Poisoning Attacks on NLP Models
Eric Wallace, Tony Zhao, Shi Feng and Sameer Singh
 09:00–10:20 1C: Machine Translation
  Session chair: Valia Kordoni (Humboldt-Universitaet zu Berlin, Germany)
 Backtranslation Feedback Improves User Confidence in MT, Not Quality
Vilém Zouhar, Michal Novák, Matúš Žilinec, Ondřej Bojar, Mateo Obregón, Robin L. Hill, Frédéric Blain, Marina Fomicheva, Lucia Specia and Lisa Yankovskaya
 Data Filtering using Cross-Lingual Word Embeddings
Christian Herold, Jan Rosendahl, Joris Vanvinckenroye and Hermann Ney
 Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine Translation
Alexandra Chronopoulou, Dario Stojanovski and Alexander Fraser
 Neural Machine Translation without Embeddings
Uri Shaham and Omer Levy
 Counterfactual Data Augmentation for Neural Machine Translation
Qi Liu, Matt Kusner and Phil Blunsom
 Cultural and Geographical Influences on Image Translatability of Words across Languages
Nikzad Khani, Isidora Tourni, Mohammad Sadegh Rasooli, Chris Callison-Burch and Derry Tanti Wijaya
 Multilingual BERT Post-Pretraining Alignment
Lin Pan, Chung-Wei Hang, Haode Qi, Abhishek Shah, Saloni Potdar and Mo Yu
 09:00–10:20 1D: NLP Applications
  Session chair: Yuval Pinter (Georgia Tech)
 A Million Tweets Are Worth a Few Points: Tuning Transformers for Customer Service Tasks
Amir Hadifar, Sofie Labat, Veronique Hoste, Chris Develder and Thomas Demeester
 Paragraph-level Rationale Extraction through Regularization: A case study on European Court of Human Rights Cases
Ilias Chalkidis, Manos Fergadiotis, Dimitrios Tsarapatsanis, Nikolaos Aletras, Ion Androutsopoulos and Prodromos Malakasiotis
 Answering Product-Questions by Utilizing Questions from Other Contextually Similar Products
Ohad Rozen, David Carmel, Avihai Mejer, Vitaly Mirkis and Yftah Ziser
 EnSidNet: Enhanced Hybrid Siamese-Deep Network for grouping clinical trials into drug-development pathways
Lucia Pagani
 DATE: Detecting Anomalies in Text via Self-Supervision of Transformers
Andrei Manolache, Florin Brad and Elena Burceanu
 A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Source Code
Nadezhda Chirkova and Sergey Troshin
 Fast and Scalable Dialogue State Tracking with Explicit Modular Decomposition
Dingmin Wang, Chenghua Lin, Qi Liu and Kam-Fai Wong
 09:00–10:20 1E: Sentence-level Semantics and Textual Inference
  Session chair: Roy Bar-Haim (IBM Research AI)
 Augmented SBERT: Data Augmentation Method for Improving Bi-Encoders for Pairwise Sentence Scoring Tasks
Nandan Thakur, Nils Reimers, Johannes Daxenberger and Iryna Gurevych
 SmBoP: Semi-autoregressive Bottom-up Semantic Parsing
Ohad Rubin and Jonathan Berant
 SGL: Speaking the Graph Languages of Semantic Parsing via Multilingual Translation
Luigi Procopio, Rocco Tripodi and Roberto Navigli
 Fool Me Twice: Entailment from Wikipedia Gamification
Julian Eisenschlos, Bhuwan Dhingra, Jannis Bulian, Benjamin Börschinger and Jordan Boyd-Graber
 Meta-Learning for Domain Generalization in Semantic Parsing
Bailin Wang, Mirella Lapata and Ivan Titov
 10:20–11:40 Session 2 (click to expand/collapse)
 10:20–11:40 2A: Language Generation
  Session chair: Peng Qi (JD AI Research)
 Aspect-Controlled Neural Argument Generation
Benjamin Schiller, Johannes Daxenberger and Iryna Gurevych
 Text Generation from Discourse Representation Structures
Jiangming Liu, Shay B. Cohen and Mirella Lapata
 APo-VAE: Text Generation in Hyperbolic Space
Shuyang Dai, Zhe Gan, Yu Cheng, Chenyang Tao, Lawrence Carin and Jingjing Liu
 DART: Open-Domain Structured Data Record to Text Generation
Linyong Nan, Dragomir Radev, Rui Zhang, Amrit Rau, Abhinand Sivaprasad, Chiachun Hsieh, Xiangru Tang, Aadit Vyas, Neha Verma, Pranav Krishna, Yangxiaokang Liu, Nadia Irwanto, Jessica Pan, Faiaz Rahman, Ahmad Zaidi, Mutethia Mutuma, Yasin Tarabar, Ankit Gupta, Tao Yu, Yi Chern Tan, Xi Victoria Lin, Caiming Xiong, Richard Socher and Nazneen Fatema Rajani
[TACL] An Error Analysis Framework for Shallow Surface Realisation
Shimorina, Anastasia, Parmentier, Yannick, Gardent, Claire
 TuringAdvice: A Generative and Dynamic Evaluation of Language Use
Rowan Zellers, Ari Holtzman, Elizabeth Clark, Lianhui Qin, Ali Farhadi and Yejin Choi
 10:20–11:40 2B: Multilinguality
  Session chair: Jonathan Clark (Google Research)
 When Being Unseen from mBERT is just the Beginning: Handling New Languages With Multilingual Language Models
Benjamin Muller, Antonios Anastasopoulos, Benoît Sagot and Djamé Seddah
 Multi-Adversarial Learning for Cross-Lingual Word Embeddings
Haozhou Wang, James Henderson and Paola Merlo
 Multi-view Subword Regularization
Xinyi Wang, Sebastian Ruder and Graham Neubig
 mT5: A Massively Multilingual Pre-trained Text-to-Text Transformer
Linting Xue, Noah Constant, Adam Roberts, Mihir Kale, Rami Al-Rfou, Aditya Siddhant, Aditya Barua and Colin Raffel
 MetaXL: Meta Representation Transformation for Low-resource Cross-lingual Learning
Mengzhou Xia, Guoqing Zheng, Subhabrata Mukherjee, Milad Shokouhi, Graham Neubig and Ahmed Hassan Awadallah
[TACL] Parameter Space Factorization for Zero-Shot Learning across Tasks and Languages
Edoardo M. Ponti, Ivan Vulić, Ryan Cotterell, Marinela Parovic, Roi Reichart, Anna Korhonen
 10:20–11:40 2C: Question Answering
  Session chair: Sara Rosenthal (IBM Research)
 Open Domain Question Answering over Tables via Dense Retrieval
Jonathan Herzig, Thomas Müller, Syrine Krichene and Julian Eisenschlos
 Open-Domain Question Answering Goes Conversational via Question Rewriting
Raviteja Anantha, Svitlana Vakulenko, Zhucheng Tu, Shayne Longpre, Stephen Pulman and Srinivas Chappidi
 QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering
Michihiro Yasunaga, Hongyu Ren, Antoine Bosselut, Percy Liang and Jure Leskovec
 XOR QA: Cross-lingual Open-Retrieval Question Answering
Akari Asai, Jungo Kasai, Jonathan Clark, Kenton Lee, Eunsol Choi and Hannaneh Hajishirzi
 SPARTA: Efficient Open-Domain Question Answering via Sparse Transformer Matching Retrieval
Tiancheng Zhao, Xiaopeng Lu and Kyusong Lee
[TACL] Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies
Mor Geva, Daniel Khashabi, Elad Segal, Tushar Khot, Dan Roth, Jonathan Berant
 10:20–11:40 2D: Special Theme: New Challenges in NLP
  Session chair: Ahmed Awadallah (Microsoft Research )
 Implicitly Abusive Language – What does it actually look like and why are we not getting there?
Michael Wiegand, Josef Ruppenhofer and Elisabeth Eder
 The Importance of Modeling Social Factors of Language: Theory and Practice
Dirk Hovy and Diyi Yang
 On learning and representing social meaning in NLP: a sociolinguistic perspective
Dong Nguyen, Laura Rosseel and Jack Grieve
 Get Your Vitamin C! Robust Fact Verification with Contrastive Evidence
Tal Schuster, Adam Fisch and Regina Barzilay
 Representing Numbers in NLP: a Survey and a Vision
Avijit Thawani, Jay Pujara, Filip Ilievski and Pedro Szekely
 What Will it Take to Fix Benchmarking in Natural Language Understanding?
Samuel R. Bowman and George Dahl
 10:20–11:40 2E: Summarization
  Session chair: Fei Liu (University of Central Florida)
 Extending Multi-Document Summarization Evaluation to the Interactive Setting
Ori Shapira, Ramakanth Pasunuru, Hadar Ronen, Mohit Bansal, Yael Amsterdamer and Ido Dagan
 Identifying Helpful Sentences in Product Reviews
Iftah Gamzu, Hila Gonen, Gilad Kutiel, Ran Levy and Eugene Agichtein
 Noisy Self-Knowledge Distillation for Text Summarization
Yang Liu, Sheng Shen and Mirella Lapata
 Improving Zero and Few-Shot Abstractive Summarization with Intermediate Fine-tuning and Data Augmentation
Alexander Fabbri, Simeng Han, Haoyuan Li, Haoran Li, Marjan Ghazvininejad, Shafiq Joty, Dragomir Radev and Yashar Mehdad
 Enhancing Factual Consistency of Abstractive Summarization
Chenguang Zhu, William Hinthorn, Ruochen Xu, Qingkai Zeng, Michael Zeng, Xuedong Huang and Meng Jiang
[TACL] Extractive Opinion Summarization in Quantized Transformer Spaces
Stefanos Angelidis, Reinald Kim Amplayo, Yoshihiko Suhara, Xiaolan Wang, Mirella Lapata
 11:40–13:00 Session 3 (click to expand/collapse)
 11:40–13:00 3A: Dialogue and Interactive Systems
  Session chair: Nigel Ward (University of Texas at El Paso)
 Few-shot Intent Classification and Slot Filling with Retrieved Examples
Dian Yu, Luheng He, Yuan Zhang, Xinya Du, Panupong Pasupat and Qi Li
 "Nice Try, Kiddo": Investigating Ad Hominems in Dialogue Responses
Emily Sheng, Kai-Wei Chang, Prem Natarajan and Nanyun Peng
 Human-like informative conversations: Better acknowledgements using conditional mutual information
Ashwin Paranjape and Christopher Manning
 A Comparative Study on Schema-Guided Dialogue State Tracking
Jie Cao and Yi Zhang
 Spoken Language Understanding for Task-oriented Dialogue Systems with Augmented Memory Networks
Jie Wu, Ian Harris and Hongzhi Zhao
 How to Motivate Your Dragon: Teaching Goal-Driven Agents to Speak and Act in Fantasy Worlds
Prithviraj Ammanabrolu, Jack Urbanek, Margaret Li, Arthur Szlam, Tim Rocktäschel and Jason Weston
 11:40–13:00 3B: Information Extraction
  Session chair: Yuhao Zhang (Amazon, AWS AI)
 Linking Entities to Unseen Knowledge Bases with Arbitrary Schemas
Yogarshi Vyas and Miguel Ballesteros
 Self-Training with Weak Supervision
Giannis Karamanolakis, Subhabrata Mukherjee, Guoqing Zheng and Ahmed Hassan Awadallah
 Neural Language Modeling for Contextualized Temporal Graph Generation
Aman Madaan and Yiming Yang
 Probabilistic Box Embeddings for Uncertain Knowledge Graph Reasoning
Xuelu Chen, Michael Boratko, Muhao Chen, Shib Sankar Dasgupta, Xiang Lorraine Li and Andrew McCallum
 Document-Level Event Argument Extraction by Conditional Generation
Sha Li, Heng Ji and Jiawei Han
 Template Filling with Generative Transformers
Xinya Du, Alexander Rush and Claire Cardie
 11:40–13:00 3C: Interpretability and Analysis of Models for NLP
  Session chair: Siva Reddy (McGill/Mila)
 Towards Interpreting and Mitigating Shortcut Learning Behavior of NLU models
Mengnan Du, Varun Manjunatha, Rajiv Jain, Ruchi Deshpande, Franck Dernoncourt, Jiuxiang Gu, Tong Sun and Xia Hu
 On Attention Redundancy: A Comprehensive Study
Yuchen Bian, Jiaji Huang, Xingyu Cai, Jiahong Yuan and Kenneth Church
 Does BERT Pretrained on Clinical Notes Reveal Sensitive Data?
Eric Lehman, Sarthak Jain, Karl Pichotta, Yoav Goldberg and Byron Wallace
 Low-Complexity Probing via Finding Subnetworks
Victor Sanh and Alexander Rush
 An Empirical Comparison of Instance Attribution Methods for NLP
Pouya Pezeshkpour, Sarthak Jain, Byron Wallace and Sameer Singh
 Generalization in Instruction Following Systems
Soham Dan, Michael Zhou and Dan Roth
[CL] Interpretability Analysis for Named Entity Recognition to Understand System Predictions and How They Can Improve
Oshin Agarwal, Yinfei Yang, Byron C. Wallace, Ani Nenkova
 11:40–13:00 3D: Language Grounding to Vision, Robotics and Beyond
  Session chair: Karthik Narasimhan (Princeton University)
 LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time Image-Text Retrieval
Siqi Sun, Yen-Chun Chen, Linjie Li, Shuohang Wang, Yuwei Fang and Jingjing Liu
 Measuring Social Biases in Grounded Vision and Language Embeddings
Candace Ross, Boris Katz and Andrei Barbu
 MTAG: Modal-Temporal Attention Graph for Unaligned Human Multimodal Language Sequences
Jianing Yang, Yongxin Wang, Ruitao Yi, Yuying Zhu, Azaan Rehman, Amir Zadeh, Soujanya Poria and Louis-Philippe Morency
 Grounding Open-Domain Instructions to Automate Web Support Tasks
Nancy Xu, Sam Masling, Michael Du, Giovanni Campagna, Larry Heck, James Landay and Monica Lam
 Modular Networks for Compositional Instruction Following
Rodolfo Corona, Daniel Fried, Coline Devin, Dan Klein and trevor darrell
 Improving Cross-Modal Alignment in Vision Language Navigation via Syntactic Information
Jialu Li, Hao Tan and Mohit Bansal
 11:40–13:00 3E: Machine Learning for NLP: Classification and Structured Prediction Models
  Session chair: Qiang Ning (Amazon)
 Improving Pretrained Models for Zero-shot Multi-label Text Classification through Reinforced Label Hierarchy Reasoning
Hui Liu, Danqing Zhang, Bing Yin and Xiaodan Zhu
 Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self-Training Approach
Yue Yu, Simiao Zuo, Haoming Jiang, Wendi Ren, Tuo Zhao and Chao Zhang
 Posterior Differential Regularization with f-divergence for Improving Model Robustness
Hao Cheng, Xiaodong Liu, Lis Pereira, Yaoliang Yu and Jianfeng Gao
 Understanding Hard Negatives in Noise Contrastive Estimation
Wenzheng Zhang and Karl Stratos
 Certified Robustness to Word Substitution Attack with Differential Privacy
Wenjie Wang, Pengfei Tang, Jian Lou and Li Xiong
 DReCa: A General Task Augmentation Strategy for Few-Shot Natural Language Inference
Shikhar Murty, Tatsunori Hashimoto and Christopher Manning
 16:00–17:00 Keynote
  Session chair: Dilek Hakkani-Tur (Amazon Alexa AI)
 From Disembodied to Embodied Multimodal Learning
Dhruv Batra
 17:00–18:20 Session 4 (click to expand/collapse)
 17:00–18:20 4A: Machine Translation
  Session chair: Wenhu Chen (UC Santa Barbara/Google AI)
 Harnessing Multilinguality in Unsupervised Machine Translation for Rare Languages
Xavier Garcia, Aditya Siddhant, Orhan Firat and Ankur Parikh
 Macro-Average: Rare Types Are Important Too
Thamme Gowda, Weiqiu You, Constantine Lignos and Jonathan May
 Assessing Reference-Free Peer Evaluation for Machine Translation
Sweta Agrawal, George Foster, Markus Freitag and Colin Cherry
 The Curious Case of Hallucinations in Neural Machine Translation
Vikas Raunak, Arul Menezes and Marcin Junczys-Dowmunt
 Towards Continual Learning for Multilingual Machine Translation via Vocabulary Substitution
Xavier Garcia, Noah Constant, Ankur Parikh and Orhan Firat
 Towards Modeling the Style of Translators in Neural Machine Translation
Yue Wang, Cuong Hoang and Marcello Federico
[TACL] Unsupervised Bitext Mining and Translation via Self-trained Contextual Embeddings
Phillip Keung, Julian Salazar, Yichao Lu, Noah A. Smith
 17:00–18:20 4B: Question Answering
  Session chair: Bhuwan Dhingra (Google AI)
 Self-Supervised Test-Time Learning for Reading Comprehension
Pratyay Banerjee, Tejas Gokhale and Chitta Baral
 Capturing Row and Column Semantics in Transformer Based Question Answering over Tables
Michael Glass, Mustafa Canim, Alfio Gliozzo, Saneem Chemmengath, Vishwajeet Kumar, Rishav Chakravarti, Avi Sil, Feifei Pan, Samarth Bharadwaj and Nicolas Rodolfo Fauceglia
 Explainable Multi-hop Verbal Reasoning Through Internal Monologue
Zhengzhong Liang, Steven Bethard and Mihai Surdeanu
 Robust Question Answering Through Sub-part Alignment
Jifan Chen and Greg Durrett
 Text Modular Networks: Learning to Decompose Tasks in the Language of Existing Models
Tushar Khot, Daniel Khashabi, Kyle Richardson, Peter Clark and Ashish Sabharwal
 RECONSIDER: Improved Re-Ranking using Span-Focused Cross-Attention for Open Domain Question Answering
Srinivasan Iyer, Sewon Min, Yashar Mehdad and Wen-tau Yih
 On the Transferability of Minimal Prediction Preserving Inputs in Question Answering
Shayne Longpre, Yi Lu and Chris DuBois
 17:00–18:20 4C: Sentence-level Semantics and Textual Inference
  Session chair: Ves Stoyanov (Facebook AI)
 Understanding by Understanding Not: Modeling Negation in Language Models
Arian Hosseini, Siva Reddy, Dzmitry Bahdanau, R Devon Hjelm, Alessandro Sordoni and Aaron Courville
 DuoRAT: Towards Simpler Text-to-SQL Models
Torsten Scholak, Raymond Li, Dzmitry Bahdanau, Harm de Vries and Chris Pal
 Looking Beyond Sentence-Level Natural Language Inference for Question Answering and Text Summarization
Anshuman Mishra, Dhruvesh Patel, Aparna Vijayakumar, Xiang Lorraine Li, Pavan Kapanipathi and Kartik Talamadupula
 Structure-Grounded Pretraining for Text-to-SQL
Xiang Deng, Ahmed Hassan Awadallah, Christopher Meek, Oleksandr Polozov, Huan Sun and Matthew Richardson
 Incremental Few-shot Text Classification with Multi-round New Classes: Formulation, Dataset and System
Congying Xia, Wenpeng Yin, Yihao Feng and Philip Yu
 Temporal Reasoning on Implicit Events from Distant Supervision
Ben Zhou, Kyle Richardson, Qiang Ning, Tushar Khot, Ashish Sabharwal and Dan Roth
 Disentangling Semantics and Syntax in Sentence Embeddings with Pre-trained Language Models
James Y. Huang, Kuan-Hao Huang and Kai-Wei Chang
 17:00–18:20 4D: Summarization
  Session chair: Rui Zhang (Penn State University)
 Structure-Aware Abstractive Conversation Summarization via Discourse and Action Graphs
Jiaao Chen and Diyi Yang
 A New Approach to Overgenerating and Scoring Abstractive Summaries
Kaiqiang Song, Bingqing Wang, Zhe Feng and Fei Liu
 D2S: Document-to-Slide Generation Via Query-Based Text Summarization
Edward Sun, Yufang Hou, Dakuo Wang, Yunfeng Zhang and Nancy X. R. Wang
 Efficient Attentions for Long Document Summarization
Luyang Huang, Shuyang Cao, Nikolaus Parulian, Heng Ji and Lu Wang
 RefSum: Refactoring Neural Summarization
Yixin Liu, Zi-Yi Dou and Pengfei Liu
 Annotating and Modeling Fine-grained Factuality in Summarization
Tanya Goyal and Greg Durrett
 17:00–18:20 4E: Syntax: Tagging, Chunking, and Parsing
  Session chair: Sheng Zhang (Microsoft Research )
 Larger-Context Tagging: When and Why Does It Work?
Jinlan Fu, Liangjing Feng, Qi Zhang, Xuanjing Huang and Pengfei Liu
 Neural Sequence Segmentation as Determining the Leftmost Segments
Yangming Li, Lemao Liu and Kaisheng Yao
 PCFGs Can Do Better: Inducing Probabilistic Context-Free Grammars with Many Symbols
Songlin Yang, Yanpeng Zhao and Kewei Tu
 GEMNET: Effective Gated Gazetteer Representations for Recognizing Complex Entities in Low-context Input
Tao Meng, Anjie Fang, Oleg Rokhlenko and Shervin Malmasi
[CL] Universal Dependencies
Marie-Catherine de Marneffe, Christopher D. Manning, Joakim Nivre, Daniel Zeman
 18:20–19:40 Session 5 (click to expand/collapse)
 18:20–19:40 5A: Dialogue and Interactive Systems
  Session chair: Huan Sun (The Ohio State University)
 Generating Negative Samples by Manipulating Golden Responses for Unsupervised Learning of a Response Evaluation Model
ChaeHun Park, Eugene Jang, Wonsuk Yang and Jong Park
 How Robust are Fact Checking Systems on Colloquial Claims?
Byeongchang Kim, Hyunwoo Kim, Seokhee Hong and Gunhee Kim
 Fine-grained Post-training for Improving Retrieval-based Dialogue Systems
Janghoon Han, Taesuk Hong, Byoungjae Kim, Youngjoong Ko and Jungyun Seo
 Put Chatbot into Its Interlocutor’s Shoes: New Framework to Learn Chatbot Responding with Intention
Hsuan Su, Jiun-Hao Jhan, Fan-yun Sun, Saurav Sahay and Hung-yi Lee
 Adding Chit-Chat to Enhance Task-Oriented Dialogues
Kai Sun, Seungwhan Moon, Paul Crook, Stephen Roller, Becka Silvert, Bing Liu, Zhiguang Wang, Honglei Liu, Eunjoon Cho and Claire Cardie
 18:20–19:40 5B: Discourse and Pragmatics
  Session chair: Yangfeng Ji (University of Virginia)
 Incorporating Syntax and Semantics in Coreference Resolution with Heterogeneous Graph Attention Network
Fan Jiang and Trevor Cohn
 Context Tracking Network: Graph-based Context Modeling for Implicit Discourse Relation Recognition
Yingxue Zhang, Fandong Meng, Peng Li, Ping Jian and Jie Zhou
 Improving Neural RST Parsing Model with Silver Agreement Subtrees
Naoki Kobayashi, Tsutomu Hirao, Hidetaka Kamigaito, Manabu Okumura and Masaaki Nagata
 RST Parsing from Scratch
Thanh-Tung Nguyen, Xuan-Phi Nguyen, Shafiq Joty and Xiaoli Li
 Did they answer? Subjective acts and intents in conversational discourse
Elisa Ferracane, Greg Durrett, Junyi Jessy Li and Katrin Erk
 Evaluating the Impact of a Hierarchical Discourse Representation on Entity Coreference Resolution Performance
Sopan Khosla, James Fiacco and Carolyn Rosé
 Bridging Resolution: Making Sense of the State of the Art
Hideo Kobayashi and Vincent Ng
 18:20–19:40 5C: Machine Learning for NLP: Language Modeling and Sequence to Sequence Models
  Session chair: Lei Yu (DeepMind)
 Explicitly Modeling Syntax in Language Models with Incremental Parsing and a Dynamic Oracle
Yikang Shen, Shawn Tan, Alessandro Sordoni, Siva Reddy and Aaron Courville
 Revisiting the Weaknesses of Reinforcement Learning for Neural Machine Translation
Samuel Kiegeland and Julia Kreutzer
 Learning to Organize a Bag of Words into Sentences with Neural Networks: An Empirical Study
Chongyang Tao, Shen Gao, Juntao Li, Yansong Feng, Dongyan Zhao and Rui Yan
 Mask Attention Networks: Rethinking and Strengthen Transformer
Zhihao Fan, Yeyun Gong, Dayiheng Liu, Zhongyu Wei, Siyuan Wang, Jian Jiao, Nan Duan, Ruofei Zhang and Xuanjing Huang
 ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding
Dongling Xiao, Yu-Kun Li, Han Zhang, Yu Sun, Hao Tian, Hua Wu and Haifeng Wang
 Lattice-BERT: Leveraging Multi-Granularity Representations in Chinese Pre-trained Language Models
Yuxuan Lai, Yijia Liu, Yansong Feng, Songfang Huang and Dongyan Zhao
 18:20–19:40 5D: Lexical Semantics
  Session chair: Ken Church (Baidu)
 Modeling Event Plausibility with Consistent Conceptual Abstraction
Ian Porada, Kaheer Suleman, Adam Trischler and Jackie Chi Kit Cheung
 UmlsBERT: Clinical Domain Knowledge Augmentation of Contextual Embeddings Using the Unified Medical Language System Metathesaurus
George Michalopoulos, Yuanxin Wang, Hussam Kaka, Helen Chen and Alexander Wong
 Field Embedding: A Unified Grain-Based Framework for Word Representation
Junjie Luo, Xi Chen, Jichao Sun, Yuejia Xiang, Ningyu Zhang and Xiang Wan
 MelBERT: Metaphor Detection via Contextualized Late Interaction using Metaphorical Identification Theories
Minjin Choi, Sunkyung Lee, Eunseong Choi, Heesoo Park, Junhyuk Lee, Dongwon Lee and Jongwuk Lee
 Non-Parametric Few-Shot Learning for Word Sense Disambiguation
Howard Chen, Mengzhou Xia and Danqi Chen
 18:20–19:40 5E: Sentiment Analysis and Stylistic Analysis
  Session chair: Shi Zong (Nanjing University)
 Why Do Document-Level Polarity Classifiers Fail?
Karen Martins, Pedro O.S Vaz-de-Melo and Rodrygo Santos
 A Unified Span-Based Approach for Opinion Mining with Syntactic Constituents
Qingrong Xia, Bo Zhang, Rui Wang, Zhenghua Li, Yue Zhang, Fei Huang, Luo Si and Min Zhang
 Target-specified Sequence Labeling with Multi-head Self-attention for Target-oriented Opinion Words Extraction
Yuhao Feng, Yanghui Rao, Yuyao Tang, Ninghua Wang and He Liu
 Does syntax matter? A strong baseline for Aspect-based Sentiment Analysis with RoBERTa
Junqi Dai, Hang Yan, Tianxiang Sun, Pengfei Liu and Xipeng Qiu
 Domain Divergences: A Survey and Empirical Analysis
Abhinav Ramesh Kashyap, Devamanyu Hazarika, Min-Yen Kan and Roger Zimmermann
 Target-Aware Data Augmentation for Stance Detection
Yingjie Li and Cornelia Caragea
 19:40–21:00 Session 6 (click to expand/collapse)
 19:40–21:00 6A: Speech
  Session chair: Yao Qian (Microsoft)
 End-to-end ASR to jointly predict transcriptions and linguistic annotations
Motoi Omachi, Yuya Fujita, Shinji Watanabe and Matthew Wiesner
 Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation
Hirofumi Inaguma, Tatsuya Kawahara and Shinji Watanabe
 Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks
Siddharth Dalmia, Brian Yan, Vikas Raunak, Florian Metze and Shinji Watanabe
 SPLAT: Speech-Language Joint Pre-Training for Spoken Language Understanding
Yu-An Chung, Chenguang Zhu and Michael Zeng
 Worldly Wise (WoW) - Cross-Lingual Knowledge Fusion for Fact-based Visual Spoken-Question Answering
Kiran Ramnath, Leda Sari, Mark Hasegawa-Johnson and Chang Yoo
 Align-Refine: Non-Autoregressive Speech Recognition via Iterative Realignment
Ethan A. Chi, Julian Salazar and Katrin Kirchhoff
 19:40–21:00 6B: NLP Applications
  Session chair: Wenhan Xiong (Facebook AI)
 Everything Has a Cause: Leveraging Causal Inference in Legal Text Analysis
Xiao Liu, Da Yin, Yansong Feng, Yuting Wu and Dongyan Zhao
 Counterfactual Supporting Facts Extraction for Explainable Medical Record Based Diagnosis with Graph Network
Haoran Wu, Wei Chen, Shuang Xu and Bo Xu
 Personalized Response Generation via Generative Split Memory Network
Yuwei Wu, Xuezhe Ma and Diyi Yang
 Towards Few-shot Fact-Checking via Perplexity
Nayeon Lee, Yejin Bang, Andrea Madotto and Pascale Fung
 Active$^2$ Learning: Actively reducing redundancies in Active Learning methods for Sequence Tagging and Machine Translation
Rishi Hazra, Parag Dutta, Shubham Gupta, Mohammed Abdul Qaathir and Ambedkar Dukkipati
 Generating An Optimal Interview Question Plan Using A Knowledge Graph And Integer Linear Programming
Soham Datta, Prabir Mallick, Sangameshwar Patil, Indrajit Bhattacharya and Girish Palshikar
 19:40–21:00 6C: Machine Learning for NLP: Classification and Structured Prediction Models
  Session chair: Lingpeng Kong (The University of Hong Kong)
 Model Extraction and Adversarial Transferability, Your BERT is Vulnerable!
Xuanli He, Lingjuan Lyu, Lichao Sun and Qiongkai Xu
 A Global Past-Future Early Exit Method for Accelerating Inference of Pre-trained Language Models
Kaiyuan Liao, Yi Zhang, Xuancheng Ren, Qi Su, Xu Sun and Bin He
 Masked Conditional Random Fields for Sequence Labeling
Tianwen Wei, Jianwei Qi, Shenghuan He and Songtao Sun
 Heterogeneous Graph Neural Networks for Concept Prerequisite Relation Learning in Educational Data
Chenghao Jia, Yongliang Shen, Yechun Tang, Lu Sun and Weiming Lu
 Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models
Wenkai Yang, Lei Li, Zhiyuan Zhang, Xuancheng Ren, Xu Sun and Bin He
 DA-Transformer: Distance-aware Transformer
Chuhan Wu, Fangzhao Wu and Yongfeng Huang
 19:40–21:00 6D: Language Resources and Evaluation
  Session chair: Alexandros Papangelis (Amazon, Alexa AI)
 ASAP: A Chinese Review Dataset Towards Aspect Category Sentiment Analysis and Rating Prediction
Jiahao Bu, Lei Ren, Shuang Zheng, Yang Yang, Jingang Wang, Fuzheng Zhang and Wei Wu
 Are NLP Models really able to Solve Simple Math Word Problems?
Arkil Patel, Satwik Bhattamishra and Navin Goyal
 WRIME: A New Dataset for Emotional Intensity Estimation with Subjective and Objective Annotations
Tomoyuki Kajiwara, Chenhui Chu, Noriko Takemura, Yuta Nakashima and Hajime Nagahara
 KPQA: A Metric for Generative Question Answering Using Keyphrase Weights
Hwanhee Lee, Seunghyun Yoon, Franck Dernoncourt, Doo Soon Kim, Trung Bui, Joongbo Shin and Kyomin Jung
 StylePTB: A Compositional Benchmark for Fine-grained Controllable Text Style Transfer
Yiwei Lyu, Paul Pu Liang, Hai Pham, Eduard Hovy, Barnabás Póczos, Ruslan Salakhutdinov and Louis-Philippe Morency
 Blow the Dog Whistle: A Chinese Dataset for Cant Understanding with Common Sense and World Knowledge
Canwen Xu, Wangchunshu Zhou, Tao Ge, Ke Xu, Julian McAuley and Furu Wei
 COVID-19 Named Entity Recognition for Vietnamese
Thinh Hung Truong, Mai Hoang Dao and Dat Quoc Nguyen
 19:40–21:00 6E: Computational Social Science and Cultural Analytics
  Session chair: Vivek Kulkarni (Twitter)
 Framing Unpacked: A Semi-Supervised Interpretable Multi-View Model of Media Frames
Shima Khanehzar, Trevor Cohn, Gosia Mikolajczak, Andrew Turpin and Lea Frermann
 Automatic Classification of Neutralization Techniques in the Narrative of Climate Change Scepticism
Shraey Bhatia, Jey Han Lau and Timothy Baldwin
 Suicide Ideation Detection via Social and Temporal User Representations using Hyperbolic Learning
Ramit Sawhney, Harshit Joshi, Rajiv Ratn Shah and Lucie Flek
 WikiTalkEdit: A Dataset for modeling Editors’ behaviors on Wikipedia
Kokil Jaidka, Andrea Ceolin, Iknoor Singh, Niyati Chhaya and Lyle Ungar
 The structure of online social networks modulates the rate of lexical change
Jian Zhu and David Jurgens
 Modeling Framing in Immigration Discourse on Social Media
Julia Mendelsohn, Ceren Budak and David Jurgens

Tue 08 Jun 2021 (all times PDT, UTC-7)

 08:00–09:00 Keynote
  Session chair: Luke Zettlemoyer (University of Washington & Facebook)
 Generating Reality: Technical and Social Explorations in Generative Machine Learning Research
Shakir Mohamed
 09:00–10:20 Session 7 (click to expand/collapse)
 09:00–10:20 7A: Computational Social Science and Cultural Analytics
  Session chair: Dallas Card (Stanford)
 Modeling the Severity of Complaints in Social Media
Mali Jin and Nikolaos Aletras
 What About the Precedent: An Information-Theoretic Analysis of Common Law
Josef Valvoda, Tiago Pimentel, Niklas Stoehr, Ryan Cotterell and Simone Teufel
 Introducing CAD: the Contextual Abuse Dataset
Bertie Vidgen, Dong Nguyen, Helen Margetts, Patricia Rossini and Rebekah Tromble
 Lifelong Learning of Hate Speech Classification on Social Media
Jing Qian, Hong Wang, Mai ElSherief and Xifeng Yan
 Learning to Recognize Dialect Features
Dorottya Demszky, Devyani Sharma, Jonathan Clark, Vinodkumar Prabhakaran and Jacob Eisenstein
[TACL] Characterizing English Variation across Social Media Communities with BERT
Lucy Li, David Bamman
 09:00–10:20 7B: Green NLP
  Session chair: Roy Schwartz (The Hebrew University of Jerusalem)
 Static Embeddings as Efficient Knowledge Bases?
Philipp Dufter, Nora Kassner and Hinrich Schütze
 Highly Efficient Knowledge Graph Embedding Learning with Orthogonal Procrustes Analysis
Xutan Peng, Guanyi Chen, Chenghua Lin and Mark Stevenson
 Rethinking Network Pruning – under the Pre-train and Fine-tune Paradigm
Dongkuan Xu, Ian En-Hsu Yen, Jinxi Zhao and Zhibin Xiao
 Towards a Comprehensive Understanding and Accurate Evaluation of Societal Biases in Pre-Trained Transformers
Andrew Silva, Pradyumna Tambwekar and Matthew Gombolay
 Detoxifying Language Models Risks Marginalizing Minority Voices
Albert Xu, Eshaan Pathak, Eric Wallace, Suchin Gururangan, Maarten Sap and Dan Klein
 HONEST: Measuring Hurtful Sentence Completion in Language Models
Debora Nozza, Federico Bianchi and Dirk Hovy
 09:00–10:20 7C: Language Grounding to Vision, Robotics and Beyond
  Session chair: Xin Eric Wang (UC Santa Cruz)
 EaSe: A Diagnostic Tool for VQA based on Answer Diversity
Shailza Jolly, Sandro Pezzelle and Moin Nabi
 DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization
Zineng Tang, Jie Lei and Mohit Bansal
 Improving Generation and Evaluation of Visual Stories via Semantic Consistency
Adyasha Maharana, Darryl Hannan and Mohit Bansal
 Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models
Po-Yao Huang, Mandela Patrick, Junjie Hu, Graham Neubig, Florian Metze and Alexander Hauptmann
 Video Question Answering with Phrases via Semantic Roles
Arka Sadhu, Kan Chen and Ram Nevatia
[TACL] Latent Compositional Representations Improve Systematic Generalization in Grounded Question Answering
Ben Bogin: ben.bogin@, Jonathan Berant, Sanjay Subramanian, Matt Gardner
 09:00–10:20 7D: Language Resources and Evaluation
  Session chair: Sowmya Vajjala (National Research Council, Canada)
 From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding
Rob van der Goot, Ibrahim Sharaf, Aizhan Imankulova, Ahmet Üstün, Marija Stepanović, Alan Ramponi, Siti Oryza Khairunnisa, Mamoru Komachi and Barbara Plank
 WEC: Deriving a Large-scale Cross-document Event Coreference dataset from Wikipedia
Alon Eirew, Arie Cattan and Ido Dagan
 Challenging distributional models with a conceptual network of philosophical terms
Yvette Oortwijn, Jelke Bloem, Pia Sommerauer, Francois Meyer, Wei Zhou and Antske Fokkens
 KILT: a Benchmark for Knowledge Intensive Language Tasks
Fabio Petroni, Aleksandra Piktus, Angela Fan, Patrick Lewis, Majid Yazdani, Nicola De Cao, James Thorne, Yacine Jernite, Vladimir Karpukhin, Jean Maillard, Vassilis Plachouras, Tim Rocktäschel and Sebastian Riedel
[TACL] AMR Similarity Metrics from Principles
Juri Opitz, Letitia Parcalabescu, Anette Frank
[TACL] Evaluating Document Coherence Modelling
Aili Shen, Meladel Mistica, Bahar Salehi, Hang Li, Timothy Baldwin, Jianzhong Qi
 09:00–10:20 7E: Machine Learning for NLP: Classification and Structured Prediction Models
  Session chair: Paul Michel (Carnegie Mellon University)
 A Survey on Recent Approaches for Natural Language Processing in Low-Resource Scenarios
Michael A. Hedderich, Lukas Lange, Heike Adel, Jannik Strötgen and Dietrich Klakow
 Temporal Knowledge Graph Completion using a Linear Temporal Regularizer and Multivector Embeddings
Chengjin Xu, Yung-Yu Chen, Mojtaba Nayyeri and Jens Lehmann
 UDALM: Unsupervised Domain Adaptation through Language Modeling
Constantinos Karouzos, Georgios Paraskevopoulos and Alexandros Potamianos
 Beyond Black & White: Leveraging Annotator Disagreement via Soft-Label Multi-Task Learning
Tommaso Fornaciari, Alexandra Uma, Silviu Paun, Barbara Plank, Dirk Hovy and Massimo Poesio
 Clustering-based Inference for Biomedical Entity Linking
Rico Angell, Nicholas Monath, Sunil Mohan, Nishant Yadav and Andrew McCallum
 Variance-reduced First-order Meta-learning for Natural Language Processing Tasks
Lingxiao Wang, Kevin Huang, Tengyu Ma, Quanquan Gu and Jing Huang
 Diversity-Aware Batch Active Learning for Dependency Parsing
Tianze Shi, Adrian Benton, Igor Malioutov and Ozan İrsoy
 10:20–11:40 Session 8 (click to expand/collapse)
 10:20–11:40 8A: Machine Learning for NLP: Language Modeling and Sequence to Sequence Models
  Session chair: Srini Iyer (Facebook AI Research)
 Can Latent Alignments Improve Autoregressive Machine Translation?
Adi Haviv, Lior Vassertail and Omer Levy
 Smoothing and Shrinking the Sparse Seq2Seq Search Space
Ben Peters and André F. T. Martins
 Unified Pre-training for Program Understanding and Generation
Wasi Ahmad, Saikat Chakraborty, Baishakhi Ray and Kai-Wei Chang
 Hyperparameter-free Continuous Learning for Domain Classification in Natural Language Understanding
Ting Hua, Yilin Shen, Changsheng Zhao, Yen-Chang Hsu and Hongxia Jin
[TACL] A Primer in BERTology: What We Know About How BERT Works
Anna Rogers, Olga Kovaleva, Anna Rumshisky
 10:20–11:40 8B: NLP Applications
  Session chair: Emily Prud'hommeaux (Boston College)
 On the Embeddings of Variables in Recurrent Neural Networks for Source Code
Nadezhda Chirkova
 Cross-Lingual Word Embedding Refinement by $_1$ Norm Optimisation
Xutan Peng, Chenghua Lin and Mark Stevenson
 Semantic Frame Forecast
Chieh-Yang Huang and Ting-Hao Huang
 MUSER: MUltimodal Stress detection using Emotion Recognition as an Auxiliary Task
Yiqun Yao, Michalis Papakostas, Mihai Burzo, Mohamed Abouelenien and Rada Mihalcea
 Learning to Decompose and Organize Complex Tasks
Yi Zhang, Sujay Kumar Jauhar, Julia Kiseleva, Ryen White and Dan Roth
 Continual Learning for Text Classification with Information Disentanglement Based Regularization
Yufan Huang, Yanzhe Zhang, Jiaao Chen, Xuezhi Wang and Diyi Yang
 10:20–11:40 8C: Sentence-level Semantics and Textual Inference
  Session chair: Mrinmaya Sachan (ETH Zurich)
 Learning from Executions for Semantic Parsing
Bailin Wang, Mirella Lapata and Ivan Titov
 Learning to Synthesize Data for Semantic Parsing
Bailin Wang, Wenpeng Yin, Xi Victoria Lin and Caiming Xiong
 Edge: Enriching Knowledge Graph Embeddings with External Text
Saed Rezayi, Handong Zhao, Sungchul Kim, Ryan Rossi, Nedim Lipka and Sheng Li
 FLIN: A Flexible Natural Language Interface for Web Navigation
Sahisnu Mazumder and Oriana Riva
 Game-theoretic Vocabulary Selection via the Shapley Value and Banzhaf Index
Roma Patel, Marta Garnelo, Ian Gemp, Chris Dyer and Yoram Bachrach
 Incorporating External Knowledge to Enhance Tabular Reasoning
J. Neeraja, Vivek Gupta and Vivek Srikumar
 Compositional Generalization for Neural Semantic Parsing via Span-level Supervised Attention
Pengcheng Yin, Hao Fang, Graham Neubig, Adam Pauls, Emmanouil Antonios Platanios, Yu Su, Sam Thomson and Jacob Andreas
 10:20–11:40 8D: Sentiment Analysis and Stylistic Analysis
  Session chair: Preslav Nakov (Qatar Computing Research Institute, HBKU)
 Domain Adaptation for Arabic Cross-Domain and Cross-Dialect Sentiment Analysis from Contextualized Word Embedding
Abdellah El Mekki, Abdelkader El Mahdaouy, Ismail Berrada and Ahmed Khoumsi
 Multi-task Learning of Negation and Speculation for Targeted Sentiment Classification
Andrew Moore and Jeremy Barnes
 A Disentangled Adversarial Neural Topic Model for Separating Opinions from Plots in User Reviews
Gabriele Pergola, Lin Gui and Yulan He
 Graph Ensemble Learning over Multiple Dependency Trees for Aspect-level Sentiment Classification
Xiaochen Hou, Peng Qi, Guangtao Wang, Rex Ying, Jing Huang, Xiaodong He and Bowen Zhou
 Emotion-Infused Models for Explainable Psychological Stress Detection
Elsbeth Turcan, Smaranda Muresan and Kathleen McKeown
 Aspect-based Sentiment Analysis with Type-aware Graph Convolutional Networks and Layer Ensemble
Yuanhe Tian, Guimin Chen and Yan Song
 10:20–11:40 8E: Syntax: Tagging, Chunking, and Parsing
  Session chair: Mike Lewis (Facebook AI)
 Supertagging-based Parsing with Linear Context-free Rewriting Systems
Thomas Ruprecht and Richard Mörbitz
 Outside Computation with Superior Functions
Parker Riley and Daniel Gildea
 Learning Syntax from Naturally-Occurring Bracketings
Tianze Shi, Ozan İrsoy, Igor Malioutov and Lillian Lee
[CL] What Should/Do/Can LSTMs Learn When Parsing Auxiliary Verb Constructions?
Miryam de Lhoneux, Sara Stymne, Joakim Nivre
[TACL] Reducing Confusion in Active Learning for Part-Of-Speech Tagging
Aditi Chaudhary, Antonios Anastasopoulos, Zaid Sheikh, Graham Neubig
 11:40–13:00 Business Meeting
 16:00–17:00 Keynote
  Session chair: Luke Zettlemoyer (University of Washington & Facebook)
 Moving the Needle in NLP Technology for the Processing of Code-Switched Language
Thamar Solorio
 17:00–18:20 Session 9 (click to expand/collapse)
 17:00–18:20 9A: Dialogue and Interactive Systems
  Session chair: Yang Liu (Amazon, Alexa AI)
 Bot-Adversarial Dialogue for Safe Conversational Agents
Jing Xu, Da Ju, Margaret Li, Y-Lan Boureau, Jason Weston and Emily Dinan
 Non-Autoregressive Semantic Parsing for Compositional Task-Oriented Dialog
Arun Babu, Akshat Shrivastava, Armen Aghajanyan, Ahmed Aly, Angela Fan and Marjan Ghazvininejad
 Example-Driven Intent Prediction with Observers
Shikib Mehri and Mihail Eric
 Imperfect also Deserves Reward: Multi-Level and Sequential Reward Modeling for Better Dialog Management
Zhengxu Hou, Bang Liu, Ruihui Zhao, Zijing Ou, Yafei Liu, Xi Chen and Yefeng Zheng
 Action-Based Conversations Dataset: A Corpus for Building More In-Depth Task-Oriented Dialogue Systems
Derek Chen, Howard Chen, Yi Yang, Alexander Lin and Zhou Yu
 Controlling Dialogue Generation with Semantic Exemplars
Prakhar Gupta, Jeffrey Bigham, Yulia Tsvetkov and Amy Pavel
 17:00–18:20 9B: Information Retrieval and Text Mining
  Session chair: Qingyao Ai (University of Utah)
 COIL: Revisit Exact Lexical Match in Information Retrieval with Contextualized Inverted List
Luyu Gao, Zhuyun Dai and Jamie Callan
 X-Class: Text Classification with Extremely Weak Supervision
Zihan Wang, Dheeraj Mekala and Jingbo Shang
 Fine-tuning Encoders for Improved Monolingual and Zero-shot Polylingual Neural Topic Modeling
Aaron Mueller and Mark Dredze
 Exploring the Relationship Between Algorithm Performance, Vocabulary, and Run-Time in Text Classification
Wilson Fearn, Orion Weller and Kevin Seppi
 Faithfully Explainable Recommendation via Neural Logic Reasoning
Yaxin Zhu, Yikun Xian, Zuohui Fu, Gerard de Melo and Yongfeng Zhang
 You Sound Like Someone Who Watches Drama Movies: Towards Predicting Movie Preferences from Conversational Interactions
Sergey Volokhin, Joyce Ho, Oleg Rokhlenko and Eugene Agichtein
[TACL] Sparse, Dense, and Attentional Representations for Text Retrieval
Yi Luan, Jacob Eisenstein, Kristina Toutanova, Michael Collins
 17:00–18:20 9C: Language Grounding to Vision, Robotics and Beyond
  Session chair: John Lalor (University of Notre Dame)
 Reading and Acting while Blindfolded: The Need for Semantics in Text Game Agents
Shunyu Yao, Karthik Narasimhan and Matthew Hausknecht
 SOrT-ing VQA Models : Contrastive Gradient Learning for Improved Consistency
Sameer Dharur, Purva Tendulkar, Dhruv Batra, Devi Parikh and Ramprasaath R. Selvaraju
 Semi-Supervised Policy Initialization for Playing Games with Language Hints
Tsu-Jui Fu and William Yang Wang
 Revisiting Document Representations for Large-Scale Zero-Shot Learning
Jihyung Kil and Wei-Lun Chao
 17:00–18:20 9D: Language Resources and Evaluation
  Session chair: Pradeep Dasigi (Allen Institute for AI)
 Negative language transfer in learner English: A new dataset
Leticia Farias Wanderley, Nicole Zhao and Carrie Demmans Epp
 SentSim: Crosslingual Semantic Evaluation of Machine Translation
Yurun Song, Junchen Zhao and Lucia Specia
 Quality Estimation for Image Captions Based on Large-scale Human Evaluations
Tomer Levinboim, Ashish V. Thapliyal, Piyush Sharma and Radu Soricut
 CaSiNo: A Corpus of Campsite Negotiation Dialogues for Automatic Negotiation Systems
Kushal Chawla, Jaysa Ramirez, Rene Clever, Gale Lucas, Jonathan May and Jonathan Gratch
 News Headline Grouping as a Challenging NLU Task
Philippe Laban, Lucas Bandarkar and Marti A. Hearst
 Olá, Bonjour, Salve! XFORMAL: A Benchmark for Multilingual Formality Style Transfer
Eleftheria Briakou, Di Lu, Ke Zhang and Joel Tetreault
 17:00–18:20 9E: Machine Learning for NLP: Classification and Structured Prediction Models
  Session chair: Guangtao Wang (JD AI Research)
 Grouping Words with Semantic Diversity
Karine Chubarian, Abdul Rafae Khan, Anastasios Sidiropoulos and Jia Xu
 Noise Stability Regularization for Improving BERT Fine-tuning
Hang Hua, Xingjian Li, Dejing Dou, Chengzhong Xu and Jiebo Luo
 FlowPrior: Learning Expressive Priors for Latent Variable Sentence Models
Xiaoan Ding and Kevin Gimpel
 HTCInfoMax: A Global Model for Hierarchical Text Classification via Information Maximization
Zhongfen Deng, Hao Peng, Dongxiao He, Jianxin Li and Philip Yu
[TACL] Modeling Content and Context with Deep Relational Learning
Maria Leonor Pacheco, Dan Goldwasser
 Knowledge Guided Metric Learning for Few-Shot Text Classification
Dianbo Sui, Yubo Chen, Binjie Mao, Delai Qiu, Kang Liu and Jun Zhao
 18:20–19:40 Session 10 (click to expand/collapse)
 18:20–19:40 10A: Dialogue and Interactive Systems
  Session chair: Ramesh Manuvinakurike (Intel labs)
 Ensemble of MRR and NDCG models for Visual Dialog
Idan Schwartz
 Supervised Neural Clustering via Latent Structured Output Learning: Application to Question Intents
Iryna Haponchyk and Alessandro Moschitti
 ConVEx: Data-Efficient and Few-Shot Slot Labeling
Matthew Henderson and Ivan Vulić
 CREAD: Combined Resolution of Ellipses and Anaphora in Dialogues
Bo-Hsiang Tseng, Shruti Bhargava, Jiarui Lu, Joel Ruben Antony Moniz, Dhivya Piraviperumal, Lin Li and Hong Yu
 Knowledge-Driven Slot Constraints for Goal-Oriented Dialogue Systems
Piyawat Lertvittayakumjorn, Daniele Bonadiman and Saab Mansour
 Clipping Loops for Sample-Efficient Dialogue Policy Optimisation
Yen-Chen Wu and Carl Edward Rasmussen
 18:20–19:40 10B: Information Extraction
  Session chair: Alan Ritter (Georgia Tech)
 Integrating Lexical Information into Entity Neighbourhood Representations for Relation Prediction
Ian Wood, Mark Johnson and Stephen Wan
 Noisy-Labeled NER with Confidence Estimation
Kun Liu, Yao Fu, Chuanqi Tan, Mosha Chen, Ningyu Zhang, Songfang Huang and Sheng Gao
 TABBIE: Pretrained Representations of Tabular Data
Hiroshi Iida, Dung Thai, Varun Manjunatha and Mohit Iyyer
 Better Feature Integration for Named Entity Recognition
Lu Xu, Zhanming Jie, Wei Lu and Lidong Bing
 ZS-BERT: Towards Zero-Shot Relation Extraction with Attribute Representation Learning
Chih-Yao Chen and Cheng-Te Li
 Graph Convolutional Networks for Event Causality Identification with Rich Document-level Structures
Minh Tran Phu and Thien Huu Nguyen
 A Context-Dependent Gated Module for Incorporating Symbolic Semantics into Event Coreference Resolution
Tuan Lai, Heng Ji, Trung Bui, Quan Hung Tran, Franck Dernoncourt and Walter Chang
 18:20–19:40 10C: Language Generation
  Session chair: Greg Durrett (UT Austin)
 Multi-Style Transfer with Discriminative Feedback on Disjoint Corpus
Navita Goyal, Balaji Vasan Srinivasan, Anandhavelu N and Abhilasha Sancheti
 FUDGE: Controlled Text Generation With Future Discriminators
Kevin Yang and Dan Klein
 Controllable Text Simplification with Explicit Paraphrasing
Mounica Maddela, Fernando Alva-Manchego and Wei Xu
 Knowledge Graph Based Synthetic Corpus Generation for Knowledge-Enhanced Language Model Pre-training
Oshin Agarwal, Heming Ge, Siamak Shakeri and Rami Al-Rfou
 Choose Your Own Adventure: Paired Suggestions in Collaborative Writing for Evaluating Story Generation Models
Elizabeth Clark and Noah A. Smith
[TACL] There Once Was a Really Bad Poet, It Was Automated but You Didn’t Know It
Jianyou, jw542@duke.edu, Xiaoxuan, zhangxiaoxuanaa@gmail.com, Yuren Zhou, Christopher Suh, Cynthia Rudin
 18:20–19:40 10D: Multilinguality
  Session chair: Radu Florian (IBM Research AI)
 InfoXLM: An Information-Theoretic Framework for Cross-Lingual Language Model Pre-Training
Zewen Chi, Li Dong, Furu Wei, Nan Yang, Saksham Singhal, Wenhui Wang, Xia Song, Xian-Ling Mao, Heyan Huang and Ming Zhou
 Context-Interactive Pre-Training for Document Machine Translation
Pengcheng Yang, Pei Zhang, Boxing Chen, Jun Xie and Weihua Luo
 Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots
Samson Tan and Shafiq Joty
 X-METRA-ADA: Cross-lingual Meta-Transfer learning Adaptation to Natural Language Understanding and Question Answering
Meryem M’hamdi, Doo Soon Kim, Franck Dernoncourt, Trung Bui, Xiang Ren and Jonathan May
 Explicit Alignment Objectives for Multilingual Bidirectional Encoders
Junjie Hu, Melvin Johnson, Orhan Firat, Aditya Siddhant and Graham Neubig
 Cross-lingual Cross-modal Pretraining for Multimodal Retrieval
Hongliang Fei, Tan Yu and Ping Li
 Wikipedia Entities as Rendezvous across Languages: Grounding Multilingual Language Models by Predicting Wikipedia Hyperlinks
Iacer Calixto, Alessandro Raganato and Tommaso Pasini
 18:20–19:40 10E: Question Answering
  Session chair: Jing Huang (JD AI Research)
 multiPRover: Generating Multiple Proofs for Improved Interpretability in Rule Reasoning
Swarnadeep Saha, Prateek Yadav and Mohit Bansal
 Adaptable and Interpretable Neural MemoryOver Symbolic Knowledge
Pat Verga, Haitian Sun, Livio Baldini Soares and William Cohen
 CLEVR_HYP: A Challenge Dataset and Baselines for Visual Question Answering with Hypothetical Actions over Images
Shailaja Keyur Sampat, Akshay Kumar, Yezhou Yang and Chitta Baral
 Refining Targeted Syntactic Evaluation of Language Models
Benjamin Newman, Kai-Siang Ang, Julia Gong and John Hewitt
 Universal Adversarial Attacks with Natural Triggers for Text Classification
Liwei Song, Xinwei Yu, Hsuan-Tung Peng and Karthik Narasimhan
 QuadrupletBERT: An Efficient Model For Embedding-Based Large-Scale Retrieval
Peiyang Liu, Sen Wang, Xi Wang, Wei Ye and Shikun Zhang
 19:40–21:00 Session 11 (click to expand/collapse)
 19:40–21:00 11A: Ethics, Bias, and Fairness
  Session chair: Swabha Swayamdipta (Allen Institute for AI)
 Dynamically Disentangling Social Bias from Task-Oriented Representations with Adversarial Attack
Liwen Wang, Yuanmeng Yan, Keqing He, Yanan Wu and Weiran Xu
 An Empirical Investigation of Bias in the Multimodal Analysis of Financial Earnings Calls
Ramit Sawhney, Arshiya Aggarwal and Rajiv Ratn Shah
 Beyond Fair Pay: Ethical Implications of NLP Crowdsourcing
Boaz Shmueli, Jan Fell, Soumya Ray and Lun-Wei Ku
 On Transferability of Bias Mitigation Effects in Language Model Fine-Tuning
Xisen Jin, Francesco Barbieri, Brendan Kennedy, Aida Mostafazadeh Davani, Leonardo Neves and Xiang Ren
 Case Study: Deontological Ethics in NLP
Shrimai Prabhumoye, Brendon Boldt, Ruslan Salakhutdinov and Alan W Black
 Privacy Regularization: Joint Privacy-Utility Optimization in LanguageModels
Fatemehsadat Mireshghallah, Huseyin Inan, Marcello Hasegawa, Victor Rühle, Taylor Berg-Kirkpatrick and Robert Sim
 On the Impact of Random Seeds on the Fairness of Clinical Classifiers
Silvio Amir, Jan-Willem van de Meent and Byron Wallace
 19:40–21:00 11B: Interpretability and Analysis of Models for NLP
  Session chair: Tiancheng Zhao (Zhejiang University)
 Topic Model or Topic Twaddle? Re-evaluating Semantic Interpretability Measures
Caitlin Doogan and Wray Buntine
 Discourse Probing of Pretrained Language Models
Fajri Koto, Jey Han Lau and Timothy Baldwin
 UniDrop: A Simple yet Effective Technique to Improve Transformer without Extra Cost
Zhen Wu, Lijun Wu, Qi Meng, Yingce Xia, Shufang Xie, Tao Qin, Xinyu Dai and Tie-Yan Liu
 tWT–WT: A Dataset to Assert the Role of Target Entities for Detecting Stance of Tweets
Ayush Kaushal, Avirup Saha and Niloy Ganguly
 Learning to Learn to be Right for the Right Reasons
Pride Kavumba, Benjamin Heinzerling, Ana Brassard and Kentaro Inui
 Double Perturbation: On the Robustness of Robustness and Counterfactual Bias Evaluation
Chong Zhang, Jieyu Zhao, Huan Zhang, Kai-Wei Chang and Cho-Jui Hsieh
 Explaining Neural Network Predictions on Sentence Pairs via Learning Word-Group Masks
Hanjie Chen, Song Feng, Jatin Ganhotra, Hui Wan, Chulaka Gunasekara, Sachindra Joshi and Yangfeng Ji
 19:40–21:00 11C: Machine Translation
  Session chair: Orhan Firat (Google Research)
 Almost Free Semantic Draft for Neural Machine Translation
Xi Ai and Bin Fang
 Pruning-then-Expanding Model for Domain Adaptation of Neural Machine Translation
Shuhao Gu, Yang Feng and Wanying Xie
 Multi-Hop Transformer for Document-Level Machine Translation
Long Zhang, Tong Zhang, Haibo Zhang, Baosong Yang, Wei Ye and Shikun Zhang
 Continual Learning for Neural Machine Translation
Yue Cao, Hao-Ran Wei, Boxing Chen and Xiaojun Wan
 Self-Training for Unsupervised Neural Machine Translation in Unbalanced Training Data Scenarios
Haipeng Sun, Rui Wang, Kehai Chen, Masao Utiyama, Eiichiro Sumita and Tiejun Zhao
 Smart-Start Decoding for Neural Machine Translation
Jian Yang, Shuming Ma, Dongdong Zhang, Juncheng Wan, Zhoujun Li and Ming Zhou
 Multi-Task Learning with Shared Encoder for Non-Autoregressive Machine Translation
Yongchang Hao, Shilin He, Wenxiang Jiao, Zhaopeng Tu, Michael Lyu and Xing Wang
 19:40–21:00 11D: NLP Applications
  Session chair: Minjoon Seo (KAIST)
 ER-AE: Differentially Private Text Generation for Authorship Anonymization
Haohan Bo, Steven H. H. Ding, Benjamin C. M. Fung and Farkhund Iqbal
 Distantly Supervised Transformers For E-Commerce Product QA
Happy Mittal, Aniket Chakrabarti, Belhassen Bayar, Animesh Anant Sharma and Nikhil Rasiwasia
 Quantitative Day Trading from Natural Language using Reinforcement Learning
Ramit Sawhney, Arnav Wadhwa, Shivam Agarwal and Rajiv Ratn Shah
 Restoring and Mining the Records of the Joseon Dynasty via Neural Language Modeling and Machine Translation
Kyeongpil Kang, Kyohoon Jin, Soyoung Yang, Soojin Jang, Jaegul Choo and Youngbin Kim
 Modeling Diagnostic Label Correlation for Automatic ICD Coding
Shang-Chi Tsai, Chao-Wei Huang and Yun-Nung Chen
 Self-Supervised Contrastive Learning for Efficient User Satisfaction Prediction in Conversational Agents
Mohammad Kachuee, Hao Yuan, Young-Bum Kim and Sungjin Lee
 19:40–21:00 11E: Special Theme: New Challenges in NLP
  Session chair: Xun Wang (Microsoft)
 A recipe for annotating grounded clarifications
Luciana Benotti and Patrick Blackburn
 Grey-box Adversarial Attack And Defence For Sentiment Classification
Ying Xu, Xu Zhong, Antonio Jimeno Yepes and Jey Han Lau
 How low is too low? A monolingual take on lemmatisation in Indian languages
Kumar Saunack, Kumar Saurav and Pushpak Bhattacharyya
 Causal Effects of Linguistic Properties
Reid Pryzant, Dallas Card, Dan Jurafsky, Victor Veitch and Dhanya Sridhar
 Dynabench: Rethinking Benchmarking in NLP
Douwe Kiela, Max Bartolo, Yixin Nie, Divyansh Kaushik, Atticus Geiger, Zhengxuan Wu, Bertie Vidgen, Grusha Prasad, Amanpreet Singh, Pratik Ringshia, Zhiyi Ma, Tristan Thrush, Sebastian Riedel, Zeerak Waseem, Pontus Stenetorp, Robin Jia, Mohit Bansal, Christopher Potts and Adina Williams
 Translational NLP: A New Paradigm and General Principles for Natural Language Processing Research
Denis Newman-Griffis, Jill Fain Lehman, Carolyn Rosé and Harry Hochheiser

Wed 09 Jun 2021 (all times PDT, UTC-7)

 09:00–10:20 Session 12 (click to expand/collapse)
 09:00–10:20 12A: Discourse and Pragmatics
  Session chair: Jessy Li (UT Austin)
 Predicting Discourse Trees from Transformer-based Neural Summarizers
Wen Xiao, Patrick Huber and Giuseppe Carenini
 Probing for Bridging Inference in Transformer Language Models
Onkar Pandit and Yufang Hou
 Is Incoherence Surprising? Targeted Evaluation of Coherence Prediction from Language Models
Anne Beyer, Sharid Loáiciga and David Schlangen
 Stay Together: A System for Single and Split-antecedent Anaphora Resolution
Juntao Yu, Nafise Sadat Moosavi, Silviu Paun and Massimo Poesio
[TACL] Decontextualization: Making Sentences Stand-Alone
Eunsol Choi, Jennimaria Palomaki, Matthew Lamm, Tom Kwiatkowski, Dipanjan Das, Michael Collins
[CL] Universal Discourse Representation Structure Parsing
Jiangming Liu, Shay B. Cohen, Mirella Lapata, Johan Bos
 09:00–10:20 12B: Information Retrieval and Text Mining
  Session chair: Thuy Vu (Amazon, Alexa AI)
 Redefining Absent Keyphrases and their Effect on Retrieval Effectiveness
Florian Boudin and Ygor Gallina
 CoRT: Complementary Rankings from Transformers
Marco Wrzalik and Dirk Krechel
 Multi-source Neural Topic Modeling in Multi-view Embedding Spaces
Pankaj Gupta, Yatin Chaudhary and Hinrich Schütze
 Inductive Topic Variational Graph Auto-Encoder for Text Classification
Qianqian Xie, Jimin Huang, Pan Du, Min Peng and Jian-Yun Nie
 Self-Alignment Pretraining for Biomedical Entity Representations
Fangyu Liu, Ehsan Shareghi, Zaiqiao Meng, Marco Basaldella and Nigel Collier
 TaxoClass: Hierarchical Multi-Label Text Classification Using Only Class Names
Jiaming Shen, Wenda Qiu, Yu Meng, Jingbo Shang, Xiang Ren and Jiawei Han
 09:00–10:20 12C: Language Generation
  Session chair: Antoine Bosselut (Stanford University)
 MERMAID: Metaphor Generation with Symbolism and Discriminative Decoding
Tuhin Chakrabarty, Xurui Zhang, Smaranda Muresan and Nanyun Peng
 On Learning Text Style Transfer with Direct Rewards
Yixin Liu, Graham Neubig and John Wieting
 Focused Attention Improves Document-Grounded Generation
Shrimai Prabhumoye, Kazuma Hashimoto, Yingbo Zhou, Alan W Black and Ruslan Salakhutdinov
 NeuroLogic Decoding: (Un)supervised Neural Text Generation with Predicate Logic Constraints
Ximing Lu, Peter West, Rowan Zellers, Ronan Le Bras, Chandra Bhagavatula and Yejin Choi
 Ask what’s missing and what’s useful: Improving Clarification Question Generation using Global Knowledge
Bodhisattwa Prasad Majumder, Sudha Rao, Michel Galley and Julian McAuley
 Progressive Generation of Long Text with Pretrained Language Models
Bowen Tan, Zichao Yang, Maruan Al-Shedivat, Eric Xing and Zhiting Hu
 09:00–10:20 12D: Language Resources and Evaluation
  Session chair: Seokhwan Kim (Amazon, Alexa AI)
 SOCCER: An Information-Sparse Discourse State Tracking Collection in the Sports Commentary Domain
Ruochen Zhang and Carsten Eickhoff
 Plot-guided Adversarial Example Construction for Evaluating Open-domain Story Generation
Sarik Ghazarian, Zixi Liu, Akash S M, Ralph Weischedel, Aram Galstyan and Nanyun Peng
 MultiOpEd: A Corpus of Multi-Perspective News Editorials
Siyi Liu, Sihao Chen, Xander Uyttendaele and Dan Roth
 Swords: A Benchmark for Lexical Substitution with Improved Data Coverage and Quality
Mina Lee, Chris Donahue, Robin Jia, Alexander Iyabor and Percy Liang
 "I’m Not Mad": Commonsense Implications of Negation and Contradiction
Liwei Jiang, Antoine Bosselut, Chandra Bhagavatula and Yejin Choi
 Identifying Medical Self-Disclosure in Online Communities
Mina Valizadeh, Pardis Ranjbar-Noiey, Cornelia Caragea and Natalie Parde
 09:00–10:20 12E: Linguistic Theories, Cognitive Modeling and Psycholinguistics
  Session chair: Costanza Navarretta (University of Copenhagen)
 Language in a (Search) Box: Grounding Language Learning in Real-World Human-Machine Interaction
Federico Bianchi, Ciro Greco and Jacopo Tagliabue
 Finding Concept-specific Biases in Form–Meaning Associations
Tiago Pimentel, Brian Roark, Søren Wichmann, Ryan Cotterell and Damián Blasi
 How (Non-)Optimal is the Lexicon?
Tiago Pimentel, Irene Nikkarinen, Kyle Mahowald, Ryan Cotterell and Damián Blasi
 Word Complexity is in the Eye of the Beholder
Sian Gooding, Ekaterina Kochmar, Seid Muhie Yimam and Chris Biemann
 Linguistic Complexity Loss in Text-Based Therapy
Jason Wei, Kelly Finn, Emma Templeton, Thalia Wheatley and Soroush Vosoughi
 Ab Antiquo: Neural Proto-language Reconstruction
Carlo Meloni, Shauli Ravfogel and Yoav Goldberg
 On Biasing Transformer Attention Towards Monotonicity
Annette Rios, Chantal Amrhein, Noëmi Aepli and Rico Sennrich
 10:20–11:40 Session 13 (click to expand/collapse)
 10:20–11:40 13A: NLP Applications
  Session chair: Tristan Naumann (Microsoft Research)
 Extracting a Knowledge Base of Mechanisms from COVID-19 Papers
Tom Hope, Aida Amini, David Wadden, Madeleine van Zuylen, Sravanthi Parasa, Eric Horvitz, Daniel Weld, Roy Schwartz and Hannaneh Hajishirzi
 Constrained Multi-Task Learning for Event Coreference Resolution
Jing Lu and Vincent Ng
 Empirical Evaluation of Pre-trained Transformers for Human-Level NLP: The Role of Sample Size and Dimensionality
Adithya V Ganesan, Matthew Matero, Aravind Reddy Ravula, Huy Vu and H. Andrew Schwartz
 Leveraging Deep Representations of Radiology Reports in Survival Analysis for Predicting Heart Failure Patient Mortality
Hyun Gi Lee, Evan Sholle, Ashley Beecy, Subhi Al’Aref and Yifan Peng
 On the Use of Context for Predicting Citation Worthiness of Sentences in Scholarly Articles
Rakesh Gosangi, Ravneet Arora, Mohsen Gheisarieha, Debanjan Mahata and Haimin Zhang
 Data and Model Distillation as a Solution for Domain-transferable Fact Verification
Mitch Paul Mithun, Sandeep Suntwal and Mihai Surdeanu
 Adapting Coreference Resolution for Processing Violent Death Narratives
Ankith Uppunda, Susan Cochran, Jacob Foster, Alina Arseniev-Koehler, Vickie Mays and Kai-Wei Chang
 10:20–11:40 13B: Question Answering
  Session chair: Marek Rei (Imperial College London)
 Time-Stamped Language Model: Teaching Language Models to Understand The Flow of Events
Hossein Rajaby Faghihi and Parisa Kordjamshidi
 If You Want to Go Far Go Together: Unsupervised Joint Candidate Evidence Retrieval for Multi-hop Question Answering
Vikas Yadav, Steven Bethard and Mihai Surdeanu
 SPARTQA: A Textual Question Answering Benchmark for Spatial Reasoning
Roshanak Mirzaee, Hossein Rajaby Faghihi, Qiang Ning and Parisa Kordjamshidi
 A Dataset of Information-Seeking Questions and Answers Anchored in Research Papers
Pradeep Dasigi, Kyle Lo, Iz Beltagy, Arman Cohan, Noah A. Smith and Matt Gardner
 Differentiable Open-Ended Commonsense Reasoning
Bill Yuchen Lin, Haitian Sun, Bhuwan Dhingra, Manzil Zaheer, Xiang Ren and William Cohen
 Does Structure Matter? Encoding Documents for Machine Reading Comprehension
Hui Wan, Song Feng, Chulaka Gunasekara, Siva Sankalp Patel, Sachindra Joshi and Luis Lastras
 Multi-Step Reasoning Over Unstructured Text with Beam Dense Retrieval
Chen Zhao, Chenyan Xiong, Jordan Boyd-Graber and Hal Daumé III
 10:20–11:40 13C: Lexical Semantics
  Session chair: Marzena Karpinska (University of Massachusetts Amherst)
 Scalable and Interpretable Semantic Change Detection
Syrielle Montariol, Matej Martinc and Lidia Pivovarova
 Scalar Adjective Identification and Multilingual Ranking
Aina Garí Soler and Marianna Apidianaki
 ESC: Redesigning WSD with Extractive Sense Comprehension
Edoardo Barba, Tommaso Pasini and Roberto Navigli
 Recent advances in neural metaphor processing: A linguistic, cognitive and social perspective
Xiaoyu Tong, Ekaterina Shutova and Martha Lewis
 Constructing Taxonomies from Pretrained Language Models
Catherine Chen, Kevin Lin and Dan Klein
 Event Representation with Sequential, Semi-Supervised Discrete Variables
Mehdi Rezaee and Francis Ferraro
 10:20–11:40 13D: Sentiment Analysis and Stylistic Analysis
  Session chair: Pushkar Mishra (Facebook AI)
 Seq2Emo: A Sequence to Multi-Label Emotion Classification Model
Chenyang Huang, Amine Trabelsi, Xuebin Qin, Nawshad Farruque, Lili Mou and Osmar Zaïane
 Knowledge Enhanced Masked Language Model for Stance Detection
Kornraphop Kawintiranon and Lisa Singh
 Learning Paralinguistic Features from Audiobooks through Style Voice Conversion
Zakaria Aldeneh, Matthew Perez and Emily Mower Provost
 Adapting BERT for Continual Learning of a Sequence of Aspect Sentiment Classification Tasks
Zixuan Ke, Hu Xu and Bing Liu
 Adversarial Learning for Zero-Shot Stance Detection on Social Media
Emily Allaway, Malavika Srikanth and Kathleen McKeown
 10:20–11:40 13E: Summarization
  Session chair: Iz Beltagy (Allen Institute for AI)
 Efficiently Summarizing Text and Graph Encodings of Multi-Document Clusters
Ramakanth Pasunuru, Mengwen Liu, Mohit Bansal, Sujith Ravi and Markus Dreyer
 Enriching Transformers with Structured Tensor-Product Representations for Abstractive Summarization
Yichen Jiang, Asli Celikyilmaz, Paul Smolensky, Paul Soulos, Sudha Rao, Hamid Palangi, Roland Fernandez, Caitlin Smith, Mohit Bansal and Jianfeng Gao
 What’s in a Summary? Laying the Groundwork for Advances in Hospital-Course Summarization
Griffin Adams, Emily Alsentzer, Mert Ketenci, Jason Zucker and Noémie Elhadad
 Understanding Factuality in Abstractive Summarization with FRANK: A Benchmark for Factuality Metrics
Artidoro Pagnoni, Vidhisha Balachandran and Yulia Tsvetkov
 GSum: A General Framework for Guided Neural Abstractive Summarization
Zi-Yi Dou, Pengfei Liu, Hiroaki Hayashi, Zhengbao Jiang and Graham Neubig
[TACL] WikiAsp: A Dataset for Multi-domain Aspect-based Summarization
Hiroaki Hayashi, Prashant Budania, Peng Wang, Chris Ackerson, Raj Neervannan, Graham Neubig
 11:40–13:10 Best Paper Presentations
  Session chair: Anna Rumshisky (University of Massachusetts Lowell)
 Video-aided Unsupervised Grammar Induction
Songyang Zhang, Linfeng Song, Lifeng Jin, Kun Xu, Dong Yu and Jiebo Luo
 It’s Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners
Timo Schick and Hinrich Schütze
 Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources
Simone Conia, Andrea Bacciu and Roberto Navigli
 Learning How to Ask: Querying LMs with Mixtures of Soft Prompts
Guanghui Qin and Jason Eisner
 How many data points is a prompt worth?
Teven Le Scao and Alexander Rush
 Preregistering NLP research
Emiel van Miltenburg, Chris van der Lee and Emiel Krahmer
 17:00–18:20 Session 14 (click to expand/collapse)
 17:00–18:20 14A: Computational Social Science and Cultural Analytics
  Session chair: Diyi Yang (Georgia Tech)
 Multitask Learning for Emotionally Analyzing Sexual Abuse Disclosures
Ramit Sawhney, Puneet Mathur, Taru Jain, Akash Kumar Gautam and Rajiv Ratn Shah
 Self Promotion in US Congressional Tweets
Jun Wang, Kelly Cui and Bei Yu
 Profiling of Intertextuality in Latin Literature Using Word Embeddings
Patrick J. Burns, James Brofos, Kyle Li, Pramit Chaudhuri and Joseph P. Dexter
 Identifying inherent disagreement in natural language inference
Xinliang Frederick Zhang and Marie-Catherine de Marneffe
 Modeling Human Mental States with an Entity-based Narrative Graph
I-Ta Lee, Maria Leonor Pacheco and Dan Goldwasser
 17:00–18:20 14B: Generation and Summarization
  Session chair: Lili Mou (UAlberta; Amii)
 A Simple and Efficient Multi-Task Learning Approach for Conditioned Dialogue Generation
Yan Zeng and Jian-Yun Nie
 Hurdles to Progress in Long-form Question Answering
Kalpesh Krishna, Aurko Roy and Mohit Iyyer
 ENTRUST: Argument Reframing with Language Models and Entailment
Tuhin Chakrabarty, Christopher Hidey and Smaranda Muresan
 Paragraph-level Simplification of Medical Texts
Ashwin Devaraj, Iain Marshall, Byron Wallace and Junyi Jessy Li
 An Empirical Study on Neural Keyphrase Generation
Rui Meng, Xingdi Yuan, Tong Wang, Sanqiang Zhao, Adam Trischler and Daqing He
 Attention Head Masking for Inference Time Content Selection in Abstractive Summarization
Shuyang Cao and Lu Wang
 17:00–18:20 14C: Interpretability and Analysis of Models for NLP
  Session chair: Allyson Ettinger (University of Chicago)
 Factual Probing Is [MASK]: Learning vs. Learning to Recall
Zexuan Zhong, Dan Friedman and Danqi Chen
 Evaluating Saliency Methods for Neural Language Models
Shuoyang Ding and Philipp Koehn
 Contextualized Perturbation for Textual Adversarial Attack
Dianqi Li, Yizhe Zhang, Hao Peng, Liqun Chen, Chris Brockett, Ming-Ting Sun and Bill Dolan
 DirectProbe: Studying Representations without Classifiers
Yichu Zhou and Vivek Srikumar
 Evaluating the Values of Sources in Transfer Learning
Md Rizwan Parvez and Kai-Wei Chang
 Too Much in Common: Shifting of Embeddings in Transformer Language Models and its Implications
Daniel Biś, Maksim Podkorytov and Xiuwen Liu
 17:00–18:20 14D: Machine Learning for NLP: Language Modeling and Sequence to Sequence Models
  Session chair: Taylor Berg-Kirkpatrick (UC San Diego)
 On the Inductive Bias of Masked Language Modeling: From Statistical to Syntactic Dependencies
Tianyi Zhang and Tatsunori Hashimoto
 Limitations of Autoregressive Models and Their Alternatives
Chu-Cheng Lin, Aaron Jaech, Xin Li, Matthew R. Gormley and Jason Eisner
 On the Transformer Growth for Progressive BERT Training
Xiaotao Gu, Liyuan Liu, Hongkun Yu, Jing Li, Chen Chen and Jiawei Han
 Revisiting Simple Neural Probabilistic Language Models
Simeng Sun and Mohit Iyyer
 ReadTwice: Reading Very Large Documents with Memories
Yury Zemlyanskiy, Joshua Ainslie, Michiel de Jong, Philip Pham, Ilya Eckstein and Fei Sha
 SCRIPT: Self-Critic PreTraining of Transformers
Erik Nijkamp, Bo Pang, Ying Nian Wu and Caiming Xiong
 17:00–18:20 14E: NLP Applications
  Session chair: Kevin Small (Amazon)
 Nutri-bullets Hybrid: Consensual Multi-document Summarization
Darsh Shah, Lili Yu, Tao Lei and Regina Barzilay
 AVA: an Automatic eValuation Approach for Question Answering Systems
Thuy Vu and Alessandro Moschitti
 SpanPredict: Extraction of Predictive Document Spans with Neural Attention
Vivek Subramanian, Matthew Engelhard, Sam Berchuck, Liqun Chen, Ricardo Henao and Lawrence Carin
 Text Editing by Command
Felix Faltings, Michel Galley, Gerold Hintz, Chris Brockett, Chris Quirk, Jianfeng Gao and Bill Dolan
 A Deep Metric Learning Approach to Account Linking
Aleem Khan, Elizabeth Fleming, Noah Schofield, Marcus Bishop and Nicholas Andrews
 Improving Factual Completeness and Consistency of Image-to-Text Radiology Report Generation
Yasuhide Miura, Yuhao Zhang, Emily Tsai, Curtis Langlotz and Dan Jurafsky
 18:20–19:40 Session 15 (click to expand/collapse)
 18:20–19:40 15A: Language Grounding to Vision, Robotics and Beyond
  Session chair: Aishwarya Padmakumar (Amazon)
 Multimodal End-to-End Sparse Model for Emotion Recognition
Wenliang Dai, Samuel Cahyawijaya, Zihan Liu and Pascale Fung
 MIMOQA: Multimodal Input Multimodal Output Question Answering
Hrituraj Singh, Anshul Nasery, Denil Mehta, Aishwarya Agarwal, Jatin Lamba and Balaji Vasan Srinivasan
 OCID-Ref: A 3D Robotic Dataset With Embodied Language For Clutter Scene Grounding
Ke-Jyun Wang, Yun-Hsuan Liu, Hung-Ting Su, Jen-Wei Wang, Yu-Siang Wang, Winston Hsu and Wen-Chin Chen
 Unsupervised Vision-and-Language Pre-training Without Parallel Images and Captions
Liunian Harold Li, Haoxuan You, Zhecan Wang, Alireza Zareian, Shih-Fu Chang and Kai-Wei Chang
 Multitasking Inhibits Semantic Drift
Athul Paul Jacob, Mike Lewis and Jacob Andreas
 Probing Contextual Language Models for Common Ground with Visual Representations
Gabriel Ilharco, Rowan Zellers, Ali Farhadi and Hannaneh Hajishirzi
 18:20–19:40 15B: Machine Learning for NLP: Classification and Structured Prediction Models
  Session chair: Arman Cohan (Allen Institute for AI)
 BBAEG: Towards BERT-based Biomedical Adversarial Example Generation for Text Classification
Ishani Mondal
 Targeted Adversarial Training for Natural Language Understanding
Lis Pereira, Xiaodong Liu, Hao Cheng, Hoifung Poon, Jianfeng Gao and Ichiro Kobayashi
 Latent-Optimized Adversarial Neural Transfer for Sarcasm Detection
Xu Guo, Boyang Li, Han Yu and Chunyan Miao
 Self-training Improves Pre-training for Natural Language Understanding
Jingfei Du, Edouard Grave, Beliz Gunel, Vishrav Chaudhary, Onur Celebi, Michael Auli, Veselin Stoyanov and Alexis Conneau
 Supporting Clustering with Contrastive Learning
Dejiao Zhang, Feng Nan, Xiaokai Wei, Shang-Wen Li, Henghui Zhu, Kathleen McKeown, Ramesh Nallapati, Andrew O. Arnold and Bing Xiang
[TACL] Self-supervised Regularization for Text Classification
Meng Zhou, Zechen Li, Pengtao Xie
 18:20–19:40 15C: NLP Applications
  Session chair: Wei Xu (Georgia Tech)
 TITA: A Two-stage Interaction and Topic-Aware Text Matching Model
Xingwu Sun, Yanling Cui, Hongyin Tang, Qiuyu Zhu, Fuzheng Zhang and Beihong Jin
 Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction
Zhenghao Liu, Xiaoyuan Yi, Maosong Sun, Liner Yang and Tat-Seng Chua
 Neural Network Surgery: Injecting Data Patterns into Pre-trained Models with Minimal Instance-wise Side Effects
Zhiyuan Zhang, Xuancheng Ren, Qi Su, Xu Sun and Bin He
 Discrete Argument Representation Learning for Interactive Argument Pair Identification
Lu Ji, Zhongyu Wei, Jing Li, Qi Zhang and Xuanjing Huang
 On Unifying Misinformation Detection
Nayeon Lee, Belinda Z. Li, Sinong Wang, Pascale Fung, Hao Ma, Wen-tau Yih and Madian Khabsa
 Frustratingly Easy Edit-based Linguistic Steganography with a Masked Language Model
Honai Ueoka, Yugo Murawaki and Sadao Kurohashi
 Few-Shot Text Classification with Triplet Networks, Data Augmentation, and Curriculum Learning
Jason Wei, Chengyu Huang, Soroush Vosoughi, Yu Cheng and Shiqi Xu
 18:20–19:40 15D: Phonology, Morphology and Word Segmentation
  Session chair: Ekaterina Vylomova (The University of Melbourne)
 Do RNN States Encode Abstract Phonological Alternations?
Miikka Silfverberg, Francis Tyers, Garrett Nicolai and Mans Hulden
 Pre-training with Meta Learning for Chinese Word Segmentation
Zhen Ke, Liang Shi, Songtao Sun, Erli Meng, Bin Wang and Xipeng Qiu
 Decompose, Fuse and Generate: A Formation-Informed Method for Chinese Definition Generation
Hua Zheng, Damai Dai, Lei Li, Tianyu Liu, Zhifang Sui, Baobao Chang and Yang Liu
 User-Generated Text Corpus for Evaluating Japanese Morphological Analysis and Lexical Normalization
Shohei Higashiyama, Masao Utiyama, Taro Watanabe and Eiichiro Sumita
 GPT Perdetry Test: Generating new meanings for new words
Nikolay Malkin, Sameera Lanka, Pranav Goel, Sudha Rao and Nebojsa Jojic
 18:20–19:40 15E: Sentence-level Semantics and Textual Inference
  Session chair: John Wieting (Google Research)
 Universal Semantic Tagging for English and Mandarin Chinese
Wenxi Li, Yiyang Hou, Yajie Ye, Li Liang and Weiwei Sun
 ShadowGNN: Graph Projection Neural Network for Text-to-SQL Parser
Zhi Chen, Lu Chen, Yanbin Zhao, Ruisheng Cao, Zihan Xu, Su Zhu and Kai Yu
 Contextualized and Generalized Sentence Representations by Contrastive Self-Supervised Learning: A Case Study on Discourse Relation Analysis
Hirokazu Kiyomaru and Sadao Kurohashi
 AMR Parsing with Action-Pointer Transformer
Jiawei Zhou, Tahira Naseem, Ramón Fernandez Astudillo and Radu Florian
 NL-EDIT: Correcting Semantic Parse Errors through Natural Language Interaction
Ahmed Elgohary, Christopher Meek, Matthew Richardson, Adam Fourney, Gonzalo Ramos and Ahmed Hassan Awadallah
 Unsupervised Concept Representation Learning for Length-Varying Text Similarity
Xuchao Zhang, Bo Zong, Wei Cheng, Jingchao Ni, Yanchi Liu and Haifeng Chen
 19:40–21:00 Session 16 (click to expand/collapse)
 19:40–21:00 16A: Dialogue and Interactive Systems
  Session chair: Alexandros Papangelis (Amazon)
 Augmenting Knowledge-grounded Conversations with Sequential Knowledge Transition
Haolan Zhan, Hainan Zhang, Hongshen Chen, Zhuoye Ding, Yongjun Bao and Yanyan Lan
 Adversarial Self-Supervised Learning for Out-of-Domain Detection
Zhiyuan Zeng, Keqing He, Yuanmeng Yan, Hong Xu and Weiran Xu
 Leveraging Slot Descriptions for Zero-Shot Cross-Domain Dialogue StateTracking
Zhaojiang Lin, Bing Liu, Seungwhan Moon, Paul Crook, Zhenpeng Zhou, Zhiguang Wang, Zhou Yu, Andrea Madotto, Eunjoon Cho and Rajen Subba
 Hierarchical Transformer for Task Oriented Dialog Systems
Bishal Santra, Potnuru Anusha and Pawan Goyal
 Measuring the ‘I don’t know’ Problem through the Lens of Gricean Quantity
Huda Khayrallah and João Sedoc
[TACL] Dialogue State Tracking with Incremental Reasoning
Lizi Liao, Le Hong Long, Yunshan Ma, Wenqiang Lei, Tat-Seng Chua
 19:40–21:00 16B: Information Extraction
  Session chair: Siliang Tang (Zhejiang University)
 RTFE: A Recursive Temporal Fact Embedding Framework for Temporal Knowledge Graph Completion
Youri Xu, Haihong E, Meina Song, wenyu song, Xiaodong Lv, wang haotian and yang jinrui
 Open Hierarchical Relation Extraction
Kai Zhang, Yuan Yao, Ruobing Xie, Xu Han, Zhiyuan Liu, Fen Lin, Leyu Lin and Maosong Sun
 Jointly Extracting Explicit and Implicit Relational Triples with Reasoning Pattern Enhanced Binary Pointer Network
Yubo Chen, Yunqi Zhang, Changran Hu and Yongfeng Huang
 Multi-Grained Knowledge Distillation for Named Entity Recognition
Xuan Zhou, Xiao Zhang, Chenyang Tao, Junya Chen, Bing Xu, Wei Wang and Jing Xiao
 SGG: Learning to Select, Guide, and Generate for Keyphrase Generation
Jing Zhao, Junwei Bao, Yifan Wang, Youzheng Wu, Xiaodong He and Bowen Zhou
 Towards Sentiment and Emotion aided Multi-modal Speech Act Classification in Twitter
Tulika Saha, Apoorva Upadhyaya, Sriparna Saha and Pushpak Bhattacharyya
 19:40–21:00 16C: Machine Translation
  Session chair: Rui Wang (Shanghai Jiao Tong University)
 Generative Imagination Elevates Machine Translation
Quanyu Long, Mingxuan Wang and Lei Li
 Non-Autoregressive Translation by Learning Target Categorical Codes
Yu Bao, Shujian Huang, Tong Xiao, Dongqi Wang, Xinyu Dai and Jiajun CHEN
 Training Data Augmentation for Code-Mixed Translation
Abhirut Gupta, Aditya Vavre and Sunita Sarawagi
 Rethinking Perturbations in Encoder-Decoders for Fast Training
Sho Takase and Shun Kiyono
 Context-aware Decoder for Neural Machine Translation using a Target-side Document-Level Language Model
Amane Sugiyama and Naoki Yoshinaga
 Machine Translated Text Detection Through Text Similarity with Round-Trip Translation
Hoang-Quoc Nguyen-Son, Tran Thao, Seira Hidano, Ishita Gupta and Shinsaku Kiyomoto
 19:40–21:00 16D: Question Answering
  Session chair: Hung-yi Lee (National Taiwan University)
 TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference
Deming Ye, Yankai Lin, Yufei Huang and Maosong Sun
 Breadth First Reasoning Graph for Multi-hop Question Answering
Yongjie Huang and Meng Yang
 Improving Zero-Shot Cross-lingual Transfer for Multilingual Question Answering over Knowledge Graph
Yucheng Zhou, Xiubo Geng, Tao Shen, Wenqiang Zhang and Daxin Jiang
 RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question Answering
Yingqi Qu, Yuchen Ding, Jing Liu, Kai Liu, Ruiyang Ren, Wayne Xin Zhao, Daxiang Dong, Hua Wu and Haifeng Wang
 DAGN: Discourse-Aware Graph Network for Logical Reasoning
Yinya Huang, Meng Fang, Yu Cao, Liwei Wang and Xiaodan Liang
 Designing a Minimal Retrieve-and-Read System for Open-Domain Question Answering
Sohee Yang and Minjoon Seo
 Unsupervised Multi-hop Question Answering by Question Generation
Liangming Pan, Wenhu Chen, Wenhan Xiong, Min-Yen Kan and William Yang Wang
 19:40–21:00 16E: Summarization
  Session chair: Yang Liu (Microsoft)
 Sliding Selector Network with Dynamic Memory for Extractive Summarization of Long Documents
Peng Cui and Le Hu
 AdaptSum: Towards Low-Resource Domain Adaptation for Abstractive Summarization
Tiezheng Yu, Zihan Liu and Pascale Fung
 QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization
Ming Zhong, Da Yin, Tao Yu, Ahmad Zaidi, Mutethia Mutuma, Rahul Jha, Ahmed Hassan Awadallah, Asli Celikyilmaz, Yang Liu, Xipeng Qiu and Dragomir Radev
 MM-AVS: A Full-Scale Dataset for Multi-modal Summarization
Xiyan Fu, Jun Wang and Zhenglu Yang
 MediaSum: A Large-scale Media Interview Dataset for Dialogue Summarization
Chenguang Zhu, Yang Liu, Jie Mei and Michael Zeng
 Improving Faithfulness in Abstractive Summarization with Contrast Candidate Generation and Selection
Sihao Chen, Fan Zhang, Kazoo Sone and Dan Roth
 Inference Time Style Control for Summarization
Shuyang Cao and Lu Wang