Anisotropic span embeddings and the negative impact of higher-order inference for coreference resolution: An empirical analysis

Feng Hou; Ruili Wang; See Kiong Ng; Fangyi Zhu; Michael Witbrock; Steven F. Cahan; Lily Chen; Xiaoyun Jia

doi:10.1017/S1351324924000019

Anisotropic span embeddings and the negative impact of higher-order inference for coreference resolution: An empirical analysis

Feng Hou, Ruili Wang, See Kiong Ng, Fangyi Zhu, Michael Witbrock, Steven F. Cahan, Lily Chen, Xiaoyun Jia

Research output: Journal Publication › Article › peer-review

Abstract

Coreference resolution is the task of identifying and clustering mentions that refer to the same entity in a document. Based on state-of-the-art deep learning approaches, end-to-end coreference resolution considers all spans as candidate mentions and tackles mention detection and coreference resolution simultaneously. Recently, researchers have attempted to incorporate document-level context using higher-order inference (HOI) to improve end-to-end coreference resolution. However, HOI methods have been shown to have marginal or even negative impact on coreference resolution. In this paper, we reveal the reasons for the negative impact of HOI coreference resolution. Contextualized representations (e.g., those produced by BERT) for building span embeddings have been shown to be highly anisotropic. We show that HOI actually increases and thus worsens the anisotropy of span embeddings and makes it difficult to distinguish between related but distinct entities (e.g., pilots and flight attendants). Instead of using HOI, we propose two methods, Less-Anisotropic Internal Representations (LAIR) and Data Augmentation with Document Synthesis and Mention Swap (DSMS), to learn less-anisotropic span embeddings for coreference resolution. LAIR uses a linear aggregation of the first layer and the topmost layer of contextualized embeddings. DSMS generates more diversified examples of related but distinct entities by synthesizing documents and by mention swapping. Our experiments show that less-anisotropic span embeddings improve the performance significantly (+2.8 F1 gain on the OntoNotes benchmark) reaching new state-of-the-art performance on the GAP dataset.

Original language	English
Journal	Natural Language Engineering
DOIs	https://doi.org/10.1017/S1351324924000019
Publication status	Accepted/In press - 2024
Externally published	Yes

Keywords

anisotropic span embeddings
contextualized representations
Coreference resolution
higher-order inference

ASJC Scopus subject areas

Software
Language and Linguistics
Linguistics and Language
Artificial Intelligence

Access to Document

10.1017/S1351324924000019

Cite this

@article{5bcefa235ff84a4383446cc54d3130cd,

title = "Anisotropic span embeddings and the negative impact of higher-order inference for coreference resolution: An empirical analysis",

abstract = "Coreference resolution is the task of identifying and clustering mentions that refer to the same entity in a document. Based on state-of-the-art deep learning approaches, end-to-end coreference resolution considers all spans as candidate mentions and tackles mention detection and coreference resolution simultaneously. Recently, researchers have attempted to incorporate document-level context using higher-order inference (HOI) to improve end-to-end coreference resolution. However, HOI methods have been shown to have marginal or even negative impact on coreference resolution. In this paper, we reveal the reasons for the negative impact of HOI coreference resolution. Contextualized representations (e.g., those produced by BERT) for building span embeddings have been shown to be highly anisotropic. We show that HOI actually increases and thus worsens the anisotropy of span embeddings and makes it difficult to distinguish between related but distinct entities (e.g., pilots and flight attendants). Instead of using HOI, we propose two methods, Less-Anisotropic Internal Representations (LAIR) and Data Augmentation with Document Synthesis and Mention Swap (DSMS), to learn less-anisotropic span embeddings for coreference resolution. LAIR uses a linear aggregation of the first layer and the topmost layer of contextualized embeddings. DSMS generates more diversified examples of related but distinct entities by synthesizing documents and by mention swapping. Our experiments show that less-anisotropic span embeddings improve the performance significantly (+2.8 F1 gain on the OntoNotes benchmark) reaching new state-of-the-art performance on the GAP dataset.",

keywords = "anisotropic span embeddings, contextualized representations, Coreference resolution, higher-order inference",

author = "Feng Hou and Ruili Wang and Ng, {See Kiong} and Fangyi Zhu and Michael Witbrock and Cahan, {Steven F.} and Lily Chen and Xiaoyun Jia",

note = "Publisher Copyright: {\textcopyright} The Author(s), 2024. Published by Cambridge University Press.",

year = "2024",

doi = "10.1017/S1351324924000019",

language = "English",

journal = "Natural Language Engineering",

issn = "1351-3249",

publisher = "Cambridge University Press",

}

TY - JOUR

T1 - Anisotropic span embeddings and the negative impact of higher-order inference for coreference resolution

T2 - An empirical analysis

AU - Hou, Feng

AU - Wang, Ruili

AU - Ng, See Kiong

AU - Zhu, Fangyi

AU - Witbrock, Michael

AU - Cahan, Steven F.

AU - Chen, Lily

AU - Jia, Xiaoyun

PY - 2024

Y1 - 2024

N2 - Coreference resolution is the task of identifying and clustering mentions that refer to the same entity in a document. Based on state-of-the-art deep learning approaches, end-to-end coreference resolution considers all spans as candidate mentions and tackles mention detection and coreference resolution simultaneously. Recently, researchers have attempted to incorporate document-level context using higher-order inference (HOI) to improve end-to-end coreference resolution. However, HOI methods have been shown to have marginal or even negative impact on coreference resolution. In this paper, we reveal the reasons for the negative impact of HOI coreference resolution. Contextualized representations (e.g., those produced by BERT) for building span embeddings have been shown to be highly anisotropic. We show that HOI actually increases and thus worsens the anisotropy of span embeddings and makes it difficult to distinguish between related but distinct entities (e.g., pilots and flight attendants). Instead of using HOI, we propose two methods, Less-Anisotropic Internal Representations (LAIR) and Data Augmentation with Document Synthesis and Mention Swap (DSMS), to learn less-anisotropic span embeddings for coreference resolution. LAIR uses a linear aggregation of the first layer and the topmost layer of contextualized embeddings. DSMS generates more diversified examples of related but distinct entities by synthesizing documents and by mention swapping. Our experiments show that less-anisotropic span embeddings improve the performance significantly (+2.8 F1 gain on the OntoNotes benchmark) reaching new state-of-the-art performance on the GAP dataset.

AB - Coreference resolution is the task of identifying and clustering mentions that refer to the same entity in a document. Based on state-of-the-art deep learning approaches, end-to-end coreference resolution considers all spans as candidate mentions and tackles mention detection and coreference resolution simultaneously. Recently, researchers have attempted to incorporate document-level context using higher-order inference (HOI) to improve end-to-end coreference resolution. However, HOI methods have been shown to have marginal or even negative impact on coreference resolution. In this paper, we reveal the reasons for the negative impact of HOI coreference resolution. Contextualized representations (e.g., those produced by BERT) for building span embeddings have been shown to be highly anisotropic. We show that HOI actually increases and thus worsens the anisotropy of span embeddings and makes it difficult to distinguish between related but distinct entities (e.g., pilots and flight attendants). Instead of using HOI, we propose two methods, Less-Anisotropic Internal Representations (LAIR) and Data Augmentation with Document Synthesis and Mention Swap (DSMS), to learn less-anisotropic span embeddings for coreference resolution. LAIR uses a linear aggregation of the first layer and the topmost layer of contextualized embeddings. DSMS generates more diversified examples of related but distinct entities by synthesizing documents and by mention swapping. Our experiments show that less-anisotropic span embeddings improve the performance significantly (+2.8 F1 gain on the OntoNotes benchmark) reaching new state-of-the-art performance on the GAP dataset.

KW - anisotropic span embeddings

KW - contextualized representations

KW - Coreference resolution

KW - higher-order inference

UR - http://www.scopus.com/inward/record.url?scp=85183522061&partnerID=8YFLogxK

U2 - 10.1017/S1351324924000019

DO - 10.1017/S1351324924000019

M3 - Article

AN - SCOPUS:85183522061

SN - 1351-3249

JO - Natural Language Engineering

JF - Natural Language Engineering

ER -

Anisotropic span embeddings and the negative impact of higher-order inference for coreference resolution: An empirical analysis

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this