"BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition" by Yinghao Li

Article

BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition

ACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference

Yinghao Li, Georgia Institute of Technology
Pranav Shetty, Georgia Institute of Technology
Lucas Liu, Georgia Institute of Technology
Chao Zhang, Georgia Institute of Technology
Le Song, Mohamed Bin Zayed University of Artificial Intelligence

Link

Document Type

Conference Proceeding

Abstract

We study the problem of learning a named entity recognition (NER) tagger using noisy labels from multiple weak supervision sources. Though cheap to obtain, the labels from weak supervision sources are often incomplete, inaccurate, and contradictory, making it difficult to learn an accurate NER model. To address this challenge, we propose a conditional hidden Markov model (CHMM), which can effectively infer true labels from multi-source noisy labels in an unsupervised way. CHMM enhances the classic hidden Markov model with the contextual representation power of pretrained language models. Specifically, CHMM learns token-wise transition and emission probabilities from the BERT embeddings of the input tokens to infer the latent true labels from noisy observations. We further refine CHMM with an alternate-training approach (CHMM-ALT). It fine-tunes a BERT-NER model with the labels inferred by CHMM, and this BERT-NER's output is regarded as an additional weak source to train the CHMM in return. Experiments on four NER benchmarks from various domains show that our method outperforms state-of-the-art weakly supervised NER models by wide margins.

Publication Date

1-1-2021

Disciplines

Computer Sciences and
Mathematics

Comments

IR deposit conditions: none described

ACL proceedings hosted and available to the public on aclanthology.org

Citation Information

Y. Li, P. Shetty, L. Liu, C. Zhang, and Le Song, "BERTifying the hidden Markov model for multi-source weakly supervised named entity recognition" in Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, pages 6178–6190 August 1–6, 2021.