"PRBOOST: Prompt-Based Rule Discovery and Boosting for Interactive Weakly-Supervised Learning" by Rongzhi Zhang

Selected Works of Le Song

Article

PRBOOST: Prompt-Based Rule Discovery and Boosting for Interactive Weakly-Supervised Learning

arXiv

Rongzhi Zhang, Georgia Tech, United States
Yue Yu, Georgia Tech, United States
Pranav Shetty, Georgia Tech, United States
Le Song, Mohamed bin Zayed University of Artificial Intelligence
Chao Zhang, Georgia Tech, United States

Download

Document Type

Article

Abstract

Weakly-supervised learning (WSL) has shown promising results in addressing label scarcity on many NLP tasks, but manually designing a comprehensive, high-quality labeling rule set is tedious and difficult. We study interactive weakly-supervised learning-the problem of iteratively and automatically discovering novel labeling rules from data to improve the WSL model. Our proposed model, named PRBOOST, achieves this goal via iterative prompt-based rule discovery and model boosting. It uses boosting to identify large-error instances and then discovers candidate rules from them by prompting pre-trained LMs with rule templates. The candidate rules are judged by human experts, and the accepted rules are used to generate complementary weak labels and strengthen the current model. Experiments on four tasks show PRBOOST outperforms state-of-the-art WSL baselines up to 7.1%, and bridges the gaps with fully supervised models.Our Implementation is available at https://github.com/rz-zhang/PRBoost. © 2022, CC BY-NC-ND.

DOI

doi.org/10.48550/arXiv.2203.09735

Publication Date

3-18-2022

Keywords

Weakly-supervised learning,
natural language processing

Comments

Preprint: arXiv

Archived with thanks to arXiv

Preprint License: CC by NC-ND 4.0

Uploaded May 18, 2022

Citation Information

R. Zhang, Y. Yu, P. Shetty, L. Song, and C. Zhang, "PRBOOST: Prompt-Based Rule Discovery and Boosting for Interactive Weakly-Supervised Learning", 2022, arXiv, doi: 10.48550/arXiv.2203.09735