Apr 06, 2021

Non-Parallel Text Style Transfer with Self-Parallel Supervision

ICLR 2022

The performance of existing text style transfer models is severely limited by the non-parallel datasets on which the models are trained. In non-parallel datasets, no direct mapping exists between sentences of the source and target style; the style transfer models thus only receive weak supervision of the target sentences during training, which often leads the model to discard too much style-independent information, or utterly fail to transfer the style. In this work, we propose LaMer, a novel text style transfer framework based on large-scale language models. LaMer first mines the roughly parallel expressions in the non-parallel datasets with scene graphs, and then employs MLE training, followed by imitation learning refinement, to leverage the intrinsic parallelism within the data.

Similar thoughts

Training Socially Aligned Language Models in Simulated Human Society

Aligning Generative Language Models with Human Values

Mind's Eye: Grounded Language Model Reasoning Through Simulation

Second Thoughts are Best: Learning to Re-Align With Human Values from Text Edits

Knowledge Infused Decoding