Posts by Tag

policy gradient

Back to Top ↑

expected reward

Back to Top ↑

sampling

Back to Top ↑

MRT

Back to Top ↑

REINFORCE

Back to Top ↑

neural machine translation

Back to Top ↑

semantic parsing

Back to Top ↑

exposure bias

Back to Top ↑

reinforcement learning

Back to Top ↑

score function gradient estimator

Back to Top ↑

monolingual data

Back to Top ↑

future challenges

Back to Top ↑

BLEU

Back to Top ↑