Tag
1 articles
This paper explores using weak rewards from retrieval-augmented interaction to model user preferences in chat agents.