Loading...
Off-Policy Recommendation System Without Exploration
Recommendation System (RS) can be treated as an intelligent agent which aims to generate policy maximizing customers’ long term satisfaction. Off-policy reinforcement learning methods based on Q-learning and actor-critic methods are commonly used to train RS. Though these methods can leverage previo...
Na minha lista:
| Udgivet i: | Advances in Knowledge Discovery and Data Mining |
|---|---|
| Main Authors: | , , , , |
| Format: | Artigo |
| Sprog: | Inglês |
| Udgivet: |
2020
|
| Fag: | |
| Online adgang: | https://ncbi.nlm.nih.gov/pmc/articles/PMC7206175/ https://ncbi.nlm.nih.govhttp://dx.doi.org/10.1007/978-3-030-47426-3_2 |
| Tags: |
Tilføj Tag
Ingen Tags, Vær først til at tagge denne postø!
|