Explore-Exploit Trade-off

We call experimentation exploration and we call profit-maximization exploitation 在 Bandit.Algorithms.for.Website.Optimization 書中看到關於 Explore-Exploit trade-off 的定義，簡單來說在解決問題的成本和獲得的利潤上取得平衡，書中開頭的案例:為網站更換 LOGO，傳統的做法是使用 AB TEST但是花大量的時間，所以從這邊我可以慢慢猜得出來如何跟 Bandit Algorithms 有關。

另外記錄一篇跟Explore-Exploit trade-off相關的有趣論文:Web-Scale Bayesian Click-Through Rate Prediction for Sponsored Search Advertising in Microsoft’s Bing Search Engine

另外有幾篇關於A/B Testing 和 Bandit.Algorithms的比較的文章可以作為參考。

Why Multi-armed Bandit Algorithm is Not “Better” than A/B Testing
20 lines of code that will beat A/B testing every time
Why Multi-armed Bandit algorithms are superior to A/B testing

Learning Blog

預定調和

Explore-Exploit Trade-off

Comments