EX3: Explainable Attribute-aware Item-set Recommendations

実験
- オフラインテスト
  - Top-N Recommendation Performance (Expect-Step)
  - Model Robustness to Missing Attributes
- オンラインテスト

論文：
https://dl.acm.org/doi/10.1145/3460231.3474240
論文（Amazon scienceの方）：EX³: Explainable attribute-aware item-set recommendations - Amazon Science

手法についてはWantedlyの記事が詳しいので、こちらを読んだほうが良い。pdfには載ってない図も掲載されていてわかりやすい。
www.wantedly.com

気になる点の幾つかがWantedlyの記事には掲載されてなかったので、その部分だけ読んだ。

用語

pivot item : レコメンドの元になるアイテム。query itemやseed itemと呼ばれたりするやつ。

$B_{cp}$ : co-purchase
$B_{cv}$ : co-view
$B_{pv}$ : purchase-after-view

実験

オフラインテスト

Amazon.comの7つのサブドメインのデータを用いてオフライン評価を実施。オフラインテストについては、論文中にAmazon.comと明記されている。

f:id:wwacky:20211024061059p:plain — ※ Table 1. EX³: Explainable attribute-aware item-set recommendationsより引用

Top-N Recommendation Performance (Expect-Step)

提案手法のExpect-stepが既存手法と同程度のrelevanceを出力できるかの確認。提案手法ではこの後にExplain-stepがあるのが、既存手法と比較するためにExpect-stepで評価する。
検証では提案手法のEX³が最もよい。

f:id:wwacky:20211024061425p:plain — ※ Table 2. EX³: Explainable attribute-aware item-set recommendationsより引用

比較手法

Relevance：提案手法のExtract-stepで学習されたitem embeddings を用いた類似度
BPR*1：Bayesian personalized ranking
ACCM*2：アイテムの分散表現を充実させるために属性を利用したCF-based and CB-based recommendation approach
A2CF*3：SOTA. attribute-based recommendation model
EX³：提案手法のExpect-step

Model Robustness to Missing Attributes

商品属性の欠損値の影響を評価する。評価時のテストセットの属性にランダムで欠損値を 10%, 20% と増やして評価する。比較対象は提案手法と、提案手法（Figure 3. の青線）からAttention機構を取り除いたもの（Figure 3. の赤線）。

f:id:wwacky:20211025055755p:plain:w500 — ※ Figure 3. EX³: Explainable attribute-aware item-set recommendationsより引用

オンラインテスト

オンラインA/Bテストを行い、CVRが +0.080%、revenue +0.105％だったと記載されている。実験を行ったのはAmazonとは明言されず、large-scale e-commerce websiteとのみ記載。

*1:Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme. 2009. BPR: Bayesian personalized ranking from implicit feedback. In UAI.

*2:Shaoyun Shi, Min Zhang, Yiqun Liu, and Shaoping Ma. 2018. Attention-based Adaptive Model to Unify Warm and Cold Starts Recommendation. CIKM (2018).

*3:Tong Chen, Hongzhi Yin, Guanhua Ye, Zi Huang, Yang Wang, and Meng Wang. 2020. Try This Instead: Personalized and Interpretable Substitute Recommendation. SIGIR.