A Reinforcement Learning Framework For Explainable Recommendation

A Reinforcement Learning Framework for
Explainable Recommendation
Xiting Wang, Yiru Chen, Jie Yang, Le Wu, Zhengtao Wu, Xing Xie
Microsoft Research Asia, Peking University, Tsinghua University, Hefei University of Technology, USTC
Recommender Systems
• Help billions of users to make decisions related to their personal lives

Choose a restaurant Buy products
• A growing need to ensure that the users understand and trust the system
2
• A growing need to ensure that the users understand and trust the system
Explanations: why the items
• Explanations serve as a bridge between are recommended
recommender systems and users
• Increase user trust
• Help users make better decisions
• Persuade users to try or buy an item
3
Existing Methods – Post Hoc
Users
Recommendation Recommended Explanation Explanation

model items Method
Items
Diversity limited by the number of predefined templates
Ignore the working mechanism of the recommendation model
Existing Methods - Embedded
Users
… …
Explanation
Explanation Recommended
Method items
Items
Difficult to ensure readability and consistency
Different explanation methods for different recommendation models
Desirable Properties for
• Model-agnostic: can be used to explain any recommendation model

• Model explainability: reveal the working mechanism of the model
• Explanation quality control: ensure the quality of the explanations (e.g.,
their readability, consistency, and diversity)
6
Desirable Properties for
Model Model Presentation

agnostic explainability quality
Post-hoc
Embedded
7
Desirable Properties for Explainable
Recommendation
Model Model Presentation

agnostic explainability quality
Post-hoc
Embedded
Wrapper
(RL)
8
Contribution
• Design a reinforcement learning framework for explainable

recommendation
• Model-agnostic
• Has good model explainability
• Can flexibly control the presentation quality
• Instantiate the agents with personalized-attention-based neural
networks
• Evaluate the effectiveness of our method by using both offline
experiments and evaluation with human subjects
9
Problem Definition
• Input
• User set , is a user user ID and/or some side information
• Item set , is an item

• A recommendation model to be item ID : interpretable component
explained
• Output Attributes like “price - low”;
user review; product image;
• Explanation
The th interpretable component is selected
The th interpretable component is not selected
Reinforcement Learning Framework
• Advantages: model-agnostic, model-explainability, presentation quality
11
Independent 12
Agent 2 can predict

Model-explainability reward: 13
will increase if is good given

14
Couple Agents
Sentence-level Explanation 15
Optimization Goal
• Maximizing expected reward Reward
Model-explainability Presentation quality
Readability (Sentiment) Consistency
Conciseness Coherence returns the sentiment score of

:desirable sentence length 16
Optimization Method
• Doubly Stochastic Policy Gradient
Agent 1
Agent 2
17
Offline Evaluation
Each training sample: user, item, rating, review text
18
Different recommendation
Offline Evaluation models to be explained
Presentation quality Model explainability
19
Offline Evaluation Different parameter settings
: desirable explanation length
20
Evaluation with Human Subjects
• Ask the participants to choose the explanations that are most useful
in helping them decide whether they will go to the restaurants
21
Evaluation with Human Subjects
P3
Frequent words in reviews:
P4
Words related to food Words related to services

22
Conclusion
• Design a reinforcement learning framework for explainable

recommendation
• Model-agnostic
• Has good model explainability
• Can flexibly control the presentation quality
• Instantiate the agents with personalized-attention-based neural
networks
• Evaluate the effectiveness of our method by using both offline
experiments and evaluation with human subjects
23
http://xitingwang.net
Our Works on Explainable Recommendation

Choose a restaurant
Textual
[AAAI2019] [ICDM2018]
Daily life Buy products
Determine research direction Visualization

[TVCG2018], Spotlight [TVCG2018]
Professional
Recomposite photos
activities
Thanks!
25

A Reinforcement Learning Framework For Explainable Recommendation

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

A Reinforcement Learning Framework For Explainable Recommendation

Uploaded by

Copyright:

Available Formats

A Reinforcement Learning Framework for

• Help billions of users to make decisions related to their personal lives

Recommendation Recommended Explanation Explanation

• Model-agnostic: can be used to explain any recommendation model

Model Model Presentation

Model Model Presentation

• Design a reinforcement learning framework for explainable

• Item set , is an item

• Advantages: model-agnostic, model-explainability, presentation quality

• Advantages: model-agnostic, model-explainability, presentation quality

• Advantages: model-agnostic, model-explainability, presentation quality

Agent 2 can predict

• Advantages: model-agnostic, model-explainability, presentation quality

will increase if is good given

• Maximizing expected reward Reward

Model-explainability Presentation quality

Readability (Sentiment) Consistency

Conciseness Coherence returns the sentiment score of

• Doubly Stochastic Policy Gradient

Each training sample: user, item, rating, review text

Presentation quality Model explainability

Presentation quality Model explainability

Presentation quality Model explainability

Presentation quality Model explainability

Words related to food Words related to services

• Design a reinforcement learning framework for explainable

Our Works on Explainable Recommendation

Determine research direction Visualization

You might also like