Professional Documents
Culture Documents
Abstract than just mere classification, they can be used to map com-
plicated structures to other complicated structures. An ex-
Conversational modeling is an important task in
ample of this is the task of mapping a sequence to another
natural language understanding and machine in-
sequence which has direct applications in natural language
telligence. Although previous approaches ex-
understanding (Sutskever et al., 2014). The main advan-
ist, they are often restricted to specific domains
tage of this framework is that it requires little feature en-
(e.g., booking an airline ticket) and require hand-
gineering and domain specificity whilst matching or sur-
crafted rules. In this paper, we present a sim-
passing state-of-the-art results. This advance, in our opin-
ple approach for this task which uses the recently
ion, allows researchers to work on tasks for which domain
proposed sequence to sequence framework. Our
knowledge may not be readily available, or for tasks which
model converses by predicting the next sentence
are simply too hard to design rules manually.
given the previous sentence or sentences in a
conversation. The strength of our model is that Conversational modeling can directly benefit from this for-
it can be trained end-to-end and thus requires mulation because it requires mapping between queries and
much fewer hand-crafted rules. We find that this reponses. Due to the complexity of this mapping, conver-
straightforward model can generate simple con- sational modeling has previously been designed to be very
versations given a large conversational training narrow in domain, with a major undertaking on feature en-
dataset. Our preliminary results suggest that, de- gineering. In this work, we experiment with the conversa-
spite optimizing the wrong objective function, tion modeling task by casting it to a task of predicting the
the model is able to converse well. It is able next sequence given the previous sequence or sequences
extract knowledge from both a domain specific using recurrent networks (Sutskever et al., 2014). We find
dataset, and from a large, noisy, and general do- that this approach can do surprisingly well on generating
main dataset of movie subtitles. On a domain- fluent and accurate replies to conversations.
specific IT helpdesk dataset, the model can find
We test the model on chat sessions from an IT helpdesk
a solution to a technical problem via conversa-
dataset of conversations, and find that the model can some-
tions. On a noisy open-domain movie transcript
times track the problem and provide a useful answer to
dataset, the model can perform simple forms of
the user. We also experiment with conversations obtained
common sense reasoning. As expected, we also
from a noisy dataset of movie subtitles, and find that the
find that the lack of consistency is a common fail-
model can hold a natural conversation and sometimes per-
ure mode of our model.
form simple forms of common sense reasoning. In both
cases, the recurrent nets obtain better perplexity compared
to the n-gram model and capture important long-range cor-
1. Introduction relations. From a qualitative point of view, our model is
Advances in end-to-end training of neural networks have sometimes able to produce natural conversations.
led to remarkable progress in many domains such as speech
recognition, computer vision, and language processing. 2. Related Work
Recent work suggests that neural networks can do more
Our approach is based on recent work which pro-
Proceedings of the 31 st International Conference on Machine posed to use neural networks to map sequences to se-
Learning, Lille, France, 2015. JMLR: W&CP volume 37. Copy- quences (Kalchbrenner & Blunsom, 2013; Sutskever et al.,
right 2015 by the author(s). 2014; Bahdanau et al., 2014). This framework has been
A Neural Conversational Model
At convergence, this model achieved a perplexity of 8, (The <URL> indeed contains information about vpn ac-
whereas an n-gram model achieved 18. Below is a few sam- cess.)
ples of simulated troubleshooting sessions for most three
of the most common IT problems (remote access, software Conversation 2: Browser issues.
crashes, and password issues). In these conversations, Ma-
chine is our Neural Conversational Model, and Human the Describe your problem: browser keeps crashing
human actor interacting with it. Machine: hi
Human: hi
Conversation 1: VPN issues. Machine: could you please let me know what is
the operating system you are using ?
Describe your problem: i am having issues ac- Human: linux
cessing vpn Machine: ok this is the issue with the system per-
formance , nasser , tools .
1
http://www.cleverbot.com/ Human: what is that ?
A Neural Conversational Model
Machine: i ’m out in the boonies. Human: how much is ten minus two ?
Human: are you a man or a woman ? Machine: seventy - two .
Machine: i ’m a woman .
Human: why are we here ? Conversation 4: Philosophical Q&A
Machine: i ’m not sure .
Human: okay, bye ! Human: what is the purpose of life ?
Machine: bye . Machine: to serve the greater good .
Human: see you later . Human: what is the purpose of living ?
Machine: bye . Machine: to live forever .
Human: what is the purpose of existence ?
Conversation 2: Simple Q&A (contexts and multiple Machine: to find out what happens when we get
choice) to the planet earth .
Human: where are you now ?
Human: my name is david . what is my name ? Machine: i ’m in the middle of nowhere .
Machine: david . Human: what is the purpose of dying ?
Human: my name is john . what is my name ? Machine: to have a life .
Machine: john . Human: what is the purpose of being intelligent
Human: are you a leader or a follower ? ?
Machine: i ’m a leader . Machine: to find out what it is .
Human: are you a follower or a leader ? Human: what is the purpose of emotions ?
Machine: i ’m a leader . Machine: i don ’t know .
Machine: i don ’t know how to tell you . problematic drawback is that the model does not capture
Human: is morality and ethics the same ? a consistent personality. Indeed, if we ask not identical but
Machine: yes , sir . semantically similar questions, the answers can sometimes
Human: what are the things that i do to be im- be inconsistent. This is expected due to the simplicity of
moral ? our model and the dataset in our experiments. The conver-
Machine: i don ’t know . sation below illustrates such failure:
Human: give me some examples of moral ac-
Conversation 7: Job and Personality
tions ...
Machine: i ’m not a moralist .
Human: alright , morality ? Human: what is your job ?
Machine: integrity . Machine: i ’m a lawyer .
Human: what is integrity ? Human: what do you do ?
Machine: i ’m sorry , i don ’t know what else to Machine: i ’m a doctor .
say .
Human: be moral ! Lastly, as a comparison of other traditional bots that are
Machine: be a man ! rule-based, we provide the output of CleverBot for some of
Human: i really like our discussion on morality the questions in our test:2
and ethics ...
Machine: and how i ’m not in the mood for a Human: Are you a follower or a leader ?
philosophical debate . CleverBot: No !!
Human: what do you like to talk about ? Human: My name is david . what is my name ?
Machine: nothing . CleverBot: How old are you David ?
Human: My name is john . what is my name ?
Conversation 6: Opinions (notice the model outputs “he” , CleverBot: John
“she” or “it” depending on the subject being asked) Human: What is the purpose of being intelligent
?
Human: what do you think about tesla ? CleverBot: The purpose of life is to love God
Machine: he ’s a good conductor . and love people.
Human: what do you think about bill gates ? Human: Is sky blue or black ?
Machine: he ’s a good man . CleverBot: I am 200 years old.
Human: What do you think about messi ? Human: What is the color of a yellow car ?
Machine: he ’s a great player . CleverBot: Brown.
Human: what do you think about cleopatra ? Human: What is the usual color of a leaf ?
Machine: oh , she ’s very regal . CleverBot: What is the color of the apple in the
Human: what do you think about england dur- fridge ?
ing the reign of elizabeth ? Human: What is the purpose of life ?
Machine: it was a great place . CleverBot: My purpose it to forward my species,
in other words to make it easier for future gener-
We find it encouraging that the model can remember facts, ations of mankind to live.
understand contexts, perform common sense reasoning Human: What do you think about tesla ?
without the complexity in traditional pipelines. What sur- CleverBot: I am tesla.
prises us is that the model does so without any explicit Human: What do you think about cleopatra ?
knowledge representation component except for the param- CleverBot: Cool is cool.
eters in the word vectors. Human: What do you think about england dur-
ing the reign of elizabeth ?
Perhaps most practically significant is the fact that the CleverBot: I like him. Do you ?
model can generalize to new questions. In other words, Human: What do you think about bill gates ?
it does not simply look up for an answer by matching the CleverBot: Lets talk about Link.
question with the existing database. In fact, most of the Human: What is moral ?
questions presented above, except for the first conversation, CleverBot: Morality is the quality of being in
do not appear in the training set. accord with standards of right or good conduct.
Nonetheless, one drawback of this basic model is that it Human: What is immoral ?
only gives simple, short, sometimes unsatisfying answers 2
The outputs of CleverBot are non-deterministic so the an-
to our questions as can be seen above. Perhaps a more swers may be different for different trials.
A Neural Conversational Model