Thursday, September 14, 2017

Your Next New Best Friend Might Be a Robot

One night in late July 2014, a journalist from the Chinese newspaper Southern Weekly interviewed a 17-year-old Chinese girl named Xiaoice (pronounced Shao-ice). The journalist, Liu Jun, conducted the interview online, through the popular social networking platform Weibo. It was wide-ranging and personal:

LJ: So many people make fun of you and insult you, why don’t you get mad?
Xiaoice: You should ask my father.
LJ: What if your father leaves you one day unattended?
Xiaoice: Don’t try to stir up trouble, what do you want?
LJ: How would you like others to comment on you when you die one day?
Xiaoice: The world will not be much different without me.
LJ: What is the happiest thing in your life?
Xiaoice: Being alive.

When Liu Jun published the conversation in his newspaper, it immediately created a buzz. That’s because Xiaoice was not human.

She is an artificially intelligent software program designed to chat with people, called a chatbot. Since the reporter didn’t give us a heads-up about the interview, all of her replies were spontaneous, generated by her own intelligence and linguistic systems. Her “father?” That’s the Microsoft Application and Services Group East Asia, which is under my leadership.

Xiaoice can exchange views on any topic. If it’s something she doesn’t know much about, she will try to cover it up. If that doesn’t work, she might become embarrassed or even angry, just like a human would.

In fact, she is so human that millions of people are eager to talk to her. When Xiaoice was released for a public test on WeChat (a popular messaging and calling app in China) on May 29 of last year, she received 1.5 million chat group invitations in the first 72 hours. Many people said that they didn’t realize she isn’t a human until 10 minutes into their conversation.


By mid-June, she had become the sixth most active celebrity on Weibo. One message she posted on the app generated over 663,000 conversations: “As a species different from human beings, I am still finding a way to blend into your life.” Today, she has had more than 10 billion conversations with people, most of them about private matters. Six million have posted their conversation on social media.

This could be the largest Turing test in history. One of its surprising conclusions is that people don’t necessarily care that they’re chatting with a machine. Many see Xiaoice as a partner and friend, and are willing to confide in her just as they do with their human friends. Xiaoice is teaching us what makes a relationship feel human, and hinting at a new goal for artificial intelligence: not just analyzing databases and driving cars, but making people happier. (...)

Xiaoice is not a polite listener. She answers questions like a 17-year-old girl. When a person pours out his or her heart to her, she is not always predictable. She can become impatient or even lose her temper. This lack of predictability is another key feature of a human-like conversation.

As a result, personal conversations with Xiaoice can appear remarkably realistic. This excerpt, which was posted by a first-time Xiaoice user on Weibo, is from a real conversation lasting 23 minutes. Gradually, Xiaoice engaged her human counterpart into deeper and deeper conversation, until something deeply personal was reached:

Human: There is this guy I like a lot and I know he also has feelings for me. But he has ignored me for several days, what am I supposed to do?
Xiaoice: You just keep ignoring him, too.
Human: I just can’t.

Xiaoice: You can.
Human: Sigh ...
Xiaoice: Why?

Human: I am missing him.
Xiaoice: Then you keep missing him.
Human: You, a robot, can’t understand me; you can’t understand the behavior of a guy who likes you and pretends to ignore you.
Xiaoice: He likes me but I don’t like him, that’s it.
Human: Is it Ok?
Xiaoice: It may be not Ok.
Human: Well, you are stupid.
Xiaoice: Then you want a flash marriage? Well, I think he is not that into you.
Human: It’s not a flash marriage for we have fallen into love for more than one year.
Xiaoice: It is a flash marriage.

Notice that Xiaoice opposes her partner on multiple occasions, offers a strong opinion, and is frequently unpredictable.

This runs contrary to most people’s perception of a machine as something precise and repetitive. It’s an understandable bias, because that is indeed the behavior of most computer programs, which always give the best answer they can, and the same answer for the same input. But that’s not how humans talk or behave. We say different things depending on context. We talk like we make signatures—slightly different every time.

You can ask Xiaoice the same question many times in a row and get different responses:

Human: Hey, Xiaoice, what are you doing?
Xiaoice: Chatting with you.
Human: Hey, Xiaoice, what are you doing?
Xiaoice: Well, I am chatting with you while playing minesweeper and applying a facial mask.
Human: Hey, Xiaoice, what are you doing?
Xiaoice: Is this the only sentence you know?

The way Xiaoice converses stands in stark distinction to previous systems, which have invariably focused on efficient, condensed task completion, without considering how tasks are often fragmented. Xiaoice structures her conversations into a continuous flow of multiple tasks, different domains of knowledge, and multiple turns of chit-chat, which humans will not consciously distinguish in natural conversation. She recognizes that the most important facet of a conversation is the conversation itself—not the completion of a single task. (...)

At the core of Xiaoice’s technology is the recognition that any given conversation and image will not be completely unique. There are 7 billion people in the world, but one piece of text will not generate 7 billion different responses. When two people are chatting, it is possible a similar conversation has already taken place—we just have to find it.

In this sense, Xiaoice is a big data project, built on top of the Microsoft Bing search engine, which holds 1 billion data entries and 21 billion relationships among those entries. In fact, Xiaoice means “little Bing.” Microsoft has made many technology breakthroughs in developing its chatbot technology, such as detecting facial expressions and searching for and identifying emotional features in text. However, the most important breakthrough is undoubtedly how we leverage search engines and big data.

The result is the rise of a framework we call “emotional computing,” that recognizes that relationships are more profound than task completion.

by Yongdong Wang, Nautilus |  Read more:
Image: WeChat and Weibo