Professional Documents
Culture Documents
ABSTRACT Social service robots have emerged as versatile and interactive solutions in various domains,
revolutionizing human-robot interaction and shaping the future of service-oriented industries. This research
addresses the growing demand for affordable social service robots, particularly during the COVID-19
pandemic when there is a need for reduced human contact. To meet this demand, a state-of-the-art social
service robot named "Allen" has been developed. Allen is specifically designed for deployment in malls
and theme parks. The study makes significant contributions by integrating advanced technologies and
incorporating human-centric design principles into the robot’s development. These enhancements enable
Allen to effectively engage with visitors, enhance their experiences, and provide efficient assistance. The
robot’s appearance resembles that of a minion and is equipped with sensors, including LiDAR and depth
cameras. It utilizes Simultaneous Localization and Mapping (SLAM) techniques, path planning algorithms,
and obstacle avoidance algorithms for navigation. Allen interacts with customers through graphical and
speech-based interfaces and employs Natural Language Processing (NLP) techniques for query answering.
The robot integrates state-of-the-art NLP algorithms, such as ChatGPT, to facilitate more natural and human-
like user interactions. Field trials conducted in a shopping mall have yielded positive results, indicating
improved customer satisfaction, increased footfall, and an enhanced customer experience. These findings
demonstrate the effectiveness of Allen as a social service robot in real-world scenarios.
INDEX TERMS Social Service robots, natural language processing, NLP, chatGPT, Alan, smart robot,
human-machine interaction, HMI, human-robot interaction, HRI, interactive robotics.
with a focus on defining important physical features and TABLE I: Allen robot’s specifications
functionalities needed for the bot while keeping the target au- Items Names and Specifications
dience, particularly youngsters, in mind. Speedy prototyping Physical Specifications
Height 80 cm
and design iteration were achieved through the utilization of Diameter 38 cm
AutoCAD Fusion 360, an industry-level CAD/CAM design Weight 6.5 kg
software. The design displayed in Fig. 4c was adopted using Display 23.98 x 15.86 x 0.85 cm
a combination of 3D modeling and simulation techniques to Technical Specifications
2 x LiPo 3C batteries:
guarantee that the bot was functional and addressed industry- Battery
11.1 V, 5000 mAh
specific demands. TTL Half Duplex,
Fig. 4 shows the physical design and model of the Allen Motor Communication Asynchronous Serial
Communication
robot. The robot is modeled after the famous animated char- Wireless WiFi IEEE 802.11n
acter, Minion. The design was chosen because of its pop- High-level processor Jetson Nano
ularity among children and adults, using its familiarity to OpenCR board: 32-bit
Low-level processor ARM Cortex M-7 pro-
increase customer accessibility and likability. Consequently, cessor
it improves the quality of the user experience when compared LiDAR LDS-01
Sensors
to traditional robot designs. Intel RealSense D455
Dynamixel-XM
The Allen robot was printed in-house on an Ultimaker 3D smart servo motors:
printer in two parts. The bottom part works as an enclosure Motors Integrated ARM
for the processing units, sensors, motors, wheels, and chassis Cortex M-3 processors
and built-in encoders
of the robot as shown in Fig. 4b. Fig. 4a shows the top part
as a complete minion, with the robot’s camera mount and a
touch screen mount. The 23.98 cm x 15.86 cm x 0.85 cm efficiently. The Jetson Nano board communicates with an
screen acts as its face, and running animations, and behaves OpenCR board, which acts as a low-level controller for the
as a communication medium between the customer and the robot. It contains an integrated accelerometer and gyroscope
robot. To guarantee that the robot is eco-friendly, we chose and implements the robot’s kinematic model. This board
PLA filament, a biodegradable thermoplastic substance, to communicates directly with two Dynamixel-XM smart servo
3D print our robot. It stands 80 cm tall and weighs approx- motors. Our approach to SLAM and navigation involves the
imately 6.5 kg. The design is circular, with a diameter of 38 integration of a low-powered LDS-01 LiDAR and an Intel
cm. The bot’s stability is achieved through the installation of RealSense D455 depth camera. This combination enables
components using screws, bolts, and hot glue. Before testing, efficient navigation in the presence of large obstacles as well
rigorous quality checks were performed to ensure optimal as ground clutter. The camera also recognizes customers and
functionality and enhance overall performance. their emotions to carry out more efficient social behaviors.
The robot runs on two LiPo 3C batteries, giving it an opera-
B. HARDWARE ARCHITECTURE tional time of about 3 hours. The specifications of the Allen
For effective and efficient deployment, the Allen robot is robot are given in Table I.
equipped with cutting-edge sensors and powerful processing
systems. A Jetson Nano development board was utilized to C. SOFTWARE STACK
perform the robot’s high-level processing tasks. The board not Our social service robot is powered by a robust software stack
only has a small form factor and better power efficiency but that includes a variety of components and tools. The core
also deploys and runs AI and deep learning-based algorithms software is based on the Robot Operating System (ROS),
4 VOLUME 11, 2023
FIGURE 5: Block diagram for the operation of SMART robot
which provides a flexible and scalable framework for building flexible navigation to manipulate the robot’s trajectory.
complex robotic systems. In addition to ROS, we use a range Furthermore, the navigational pipeline provides the robot
of software tools for perception, navigation, and control, with remarkable maneuvering abilities, enabling it to navigate
including OpenCV, TensorFlow, and MoveIt! These tools through diverse scenarios with ease. To travel a given path,
enable the robot to perform a wide range of tasks, from object it uses different algorithms such as localization, path plan-
recognition and tracking to path planning and manipulation. ning, and motion planning within a map constructed by the
We also leverage cloud-based services to provide natural lan- SLAM algorithm. The navigational features are affected by
guage processing and speech recognition capabilities, which the directional inquiries, so that the robot may lead customers
allow the robot to interact with visitors more intuitively and to a certain franchise or halt and answer when a consumer
naturally. Overall, our software stack is designed to provide approaches. Fig. 5 depicts the integration of the navigational
a robust and scalable platform for the development of social and interactive pipelines respectively, highlighting the Allen
service robots in a variety of environments. robot’s exceptional skills and innovative position in setting a
new benchmark for customer service robots.
1) System Architecture
The Allen robot exhibits remarkable implementation with 2) Robot Model: Differential Drive
two innovative pipelines. The communication and interactive Our robot employs the differential drive model, featuring two
pipeline enable the robot to seamlessly address customers’ independent wheels mounted on a shared axis on each side as
requests in areas such as entertainment, general information, well as two caster wheels for design stability. By accurately
and navigation. The robot utilizes a variety of sensors, includ- controlling the wheel velocities, the robot achieves different
ing a camera and microphone, to detect emotions and capture types of motion, such as translation and rotation around its
customer queries, respectively. It then tailors its interaction instantaneous center of curvature (ICC). This enables the
accordingly. The built-in display features kid-friendly rhymes robot to follow curved trajectories or rotate in place as shown
and movies for enjoyment. The robot can also have intuitive in Fig. 6. Both velocities can be varied to specify the trajectory
conversations with members of the general public. Naviga- taken by the robot. The two velocities can be given by eq. 1
tional queries are directed to the navigational stack, enabling and eq. 2.
VOLUME 11, 2023 5
of engaging interactions with the audience. Moreover, the
training of Alan AI using tailored behavioral data ensured
personalized and contextually relevant responses, enhancing
overall user satisfaction. The model effectively delivered effi-
cient and tailored conversations, while considering the emo-
tional state of users through emotion detection capabilities.
However, during our implementation of the Alan AI frame-
work, we encountered limitations in generating precise re-
sponses for more general questions. To overcome this chal-
lenge and enhance the conversational experience, we ex-
FIGURE 6: Kinematics of a differential drive robot) plored alternative solutions. Consequently, we evaluated the
integration of ChatGPT, a widely adopted large language
model known for its advanced capabilities. Although incorpo-
L
rating ChatGPT improved conversation quality, it introduced
ω R+ = vr (1) slightly higher latency compared to Alan AI. This latency
2
increase can be attributed to ChatGPT’s larger model size
L and increased computational requirements. Despite this, the
ω R− = vl (2) benefits of ChatGPT outweighed the slight latency increase.
2
The integration of ChatGPT expanded the range of inter-
where L is the distance the wheels are parted, is the angular
actions that our robot, Allen, could facilitate. It empowered
velocity of the robot and R is the distance from the ICC to the
Allen to engage visitors, particularly children, in activities,
midpoint of the wheel axis. R and can be calculated by eq. 3
provide entertainment, and offer basic information such as
and eq. 4:
weather updates. These additional capabilities significantly
L vl + vr
enhanced the overall user experience and made Allen a more
R= (3) valuable and versatile assistant within the mall environment.
2 vr − vl
To make the most of the strengths of both frameworks,
vr − vl we developed separate application software for Alan AI and
ω= (4)
L ChatGPT within our HRI framework. This approach allowed
us to selectively switch between the two frameworks based
on specific application requirements. When prioritizing low
latency and user-friendly conversations, we utilized the Alan
AI application. Conversely, for more general queries requir-
ing comprehensive and accurate responses, we seamlessly
switched to the ChatGPT application, leveraging its extensive
training on diverse text data. This strategic utilization of the
appropriate application software optimized the conversational
experience, enabling Allen to deliver suitable and effective
responses in various scenarios. Thus, we achieved a balance
between real-time interaction and the generation of human-
like and contextually relevant responses.
To provide a visual representation of the interaction process
within the Allen robot, we developed an interactive pipeline,
as depicted in Figure 7. This flowchart demonstrates the
seamless integration of both Alan AI and ChatGPT language
models into our HRI framework. The integration of these
language models transformed the way Allen engages with
FIGURE 7: Interaction pipeline visitors, ensuring a more human-like and effective interaction
experience.
Although integrating ChatGPT introduced a slight increase
3) Interactive System Design in latency, the advantages it offered in terms of enhanced con-
Initially, we integrated the Alan AI conversational model into versation quality, personalized responses, and expanded capa-
our multi-modal Human-Robot Interaction (HRI) framework bilities outweighed this drawback. The separate application
due to its low latency and user-friendly conversational capa- software for Alan AI and ChatGPT enabled us to effectively
bilities. Alan AI facilitated near-real-time interactions, result- harness the strengths of both frameworks. It allowed us to
ing in a seamless and responsive conversational experience. address the limitations of Alan AI by delivering more accurate
Its intuitive conversation flow design simplified the creation and engaging conversations and providing users with person-
6 VOLUME 11, 2023
alized and contextually relevant responses. Ultimately, the vironment in combination with SLAM. These readings were
integration of ChatGPT into our multi-modal HRI framework used for pose optimization. Ceres-based global optimization
contributed to a more immersive and satisfying experience for was carried out on the estimated map and pose information.
mall customers. Continuous map updating and submap-based loop closures
contributed to the optimization process when revisiting spe-
4) Navigation system cific locations. Fig. 9 presents the real-time implementation
The Allen robot’s navigation system, depicted in Fig. 8, of SLAM in both the testing and deployment environments.
incorporates multiple modalities. The system utilizes the Specifically, Fig. 9a illustrates its implementation in the test-
Robot Operating System (ROS) framework and implements ing environment, while Fig. 9b showcases its application in a
a navigation pipeline. This pipeline includes a Simultaneous mall setting. The complete maps of both controlled and mall
Localization and Mapping (SLAM) algorithm, which enables environments are shown in Fig. 10.
the robot to build an initial map of the environment where
it is deployed. By leveraging these technologies, the Allen b: Navigation and Path Planning
robot can navigate autonomously within its surroundings. We We chose the ROS navigation stack to enable our Allen robot
tested our robot in a controlled environment within a lab. The to navigate through cluttered environments efficiently. This
robot was able to navigate through it efficiently and avoid was an obvious choice due to its scalability, robustness, and
obstacles successfully. We also tested the robot in a real- adaptability. The ROS navigation stack is packed with effi-
world environment, and the robot was able to navigate the cient algorithms to implement robust, versatile, and reliable
environment successfully without any problems. navigation. To configure the ROS navigation stack for our
robot, we first needed to select the appropriate global planner
and local planner. Dijkstra’s algorithm (named navfn in ROS)
for the global planner and the Dynamic Window Approach
(DWA) for the local planner was selected. The parameters
for each of the planners were set. Parameters for Dijkstra’s
algorithm include the following:
• Heuristic function: The heuristic function is used to esti-
mate the cost of the path from the robot’s current location
to its goal location. We used the Euclidean distance as
the heuristic function.
• Step size: The step size is the distance that the robot will
move in each planning iteration. We set the step size to
0.1 meters.
FIGURE 8: Navigational pipeline The parameters that we set for the DWA planner included
the following:
(b)
FIGURE 9: Real-time SLAM implementation in (a) testing environment (b) mall.
clutter. The obtained point cloud was integrated into the ROS crophone, and a speaker. Allen also has a variety of software
navigation stack shown in Fig. 11. capabilities, including the ability to navigate autonomously,
We also chose the Adaptive Monte Carlo Localization recognize emotions, and generate speech.
(AMCL) algorithm to localize our robot in the deployed The robot has a prebuilt map of the mall that it uses to
environment. AMCL is a probabilistic localization algorithm direct customers to their desired destinations. It is designed
that uses particle filters. The combination of these algorithms to appeal to both adults and children, with its child-friendly
allowed the robot to detect and avoid obstacles while navigat- minion-shaped design. The robot can interact with customers
ing through the environment. based on their mood and age. For example, it can talk to adult
customers, provide them with weather information, entertain
5) Adaptive Navigation Pipeline them with jokes, and answer their directional as well as
informational queries. It can also engage children by telling
We developed an adaptive navigation pipeline for our robot
them jokes, reciting poems, showing them movies, or playing
that allows it to move randomly while not in use. To establish
games with them. The combination of the robot’s autonomous
this communication between the navigational and interactive
navigation, customer interaction, and directional assistance
pipelines, we have developed APIs using the Flask frame-
capabilities makes it a valuable asset for any shopping mall.
work. The robot initially checks its status to see if it is
currently responding to a query. If not, it generates a random TABLE II: Customer’s reaction to the robot
location within the given map and navigates to that location.
Common (Percentage) Surprised (Percentage) Disappointed (Percentage)
As soon as the robot receives a directional query, it stops 25 63 12
its random motion and selects the static location from the
tagged list of positions on the map. A path is then generated,
TABLE III: Customer’s behavior towards the robot
and the robot assists the customer in reaching their desired
destination. Once the guidance pipeline is complete, the robot A B C D
38% 25% 27% 10%
resumes generating random locations and driving randomly.
(b)
(b)
FIGURE 11: Visualization of the robot’s navigation with the
FIGURE 10: (a) Map of the controlled environment used 2D map integrated with the 3D point cloud
for testing the robot. (b) A 2D map of the mall’s children’s
playground area used for the deployment of our autonomous
robot. sales, and create a more enjoyable shopping experience.
A. CUSTOMER REVIEWS
The performance assessment of the Allen robot was con- Customers’ feedback on the Allen robot was analyzed based
ducted in two distinct phases. The initial phase involved test- on their comments. Their feelings about the robot were di-
ing the robot in a controlled environment. During this phase, vided into three categories: common, disappointed, and sur-
the Allen robot exhibited efficient communication capabili- prised. As shown in Table II, most customers were surprised
ties with the study participants and demonstrated high accu- by the robot. Notably, approximately 12% of customers were
racy in navigating the environment. However, a limitation was disappointed with the robot. These customers gave various
observed regarding the robot’s ability to detect ground clutter, reasons, such as the robot not meeting their expectations of
which had a negative impact on its navigation efficiency. To an intelligent robot like in science fiction movies. They also
overcome this limitation, a depth camera was integrated into believed that the robot would not provide any positive impacts
the robot’s navigation system, allowing for the incorporation on the market.
of a 3D point cloud of the environment. This enhancement Table III shows customers’ behavior regarding the func-
significantly improved the robot’s ability to accurately detect tionality of the robot. Options A, B, C, and D in Table III
potential obstacles in its path. represent the options that the robot is now useful but needs
In the second phase, the robot was deployed in a shopping improvements, the robot is now useful and good enough, the
mall, where it successfully navigated and guided customers to robot is now useless but promising, and the robot is useless
their desired destinations. The robot engaged in conversations and impractical.
with customers of various ages and interests, providing enter- In general, customers who interacted with the Allen robot
tainment through games, poems, and stories. Fig. 12 show- were generally positive about the robot. They appreciated the
cases the vibrant interaction between Allen and customers. robot’s ability to provide information and assistance, as well
Additionally, it offered information and conversed with cus- as its ability to entertain them. Overall, its testing has been
tomers using its ChatGPT interface, showcasing its versatility successful. It shows room for improvement but the potential
in addressing the dynamic needs of different customers. to provide various services in a shopping mall. The Allen
Overall, the test results demonstrate Allen’s potential to robot is a versatile and promising new development that has
provide a variety of services in a shopping mall setting. The the potential to revolutionize the retail industry.
robot has the capability to enhance customer service, boost
VOLUME 11, 2023 9
FIGURE 12: Interaction of customers with the Allen robot in the mall