You are on page 1of 14

“ International Conference on

Innovative Researches in Engineering & Technology (IRET-2022)

SMART ASSISTANT USING MACHINE


LEARNING
Presented By- Under the Guidance of -

Anirudh Agrahari Mr. Abhishek Shahi
Ravita Prajapati (Assistant Professor, CSE Department)
Abhishek Singh
Pooja Singh

Department of Computer Science and Engineering


Buddha Institute of Technology , GIDA , Gorakhpur
Contents

 Introduction
 Literature Survey
 Problem Statement
 Proposed Methodology
 Conclusions
 Future Scope
 References

2
BIT
Introduction
 Today the development of artificial intelligence (AI) systems that can organize a natural
human-machine interaction (through voice, communication, gestures, facial expressions,
etc.) are gaining in popularity.
 One of the most studied and popular was the direction of interaction, based on the
understanding of the machine by the machine of the natural human language.
 It is no longer a human who learns to communicate with a machine, but a machine
learns to communicate with a human, exploring his actions, habits, behaviour and trying
to become his personalized assistant.
 Virtual assistants are software programs that help you ease your day to day tasks, such as
showing weather reports, creating remainders, making shopping lists etc.

4
BIT
Literature Survey

Paper Description Limitation


No. Methodology

[1]
Study of smart The study uses an isolated The approach
assistant Using character dictionary based requires further
machine learning on speech recognition to analysis for the
Concept deliver precise meaning of improvement in
spoken words with great the performance
accuracy of the technique

5
BIT
Problem Statement

 We are all well aware about Cortana, Siri, Google Assistant and many other virtual
assistants which are designed to aid the tasks of users in Windows, Android and iOS
platforms.

 As a personal assistant, Jarvis assists the end-user with day-to-day activities like general

human conversation, searching queries in various search engines like Google, Bing or

Yahoo, searching for videos, retrieving images, live weather conditions, word meanings,

searching for medicine details, health recommendations based on symptoms and

reminding the user about the scheduled events and tasks.

6
BIT
Proposed Methodology

 There are 3 steps involved in the processing of the image.

1. Speech Recognition module


2. API calls
3. Text-to-speech module

BIT 7
Proposed Methodology

BIT
Proposed Methodology

.
1 Speech Recognition module

 The system uses Google’s online speech recognition system for converting
speech input to text,
 The speech input Users can obtain texts from the special corpora organized
on the computer network server at the information centre from the
microphone is temporarily stored in the system which is then sent to Google
cloud for speech recognition.

 The equivalent text is then received and fed to the central processor.

BIT
Proposed Methodology

.
2 API calls-

 An API is a software intermediary that allows two applications to talk


to each other.
 API stands for Application Programming Interface.
 In other words, an API is a messenger that delivers your request to
the provider that you’re requesting it from and then delivers the
response back to you.

BIT
Proposed Methodology

3. Text-to-speech module

 Text-to-Speech (TTS) refers to the ability of computers to read text


aloud.
 A TTS Engine converts written text to a phonemic representation, then
converts the phonemic representation to waveforms that can be output
as sound.
 TTS engines with different languages, dialects and specialized
vocabularies are available through third-party publishers..

BIT
Conclusions

 In this paper “Virtual Assistant Using Python” we discussed the design and
implementation of Digital Assistance.
 The project is built using open source software modules with PyCharm community
backing which can accommodate any updates shortly.
 The modular nature of this project makes it more flexible and easy to add additional
features without disturbing current system functionalities.
 It not only works on human commands but also give responses to the user based on the
query being asked or the words spoken by the user such as opening tasks and operations.
 It is greeting the user the way the user feels more comfortable and feels free to interact
with the voice assistant.

8
BIT
Future Scope

 Finally, the work opens interesting future directions for researchers. Firstly, the proposed
technique can be integrated into any high- resolution video surveillance devices and not
limited to mask detection only. Secondly, the model can be extended to detect facial
landmarks with a facemask for biometric purposes.

 In future this model can also used for identification of authorized person in any
organization and also for detection of unauthorized person.

9
BIT
References

 [1] R. Belvin, R. Burns, and C. Hein, “Development of the HRL route


navigation dialogue system,” in Proceedings of ACL-HLT, 2001
 [2] V. Zue, S. Seneff, J. R. Glass, J. Polifroni, C. Pao, T.J.Hazen,and
L.Hetherington, “JUPITER: A Telephone Based Conversational Interface for
Weather Information,” IEEE Transactions on Speech and Audio Processing, vol.
8, no. 1, pp. 85–96, 2000.
 [3] M. Kolss, D. Bernreuther, M. Paulik, S. St¨ucker, S. Vogel, and A. Waibel,
“Open Domain Speech Recognition & Translation: Lectures and Speeches,” in
Proceedings of ICASSP, 2006.
 [4] D. R. S. Caon, T. Simonnet, P. Sendorek, J. Boudy, and G. Chollet, “vAssist:
The Virtual Interactive Assistant for Daily Homer-Care,” in Proceedings of
pHealth, 2011.

12
BIT
Thank You
BIT
Q/A

You might also like