You are on page 1of 5

Department of Computer Science

M.Sc. Computer Science – II Year 2021-22


Project Name: Cognitive Services(Speech)

The project is based on Microsoft Azure Cognitive Services. Azure Cognitive Services are
cloud-based services with REST APIs and client library SDKs available to help you build cognitive
intelligence into your applications. You can add cognitive features to your applications without
having artificial intelligence (AI) or data science skills. Azure Cognitive Services comprise various
AI services that enable you to build cognitive solutions that can see, hear, speak, understand, and
even make decisions.

Objective of the project


It is a product based on the services provided by Microsoft under Cognitive Services

 Speech to Text
 Conversation Transcription(Batch Transcription)
 Speech to Text
 Text Analytics
 Key phrase extraction

The aim of conversation transcription in the project is a speech-to-text solution that combines
speech recognition, speaker identification, and sentence attribution to each speaker (also known as
diarization) to provide real-time and/or asynchronous transcription of any conversation and the
aim of speech to text to enable speech-to-text translation of audio streams.

It gives the customer to focus more the core aspect of the meetings, conversation, interview

Technical discussion, rather then focusing on the key points to be noted. Also removes the barrier
of the language.
Scope

1. Conversation Transcription

Convert the audio conversation uploaded by the user to text format by using the
SDK provide by Microsoft Azure Cognitive AI services and stored the result into
the database.

2. Speech to Text
Convert the audio conversation uploaded by the user to the languages the user
requested. (like German, Spanish , French etc) and store the result into the
database.

Hardware

Microsoft Windows Server.

Language / Framework

C# language and .NET Web API framework.

Angular.

Testing Technologies

Unit Testing .

Manual Testing.

Bibliography/References
1. https://docs.microsoft.com/en-in/azure/cognitive-services/speech-service/conversation-
transcription
2. https://docs.microsoft.com/en-in/azure/cognitive-services/speech-service/speech-
translation
3. https://docs.microsoft.com/en-in/azure/cognitive-services/text-analytics/quickstarts/
client-libraries-rest-api?tabs=version-3-1&pivots=programming-language-csharp

You might also like