You are on page 1of 3

AI Assignment 1 - Deep Blue Track

Data Platform Architecture

1. What do we mean by the trust-worthiness of data?

Data needs follow veracity. It should be validated from a single source of truth and be free from
imputed poisonous values that create bias in results. Hence, data must be trustworthy.

2. What is information retrieval? How is it different than structured query language (SQL)?

Information Retrieval is the process of extraction of data in form of text, video, images, sound
and so forth. It could mean searching these items in a document, searching the document itself
or even use the metadata to search info from databases.
Structured Query Language or SQL is used to retrieve information in form of tables comprising
of rows and columns from a relational database management system. Each record is subject to
follow the syntax of Select, From, Where and other commands that filter or aggregate the data.

3. Please mention three examples of how cloud technology can be used with data architecture
for Cognitive Enterprise.

1. Platform as a Service
2. Infrastructure as a Service
3. Hybrid cloud

4. What are typical steps in a text processing pipeline?

● Sentence Segmentation,
● Word Tokenization,
● Semantics and Syntax Predictions,
● Lemmatization,
● Stop Words,
● Dependencies Parsing,
● Extracting Facts.

5. Please give one example of use of NLP in data governance process.

Yapı Kredi bank wanted to define common business terms to collaborate data managers and
functional units. They created a business glossary to support business metadata management
processes. The bank can now store “golden source” data, up-to-date data definitions and data
quality rules.
Artificial Intelligence Architecture

1. How do you differentiate data from information?

Data is an unstructured collection of facts. Information is processed data - It is a collection of


facts that is structured and organized to the interpretability of the user.

2. Which layer of AI ladder does perform data governance?

Data Governance is performed by the ‘Organize’ layer.

3. What is the difference between Analyse and Infuse layers n terms of their key characteristics?

The Analyze layer focuses on building scalable AI models and solutions using organized data.
The Infuse layer, however, is about able to understand and comprehend how AI can be applied
in an enterprise or an organization.

4. How does data consolidation happen through the federation and virtualization mechanism?

Data Virtualization refers to providing a singular point of access to a form of source data, in an
abstract manner by hiding its details and nature.
Data Federation is a virtual data storage point that deploys a common data model and acts as an
access point for various data sources that may be heterogeneous and distributed in nature.
Using both the techniques, we integrate & design data, so as to enable simplified access to front
end applications.

5. What are non-functional requirements for performance, accuracy and stability important in AI
Architecture?
Non Functional requirements are used to judge the performance of overall operations rather
than judging individual-specific behaviors. These include attributes such as downtime, security,
performance and reliability. Performance specifically is very important as a non-functional
requirement that is measured across Response time, accuracy and stability.

Internet of Things and Edge Architecture

1. What is Internet of Things?

The Internet of Things (IoT) describes the network of physical objects—“things”—that are
embedded with sensors, software, and other technologies for the purpose of connecting and
exchanging data with other devices and systems over the internet. These devices range from
ordinary household objects to sophisticated industrial tools.
2. “Edge Computing replaces Cloud Computing”: myth or fact?

It is a myth as both are used to handle different types of computing concepts. Cloud computing
is used to deal with storage and processing of data whereas Edge computing is about process
data at the edge or source. Hence, both can be used to combine and reduce the latency induced

3. What are the advantages of running Inference on IoT Devices?

IOT data can be used as the input to a trained machine learning model, enabling predictions
that can guide decision logic on the device, at the edge gateway or elsewhere in the IOT
systems. Inference takes a lot less computing power and is often done in real-time when new
data is available.

4. Please give one or more use cases of IoT for Manufacturing Industry.

Sensors can be placed to educate and optimally use a machine’s runtime to increase longevity
and reduce wear and tear due to overuse/overheating.

5. Please provide one or more advancements of technology in the network and platform
domains.

Affordable and efficient connectivity is a fundamental component of digitalization and has


become as important as clean water and electricity in creating a sustainable society of the
future. Recognition of this fact is of critical importance as we enter a new era that is defined by
the combinatorial effects of a multitude of transformative technologies in areas such as
mobility, the Internet of Things (IoT), distributed computing and artificial intelligence (AI). Virtual
Meetings are a good example of network and platform domains

You might also like