You are on page 1of 5

· Week 5 Discussion 1

Review of Association Analysis and the Advanced Concepts

Fundamental affiliation examination manages the event of one thing with another. An

affiliation calculation restricts the examination to the most often happening things, so the last

rule set extricated in the following stage is more significant. First, we should consider a media

site, such as BBC or Yahoo News, with classes like news, governmental issues, finance,

diversion, sports, and expressions.

In this model, a meeting or exchange is one visit to the site, where a similar client gets content

from various classifications within a specific meeting period. In web-based news destinations,

things are visits to classes like News, Finance, Entertainment, Sports, and Arts. We can gather

the information as displayed, with a rundown of meetings and media classes during a given

meeting. Our objective in this information mining task is to track the relationship between media

classifications.

1)

What are the techniques in handling categorical attributes?

Categorical data is a collection of information split into categories. For example, if a

company or organization attempts to get biodata about its personnel, the resultant information is

said to as categorical. This data is called categorical since it may be categorized based on the

characteristics available in the biodata, such as gender and state of residence.


Nominal Data

This data type is used to name variables without providing any numerical value. Coined from the

Latin nomenclature Nomen, this data type is a subcategory of categorical data.

Nominal data is sometimes called labeled or named data.

Ordinal Data

A data type with a predefined order or scale. This hierarchy, however, lacks a common scale for

measuring the difference in variables between scales.

Categories

These are divided into two types of categorical data: nominal data and ordinal data.

Categorical data is qualitative.

Analysis

Categorical data is analyzed using mode and median distributions, where nominal data is

analyzed with mode while ordinal data uses both.

Graphical analysis

It can also be analyzed graphically using a bar chart and a pie chart.

Numeric values

Although categorical data is qualitative, it may sometimes take numerical values.

Nature

Depending on its nature, categorical data may also be classified into binary and non-binary.
2)

How do continuous attributes differ from categorical attributes?

Data is classified into several classes, which dictate which forms of mapping may be

utilized for it. The most fundamental distinction is between continuous (or quantitative) data and

categorical data, which significantly influences the sorts of visualizations that may be utilized.

The fundamental contrast is straightforward, but it has far-reaching implications. Quantitative

data is data in which the values fluctuate continually, and the number of various values cannot

be counted. Anything that can be measured or counted is quantitative. Weight, price, earnings,

and counts are all examples. Categorical data, on the other hand, is for those parts of data that

distinguish between various groups and can often specify a small number of categories.

3)

What is a concept hierarchy?

A concept hierarchy is a series of mappings from a collection of low-level concepts to

higher-level, more general concepts. Consider the dimension location as a thought hierarchy.

Vancouver, Toronto, New York, and Chicago are city values for location.

Concept Hierarchy reduces data by gathering and replacing low-level concepts (for example,

numerical values for the attribute age) with higher-level concepts (such as young, middle-aged,

or senior). (What Is a Concept Hierarchies?, 2021)


4)

Note the major patterns of data and how they work.

Data mining creates models using the most relevant data to detect trends among the

attributes in a data collection. Models are mathematical representations of the connections

between the qualities of the objects described in the data collection. For example, to increase

dependability, the relevant phrases may be listed as guessing, predicting, and forecasting.

Prediction and forecasting are equivalent in data mining language, and the word prediction is

used as the general representation of the act. Prediction can be classified as classification or

regression depending on what is being forecasted. (Delen, 2020)

Association rule mining is also known as market-basket analysis in the retail business. Link

analysis and sequence mining are two popular association rule mining variants. Link analysis

automatically discovers ties between assorted items of interest, such as links between web pages

and referential relationships among groups of academic publishing authors. (Delen, 2020)
References

Blog, F. (2022). Categorical Data: Definition + [Examples, Variables & Analysis]. Https.

https://www.formpl.us/blog/categorical-data

Data: Continuous vs. Categorical. (2013, April 18). Eagereyes. https://eagereyes.org/basics/data-

continuous-vs-categorical

Delen, D. (2020, December 21). Introduction to Predictive Analytics and Data Mining. InformIT

Database. https://www.informit.com/articles/article.aspx?p=3100071&seqNum=4

What is a Concept Hierarchies? (2021). Www.tutorialspoint.com.

https://www.tutorialspoint.com/what-is-a-concept-hierarchies

You might also like