(IJCSIS) International Journal of Computer Science and Information Security,Vol. 9, No. 2, February 2011
In [2], Rossi and Ale extended the well-known A-priorialgorithm for mining association rules to temporal data anddescribed a technique to find interesting patterns on the data thatare time bounded.In [5], the problem of mining maximal frequent intervals isdiscussed. They define a maximal frequent interval as an intervalthat is frequent which means that it is present in sufficientnumber of transactions and no other frequent interval contains it.Using a pre-fix traversal algorithm, the maximal frequentintervals have been found and it was also found experimentallythat pre-order traversal algorithm outperforms the A-priori basedalgorithm.Our approach is different from the above approaches. We aretaking into account the fact that the intervals of time are of fuzzynature. By calculating density of the fuzzy intervals in aparticular transactional dataset where transactions are associatedwith time intervals (non-fuzzy) as mentioned in the next section,we first compute the dense fuzzy time intervals by using someuser defined minimum density value and then apply a joinoperation to join neighboring intervals to find maximal densefuzzy intervals. The fuzzy intervals and their membershipfunctions are provided by domain experts.III PROBLEM DEFINITION
A. Some basic definitions related to fuzziness
Let
E
be the universe of discourse. A fuzzy set
A
in
E
ischaracterized by a membership function
A
(x) lying in [0,1].
A
(x)for x
∈
E
represents the grade of membership of x in
A
. Thus afuzzy set
A
is defined as
A
={(x,
A
(x)), x
∈
E
}A Fuzzy set
A
is said to be normal if
A
(x) =1 for at least one
x
∈
E
.An
α
-cut of a fuzzy set is an ordinary set of elements withmembership grade greater than or equal to a threshold
α
, 0
≤α≤
1.Thus an
α
-cut
A
α
of a fuzzy set
A
is characterized by
A
α
={x
∈
E
;
A
(x)
≥
α
} [see
e.g.
[3]]A fuzzy set is said to be convex if all its
α
-cuts are convexsets.A fuzzy number is a convex normalized fuzzy set
A
definedon the real line
R
such that1.
there exists an
x
0
∈
R
such that
A
(x
0
) =1, and2.
A
(x)
is piecewise continuous.Thus a fuzzy number can be thought of as containing the realnumbers within some interval to varying degrees.Fuzzy intervals are special fuzzy numbers satisfying thefollowing.1.
there exists an interval
[a, b]
⊂
R
such that
A
(x
0
) =1 forall x
0
∈
[a, b], and2.
A
(x)
is piecewise continuous.A fuzzy interval can be thought of as a fuzzy number with a flatregion. A fuzzy interval
A
is denoted by
A
= [a, b, c, d] with a <b < c < d where
A
(a) =
A
(d) = 0 and
A
(x) = 1 for all x
∈
[b, c].
A
(x) for all x
∈
[a, b] is known as left reference function and
A
(x)for
x
∈
[c, d] is known as the right reference function. The leftreference function is non-decreasing and the right referencefunction is non-increasing [see
e.g.
[4]]. The area of a fuzzyinterval is defined as the area bounded by the membershipfunction of the fuzzy interval and the real line.
B. Contribution (vote) of a transaction to a fuzzy interval
We define vote of a transaction t associated with the timeinterval [t
/
, t
//
] for the fuzzy interval
A
= [a, b, c, d] as follows:
∫ ∫
=
d at t t
dxxAdxxAAvote
)()(
// /
where
A
(x) is the membership function associated with the fuzzyinterval.Here
∫
// /
)(
t t
dxxA
is the portion of the area bounded by
A
(x) andthe real line included in the time interval [t
/
, t
//
].
∫
d a
dxxA
)(
isthe total area bounded by
A
(x) and the real line.Obviously
Avote
t
lies in [0,1] and if
A
⊆
[t
/
, t
//
], then
Avote
t
=1 and if
A
∩
[t
/
, t
//
] =
Φ
, then
Avote
t
=0.
C. Density of a fuzzy time interval in a data set
The density of a fuzzy interval over a given temporal intervaldataset
D
is computed by summing up the votes of all thetransactions of
D
for the corresponding fuzzy time interval anddividing it by the total number of transactions in
D
. Each recordcontributes a vote, which falls in [0, 1].
||/
DAvoteAdensity
Dt t D
∑
∈
=
A fuzzy interval is dense if its density is more than a userspecified threshold called
min_density
.
D. Join of two fuzzy intervals
The fuzzy intervals are given by the user as input. Two fuzzyintervals A and B are called neighbors or adjacent to each otherif supp(
A
∩
B
)
≠Φ
where supp(
A
∩
B
) ={x; (
A
∩
B
)(x) > 0 }[see
e.g.
[4]]. We assume that the input fuzzy intervals are such that if the intervals are arranged in the ascending order according totheir starting time then each fuzzy interval has a unique leftneighbor and a unique right neighbor. Let
A
= [a
1
, b
1
, c
1
, d
1
] and
B
= [a
2
, b
2
, c
2
, d
2
] be two adjacent fuzzy intervals. Without lossof generality we can assume that a
1
< a
2
. Also we assume that forany two adjacent fuzzy intervals such as
A
and
B
above c
1
= a
2
and d
1
= b
2
and for c
1
≤
x
≤
d
1
A
(x) = 1 –
B
(x). Our assumption isnatural since otherwise some points will be given more emphasisand some less emphasis. We define the join of
A
and
B
denotedby
A
∧
B
is defined as
A
∧
B
= [a
1
, b
1
, c
2
, d
2
]
103 http://sites.google.com/site/ijcsis/ISSN 1947-5500