Professional Documents
Culture Documents
Mining Weighted Sequential Patterns in A Sequence
Mining Weighted Sequential Patterns in A Sequence
Author:
Joong Hyuk Chang
DCIT, Daegu University,
1
ABSTRACT
The weighted sequential pattern mining aims to find more interesting
sequential patterns, considering the different significance of each data
element in a sequence database.
2
OUTLINE
ABSTRACT
KEYWORDS
APPLICATIONS
INTRODUCTION
RELATED WORK
PROBLEM DEFINITION
TiWS Patterns
Mining TiWS patterns in a large database
EXPERIMENTAL RESULTS
CONCLUSION
3
KEY WORDS
Sequence database
Time-interval sequence database
Sequential pattern mining
Weighted sequential pattern
TiWS pattern
TiWS support
4
KEY WORDS
5
APPLICATIONS
6
INTRODUCTION
[Customer_B]:
Laser printer______________Jan
Scanner_________________Jun
CD Burner_______________Sep.
7
RELATED WORK
8
PROBLEM DEFINITION
10
TiWS-patterns
Definition 1:
A time-interval between pair of items:
S=<s1,s2,s3...sn> is a sequence.
TS(S)=<t1,t2,t3...tn>be the time stamp list
The time interval between si and sj is
Tiij=tj-ti where(1<i<j<n)
There exists
n×(n-1) pairs of items
2
n :no of itemsets in the sequence.Possible pairs of itemsets for SID10
1st 2nd Time-
itemset temset interval
a (abc) 1
a (ac) 2
a d 3
(abc) (ac) 1
(abc) d 2
(ac) d 1 12
Time interval weight of a pair of itemsets.
3 weight functions as
WF_1:General scale weighting: Wg(TIij)=δ( T Iij )/u=δ (tj–ti)/u
13
Time ineterval weight of the sequence.
Definition 2:
Strength of a pair of itemsets
STij=length(si)xlength(sj).
Time-interval weight of a sequence.
14
TiWS-Support
Definition3:(TiW-support of a sequence)
The TiW-support of a sequence X in SDB,TiW-
Supp(X), is defined as follows
15
TiWS-Support
Definition 6:TiWS-patterns
Given a support threshold minSupport(0<minSupport≤1),
a sequence X is Time interval weighted sequential pattern
if TiW-Supp(X) is no less than the threshold ie TiWS-
Supp(X)≥minSupport.
16
Anti-monotone property of TiWS-support
since A⊆B, the weight of the A is always greater than the weight of
Accordingly the following holds.
Mining TiWS patterns in large sequence database
Input
SDB,Minsup,& time-interval weighting function
psTiWS S=<s1,s2,s3...sn>
TS(S)=<t1,t2,t3...tn>
•Scan SDB once,
•For each sequence S call GetWeight(S).
•Find each time-interval frequet item α Procedure GetWeight(S)
such that TiWS-Supp(α)>minSupp.
W(S)
α,l,Slα
For each time-interval weighted frequent
item α, output α and call Span(α,l,Slα)
Procedure Span(α,l,Sl α
)
TiWS pattern TiWS patterns
(Single item)
Output ἀ,l+1,slἀ
21
22