You are on page 1of 495
Bites MoKA Jiawei Han Micheline Kamber Jian Pei % Tom) ne Data Mining Concepts and Techniques Third Edition G) oh tt Tae i at af China Machine Press ARCBS DMV ORS, Hi, RRARMTRME. AAMT SHG, MRAEFAARTSRORAAS, BART RT, MAIER. RARKSOAB, WSMV T OLAP MABE, ORY T ISS BARRA REEMA. APAARERAAR LARA NAB, RAR, FRAR AAA OS SH, R-AEAT RAT, AUER RRAREN REAM, VERA AERC FRR TEA: HOSE EH PEALE 8 Jiswei Han, Micheline Kamber and Jian Pei: Data Mining: Concepts and Techniques, ‘Third Edition (ISBN 978-0-12-381479-1) Copyright © 2012 by Elsevier Inc. All rights reserved. Authorized Simplified Chinese translation edition published by the Proprietor. Copyright © 2012 by Elsevier (Singapore) Pte Lid. All rights reserved. Printed in China by Machine Press under special arrangement with Elsevier ( Singapore ) Pte Lid. This edition is authorized for sale in China only, excluding Hong Kong SAR and Tai- wan, Unauthorized export of this edition is a violation of the Copyright Act. Violation of this Law is sub- ject to Civil and Criminal Penalties. ARTS RHP CH eh DUAR Th a WRAL AG Elsevier (Singapore) Pte Lid. 7E" BLK Bis NAHE HA. ACRE PA (RP PETRA RA PR) NRO. ABSA, WHERE, SRL HR. ERA eR ALR, BALE APACHE JR RARIR SH ABMALEIAS: AF: 01-2012-0225 BENE (CIP) Sie BASIS: MESSER (HZ NO / (98) PhaRHE (Han, 1.) 535; WR bE: ‘PUR LAe ts eat, 2012.7 CHL EH) BAIRX: Data Mining: Concepts and Techniques, Third Edition ISBN 978-7-111-39140-1 1. Be UD @yw-- ML RGB WV. TP274 op ASIA CIP McA (2012) 3B 157938 S ‘SLR RRL Ct TE: CUR AC us A A ET BA BRS ED 2012 48 FAS 1 ARB 1 ACUI 185mm x 260mm + 31 EDIE SRHEBS. ISBN 978-7-111-39140-1 IROL AA 22 AALS 100087) SEB: 79.00 Te FURS, ATL BLT, ABET, HAL AT RBA &l (010) 88378991 ; 88361066 WIPE. (010) 68326294; 88379649, 68995259 (010) 88379604 EMEA; hzjsj@bzbook.com | aneen Date Mining: Concepis and Techniques, Tied Editon RERMAGE, BORK NDE RAE, (EP SE REY AT ARIUE TATE; IERIE, REAR EMA TS ale FE, SUR, ERAGE, MPU ST RAR eA, HSU SPRL DYE eC ALE SA AE AER, TTP AE ORR EE, ARE SAT FNM, AR TARE, PERAERE, MBA, HOMERS AEA NEAT WE WE, ESP R AMAT, REM eR, Ae AA ER FAA, OMA A RE, ALAR, TO A LEPERBE, EREABRAAEN MRM RRT, REGRET AOE BROIL EAREMRE NARA ASG EES Alte, Sk EIME AT FALE HERE: ROLE TL RAMEE, A STL BEAL ECIE AHA AE ZB PRT RA ARIA) “MENUS”. 8 1998 RIFE, RTT RE LIRR ATE TRB. BEIM Lb. BASE RMI, RAS Pearson, McGraw-Hill, Elsevier, MIT, John Wiley & Sons, Cengage St 43% HMR He 57 T BAT AY GHP, MTR RC BCH PALE Andrew S, Tanenbaum, Bjarne Stroustrup, Brain W. Kernighan, Dennis Ritchie, Jim Gray, Afted V. Aho, John E. Hoperoft, Jeffrey D. Ullman, Abraham Silberschatz, William Stallings, Donald E. Knuth, John L. Hennessy, Larry L. Peterson SAIMZ ROBIE ah, VA TSU” ER, GER EA. ER ik, ACG SOARS A, HEA TAA SAL “TOLER” AO LPR BT ANE OS EB, ARAN HE Hy AO UERLGE Se, ERNEST AP ALABAE T MAPE A BRON LHe; MTR A ee ARS EE EPR, ANREEAH BOTHER. 2S. “AR OPEAR BRT BATH, RAPER PHT ROM, SRR RA Ene AS SB Bo HANI “BREE” PER RS REAR TR BURLAIIER . SRLS. TL. PRET BE AIOE, SRD RE EIR TT AY FEAT ASR TE, BERT POLARS BEANS WA BES RT RE TB BE I EIR Me, SPATE ER A AE A — EE, FET AA ES HX, HLA RELRAAR AAPOR. PRA KOE MRE MA WOT EME DLR PATE, TRAM RTT ww, hzbook. com 7)sj@ hzbook. com z 010) 88379604 a ae 7 AHAB SH np *PSCHEE | Data Mining: Concepts and Teeiques, Thied Eaton We are pleased to see that our third edition has been translated into Chinese by Professor Fan and Meng. The first two editions were translated by them several years ago and have been well re~ ceived among Chinese readers, In recent years, we have witnessed tremendous progress in the field of data mining research and applications internationally. As a promising new technology, data mining has attracted tremendous interest in the Far East as well. Numerous international and regional confer- ences on data mining and applications have appeared or held in this region. Many Chinese research- ers have been playing an active role, contributing in both research and applications to the advances of this young field. In this third edition, we have carefully selected and tailored the technical materials to be cov- ered for the courses on data mining at both the undergraduate level and the first-year graduate lev- el. We-have updated and enhanced the existing chapters substantially with many new topics. Thus, wwe expect the publication of this edition in Chinese will help Chinese readers to learn and master the latest technology and put them into promising new applications. With best regards, CAPR SUAS THE 3 ACE TO Aa EE R.A, HATE A AVATAR EAT EARS. HEAR, BAPE TARAS A FE AN DF WHAM SAE. (EAN—MAA RRB MRR, BORE TE IR th 51 TBO 8. HEMMER MMA SNES A. EPS EEA, HE BNI +E LR ATE AT OR CBI GP, RTM ASAT ROPER, WAT ARE IE SORIEH “BORE” RH. RNS ROEE, ARE MAIER T CARY PAT, SOA PH EE ER A PEAT ARTA. FESR FOL!) Jiawei Han, Micheline Kamber, and Jian Pei June 2012 | eae Dat Mining: Concepts and Techniques, Thiel Eton 2001 4F, Jiawei Han (#3¢Hi) Ml Micheline Kamber ti T RUSH ORA A BERT SUBSE ARB AI 1 AL. 2006 FE, HAT HE TAR ASB 2 Mo EK TP IEA (2012 4), RABE TAH 3, FARRAH SR, PANE ARES Jian Pei (484) o RECA EEIR, FAMMARERODLZ— RBRAROH, BUBER K, GPHAKAREERRO RD, HORTA TRA AOSTA, AROUND F, MITES SEE. ESL SAE. ARERR (UAE FILH ROSH ACE, Me BAHIA BE, AMAT RT ELA BD AT SU, ROTTER. PU, BUSH (MOCRLE PE AR SL KARAM) BRA ARR. ERR ART TMT IE, FBTR HAAR AMSA. a SCRE —TSEAWAL TR. PERERA. TM, AWARE LAI RAIA D ACES PH RA, BURL SEITE. PRA. EE Bik Wl, MAERSK. OK, SERA MESA RR. AWM, Bee SER TUR Fa BAS AE ESR TS BR, RA ok AS A eS THAMBAMK. SR, BGC AAA BARE NE A RAE HY ERRRARHS. MRED SAS CRNA RAMT BH RLS — . MEAL FAS ARS ETD, TORR VET LRU EA RR SR BAB LMA BAT UF, CEMA LL EH, Jiawei Han BARS VOREVE PUI BE TRB AI 1 AN 2 MR, ASAE TE, FP aR BEEREABLFTA AG AIR IIA PL BATA SB BA HH A EE SFE He SE FE, BER AMAIA RAST BSE LH T BCR, HAT AMOR BB ARPEGUR, TALIS. ACTA, BCA, ULES SOUT. BAO BAF RABE AR SMA CBRE THES ABORT ILE RAE, WK, SHEAR RARAW SIGMOD, ICDE ABSA AIL KDD ARAAETE AED) 0 AN 1 RROD A EY 11 AE AEA BT, BT SRORBRE ABER, MATES A. RAMA MRK WAIHI, ELE A a FAG ER A SE BR OY, EP. PA, TLRS APE. AUER, Sl. HR, ARAMA, ACOA. PMN, BEART SK ot AAU BALE A] AE AERA, GES AE A A BR AL AE, Web BLM AI STS AAG LEAS HEH ALAS APE, TG LS FG SE A HR A ARE OT SE 3B 3 HAS HURT MRUETT T SRST, ZEAL TBA. DAO PEARL RE WULOROAS. IEEE a SUE FEA OLAP HR, Hist PETIT OK, FRAP. FP, REAR, J SHCRRAHOSAIA. BAM, BTA TE. Beis — HO TTS MAB TOR, ES BRAW ER, SHPUNTA LL, O83 ie Vi HAREAATEAS. SUR DL LL SERA ATA LA ES OUR LATE , 5 ABT ASE 2 ALA BS SURAT AGL A T RUN, AR ASE 3 FL NAR, HORA RHMASCSETRE, EHREANR ERAN MOIRA HEH RARE RECSATSES. Jiawei Han BR EMERRTONAS, ARMS, BME AAR MLS filo MEAN AMICK MEANS - HEALER, MELANOMAS ELM — LEME SL Aft) Bliss £2. Jiawei Han SCABIES MBAR EAR Ieee, ACM ATEEE 2. (hi A EARS MRE UA, SLR ACM SIGK- DD G83 (2004) , IEEE HEPES ARAL (2005) Ail IEEE W. Wallace McDowell (2009) . Re. HEAR, MESA, ERR, Bh. MENS 1 AAT ARR RAE AVR. ME, Wee SHENAE IL 3 MEO AR. PEER PER RSM TR 3 RENT CHE. ARIAL A BAI YE Jiawei Han BEBE, FEIC ALIS 1 AL, 2 AR, TEE HATA TI SCE, HEARNE AY A IAS A SB EL HP BE A PES — SITAR 2 ANB 3 OP RRS TFA. SUT LEE A], AMT RRA BR RRS ES LL FES 3 MAURIE, RTBU TARGA GPE. BERLE 1 WR. GB 2 ARATE ANE BR, BIRTHS, MME. RERNABw EE. eA RGB, (LDA MER My BUI ANY Sho FSP GRAA 4 Zeb, SEAM RIE. edu. cn, FSA HEAR BUTE RR RAB OTE A I ABE ARMAS, HB RATE Pi — EER BARRE. AURA FER , FRPSSINT A OMB, BE 26 3 HAS PEAS BT E45, Jiawei Han BEE BM Eve 2012 #6 A | Pee Ri sT Dia Mining: Concepts sod Techniques, Taint Ean JOR AEE, AS. By SOESMRERWERAEA, ALERSRRRIS VERS BAR, TRUE SBCA AT DHE FERAL, HOUR RARE. BORE 2 ASE. HAREM SHUR OE. 19891990 EM TMA Simon Fraser KEE R, RSWARE HR. 1999 FSH RA Wright State EH HAESTBAR, ASRGHEIOITE. RI AUT RCE | FRR AEX 60 Feil. BRA 89h, VRAERBET Pang-Ning Tan, Michael Steinbach #] Vipin Kumar 4) (GRIESE) o RAME Hb, PRARAF ARSE, PLES. B HPHAMASHH BH, PHAM ESREE LESH, Gournal of Computer Science and Technology), (Frontiers of Computer HARD CHES. coe See, ERS i (1998, 2001). earanee el % (2009). Jb: MERER KBR (2011) PRM, Atk “AEA AR” (2002), “BCT ARTI LIE A AEs” (2004). PARANA” (2005), HES MASA PRAM AIT 120 Binh, MBEAN AE (Moving Objects Management; Models, ‘Techniques, and Applications) (Springer) , (XML cH SR) (PEHROES ERE) iM EBM OR SE BE, LIE Web Bet Seal. XML AGRE RS. BECERRA. RAMS 953 RFE I Data Mining; Cancepts and Techniques, ‘Third Eton STK ABARE UB i. HBR “super crunchers” (2 Bi) ROPETTAT AIPA tte SAT SACRE Ae AAG BL TE RSE EE PE EB SEAT PK #: RRUUMRACRPRUBHARS , WRG LUTE MRIT EA Ht, EAA DA: ABI TAT AAS BE Pe ORF EARS, MHRA TY PS AAI. AL A i SAA PARNER. UES ARBGE. AREA HH, UME (Twitter) F, A-ML. SRR, FEAL OTR, LMR AUT EIA FR, BE ko tT FE SALI 3 WAY ZERE, aR olan, HB). MPMEARK, HMMA Oe, MERA i. AE Te AHS, PRAWAAEM MAP. PCT ERA, RARE AMER VRAER A FATE ARF PARTE Io AVE — HERERO IC ST, FF ELE RE RS 3 Wo PMA SEMIN: (ASE ATRL, ABL 100 1913051 FA 2006 FLEW IE, ERM, MAMA. (RS, UR A. MER, AT —h; AROMA BR, MARR, UTES. BM, top-k HSN SRE ROR ARK RE—BKF AMARA ERB SS, CLEARS SH waLR— AEE WBA TH 5 Jiawei, Micheline, Jian (4% T BOB 2 REY DT TK, RH PRC EL, BBE PE IK Christos Falontsos FAR - MERE FAL AUG HK), BA ANE (Bin, SVD/PCA, Avie, AF | 582 nore Data Mining: Concepts and Techniques, Thind Fiton PATHE (AER. ITER. ACR SGE, GSE SGE) PTFE BL. AREA HE. IER BST ROME Lk, BRA ARE, ADWARE. BONAR, BSI Baa RAR HPO. SAMIR. RRB, BOAMAM TZ —. BITE, ABE. ALS RAL: PROTA REE HATA FUR PAE HET TA, EXE EMILE ERR 747, Jiawei Han #1 Micheline Kamber (JE Gl HERB) 545 BRAS MH A BSE — HB HERBIER. CHA T PASI CET RAT AISI, MAT ADS NT RT SHER, EU LOSS SRA ERE RMT. YMRS, WRT IE BRI. ean RARCSHR, BA TRSABAM, wy, FI, A. AR Bil, Ha5025 [al . 5 RAST. BATA AG ET AR RL BECAME reeene TEL, CAR SCARS Se A TIT CS 5 AA Hae SEER Fe AUREL AOFM ITA ARBRE. Wh, BRIA 38, HIM, IRAE GAMA. 3k YESS FA SER ee SE, TM RAAT AE ESC ALITY BOM NAA HL RT DES PN A RST PEE, FAASES. REM UIE | ENT PPM, FF AEE 2 WTP SRE FER T AR. Jiawei Han Micheline Kamber Ze SRS 13% 77 iil — . » bain pa oe BUR. aA REAR Jim Gray Microsoft Research XB anh aE |i a | Data Mining: Concepts and Teebniqus, Tied Baton LS HOLE SIE ER T RTE AE EH. ABE TE FATE. FPR NT CURA OBE RE HES KE RSL LAMAR, LAR (TE REE ERR MG BAR, PSOE EAT SPRITE, TFA AK R HAT ZEB. Be Fe ER 46 Oh oRAM (KDD), A AMAA EMAAR AER, ORS MCE KU 4BPE, BGR-EPE, Web. FUWKEHa BERET. KAZRO VSAM SMEAR, (N—-TE ERM, BUSS PERRET. RRS. SLES. BCS, BUR PEER (eae. HRAE. AUER. ACCME. PIPER A RET Oe, BERRI TE XB a AHMBRIOEA, VET. AME. ARCHER RE, Alt, ein § FE ee eee Blah SE FUR AEWA. BAIT RR Erewinineriineesnant * BMA AAW. SUBIC LAF 20 Htc 80 EAL, 20 Hee 90 Eft TR ENA, OE PTERSIR, APSHRAKIM, TAHOMA MRR, CRORE HBO TLR, kD ARREARS AHR BAHSB 1 WR, SED MEARE, Be HARE AMG T BIG, ana MBAR AR KSA aE tate as, EA SRT, BAS, Web. ZIM, BLIP RI, HEHE A aT HY BEE PARAS EA PE NIE EE ME. Alt, Be TPE. stu PARA ATA, PERALTA ABATE AR I BO, TPES AE ARS AU Ub Fa 3S — AC BO TH 3 MA ANTPMAT SHISTT , IRAART SBWRARAR, SAT FE ARLE PREM A, BMP RICH EMT Cl, BoE Hh, SUR BSUCHR. ASAE) 1EIX — ABE HERP RAE, WP Ew, —ERELABSMEA, A-BAT. 2 WF RABE (BIW, ORR RR, BUR, the eae PASE RAR, VA, Web, SHRI ARTI BE) BLE SIP Bae AGO HBR AR) co YT SARE AD ER, FRAT 2 RRA RET HY FH BOA BINA Lb , (RN 3 MACE 3 ARORA ( BARE) x 1 RRUKT RUM SLD Sie, TIE FB AER ME ER ROLRABASE MAE. RRS RARMRERM, WIKRH, FHWA BGR CEAGE, UBM ARMARIM, MUATAE A, FFF, BARN. TALS. Awe HA BOE. SORBGE. A. HPA Web WR. KREDI AAR IRA DAB PRLS, NAHB ELLE HEY 3 B2 EPA Ra REE PRCT ROME BER BBARHT BAR PARES RE RE. REEL IS UH HHT GAR BRT BEBE OY HY LAL tk Sh, BPICAR, RE. ASAE OCT. 2 BARS HMBC ITE {EAU FE HERI AISPBARERABLA. WEA N ARERR WS, RSW CRUEE, BUR Seah, SUBIAA, Baas AR RIK. SB 4 RAS BRAUER, OMAP (RAPHRD) MABAFARAWTIC. HA EA BUR OE AM OLAP MAS RL, AP, ASCE, VR CPE HINRA. BS RTA We RAE, TBR, TERME HRI, IE Star-Cubing #42 OLAP Wik BS, RE SLUR. HEPES IOS HRMS A FP SLSR Ye AE AB a HET hea. 6 RAS 7 BPA RAMAR ET MPRA, ARMA. 6 RITA LE , MNOS AT, LO IEE TS TRI AS AER a AS Apriori S125 #1 FE, BIBUEE AR AS SRO TTI, LAH EH SL ANSE BY st sel, 42: SORA ATER. ER. B7 BTR AR. 1K , iA AIMS, HEHE AUS Aas Be, AR Pe ASE: BATE. RET MARE MAN, LWT EAR BSBA RARE RAK. HPT KNERENSAE, AAS OL Ho BB SPADA AT PRICE EA LOEW STS AEF LMU STE. BRAT CRAMER, USGA RK, OA RAUL BUR. BI BWTCARO ARAL, CSM. STS MAR LE TYROL, GEAUSTS RRS, k- ETRE, HEA AEC MRT. BRK, Es HEBET E 10 MMA RANA SMI, CHR Ae: AP ENE, SUS. AUT PE BRARAIHEIA. B11 RHIC BK, MERTEN BH GE. FSM SE, DRO Peskin IR H2BSWCSH ALA, KEPAARESWRAMRS AMEE, EAR BAR UPR, PERM) URI (8 ET BUEN TE ET RAWAEMETARO AK) WER. WRC RRR BL SURRTER, UAE TAA. Sua, FER 13 BRAC AMAR ATS HARES FRUIT, CATR TIRE Cin, BTA, APS EAE F Hh, VASSAR ZR, SCRA Web BUR. Rc MOSHE IR Dy tk AN) ig i Si 3 MI SWAG RRA—H. , RPI A RRS, PASI i, RGRSTE, ATE, DSO. HCE ESE AT. SRAM, PESTLE, UR T NN. BCR RE HSRAARMRA, ATARI AREA TM, PURER HSS, GAUL CARICOSGEM, UR, RSA SNwM, RK (LARSEN RREBSRAT . PASTRIES, TRAE APR Ee SPER EIR BO BRA: ES WH, SRRURRAAAH, WET Le a ORE RAMI T Betis IR OLAP 477 FREE HE EE, AES AB HP He HE: MSD HLT az TO, PATER, On BU. SAE STRIDES YTE FST. BAUR ARES PERE RA TUR — TETRA OBES, APE EAE oh — ER DFE BUR Se. RT UE BUS AR Ne Ae Sb, AR (une, ¢s. uiue, edu/ ~ hanj/b3 B www. booksite. mkp. com/datamining3e) BAe HE T AEE ATR eat. bnaaniies thhnaeen RAR, PRAT LGR RE ROSE, LIZA TEA) SEAT sB10m ti eampetiot sb tex| | matin ti Sea AT BP Ate mae tira AREA ERE, ORM LAPEER & Sm BK ARF BRA ARNOTT LS Fei I “EO RE PIE MOTI B OGRA BUNTY LRG “37 BURCH”, MM OLAP FNRI S72 WR AU AO BUIPAT Ln “FE 4 BE BU SOLS TALI” An SS ER BURST RR”, Re, UAT EAE TE REP BEA E , pF » BYTE iS, MLAS RENN ADL SOC BE PR PRI] LA SAAS PS RUA CES iE, MLTR TTA. AHS — BAT LE, CR RE RSE. PLES, BSB HRPTENARREN ES. BRSHBA HIG, WAAR. KA PUA A ee RHONA, RELGROMT SW AMM, RAR RT. A Mea AT TSE HURL, SEA TM SCALE PERT VA AR SE A I. FRE ARATE RIT BENT RE TOESE SCR BES RMB BAMA, ECR RNS ORR, ATR WARREN, TAPERS, SRSA TA, BAER. & PASTHSRAME, WERT EIA RME TR. REBATE ABR OY, 18 BRERA EBAM AAAS BRIM, BTS ia RUE Ts TR AT RARER. DEAT, EAT EAE A? °° GMYRAKTRUS, BAER EET OSAMA OR, AM, REARS LEMAR GAR, ERA ML RT AMERA IIL ARGO, RCRA HITE. © KMART HA. PREM MB MES, REARS HE EDOREHN fi BCE TS 6 BReWAR ALT EMBRAER. All, KHRKFREMN- ALA PM. th FH RMR BS A AL, UATE OC RAR, BET RARE SRB EAB FAS A SEAT EATS AP CR FR PWS PLN. RI, ZEA RTA, AEA. tah, ARE ETE ARRCRIRNS HRA, ADTHEA REA AGH, RBS EFA. APTN RAMP ETA, SHORES TRRRMEAM SBE WOAH, AS BE eI “DLL” BRP RT RIA. KREME PEACE. NSS RL eC, ALINE, BERR C Bk C++ UE THOME, URAL SESE, AS RIS RATA PoE OEE OEP BOHRA BFE TE HE MES . ABR ACB RUA NHLHER wow. cs, uiuc. edu/ ~ hanj/bk3, 5—-7-4E Morgan Kaufmann tH ieee) ij www. booksite. mip. com/datamining3eo 1X26 PA HE IA 5 AYER AU BIE BE ET SEES, DER ELH: © PRAVAITH. AKT ACY PowerPoint PEM BABE. © ABA ORB SB, AG 2 MINH 8 ~ 10 RR a THR AH HE, SOM TAR, SSR ER aE A TT a FR © AFM, A BES A RL eH AEH, °° RAMEY. MAA ANAIKT HP RR et FC A AAU, ATIF LER ° FRAUEN RALRAR, HMMA MERAA, BRS AUER ROR. WATE RR REAR 8 ee SCRA RHE, WPA AEE — LEB UliMine A ERE (hutp, //illimine. cs. uiue. edu) ° we © Hea, SARA A, MLE ARANDA RAS ZH RL AE SCR) G7 BAF tl FF REIT © APAR. PDF ix. © ABP MAME. KURA AEP HHR, —AARBES, RADE RR, J AH ABH. PRA RAE hanj@ cs. uine. edu FH AF DOT Ay x Data Mining: Conoapt and Techniques, Thind aition $8 3 RBC BEA UIUC HSH LAE TA A RAE ~ LR SE HAO AASB S fi, SEWER (DAIS) AGRI ANAE AL LA DUE Se AA eA] RAR TUBE LATTA SEA i 8) SCF ES RATE HL HP EEN. ERA A UIUC 2010—2011 324F CS412 # CS512 URERASSEAE, AMT AF MSE TSA, BOT WAR, RUT SHAG. RRA UR Morgan Kaufmann iti fi4L A A4T A David Bevans #fi Rick Adams, Ris} TTAERRATS (EA TREE A O. R AUSCS. RRAT BHT AEB Marilyn Rash FURUE ALBA, (MATER TEAST BATMAN AAR, 4A. NASA, YEE RBA (NSERC), LK IBM FEBS, BARGTIEGE, Google, HRUTEBS. WA. HP 3 BSAAMMALKE, OMIA RES, AAMT MMA MARR. ew PAFLIMER T BATMAN TIE RTI HS, BARN BRAN, RAT TN HEA ORH. 582 hat BEAT) UTUC Sie F2 128 A AE A A A. AES - MN PROPER SES fk (DAIS) BUM ANSE AE LA DS HA A a Be BAS, AMAT Ae AS ASN Se A AS TR ATT ES 2 ME Pe a ER. ME A LH: Gul Agha, Rakesh Agrawal, Loretta Auvil, Peter Bajesy, Geneva Belford, Deng Cai, Y. Dora Cai, Roy Cambell, Kevin C.-C. Chang, Surajit Chaudhuri, Chen Chen, Yixin Chen, Yuguo Chen, Hong Cheng, David Cheung, Shengnan Cong, Gerald DeJong, AnHai Doan, Guozhu Dong, Charios Ermopoulos, Martin Ester, Christos Faloutsos, Wei Fan, Jack C. Feng, Ada Fu, Michael Gar- land, Johannes Gehrke, Hector Gonzalez, Mehdi Harandi, Thomas Huang, Wen Jin, Chulyun Kim, Sangkyum Kim, Won Kim, Won- Young Kim, David Kuck, Young- Koo Lee, Harris Lewin, Xiaolei Li, Yifan Li, Chao Liu, Han Liu, Huan Liu, Hongyan Liu, Lei Liu, Ying Lu, Klara Nahrstedt, David Padua, Jian Pei, Lenny Pitt, Daniel Reed, Dan Roth, Bruce Schatz, Zheng Shao, Mare Snir, Zhaohui ‘Tang, Bhavani M. Thuraisingham, Josep Torrellas, Peter Tevet- 0 HSh, FRR AE MEK RASS kov, Benjamin W. Wah, Haixun Wang, Jianyong Wang, Ke Wang, Muyuan Wang, Wei Wang, Michael Welge, Marianne Winslett, Ouri Wolfson, Andrew Wu, Tianyi Wu, Dong Xin, Xifeng Yan, Jiong Yang, Xiaoxin Yin, Hwanjo Yu, Jeffrey X. Yu, Philip S. Yu, Maria Zemankova, ChengXiang Zhai, Yuanyuan Zhou, Wei Zou. Deng Cai #i ChengXiang Zhai X} SCARE AL Web FHP AT, Nifeng Yan xt AtEiR— i, Xiaoxin Yin Xf & HX RHA — WV Heth T FLAK, Hong Cheng, Charios Ermopoulos, Hector Gonzalez, David J. Hill, Chulyun Kim, Sangkyum Kim, Chao Liu, Hongyan Liu, Kasi M Manzoor, Tianyi Wu, Xifeng Yan, Xiaoxin Yin RUT FROM, Fe TR A AREA Morgan Kaufmann th GALI RATA Diane Cerra, BEBE ARES (EMAL) BAMA. LAE. RAB GK AV AS Alan Rose, RB} ANREP A DO Hh SR MRA, LHR, RATA MT ICR ARM, BRATZ Ri. WS, RATRIRNORA, BMI BREST SB 1 Raia Seal AB OE 5 Fe] RAS ASEM DBMiner SEE, RAPER AED Ta BR ATT Be Rs A OT A Ze OR GK A 44H: Rakesh Agrawal, Stella Atkins, Yvan Bedard, Binay Bhattacharya, (Yandong) Dora Cai, Nick Cercone, Surajit Chaudhuri, Sonny H.$.Chee, Jianping Chen, Ming- Syan Chen, Qing Chen, Qiming Chen, _ Shan Cheng, David Cheung, Shi Cong, Son Dao, Umeshwar Dayal, James Delgrande, Guozhu Dong, Carole Edwards, Max Egenhofer, Martin Ester, Usama Fayyad, Ling Feng, Ada Fu, Yongjian Fu, Daphne Gelbart, Randy Goebel, Jim Gray, Robert Grossman, Wan Gong, Yike Guo, Eli Hagen, Howard Hamilton, Jing He, Lamy Henschen, Jean Hou, Mei-Chun Hsu, Kan Hu, Haiming Huang, Yue Huang, Julia Itskevitch, Wen Jin, Tiko Kameda, Hiroyuki Kawano, Rizwan Kheraj, Eddie Kim, Won Kim, Krzysztof Koperski, Hans-Peter Kriegel, Vipin Kumar, Laks V.S.Lakshmanan, Joyce Man Lam, James Lau, Deyi Li, George (Wenmin) Li, Jin Li, Ze-Nian Li, Naney Liao, Gang Liu, Jungiang Liu, Ling Liu, Alan (Yijun) Lu, Hongjun Lu, Tong Lu, Wei Lu, Xuebin Lu, Wo-Shun Luk, Heikki Mannila, Runying Mao, Abhay Mehta, Gabor Melli, Alberto Mendelzon, Tim Merrett, Harvey Miller, Drew Miners, Behzad Mortazavi- Asl, Richard Muntz, Raymond T. Ng, Vicent Ng, Shojiro Nishio, Beng-Chin Ooi, Tamer Ozsu, Jian Pei, Gregory Piatetsky-Shapiro, Helen Pinto, Fred Popowich, Amynmohamed Rajan, Peter Scheuermann, Shashi Shekhar, Wei-Min Shen, Avi Silberschatz, Evangelos Simoudis, Nebojsa Stefanovic, Yin Jenny Tam, Simon Tang, Zhaohui Tang, Dick Tsur, Anthony K. H. Tung, Ke Wang, Wei Wang, Zhaoxia Wang, Tony Wind, Lara Winstone, Ju Wu, Betty (Bin) Xia, Cindy M. Xin, Xiaowei Xu, Qiang Yang, Yiwen Yin, Clement Yu, Jeffrey Yu, Philip S. Yu, Osmar R. Zaiane, Carlo Zaniolo, Shuhua Zhang, Zhong Zhang, Yvonne Zheng, Xiuofang Zhou, Hua Zhu, FU {EZR Jean Hou, Helen Pinto, Lara Winstone, Hua Zhu, Ria Ait] AS Dee ARABS fj) —ALS PA; RRR} Eugene Belchev, REA VDIEBEM T BE, PUT Ar BARI Morgan Kaufmann 44 CHAT BUT 8.4848 Diarie Cerra, RRULMKAEAR HS 1 SANTO. LO ASHE; BRAN TITTLE ED HH Howard Severson AIA AY TIS, ARAM PRENATAL. RTM AAMC E AR, NTR. Sk i, RATRWRIMAA, SUMMIT Ae Sb. nonce nneocaninaeenenssisecnsseitiatl | 28 fis Data Mining: Concepts and Techniques, Thied Eton viawei. Han (back: SEA AALE A — LST SOLER MY Bliss BUR. ALAR Se IL A BAR FS HT IE To UY HR TT ASF & 3, 44H ACM SIGKDD 8) HK (2004), IEEE +#B OLE SRAM (2005) AIEEE W. Wallace McDowell % (2009), ft #2 ACM #UIEEE 24-, fthiBHBfE (ACM Transactions on Knowledge Discovery from Data) fit EM (20062011) MPAA AYA, 44% CIERE Transactions on Knowledge and Data Engineering) #1 (Data Mining Knowledge Discovery) . Micheline Kamber ¢ hi @ AGILE SFA Concordia KERR ONE (ATES ML) EARL. WHE NSERC 48, fe ADTSERTE McGill KA*, Bae — REE AL THE 5 tA BRIE A RAL Ss FE TES PEI ORS AR ESE A BUNA EiNTB, dian Pei (efit) SUC DO Se - VERE SOU EBA. {TE Jiawei Han 14% SEP, F 2002 FARR - REA RAS ILE. ERIS BEE Web 12 BIMEBMRWEREREERR TIKES, PRWURS TPR AIK, ANIC S| RR PK, HRERR EK HS BEAR TT AE OY |H #/ | Data Mining; Concepts and Techniques, Thin! Edition HR IE FRSA Bar Re 3 2 we Bus eS IR sie L 1 1 1 5 ARR 1 1 Lt Catt LLL BOARDER ccc 1 LL 2 SabddR Me - 2 tha eae 3 MART AAS 1.3.1 SR 1.3.2 dee 1 1 4 1 3.3 BH ate 3.4 Se RAL aE BY LASHES 4 aes 1.4.2 1.4.3 wae 44 45 4.6 5.1 RHE 25.2 PBEA ” 1.5.3 HA RRS EEE ak TBI SSA 6.1 BRAT 6.2 Web 231 SCAN ER AG ae 7.2 RAE APRB~ Fi BAN Ao TA BT RUE ERA SH REAR BS 19 De 1.10 SCmRRERE ~~ B28 RMR 26 21 BOERS Re 26 211 21.2 213 214 pa | 2.1.6 2.2 SRA SETH 29 2.21 Ps BIMER: 4h. pAb ake Rak 2.2.2 RERRA: ME, WME, FE. ABE FOUI ANE BALE veers 32 2.2.3 RAB HS A A oi aT ak ReLs 2.3 Sete A BL 37 23.1 RPA TRA 37 23.2 RAY TER 38 2.3.3 REBAR TM. 2.3.4 PUA 42 2.3.5 TRL At RACK 2.4 SRSA ASE 44 2.4.1 aRAB SEM 5 OF EE 45 2.4.2 Fay ARSENE EB 46 2.4.3 By AEE EB 46 2.4.4 SUA Oa A BUT RIALS 24.5 FRR EERE 2.4.6 RERBRM MA 3.1 ane BR 3.11 MRE: AMARA ETD 1.2 KB RRM HH ZRAS BOR 2.1 RRB 2.2 RE SE 2.3 ABA EE HE SURE Re 3.1 pital BB 65 3.4 Bae rt Rabb Bana . 4.1 S49 25 Rob te 4.2 bik 4.3 Eka aA . 44 BETS oe 4,5 aya foxt ARG: BBA AB IAL 3.4.6 3.4.7 3.4.8 seeeteeeeee 3.4.9 BABE ARR ce 5 3, SUBIR SBR Be SL AE RE BRR OPAL voererereeee 73, 3.5.2 sBRLARIE AG Rae 3.5.3 mide BRIE 3.5.4 Be RADAR Re 6 3.5.5 BERR, RA AAA BA BTA a 3.5.6 MAREE DR Pa 3.8 SCHERER B48 SRCRSRUD ME 4.1 BGR FE: Se ALL PARE E 4.1.2 PAPER ARS HH Ba 413 Ate BR i 0b ste ak see 414 RRER: HSK RBS . 7 41S KMCRRA, SER BAER We Bae 16 BURR, R 17 RRR SURO MER. BR tk 43 OLAP s = ve ® * an 4.2.2 BY, Sibi BRAD OA 2.3 He: 2.4 REMPRAHH- 2.5 Aah OLAP ARE - 2.6 HSA SMe ka + B 4.3 FRC RTS OO +99 43.1 BARE RR MSO WER 4.3.2 Babe IKE 4.3.3 BAB AE aeeA 99 SEAR oo 100 EAB 101 4.3.4 MOR Lap A LAY S Hea AE 464K + 102 4.4 BUR EHS + 103 4.4.1 CAB ARH AHL SE HER cesceeeseeeeeeccesennsneee 103 4.4.2 431 OLAP ate. eB eal Foik te eG) 105 4.4.3 OLAP #itj is Aas + 107 4.4.4 OLAP ik 4 #4544: ROLAP, MOLAP, HOLAP 44 3%4% --- 107 4.5 SUR th: TREES -- 109 4.5.1. BABA AE AY BI A yaw 109 4.5.2 ney ABH 2 A A A RM, see 13 4.5.3 Res ah dh ey ey: 114 4.6 ANS 116 4.7 33 7 4.8 3 ' 19 BSR BARA R 121 5.1 BURSITIS: BARA + 121 5.11 i 122 5.1.2 sevseceeeeseessssneeesess 124 5.2 BARTERI ee 126 5.2.1 BRAA AI HOb FB BRI vere 126 5.2.2 BUC: ARADO FE seseeee 129 5.2.3. Star-Cubing: MHA Bat Hit Sakis BAR vee 132 5.2.4 Ay thik & Ht OLAP Hist IE : + 136 5.3. (EH cRNA aE PEA) eeecvsseeeeeceessenneeeneeeeeeee 141 141 5.3.2 ARR RAR: top-k ih oy Arai 5.4 BGR RI BM aS oH 147 5.4.1 Fa RAR ia RMAC 5.4.2 SHEAR: SR. HARE 5.4.3 RTH RR AD LAKE ARE 5.5 Na 5.6 St 5.7 SCWREERE HOS eieASewst, KRINAK EASA, 6.1 RARER 6.1.1 Maat: —PER at seeeeeeeeees 1ST 6.1.2 ERR, PRK 0] “ 6.2 FARIA 6.2.1 Apriori Shik: shit ALA KEP ERRRERE 160 6.2.2 BRR RH ARR PLAY oe ce 164 6.2.3 48 Aprion’ Hk Ag -- 165 624 RAK RAE K 166 6 169 6.2.6 sei ABA PRARA --- 170 ELAR EA a NY BUPA 6.4 Ni 176 6.5 BRR ee 177 6.6 SCHEER 179 B7e Basti ~ 180 7.1 RHE: TRA + 180 numnentsnennnmnanncnnensnenndi 7.2 Sia, SHIA PA ei 7.21 BAS RAR 7.2.2 iS BARA 7.2.3 REM ARA 7 3 1 2.4 ABARAT AR A Ae ABA 188 ET ARAARAAE 1.3.1 RAR ALD 6 7 tel 46k + 7.3.2 KREHRORAP A: RALAY RRR SMHR - SSR A Uh - 191 198, ors wes 199 5.2 BR Ae ew BEX + 7.6 eociee sium 7.6.1 RRA TE LISP 7.6.2 MAGA A - 7.9 SRR HH HR: RAS 241 8.1 ACHES 2ut B11 HARD - QU 812 PR A 2 8.2 PRIA 213 8.2.1 AURIS verre 214 8.2.2 Bibide Re 217 8.2.3 PEMA cree s+ 222 8.2.4 PAP etE Se Rha hy 224 8.2.5 RAL TALE 4E4B oe 225, 8.3 MMT vee 226 8.3.1 Meta ++ 227 8.3.2 te Mba e sees 227 8.4 EF RMMN A 8.4.1 428) IF-THEN Um) --- 230 8.4.2 RRA RT z 8.4.3 REAR A A SE wa J3 oh 5 Bars 8.5.1 8.5.2 PETREERARE - GRAS A FORK att RABE 8 Shik Mit BH RO ATA a fe ROC th A MRPRE . 6 RSET 8.61 mess 8.6.2 BR 8.6.3 a8 44 AdaBoost 247 8.6.4 RbLAAR - 249, 8.6.5 RERKEMMEH TR fe a 8.5.3 8.5.4 8.5.5 + 241 Bie Bike 8.5.6 ack - + 245 250 7 AN 8 a OL Bea AHL Ro Bi 9. 2 FUSS 9. 9, LL SRW RAB 2.2 RRMA 9.2.3 By edt sree 260 9.2.4 R&A Bette THE SHEL aml sean amet 3.2 cabs TNO te ‘EFL SSL 9.4.1 RRP - 9.4.2 REPL AAOR RR Ree x 9.5 HEHE (BUG) +++ 275 9.5.1 k- REAR RK = 9.5.2 RP RHR 9.6 Hibs + 277 9.6.1 ite Iba 277 9.6.2 ABRSRA 278 9.6.3 BARRA 2728 9.7 REPRE + 280 9.7.1 SRAK 9.7.2 FRESE 9.7.3 £HEA 9.7.4 BED - 9.8 AME 9.9 RB 9.10 SCRRTERE S10@ RAMA. ALAA 10.1 HEARST 10.1.1 HARRAMHT- 10.1.2 PRAM ORR 10.1.3 RAR A AR 10.2 WAST BE 10.2.1 k- 344i SEG GER ooeeeeeeenennsnenss 10.2.2 k-Psk: —PRT RAM RMR 10.3 BRR 10.3.1 RRHHPRAK R. 10.3.2 #2 10.3.3 + 298 300 eR LE BIRCH; 4&9 RAPER 89 SBPRRR ' BAAS RH SPRAKRR -- 303 RPE BRRA 10.3.4 Chameleon; 10.3.5 10.4 FEAT 10.4.1 DBSCAN: —#A TAH (RRR TEA 307 RH RABA 10.4.3 DENCLUE; AT SAR BAT BiB RK 10.5 BFR ATT ” 10.5.1 STING; #iHE& MA -- 312 10.5.2 CLIQUE, —## RAT Aprion 65-2 i) RK # 10.6 RASTA 10.6.1 eH RRA 10.6.2 i RAE 10.6.3 MERKRE 10.7 spi 10.8 Wii 10.9 SCRREERE ANE BARK 11 PARE RAS LL 11 HAR LL 2 RFR RRMHRK 1.3 MBRAAHIR 11.2.3 M1.2.4 Hiya 11.3 RAS PA AO iE 339 11.3.1 MS ate + 339 11.3.2 AMER E - 340 11.3.3 BRKA 343 1.4 BASRA 345 1.4.1 ROR 345 1.4.2 RAMRMRKAR 1S Ni eaeee 17 SCTE S128 Bee 12.1 REAR ROT 351 12.1.1 HER BAR 351

You might also like