Welcome to Scribd, the world's digital library. Read, publish, and share books and documents. See more
Download
Standard view
Full view
of .
Look up keyword or section
Like this
2Activity
0 of .
Results for:
No results containing your search query
P. 1
Distributed Data Mining

Distributed Data Mining

Ratings: (0)|Views: 10|Likes:
Published by Krishnamurthy Hegde

More info:

Published by: Krishnamurthy Hegde on Jun 28, 2012
Copyright:Attribution Non-commercial

Availability:

Read on Scribd mobile: iPhone, iPad and Android.
download as PDF, TXT or read online from Scribd
See more
See less

10/13/2012

pdf

text

original

 
EMPOWERINGSCIENTIFICDISCOVERYBYDISTRIBUTEDDATAMININGONAGRIDINFRASTRUCTURE
A PROPOSAL FOR DOCTORAL RESEARCHbyHaimonti Dutta
SUBMITTED IN PARTIAL FULFILLMENT OF THEREQUIREMENTS FOR THE DEGREE OFDOCTOR OF PHILOSOPHYATUNIVERSITY OF MARYLAND BALTIMORE COUNTY1000 HILLTOP CIRCLE, BALTIMORE, MD, 21250JULY 2006
 
Table of Contents
Table of Contents iAbstract iii1 Introduction 1
1.1 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11.2 Proposed Research . . . . . . . . . . . . . . . . . . . . . . . . . . . 21.3 Objectives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2 Background 5
2.1 The Grid . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52.1.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 52.1.2 The Grid Architecture . . . . . . . . . . . . . . . . . . . . . 62.1.3 Classication of Grids . . . . . . . . . . . . . . . . . . . . . 82.2 The Data Grid . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92.2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 92.2.2 Data Distribution Scenarios . . . . . . . . . . . . . . . . . . 112.2.3 Middleware, Protocols and Services . . . . . . . . . . . . . . 112.2.4 Data Mining on the Grid . . . . . . . . . . . . . . . . . . . . 142.3 Distributed Data Mining . . . . . . . . . . . . . . . . . . . . . . . . 232.3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 232.3.2 Classication . . . . . . . . . . . . . . . . . . . . . . . . . . 242.3.3 Clustering . . . . . . . . . . . . . . . . . . . . . . . . . . . . 292.3.4 Distributed Data Stream Mining . . . . . . . . . . . . . . . . 332.4 The Challenges . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
3 Preliminary Work 39
3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 393.2 Orthogonal Decision Trees . . . . . . . . . . . . . . . . . . . . . . . 403.2.1 Decision Trees and the Fourier Representation . . . . . . . . 413.2.2 Computing the Fourier Transform of a Decision Tree . . . . . 443.2.3 Construction of a Decision Tree from Fourier Spectrum . . . 483.2.4 Removing Redundancies from Ensembles . . . . . . . . . . 533.2.5 Experimental Results . . . . . . . . . . . . . . . . . . . . . . 55i
 
3.3 DDM on Data Streams . . . . . . . . . . . . . . . . . . . . . . . . . 653.3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 653.3.2 Experimental Results . . . . . . . . . . . . . . . . . . . . . . 693.3.3 Monitoring in Resource Constrained Environments . . . . . . 743.3.4 Grid Based Physiological Data Stream Monitoring - A Dreamor Reality ? . . . . . . . . . . . . . . . . . . . . . . . . . . . 753.4 DDM on Federated Databases . . . . . . . . . . . . . . . . . . . . . 753.4.1 The National Virtual Observatory . . . . . . . . . . . . . . . 753.4.2 Data Analysis Problem: Analyzing Distributed Virtual Catalogs 773.4.3 The DEMAC system . . . . . . . . . . . . . . . . . . . . . . 783.4.4 WS-DDM – DDM for Heterogeneously Distributed Sky-Surveys 783.4.5 WS-CM – Cross-Matching for Heterogeneously DistributedSky-Surveys . . . . . . . . . . . . . . . . . . . . . . . . . . 793.4.6 DDM Algorithms: Definitions and Notation . . . . . . . . . . 793.4.7 Virtual Catalog Principal Component Analysis . . . . . . . . 803.4.8 Case Study: Finding Galactic Fundamental Planes . . . . . . 823.4.9 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
4 Future Work 91
4.1 The DEMAC system - further explorations . . . . . . . . . . . . . . . 914.1.1 Grid-enabling DEMAC . . . . . . . . . . . . . . . . . . . . . 914.1.2 PCA based Outlier Detection on DEMAC . . . . . . . . . . . 924.2 Proposed Plan of Research . . . . . . . . . . . . . . . . . . . . . . . 93
Bibliography 95
ii

You're Reading a Free Preview

Download
scribd
/*********** DO NOT ALTER ANYTHING BELOW THIS LINE ! ************/ var s_code=s.t();if(s_code)document.write(s_code)//-->