Professional Documents
Culture Documents
Content Mining
WEEK 9
What is data mining?
Process of automatically searching large volumes of data for patterns
Also known as knowledge-discovery (bagaimana cara mencari sebuah keilmuan dari data yang
sangat banyak)
Some people prefer to use the term “content mining”
Contoh: prediction
Framework:
CRISP-DM (Cross standard process for data mining): it
is a data mining process model that describes commonly
used approaches that data mining expperts use to tackle
problem
Contohnya seperti kalo di bank ada yang nunggak nanti
di pelajari si nasabah ini gajihnya dimana cewe apa
cowo terus minjemnya brp nanti di klasifikasikan
Classification : memprediksi kelas, pilihan sudah jelas (ya atau tidak) tinggal nanti
memprediksi apakah masuk Ya atau Tidak.
Example of decision tree:
Text mining
Is the process of
- Extracting interesting
- Nin-trivial information
- Knowledge from unstructured text
Also known as:
- Intelligent text analysis
- Text data mining
- Document mining
- Unstructrured data management
- Or knowledge-discovery in text
Web mining:
- Is the extraction of interesting potentially useful patterns (mengekstraksi pola yang
menarik tapi dari internet)
- ….
- ….
Three knowledge discovery domains that pertain to web mining
corpus: data text yang sudah teratur, sehingga kalo diolah jauh lebih valid
catatan untuk orange pake Bahasa inggris, soalnya orange reprocessingnya belum mantep kalo
pake Bahasa Indonesia.