You are on page 1of 28

MMABA3 – BIG DATA

MEMAHAMI DAN MENGANALISIS KONSEP DASAR DAN


PERKEMBANGAN TERKINI DARI BIG DATA ANALYTICS

This material belongs to Universitas Prasetiya Mulya


Do not upload and share this material to public domain. For private use only!
AI, Machine Learning, and Big Data

 Bagaimana ketiga hal ini saling terkait?


 Apakah bisa Big Data Analytic dilakukan tanpa ML dan AI ?

This material belongs to Universitas Prasetiya Mulya


Do not upload and share this material to public domain. For private use only!
The Definition

 Artificial Intelligence : broad term for when a machine can


respond intelligently to its environment. 
 Machine Learning : is when a machine keeps improving its
performance, even after you’ve stopped programming it.
 Deep Learning : ML using Neural Network and vast amount of
data

This material belongs to Universitas Prasetiya Mulya


Do not upload and share this material to public domain. For private use only!
Data mining involves exploring and
analyzing large blocks of information
to glean meaningful patterns and
trends.

Data science is
an interdisciplinary field that uses
scientific methods, processes,
algorithms and systems to
extract knowledge and insights
from structured and unstructured
data, and apply knowledge and
actionable insights from data
across a broad range of
application domains. 

This material belongs to Universitas Prasetiya Mulya


Do not upload and share this material to public domain. For private use only!
How AI Works

This material belongs to Universitas Prasetiya Mulya


Do not upload and share this material to public domain. For private use only!
Artificial Intelligence

This material belongs to Universitas Prasetiya Mulya


Do not upload and share this material to public domain. For private use only!
AI Tools 2023

10 Useful AI Tools You'll Actually Want to Use! 2023 - YouTube


This material belongs to Universitas Prasetiya Mulya
Do not upload and share this material to public domain. For private use only!
OpenAI GPT-4
Training cost : >100 million dollars
Training data : 45 GB (from 17GB)
Parameters : 1 trillion (estimated from 175 billion)
Training methods : supervised learning on large dataset and reinforcement learning
Input Prompt : not only text can use umages

This material belongs to Universitas Prasetiya Mulya


Do not upload and share this material to public domain. For private use only!
Midjourney V5

Midjourney is a generative
artificial intelligence program and
service created and hosted by a 
San Francisco-based independent
research lab Midjourney, Inc.
Midjourney 
generates images from natural
language descriptions, called "
prompts", similar to OpenAI's 
DALL-E and Stable Diffusion.[1][2]

This material belongs to Universitas Prasetiya Mulya


Do not upload and share this material to public domain. For private use only!
What is Machine Learning

This material belongs to Universitas Prasetiya Mulya


Do not upload and share this material to public domain. For private use only!
KEY POINTS OF MACHINE LEARNING

TASK (T) EXPERIENCE (E) PERFORMANCE (P)

Machine learning untuk memprediksi cuaca


data riwayat indikator kecepatan angin, persentase kondisi cuaca
kelembaban udara, suhu, pembentukan
Prediksi cuaca awan, tingkat curah hujan pada lokasi
yang diprediksi dengan
tertentu tepat (akurasi)
This material belongs to Universitas Prasetiya Mulya
Do not upload and share this material to public domain. For private use only!
TRADITIONAL PROGRAMMING VS MACHINE
LEARNING

Orang menulis rule dalam


bentuk kode aplikasi

Model (komputer) dilatih


menggunakan data

Mehra, Sidharth & Hasanuzzaman, Mohammed. (2020). Detection of Offensive Language in


Social Media Posts

This material belongs to Universitas Prasetiya Mulya


Do not upload and share this material to public domain. For private use only!
BUT WHY MACHINE LEARNING?
You wouldn’t want be this guy
No Human
Can’t explain
Experience
the experience
Yet

Many
Situation
solutions
changes
adaptation

Large amount Human are too


of Data expensive
Checking all data by eyes and hands
This material belongs to Universitas Prasetiya Mulya
Do not upload and share this material to public domain. For private use only!
MACHINE LEARNING TYPES

• Menggunakan dataset memiliki label (E)


Supervised
untuk memprediksi varible target (T)
• Menggunakan dataset tanpa label (E)
Unsupervised
untuk melihat/mempelajari pola (T)
Semi- • Menggunakan data dg label dan tanpa label
supervised (E) untuk memprediksi / mempelajari pola (T)

• Menggunakan data hasil simulasi secara


Reinforced
iterative (E) untuk mencapai tujuan (T)
Learning (memperbesar reward / mengurangi error)

This material belongs to Universitas Prasetiya Mulya


Do not upload and share this material to public domain. For private use only!
Supervised Learning

STEP 2: Predicting
STEP 1: Training

Different Types Based on Target Variable

This material belongs to Universitas Prasetiya Mulya


Do not upload and share this material to public domain. For private use only!
Simple Math Notation on Training Step

This material belongs to Universitas Prasetiya Mulya


Do not upload and share this material to public domain. For private use only!
UNSUPERVISED LEARNING

This material belongs to Universitas Prasetiya Mulya


Do not upload and share this material to public domain. For private use only!
The Advantages
Growing importance in a number of fields

• subgroups of breast cancer patients grouped by their gene expression


measurements
• groups of shoppers characterised by their browsing and purchase
histories
• movies grouped by the ratings assigned by movie viewers
• topic modelling of text document (NLP)

Easier to obtain unlabeled data than labelled data

This material belongs to Universitas Prasetiya Mulya


Do not upload and share this material to public domain. For private use only!
Reinforced Learning

This material belongs to Universitas Prasetiya Mulya


Do not upload and share this material to public domain. For private use only!
Deep Learning

This material belongs to Universitas Prasetiya Mulya


Do not upload and share this material to public domain. For private use only!
Google

This material belongs to Universitas Prasetiya Mulya


Do not upload and share this material to public domain. For private use only!
Other Google AI’s powered services

Any other?
This material belongs to Universitas Prasetiya Mulya
Do not upload and share this material to public domain. For private use only!
How Tesla Autopilot Works

This material belongs to Universitas Prasetiya Mulya


Do not upload and share this material to public domain. For private use only!
Can we apply Tesla Self Driving in Indonesia?

This material belongs to Universitas Prasetiya Mulya


Do not upload and share this material to public domain. For private use only!
AI Startup in Indonesia

This material belongs to Universitas Prasetiya Mulya


Do not upload and share this material to public domain. For private use only!
Share to us your insight
 Pada pekerjaan atau kegiatan anda saat ini, kira-kira
bagaimana Big Data, AI dan ML dapat memberikan
perubahan yang memiliki impact?
 Menurut anda apakah tantangan dalam penerapan
teknologi tersebut?
 Apa yang anda dapat lakukan agar tantangan-
tantangan tersebut dapat diatasi?
 Worth it or not applying the technologies?

This material belongs to Universitas Prasetiya Mulya


Do not upload and share this material to public domain. For private use only!
Mini Case Study 1 – Sesi 4

10 mins presentation and 5 mins Q&A


This material belongs to Universitas Prasetiya Mulya
Do not upload and share this material to public domain. For private use only!
END OF SESSION
Lecturer: Sindhu Wardhana
Email: wardhana.sindhu@gmail.com
WA: +6281 399 29 499

You might also like