Professional Documents
Culture Documents
CAPSTONE PROJECT
Learning Outcomes
billion by 2025.
class XI to
the meaning of the capstone project, apply
concepts learned in
In this unit, you will learn about for your AI model and also learn
will understand which algorithm to apply
real-world problems. Next, you
solve will learn about the concept of
At the end of the chapter, you
about different types of testing methodologies.
Loss functions.
doable in a
particular timeframe
contemporary
and helps in your career advancement
Experiential Learning
Video Session
Scan the QR code or visit the following link to watch the video: What is a Capstone Project
www
outube conm vatchiv yBs2Vb'5HIS4
After watching the video, answer the following questio
What is a Capstone
Project? What is its purpose?
Some of the examples of capstone projects in Al, from which you can pick up to develop, are as follows:
Image Classifier
Analysing social media to assess emotions
R Using regression to predict a trend
Understanding
Al modelling
the nature of
H EvaluationH Deployment
t is essential to determine which of these questions you're asking, and in what way answering it helps solve your
problem.
Brainy Fact
There are patterns in nature even! The Golden Ratio' or 'Divine Proportion' i.e., 1.618 is found in sunflowers,
daisies, chrysanthemums, etc. Leonardo Da Vinci used this ratio in his paintings like The Vitruvian Man, Mona
Lisa. Even the Taj Mahal has been built using the Golden Ratio. The ratio of any two terms of the Fibonacci
series like 233/144 is 1.618. Fascinating isn't it!
words:
1 Understand the problem and express the problem in your own
Understand the required inputs and outputs
Ask questions for clarity (in class, these questions may be directed to the teacher, however, you can alca
4. Code the smaller parts one by one. Use the following methodology:
Analyse how to implement the code.
Write the code/query.
Test each code individually.
Fix the problem(s), if any.
Imagine you want to create your first website. How would you decompose this task? Think about the following while
decomposing:
What colour combination can be used?
How many web pages are to be included on the website?
Experiential Learning
Video Session
Scan the QR code orvisit the followinglink to watch the video: Introduction to Decomposition
https://www.youtube.com/watch?v=rxsYpP2-omg
orANO
After watching the video, answer the following questions:
What is decomposition?
Intelligence-XI
1 1 4 | Touchpad Artificial
Usingan Analytical Approach
In data science, it is common to solve problems and answer questions using data analysis. Typically, data scientists
As data analytics becomes more accessible and prevalent, data scientists need a core methodology that can
provide a guiding strategy, regardless of the technology, the volume of relevant data, or the approach. This
methodology emphasizes many of the new approaches in data science. It consists of 10 steps that form an
iterative process using data to discover information. Each step plays an important role in the context of the
overall methodology.
Business Analytic
understanding approach
Feedback Data
requirements
Data
Deployment
collection
Data
Evaluation
understanding
Data
Modelling
preparation
to modity data requirements accordingly and collect new and/or more data. loday's high-performance database
analytics enable data scientists to utilize large datasets that contain large or even all of the available data. Due to
this, predictive models are able to better predict rare events such as disease or system failure.
Stage 7: Modelling
The modelling stage, which with the initial version of the prepared data set, focuses on
begins constructing
predictive or descriptive models based on the previously stated analytic approach. To develop a prediction model,
data scientists employ a training set (historical data in which the desired
outcome is already known).
As businesses receive intermediate insights, the modelling process is often very iterative, leading to refinements in
data preparation
and model formulation. Data scientists
may attempt numerous algorithms with their respective
parameters for a specific technique to get the best model for the available variables.
Stage 8: Evaluation
The data scientist
reviews the model during development and before
deployment to determine its quality and ensure that it
correctly and completely answers the business problem.
can interpret the model's quality and efficacy in solving the problem by
producing numerous diagnostic metrics
and other outputs such as tables and graphs.
can utilizetesting set for predictive models (which is separate from the
a
training set but follows the same
orobability distribution and has a known outcome.) The testing set is used to assess the
model and adjust it as
necessary.
Ai Reboot ****' ******** '** *** ****** ******* *** ******************************** *********
********'***************
Brainy Fact
Envision, the award-winning iOS and Android smartphone app that allows blind and visually impaired people
to independently access visual information around them, announced plans to integrate its Al-powered
software technology into Google Glass in March 2020. The combination of Envision's software and Google
Glass gives blind and visually impaired users with a substantially less invasive and hands-free manner of
accessing the world around them, giving them greater freedom and independence to access and 'see' the
world around them.
Video Session
a n the QR code or vrsit the following link to watch the video: How Envision Works
Model Validation
There are mainly two types of validation methods which are Train Test Split Evaluation and Cross Validation. Let us
leam about them in detail.
Experiential Learning
Video Session
Scan the QR code or visit the following link to watch the video: Training and Testing
https://www.youtubecom/watch?v=P2NqrFp8usY
After watching the video, the 0MANGE
answer
following question:
What is the role of data in training a model?
OR
_rain, X_test, y_train, y_test = t r a i n _ t e s t _ s p l i t (X, y, train_size=0.67)
Example:
#split a dataset into train and test sets
from sklearn.datasets import make_blobs
from sklearn.model_selection import train_test_split
#create dataset
, Y = make_blobs (n_samples=1000)
Output:
(500, 2) (500, 2) (500,) (500,)
Out of 1000 samples 50% (500) is for training set and 50% (500) is for test set.
Cross-Validation Procedure
learning machine sample models on a small of data. The
Cross-validation is resampling technique for evaluating
a
the number of into which a given data sample should
process includes only one parameter, k, which specifies groups
be divided. As a result, the process is frequently referred to as k-fold cross-validation. For example, k=10 for 10-fold
cross-validation. It's a popular strategy since it's straightforward to grasp and produces a less biased or optimistic
estimate of model competence than other approaches, such as a simple train/test split.
Iteration1
Iteration 2 0 0OC
Iteration 3 O0TOO0g
Tteration kk- O 00 0©OOO0
All data
ExperientialLearning
Video Session
Scan the QR code orvisit the following link to watch the video: K-Fold Cross Validation Intro to
Machine Learning
https.//wwwyoutube.com/watch?v=TIgfjmp-4BA
ORANGE
After watching the video, answer the following question:
What is the role of cross validation in training and testing a model?
For the same reasons, for larger datasets, a simple train-test split is sufficient. It will run faster, and you may have
enough data that reusing a portion of it for a train test split is unnecessary.
Cross-validation is mostly the method ofchoice since it allows your model to be trained
multiple train-test splits.
This gives a good idea of how well your model will perform on data not seen before. On the other hand, Train Test
Classification Regression
Loss
Log Loss Mean Square Error/Quadratic
Quantile Loss
Hinge Loss
MSE
MSE will never be negative because the errors are always squared.
Disadvantage
incorrect forecast, the squaring part of the function multiplies the error.
If our model makes a single particularly
outliers and instead seek a more fully-rounded model
cases, we don't worry about these
However, in many real-life
that performs well enough on the majority of cases
Video Session
Scan the QR code or visit the following link to watch the video: Mean Squared Error MSE
https.//wwwyoutube.com/watch?v=Mhw_-xHVmaE
After watching the video, answer the following question: MAN OE
Calculating MSE in Python, the mean_squared_error function gives the Mean squared error regression loss.
from sklearn.metrics import mean_squared_error
Y_true 13, -0.5, 2, 7.2] #list of actual values
Y_pred- [2.5, 0.0, 2.3, 8] #list of predicted values
print"MSE value=", mean_squared_error (y_true, Y_pred)) #returns MSE value
100-
80
.I..- Residual
60 Error
40
20
1 2 3 4 5 7 8
No. of Hours Studied 10
(X)
RMSE
Predicted Actual
N
The errors are squared betore being averaged in RMSE. This basically means that RMSE
gives larger mistakes a
This
higher weight. sug9gests that RMSE is far more beneficial when substantial erors exist and have a significant
impact on the model's performance. This characteristic is important in many mathematical calculations since it
avoids taking the absolute value of the error. The RMSE of a good model should be less than 180. The lower the
RMSE value, the higher the model's performance.
The errors are squared before being
averaged in RMSE. This basically means that RMSE gives larger mistakes a3
higher weight. This suggests that RMSE is far more beneficial when substantial errors
exist and have a
impact on the model's performance.
significant
This characteristic is
important in many mathematical calculations since it avoids taking the absolute value of the
error. The RMSE of a good model should be less than 180. The lower the RMSE value, the higher the model's
performance.
Experiential Learning
Video Session
Scan the QR code or visit the following link to watch the video: U01V05 Calculating RMSE in Excel
https://www.youtube.com/watch?v=G8j8KAJJlw
After watching the video, answer the following question:
What do you mean by RMSE?
mean_diff_sq = d i f f _ s q . mean ()
return rmse_val
Print ("actual values are: " + str (["%.4f" & i for i in y_true]))
Hyperparameters
parameters whose values govern the learning process. They also determine the values of
Hyperparameters are
model parameters learned by a learning algorithm. They are 'top-level parameters that regulate the learning
process and the model parameters that come from it, as the prefix hyper suggests. Since the model cannot
modify its values during learning/training, hyperparameters are said to be external to the model. Some examples
of hyperparameters are
The ratio of train-test split
Optimization algorithms' learning rate (e.g. gradient descent)
In a neural network, the activation function selected (e.g. Sigmoid, ReLU, Tanh)
The loss function that the model will
employ
A neural network's number of hidden layers
At a Glance
A
capstone project is a comprehensive, independent, and final project undertaken as
designed to assess the skills, knowledge, and expertise a student has acquired. part of a curriculum
A successful
problem-defining process requires a basic analysis and evaluation of the project-related
problems, their reasons, and methods.
Design Thinking methodology provides a solution-based approach
solving problems.
to
During coding, we follow problem decomposition
problems as well. methodology which can be applied to real-life
Once the business
problem is clearly stated, the data scientist can define
solving the problem. an
analytical approach to
The
analytical approach chosen characterizes the requirements for the data.
During the initial data collection phase, data scientists
unstructured, and semi-structured) relevant to the identify available data sources (structured,
The modelling stage, which problem area.
begins with the initial version of the prepared data set, focuses
predictive or descriptive models based on the previously stated on
constructing
The data scientist reviews the analytic approach.
model
quality and ensure that it correctly and during development and before
deployment to determine its
The train test procedure measures
completely answers the business
problem.
the performance of machine
make predictions on data that was
not used to train the model.
learning algorithms when they need to
The training dataset is used to
fine-tune the machine learning model and train the
.Test dataset algorithms make algorithm.
predictions using the input elements from the
Cross-validationis a
resampling technique for
training data.
of data. evaluating machine learning models on a small sample
Loss functions can be divided into two categories: regression losses and classification losses.
Ai Quiz
A.Tick () the correct option.
Which of the following is not the part of Design Thinking, a 5 Stage Process?
a. Empathize b. Sympathize.
d. Define
c. Prototype
is a project where students must research a topic independently to get a deep understanding
A
of the subject matter.
a. It is sensitive to outliers
In problem decomposition:
and then restate the problem in your own words
Understand the problem
ii. Gather all simple facts to create a complicated piece
C.
Mean Square Error