Professional Documents
Culture Documents
1
Executive Summary 3
Gaps identified 3
Kashat / TensorGraph engagement options: 4
Consultancy agreement to support LFD close the identified gaps 4
Technical Partnership to transition the LFD assets to TensorGraph 5
2
Executive Summary
TensorGraph is a Machine Learning Solution provider hired by Kashat to audit the Machine Learning
technology and processes built and supported by LFD.
Gaps identified
After reviewing the LFD ML models (Algo or classifier) and processes, we identified 10 gaps which we
categorized by criticality
High:
1. ML Models optimization were focused on reducing the cost associated with default rate more than it
was focused on reducing the cost of lost opportunities. LFD and Kashat have not developed a
comprehensive KPI to track both costs.
2. Correlation between data quality coming from traffic generation (marketing campaigns) and classifier
performance is not analyzed/tracked.
3. No ML model versioning system is currently used.
4. No versioned gold test dataset is currently used.
5. No data partner or expert data labeller (professional underwriter) was engaged in building test
datasets, and monitoring ML model performance.
Medium:
6. No applicant profile similarity metrics were used to analyze LFD historical data
7. No systematic A/B testing were conducted for ML models benchmarking
8. No focused study for feature correlation analysis (affects classifier performance)
Low:
9. No academic partner
10. No focused study for feature reduction
We estimated the cost of using the current ML model to be around 22% of the amount disbursed.
Our calculation is based on:
● 10,000 loans/month
3
● Average ticket size 400EGP and interest rate of 28%
● Tra c with 43% bad applicants
The credit score of 55-60 chosen by LFD to achieve a default rate below than 25% for new
customers (red line) is at the expense of a high cost of lost opportunities (green line).
We believe that the ratio of the LFD model total cost to disturbsed amount (estimated to be 22%)
will increase as you drive more tra c to the kashat application which will result in higher
rejection rate and higher customer acquisition cost. The high rejection rate in June 2022 could
be attributed to the issue explained above.
4
● Technical Partnership to transition the LFD assets to TensorGraph
Timeline: 9 month for transition in three milestones
Estimated cost during transition: ~$30,000/month
Estimated cost post transition and up to 500,000 disbursed loans: ~$40,000 - 60,000/month
Estimated cost post transition and up to 1Million disbursed loans): ~$70,000 - 120,000/month
This is a high level estimate and if there is an interest to explore in further detail, TensorGraph will
prepare a detailed plan. TensorGraph has a similar type of engagement and it was very successful.
TensorGraph will help Kashat hire a small team (ML engineer, data scientist, and ML architect) and
the rest of the team will be under the TensorGraph workforce.
5
The LFD current ML approach
The LFD model is based on a logistic regression classifier with ~220 features for the new customers.
𝑋 : 𝐿𝑜𝑎𝑛 𝑟𝑒𝑞𝑢𝑒𝑠𝑡𝑒𝑟,
𝑌 : [𝑔𝑒𝑛𝑑𝑒𝑟, 𝑚𝑎𝑟𝑖𝑡𝑎𝑙_𝑠𝑡𝑎𝑡𝑢𝑠, 𝑎𝑐𝑐𝑜𝑢𝑛𝑡𝑠_𝑛𝑎𝑚𝑒, 𝑎𝑐𝑐𝑜𝑢𝑛𝑡𝑠_𝑡𝑦𝑝𝑒, 𝑐𝑎𝑙𝑙𝑠, ...., 𝑎𝑝𝑝_𝑖𝑛𝑠𝑡𝑎𝑙𝑙_𝑑𝑎𝑡𝑎_𝑚𝑖𝑠𝑠𝑖𝑛𝑔]
𝑍 : 𝑆𝑐𝑜𝑟𝑒 𝑓𝑟𝑜𝑚 0 𝑡𝑜 100 𝑟𝑒𝑝𝑟𝑒𝑠𝑒𝑛𝑡𝑠 𝑡ℎ𝑒 𝑙𝑖𝑘𝑒𝑙𝑖ℎ𝑜𝑜𝑑 𝑜𝑓 𝑑𝑒𝑓𝑎𝑢𝑙𝑡𝑖𝑛𝑔
Limitations : Summary
After analyzing artifacts provided by LFD we noticed 3 main limitations
6
Limitations : Details
7
Model performance provided by LFD
According to results shared by LFD, it is clear that the algorithm is still struggling with new customer
datasets and features which might suggest the need for a detailed
- Feature importance analysis
- Feature correlation analysis
- Feature reduction and correlation resolution
8
LFD Model cost
Our calculation is based on:
● 10,000 loans/month
● Average ticket size 400EGP and interest rate of 28%
● Tra c with 43% bad applicants
9
75-80 0 633,472 633,472.00
10
TensorGraph Technical proposal
Overview
Working towards the goal of minimizing the overall cost of AI classifiers while maximizing the revenue
generated using the system, we propose the following 3 pillars to track and maintain.
We believe that the key to a successful AI based commercial application/solution is not the outcome alone,
but the MLOps process and the ecosystem governing the typical procedure to produce the final outcome,
the ML model.
11
- Kashat Marketing team
- Kashat Underwriters
- Investors and board of directors
Given the above premises, we propose our MLOps iterative process of model production that will be used to
● Develop
● Analyze
● Interpret and explain
each model iteration, which could be summarized as follows.
12