Group 5 Project

INTELLIGENT SYSTEMS
Fashion e-commerce
Abhimanyu Yadav (22M2303)
Abhishek Pratap Singh K (22M2304)
Deepak Padhi (22M2334)
Dhawal Swaroop (22M2336)
Madhusudan Sawant (22M2385)
Rakesh Kore (22M2375)
Rithvik Jawalkar (22M2343)
Satyam Kumar (22M2384)
Soumya (22M2411)
Sr. No. Topic Page No.
1. Using Clickstream data for targeted approach 1
when acquiring new customers
2. Fashioning a Personalized Data Model: The 7

Cornerstone of Recommendations
3. Enhanced Dynamic Pricing with RNN 23

implementation:
4. Sentiment Analysis on Product Reviews and 26

other feedback
5. Image Recognition and matching for item 27

selection and customer profiling
6. Personalizing websites with dynamic content 28

layouts - Personalized filtering
PART 1: TARGETING NEW CUSTOMERS
Using Clickstream data for targeted approach when acquiring new customers
By utilising clickstream data and adopting a targeted approach, e-commerce businesses can attract and retain
high-value customers, leading to higher revenue and profitability. Personalised advertising scenarios can be a
useful tool for boosting e-commerce sales and reducing advertising costs.
Data Requirements Customer Data:- Unique customer IDs or user identifiers to track
individual customer behaviour
Item Data:- Item descriptions, categories, prices, and popularity

rankings
Time-Stamped Data:- Duration of visits, time spent on specific

pages, and the frequency of visits over time
Historical Data:- To identify recurring patterns, preferences, and

changes in customer behaviour over time
Data Source Data generated by user interactions on the e-commerce website
CRM Systems:- Integrating customer data, purchase history and

interactions with business
Third-Party Data Providers:- Supplement internal data sources
Technique Generating Customer Value Metric

Methodology to get Customer’s Value Metric
The Specific Item ID link is connected to the second file Product DB (PDB),
which contains detailed information about the products, such as product name,
price, current discount, etc. By analyzing the product-related data, the proposed
strategy can recommend personalized advertising scenarios that are tailored to
each customer’s preferences and needs
The CM index is the total weighted sum of the visited links, where the weights
of possible links are denoted as w1, w2, w3, and w4. The CM index takes into
account all types of adjustments and parameter settings, such as defining the
minimum time spent on the webpage to ensure that the customer’s awareness
of the presented information is sufficient. Only the customers with a sufficient
value of the index CM can be assigned to the advertising campaign for showing
ads to them.
The session starts with the customer visiting the webpage showing a group of
items, then it follows by clicking on a specific product item and transferring it to
the basket. The customer then proceeds to preview another product before
making a purchase of a specific item. At each step of the browsing session, the
CM index is recalculated and checked if it has reached the limit value L.
Customers with a CM index above the limit L are assigned to the advertising
campaign, and their CM index is set to the initial value for starting new
calculation. If the CM index does not reach the limit value L, it is updated
dynamically based on the customer’s activities. The initial CM index can be set
to different values depending on the customer’s merits. The default initial value
is 0, but it can be set as a bigger value for the customers who have registered
with the e-shop, agreed to receive promotional information, or engaged in other
interactions that might be feasible to start dynamic calculation of CM from
higher initial CM index value
Identify & personalize ad to target lookalike customers
Data Model
Customer ID (unique identifier)

Demographics (age, gender, location,
etc.)
Consent flag (indicates user consent
for data collection)
Purchase history (products purchased,
timestamps, quantities, etc.)
● Source of Data: Collected using already existing customer data
● Demographics: Age, gender, location, income, etc.
● Purchase history: Products purchased, quantity, frequency, and total spend.
● Website behavior: Pages viewed, search terms used, time spent on specific
products, and clicks on promotional offers.
● Engagement data: Social media interactions, email open rates, and website sign-
ups
● Requirement: Ensuring data collected is consistent
● Address missing values
● Remove outliers & inconsistencies
Segment ID (unique identifier)

Description (e.g., "High-value
customers," "Frequent buyers of
specific category")
Techniques to be incorporated:
● Feature Engineering to create new features from available data-points ( average order value, product
category preference)
● Collaborative filtering for Modeling
Steps for implementing:

● After data preparation, User-Item Matrix is developed with cell values indicating the user's interaction
with the item
● Matrix Factorization: Decomposes the user-item matrix into lower-dimensional matrices representing
hidden preferences of users and items. This allows identifying users with similar latent factors,
indicating similar preferences which identifies Similar Users
● Similar high-value customers identification: Analyzes the predicted preferences of high-value
customers (e.g., frequent buyers, high spenders) and identifies other customers who share similar
predicted preferences
● The model learns from the prepared customer data, identifying patterns and relationships between
features and desired outcome (e.g., high-value customer or repeat buyer)
● Generate lookalike audience: Users with similar predicted preferences to high-value customers can
be considered part of the lookalike audience, potentially exhibiting similar purchasing behavior. Once
the model is trained and validated, it can identify individuals outside the customer base who share
similar characteristics to high-value customers
● Lookalike audience evaluation: Assess the generated audience based on its similarity to target
segment (e.g., high-value customers) by comparing their purchase history, demographics, or other
relevant attributes
Target advertisement to Lookalike customers:
● Export the audience: Export the generated lookalike audience list to the advertising platform
(e.g., Facebook Ads, Google Ads)
● Target ads: Utilize the lookalike audience for targeted advertising campaigns, showcasing
relevant products, special offers, or brand messaging tailored to their predicted interests and
purchase behavior
Benefit to Organization & Lookalike Customers:
PART 2: Prompting the First purchase & Increasing purchase frequency

In the digital realm of fashion e-commerce, catering to individual
preferences is key to fostering customer loyalty and driving sales. This
part delves into the world of collaborative filtering (CF), a robust
technique that leverages data to personalize the shopping experience
for each customer. We'll explore the data model, potential techniques,
implementation steps, anticipated outcomes, and the benefits for
both customers and organizations.
The Indian Fashion eCommerce market is predicted to reach

US$19,605.9 million by 2024 and accounts for 18.7% of the total eCommerce market in India. The expected
CAGR (2024-2028) will be 12.6%, resulting in a projected market volume of US$31,520.5 million by 2028.
Fashioning a Personalized Data Model: The Cornerstone of Recommendations
A comprehensive data model forms the bedrock of a successful CF system, capturing information from diverse
sources to paint a detailed picture of individual preferences and trends.
Customer Data:
● Demographics: Age, gender, location, and income.

● Preferences: Capture preferred brands, styles, colors, sizes, and price ranges.
● Purchase History: For valuable insights into buying patterns and interests.
● Wishlist and Saved Items: Showcase potential future purchases and areas of interest.
● Reviews and Ratings: Offer valuable feedback on products
● Implicit Data: Browsing behavior, search history, and click-through rates.
Product Data:
● Product Attributes: Category, brand, style, color, size, material, price, and seasonality.
● Product Descriptions and Reviews: Enrich data with details and customer sentiment.
● Product Images and Videos: Showcase product details and visual characteristics.
Data Sources:
● Customer sign-up information

● Website and app activity
● Purchase history
● Wishlist and saved items
● Reviews and ratings
● Search history
● Click-through rates (CTRs)
● Social media data (with consent)
● Fashion trend analysis platforms
By harnessing this diverse data set, the system creates a user-item matrix, where rows represent users and
columns represent items. Each cell reflects the user's interaction with the corresponding item, such as
purchase history, ratings, or browsing behavior.
Collaborative Filtering for Personalized Recommendations
Among the various data-driven and knowledge-driven techniques suitable for e-commerce personalization,
collaborative filtering (CF) is both, powerful and versatile approach.
CF leverages the fundamental premise that users with similar preferences tend to purchase similar items. It
operates in three core steps:
● Building the User-Item Matrix: As mentioned earlier, this matrix forms the foundation of CF, capturing
user interactions with various items.
● Identifying Similar Users: Employing similarity measures like cosine similarity, Pearson correlation
coefficient, or Jaccard similarity, the system identifies users who exhibit similar tastes and preferences
based on their interaction patterns within the user-item matrix.
● Recommendation Generation: The system analyzes the purchase history or interactions of similar
users and recommends items that these users have interacted with but which the target user has not
yet encountered.
Advantages of CF:
● Scalability: CF can handle large data sets effectively, making it suitable for e-commerce platforms with
a vast user base.
● Cold Start Problem Mitigation: Even for new users with limited purchase history, CF can leverage
data from similar users to generate relevant recommendations.
● Interpretability: Understanding the logic behind CF recommendations allows for easier troubleshooting
and improvement.
While CF offers a strong foundation, it can be further enhanced by incorporating other techniques like content-
based filtering, which recommends items based on the attributes of previously purchased items, or hybrid
approaches that combine CF with other techniques for a more comprehensive recommendation strategy.
Implementation
Building a successful CF system involves a well-defined process, encompassing several key stages:
● Data Preprocessing: Ensure data quality by cleaning missing values, handling inconsistencies, and
normalizing data if necessary.
● Model Selection: Choose an appropriate similarity measure and recommendation generation algorithm
based on factors like data characteristics, desired outcomes, and computational resources.
● Model Training: Train the chosen model using the prepared user-item interaction data. This training
process allows the model to learn the underlying relationships between users and items.
● Evaluation: Measure the model's performance using metrics like precision, recall, and NDCG
(Normalized Discounted Cumulative Gain). These metrics assess the accuracy and relevance of the
recommendations generated by the model.
● Continuous Refinement: Regularly update the model with new data and experiment with different
hyperparameters and algorithms to optimize its performance and tailor it to the specific needs of the
platform.
Anticipated Outcomes
A well-implemented collaborative filtering recommender system promises tangible benefits and positive
outcomes:
● Increased Sales Conversions: Relevant recommendations drive higher sales conversions by

presenting customers with products they're more likely to purchase.
● Enhanced Customer Engagement: Personalized experiences keep customers engaged and
encourage repeat visits by providing continuous value and relevant suggestions.
● Improved Product Discovery: Recommendations highlight items that align with customer preferences,
leading to the discovery of new and exciting products that could have been overlooked in general
browsing.
● Optimized Inventory Management: Analyzing buying patterns and successful recommendations can
guide inventory and purchasing decisions, minimizing overstocking and maximizing the visibility of
popular items.
Benefits to Customers: Curating a Personalized Fashion Experience
By adopting CF techniques, e-commerce platforms create highly personalized experiences for their customers.
This translates into numerous benefits:
● Effortless Product Discovery: CF uncovers items that align with their unique preferences, reducing
the time and effort spent browsing through vast collections.
● Elevated Fashion Sense: Exposure to a wider range of products and styles, handpicked based on
their preferences, expands their sartorial horizons and encourages experimentation.
● Streamlined Shopping Experience: Recommendations tailored to their interests and preferences
create a smoother and more enjoyable shopping experience.
● Enhanced Confidence in Purchases: Reviews and recommendations from users with similar tastes
reduce uncertainty and boost confidence in purchase decisions, particularly for new or unfamiliar items.
Benefits to the Organization: Building a Competitive Advantage
Accurate recommendations not only boost sales but also drive up the average order value as customers are
more likely to add multiple suggested items to their carts. Personalization strengthens customer relationships
by fostering loyalty, reducing churn, and stimulating positive word-of-mouth, contributing to long-term growth.
Moreover, analyzing successful recommendations provides valuable insights into customer preferences,
market trends, and growth opportunities. Leveraging collaborative filtering for recommendation systems
enables targeted marketing campaigns, optimizing marketing spend by efficiently reaching the most
relevant audiences with customized offers.
Image recognition system:
● Data Required:
Product Images: High-quality images of your fashion items from various angles (front, back, sides) with
consistent backgrounds and lighting.
Product Labels: Information associated with each image, such as product ID, name, brand, category (e.g.,
dress, shirt), material, color, size, etc.
User Interaction Data: If possible, track user interactions with product images (clicks, views, purchases) to
understand user preferences and refine recommendations.
● Data Sources
Internal Data: Your product images and associated information stored in your product management system.
External Data: Collaborate with fashion photographers for diverse, high-quality images or consider licensing
image datasets specific to fashion.
● Data Preprocessing:
Resizing and Normalization: Uniform image size across the dataset for consistent processing.
Data Cleaning: Remove blurry, low-resolution images, or images with incorrect labels.
Data Augmentation: Artificially create variations of existing images (e.g., rotations, flips) to increase training
data size and improve model robustness.
● Modelling:
1. Choosing a Model Architecture
Convolutional Neural Networks (CNNs): The standard choice for image-related tasks due to their ability to
learn hierarchical visual features. Popular CNN models for image recognition include:
● VGGNet
● ResNet
Pretrained Models and Transfer Learning: Start with an existing, pre-trained model (like those listed above)
trained on large image datasets (like ImageNet). Refine it on your specific fashion dataset. This often leads to
faster training and better results than training from scratch.
2. Feature Extraction
The CNN acts as a feature extractor: Input an image; the lower layers of the CNN learn basic features like
edges and color gradients. Higher layers learn more complex combinations of features, representing different
parts or styles of clothing.
Output: The output from the CNN is a set of features (sometimes called an "embedding") that represent the
image in a compact, numerical form.
3. Classification
Model Type: Use a classifier like a Support Vector Machine (SVM), Random Forest, or a simple fully
connected neural network layer.
Training: This classifier is trained on the extracted features and their corresponding labels (product categories,
attributes). The goal is to learn to map these features to the correct labels.
Prediction: When given a new image, the same CNN extracts its features, and the trained classifier predicts its
product category or attributes.
4. Similarity Search & Recommendation Engine
Feature Space: The features extracted by the CNN act as a "fashion search index" – items that look similiar
are placed close together in a multidimensional space.
Nearest Neighbors: When a user interacts with an image, find the closest images (nearest neighbors) in this
feature space. These closest images are the system's findings.
Example: Recommending shirt in our catalog similar to image
● User uploads an image of a white shirt.

● The CNN extracts features representing its color, shape, texture, etc.
● A nearest-neighbor search finds other dresses in the dataset with similar features.
● The system recommends these other dresses to the user.
Myntra is using the similar model in their home page
Benefits for -
Users: Time saving, discovering new styles.
Organizations: Boost sales, better customer engagement and improve customer satisfaction.
CNN-Based Recommendation System
Introduction
We proposed a personalized Fashion Recommender system that generates recommendations for the user
based on an input given. Unlike the conventional systems that rely on the user's previous purchases and
history, this project aims at using an image of a product given as input by the user to generate
recommendations since many-a-time people see something that they are interested in and tend to look for
products that are similar to that. We use neural networks to process the images from Fashion Product Images
Dataset and the Nearest neighbour backed recommender to generate the final recommendations.
Methodology
Data Preprocessing
The initial stage involves preparing the fashion images to be compatible with the ResNet50 model. This
includes resizing, normalization, and augmentation techniques to enhance the diversity and quality of
the dataset for better training outcomes.
Model Training
The core of the system relies on transfer learning from ResNet50, a proven model in image recognition
tasks. The project enhances the model by adding custom layers tailored to the specifics of fashion item
recognition, enabling fine-tuning to the project's unique requirements.
Inventory Database Creation

Creating an inventory database involves cataloging each item with metadata, including images,
category labels, and potentially other features like color, size, and brand. This database is indexed
efficiently to facilitate rapid retrieval of items during the recommendation process. Techniques such as
hashing or tree structures (e.g., KD-trees) can be used to optimize the search process.
Recommendation Generation
For generating recommendations, the system extracts features from the user-input fashion image using
the trained CNN. These features represent high-level attributes of the clothing item, such as texture,
shape, and color. The Nearest Neighbor algorithm then searches the inventory database for items with
similar features. This search can be accelerated using approximation methods or indexing structures to
ensure scalability. The most similar items are presented to the user as recommendations, providing a
personalized shopping experience.
As shown in the figure Initially, the neural networks are trained and then an inventory is selected for generating
recommendations and a database is created for the items in inventory. The nearest neighbour’s algorithm is
used to find the most relevant products based on the input image and recommendations are generated.
Once the data is pre-processed, the neural networks are trained, utilizing transfer learning from ResNet50.
More additional layers are added in the last layers that replace the architecture and weights from ResNet50 in
order to fine-tune the network model to serve the current issue. The figure shows the ResNet50 architecture.
Due to constraints in time and resources, the images from Kaggle Fashion Product Images Dataset are used
for the experiment. The inventory is then run through the neural networks to classify and generate embeddings
and the output is then used to generate recommendations. The Figure shows a sample set of inventory data
To generate recommendations, our proposed approach uses Sklearn Nearest neighbours. This allows us to
find the nearest neighbours for the given input image. The similarity measure used in this Project is the Cosine
Similarity measure. The top 5 recommendations are extracted from the database and their images are
displayed.
KBS for VENDOR/SUPPLIER MANAGEMENT
Introduction: We will be utilizing a rule-based expert system with the following functionalities
Data Validation and Standardization:

● Implement rules to validate and standardize product information entered by vendors.
● Enforce data standards, ensuring consistency in attributes such as size, color, and material.
Automated Categorization:
● Utilize rule-based systems to automatically categorize products based on predefined criteria.
● Create rules that assign products to appropriate categories and subcategories, improving the
organization of your catalog
Pricing Rules:
● Set up pricing rules to automatically calculate and adjust product prices based on predefined criteria.
● Incorporate rules for discounts, promotions, and bundling strategies.
Content Duplication Prevention:
● Establish rules to detect and prevent the listing of duplicate products on the platform.
● Ensure a clean and organized catalog without redundant entries.
Notification and Alerts:
● Set up rules to send automated notifications to vendors or administrators for various events, such as
low stock, high demand, or new product additions.
Data Requirements and Sources:
Data Validation and Standardization:
● Product ID
● Vendor ID
● Vendor historic transactions
● Product Name: Ensure consistency and adherence to naming conventions.
● Attributes (Size, Color, Material): Validate and standardize attribute values.
● Category: Ensure accurate categorization based on predefined categories and subcategories.
● Brand: Validate and standardize brand names.
Sources: Vendor Input Forms, API Integrations ( Integrate with vendor systems through APIs to validate
and standardize data during data exchange),External Data Providers
Automated Categorization:
● Category IDs: Assign products to appropriate categories and subcategories based on predefined rules.
● Attribute details (e.g., Style, Type): Use attributes to further refine categorization.
Sources: Fashion Taxonomy Databases( for categories and subcategories, as well as attributes),
ML model for categorization.
Pricing Rules:
● Price: Apply pricing rules based on factors such as cost, competitor prices, or market demand.
● Discounts, Promotions: Implement rules for applying discounts and promotions dynamically.
Sources: Competitor Pricing Data, Marketplace Data, Cost Data (from vendors)
Content Duplication Prevention:

● Product Name: Check for duplicate product names to prevent identical entries.
● SKU (Stock Keeping Unit): Use unique SKUs to identify and prevent duplicate listings.
● Attributes: Cross-check external identifiers to avoid duplication.
Sources: Internal Database, External Identifiers
Notification and Alerts:

● Stock Levels: Set up alerts for low stock levels or when products are out of stock.
● Price Changes: Notify relevant parties of significant price changes.
● Policy Violation incidents: Send alerts for non-compliance with platform policies.
Sources: Inventory Management System, Price Change Logs

Designing a KBS:
1. Knowledge Base:
a. Data Validation & Standardization:

● Validation Rules:
● Define rules for validating data upon entry.
● Establish checks for completeness, accuracy, and adherence to standards.
● Standardization Rules:
● Specify rules for standardizing data formats and units.
● Include guidelines for consistent naming conventions.
b. Automated Categorization:
● Category Rules:
● Develop rules to categorize products based on attributes.
● Utilize historical data and fashion taxonomies to inform categorization.
c. Pricing Rules:
● Dynamic Pricing Rules:
● Formulate rules for dynamic pricing based on market trends, competition, and cost.
● Incorporate strategies for discounts, promotions, and pricing adjustments.
d. Content Duplication Prevention:

● Duplicate Detection Rules:
● Define algorithms or rules to identify duplicate entries.
● Consider text matching, external identifiers, and attribute matching.
e. Notification and Alerts:

● Event Triggering Rules:
● Specify conditions for triggering notifications and alerts.
● Identify events such as low stock, price changes, and policy violations.
2. Inference Engine:
● Decision-Making Logic:
● Implement the logic that interprets rules and makes decisions.
● Use if-then statements or rule-based engines for processing.
3. User Interface (UI):

● Vendor Interface:
● Design interfaces for vendors to input data with built-in validation checks.
● Provide feedback on standardization requirements during data entry.
● Admin Interface:
● Create a dashboard for administrators to monitor and manage rules.
● Include tools for rule customization and adjustment.
4. Data Storage:
● Central Database:
● Store clean, standardized data centrally.
● Facilitate quick access for categorization, pricing, and duplication checks.
5. External Data Sources:

● API Integrations:
● Integrate with external data providers for validation, enrichment, and market insights.
● Ensure seamless data exchange with vendor systems.
6. Reporting and Analysis:

● Analytics Dashboard:
● Incorporate tools for analyzing system performance, identifying trends, and monitoring key
metrics.
● Generate reports on data quality, categorization accuracy, and pricing effectiveness.
7. Feedback Loop:
● User Feedback Mechanism:
● Establish a system for vendors and administrators to provide feedback on the effectiveness of
rules.
● Use feedback to refine and improve the knowledge base continuously.
Benefits:
1. Improved Data Quality:

○ Data Validation & Standardization: Enhance the quality and accuracy of product information by
enforcing standardized formats and validating data integrity. This results in a cleaner and more
reliable dataset.
2. Efficient Operations:
○ Automated Categorization: Streamline the categorization process by automating the assignment
of products to appropriate categories. This reduces manual effort, speeds up product listing, and
ensures consistency in categorization.
3. Dynamic and Optimal Pricing:
○ Pricing Rules: Implement dynamic pricing strategies based on market trends, competitor pricing,
and internal factors. This enables your platform to respond quickly to changes in the market,
optimize pricing, and stay competitive.
4. Enhanced User Experience:
○ Content Duplication Prevention: Avoid confusion and provide a better user experience by
preventing the listing of duplicate products. This ensures a clean and organized catalog, making
it easier for customers to find what they're looking for.
5. Proactive Notification and Alert System:
○ Notification and Alerts: Increase operational efficiency by proactively addressing issues such as
low stock, significant price changes, or policy violations. This ensures timely responses and
helps in maintaining a smooth and compliant marketplace.
PART 3: Maintaining the Customer Experience to increase loyalty
Enhanced Dynamic Pricing with AI/ML implementation:
Customer Profile: Customers who follow fashion influencers and enjoy discovering new trends. She/he values
quality and is willing to pay a premium for unique pieces but is budget-conscious and seeks good deals.
Data Requirement and Data Sources:

Source ● CRM System: Purchase history (items, prices, dates,
discounts), abandoned carts, customer support
interactions.
● Website/App Analytics: Browsing behaviour (viewed
products, time spent, clickstream data), product interactions
(adding to wishlist, sharing on social media).
● Customer Profile Data: Demographics (age, location,
income), interests (preferred brands, styles)
● Market Research: Popular styles, seasonal trends,
competitor pricing for similar items.
● Social Media: Influencer engagement data (likes,
comments, shares) related to specific brands, styles, and
products.
● Public Data: Weather data (influencing clothing choices),
local economic indicators (consumer spending trends).
Data Types:
● Categorical: Product categories (dresses, tops, etc.),
brands, colours, sizes, discount codes, influencer names,
weather conditions.
● Numerical: Prices, purchase frequency, time spent
browsing, number of clicks, social media engagement
metrics, income level.
● Sequential: Product browsing history (sequence of viewed
items), purchase history (chronological order of purchases).
Techniques Feature Engineering:
● Combine features: "Product category + season" to

understand seasonal purchasing patterns.
● Create new features: Time difference between last
purchase and current visit, average time spent on product
pages from similar categories.
● Text analysis: Extract sentiment from customer reviews and
social media mentions to understand brand and product
perception.
AI/ML Techniques:
Model Selection:
● Recurrent Neural Networks (RNNs): Capture sequential

data patterns in browsing and purchase history.
● Reinforcement Learning: The model learns optimal pricing

strategies through trial and error simulations.
Model Training:
● Split data into training, validation, and testing sets.

● Train the chosen models on the training set, optimising
hyperparameters through validation to improve model
performance and generalisation.
● Evaluate model performance on the testing set using
metrics like Mean Squared Error (MSE) and R-squared for
regression models, and Precision/Recall for
recommendation models.
The implementation can be done in following steps:
● Data Preprocessing: Clean, normalise, and engineer features from various data sources.
● Model Training and Evaluation: Train and evaluate different AI/ML models based on chosen
techniques and data types.
● API Development: Develop an API to connect the chosen model to the e-commerce platform for real-
time price adjustments.
● Integration and Deployment: Integrate the API with the platform and deploy the dynamic pricing
system for individual customers.
● Monitoring and Refinement: Continuously monitor model performance, customer behavior, and
market trends. Retrain and refine the model based on new data and insights.
The expected outcomes for this implementation are as follows:
● Increased conversion rate: Personalized pricing nudges consumers towards purchasing items she's
interested in at prices she finds acceptable.
● Improved customer satisfaction: consumer feels valued and appreciates the tailored shopping
experience that caters to her preferences and budget.
● Optimized pricing strategy: The company maximizes revenue by offering consumers the right price
without compromising margins.
● Enhanced customer segmentation: AI/ML insights help the company create targeted marketing
campaigns and promotions for different customer segments.
● Data-driven decision making: The company can make informed decisions about product pricing,
inventory management, and marketing strategies based on real-time customer data and AI/ML
analysis.
The following of AI ML implementation benefits through dynamic pricing;
For Consumers:
● Personalized shopping experience: consumer finds products she loves at reasonable prices, leading
to a more enjoyable shopping experience.
● Transparency and choice: The company can explain the factors influencing dynamic pricing, allowing
consumers to make informed purchase decisions.
For the Company:

● Increased revenue and profitability: Dynamic pricing helps capture more sales opportunities and
optimize pricing for individual customers.
● Improved customer lifetime value: Satisfied customers are more likely to return and spend more in
the long run.
● Enhanced brand image: By providing a personalised and customer-centric experience, the company
can build a positive brand image and reputation.
Sentiment Analysis on Product Reviews and other feedback
Data Requirements Text data reviewing products and product specific service
Data Source Customer Reviews, Order feedback inputs, return reasoning and
other text based inputs from customers from CRMs
Techniques NLP for sentiment analysis with classification, then clustering of

products based on sellers, brands and sentiments
For initial sentiment, we’ll be going through the text data, determining whether a review, comment or feedback
related to a particular product ID is negative or positive. This can easily be done by going through product and
user ID attribution to a review, preprocessing, tokenization and n-grams based sentiment scoring, using nltk
libraries.
https://colab.research.google.com/drive/1XMGLlaHhnVtYIN9sZ2TgrJWQza4hvhQK?usp=sharing
But in the later stages of implementation, we’ll be shifting to a more

attribute focused sentiment model instead of a single, combined
sentiment. This would involve attribution of given sentiment into
different features mentioned in a review like:
● Size
● Quality, Durability, Comfort
● Delivery
● Fashion trend / looks
● Packaging & other seller attributes
For this, simple NLP libraries won’t be sufficient and thus would
involve integration with existing GPT implementations (like
FashionGPT), with in-built Fashion quotient into their systems. These GPT systems will be able to attribute
negative sentiments to each of the product factors effectively due to their inbuilt LLMs that have been acclimated
to fashion queries and issues.
When clustered on Multidimensional planes (product, brand, seller) using simple clustering methodologies like
K-means clustering, we can identify which sellers and brands are plagued by what kind of complaints in the
negative sentiment. This can help us take action against sellers with repeatedly bad performance in areas of
accountability and also share the same info with the brands with feedback/warnings regarding the same.
Outcome: Through this, the feedback received by customers will be acted upon, if based on high volumes of
negative sentiment surrounding a specific product, focusing on specific attributes for improvement, thus
appealing to existing customers (their voices being heard).
Image Recognition and matching for item selection and customer profiling
Data Requirements Image datasets for clothing, i/p images from customer interest,
description text from all product and image postings
Data Source Internal display images, MNIST fashion for training, social media
fashion image interests, clickstream data from images, banners,
Techniques NLP + Image based classification like CLIP, Clustering techniques

on images of interest to identify different aspects of interest
We have a 2 step approach to the implementation of this use case:
1. Creation of an image model using CNNs and pretrained fashion models:
This involves either training a model from scratch on fashion image datasets or utilizing existing models like R-
CNN Fashion-MNIST or CLIP and adding additional layers on top using images from our portfolio of fashion
products. This will build a functional model which will be able to:
a. Separate all the fashion objects from a given image into separate
entities
b. Identify each fashion entity by mapping it into a given object in
catalog (in case of lower matching scores, mapping it to a generic
fashion name instead)
Now we can take in user input as images, allowing them to quickly

select clothing from our website as soon as they see it in real life as
well, using a click of an image.
2. Deriving insights and customer profiling from images of interest:
Using NLP tokenizers with associated image to create

different fashion interest keywords for images (genz,
classic, chic). CLIP models do so from the very onset of
training itself, so it makes it easier to integrate images with
text descriptions.
Using social media integration (fashion related posts with
customer interest), banner data, and purchased item
keywords to profile the customer’s fashion sense in
different categories and use it for further personalization
within other AI implementations
Outcomes: Search improvement within own collection of products, use of image clicks to develop interests
and give user exactly what they need
Personalizing websites with dynamic content layouts - Personalized filtering
Data Requirements Product metadata that the user interacts with, user-generated search data
Data Source Analytics engine, clickstream data, purchase data, filter data, CRM, etc
Techniques AI based personalization platforms, affinity based personalization
Features of dynamic content layouts:
● Dynamic Content Blocks: Creating dynamic sections on websites that adapt based on user behavior.
For example - displaying a “Recommended for You” section with personalized product
recommendations.
● Personalized Product Listings:Arranging product listings based on user preferences.For example -
showing products in the user’s preferred category or style at the top.
● Tailored Navigation Menus: Customized navigation menus based on user interests. For example -
highlighting relevant categories or collections.
● Individualized Product Pages: Customized product pages with content relevant to the user. For
example - show reviews, related products, and styling tips.
The steps involved in affinity profiling are:
1. Gathering data: Metadata connected to a product that they have recently interacted with is collected.
Consider the scenario where a user generates the the following:

● 4 pageviews in which the product color was green
● 6 for red
● 2 for blue
● Gender - men
● Price range < Rs 500
The visitor then places 2 items in their cart, a red and a green t-shirt, but ultimately purchases the red
one. The following data is gathered:
2. Engagement score calculation: The score of each attribute value is updated by the type of interaction
with it as well as when it occurred. A correlation between the user interaction and an attribute value
must first be determined.
In our example, let's say the default weights for interactions are as follows:
● Purchase: 4X
● Add to cart: 3X
● Add to wishlist: 2X
● Product view: 1X
Engagement score = interaction type weight x attribute value count
3. Recency score calculation: Since user behavior and preferences change over time, importance must be
given to how recently the interaction has taken place.
Let's say the default weights for recency are as follows:

● Real-time or current browsing session: 8X
● Recent history or in the last month: 2X
● All-time or the last six months 1X
Recency score = recency weight (interaction weight x attribute value count)
Now the affinity profile reflects two strong scores (engagement and recency), enabling us to showcase
products the user is more likely to engage with at any given moment in time.
Outcomes: In this manner, dynamically re-ordering content and product categories according to user
preferences will allow visitors to quickly find what they’ve come for.
● Instead of defaulting search results to a ‘featured products’ page, retailers can utilize user-generated
search data to personalize results automatically, arranging product grids according to each user’s
buying preferences.
● If a user constantly filters according to rating, for instance, they can automatically be presented with a
default sorting order that features the highest-rated products first.
● If user tends to purchase for himself in fixed size over last few purchases, set auto filters for his gender,
his size on products being searched, to enable quick selection and purchase for a better optimized
journey
Employing a sophisticated personalization platform with predictive analytics that automatically tailors and
transforms the layout according to the preferences of individual user segments.

Group 5 Project

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Group 5 Project

Uploaded by

Copyright:

Available Formats

INTELLIGENT SYSTEMS

Abhimanyu Yadav (22M2303)

Abhishek Pratap Singh K (22M2304)

Deepak Padhi (22M2334)

Dhawal Swaroop (22M2336)

Madhusudan Sawant (22M2385)

Rakesh Kore (22M2375)

Rithvik Jawalkar (22M2343)

Satyam Kumar (22M2384)

2. Fashioning a Personalized Data Model: The 7

3. Enhanced Dynamic Pricing with RNN 23

4. Sentiment Analysis on Product Reviews and 26

5. Image Recognition and matching for item 27

6. Personalizing websites with dynamic content 28

Item Data:- Item descriptions, categories, prices, and popularity

Time-Stamped Data:- Duration of visits, time spent on specific

Historical Data:- To identify recurring patterns, preferences, and

Data Source Data generated by user interactions on the e-commerce website

CRM Systems:- Integrating customer data, purchase history and

Third-Party Data Providers:- Supplement internal data sources

Technique Generating Customer Value Metric

Customer ID (unique identifier)

Segment ID (unique identifier)

Steps for implementing:

PART 2: Prompting the First purchase & Increasing purchase frequency

The Indian Fashion eCommerce market is predicted to reach

Fashioning a Personalized Data Model: The Cornerstone of Recommendations

● Demographics: Age, gender, location, and income.

● Customer sign-up information

Collaborative Filtering for Personalized Recommendations

● Increased Sales Conversions: Relevant recommendations drive higher sales conversions by

Benefits to the Organization: Building a Competitive Advantage

1. Choosing a Model Architecture

● User uploads an image of a white shirt.

Inventory Database Creation

Data Validation and Standardization:

Data Requirements and Sources:

Data Validation and Standardization:

Content Duplication Prevention:

Sources: Internal Database, External Identifiers

Notification and Alerts:

Sources: Inventory Management System, Price Change Logs

a. Data Validation & Standardization:

d. Content Duplication Prevention:

e. Notification and Alerts:

3. User Interface (UI):

5. External Data Sources:

6. Reporting and Analysis:

1. Improved Data Quality:

Enhanced Dynamic Pricing with AI/ML implementation:

Data Requirement and Data Sources:

Techniques Feature Engineering:

● Combine features: "Product category + season" to

● Recurrent Neural Networks (RNNs): Capture sequential

● Reinforcement Learning: The model learns optimal pricing

● Split data into training, validation, and testing sets.

The implementation can be done in following steps:

The following of AI ML implementation benefits through dynamic pricing;

For the Company:

Techniques NLP for sentiment analysis with classification, then clustering of

But in the later stages of implementation, we’ll be shifting to a more

Techniques NLP + Image based classification like CLIP, Clustering techniques

1. Creation of an image model using CNNs and pretrained fashion models: