You are on page 1of 13

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/359511246

DIGITAL BUSINESS TECHNOLOGY AND MANAGEMENT COURSEWORK 2-


"Data Management and Operations Improvement for a Children Book Club
with Digital Technologies"

Article · January 2022

CITATIONS READS

0 1,844

1 author:

Shivani Mishra
Xalient
35 PUBLICATIONS 1 CITATION

SEE PROFILE

All content following this page was uploaded by Shivani Mishra on 28 March 2022.

The user has requested enhancement of the downloaded file.


REPORT ON

“Data Management and Operations Improvement for a


Children Book Club with Digital Technologies”

Submitted to

University of Liverpool

In Partial Fulfillment of the Requirement for the module of

DIGITAL BUSINESS TECHNOLOGY AND


MANAGEMENT (EBUS603)
BY
SHIVANI MISHRA (201602773)

MANAGEMENT SCHOOL
UNIVERSITY OF LIVERPOOL

1|Page
CONTENTS
Sl.no. Topic Pg.no.

1 Introduction 3

2 Database Design 5

3 Improvements 7

Data and Data sources


Relational database purpose
NoSQL database tools
4 References 11

LIST OF FIGURES
Fig.no. Topic Pg.no.

1 Reference of the business case: http://bookabees.com/ 3

2 ER diagram for Bookabees 5

3 High-level Web Usage Mining Process 7


4 Major application areas of web usage mining 7
5 Big data analytics with Hadoop structure 9

2|Page
1. INTRODUCTION

Fig 1: Reference of the business case: http://bookabees.com/

Bookabees provides customized book services to subscribed children through its digital service
portal and free delivery via public post service. The customers pay a monthly membership fee
via direct debit for a particular Subscription Type. Customers explore the collection online and
build a "Book Box" from the Recommended Book List based on their Age Groups and personal
preferences created by industry experts. The company keeps track of statistical data from book
services.

Now, several issues could be addressed in this business case like:

1. Need for real-time recommendation system: The recommendation system only uses static
settings like tracking each user's search and updating the user's interests. The book club can
utilize a recommendation method like collaborative filtering, which combines item- and user-
based suggestions for titles with similar attributes and products popular among like-minded
people. [Shaikh, Rathi and Janrao, 2017]

3|Page
2. Access to e-books preview before buying: According to research, customers regularly use
physical elements in stores as external memory cues, creating new desires or stimulating
forgotten needs, validating the business adage that "unseen is unsold." [Hui et al., 2013]

3. Reminder to customers for wish listed books, out-of-stock books and new books in store:
Excessive online shopping cart abandonment rates might impede e-commerce enterprises'
growth in a competitive industry. As a result, to automatically detect online shopping cart
abandoners, data-driven solutions are necessary to extract relevant information from enormous
volumes of data. [Rausch, Derra and Wolf, 2020]

4|Page
2. DATABASE DESIGN
Database design is a step in the database development process that entails analysing a problem
description (specifications and needs) and presenting all relevant results for constructing a
logical data structure. The purpose, demands, requirements, and limitations for data required
to support particular organizational functions are more or less specified in the issue
specification. [J Letkowski, 2015]

Fig 2: ER diagram for Bookabees

This diagram shows the relationship between different tables of the specified business case.

Details for each of the tables are mentioned below:

1. Customer Table: This table contains all the details of the customer and is related to all
other tables either in 1:1 or N:N relationship.
2. Address Table: This table contains the address of every customer for book box delivery.
It is joined with the customer table and the cardinality relationship 1:1, i.e., each
customer will have only one address.

5|Page
3. Subscription Table: This table contains subscription type (1,3 or 5 books per month)
and monthly subscription fee, depending on the subscription type. This table is joined
with the customer table, and the relationship is 1:1 as each customer will have only one
subscription plan.
4. Order Table: This table contains all the order details. A customer can order more than
one book. Similarly, the order can be placed by several customers. Hence it’s an N: N
relationship.
5. Feedback Table: This table contains feedback given by customers for every book
purchased. Feedback given is ranking (1-5) and comment. Multiple customers can
provide feedback for one book. Similarly, a customer can provide feedback on different
books. Hence N: N relationship. This table is joined to the Statistical info table by N:N
relationship because multiple customers can give multiple feedbacks.
6. Book Box Table: This contains the details of the book selected by the customer by
browsing through the online catalogue. The online catalogue is based on the
recommendation given by the industry expert. Book Box is based on recommendation.
For every recommendation, one Book Box is created. So, the relationship is 1:1. Book
Box is made for every customer. Hence the relationship is 1:1.
7. Book Recommendation: This table contains the details like recommendations by
industry experts as per age group. This table is connected to the subscription table
because the book recommendation list is prepared based on the monthly subscription
plan (1:1). The book recommendation table is also connected to the customer table as
for every customer; one book recommendation is prepared as per his interest and
education need (1:1).
8. Statistical Information: This table contains the data about the count of the different
books purchased. This can be analysed to know the most popular book and the most
popular book category preferred by the customer. This table is joined with the customer
table, feedback and book recommendation table.

6|Page
3. IMPROVEMENTS
3.1. Data and Data Sources:

1. Data such as the item that was searched online can be a valuable data resource. Finding
similarities and differences in the search log will aid in comprehending customer behaviour.
Customers' web navigation is valuable information that may be analysed in-depth, utilising
Sequence analysis and other techniques such as clustering to discover patterns. This analysis'
findings can be used to generate a marketing strategy. [Jiang et al., 2012]

Fig 3: High-level Web Usage Mining Process

Fig 4: Major application areas of web usage mining

7|Page
2. Ranking and comments from the feedback table can be used for opinion and sentiment
analysis which help to understand customer behaviour better. [Kumar, Desai and Majumdar,
2016]. The industry expert uses this to prepare a book recommendation list according to
different customer experiences. Sentiment analysis is a technique for converting unstructured
data into useful information. Sentiment analysis insights may assist a corporation in making
accurate changes and transformations to their business. It might be in areas where stakeholders
have the most negative feelings, such as product features, pricing, return policies, customer
service, or prices, or it could be in areas where stakeholders are most positive, such as net
banking, price matching, and so on. These strategic measures benefit firms in the following
ways:

• Become more competitive.


• Attract new clients
• Retain current clients
• Increase your product and service sales.
• Reduce the number of customers you serve.
• Increase the profitability of your customers
• Enhance marketing strategies and communications

3. Web server log and customer navigation data can be used to investigate web bot traffic
patterns. Web bot detection and online purchase prediction using artificial intelligence (AI)
approaches are hot topics right now. An up-to-date, large-volume e-commerce dataset on Web
traffic is required to construct credible models of Web traffic, Web user navigation, and e-
customer behaviour. Similarly, to train and test new models and procedures, AI challenges
need a substantial amount of reliable, real-world data. [Chodak, Suchacka and Chawla, 2020]

3.2. Relational database purpose:

A relational database will not be an effective tool for the proposed improved system. The data
is massive, and the data gathered from the web for customer navigation is unstructured.
Relational databases will not be able to handle large amounts of data. Ample data storage,
administration, and processing will be significant concerns. The unstructured data cannot be
managed in the traditional relational database management system (RDBMS). As a result, data
growth necessitates a rethinking of data capture, storage, and processing approaches.

Hadoop will be a fantastic tool for managing large amounts of heterogeneous data and
analysing past shopping records for better recommendations. Big Data analytics has enabled

8|Page
organizations to create successful business strategies and remain competitive. [Jaskaran Singh
and Varun Singla, 2015].

Fig 5: Big data analytics with Hadoop structure

Hadoop was created as an open source platform for handling massive data analytics via batch
processing. It was built with the following principles in mind: less reliance on expensive high-
end hardware platforms and infrastructure, parallel processing to reduce computing time, not
moving data from disc to the central application to be processed, embrace failure, build
applications that are less reliant on infrastructures, and take advantage of Hadoop's flexibility.
[Ajah and Nweke, 2019]

3.3. NoSQL database tools:

To manage these data for the purpose of analytics the best NoSQL tool would be MongoDB.
This NoSQL database is a high-performing system that is both scalable and nimble. This open
source database designed in C++ includes document-oriented storage. Full index support, high
availability across WANs and LANs with easy replication, horizontal scalability, document-
based rich searches, flexibility in data processing and aggregation, as well as suitable training,

9|Page
support, and advice will be given. Several other NoSQL databases include Couch DB,
RavenDB, Redis etc.

A distributed database system is needed that can operate several nodes and is partition tolerant
to accommodate this much data. It means that even if one of the nodes fails for whatever reason,
the system should continue to function normally. The document-based approach of MongoDB
will be a wonderful fit for book clubs that want to combine hundreds of various data sources
since it will give a single unified view of the data. MongoDB is the only database that has
numerous storage engines in a single installation. This configuration is advantageous for
transferring data between storage engine technologies. It is accomplished by using local
replication.[Jose and Abraham, 2017]

MongoDB is used in the tech stacks of over 3400 firms, according to the website stackshare.io.
Uber, Google, eBay, Nokia, and Coinbase are just a few of the companies involved.

10 | P a g e
4. REFERENCES
1. Ajah, I.A. and Nweke, H.F. (2019). Big Data and Business Analytics: Trends,
Platforms, Success Factors and Applications. Big Data and Cognitive Computing,
[online] 3(2), p.32. Available at: https://www.mdpi.com/2504-2289/3/2/32/htm
[Accessed 13 Jan. 2022].
2. Chodak, G., Suchacka, G. and Chawla, Y. (2020). HTTP-level e-commerce data
based on server access logs for an online store. Computer Networks, [online] 183,
p.107589. Available at:
https://www.sciencedirect.com/science/article/pii/S1389128620312263 [Accessed 13
Jan. 2022].

3. Hui, S.K., Inman, J.J., Huang, Y. and Suher, J. (2013). The Effect of In-Store Travel
Distance on Unplanned Spending: Applications to Mobile Promotion
Strategies. Journal of Marketing, [online] 77(2), pp.1–16. Available at:
https://journals.sagepub.com/doi/10.1509/jm.11.0436 [Accessed 12 Jan. 2022].

4. Jaskaran Singh and Varun Singla (2015). Download Limit Exceeded. [online]
citeseerx.ist.psu.edu. Available at:
https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.695.5308&rep=rep1&type
=pdf.

5. Jiang, Q., Tan, C.-H., Phang, C.W. and Wei, K.K. (2012). Using Sequence Analysis to
Classify Web Usage Patterns across Websites. [online] IEEE Xplore. Available at:
https://ieeexplore.ieee.org/abstract/document/6149134 [Accessed 13 Jan. 2022].

6. Jose, B. and Abraham, S. (2017). Exploring the merits of nosql: A study based on
mongodb. 2017 International Conference on Networks & Advances in Computational
Technologies (NetACT). [online] Available at:
https://ieeexplore.ieee.org/abstract/document/8076778 [Accessed 13 Jan. 2022].

7. Kumar, K.L.S., Desai, J. and Majumdar, J. (2016). Opinion mining and sentiment
analysis on online customer review. [online] IEEE Xplore. Available at:
https://ieeexplore.ieee.org/abstract/document/7919584.

11 | P a g e
8. Letkowski, J., 2015. Doing database design with MySQL. Journal of Technology Research, 6,
p.1. ResearchGate. (2015). (16) (PDF) Doing database design with MySQL. [online]
Available at:
https://www.researchgate.net/publication/271910489_Doing_database_design_with_
MySQL [Accessed 13 Jan. 2022].

9. Rausch, T.M., Derra, N.D. and Wolf, L. (2020). They are predicting online shopping
cart abandonment with machine learning approaches. International Journal of Market
Research, [online] 64(1), pp.89–112. Available at:
https://journals.sagepub.com/doi/abs/10.1177/1470785320972526 [Accessed 12 Jan.
2022].

10. Shaikh, S., Rathi, S. and Janrao, P. (2017). Recommendation System in E-Commerce
Websites: A Graph-Based Approached. 2017 IEEE 7th International Advance
Computing Conference (IACC). [online] Available at:
https://ieeexplore.ieee.org/abstract/document/7976923 [Accessed 12 Jan. 2022]

12 | P a g e

View publication stats

You might also like