Professional Documents
Culture Documents
https://doi.org/10.22214/ijraset.2023.52836
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com
Abstract: Analysts need to be able to identify insights into the data as it grows and becomes more complex. In the business world,
it's common for organizations, even small ones, to be overwhelmed by data. They have a lot of spreadsheets, databases, and other
documents that need to be looked at to help them make decisions. [5]Unfortunately, this procedure takes a lot of time and a lot of
manual labor.
Additionally, it costs a lot, especially if the user implements it. The systematic application of statistical [3] and logical techniques
to describe and illustrate, summarize, and evaluate data is known as data analysis [22-23]. One of the fast-growing techniques
for identifying data trends is data analysis [13]. Because speed and accuracy are the foundations of this system, they are why it is
so well-known.
The entire procedure by which this tool could take the place of the current method is detailed in this paper. A device that brings
those fields together is integrating the various tools used by individual users. The UNkNOT tool's flexibility for integration into
existing security systems and frameworks is designed to guarantee data integrity and confidentiality.
Keywords: Data Analysis Tool, Data Analytics, Data Viz, Data Visualization, Unknot, Exploratory Data Analysis
I. INTRODUCTION
Unknot Data Analysis Tool [23] is a highly customizable program that helps users unknot their databases. The tool provides users
with a straightforward approach to exploring and researching their data, which can help them source new information or discover
patterns within their databases.
The tool will allow its users to create their parameters for exploration and analysis. When it comes to performance monitoring there
are several tools on the market, but sadly many of these tools do not allow monitoring databases in a very simple way. By making it
possible to efficiently handle and manipulate large amounts of data, automate tasks, and train and deploy AI models, this tool can
help with AI-based data analysis. This tool can help data scientists and AI engineers focus on building better models and analyzing
the results rather than getting bogged down in data management and manipulation by offering effective methods for handling large
amounts of data.
The tool will provide an easy way to access, process, and share centralized information on various stages of your workflow, as well
as help with managing security policies. It also allows users to personalize their tool by altering its functionality and user interface.
The Unknot tool is the ultimate data validation and extraction tool, designed to provide users with the most robust and thorough
results. It provides a holistic solution in 4 phases: 1st phase focuses on working with existing data on a local device; 2nd phase
focuses on taking data from the user (can be local or global) given that the device is local; 3rd phase focuses on securing the data
integrity and confidentiality. And finally, after 3rd phase, the tool can be hosted on a global level/platform.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 5825
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com
3. DASH Dash [14-17] is a Python framework for building web-based data visualization Complex, Integration issues
and analysis tools. One of the key features of Dash is its ability to connect to a
wide range of data sources.
4. PLOTLY Plotly [14-25] is a powerful data visualization library for Python. It allows Confusing initial setup to use Plotly
developers to create interactive, web- based plots and graphics, such as scatter without an online account, and lots of
plots, line plots, bar plots, and more. code to write.
Start
Take path
as input
other
Display Visual
ig 3.2: Phase 1 Layout
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 5826
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com
V. PHASE 1 FINDINGS
During the first part of the process, the user must enter the path of the file present in his local server/system/storage. Users will be
given a list of choices from which the user can operate based on their preference for visuals shown in below figures.
Fig 5.1: Visuals to Choose From Fig 5.2: Pie Chart Visual
User will enter their preferred choice and will be asked for the respective attribute/attributes for analysis. After entering the path, the
system will display data for verification. To prevent re-running of the program, the User will get the choice to choose exit or the
next preferred chart/visual for analysis. In charts, users will get features like downloading the whole chart, or the preferred region
only, taking visuals for a specific attribute or an area only. Users can dismiss the dashboard at any point by entering ‘0’.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 5827
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com
REFERENCES
[1] K. Johnson, B. Lee, and J. Smith. (2020). Data analysis methods for large datasets. Journal of Big Data, 7(2), 23-38.
[2] S. Chen, X. Zhang, and Y. Liu. (2021). Machine learning approaches for predictive analytics. Data Mining and Knowledge Discovery, 35(1), 73-8
[3] W. McKinney, "pandas: a foundational Python library for data analysis and statistics", Python for High Performance and Scientific Computing, vol. 14, no. 9,
2011
[4] X. Cai, H. Langtangen and H. Moe, "On the Performance of the Python Programming Language for Serial and Parallel Scientific Computations", Scientific
Programming, vol. 13, no. 1, pp. 31-56, 200
[5] J. Van Der Donckt, J. Van der Donckt, E. Deprost and S. Van Hoecke, "Plotly-Resampler: Effective Visual Analytics for Large Time Series," 2022 IEEE
Visualization and Visual Analytics (VIS), Oklahoma City, OK, USA, 2022, pp. 21-25, doi: 10.1109/VIS54862.2022.00013
[6] G. Iyer, S. DuttaDuwarah and A. Sharma, "DataScope: Interactive visual exploratory dashboards for large multidimensional data," 2017 IEEE Workshop on
Visual Analytics in Healthcare (VAHC), Phoenix, AZ, USA, 2017, pp. 17-23, doi: 10.1109/VAHC.2017.8387496
[7] Kabita Sahoo, Abhaya Kumar Samal, Jitendra Pramanik, and Subhendu Kumar Pani. Exploratory data analysis using python. International Journal of
Innovative Technology and Exploring Engineering (IJITEE), 2019
[8] Wes McKinney. Python for data analysis: Data wrangling with Pandas, NumPy, and IPython. OReilly Media, Inc., 2012
[9] Fabio Nelli. Python data analytics: Data analysis and science using PANDAs, Matplotlib and the Python Programming Language. Apress, 2015.
[10] Dr Ossama Embarak, Embarak, and Karkal. Data analysis and visualization using python. Springer, 2018.
[11] Pramanik, Jitendra & Samal, Abhaya Kumar & Sahoo, Kabita & Pani, Dr. Subhendu. (2019). Exploratory Data Analysis using Python. International Journal of
Innovative Technology and Exploring Engineering. 8. 4727-4735
[12] Kiranbala Nongthombam , Deepika Sharma, 2021, Data Analysis using Python, INTERNATIONAL JOURNAL OF ENGINEERING RESEARCH &
TECHNOLOGY (IJERT) Volume 10, Issue 07 (July 2021
[13] Wes McKinney and the Pandas Development Team,pandas: powerful Python data analysi
[14] Stancin, Igor and Alan Jović. “An overview and comparison of free Python libraries for data mining and big data analysis.” 2019 42nd International
Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) (2019): 977-982
[15] Harshal S. Kudale, Mihir V. Phadnis, Pooja J. Chittar, Kalpesh P. Zarkar,DATA ANALYSIS AND VISUALIZATION OF OLYMPICS USING PYSPARK
AND DASH-PLOTLY,202
[16] Pritchard, L., White, J. A., Birch, P. R. J., Toth, I. K. GenomeDiagram: a python package for the visualization of large-scale genomic data. Bioinformatics,
Volume 22, Issue 5, 1 March 2006, Pages 616–617. DOI: 10.1093/bioinformatics/btk021
[17] Shammamah Hossain,Visualization of Bioinformatics Data with Dash Bio,201
[18] Nagpal, Abhinav & Gabrani, Goldie. (2019). Python for Data Analytics, Scientific and Technical Applications. 140-145. 10.1109/AICAI.2019.8701341
[19] Wes McKinney, Python for Data Analysis(BookZZ.org),201
[20] Carson Sievert,Interactive web-based data visualization with R, plotly, and shiny(CRC press),202
[21] Nelli, Fabio. (2018). Python Data Analytics: With Pandas, NumPy, and Matplotlib. 10.1007/978-1-4842-3913-1
[22] "Data Wrangling with Python" by Jacqueline Kazil and Katharine Jarmul (2017) - O'Reilly Media, ISBN: 978-1491948811
[23] "Data Analysis with Pandas and Python" by Fabio Nelli (2017) - Packt Publishing, ISBN: 978-1787125933
[24] "Hands-On Data Analysis with Pandas" by Kevin Markham (2019) - Packt Publishing, ISBN: 978-1801092913
[25] "Python for Data Analysis and Visualization: A Hands-On Guide to Pandas, Matplotlib, Seaborn and Plotly" by Hadelin de Ponteves (2021) - Udemy, ISBN:
978-1801249073
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 5828