The team aims to develop unified software to extract data from various sources like images, PDFs, text, and documents. The software will use tools like import.io, selenium, and Hadoop for big data extraction and provide graphical representations of analyzed data. It will also feature integrated keyword searching of a locally stored database. The team is from Rajalakshmi Institute of Technology and their problem addresses bulk evidence collection, deleted data recovery, large-scale data analysis and comparison for the Madhya Pradesh Police.
The team aims to develop unified software to extract data from various sources like images, PDFs, text, and documents. The software will use tools like import.io, selenium, and Hadoop for big data extraction and provide graphical representations of analyzed data. It will also feature integrated keyword searching of a locally stored database. The team is from Rajalakshmi Institute of Technology and their problem addresses bulk evidence collection, deleted data recovery, large-scale data analysis and comparison for the Madhya Pradesh Police.
The team aims to develop unified software to extract data from various sources like images, PDFs, text, and documents. The software will use tools like import.io, selenium, and Hadoop for big data extraction and provide graphical representations of analyzed data. It will also feature integrated keyword searching of a locally stored database. The team is from Rajalakshmi Institute of Technology and their problem addresses bulk evidence collection, deleted data recovery, large-scale data analysis and comparison for the Madhya Pradesh Police.
Problem Statement Organization Name: Madhya Pradesh Police.
PS Code: AT980
Problem Statement Title: Big Data Searching.
Team Name: ClassyCoders
Team Leader Name: VISWESH S
Institute Code (AISHE): 2117
Institute Name: RAJALAKSHMI INSTITUTE OF TECHNOLOGY
Theme Name: Blockchain & Cybersecurity
Idea/Approach Details
⮚ Our objective is to develop an unified software to
extract data from various sources such as images, pdf, text, documents, etc. ⮚ The software uses various big data extraction tools such as import.io, selenium and Hadoop. ⮚ Our unique feature is that we are analyzing and providing various graphical representations of the data by using data analytics and visualzation techniques. ⮚ The software also will have an integrated feature to search keywords by accessing the locally stored database. 2 Idea/Approach Details Use cases: Dependencies/Tools used: ⮚ To collect bulk evidence from •NoSQL Database: HBase,MongoDB, smartphones or laptops of accused. ZooKeeper ⮚ For quick recovery of deleted data. •Storage: HDFS and S3 ⮚ To analyze large blocks of data and compare with existing keywords •Processing: Datameer, BigSheets, provided in database. Mechanical Turk, R
⮚ For graphical representation of •MapReduce: Hive, Hadoop, S4, Flume,
various big Data. Cascading
•Servers: Heroku, Google App Engine
3 Team Member Details Team Leader Name: VISWESH S Branch: BE Stream: CSE Year: II Team Member 1 Name: TARUN KUMAR S Branch: BE Stream: CSE Year: II Team Member 2 Name: SURESH KISHNA A Branch: BE Stream: CSE Year: II Team Member 3 Name: YATISHWAR GV Branch: BE Stream: CSE Year: II Team Member 4 Name: VENKATA SAI DEEPA A Branch: BE Stream: CSE Year: II Team Member 5 Name: THARUN M Branch: BE Stream: CSE Year: II Team Mentor 1 Name: Mr. R. Arun Kumar Category: ACADEMIC Expertise: Blockchain Domain Experience (in years): 2