Professional Documents
Culture Documents
❖ Every group must choose one project, every group consists of (5 : 8) students
❖ Every Project is available for 9 groups
❖ Project grads 10 points.
Notes : register more than one idea ,if first choice is not available , you will execute the available idea
for you
Project 1:
Re-write SQL Join algorithms (inner - outer joins) using map-reduce and compare with Hive execution
time.
Project 2:
Develop marketing web-based application (CRUD operations) that uses hive as Back-end Database
and do full text search.
Project 3:
Create Desktop GUI for HADOOP frame work enabling user to:
▪ Start and Stop Hadoop.
▪ Load MapReduce JAR File.
▪ Input data files.
▪ Determine Output Data files.
▪ Copy result data into local file.
Project 4:
Create Desktop GUI for HIVE frame work enabling user to:
▪ Create Schema DDL
▪ Load schema Data tables DDL
▪ Queering schema by SQL queries DML
▪ Store query result into local file system
Project 5:
Project 7:
Using what you learn in big data to develop a distributed search engine to crawel 100,000 web pages
f from the news websites and index them using PageRank technique .all work must run in a
distributed fashion on Spark/Hadoop cluster.
Finally , provide search services on the craweled pages.
Project 8:
- You have number of documents , do inverted index on them and search engine
- do OR , AND operation for examples
Hello appears in : Doc1 , Doc3
World appears in : Doc1 , Doc2 ,
Hello And World : result is : Doc1
Hello OR World : result is : Doc1 , Doc2 ,Doc3