2. runs on the top of HDFS 3. Used for semi-structured, structured data in the form of tables. 4. Support scaling. 5. Provides fault-tolerant 6. Well suited for real-time data processing. 7. It runs on Hadoop cluster. HBase can be used in the following scenarios: 1. Huge Data 2. Fast Random Access 3. Structured Data 4. Variable Schema 5. Need of Compression 6. Need of Sharding A+B+C+D=E -------------1 A+C+D=E