Professional Documents
Culture Documents
2. What is Cassandra?
Cassandra is one of the most favored NoSQL distributed database management
systems by Apache. With open source technology, Cassandra is efficiently designed to
store and manage large volumes of data without any failure. Highly scalable for Big
Data models and originally designed by Facebook, Apache Cassandra is written in Java
comprising flexible schemas. Apache Cassandra has no single point of failure.There are
various types of NoSQL databases and Cassandra is a hybrid of column-oriented and
key-value store database. The keyspace is outermost container for an application and
table or column family in Cassandra is keyspace entity.
Learn more about Cassandra from this tutorial page.
SPM primarily administers Cassandra metrics and various OS and JVM metrics.
Besides Cassandra, SPM also monitors Hadoop, Spark, Solr, Storm, zookeeper
and other Big Data platforms. The main features of SPM include correlation of
events and metrics, distributed transaction tracing, creating real-time graphs
with zooming, anomaly detection and heartbeat alerting.
7. Define memtable.
Similar to table, memtable is in-memory/write-back cache space consisting of content in
key and column format. The data in memtable is sorted by key, and each ColumnFamily
consist of a distinct memtable that retrieves column data via key. It stores the writes
until it is full, and then flushed out.
8. What is SSTable? How is it different from other relational tables?
SSTable expands to ‘Sorted String Table,’ which refers to an important data file in
Cassandra and accepts regular written memtables. They are stored on disk and exist
for each Cassandra table. Exhibiting immutability, SStables do not allow any further
addition and removal of data items once written. For each SSTable, Cassandra creates
three separate files like partition index, partition summary and a bloom filter.
9. Explain the concept of Bloom Filter.
Associated with SSTable, Bloom filter is an off-heap (off the Java heap to native
memory) data structure to check whether there is any data available in the SSTable
before performing any I/O disk operation.Learn more about Apache Cassandra- A Brief
Intro in this insightful blog now!