Column-Based NoSQL:
By Manish Dey
USN 1EP23AD021
Understanding NoSQL and Data Modeling
NoSQL Overview Data Modeling
NoSQL databases handle large and diverse data flexibly. Model design depends on the NoSQL type and use case.
Schema-less design Document, key-value, column, graph types
Horizontal scaling Optimizing for query patterns
Varied storage models
Column based
archietecture
Column Family Wide Tables
Storage Supports tables with many
Data stored by columns dynamic columns per row.
instead of rows for
efficiency.
Distributed Design
Shards data horizontally across nodes for scalability.
Key Features and
Advantages
High Performance
Optimizes queries on columns reduces read/write loads.
Scalable
Easily handles huge data volumes across clusters.
Flexible Schema
Adds or modifies columns without downtime.
Efficient Compression
Columns with similar data leverage compression effectively.
Use Cases: Where
Column-Based Shines
Analytics IoT Data Recommendat
Platforms Storage ion Engines
Fast column scans for Storing sensor data Quickly retrieve user
large data that arrives in wide, and product attributes
aggregates. sparse columns. from columns.
Financial
Services
Real-time analytics on
transactional data at
scale.
Popular Column-Based NoSQL Systems
Apache Cassandra Apache HBase ScyllaDB
Decentralized, high availability, Built on Hadoop, provides strong Compatible with Cassandra,
supports multi-datacenter replication. consistency and random access. optimized for low latency and high
throughput.
Comparing Column-
Based with Other NoSQL
Types
NoSQL Type Strengths Typical Use Cases
Column-based Fast aggregations, Analytics, IoT data
scalable
Document Flexible schemas, Content
rich queries management
Key-value Simple, ultra-fast Cache, session
lookups storage
Graph Relationship Social networks,
queries fraud detection
Conclusion:
Assess Your Data
Evaluate if your data needs wide, sparse columns and scalability.
Align With Use Cases
Consider analytics, IoT, or real-time big data needs.
Choose the Right System
Select a column database fitting your workload and budget.