Balaji Srinivasan

What I Learned from Reading the Bigtable Paper

1 minutes (276 words)

Authors: Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson Hsieh, et al.
Published: 2006, OSDI
Link: research.google/pubs/pub27898


πŸ”—πŸ” What’s the Problem?

Google needed a scalable, highly available storage system that could handle petabytes of structured data β€” across thousands of machines β€” for diverse workloads like web indexing, Google Earth, and personalization data.


πŸ”—πŸ’‘ Core Idea

Bigtable introduced a sparse, distributed, persistent multidimensional sorted map β€” where:

(row key, column key, timestamp) β†’ value

Rows are lexicographically ordered. Columns are grouped into families. Versions are timestamped.

This flexible model allows efficient range scans, time-versioned data, and horizontally scalable reads/writes across commodity hardware.


πŸ”—βš™οΈ How It Works (Short Version)


πŸ”—πŸ€” What I Liked


πŸ”—πŸ§± Mental Models or Lessons


Tags: #software-engineering #papers #distribute-systems