When you meet huge table

Posted Aug 24, 2022 Updated Sep 7, 2022

By M_Falcon

1 min read

When you work with huge table

Brute forcing your way to work on the table.

If you tried to find a row in the table without indexing which is do multi-processing, multi-threading parallely.

Can I avoid processing entire table?

📝 Best approach is indexing

Partitioning the table on disk breaking into smaller size, multiple parts
Separate location of data logically.

Have multiple hosts.
Divide disk , Separate data physically.
You can reduce the size of table physically but complexity would increase.

(Size descending) Shard -> Partition -> Index

Before applying like this solution, Avoid having a billion row table.
That’s the first thing

You can’t avoid it? Use indexing, partitioning, sharding. (Like DynamoDB)

[Twitter] Single profile table, id, name, follower integer, follower , following on the write level.

Update these things everytime someone follow / following

This post is licensed under CC BY 4.0 by the author.