node investigate large db size

Testnet's database ballooned to an alarming size:

6.9G	data.testnet/akashd/data/tx_index.db
1023M	data.testnet/akashd/data/cs.wal
2.3G	data.testnet/akashd/data/state.db
5.0G	data.testnet/akashd/data/blockstore.db
40K	data.testnet/akashd/data/evidence.db
30G	data.testnet/akashd/data/application.db
45G	data.testnet/akashd/data
268K	data.testnet/akashd/config
45G	data.testnet/akashd

[ ] How much space does our various models take up in the database? How much of the data is the index (key), how much is the value?
deployment.Deployment
deployment.Group
market.Order
market.Bid
market.Lease
[ ] make some estimations looking at data_size(lease_count)
[ ] explore some options for reducing db size
uint IDs instead of large compound keys?
is any data present in both index and value?
[ ] any tools we can write or use to monitor/analyse database?
can we expose (prometheus) metrics?

Related to #604 - adding more indexes may increase unless we re-type our keys (use uints).

May 21 '20 04:05 boz

@akhilkumarpilli can you add the observations here.

Jun 20 '20 17:06 anilcse

Ran single-node testnet on two instances to investigate db size. We ran around 89K blocks.

Node with pruning syncable has application db with size 4.2 MB. Whereas node with pruning nothing has application db with size 328 MB.

We can see the clear difference on application db size. pruning syncable (default) has better optimization and pruning nothing will use in a lot of disk.

Jun 22 '20 06:06 akhilkumarpilli

For our Validators, why are we running them with --pruning nothing vs. --pruning syncable? Can we reset the database and then restart with --pruning syncable?

My VPS is at 72% disk usage. It's a good thing we are about to shutdown the validators for phase 2 as I estimate that I have 2 to 3 weeks before running out of storage space.

Jun 22 '20 12:06 lightiv

@lightiv we have state load issues with other pruning strategies: https://github.com/cosmos/cosmos-sdk/issues/5746

Jun 22 '20 13:06 anilcse

Do we consider this solved with the latest network upgrade @boz ?

Jul 22 '21 14:07 hydrogen18