Bulk uploader not making equal shards

Valdanito · March 16, 2022, 3:19am

As the doc says:

Using this compression setting (Snappy) provides a good compromise between the need for a high compression ratio and efficient CPU usage.

I think it depends on the performance of your machine, the size of the dataset and which indexes are used. There are no definite numbers. You need to run the import many times to get experience.

I can’t find the document of --num_go_routines. I remember it is positively related to import speed and memory consumption.

Topic		Replies	Views
How to import bulk data into cluster？ Users	8	1528	September 27, 2019
Bulk load - missing predicates Dgraph	14	1540	July 26, 2018
Bulk load to initial multi host cluster Users	7	780	July 12, 2019
Bulk Loader - Deploy Documentation	0	905	December 16, 2020
Zero rebalance_interval server write error predicate_move Dgraph example	7	1022	November 5, 2018

Bulk uploader not making equal shards

Related topics