Thanks, --ignore_errors should be helpful. Not sure why I missed that option. I did look for something like that.
I noticed the logs are different since I upgraded to the latest version. It would be great if it was possible to have some stats dumped on a crash.
The Bulk Loader docs discuss some performance tuning options. I’d appreciate a more detailed discussion on what they do and how they’re related as well as some quantifications/estimates on how much memory would be required in various cases.
If I plan to load something like 8192 rdf.gz files totalling close to 2TB, what kind of hardware would you say I need to run the bulk loader?