Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The process is too slow #1

Open
harsh9t opened this issue Nov 4, 2016 · 2 comments
Open

The process is too slow #1

harsh9t opened this issue Nov 4, 2016 · 2 comments

Comments

@harsh9t
Copy link

harsh9t commented Nov 4, 2016

Hi,

I am using dbpedia 2015-04 version of dbpedia to generate the graph.db file for neo4j. There are a total of 71 .nt files to be loaded. So, far only 1 file has been loaded which 1.2 GB (name: category_labels_en.nt) in size and it took almost one whole day to do so. I am just wondering how much time it will take to load and process all 71 files.

Any insights?

@tiepologian
Copy link
Owner

Hi!

The time it takes depends on your hardware.
One of the most important factors is disk speed. Do you have an SSD disk?
Are you running a virtual machine? Or a VPS server? If you are, it will be very slow.

On our hardware the entire process, for all files, takes less than an hour.
Cheers

@harsh9t
Copy link
Author

harsh9t commented Nov 8, 2016

The machine I am using has a RAID storage of 6TB, there is also a 512 GB SSD but I am not using it as it is shared between users. The machine is a 12 core Xeon 2.8 GHz processor, 128 GB RAM device.

So far it has just managed to crunch 55 files out of 71 in 3 days :D I am just wondering and waiting it to finish. Any further comments?

Also one more question, How can I add multiple files in one single python command? I tried python $ dbpediaNeo4j.py /home/batman/git/dbpediaNeo4j/db_nt/ ; but the scripts terminates loading 0 relationships and no error. It is not able to take multiple .nt files.. should i mention each file separately?

*apologies for lame questions in advance :) *

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants