Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

doc: distributed training #16

Open
wey-gu opened this issue Dec 5, 2023 · 0 comments
Open

doc: distributed training #16

wey-gu opened this issue Dec 5, 2023 · 0 comments
Labels
documentation Improvements or additions to documentation

Comments

@wey-gu
Copy link
Owner

wey-gu commented Dec 5, 2023

How to do distributed training:

Load data and prepare on graph partition

import dgl

g = ...  # load the DGLGraph object with nebula-dgl
dgl.distributed.partition_graph(g, 'mygraph', 2, 'data_root_dir')

It'll output the partitioned graph as:

data_root_dir/
  |-- mygraph.json          # metadata JSON. File name is the given graph name.
  |-- part0/                # data for partition 0
  |  |-- node_feats.dgl     # node features stored in binary format
  |  |-- edge_feats.dgl     # edge features stored in binary format
  |  |-- graph.dgl          # graph structure of this partition stored in binary format
  |
  |-- part1/                # data for partition 1
     |-- node_feats.dgl
     |-- edge_feats.dgl
     |-- graph.dgl

See more on the reference docs:

ref:

Prepare distributed training env

  • create a cluster of machines
  • upload training script and partitioned data to each cluster
    • Could consider NFS/JuiceFS for ease of data access from distributed servers
  • SSH access, prepare SSH pub key to enable password-less SSH auth
  • Launch training job

ref:

@wey-gu wey-gu added the documentation Improvements or additions to documentation label Dec 5, 2023
@wey-gu wey-gu changed the title NebulaGraph FAQ Doc: distributed training Dec 5, 2023
@wey-gu wey-gu changed the title Doc: distributed training doc: distributed training Dec 5, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

No branches or pull requests

1 participant