-
Notifications
You must be signed in to change notification settings - Fork 3.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add distributed feature info for distributed training #7678
Conversation
for more information, see https://pre-commit.ci
Codecov Report
@@ Coverage Diff @@
## master #7678 +/- ##
==========================================
- Coverage 91.58% 91.05% -0.54%
==========================================
Files 452 454 +2
Lines 25534 25783 +249
==========================================
+ Hits 23385 23476 +91
- Misses 2149 2307 +158
📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we rebase this PR on the other two PRs to avoid a duplicated diff? Otherwise, this one is hard to review.
This code belongs to the part of the whole distributed training for PyG. (This PR is to replace #7678) This PR originally designed for the DistFeature class and now merged with LocalFeatureStore - Add partition/rpc info into LocalFeatureStore like num_partition, partition_idx, feature_pb (feature_partitionbook), partition_meta, RpcRouter, etc Add one new class (RpcCallFeatureLookup) to do real remote rpc feature_lookup work Add one api ( .lookup_features() ) to do feature lookup in local node and remote nodes based on sampled global node ids/edge ids based on torch rpc apis one unit test to verify the function of local/remote feature lookup under .test/distributed/. folder Now we combined the local feature store and distributed feature properties (partition info and rpc remote access apis) into one FeatureStore. later on we will change the class name from LocalFeatureStore into PartitionFeatureStore with another PR. Any comments please let us know. thanks. --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Matthias Fey <matthias.fey@tu-dortmund.de> Co-authored-by: root <root@skyocean.sh.intel.com>
This code belongs to the part of the whole distributed training for PyG.
This PR originally designed for the DistFeature class and now merged with LocalFeatureStore -
Now we combined the local feature store and distributed feature properties (partition info and rpc remote access apis) into one FeatureStore. later on we will change the class name from LocalFeatureStore into PartitionFeatureStore with another PR.
Any comments please let us know. thanks.