Remove Supervision Edge Outward Assumption from V2 GiGL #316

mkolodner-sc · 2025-09-08T18:43:41Z

Scope of work done

Removes the assumption that the supervision edge type is always directed outward from the V2 GiGL codebase. We are removing this assumption since it can introduce confusion and greater difficulty in maintainability to be internally flipped edge types due to the assumption that our supervision edge is always outward direction. We are reducing opinionation here by removing this requirement.

Specifically, it does this by:

Removing the flipping step when we convert labels to edge types
Remove the flipping step when we are splitting the data
Remove the flipping step when we are initializing the neighborloader with some supervision edge type
Make it so that if we partition_labels, we don't automatically assume src as the root node type and instead base the root node type on the edge direction
Adds corresponding tests

Following this change, we make no underlying assumption about the supervision edge type in the graph and assume that it also complies with the sampling_edge_direction provided. This means for a supervision edge type A -> B which is directed outward, we would have A by the anchor node type and B be the supervision node type. If the graph was inward, we'd have B be the anchor node type and A be the supervision node type.

This provided supervision edge type should adhere to what labels were specified in data preprocessor. A follow-up item from this PR is to add a check that our data preprocessor config labels adhere to the task metadata.

Where is the documentation for this feature?: N/A

Did you add automated tests or write a test plan?

Updated Changelog.md? NO

Ready for code review?: NO

kmontemayor2-sc

Hi Matt, thanks for the work.

I think for future readers of this PR (and also me rn...) it may be useful to add more description to the PR about why we do these things/etc. Where we were, why we are changing, and what the rules are now, etc.

kmontemayor2-sc · 2025-09-09T16:32:55Z

python/tests/unit/distributed/distributed_neighborloader_test.py

Hmmm, the DBLP dataset contains both (paper, to, author), and (author, to, paper) right 1?

I have some concerns about some case with only (paper, to, author) and a supervision edge type (author, to, paper) having previously worked, since we reversed the edge type, but now it would fail.

Does this seem correct to you? And if so could we add some test for a supervision edge type whose either doesn't have a corresponding message passing edge type, or whose message passing edge type isn't reciprocal?

Sure, I can aim to increase the test coverage here, thanks!

mkolodner-sc added 5 commits September 8, 2025 18:23

Initial commit

72da94c

Update test

29ad91f

Update

6f647ef

Fix

32ee56a

Update

dadffae

kmontemayor2-sc reviewed Sep 9, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove Supervision Edge Outward Assumption from V2 GiGL #316

Remove Supervision Edge Outward Assumption from V2 GiGL #316

Uh oh!

mkolodner-sc commented Sep 8, 2025 •

edited

Loading

Uh oh!

kmontemayor2-sc left a comment

Uh oh!

kmontemayor2-sc Sep 9, 2025

Uh oh!

mkolodner-sc Sep 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Remove Supervision Edge Outward Assumption from V2 GiGL #316

Are you sure you want to change the base?

Remove Supervision Edge Outward Assumption from V2 GiGL #316

Uh oh!

Conversation

mkolodner-sc commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kmontemayor2-sc left a comment

Choose a reason for hiding this comment

Uh oh!

kmontemayor2-sc Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

mkolodner-sc Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mkolodner-sc commented Sep 8, 2025 •

edited

Loading