Fixed issue where jobs with subgroups but not hard networkTopology.mode could not be scheduled.#5038
Conversation
Summary of ChangesHello @JesseStutler, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed! This pull request addresses a critical bug in the scheduler where jobs configured with subgroups were unable to be scheduled if they did not also specify a hard network topology mode. The changes update the allocation logic to properly account for jobs with subjob policies, allowing them to proceed through the scheduling process as intended. This ensures greater flexibility and correctness in how jobs with complex structures are handled by the system. Highlights
🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console. Changelog
Activity
Using Gemini Code AssistThe full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips. Invoking Gemini You can request assistance from Gemini at any point by creating a comment using either
Customization To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a Limitations & Feedback Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here. You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension. Footnotes
|
|
Let me also add e2e tests and further verify it |
There was a problem hiding this comment.
Code Review
This pull request successfully addresses an issue where jobs with subgroups but without a hard network topology mode were not being scheduled correctly. The fix correctly routes jobs with sub-job policies to the allocateForJob function, which is capable of handling them. The addition of a new test case ensures this scenario is now covered. I have a couple of suggestions to refine the logic for populating tasksNoHardTopology to improve efficiency and code clarity.
There was a problem hiding this comment.
Pull request overview
Fixes a scheduling failure in the allocate action for jobs that have subgroups (SubGroupPolicy/subjobs) when networkTopology.mode is not hard, by ensuring those jobs use the subjob-aware allocation path.
Changes:
- Route jobs with
SubGroupPolicythroughallocateForJob(same path as hard-topology jobs) instead of the default-subjob-only path. - Update
tasksNoHardTopologycomments/initialization to reflect the adjusted scheduling paths. - Add a regression test covering allocation when subgroups exist without hard topology at the job level.
Reviewed changes
Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.
| File | Description |
|---|---|
| pkg/scheduler/actions/allocate/allocate.go | Uses allocateForJob for subjob-policy jobs to avoid default-subjob lookups that can fail when real subjobs exist. |
| pkg/scheduler/actions/allocate/allocate_test.go | Adds a test case to ensure subjob-policy jobs can be allocated without hard job-level topology. |
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
bb447a2 to
e6cee05
Compare
…de could not be scheduled. Signed-off-by: zhengchenyu <zhengchenyu16@163.com> Signed-off-by: JesseStutler <chenzicong4@huawei.com>
e6cee05 to
56c6901
Compare
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: hzxuzhonghu The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
|
/cherry-pick release-1.14 |
|
@hzxuzhonghu: once the present PR merges, I will cherry-pick it on top of release-1.14 in a new PR and assign it to you. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
/lgtm |
|
@hzxuzhonghu: new pull request created: #5041 DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
/lgtm |
commit f4db07d Merge: 1000067 01823d1 Author: Volcano Bot <49986348+volcano-sh-bot@users.noreply.github.com> Date: Fri Feb 13 11:26:57 2026 +0800 Merge pull request volcano-sh#5039 from volcano-sh/copilot/update-kubernetes-compatibility-map Reorganize Kubernetes compatibility matrix: show 5 latest versions with newest first commit 01823d1 Author: Jesse Stutler <jesseincomparable@hotmail.com> Date: Fri Feb 13 10:20:07 2026 +0800 Add version compatibility archive and update README with reference link commit 1000067 Merge: 6ba0e29 56c6901 Author: Volcano Bot <49986348+volcano-sh-bot@users.noreply.github.com> Date: Thu Feb 12 16:49:57 2026 +0800 Merge pull request volcano-sh#5038 from JesseStutler/fix_4871 Fixed issue where jobs with subgroups but not hard networkTopology.mode could not be scheduled. commit f22bea4 Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 08:42:26 2026 +0000 Reorganize Kubernetes compatibility table: show only 5 latest versions with newest on top and left Co-authored-by: JesseStutler <38534065+JesseStutler@users.noreply.github.com> commit 56c6901 Author: JesseStutler <chenzicong4@huawei.com> Date: Thu Feb 12 16:05:35 2026 +0800 Fixed issue where jobs with subgroups but not hard networkTopology.mode could not be scheduled. Signed-off-by: zhengchenyu <zhengchenyu16@163.com> Signed-off-by: JesseStutler <chenzicong4@huawei.com> commit 9125f3e Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 06:15:55 2026 +0000 Add Kubernetes compatibility for Volcano v1.13 and v1.14 Co-authored-by: JesseStutler <38534065+JesseStutler@users.noreply.github.com> commit f4e0f66 Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 06:14:39 2026 +0000 Initial plan
commit f4db07d Merge: 1000067 01823d1 Author: Volcano Bot <49986348+volcano-sh-bot@users.noreply.github.com> Date: Fri Feb 13 11:26:57 2026 +0800 Merge pull request volcano-sh#5039 from volcano-sh/copilot/update-kubernetes-compatibility-map Reorganize Kubernetes compatibility matrix: show 5 latest versions with newest first commit 01823d1 Author: Jesse Stutler <jesseincomparable@hotmail.com> Date: Fri Feb 13 10:20:07 2026 +0800 Add version compatibility archive and update README with reference link commit 1000067 Merge: 6ba0e29 56c6901 Author: Volcano Bot <49986348+volcano-sh-bot@users.noreply.github.com> Date: Thu Feb 12 16:49:57 2026 +0800 Merge pull request volcano-sh#5038 from JesseStutler/fix_4871 Fixed issue where jobs with subgroups but not hard networkTopology.mode could not be scheduled. commit f22bea4 Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 08:42:26 2026 +0000 Reorganize Kubernetes compatibility table: show only 5 latest versions with newest on top and left Co-authored-by: JesseStutler <38534065+JesseStutler@users.noreply.github.com> commit 56c6901 Author: JesseStutler <chenzicong4@huawei.com> Date: Thu Feb 12 16:05:35 2026 +0800 Fixed issue where jobs with subgroups but not hard networkTopology.mode could not be scheduled. Signed-off-by: zhengchenyu <zhengchenyu16@163.com> Signed-off-by: JesseStutler <chenzicong4@huawei.com> commit 9125f3e Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 06:15:55 2026 +0000 Add Kubernetes compatibility for Volcano v1.13 and v1.14 Co-authored-by: JesseStutler <38534065+JesseStutler@users.noreply.github.com> commit f4e0f66 Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 06:14:39 2026 +0000 Initial plan Signed-off-by: devzizu <jazevedo960@gmail.com>
commit f4db07d Merge: 1000067 01823d1 Author: Volcano Bot <49986348+volcano-sh-bot@users.noreply.github.com> Date: Fri Feb 13 11:26:57 2026 +0800 Merge pull request volcano-sh#5039 from volcano-sh/copilot/update-kubernetes-compatibility-map Reorganize Kubernetes compatibility matrix: show 5 latest versions with newest first commit 01823d1 Author: Jesse Stutler <jesseincomparable@hotmail.com> Date: Fri Feb 13 10:20:07 2026 +0800 Add version compatibility archive and update README with reference link commit 1000067 Merge: 6ba0e29 56c6901 Author: Volcano Bot <49986348+volcano-sh-bot@users.noreply.github.com> Date: Thu Feb 12 16:49:57 2026 +0800 Merge pull request volcano-sh#5038 from JesseStutler/fix_4871 Fixed issue where jobs with subgroups but not hard networkTopology.mode could not be scheduled. commit f22bea4 Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 08:42:26 2026 +0000 Reorganize Kubernetes compatibility table: show only 5 latest versions with newest on top and left Co-authored-by: JesseStutler <38534065+JesseStutler@users.noreply.github.com> commit 56c6901 Author: JesseStutler <chenzicong4@huawei.com> Date: Thu Feb 12 16:05:35 2026 +0800 Fixed issue where jobs with subgroups but not hard networkTopology.mode could not be scheduled. Signed-off-by: zhengchenyu <zhengchenyu16@163.com> Signed-off-by: JesseStutler <chenzicong4@huawei.com> commit 9125f3e Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 06:15:55 2026 +0000 Add Kubernetes compatibility for Volcano v1.13 and v1.14 Co-authored-by: JesseStutler <38534065+JesseStutler@users.noreply.github.com> commit f4e0f66 Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 06:14:39 2026 +0000 Initial plan Signed-off-by: devzizu <jazevedo960@gmail.com>
commit f4db07d Merge: 1000067 01823d1 Author: Volcano Bot <49986348+volcano-sh-bot@users.noreply.github.com> Date: Fri Feb 13 11:26:57 2026 +0800 Merge pull request volcano-sh#5039 from volcano-sh/copilot/update-kubernetes-compatibility-map Reorganize Kubernetes compatibility matrix: show 5 latest versions with newest first commit 01823d1 Author: Jesse Stutler <jesseincomparable@hotmail.com> Date: Fri Feb 13 10:20:07 2026 +0800 Add version compatibility archive and update README with reference link commit 1000067 Merge: 6ba0e29 56c6901 Author: Volcano Bot <49986348+volcano-sh-bot@users.noreply.github.com> Date: Thu Feb 12 16:49:57 2026 +0800 Merge pull request volcano-sh#5038 from JesseStutler/fix_4871 Fixed issue where jobs with subgroups but not hard networkTopology.mode could not be scheduled. commit f22bea4 Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 08:42:26 2026 +0000 Reorganize Kubernetes compatibility table: show only 5 latest versions with newest on top and left Co-authored-by: JesseStutler <38534065+JesseStutler@users.noreply.github.com> commit 56c6901 Author: JesseStutler <chenzicong4@huawei.com> Date: Thu Feb 12 16:05:35 2026 +0800 Fixed issue where jobs with subgroups but not hard networkTopology.mode could not be scheduled. Signed-off-by: zhengchenyu <zhengchenyu16@163.com> Signed-off-by: JesseStutler <chenzicong4@huawei.com> commit 9125f3e Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 06:15:55 2026 +0000 Add Kubernetes compatibility for Volcano v1.13 and v1.14 Co-authored-by: JesseStutler <38534065+JesseStutler@users.noreply.github.com> commit f4e0f66 Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 06:14:39 2026 +0000 Initial plan Signed-off-by: devzizu <jazevedo960@gmail.com>
commit f4db07d Merge: 1000067 01823d1 Author: Volcano Bot <49986348+volcano-sh-bot@users.noreply.github.com> Date: Fri Feb 13 11:26:57 2026 +0800 Merge pull request volcano-sh#5039 from volcano-sh/copilot/update-kubernetes-compatibility-map Reorganize Kubernetes compatibility matrix: show 5 latest versions with newest first commit 01823d1 Author: Jesse Stutler <jesseincomparable@hotmail.com> Date: Fri Feb 13 10:20:07 2026 +0800 Add version compatibility archive and update README with reference link commit 1000067 Merge: 6ba0e29 56c6901 Author: Volcano Bot <49986348+volcano-sh-bot@users.noreply.github.com> Date: Thu Feb 12 16:49:57 2026 +0800 Merge pull request volcano-sh#5038 from JesseStutler/fix_4871 Fixed issue where jobs with subgroups but not hard networkTopology.mode could not be scheduled. commit f22bea4 Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 08:42:26 2026 +0000 Reorganize Kubernetes compatibility table: show only 5 latest versions with newest on top and left Co-authored-by: JesseStutler <38534065+JesseStutler@users.noreply.github.com> commit 56c6901 Author: JesseStutler <chenzicong4@huawei.com> Date: Thu Feb 12 16:05:35 2026 +0800 Fixed issue where jobs with subgroups but not hard networkTopology.mode could not be scheduled. Signed-off-by: zhengchenyu <zhengchenyu16@163.com> Signed-off-by: JesseStutler <chenzicong4@huawei.com> commit 9125f3e Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 06:15:55 2026 +0000 Add Kubernetes compatibility for Volcano v1.13 and v1.14 Co-authored-by: JesseStutler <38534065+JesseStutler@users.noreply.github.com> commit f4e0f66 Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 06:14:39 2026 +0000 Initial plan Signed-off-by: devzizu <jazevedo960@gmail.com>
commit f4db07d Merge: 1000067 01823d1 Author: Volcano Bot <49986348+volcano-sh-bot@users.noreply.github.com> Date: Fri Feb 13 11:26:57 2026 +0800 Merge pull request volcano-sh#5039 from volcano-sh/copilot/update-kubernetes-compatibility-map Reorganize Kubernetes compatibility matrix: show 5 latest versions with newest first commit 01823d1 Author: Jesse Stutler <jesseincomparable@hotmail.com> Date: Fri Feb 13 10:20:07 2026 +0800 Add version compatibility archive and update README with reference link commit 1000067 Merge: 6ba0e29 56c6901 Author: Volcano Bot <49986348+volcano-sh-bot@users.noreply.github.com> Date: Thu Feb 12 16:49:57 2026 +0800 Merge pull request volcano-sh#5038 from JesseStutler/fix_4871 Fixed issue where jobs with subgroups but not hard networkTopology.mode could not be scheduled. commit f22bea4 Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 08:42:26 2026 +0000 Reorganize Kubernetes compatibility table: show only 5 latest versions with newest on top and left Co-authored-by: JesseStutler <38534065+JesseStutler@users.noreply.github.com> commit 56c6901 Author: JesseStutler <chenzicong4@huawei.com> Date: Thu Feb 12 16:05:35 2026 +0800 Fixed issue where jobs with subgroups but not hard networkTopology.mode could not be scheduled. Signed-off-by: zhengchenyu <zhengchenyu16@163.com> Signed-off-by: JesseStutler <chenzicong4@huawei.com> commit 9125f3e Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 06:15:55 2026 +0000 Add Kubernetes compatibility for Volcano v1.13 and v1.14 Co-authored-by: JesseStutler <38534065+JesseStutler@users.noreply.github.com> commit f4e0f66 Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 06:14:39 2026 +0000 Initial plan Signed-off-by: devzizu <jazevedo960@gmail.com>
commit f4db07d Merge: 1000067 01823d1 Author: Volcano Bot <49986348+volcano-sh-bot@users.noreply.github.com> Date: Fri Feb 13 11:26:57 2026 +0800 Merge pull request volcano-sh#5039 from volcano-sh/copilot/update-kubernetes-compatibility-map Reorganize Kubernetes compatibility matrix: show 5 latest versions with newest first commit 01823d1 Author: Jesse Stutler <jesseincomparable@hotmail.com> Date: Fri Feb 13 10:20:07 2026 +0800 Add version compatibility archive and update README with reference link commit 1000067 Merge: 6ba0e29 56c6901 Author: Volcano Bot <49986348+volcano-sh-bot@users.noreply.github.com> Date: Thu Feb 12 16:49:57 2026 +0800 Merge pull request volcano-sh#5038 from JesseStutler/fix_4871 Fixed issue where jobs with subgroups but not hard networkTopology.mode could not be scheduled. commit f22bea4 Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 08:42:26 2026 +0000 Reorganize Kubernetes compatibility table: show only 5 latest versions with newest on top and left Co-authored-by: JesseStutler <38534065+JesseStutler@users.noreply.github.com> commit 56c6901 Author: JesseStutler <chenzicong4@huawei.com> Date: Thu Feb 12 16:05:35 2026 +0800 Fixed issue where jobs with subgroups but not hard networkTopology.mode could not be scheduled. Signed-off-by: zhengchenyu <zhengchenyu16@163.com> Signed-off-by: JesseStutler <chenzicong4@huawei.com> commit 9125f3e Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 06:15:55 2026 +0000 Add Kubernetes compatibility for Volcano v1.13 and v1.14 Co-authored-by: JesseStutler <38534065+JesseStutler@users.noreply.github.com> commit f4e0f66 Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 06:14:39 2026 +0000 Initial plan Signed-off-by: devzizu <jazevedo960@gmail.com>
commit f4db07d Merge: 1000067 01823d1 Author: Volcano Bot <49986348+volcano-sh-bot@users.noreply.github.com> Date: Fri Feb 13 11:26:57 2026 +0800 Merge pull request volcano-sh#5039 from volcano-sh/copilot/update-kubernetes-compatibility-map Reorganize Kubernetes compatibility matrix: show 5 latest versions with newest first commit 01823d1 Author: Jesse Stutler <jesseincomparable@hotmail.com> Date: Fri Feb 13 10:20:07 2026 +0800 Add version compatibility archive and update README with reference link commit 1000067 Merge: 6ba0e29 56c6901 Author: Volcano Bot <49986348+volcano-sh-bot@users.noreply.github.com> Date: Thu Feb 12 16:49:57 2026 +0800 Merge pull request volcano-sh#5038 from JesseStutler/fix_4871 Fixed issue where jobs with subgroups but not hard networkTopology.mode could not be scheduled. commit f22bea4 Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 08:42:26 2026 +0000 Reorganize Kubernetes compatibility table: show only 5 latest versions with newest on top and left Co-authored-by: JesseStutler <38534065+JesseStutler@users.noreply.github.com> commit 56c6901 Author: JesseStutler <chenzicong4@huawei.com> Date: Thu Feb 12 16:05:35 2026 +0800 Fixed issue where jobs with subgroups but not hard networkTopology.mode could not be scheduled. Signed-off-by: zhengchenyu <zhengchenyu16@163.com> Signed-off-by: JesseStutler <chenzicong4@huawei.com> commit 9125f3e Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 06:15:55 2026 +0000 Add Kubernetes compatibility for Volcano v1.13 and v1.14 Co-authored-by: JesseStutler <38534065+JesseStutler@users.noreply.github.com> commit f4e0f66 Author: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Date: Thu Feb 12 06:14:39 2026 +0000 Initial plan Signed-off-by: devzizu <jazevedo960@gmail.com>

What type of PR is this?
/kind bug
What this PR does / why we need it:
When networkTopology.mode is not hard, but subgroups exist, scheduling is not possible, we need to support it.
This pr is inherited from #4872
Which issue(s) this PR fixes:
Fixes #4871
Special notes for your reviewer:
Does this PR introduce a user-facing change?