-
Notifications
You must be signed in to change notification settings - Fork 235
OCM-20819 | CNV-56734 | feat: prevent vCPU overcommit #2588
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
OCM-20819 | CNV-56734 | feat: prevent vCPU overcommit #2588
Conversation
|
/retest |
|
For other reviewers: here's what Claude said when I asked it to compare this PR against #2535 (the previous attempt):
@BraeTroutman can you address the scripts/generate-policy-config.py issue? Can you also provide steps on how to test this PR in stage, so we can confirm that this won't cause an issue with existing Windows VMs in production? |
492caff to
c92f4e1
Compare
fc88a88 to
ddecd42
Compare
AWS is legally responsible for having limited and accurate vCPU allocation for Microsoft License-Included customers on AWS EC2. Having a misallocation of vCPU would result in a breach of agreement between AWS and Microsoft for License-Included Windows software. This change deploys VAP on HCPs that prevents the overcommit of vCPUs for windows VMs.
ddecd42 to
23b1cea
Compare
|
LGTM for the VAP and VAP bindings from IAM perspective. It won't affect customer without WinLI label, and the VAP test cases make sense. We would better have another eye from virtualization experts regarding the @BraeTroutman for the |
jcanocan
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for taking care of this yourself! Excelent job!
deploy/srep-vap/vcpu-overcommit/104-windows-server-2016-vcpu-restrict.yaml
Outdated
Show resolved
Hide resolved
deploy/srep-vap/vcpu-overcommit/104-windows-server-2016-vcpu-restrict.yaml
Outdated
Show resolved
Hide resolved
deploy/srep-vap/vcpu-overcommit/104-windows-server-2016-vcpu-restrict.yaml
Outdated
Show resolved
Hide resolved
deploy/srep-vap/vcpu-overcommit/105-windows-server-2019-vcpu-restrict.yaml
Outdated
Show resolved
Hide resolved
deploy/srep-vap/vcpu-overcommit/105-windows-server-2019-vcpu-restrict.yaml
Outdated
Show resolved
Hide resolved
deploy/srep-vap/vcpu-overcommit/106-windows-server-2022-vcpu-restrict.yaml
Outdated
Show resolved
Hide resolved
deploy/srep-vap/vcpu-overcommit/106-windows-server-2022-vcpu-restrict.yaml
Outdated
Show resolved
Hide resolved
deploy/srep-vap/vcpu-overcommit/107-windows-server-2025-vcpu-restrict.yaml
Outdated
Show resolved
Hide resolved
deploy/srep-vap/vcpu-overcommit/107-windows-server-2025-vcpu-restrict.yaml
Outdated
Show resolved
Hide resolved
@feichashao At the moment these preferences are tightly coupled to the validating admission policies, but are temporary and will be removed once the preferences are shipped by default with CNV's next release. If it's not too much trouble, it's ideal that the VM Preference definitions remain with the VAP 🙏 |
87a73f5 to
a6b6cd2
Compare
|
LGTM from IAM domain perspective. Need approval from TL/FL. |
|
LGTM |
58cabe8 to
18899a4
Compare
|
This looks great @BraeTroutman. Please just update the README so there's no confusion about the test cases (here's a Claude-generated fix that you're welcome to use: README.md) We're going to hold this during the change freeze, but I'm +1 on merging afterwards. /hold |
|
/approve |
Co-authored-by: Javier Cano Cano <jcanocan@redhat.com>
18899a4 to
ed89fbf
Compare
|
@BraeTroutman: all tests passed! Full PR test history. Your PR dashboard. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
|
/lgtm |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: abyrne55, BraeTroutman The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
What type of PR is this?
feature
What this PR does / why we need it?
AWS is legally responsible for having limited and accurate vCPU allocation for Microsoft License-Included customers on AWS EC2. Having a misallocation of vCPU would result in a breach of agreement between AWS and Microsoft for License-Included Windows software. This change deploys VAP on HCPs that prevents the overcommit of vCPUs for windows VMs.
This change makes use of a dedicated ManagedCluster label that is only present on clusters with Windows LI enabled. That means all existing customer workloads will not be impacted by this change.
Which Jira/Github issue(s) this PR fixes?
Fixes #CNV-56734 #OCM-20819
Special notes for your reviewer:
Demonstration of local testing/procedure for stage validation here
Pre-checks (if applicable):
Tested latest changes against a cluster
Included documentation changes with PR
If this is a new object that is not intended for the FedRAMP environment (if unsure, please reach out to team FedRAMP), please exclude it with: