Skip to content

Conversation

@sunzhaohua2
Copy link
Contributor

@sunzhaohua2 sunzhaohua2 commented Oct 14, 2025

Summary by CodeRabbit

  • Tests
    • Added extensive end-to-end suites for bidirectional status translation between Machine API and Cluster API, covering replicas, readiness, conditions, provider status, errors, same-name and cross-version scenarios.
  • Test Utilities
    • Added YAML-capable provider-status parsing and a unique-name generator to support parallel, cross-version tests.
  • Test Helpers
    • Introduced skip-wait variants and test-scoping adjustments to enable parallel resource creation and more reliable assertions.
  • Chores
    • Increased e2e test timeout for longer runs.

@openshift-ci-robot openshift-ci-robot added the jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. label Oct 14, 2025
@openshift-ci-robot
Copy link

openshift-ci-robot commented Oct 14, 2025

@sunzhaohua2: This pull request references OCPCLOUD-3017 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "4.21.0" version, but no target version was set.

Details

In response to this:

/hold
For machineset migration split pr raise and merge

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@sunzhaohua2 sunzhaohua2 marked this pull request as draft October 14, 2025 08:09
@coderabbitai
Copy link

coderabbitai bot commented Oct 14, 2025

📝 Walkthrough

Walkthrough

Adds extensive e2e status-conversion tests for MAPI↔CAPI, YAML-based AWS providerStatus parsing and verification helpers, skip-wait MachineSet creation and readiness helpers, pervasive GinkgoHelper() insertions, a UniqueName utility, and extends the e2e Makefile timeout.

Changes

Cohort / File(s) Summary
Status conversion tests
e2e/status_conversion_test.go
New comprehensive end-to-end tests for bidirectional MAPI↔CAPI status conversion covering MachineSet and Machine flows, providerStatus (AWSMachine) translation, generation/replicas/readiness/conditions, error propagation, same-name and cross-direction scenarios, and parallel creation paths.
Machine providerStatus helpers
e2e/machine_migration_helpers.go
Imported sigs.k8s.io/yaml; added getAWSProviderStatusFromMachine (unmarshal AWSMachineProviderStatus from Machine.Status.ProviderStatus.Raw) and verifyMAPIMachineProviderStatus (Gomega verification). Added GinkgoHelper() calls and nil/YAML unmarshal checks.
MachineSet creation & wait helpers
e2e/machineset_migration_helpers.go
Added skip-wait variants createCAPIMachineSetSkipWait, createMAPIMachineSetWithAuthoritativeAPISkipWait, wrappers delegating to skip-wait variants, and waitForMAPIMachineSetReady. Many helpers now call GinkgoHelper() and support conditional skip-wait for parallel tests.
Common utilities
e2e/migration_common.go
Added fmt and time imports and a new public helper UniqueName(prefix string) string that appends a nanosecond timestamp for generating unique resource names.
CI / Makefile
Makefile
Increased e2e target timeout from 120m to 180m.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  participant Test
  participant MAPI_API as MAPI API Server
  participant CAPI_API as CAPI API Server
  participant Controllers as Sync Controllers
  participant AWS as AWS providerStatus

  Test->>MAPI_API: create MAPI MachineSet / Machine (maybe skipWait)
  Test->>CAPI_API: create CAPI MachineSet / Machine (parallel)
  MAPI_API->>Controllers: notify desired state / status changes
  CAPI_API->>Controllers: notify desired state / status changes
  Controllers->>AWS: reconcile provider resources, obtain providerStatus
  AWS-->>Controllers: return AWSMachineProviderStatus
  Controllers->>MAPI_API: update Machine.Status.ProviderStatus.Raw
  Controllers->>CAPI_API: update MachineSet/Machine.Status and Conditions
  Test->>MAPI_API: read ProviderStatus raw
  Test->>CAPI_API: read Status / Conditions
  Test->>Test: verify via YAML unmarshal and Gomega matchers
Loading

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Poem

🐇
I nibble YAML, whiskers all a-quiver,
GinkgoHelper hops where tests deliver.
Machines mirror, statuses entwine,
I stitch their bits with a carrot-line,
Hooray — migrations hop and shiver.

🚥 Pre-merge checks | ✅ 2 | ❌ 1
❌ Failed checks (1 warning)
Check name Status Explanation Resolution
Docstring Coverage ⚠️ Warning Docstring coverage is 77.42% which is insufficient. The required threshold is 80.00%. Write docstrings for the functions missing them to satisfy the coverage threshold.
✅ Passed checks (2 passed)
Check name Status Explanation
Description Check ✅ Passed Check skipped - CodeRabbit’s high-level summary is enabled.
Title check ✅ Passed The title clearly and specifically describes the main change: implementing status syncing for MAPI/CAPI migration with comprehensive e2e testing automation, which aligns with the substantial additions of status conversion tests and helper functions.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing touches
  • 📝 Generate docstrings
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Post copyable unit tests in a comment

Warning

There were issues while running some tools. Please review the errors and either fix the tool's configuration or disable the tool if it's a critical failure.

🔧 golangci-lint (2.5.0)

Error: can't load config: unsupported version of the configuration: "" See https://golangci-lint.run/docs/product/migration-guide for migration instructions
The command is terminated due to an error: can't load config: unsupported version of the configuration: "" See https://golangci-lint.run/docs/product/migration-guide for migration instructions


Comment @coderabbitai help to get the list of available commands and usage tips.

@openshift-ci openshift-ci bot added do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. labels Oct 14, 2025
@openshift-ci openshift-ci bot requested review from RadekManak and mdbooth October 14, 2025 08:11
@openshift-ci-robot
Copy link

openshift-ci-robot commented Oct 21, 2025

@sunzhaohua2: This pull request references OCPCLOUD-3017 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "4.21.0" version, but no target version was set.

Details

In response to this:

/hold
Waiting for machineset migration split pr to merge

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@openshift-ci-robot
Copy link

openshift-ci-robot commented Dec 22, 2025

@sunzhaohua2: This pull request references OCPCLOUD-3017 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "4.22.0" version, but no target version was set.

Details

In response to this:

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@sunzhaohua2 sunzhaohua2 marked this pull request as ready for review December 22, 2025 15:04
@openshift-ci openshift-ci bot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Dec 22, 2025
@openshift-ci openshift-ci bot requested a review from damdo December 22, 2025 15:04
@openshift-ci-robot
Copy link

openshift-ci-robot commented Dec 22, 2025

@sunzhaohua2: This pull request references OCPCLOUD-3017 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "4.22.0" version, but no target version was set.

Details

In response to this:

Summary by CodeRabbit

  • Tests
  • Added comprehensive end-to-end test suite validating status field translation and synchronization across API versions.
  • Introduced testing helpers for enhanced provider status verification and error scenario handling.
  • Extended test coverage for both success paths and failure scenarios in resource conversion workflows.

✏️ Tip: You can customize this high-level summary in your review settings.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

Copy link

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

🧹 Nitpick comments (1)
e2e/machine_migration_helpers.go (1)

346-354: Using Expect inside a transform function defeats Eventually retry semantics.

When getAWSProviderStatusFromMachine is used as a transform function within Eventually, any Expect failure will cause an immediate test failure instead of allowing Eventually to retry. If ProviderStatus.Raw is nil or YAML unmarshalling fails on an early attempt (before status is populated), the test fails prematurely.

Consider returning an error-indicating value that the matcher can handle, or restructure to allow retries.

🔎 Proposed fix
 // getAWSProviderStatusFromMachine extracts and unmarshals the AWSMachineProviderStatus from a MAPI Machine.
 func getAWSProviderStatusFromMachine(mapiMachine *mapiv1beta1.Machine) *mapiv1beta1.AWSMachineProviderStatus {
-	Expect(mapiMachine.Status.ProviderStatus.Raw).ToNot(BeNil())
+	if mapiMachine.Status.ProviderStatus == nil || mapiMachine.Status.ProviderStatus.Raw == nil {
+		return nil
+	}
 
 	providerStatus := &mapiv1beta1.AWSMachineProviderStatus{}
-	Expect(yaml.Unmarshal(mapiMachine.Status.ProviderStatus.Raw, providerStatus)).To(Succeed())
+	if err := yaml.Unmarshal(mapiMachine.Status.ProviderStatus.Raw, providerStatus); err != nil {
+		return nil
+	}
 
 	return providerStatus
 }

Then update the matcher usage to handle nil:

verifyMAPIMachineProviderStatus(mapiMachine,
    SatisfyAll(
        Not(BeNil()),
        HaveField("InstanceState", HaveValue(Equal(...))),
    ),
)
📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Cache: Disabled due to data retention organization setting

Knowledge base: Disabled due to data retention organization setting

📥 Commits

Reviewing files that changed from the base of the PR and between 10f36ad and b56da67.

📒 Files selected for processing (2)
  • e2e/machine_migration_helpers.go
  • e2e/status_conversion_test.go
🔇 Additional comments (12)
e2e/machine_migration_helpers.go (1)

22-22: LGTM!

Import addition is appropriate for the new YAML unmarshalling functionality.

e2e/status_conversion_test.go (11)

1-21: LGTM!

Imports are well-organized and appropriate for the comprehensive status conversion testing.


23-32: LGTM!

Proper platform and feature gate guards ensure tests only run in appropriate environments.


34-57: LGTM!

Well-structured test setup with proper cleanup via DeferCleanup.


66-84: Consider potential stale reference issue.

Line 75 compares against mapiMachineSet.Status.ReadyReplicas which is captured when the Eventually on line 68-71 completes. However, since komega.Object(mapiMachineSet) updates the object in place, this should work correctly. Just noting this pattern for awareness.


138-150: Good documentation.

The comment clearly explains why only the Paused condition is verified for paused MachineSets.


461-491: Good practice: Pending tests with bug references.

Using PIt with explicit bug URLs (OCPBUGS-63340) is the right approach for known issues that are being tracked.


557-568: Address comparison assumes ordering.

The index-based address comparison (cm.Status.Addresses[i] vs mapiMachine.Status.Addresses[i]) will fail if addresses are returned in different orders between MAPI and CAPI. If ordering isn't guaranteed by the conversion logic, consider using ContainElements or sorting before comparison.

This is acceptable if the conversion preserves order, but worth verifying.


647-667: LGTM!

Good coverage of providerStatus conversion including instanceState verification.


765-791: Pending test appropriately marked with bug reference.

The PIt with OCPBUGS-70136 reference correctly marks this as pending until the underlying bug is fixed.


777-790: Note: This test uses the helper function with the Expect issue.

When this pending test is enabled, be aware that verifyMAPIMachineProviderStatus uses getAWSProviderStatusFromMachine which has Expect assertions inside a transform function. Applying the fix suggested in machine_migration_helpers.go will be needed.


805-816: Good use of PIt for pending tests with bug tracking.

Multiple tests appropriately marked as pending with OCPBUGS-63183 reference for the same-name scenario.

Also applies to: 832-842, 844-869

@sunzhaohua2
Copy link
Contributor Author

/test e2e-aws-capi-techpreview

@openshift-ci-robot
Copy link

openshift-ci-robot commented Dec 23, 2025

@sunzhaohua2: This pull request references OCPCLOUD-3017 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "4.22.0" version, but no target version was set.

Details

In response to this:

Summary by CodeRabbit

  • Tests
  • Added a comprehensive end-to-end test suite validating status translation and synchronization between Machine API and Cluster API across many scenarios (replicas, conditions, phases, provider status, error propagation).
  • Introduced testing helpers to verify provider status transformations and YAML-based providerStatus decoding.
  • Expanded coverage for success paths, failure scenarios, cross-namespace and cross-authority conversions.

✏️ Tip: You can customize this high-level summary in your review settings.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

Copy link

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

🧹 Nitpick comments (2)
e2e/status_conversion_test.go (2)

557-570: Consider order-independent address comparison.

The current comparison assumes addresses are in the same order in both MAPI and CAPI. If address ordering is not guaranteed to be preserved during conversion, this could cause flaky test failures.

🔎 Suggested order-independent comparison
 Eventually(komega.Object(capiMachine), capiframework.WaitMedium, capiframework.RetryMedium).Should(
     WithTransform(func(cm *clusterv1.Machine) bool {
         if len(cm.Status.Addresses) != len(mapiMachine.Status.Addresses) {
             return false
         }
-        for i, mapiAddr := range mapiMachine.Status.Addresses {
-            capiAddr := cm.Status.Addresses[i]
-            if string(capiAddr.Type) != string(mapiAddr.Type) || capiAddr.Address != mapiAddr.Address {
-                return false
+        for _, mapiAddr := range mapiMachine.Status.Addresses {
+            found := false
+            for _, capiAddr := range cm.Status.Addresses {
+                if string(capiAddr.Type) == string(mapiAddr.Type) && capiAddr.Address == mapiAddr.Address {
+                    found = true
+                    break
+                }
+            }
+            if !found {
+                return false
             }
         }
         return true
     }, BeTrue()),

736-749: Consider order-independent address comparison here as well.

Same concern as the MAPI→CAPI address comparison - the index-based comparison assumes preserved ordering.

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Cache: Disabled due to data retention organization setting

Knowledge base: Disabled due to data retention organization setting

📥 Commits

Reviewing files that changed from the base of the PR and between b56da67 and 44fc78c.

📒 Files selected for processing (2)
  • e2e/machine_migration_helpers.go
  • e2e/status_conversion_test.go
🚧 Files skipped from review as they are similar to previous changes (1)
  • e2e/machine_migration_helpers.go
🧰 Additional context used
🧬 Code graph analysis (1)
e2e/status_conversion_test.go (5)
e2e/framework/util.go (1)
  • IsMachineAPIMigrationEnabled (41-76)
pkg/conversion/mapi2capi/interface.go (2)
  • MachineSet (29-31)
  • Machine (24-26)
e2e/framework/framework.go (7)
  • WaitLong (25-25)
  • RetryLong (19-19)
  • WaitMedium (24-24)
  • RetryMedium (18-18)
  • WaitShort (23-23)
  • RetryShort (17-17)
  • CAPINamespace (14-14)
e2e/framework/machine.go (2)
  • GetMachine (75-86)
  • GetAWSMachine (61-72)
e2e/framework/machineset.go (3)
  • WaitForMachineSet (137-173)
  • ScaleCAPIMachineSet (224-235)
  • GetNewestMachineFromMachineSet (204-221)
🔇 Additional comments (8)
e2e/status_conversion_test.go (8)

1-21: LGTM!

Imports are well-organized and appropriate for the e2e test suite. The use of dot imports for Ginkgo/Gomega is a common pattern in Kubernetes e2e tests.


23-32: LGTM!

Good use of skip conditions to ensure tests only run on AWS platforms with the MachineAPIMigration feature enabled. The Ordered decorator ensures deterministic test execution.


66-84: LGTM!

The pattern of waiting for MAPI status to stabilize and then verifying CAPI synchronization is correct. The komega.Object() matcher refreshes the object on each poll, ensuring fresh values are used.


205-230: LGTM!

The error status conversion test correctly handles the case where either ErrorReason or ErrorMessage (or both) may be set. The conditional assertions ensure proper matching without failing if only one field is populated.


461-490: LGTM on pending tests with bug tracking.

Good practice to mark tests as pending (PIt) with inline bug references (OCPBUGS-63340). This maintains test coverage visibility while avoiding false negatives until the underlying issues are resolved.


794-803: LGTM!

The cross-authority scenario (CAPI Machine created first, then MAPI with ClusterAPI authority) is properly set up. Individual test cases appropriately wait for the required status fields to be populated.


34-246: Well-structured test suite for MAPI→CAPI MachineSet conversions.

The test organization is clean with:

  • Proper resource lifecycle management (BeforeAll setup, DeferCleanup teardown)
  • Clear separation between normal operation and error condition tests
  • Comprehensive coverage of status fields, conditions, and selector conversion

304-309: Access to Deprecated.V1Beta1 should use defensive patterns or explicit nil checks.

Direct access to capiMachineSet.Status.Deprecated.V1Beta1.FullyLabeledReplicas at lines 306 and 397 may panic if the pointer is not initialized. Use HaveValue() matcher (as line 81 does for ReadyReplicas) or add a nil guard in test setup to ensure the field is initialized before the test assertions.

@sunzhaohua2
Copy link
Contributor Author

/test unit

@openshift-ci-robot
Copy link

openshift-ci-robot commented Dec 23, 2025

@sunzhaohua2: This pull request references OCPCLOUD-3017 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "4.22.0" version, but no target version was set.

Details

In response to this:

Summary by CodeRabbit

  • Tests
  • Added a comprehensive end-to-end test suite validating status translation and synchronization between Machine API and Cluster API across many scenarios (replicas, conditions, phases, provider status, error propagation).
  • Introduced helpers to verify provider-status transformations and YAML-based providerStatus decoding; expanded coverage for success, failure, cross-namespace and cross-authority conversions.
  • Chores
  • Extended e2e test timeout to accommodate longer-running scenarios.

✏️ Tip: You can customize this high-level summary in your review settings.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

Copy link

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 0

🧹 Nitpick comments (1)
Makefile (1)

60-60: Timeout increase appears reasonable given the new test coverage.

The 50% increase in e2e test timeout (120m → 180m) aligns with the comprehensive status conversion test suite being added. However, consider verifying that the full runtime approaches this limit, as unnecessarily long timeouts can delay failure feedback in CI.

Optional: If the new tests are the primary driver of increased runtime, consider whether any tests could be parallelized or optimized to reduce overall execution time.

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Cache: Disabled due to data retention organization setting

Knowledge base: Disabled due to data retention organization setting

📥 Commits

Reviewing files that changed from the base of the PR and between 44fc78c and ff70d3f.

📒 Files selected for processing (1)
  • Makefile

@sunzhaohua2
Copy link
Contributor Author

/test e2e-aws-capi-techpreview

@shellyyang1989
Copy link

@sunzhaohua2 is it ready for review?

@sunzhaohua2
Copy link
Contributor Author

@sunzhaohua2 is it ready for review?

yes

})

It("should convert status.availableReplicas from MAPI to CAPI status and status.Deprecated.V1Beta1", func() {
By("Verifying CAPI MachineSet status.availableReplicas is synchronized")
Copy link
Contributor

@theobarberbany theobarberbany Jan 6, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm curious about this (thinking out loud) - we're waiting for the field to be populated, and once it is we are stating it's synchronised. This works because we start with no CAPI machine set, right, so population == sync?

Should we be looking at SynchronizedGeneration too? Given that the By( says we're waiting for synchronisation (and that's a field in status)? We likely only need to do it once, in the BeforeAll? and it may speed up the tests a little?

})

// bug https://issues.redhat.com/browse/OCPBUGS-63183
It("should convert CAPI Machine phase to MAPI Machine phase", func() {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should this be a PIt() ? Or has the bug been resolved?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

:) I removed the comment, we hit the bug only when the capi machine exists

.PHONY: e2e
e2e: ## Run e2e tests against active kubeconfig
./hack/test.sh "./e2e/..." 120m
./hack/test.sh "./e2e/..." 180m
Copy link
Contributor

@theobarberbany theobarberbany Jan 6, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

:'(

Should we consider splitting the e2es / sharing early?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@theobarberbany I raised a pr here to add shard_count openshift/release#73182

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@theobarberbany I'd like to keep it as is for now to align with the job's time setting, I'll add the shard_count parameter to the job after the OTE pr is merged.

@openshift-ci openshift-ci bot removed the lgtm Indicates that a PR is ready to be merged. label Feb 2, 2026
@openshift-ci-robot
Copy link

openshift-ci-robot commented Feb 2, 2026

@sunzhaohua2: This pull request references OCPCLOUD-3017 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "4.22.0" version, but no target version was set.

Details

In response to this:

Summary by CodeRabbit

  • Tests
  • Added extensive end-to-end tests covering bidirectional status translation between Machine API and Cluster API (versions, conditions, provider status, error propagation, edge cases, same-name scenarios) and broadened use of the Ginkgo/Gomega DSL for clearer, scoped assertions and cleanup.
  • Test Utilities
  • Added helpers for provider-status parsing and unique-name generation to support parallel and cross-version test scenarios.
  • Chores
  • Increased e2e test timeout to accommodate longer-running scenarios.

✏️ Tip: You can customize this high-level summary in your review settings.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@sunzhaohua2
Copy link
Contributor Author

/test unit

@sunzhaohua2
Copy link
Contributor Author

/test e2e-aws-capi-techpreview

@openshift-ci-robot
Copy link

openshift-ci-robot commented Feb 2, 2026

@sunzhaohua2: This pull request references OCPCLOUD-3017 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "4.22.0" version, but no target version was set.

Details

In response to this:

Summary by CodeRabbit

  • Tests
  • Added comprehensive end-to-end tests validating bidirectional status translation between Machine API and Cluster API (versions, conditions, provider status, error propagation, same-name and edge-case scenarios).
  • Test Utilities
  • Added helpers for provider-status parsing and unique-name generation to support parallel and cross-version test scenarios.
  • Test Helpers
  • Introduced skip-wait variants and Ginkgo scoping adjustments to enable parallel resource creation and more reliable test assertions.
  • Chores
  • Increased e2e test timeout for longer-running scenarios.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@sunzhaohua2
Copy link
Contributor Author

/test e2e-aws-capi-techpreview

@sunzhaohua2 sunzhaohua2 changed the title OCPCLOUD-3017: e2e testing automation: Implement Status Syncing for MAPI/CAPI migration OCPCLOUD-3355: e2e testing automation: Implement Status Syncing for MAPI/CAPI migration Feb 2, 2026
@openshift-ci-robot
Copy link

openshift-ci-robot commented Feb 2, 2026

@sunzhaohua2: This pull request references OCPCLOUD-3355 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "4.22.0" version, but no target version was set.

Details

In response to this:

Summary by CodeRabbit

  • Tests
  • Added comprehensive end-to-end tests validating bidirectional status translation between Machine API and Cluster API (versions, conditions, provider status, error propagation, same-name and edge-case scenarios).
  • Test Utilities
  • Added helpers for provider-status parsing and unique-name generation to support parallel and cross-version test scenarios.
  • Test Helpers
  • Introduced skip-wait variants and Ginkgo scoping adjustments to enable parallel resource creation and more reliable test assertions.
  • Chores
  • Increased e2e test timeout for longer-running scenarios.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@openshift-ci-robot
Copy link

openshift-ci-robot commented Feb 2, 2026

@sunzhaohua2: This pull request references OCPCLOUD-3355 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "4.22.0" version, but no target version was set.

Details

In response to this:

Summary by CodeRabbit

  • Tests
  • Added extensive end-to-end tests covering bidirectional status translation between Machine API and Cluster API (versions, conditions, provider status, error propagation, same-name and edge-case scenarios).
  • Test Utilities
  • Added utilities for provider-status parsing (including YAML-based provider status handling) and unique-name generation to support parallel and cross-version tests.
  • Test Helpers
  • Introduced skip-wait variants and test-scoping adjustments to enable parallel resource creation and more reliable assertions.
  • Chores
  • Increased e2e test timeout for longer-running scenarios.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@sunzhaohua2
Copy link
Contributor Author

/test e2e-aws-capi-techpreview

Copy link

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)
e2e/machineset_migration_helpers.go (1)

98-103: ⚠️ Potential issue | 🟡 Minor

Fail fast on unknown MachineAuthority values.

Without a default case, unexpected values silently skip readiness waits and can mask test issues.

🛠️ Suggested fix
@@
 switch machineAuthority {
 case mapiv1beta1.MachineAuthorityMachineAPI:
 	mapiframework.WaitForMachineSet(ctx, cl, machineSetName)
 case mapiv1beta1.MachineAuthorityClusterAPI:
 	capiframework.WaitForMachineSet(cl, machineSetName, capiframework.CAPINamespace)
+default:
+	Fail(fmt.Sprintf("unknown authoritativeAPI type: %v", machineAuthority))
 }
@@
 switch machineAuthority {
 case mapiv1beta1.MachineAuthorityMachineAPI:
 	mapiframework.WaitForMachineSet(ctx, cl, machineSetName)
 case mapiv1beta1.MachineAuthorityClusterAPI:
 	capiframework.WaitForMachineSet(cl, machineSetName, capiframework.CAPINamespace)
+default:
+	Fail(fmt.Sprintf("unknown authoritativeAPI type: %v", machineAuthority))
 }

Also applies to: 121-126

🤖 Fix all issues with AI agents
In `@e2e/status_conversion_test.go`:
- Around line 110-146: The assertions wrap non-pointer int32 replica fields with
HaveValue causing failures; update the checks against mapiMSAuthMAPI and
capiMSMirrorMAPI so that Status.ReadyReplicas, Status.AvailableReplicas and
Status.Deprecated.V1Beta1.{ReadyReplicas,AvailableReplicas} are matched directly
(e.g., BeNumerically(">", 0) or Equal(...)) instead of HaveValue(Equal(...));
keep the existing HaveValue usage for CAPI v1beta2 pointer fields (the
Status.ReadyReplicas checks that currently use HaveValue should remain).

@openshift-ci-robot
Copy link

openshift-ci-robot commented Feb 3, 2026

@sunzhaohua2: This pull request references OCPCLOUD-3355 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "4.22.0" version, but no target version was set.

Details

In response to this:

Summary by CodeRabbit

  • Tests
  • Added extensive end-to-end suites for bidirectional status translation between Machine API and Cluster API, covering replicas, readiness, conditions, provider status, errors, same-name and cross-version scenarios.
  • Test Utilities
  • Added YAML-capable provider-status parsing and a unique-name generator to support parallel, cross-version tests.
  • Test Helpers
  • Introduced skip-wait variants and test-scoping adjustments to enable parallel resource creation and more reliable assertions.
  • Chores
  • Increased e2e test timeout for longer runs.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Feb 3, 2026

@sunzhaohua2: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/prow/okd-scos-e2e-aws-ovn b493e38 link false /test okd-scos-e2e-aws-ovn
ci/prow/e2e-openstack-ovn-techpreview 49a6d93 link true /test e2e-openstack-ovn-techpreview

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

@sunzhaohua2
Copy link
Contributor Author

/test e2e-aws-capi-techpreview

@sunzhaohua2
Copy link
Contributor Author

@damdo I updated this pr to create machineset/machine in parallel, checked log, Status Conversion Tests time was reduced about 24 minutes.

In the new job 2018512646269046784 Status Conversion Tests took about 32m.

MachineSet Status Conversion: ~19m36m (04:15:32 → 04:35:08)
Machine Status Conversion: ~12m29s (04:35:08 → 04:47:37)

In the previous job 2016986669105811456 Status Conversion Tests took about 56 m.

Started: 01/30/26 00:22:03
Ended: 01/30/26 01:18:28

I also raised a pr to optimize migration e2e to reduce running time #451 ci/prow/e2e-aws-capi-techpreview e2e duration reduced from ~4h10m to 3h15m, this also covered this pr's status testing.

@damdo
Copy link
Member

damdo commented Feb 3, 2026

/assign @damdo @theobarberbany @nrb

@nrb
Copy link
Contributor

nrb commented Feb 3, 2026

The changes for speeding up the test times look good to me.

/lgtm

@openshift-ci openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Feb 3, 2026
@openshift-ci-robot
Copy link

Tests from second stage were triggered manually. Pipeline can be controlled only manually, until HEAD changes. Use command to trigger second stage.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. lgtm Indicates that a PR is ready to be merged.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants