Skip to content

Improvements to the daily checker buildfarm #57

@Crola1702

Description

@Crola1702

Description

Throughout the green buildfarm subproject development, we've found some issues with current daily checker workflow. The current prioritization method of the checker workflow.

Some considerations we should have for improved daily checker workflow:

Prioritization:

  • Consecutive test failures:
    • Errors occurring three times in a row will be prioritized
    • Errors that have appeared three times at least in a 2 week windows, will also be prioritized
  • Flaky test categorization:
    • ~7%: or errors that happened only once: Not reported
    • 7-20%: reported an announced in the weekly meetings. Consider as known issues
    • 20%-100%: reported and assigned a develioper to take a look
  • Keep track of disabled (skipped?) tests

Check buildfarm script

  • Modification to report tests failing 3 times in a row
  • Gazebo jobs should trigger a warning if failing 2 times in a row and not debbuilders
  • Warnings should be addressed, so we should monitor often jobs failing for 5 times in a row
  • We should monitor the output of a single sql script that give us jobs that dont have any success:

Sample report.

This is an example of how new reports should look like:

Sample report:

Buildfarmer log

Probably skip for saturday and sunday (show big log diff on Monday)

New X items to investigate (+/- Y): ?? No new issues!

Show new reports that didn’t existed yesterday

Build regressions:

Show only consistent regressions
For build regressions should keep just 1 time in a row

  • Issue in job : failed X times in a row

  • Issue in job : happened Y times in the last 2 weeks (flaky)

    Ignore ClosedChanel exception ones

Test regressions:

  • Issue in job : failed X times in a row
  • Issue in job : happened Y times in the last 2 weeks (flaky)

Warnings:

  • Job contains warnings (+/- X)

Continue investigating: X items (+/- Y):

Show reports that still exists from yesterday

Build regressions:

Show only consistent regressions
For build regressions should keep just 1 time in a row

  • Issue in job : failed X times in a row
  • Issue in job : happened Y times in the last 2 weeks (flaky)

Test regressions:

  • Issue in job : failed X times in a row
  • Issue in job : happened Y times in the last 2 weeks (flaky)

Warnings:

  • Job contains warnings

Old issues:

Show known issues

Jobs to check:

  • Job hasn’t passed in days

Reported issues

Integration with gh cli

  • Issue hasn’t been updated in days
  • Issue hasn’t happened in days. should check!

Disabled/Skipped tests:

  • Total: (+/- X)

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions