tests/xtimer_usleep: fail with negative offsets #11493

fjmolinas · 2019-05-07T06:54:02Z

Contribution description

This PR modifies xtimer_usleep automated test to recognize negative offset with respect to target sleep time.

When reviewing #11044 I realized tests/xtimer_usleep ~~wasn't passing because the python script was not recognizing negative integers as acceptable values for the timer offset, this is done now.~~ was failing because of a pexpect timeout and not because of negative offsets (sleeping less than the specified time).

Testing procedure

With frdm-kw41z or usb-kw41z (or any board witch would produce negative offset with respect to the target usleep time)

→ make -C tests/xtimer_usleep BOARD=usb-kw41z flash test

~~Failed without this PR, succeeds with it.~~ Failes correctly with this PR

Issues/PRs references

cladmi

The issue is is not the test, the test is correct.
When we say sleep 10us it should sleep at least ~~10s~~ 10us.

There is an issue in sys/include/xtimer/tick_conversion.h where for some configuration if you say, give me the number of ticks for 1s it returns less than enough ticks for 1s so sleeps 0.9something which is wrong.

fjmolinas · 2019-05-07T12:05:52Z

Aha!, thanks for the clarification @cladmi. When you say there is an issue with sys/include/xtimer/tick_conversion.h , are you talking about a specific opened issue? I couldn't find one.

I'll close this PR after pointing or opening an issue (if there isn't one already).

kaspar030 · 2019-05-07T12:08:33Z

When we say sleep 10us it should sleep at least 10s.

More like "as close to 10us as possible". 9.99us is more accurate than 10.02.

fjmolinas · 2019-05-07T12:43:56Z

More like "as close to 10us as possible". 9.99us is more accurate than 10.02.

@cladmi @kaspar030 So what is actually the expected behavior:
A) sleep as close as target time
B) sleep at least target time

This in not clear from the documentation., which ever the case I think we should make it more explicit.

cladmi · 2019-05-07T12:56:48Z

When we say sleep 10us it should sleep at least ~~10s~~ 10us.

More like "as close to 10us as possible". 9.99us is more accurate than 10.02.

So in a timesloted networking protocol if I say:

next_timeslot_start = now + convert(10ms);
sleep_until(timeslot_start); // or with delay, its for the idea
send_packet();

I may just send in the previous timeslot everytime.

I do not feel it more accurate to send a message before I am allowed to.
I do not feel it more accurate to write to a hardware before its ready.

That MUST be documented in the API and tell how to proceed if one wants to be sure that he can do:

t0 = now();
timer_set(10ms, callback, arg=t0);

and have an the callback:
t0 = arg;
assert(now() > t0 + 10ms);

From the timer libraries I remember, they were all more ensuring it is done at least after/waits longer than the time specified. Like posix guarantees.

kaspar030 · 2019-05-07T13:56:46Z

So in a timesloted networking protocol if I say:

I think we need to keep the timescales in context. I'd expect my timer subsystem to quantize any real time interval to the to the closest hardware timer tick instead of always rounding up. That would make it at most +/- half a tick wrong, being spot on on average. Always rounding up would mean it is between 0 and a whole tick late (half a tick on average).

Your argumentation for a timeslot would shift the problem to the end of the timeslot (always rounding up might make the protocol exceed its timeslot every time).

In practice, we're talking +- 0-15us vs +0-30us for a 32kHz timer quantized to 1000000us. The better option here is to either work with the low-level ticks or chose a faster timer. Or chose one that corrects in software via spinning, but that is inefficient.

I think we agree that the timer peripheral itself must not sleep too little. Is that the issue here?

fjmolinas · 2019-05-07T14:20:43Z

I think we agree that the timer peripheral itself must not sleep too little. Is that the issue here?

What is too little? Is -15us too little?

cladmi · 2019-05-07T15:03:19Z

Returning before, whatever the amount is, is not what I would expect as a user and it is not what the API says.

If my resolution is 30us, I know I cannot get better precision than a few multiple of 30us. Also I should take into account all the delays to know what I should have in the worst case and I must deal with the worst case as it can happen. If my precision is lower that what I need, I am screwed.

The usage currently assumes there is a way to have a timer after an expired time.
The sema module does this which used by posix_semaphore.

tests/posix_semaphore on frdm-kw41z.

2019-05-07 16:20:51,089 - INFO # ######################### TEST4:
2019-05-07 16:20:51,090 - INFO # first: sem_init s1
2019-05-07 16:20:51,094 - INFO # first: wait 1 sec for s1
2019-05-07 16:20:52,093 - INFO # first: timed out
2019-05-07 16:20:52,098 - INFO # first: waited only 999970 usec => FAILED
2019-05-07 16:20:52,099 - INFO # ######################### DONE

And here it also happened after 30us

2019-05-07 16:37:30,761 - INFO # ######################### TEST4:
2019-05-07 16:37:30,761 - INFO # first: sem_init s1
2019-05-07 16:37:30,765 - INFO # first: wait 1 sec for s1
2019-05-07 16:37:31,764 - INFO # first: timed out
2019-05-07 16:37:31,769 - INFO # first: waited 1000031 usec
2019-05-07 16:37:31,770 - INFO # ######################### DONE

So here code using xtimer must assume tick difference in the future and the past, not only for sleep but also for xtimer_set so the callback done in interrupt context.

And for the precision, you can only assume the guarantee that can happen in the worst case. If one high priority thread can do small sleeps, a lower priority thread should already assume it can be descheduled for XTIMER_BACKOFF ticks at any time as the other thread could have xtimer_usleep spin up to XTIMER_BACKOFF ticks.
And if this happens just before calling the xtimer_usleep/xtimer_set function, it would be at least 5 ticks after what is expected as it is a relative time.

With timing, I would only consider the worst case.

miri64 · 2019-05-07T16:40:25Z

Murdock hangs on this PR for over 8 hours so I requeued it.

MrKevinWeiss · 2019-05-08T14:23:43Z

Just to chime in, I think the time should wait at least the time specified (both for sleep functions and for timer periph functions). A case is if you expect something to be triggered after your sleep but you wake up too early you take the value of a previous event.

If everyone agrees on that then we probably shouldn't merge the test change or, if everyone disagrees (at least on the sleep perspective), we should update the docs to say this is the behaviour of sleep as my assumption for sleep is at least not about... Maybe update the docs either way...

MichelRottleuthner · 2019-05-08T16:04:21Z

(...) a lower priority thread should already assume it can be descheduled for XTIMER_BACKOFF ticks at any time (...)

This illustrates why trying to hit the point as precisely as possible (maybe ahead of time) doesn't really gain anything.

That would make it at most +/- half a tick wrong, being spot on on average.

This would only be true if there were no delays introduced such as with the above effect of (de)scheduling.

As "spot on" can not be guaranteed anyway I'd also vote for "wait at least for the provided duration".
That way we have at least a well defined guarantee in one direction. When very precise timing is required, sleep functions also wouldn't be the goto solution.

kaspar030 · 2019-05-09T07:11:54Z

As "spot on" can not be guaranteed anyway I'd also vote for "wait at least for the provided duration".
That way we have at least a well defined guarantee in one direction.

Ok, you guys have convinced me. +1 for never sleeping too little, as in, Tafter >= Tbefore + interval.

fjmolinas · 2019-05-13T09:13:26Z

Ok so we have an agreement over the expected behaviour for xtimer_usleep(TARGET_TIME). I still think the test is wrong, it should recognize the negative offset but fail according to this. Right now the test is defining an upper and lower 5% error margin for the sleep time (INTERNAL_JITTER).

Nonetheless since the expect REGEX is not capturing negative values it is failing whatever negative value it is getting but not in the right way. I have changed the test so the lower bound is equal to TARGET_TIME and the upper bound keeps the 5% margin.

The result is having the test fail as:

2019-05-13 11:02:14,078 - INFO # main(): This is RIOT! (Version: 2019.07-devel-162-gcd0ab-pr_xtimer_doc)
2019-05-13 11:02:14,081 - INFO # Running test 5 times with 7 distinct sleep times
2019-05-13 11:02:14,083 - INFO # Please hit any key and then ENTER to continue
a
2019-05-13 11:02:14,155 - INFO # Slept for 9979 us (expected: 10000 us) Offset: -21 us
Invalid timeout 9979 ,expected 10000 < timeout < 10500
Host max error  500
error           -21

Instead of:

2019-05-13 11:03:02,812 - INFO # Slept for 12116 us (expected: 12122 us) Offset: -6 us
2019-05-13 11:03:02,922 - INFO # Slept for 98754 us (expected: 98765 us) Offset: -11 us
2019-05-13 11:03:02,997 - INFO # Slept for 74981 us (expected: 75000 us) Offset: -19 us
2019-05-13 11:03:02,998 - INFO # Test ran for 1732575 us
Timeout in expect script at "child.expect(u"Slept for (\\d+) us \\(expected: (\\d+) us\\) Offset: (\\d+) us")" (tests/xtimer_usleep/tests/01-run.py:36)

@cladmi @kaspar030 Does this change make sense?

cladmi · 2019-05-13T16:12:26Z

I am running the test on many boards right now.

I would like to have the explanation that the time for usleep should not be in the past emphasized in the commit description too.

Otherwise I agree with the change.

cladmi · 2019-05-13T16:19:48Z

I ran the test successfully on:

arduino-mega2560 frdm-k64f iotlab-m3 msba2 mulle native nrf52dk nucleo-f103rb pba-d-01-kw2x sltb001a stm32f3discovery

And it failed as expected on frdm-kw41z.

Only need to update commit message, otherwise I agree.

- xtimer_usleep(timeout) should sleep for at least timeout us. Negative offset, i.e. sleeping less than the specified time is incorrect.

fjmolinas · 2019-05-14T07:07:14Z

I would like to have the explanation that the time for usleep should not be in the past emphasized in the commit description too.

@cladmi not sure what you mean by not beeing in the past, in xtimer_usleep(timeout) timeout is always in the future. I did add more detail in the commit stating that sleeping less that the specified time is incorrect behavior.

cladmi · 2019-05-14T10:44:38Z

Sorry, I meant in the past from the expected end time, your comment said it correctly.

cladmi

ACK, review and tested. It now fails with an appropriate message on the frdm-kw41w.

fjmolinas added Type: bug The issue reports a bug / The PR fixes a bug (including spelling errors) Area: tests Area: tests and testing framework labels May 7, 2019

fjmolinas requested review from aabadie and cladmi May 7, 2019 06:54

fjmolinas mentioned this pull request May 7, 2019

boards/kw41z*: add common configuration and use it with existing kw41z boards #11044

Merged

kaspar030 added CI: run tests If set, CI server will run tests on hardware for the labeled PR CI: ready for build If set, CI server will compile all applications for all available boards for the labeled PR labels May 7, 2019

cladmi previously requested changes May 7, 2019

View reviewed changes

cladmi requested a review from MichelRottleuthner May 7, 2019 12:32

jcarrano added the Discussion: contested The item of discussion was contested label May 8, 2019

fjmolinas force-pushed the pr_xtimer_usleep_negative branch from f2e6661 to b124a02 Compare May 13, 2019 09:11

fjmolinas changed the title ~~tests/xtimer_usleep: recognize negative offset~~ tests/xtimer_usleep: fail with negative offsets May 13, 2019

tests/xtimer_usleep: fail with negative offset

a687a26

- xtimer_usleep(timeout) should sleep for at least timeout us. Negative offset, i.e. sleeping less than the specified time is incorrect.

fjmolinas force-pushed the pr_xtimer_usleep_negative branch from b124a02 to a687a26 Compare May 14, 2019 07:03

cladmi approved these changes May 14, 2019

View reviewed changes

cladmi merged commit dde4fe0 into RIOT-OS:master May 14, 2019

cladmi mentioned this pull request May 14, 2019

API change, uart input not working anymore on previously working setups #11525

Closed

fjmolinas mentioned this pull request Jul 14, 2019

tests/xtimer_periodic_wakeup: allow for negative difference #11382

Closed

fjmolinas deleted the pr_xtimer_usleep_negative branch August 7, 2019 15:43

tests/xtimer_usleep: fail with negative offsets #11493

tests/xtimer_usleep: fail with negative offsets #11493

Uh oh!

Conversation

fjmolinas commented May 7, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Contribution description

Testing procedure

Issues/PRs references

Uh oh!

cladmi left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fjmolinas commented May 7, 2019

Uh oh!

kaspar030 commented May 7, 2019

Uh oh!

fjmolinas commented May 7, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cladmi commented May 7, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kaspar030 commented May 7, 2019

Uh oh!

fjmolinas commented May 7, 2019

Uh oh!

cladmi commented May 7, 2019

Uh oh!

miri64 commented May 7, 2019

Uh oh!

MrKevinWeiss commented May 8, 2019

Uh oh!

MichelRottleuthner commented May 8, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kaspar030 commented May 9, 2019

Uh oh!

fjmolinas commented May 13, 2019

Uh oh!

cladmi commented May 13, 2019

Uh oh!

cladmi commented May 13, 2019

Uh oh!

fjmolinas commented May 14, 2019

Uh oh!

cladmi commented May 14, 2019

Uh oh!

cladmi left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

fjmolinas commented May 7, 2019 •

edited

Loading

cladmi left a comment •

edited

Loading

fjmolinas commented May 7, 2019 •

edited

Loading

cladmi commented May 7, 2019 •

edited

Loading

MichelRottleuthner commented May 8, 2019 •

edited

Loading