add regular scheduled functions, now also callable on `yield()` #6039

d-a-v · 2019-05-02T14:19:49Z

added bool schedule_function_us(std::function<bool(void)> fn, uint32_t repeat_us)
lambda must return true to be not removed from the schedule function list
if repeat_us is 0, then the function is called only once.

Legacy schedule_function() is preserved

~~Linked list management is simplified~~

This addition allows network drivers like ethernet chips on lwIP to be regularly called

even if some user code loops on receiving data without getting out from main loop
(callable from yield())
without the need to call the driver handling function
(transparent)

This may be also applicable with common libraries (mDNS, Webserver, )

added bool schedule_function_us(std::function<bool(void)> fn, uint32_t repeat_us) lambda must return true to be not removed from the schedule function list if repeat_us is 0, then the function is called only once. Legacy schedule_function() is preserved Linked list management is simplified This addition allows network drivers like ethernet chips on lwIP to be regularly called - even if some user code loops on receiving data without getting out from main loop (callable from yield()) - without the need to call the driver handling function (transparent) This may be also applicable with common libraries (mDNS, Webserver, )

This is *dependant* on this esp8266 arduino core pull request: esp8266/Arduino#6039 Fix #3 (comment)

earlephilhower · 2019-05-02T18:31:22Z

This may be more of a scheduled_functions general question, but what about interrupt safety? One use I see for this is to put a notice into a queue in an IRQ (instead of actually doing work there). But you could also have things like repeated ones, now, in the main app. So isn't there a race condition in the list update stage (or if, say, an IRQ happens while parsing through the existing list)?

d-a-v · 2019-05-03T01:59:03Z

but what about interrupt safety?

You are right.
Per #2218 (comment), critical sections are now protected.

edit: still not sure this is the right way
edit2: is it better (per Arduino.h comment) ?

uint32_t savedPS = xt_rsil(15);
// do work here
xt_wsr_ps(savedPS); // restore the state

earlephilhower · 2019-05-03T14:20:33Z

That looks good and seems like it will preserve the last IRQ level which is what is needed.

I think you also need locking around get_fn. It's not likely (so would be ugly to debug) but you could get an IRQ in the middle of it and a call to schedule_fcn()...which will call get_fn() again and your linked list operations will go wonky.

cores/esp8266/Arduino.h

dok-net

May I bring the existence of cores/esp8266/interrupts.h to your attention? That actually gets included in Esp.cpp already.
About the use of volatile, I have some reserverations - IIRC, memory fences must be handled differently.

cores/esp8266/FunctionalInterrupt.cpp

dok-net · 2019-05-05T23:47:23Z

I've grepped across the source tree, and interrupts are disabled in one form or the other in any of these places:

libraries/EEPROM/EEPROM.cpp: noInterrupts();
libraries/ESP8266SdFat/src/SdCard/SdioTeensy.cpp: noInterrupts();
libraries/ESP8266SdFat/src/SpiDriver/DigitalPin.h: cli();
tools/sdk/lwip2/builder/glue-esp/lwip-esp.c: ets_intr_lock();
cores/esp8266/core_esp8266_timer.cpp: uint32_t savedPS = xt_rsil(15); // stop other interrupts
cores/esp8266/core_esp8266_wiring_digital.cpp: uint32_t savedPS = xt_rsil(15); // stop other interrupts
cores/esp8266/interrupts.h: _state = xt_rsil(15);

I gather, that timer and GPIO interrupt handlers by design decision ("AVR compatibility" gets mentioned) always run with interrupts disabled, reducing the changes that an ISR gets interrupted to about zero.

I think tools/sdk/lwip2/builder/glue-esp/lwip-esp.c maintains a linked list a lot like the one that's being discussed here, and stands out as having a rather large block of code during which interrupts are disabled.
I intend to take some measurements as to the effect disabling interrupts has on the receive timings of EspSoftwareSerial and report these soon, just be be on the safe side, please mind my word of caution that too much interrupt blocking adversely affects the real-time characteristics of the platform. Also, there is a chance that the data structure being used for the schedule queue is non-optimal for this case, and there might a problem with atomic operations / memory fences even in the case where interrupts get disabled. Is anyone knowledgeable of what the compiler and CPU do to register aliases during interrupts, i.e., do they get written to RAM and restored from RAM upon interrupt entry and return? I doubt that.

Now, late at night, all I can contribute toward a solution is this article, containing supposedly lock-free code for a queue:
https://stackoverflow.com/questions/871234/circular-lock-free-buffer

I will need to look into that for EspSoftwareSerial, the ISR buffer interacts with user code and has all the possibility of issues that I suggest this PR might have.

d-a-v · 2019-05-06T00:19:24Z

May I bring the existence of cores/esp8266/interrupts.h to your attention? That actually gets included in Esp.cpp already.

Sure, and thanks for it. I wasn't aware (!)

About the use of volatile, I have some reserverations - IIRC, memory fences must be handled differently.

This volatile is unnecessary, I forgot to remove it.

please mind my word of caution that too much interrupt blocking adversely affects the real-time characteristics of the platform

Sure but they are necessary to avoid races.
Lock-free queue structure is very interesting.
Maybe a new issue would have to be opened to discuss about your comment about locking:

use the best/correct API everywhere
consider lock-free linked lists
your results about measurements on the receive timings of EspSoftwareSerial when disabling interrupts

Is anyone knowledgeable of what the compiler and CPU do to register aliases during interrupts, i.e., do they get written to RAM and restored from RAM upon interrupt entry and return? I doubt that.

Registers used in a function (or all registers) are pushed on the stack and restored before the "reti".
I've seen that with gcc on other archs. I believe this is always mandatory when dealing with interrupts.

dok-net · 2019-05-06T00:54:11Z

Registers used in a function (or all registers) are pushed on the stack and restored

That's what I believe and that's the reason to use std::atomic load and store - otherwise all interrupt locking may not be much help. Or am I completely missing something?

earlephilhower · 2019-05-06T15:42:24Z

@dok-net, what specifically are you worried about?

The SDK blob IRQ wrapper stores all the registers in use before calling an IRQ function and then restores then before returning from interrupt. I don't think there's any particular concern there, it's a simple and common operation.

On function entry in main code you call IRQ-disable as the first state changing operation. That either finishes uninterrupted, or an IRQ gets called. That IRQ may call the same function, (and can't be interrupted) so will run to completion changing the linked list). On return from interrupt the main app's locking code completes and it has full access to the updated list (there is no chance of the list being cached incoherently anywhere in the machine) and can do its own work...

dok-net · 2019-05-09T14:29:22Z

@d-a-v @earlephilhower Is there a quick explanation why linked lists are used? IIRC linked lists are discouraged for use where lock-free programming is an advantage. I just too a perfunctory glance at the code spots and couldn't figure out why linked links are used - if all the reason should be to keep entries in for repeated execution, pop and re-push should do the trick, too. I've implemented a lock-free (well, on ESP32, on single-threaded ESP8266 it disable IRQs after all ;-) ) ring buffer / circular queue for EspSoftwareSerial, which I am still testing. It's multi-producer, single-consumer capable, and could be a nice basis to #6039...

earlephilhower · 2019-05-09T14:47:00Z

@dok-net I didn't do any of the coding on this, but I imagine linked lists were used to minimize heap usage when only a few (or common case: 0) delayed functions are in play. There's no atomic TAS operations on the chip, though, so even simple things like mutexes aren't doable safely w/o stopping IRQs. I would not want to guarantee that std::atomic (if available, I don't remember if it compiled or not) works as-expected with interrupts on this chip, actually.

In any case, I'm still trying to understand the concern here. Logically, I think it's fine. Are you concerned about performance (i.e. you have a use where you have to add items to the queue at high frequency, etc.? W/only 32 slots even that's a pretty bounded problem) or something else?

dok-net

Due to multiple changes, I've left a PR with explanations - d-a-v#6

cores/esp8266/Schedule.cpp

dok-net · 2019-05-21T11:07:06Z

cores/esp8266/Schedule.cpp

+{
+    return schedule_function_us([&fn](){ fn(); return false; }, 0);
+}
+
 void run_scheduled_functions()


IMHO: This linked-list implementation is not - probably never was - preemption safe, generally a compiler will keep the values of all the pointers in registers, even on the single-core ESP8266 an IRQ will not flush the registers but just push them to the stack and restore them, therefore any IRQ that's scheduling functions fails during an ongoing run_scheduled_functions(). Blocking IRQs during the complete execution of run_scheduled_functions makes it thread/IRQ safe, but I don't think this is permissible from an IRQ performance POV.

Good point.
We don't want to lock while executing the scheduled function themselvses.
One solution is to tag variables with volatile.
Another one is to build a local copy of the list while being locked, then unlock and run that list.

I think I must step back on this.
Locking IRQ is the right thing to do when the value of a variable can be modified in a TAS block.
Even if there are registers holding some variables, they will not be changed while in a locked block because an IRQ won't occur in that block, regardless whether the variable is cached in a register.

I think the compiler will not / must not optimize a variable into registers when is it not declared locally.

cores/esp8266/Schedule.cpp

dok-net · 2019-05-23T07:15:16Z

@d-a-v What happens if a scheduled function/task calls yield() etc. itself? Calling schedule_function() recursively seems safe, but what about this?
I am beginning to think that loop() should be just another (initially) scheduled function, returning true forever, what do you think? That said, wasn't there a scheduler for adding loops() in place all the time...?

https://www.arduino.cc/en/Reference/SchedulerStartLoop
https://github.com/arduino-libraries/Scheduler/

d-a-v · 2019-05-23T07:43:59Z

What happens if a scheduled function/task calls yield() etc. itself?

My thinking is these scheduled functions should be thought as interrupt functions (that are artificially shifted out from sys stack).
Only the minimal should be done in them, no delay no yield.

If it happens we need to be calling other functions that themselves call yield, then we can add a boolean fence set up in run_schedule_functions that will be checked in yield family functions.

I am beginning to think that loop() should be just another (initially) scheduled function

That can be something we think about when we will move from nonos-sdk to rtos-sdk which will happen sometimes soon enough.

That said, wasn't there a scheduler for adding loops() in place all the time...?

check loop_task() and loop_wrapper()

dok-net · 2019-05-23T07:45:57Z

@d-a-v please revisit my edited comment above regarding https://github.com/arduino-libraries/Scheduler/
I would love to stick to the Arduino idea in ESP8266/ESP32 "Arduino" :-) :-)
What seems really interesting is the stack modifications or using the same "loop" stack for all "functions". The use of assembly language in the Arduino Scheduler is holding me back - on the other hand, loop_task() and loop_wrappper() are pretty low-level and I have too little understanding of the ESP8266 Arduino internals to participate in this discussion beyond asking some obvious questions.

d-a-v · 2019-05-23T08:02:12Z

(edited: task->stack)

These schedulers are multitasking schedulers with each its own task.
When we switch to rtos-sdk, we already have that with FreeRTOS pretty much like in arduino-esp32 which you know well because of EspSoftwareSerial compatible with esp32. On esp8266 we have limited ram so maybe it's not a good idea to have a separate stack for each task (as opposed to scheduled functions running on the same stack).

dok-net · 2019-05-23T09:56:25Z

@d-a-v I've rebased d-a-v#6 - there's still something to do, one critical bug and performance related nullptr and rvalue refs.

d-a-v · 2019-05-23T10:21:09Z

one critical bug

Which is it ?

dok-net · 2019-05-23T10:24:24Z

Null pointer access in last line of get_fn_unsafe

Proposed changes from review

dok-net · 2019-05-24T08:05:55Z

@d-a-v Big oops, something we've all missed:

if (item->callNow)
{
    if (item->mFunc())
    {
         lastRecurring = item;
    }

lastRecurring should get updated if (!item->callNow) :

while (toCall)
{
    scheduled_fn_t* item = toCall;
    toCall = toCall->mNext;
    if (!item->callNow || item->mFunc())
    {
        lastRecurring = item;
    }
    else
    {
        InterruptLock lockAllInterruptsInThisScope;

        if (sFirst == item)
            sFirst = sFirst->mNext;
        else if (lastRecurring)
            lastRecurring->mNext = item->mNext;

        if (sLast == item)
            sLast = lastRecurring;

        recycle_fn_unsafe(item);
        }
    }

dok-net · 2019-05-24T08:22:15Z

@d-a-v Could we maybe agree to rename toCall as next? I've really a deep expectation in my mind that toCall is the current, to call, item, and it's making the piece of code very hard to reason about.

devyte · 2019-05-24T19:25:13Z

In that case:
toCall => nextItem
item => currentItem or currItem
or something along those lines

d-a-v · 2019-05-24T20:48:08Z

Already updated in #6137

d-a-v requested review from earlephilhower, igrr and devyte May 2, 2019 14:19

d-a-v added a commit to d-a-v/W5500lwIP that referenced this pull request May 2, 2019

transparent polling

b468cc8

This is *dependant* on this esp8266 arduino core pull request: esp8266/Arduino#6039 Fix #3 (comment)

d-a-v mentioned this pull request May 2, 2019

FunctionalInterrupt/ScheduledFunctions should be in a library, not in core_esp8266_wiring_digital #6038

Closed

d-a-v added 2 commits May 3, 2019 03:56

protect critical sections

4f09a64

Merge branch 'master' into recurrentscheduledfunctions

b6564c2

d-a-v force-pushed the recurrentscheduledfunctions branch from b58f12a to b6564c2 Compare May 3, 2019 01:56

d-a-v added 2 commits May 3, 2019 11:05

fix emulation on host, use alternate interruopt locking method

545dd35

fix emulation on host

aa4f87e

d-a-v added 3 commits May 3, 2019 23:02

critical code protection (wip)

896b686

add IRAM attrs where relevant

fac39ed

add host emulation fake defines

6e48831

dok-net mentioned this pull request May 4, 2019

Expose attachInterruptArg in Arduino.h and updated base for functional interrupts #6047

Open

d-a-v and others added 2 commits May 4, 2019 14:26

fix per esp8266#6039 (comment) @mhightower83

b9bf95b

Merge branch 'master' into recurrentscheduledfunctions

37128a2

earlephilhower reviewed May 5, 2019

View reviewed changes

cores/esp8266/Arduino.h Outdated Show resolved Hide resolved

wonderful idea with a class and its destructor for interrupt locking

36ac7dd

dok-net reviewed May 5, 2019

View reviewed changes

cores/esp8266/FunctionalInterrupt.cpp Outdated Show resolved Hide resolved

remove duplicate interrupt lock class

499d2ea

d-a-v added 4 commits May 22, 2019 10:54

uodates per review

c064a58

fixes per review

8f022c6

fixes per review

e2ad457

fix inverted logic missed from review

16c7e62

d-a-v requested a review from devyte May 22, 2019 09:19

dok-net suggested changes May 22, 2019

View reviewed changes

d-a-v and others added 4 commits May 23, 2019 10:11

fix per review #6 (1/2)

7982a7f

fix dangling pointer per #6 last point - thanks!

65603a3

pass lambdas with const refs

24474c8

Proposed changes from review

d9c2270

Initial count not applied anymore

df839c2

d-a-v added 2 commits May 23, 2019 12:34

Merge pull request #6 from dok-net/d-a-v/recurrentscheduledfunctions

40dbcc1

Proposed changes from review

cosmetics

8e06c30

d-a-v force-pushed the recurrentscheduledfunctions branch from 6b95675 to 8e06c30 Compare May 23, 2019 10:43

Merge branch 'master' into recurrentscheduledfunctions

a98cf9d

d-a-v merged commit b551992 into esp8266:master May 23, 2019

dok-net mentioned this pull request May 24, 2019

Refactor linked list to lock-free ring buffer #6139

Closed

d-a-v deleted the recurrentscheduledfunctions branch May 24, 2019 13:24

hreintke mentioned this pull request May 25, 2019

scheduled functions: fixes #6137

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add regular scheduled functions, now also callable on `yield()` #6039

add regular scheduled functions, now also callable on `yield()` #6039

d-a-v commented May 2, 2019 •

edited

Loading

earlephilhower commented May 2, 2019

d-a-v commented May 3, 2019 •

edited

Loading

earlephilhower commented May 3, 2019 •

edited

Loading

dok-net left a comment

dok-net commented May 5, 2019

d-a-v commented May 6, 2019

dok-net commented May 6, 2019 •

edited by d-a-v

Loading

earlephilhower commented May 6, 2019 •

edited

Loading

dok-net commented May 9, 2019

earlephilhower commented May 9, 2019

dok-net left a comment

dok-net May 21, 2019

d-a-v May 23, 2019

d-a-v May 23, 2019 •

edited

Loading

dok-net commented May 23, 2019 •

edited

Loading

d-a-v commented May 23, 2019

dok-net commented May 23, 2019 •

edited

Loading

d-a-v commented May 23, 2019 •

edited

Loading

dok-net commented May 23, 2019

d-a-v commented May 23, 2019

dok-net commented May 23, 2019

dok-net commented May 24, 2019 •

edited

Loading

dok-net commented May 24, 2019

devyte commented May 24, 2019 •

edited

Loading

d-a-v commented May 24, 2019

add regular scheduled functions, now also callable on yield() #6039

add regular scheduled functions, now also callable on yield() #6039

Conversation

d-a-v commented May 2, 2019 • edited Loading

earlephilhower commented May 2, 2019

d-a-v commented May 3, 2019 • edited Loading

earlephilhower commented May 3, 2019 • edited Loading

dok-net left a comment

Choose a reason for hiding this comment

dok-net commented May 5, 2019

d-a-v commented May 6, 2019

dok-net commented May 6, 2019 • edited by d-a-v Loading

earlephilhower commented May 6, 2019 • edited Loading

dok-net commented May 9, 2019

earlephilhower commented May 9, 2019

dok-net left a comment

Choose a reason for hiding this comment

dok-net May 21, 2019

Choose a reason for hiding this comment

d-a-v May 23, 2019

Choose a reason for hiding this comment

d-a-v May 23, 2019 • edited Loading

Choose a reason for hiding this comment

dok-net commented May 23, 2019 • edited Loading

d-a-v commented May 23, 2019

dok-net commented May 23, 2019 • edited Loading

d-a-v commented May 23, 2019 • edited Loading

dok-net commented May 23, 2019

d-a-v commented May 23, 2019

dok-net commented May 23, 2019

dok-net commented May 24, 2019 • edited Loading

dok-net commented May 24, 2019

devyte commented May 24, 2019 • edited Loading

d-a-v commented May 24, 2019

add regular scheduled functions, now also callable on `yield()` #6039

add regular scheduled functions, now also callable on `yield()` #6039

d-a-v commented May 2, 2019 •

edited

Loading

d-a-v commented May 3, 2019 •

edited

Loading

earlephilhower commented May 3, 2019 •

edited

Loading

dok-net commented May 6, 2019 •

edited by d-a-v

Loading

earlephilhower commented May 6, 2019 •

edited

Loading

d-a-v May 23, 2019 •

edited

Loading

dok-net commented May 23, 2019 •

edited

Loading

dok-net commented May 23, 2019 •

edited

Loading

d-a-v commented May 23, 2019 •

edited

Loading

dok-net commented May 24, 2019 •

edited

Loading

devyte commented May 24, 2019 •

edited

Loading