Multi-core in llbooter, capmgr, sl, cos_kernel_api(locks) and other related #346

phanikishoreg · 2018-04-09T15:26:33Z

Summary of this Pull Request (PR)

This PR has all the required (at least the basic facilities for foundation) for cross-cpu and parent-child communication and for any 2 parties to communicate (through channels) and bug-fixes in core interfaces and libraries for multi-core.

channel:

new interface for providing shared memory channels.

capmgr:

per-core data structures for threads and per-core initialization
implement "channel" interface for creating/mapping shared memory with a channel key.
capmgr_thd_retrieve api to return the thdid of the initthd in the child component to aid in getting the initthd(schedthd) for a lazy child thread lookup.

sched/schedinit:

bug-fix in schedinit_child interface where thread was being retrieved on a call to sl_thd_lkup, so added sl_thd_try_lkup (in sl) to avoid that.
Make schedinit_child return the cbuf_t ID to the child after parent creates a shared memory, initializes a ring-buffer in it and returns the ID to the child to map in that shared memory and process events from there.

llbooter:

per-core data structures and per-core initialization: Scheduling information is tracked for every core for a system with per-core hierarchies
TODO: run-script and/or configuration (static) for identifying per-core hierarchies.

sl:

per-core data structures: per-core global data and sl_thd structures
sl_xcpu: For cross-cpu (cross-core) communication. API for asynchronous thread creation on destination core. Only one API for cross-core thread creation is implemented and the rest return -ENOSUP for now. (TODO: rename to sl_xcore).
TODO: sl_init to be passed a CPU bitmap for initializing for only those cores and for creating asnds for communication with only those cores. Right now, its hardcoded in sl_init to initialize for NUM_CPU number of cores.
sl_child: For parent-child communication. To enable child threads to make invocations into the parent and be blocked. This lets the parent notify it's child scheduler of its thread blocking.

cos_kernel_api:

bug-fixes for multi-core parallelism.
locks for making cap/mem bumps and vas expands atomic. This is not great because it is not wait-free. This is the reason for not marking "This PR is mature" checkbox.

ck submodule:

added ck submodule in components/lib/ck and using SPSC and MPSC ck_ring in sl_child and sl_xcpu respectively.

Intent for your PR

Choose one (Mandatory):

This PR is for a code-review and is intended to get feedback, but not to be pulled yet.
This PR is mature, and ready to be integrated into the repo.

Reviewers (Mandatory):

@gparmer @ryuxin @yzcode

Code Quality

As part of this pull request, I've considered the following:

Style:

Comments adhere to the Style Guide (SG)
Spacing adhere's to the SG
Naming adhere's to the SG
All other aspects of the SG are adhered to, or exceptions are justified in this pull request
I have run the auto formatter on my code before submitting this PR (see doc/auto_formatter.md for instructions)

Code Craftsmanship:

I've made an attempt to remove all redundant code
I've considered ways in which my changes might impact existing code, and cleaned it up
I've formatted the code in an effort to make it easier to read (proper error handling, function use, etc...)
I've commented appropriately where code is tricky
I agree that there is no "throw-away" code, and that code in this PR is of high quality

Testing

I've tested the code using the following test programs (provide list here):

microbooter, unit_capmgr, unit_schedcomp, unit_hiersched for 1, 2 and 4 cores. (some on HW, some on Qemu)
unit_fprr - modified to support multi-core and added unit-test for cross-core thread creation.
tested parent-child communication in a hacky way to confirm that the child is able to dequeue the exact requests from parent.

* This really is reverting back to use the SL_THD_WOKEN state for a thread that could call block() and was just woken() that could result in a race condition between block/wakeup events. * However, when the scheduler receives events from the kernel, they're for AEPs and must not have races. More importantly, kernel events could be redundant blocked/unblocked events. * NOTE: There could be some complex interleaving with AEP threads that I may be missing, could still have some issues. Will need to debug those as they occur.

* most changes are related to having PER_CORE variables/data-structures.

…into smp Conflicts RESOLVED: src/components/include/cos_defkernel_api.h src/components/include/cos_thd_init.h src/components/lib/cos_defkernel_api.c FIXED TO USE CACHE_ALIGNED. TODO: capmgr/sched implementations!

* TODO? Cannot create per-core thread ids because stack allocation uses unique thread ids for that. * TODO: sched component per core! * TODO: sl global data structures * Made sure that llbooter /capmgr data structures are set up such that we can have core-specific hierarchies. TODO: runscript design for that and passing that in init_args??

Conflicts RESOLVED: src/platform/i386/lapic.c

* Schedulers are implemented such that we can have core-specific hierarchies. This still needs to be designed and implemented in the linker/loader. * TODO: inter-core scheduler notifications.

…R register

* This is a workaround for issues in cos_kernel_api data-structures that are prone to races. * TODO: 1. Figure out where the races are! Use ticket locks for cos_kernel_api frontier modifications.

* Fixed race bugs in cos_kernel_api that lead to capability slot reuse. * This allows booter/capmgr to run 1 to N-1 CPUs to run initialization in parallel. (no more serializing required). * There are still some random crashes (noticed one in microbooter for heap alloc), I'll continue to debug those. But this is much stable version than before! * Tested my unit test for hierarchical scheduling with ~9 components, capmgr test, raw scheduler (scheduler that uses raw kernel API) and micro booter all on HW by running with NUM_CPU = 4.

* using locks in the cos_kernel_api!

* Talked to Yuxin about using locks here. Issues, 1. We could be propagating errors to user-level and not use locks. I think these are not performance intensive paths and the idea here is only to maintain consistency of the shared frontiers (across cores) and not to solve for example system call errors, which we'd still need to either propagate to the user or BUG()/assert() as we've now. 2. Reason for locks, vs retry logic for data structure consistencies: Having locks ensure that we don't update frontiers when we don't need to so we don't have holes created in the capability slots. Plus, again, if we need to propagate retries, we can do that with `try_lock` instead of with the internal API failures. Which would ensure we maintain simple code and less complex retry logic around it. 3. Whether locks are a long-term solution: I'd think the locks are mainly here for maintaining the cos_kernel_api library data-structure consistency and that around the APIs that are not performance sensitive or for which we don't upper-bound (??). So unless we want to have a totally lock free user-level(user of the lib) retry version, this doesn't feel like a short-term solution to me! 4. We could be work to make the lock usage more efficient: Perhaps we can. Ex: In bump_alloc_generic, we don't need locks to be used for per-core bump allocs. I have not thought this through but I don't think it matters as much but there could be room for optimizing the usage! * TODO: The only issue I think after this is with `cos_page_bump_alloc()` and I need to fix it. In my opinion I think it is probably not related to frontiers (very well could be) but is mostly related to atomicity of bump_expand and that PTE may need additional extension.. This is still not a complete thought, will debug deeper.. The reason I say this is because `page_bump_alloc()` in most of my tests don't crash except for micro_booter which tests it more rigorously in that the page_bump_allocs() cause additional PTEs to be allocated and if executions interleave, they should be causing issues!

* There is a race bug that leads to asserts trigger in page_bump_alloc from a system call. In my understanding PTEs are allocated by one core and used by other core which does also valloc.. I'm going to debug this further to get to the core of the problem.. But this fix makes things flawless on multi-core (tested with 8 cores).

Conflicts RESOLVED: src/components/implementation/tests/micro_booter/micro_booter.c src/kernel/include/shared/cos_types.h src/platform/i386/chal.c

* added a long description in sl_thd.h for what the idea is for the fix for AEP threads.. Essentially, if a thread is not blocked on cos_rcv, a kernel "unblocked" event should not bother what the thread state is at the user-level and the user-level block/yield should reset kernel state in the scheduler data structure as the thread is running or has been running without the scheduler's knowledge! Therefore kernel event isn't really a thread state, designed it that way!

…ken_race

* sl_xcpu_xxxx() api for cross-core requests! Currently only a single API sl_xcpu_thd_alloc() is supported and tested! * `sl_global_init()` is called from inside `sl_init()` on the first call to it, so global init is not exposed! TODO: pass the cpu bitmap in `sl_init` for information about which cpu/cores does this scheduler run on! Right now, it's hard coded within sl_init! * Added ability to create `asndcap` from one core to all other cores for cross-core notifications!

into shmmgr_channels

* Channels are a more generic term and using the same channel keys for ASND->ARCV and SHARED MEM would be ideal as they together form a communication channel. This change is toward that goal!

* This interface allows us to create a channel based on a static namespace using the cos_channelkey_t.. If two components want to communicate with each other but not worry about their ancestry or hierarchy, this API along with AEP creation API and shared keys enables two arbitrary components to talk to each other! We need access control for which components can talk to which others! Perhaps based on the dependencies in runscripts?? (This is not currently done however!). * I've also modified to return the mapped information if we make a duplicate map call either in memmgr or channel interface! * I was slightly confused between channel vs shmchannel. But I chose channel because it's shorter and perhaps allows us to also for ex, combine shared memory creation and asnd creation API using a channel key!

…mposite into smp Conflicts RESOLVED: src/components/implementation/capmgr/naive/cap_info.c src/components/implementation/capmgr/naive/cap_info.h src/components/implementation/tests/unit_capmgr/unit_capmgr.c src/components/implementation/tests/unit_capmgr_shmmap/unit_capmgr_shmmap.c src/components/lib/sl/sl_capmgr.c src/components/lib/sl/sl_raw.c * TODO: TESTING!

* Added `capmgr_asnd_rcv_create()` api to avoid `introspection` on a initthread to be able to create a snd endpoint.

* A ring buffer (ck_ring) is initialized in a shared memory region that is shared between a parent and its child (SPSC). * Parent creates the shared memory, initializes the ring buffer and returns the cbuf ID to the child through the schedinit_child interface! * Child then maps that address and setup it's ring/ring buffer pointers. * Every scheduler then in its `sl_sched_loop()` check if there are events in their ring buffer if there are, they're processed like other kernel events or inter-core events! Only BLOCK/WAKEUP are supported currently. * TODO: Upon `capmgr_thd_retrieve()` I'd need to get the init thread id of the child component that has this thread! This is because, the lazy thread retrieval doesn't know the mappings for child spdid to child threads! * Tested the notification path but not the actual blocks & wakeups!

* This API is used to retrieve the thread lazily, so knowing the initthd of the client component is essential at this point to setup the thd->schedthd of the thread being retrieved!

* Replaced multiple arrays of per core information with a struct that contains all of them and cache alignment for the struct, to save space!

* sl_xcpu_xxxx API are for cross-cpu requests.. passing in the cpuid of the current core as the argument is restricted in my design! * also made the return value reflect the internal error by returning the errno!

Conflicts RESOLVED: src/kernel/include/shared/cos_config.h src/kernel/include/shared/cos_types.h

…ken_race

* change `sched_blocked` to `rcv_suspended` * comments update for `rcv_suspended` * cos_switch to sched thread with current tcap on -EPERM for cos switch with sched tcap.

Woken race fix for any thread

Conflicts RESOLVED: src/components/include/sl.h src/components/lib/sl/sl_sched.c

gparmer · 2018-04-21T20:53:07Z

I really hope that you addressed all of the issues. I looked through the commits after feedback, and there isn't much there...but I don't remember what the feedback was at this point, so that might be OK. I'm taking you on your word ;-)

phanikishoreg · 2018-04-21T21:19:38Z

I wasn't expecting a merge of this so soon. ;-)
I mean what I've tested works but this PR is not reviewed at all. This was created when you were away for RTAS conference.

The first few comments are for my changes in cos_kernel_api and those were through a direct commit (in my forked repo) before me creating this PR that I shared for some feedback while I was debugging and I've addressed those.

I'm not sure if I should revert the PR so it can go through the review process.

phanikishoreg · 2018-04-21T21:35:57Z

Never mind. I just realized this is to "smp" branch and not mainline ppos.
My mistake to not have noticed that it was not a PR to mainline ppos.

We can do the review process when I PR it to the mainline.

gparmer · 2018-04-22T13:16:12Z

I misunderstood. I thought you said before that this was ready and post-changes. I thought I did give feedback. Did I only give feedback on a subset of the PR?

phanikishoreg · 2018-04-22T16:00:00Z

I have said that for the other "ready to merge" PRs and but not this one I think. :-)
Yeah, you provided feedback for a commit that I directly referenced on slack some time back.

phanikishoreg added 30 commits March 9, 2018 18:49

Per-core scheduling structures

0fdb013

Merge branch 'ppos' of https://github.com/gwsystems/composite into smp

3427c92

Fully working micro_booter multi-core (Qemu tested)

11b9f33

* most changes are related to having PER_CORE variables/data-structures.

Consistency check in timer calibration and lapic ds CACHE_ALIGNED

47948ad

Merge branch 'smp' of https://github.com/gwsystems/composite into smp

3fe3bc5

Conflicts RESOLVED: src/platform/i386/lapic.c

Cleaning up my debug mess

aa72b8c

BLOCK on TCAP expiry for threads with TCAPs

1e1cefa

Updated for core-partitioned scheduling

3895314

* Schedulers are implemented such that we can have core-specific hierarchies. This still needs to be designed and implemented in the linker/loader. * TODO: inter-core scheduler notifications.

Merge branch 'smp' of https://github.com/gwsystems/composite into smp

f0c96a2

Merge branch 'smp' of https://github.com/gwsystems/composite into smp

daabdcb

Avoid calibration where we have TSC-DEADLINE and Processor freq in MS…

49bfe52

…R register

Serializing initialization in llbooter/capmgr

8dcf7b6

* This is a workaround for issues in cos_kernel_api data-structures that are prone to races. * TODO: 1. Figure out where the races are! Use ticket locks for cos_kernel_api frontier modifications.

Updated cos_kernel_api to atomically update frontiers.

7ca26e6

* using locks in the cos_kernel_api!

Merge branch 'smp' of https://github.com/gwsystems/composite into smp

400f807

Conflicts RESOLVED: src/components/implementation/tests/micro_booter/micro_booter.c src/kernel/include/shared/cos_types.h src/platform/i386/chal.c

Merge branch 'ppos' of https://github.com/gwsystems/composite into wo…

d84cdf7

…ken_race

Move sl_global_data to sl_global_cpu_data for per core global info

0acf8b0

Merge branch 'cbufid_only' of https://github.com/phanikishoreg/composite

538a97c

into shmmgr_channels

Renamed cos_aepkey_t to cos_channelkey_t

9c002f6

* Channels are a more generic term and using the same channel keys for ASND->ARCV and SHARED MEM would be ideal as they together form a communication channel. This change is toward that goal!

Fixed sl xcpu to use sl_raw/sl_capmgr for asnd creation

ed4e63c

* Added `capmgr_asnd_rcv_create()` api to avoid `introspection` on a initthread to be able to create a snd endpoint.

phanikishoreg and others added 12 commits April 6, 2018 12:34

capmgr_thd_retrieve to return the initthd's tid

6a94d8c

* This API is used to retrieve the thread lazily, so knowing the initthd of the client component is essential at this point to setup the thd->schedthd of the thread being retrieved!

Merge branch 'ppos' of https://github.com/gwsystems/composite into smp

816a036

Check for current thd's schedthd in schedinit_child

d56dac5

capmgr per-core data structure optimized

da38741

* Replaced multiple arrays of per core information with a struct that contains all of them and cache alignment for the struct, to save space!

Consistency check in sl_xcpu api and errno.

859e23c

* sl_xcpu_xxxx API are for cross-cpu requests.. passing in the cpuid of the current core as the argument is restricted in my design! * also made the return value reflect the internal error by returning the errno!

Merge branch 'smp' of https://github.com/gwsystems/composite into smp

87eddf6

Conflicts RESOLVED: src/kernel/include/shared/cos_config.h src/kernel/include/shared/cos_types.h

Merge branch 'ppos' of https://github.com/gwsystems/composite into wo…

dd7757b

…ken_race

Feedback fixes

e92c6c3

* change `sched_blocked` to `rcv_suspended` * comments update for `rcv_suspended` * cos_switch to sched thread with current tcap on -EPERM for cos switch with sched tcap.

feedback cleanup

5a1fd70

Merge pull request gwsystems#345 from phanikishoreg/woken_race

c131a32

Woken race fix for any thread

Bugfix in xcore thread creation in sl

67c993d

Merge branch 'ppos' of https://github.com/gwsystems/composite into smp

95b1590

Conflicts RESOLVED: src/components/include/sl.h src/components/lib/sl/sl_sched.c

gparmer merged commit b52203a into gwsystems:smp Apr 21, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-core in llbooter, capmgr, sl, cos_kernel_api(locks) and other related #346

Multi-core in llbooter, capmgr, sl, cos_kernel_api(locks) and other related #346

phanikishoreg commented Apr 9, 2018

gparmer commented Apr 21, 2018

phanikishoreg commented Apr 21, 2018

phanikishoreg commented Apr 21, 2018 •

edited

Loading

gparmer commented Apr 22, 2018

phanikishoreg commented Apr 22, 2018

Multi-core in llbooter, capmgr, sl, cos_kernel_api(locks) and other related #346

Multi-core in llbooter, capmgr, sl, cos_kernel_api(locks) and other related #346

Conversation

phanikishoreg commented Apr 9, 2018

Summary of this Pull Request (PR)

Intent for your PR

Reviewers (Mandatory):

Code Quality

Testing

gparmer commented Apr 21, 2018

phanikishoreg commented Apr 21, 2018

phanikishoreg commented Apr 21, 2018 • edited Loading

gparmer commented Apr 22, 2018

phanikishoreg commented Apr 22, 2018

phanikishoreg commented Apr 21, 2018 •

edited

Loading