-
Notifications
You must be signed in to change notification settings - Fork 384
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Signed-off-by: Timo K <toger5@hotmail.de>
- Loading branch information
Showing
1 changed file
with
150 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,150 @@ | ||
# MSC0000: MatrixRTC | ||
|
||
This MSC defines the modules with which the matrix real time system is build with. | ||
|
||
The MatrixRTC specification is separated into different modules. | ||
|
||
- The MatrixRTC room state that defines the state of the real time application.\ | ||
It is the source of truth for: | ||
- Who is part of a session | ||
- Who is connected via what technology/backend | ||
- Metadata per device used by other participants to decide whether the streams | ||
from this source are of interest / need to be subscribed. | ||
- The RTC backend. | ||
- It defines how to connect the participating peers. | ||
- Livekit is the standard for this as of writing. | ||
- Defines how to connect to a server/other peers, how to update the connection, | ||
how to subscribe to different streams... | ||
- Another planned backend is a full mesh implementation based on MSC3401. | ||
- The RTCSession types (application) have their own per application spec. | ||
- Calls can be done with an application of type `m.call` see (TODO: link call msc) | ||
- The application defines all the details of the RTC experience: | ||
- How to interpret the metadata of the member events. | ||
- What streams to connect to. | ||
- What data in which format to sent over the RTC channels. | ||
|
||
This MSC will focus on the matrix room state which can be seen as the most high | ||
level signalling of a call: | ||
|
||
## Proposal | ||
|
||
Each RTC session is made out of a collection of `m.rtc.member` events. | ||
Each `m.rtc.member` event defines the application type: `application` | ||
and a `call_id`. And is stored in a state event of type `m.rtc.member`. | ||
The first element of the state key is the `userId` and the second the `deviceId`. | ||
(see [this proposal for state keys](https://github.com/matrix-org/matrix-spec-proposals/pull/3757#issuecomment-2099010555) | ||
for context about second/first state key.) | ||
|
||
### The MatrixRTC room state | ||
|
||
Everything required for working MatrixRTC | ||
(current session, sessions history, join/leave events, ...) only | ||
require one event type. | ||
|
||
A complete `m.rtc.member` state event looks like this: | ||
|
||
```json | ||
// event type: m.rtc.member | ||
// event key: ["@user:matrix.domain", "DEVICEID"] | ||
{ | ||
"m.application": "m.my_session_type", | ||
"m.call_id": "", | ||
"focus_active": {...FOCUS_A}, | ||
"foci_preferred": [ | ||
{...FOCUS_1}, | ||
{...FOCUS_2} | ||
] | ||
} | ||
``` | ||
|
||
giving us the information, that user: `@user:matrix.domain` with device `DEVICEID` | ||
is part of an RTCSession of type `m.call` in the scope/sub-session `""` (empty | ||
string as call id) connected over `FOCUS_A`. This is all information that is needed | ||
for another room member to detect the running session and join it. | ||
|
||
There is **no event** to represent a session. This event would include shared | ||
information where it is not trivial to decide who has authority over it. | ||
Instead the session is a computed value based on `m.rtc.member` events. | ||
The list of events with the same `m.application` and `m.call_id` represent one session. | ||
This array allows to compute fields like participant count, start time ... | ||
|
||
Sending an empty `m.rtc.member` event represents a leave action. | ||
Sending a well formatted `m.rtc.member` represents a join action. | ||
|
||
Based on the value of `m.application`, the event might include additional parameters | ||
required to provide additional session parameters. | ||
|
||
> A thirdRoom like experience could include the information of an approximate position | ||
on the map, so that clients can omit connecting to participants that are not in their | ||
area of interest. | ||
|
||
#### Historic sessions | ||
|
||
Since there is no singe entry for a historic session (because of the owner ship discussion), | ||
historic sessions need to be computed and most likely cached on the client. | ||
|
||
Each state event can either mark a join or leave: | ||
|
||
- join: `prev_state.m.appliation != current_state.m.appliation` && | ||
Check warning on line 88 in proposals/XXXX-matrixRTC.md GitHub Actions / Spell Check with Typos
|
||
`prev_state.m.call_id != current_state.m.call_id` && | ||
`current_state.m.appliation != undefined` | ||
(where an empty `m.rtc.member` event would imply `state.m.appliation == undefined`) | ||
- leave: `prev_state.m.appliation != current_state.m.appliation` && | ||
Check warning on line 92 in proposals/XXXX-matrixRTC.md GitHub Actions / Spell Check with Typos
|
||
`prev_state.m.call_id != current_state.m.call_id` && | ||
`current_state.m.appliation == undefined` | ||
|
||
Based on this one can find user sessions. (The range between a join and a leave | ||
event) of specific times. | ||
The collection of all overlapping user sessions with the same `call_id` and | ||
`application` define one MatrixRTC history event. | ||
|
||
### The RTC backend | ||
|
||
`foci_active` and `foci_preferred` are used to communicate | ||
|
||
- how a user is connected to the session (`foci_active`) | ||
- what connection method this user knows about would like to connect with. | ||
|
||
The only enforced parameter of a `foci_preferred` or `foci_active` is `type`. | ||
Based on the focus type a different amount of parameters might be needed to, | ||
communicate how to connect to other users. | ||
`foci_preferred` and `foci_active` can have different parameters so that it is, | ||
possible to use a combination of the two to figure our that everyone is connected | ||
with each other. | ||
|
||
Only users with the same type can connect in one session. If a frontend does | ||
not support the used type they cannot connect. | ||
|
||
Each focus type will get its own MSC in which the detailed procedure to get from | ||
the foci information to working webRTC connections to the streams of all the | ||
participants is explained. | ||
|
||
- [`livekit`](www.example.com) TODO: create `livekit` focus MSC and add link here. | ||
- [`full_mesh`](https://github.com/matrix-org/matrix-spec-proposals/pull/3401) | ||
TODO: create `full-mesh` focus MSC based on[MSC3401](https://github.com/matrix-org/matrix-spec-proposals/pull/3401) | ||
and add link here. | ||
|
||
### The RTCSession types (application) | ||
|
||
Each session type might have its own specification in how the different streams | ||
are interpreted and even what focus type to use. This makes this proposal extremely | ||
flexible. A Jitsi conference could be added by introducing a new `m.application` | ||
and a new focus type and would be MatrixRTC compatible. It would not be compatible | ||
with applications that do not use the Jitsi focus but clients would know that there | ||
is an ongoing session of unknown type and unknown focus and could display/represent | ||
this in the user interface. | ||
|
||
To make it easy for clients to support different RTC session types, the recommended | ||
approach is to provide a matrix widget for each session type, so that client developers | ||
can use the widget as the first implementation if they want to support this RTC | ||
session type. | ||
|
||
Each application should get its own MSC in which the all the additional | ||
fields are explained and how the communication with the possible foci is | ||
defined: | ||
|
||
- [`m.call`](www.example.com) TODO: create `m.call` MSC and add link here. | ||
|
||
## Potential issues | ||
## Alternatives | ||
## Security considerations |