diff options
author | Thomas Voss <mail@thomasvoss.com> | 2024-11-27 20:54:24 +0100 |
---|---|---|
committer | Thomas Voss <mail@thomasvoss.com> | 2024-11-27 20:54:24 +0100 |
commit | 4bfd864f10b68b71482b35c818559068ef8d5797 (patch) | |
tree | e3989f47a7994642eb325063d46e8f08ffa681dc /doc/rfc/rfc8848.txt | |
parent | ea76e11061bda059ae9f9ad130a9895cc85607db (diff) |
doc: Add RFC documents
Diffstat (limited to 'doc/rfc/rfc8848.txt')
-rw-r--r-- | doc/rfc/rfc8848.txt | 1482 |
1 files changed, 1482 insertions, 0 deletions
diff --git a/doc/rfc/rfc8848.txt b/doc/rfc/rfc8848.txt new file mode 100644 index 0000000..c9173f3 --- /dev/null +++ b/doc/rfc/rfc8848.txt @@ -0,0 +1,1482 @@ + + + + +Internet Engineering Task Force (IETF) R. Hanton +Request for Comments: 8848 Cisco Systems +Category: Experimental P. Kyzivat +ISSN: 2070-1721 + L. Xiao + Beijing Chuangshiyoulian + C. Groves + January 2021 + + + Session Signaling for Controlling Multiple Streams for Telepresence + (CLUE) + +Abstract + + This document is about Controlling Multiple Streams for Telepresence + (CLUE) signaling. It specifies how the CLUE protocol and the CLUE + data channel are used in conjunction with each other and with + existing signaling mechanisms, such as SIP and the Session + Description Protocol (SDP), to produce a telepresence call. + +Status of This Memo + + This document is not an Internet Standards Track specification; it is + published for examination, experimental implementation, and + evaluation. + + This document defines an Experimental Protocol for the Internet + community. This document is a product of the Internet Engineering + Task Force (IETF). It represents the consensus of the IETF + community. It has received public review and has been approved for + publication by the Internet Engineering Steering Group (IESG). Not + all documents approved by the IESG are candidates for any level of + Internet Standard; see Section 2 of RFC 7841. + + Information about the current status of this document, any errata, + and how to provide feedback on it may be obtained at + https://www.rfc-editor.org/info/rfc8848. + +Copyright Notice + + Copyright (c) 2021 IETF Trust and the persons identified as the + document authors. All rights reserved. + + This document is subject to BCP 78 and the IETF Trust's Legal + Provisions Relating to IETF Documents + (https://trustee.ietf.org/license-info) in effect on the date of + publication of this document. Please review these documents + carefully, as they describe your rights and restrictions with respect + to this document. Code Components extracted from this document must + include Simplified BSD License text as described in Section 4.e of + the Trust Legal Provisions and are provided without warranty as + described in the Simplified BSD License. + +Table of Contents + + 1. Introduction + 2. Terminology + 3. Media Feature Tag Definition + 4. SDP Grouping Framework CLUE Extension Semantics + 4.1. General + 4.2. The CLUE Data Channel and the CLUE Grouping Semantic + 4.3. CLUE-Controlled Media and the CLUE Grouping Semantic + 4.4. SDP Semantics for CLUE-Controlled Media + 4.4.1. Signaling CLUE Encodings + 4.4.1.1. Referencing Encodings in the CLUE Protocol + 4.4.2. Negotiating Receipt of CLUE Capture Encodings in SDP + 4.5. SDP Offer/Answer Procedures + 4.5.1. Generating the Initial Offer + 4.5.2. Generating the Answer + 4.5.2.1. Negotiating Use of CLUE and the CLUE Data Channel + 4.5.2.2. Negotiating CLUE-Controlled Media + 4.5.2.3. Negotiating Non-CLUE-controlled Media + 4.5.3. Processing the Initial Offer/Answer Negotiation + 4.5.3.1. Successful CLUE Negotiation + 4.5.3.2. CLUE Negotiation Failure + 4.5.4. Modifying the Session + 4.5.4.1. Adding and Removing CLUE-Controlled Media + 4.5.4.2. Enabling CLUE Mid-Call + 4.5.4.3. Disabling CLUE Mid-Call + 4.5.4.4. CLUE Protocol Failure Mid-Call + 5. Interaction of the CLUE Protocol and SDP Negotiations + 5.1. Independence of SDP and CLUE Negotiation + 5.2. Constraints on Sending Media + 5.3. Recommendations for Operating with Non-atomic Operations + 6. Interaction of the CLUE Protocol and RTP/RTCP CaptureID + 6.1. CaptureID Reception during MCC Redefinition + 7. Multiplexing of CLUE-Controlled Media Using BUNDLE + 7.1. Overview + 7.2. Usage of BUNDLE with CLUE + 7.2.1. Generating the Initial Offer + 7.2.2. Multiplexing of the Data Channel and RTP Media + 8. Example: A Call between Two CLUE-Capable Endpoints + 9. Example: A Call between a CLUE-Capable and Non-CLUE Endpoint + 10. IANA Considerations + 10.1. New SDP Grouping Framework Attribute + 10.2. New SIP Media Feature Tag + 11. Security Considerations + 12. References + 12.1. Normative References + 12.2. Informative References + Acknowledgements + Authors' Addresses + +1. Introduction + + To enable devices to participate in a telepresence call, where they + select the sources they wish to view, receive those media sources, + and display them in an optimal fashion, Controlling Multiple Streams + for Telepresence (CLUE) employs two principal and interrelated + protocol negotiations. SDP [RFC4566], conveyed via SIP [RFC3261], is + used to negotiate the specific media capabilities that can be + delivered to specific addresses on a device. Meanwhile, CLUE + protocol messages [RFC8847], transported via a CLUE data channel + [RFC8850], are used to negotiate the Capture Sources available, their + attributes, and any constraints in their use. They also allow the + far-end device to specify which Captures they wish to receive. It is + recommended that those documents be read prior to this one as this + document assumes familiarity with those protocols and hence uses + terminology from each with limited introduction. + + Beyond negotiating the CLUE channel, SDP is also used to negotiate + the details of supported media streams and the maximum capability of + each of those streams. As the CLUE Framework [RFC8845] defines a + manner in which the Media Provider expresses their maximum Encoding + Group capabilities, SDP is also used to express the encoding limits + for each potential Encoding. + + Backwards compatibility is an important consideration of the + protocol: it is vital that a CLUE-capable device contacting a device + that does not support CLUE is able to fall back to a fully functional + non-CLUE call. The document also defines how a non-CLUE call may be + upgraded to CLUE mid-call and, similarly, how CLUE functionality can + be removed mid-call to return to a standard non-CLUE call. + +2. Terminology + + The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", + "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and + "OPTIONAL" in this document are to be interpreted as described in + BCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all + capitals, as shown here. + + This document uses terminology defined in the CLUE Framework + [RFC8845]. + + A few additional terms specific to this document are defined as + follows: + + CLUE-controlled media: A media "m=" line that is under CLUE control; + the Capture Source that provides the media on this "m=" line is + negotiated in CLUE. See Section 4 for details on how this control + is signaled in SDP. There is a corresponding "non-CLUE- + controlled" media term. + + non-CLUE device: A device that supports standard SIP and SDP but + either does not support CLUE or does support CLUE but does not + currently wish to invoke CLUE capabilities. + + RTCP: RTP Control Protocol. + + SCTP: Stream Control Transmission Protocol. + + STUN: Session Traversal Utilities for NAT. + +3. Media Feature Tag Definition + + The "sip.clue" media feature tag [RFC3840] indicates support for CLUE + in SIP [RFC3261] calls. A CLUE-capable device SHOULD include this + media feature tag in its REGISTER requests and OPTION responses. It + SHOULD also include the media feature tag in INVITE and UPDATE + [RFC3311] requests and responses. + + Presence of the media feature tag in the contact field of a request + or response can be used to determine that the far end supports CLUE. + +4. SDP Grouping Framework CLUE Extension Semantics + +4.1. General + + This section defines a new SDP Grouping Framework [RFC5888] extension + called 'CLUE'. + + The CLUE extension can be indicated using an SDP session-level + 'group' attribute. Each SDP media "m=" line that is included in this + group, using SDP media-level mid attributes, is CLUE controlled by a + CLUE data channel that is also included in this CLUE group. + + Currently, only support for a single CLUE group is specified; support + for multiple CLUE groups in a single session is outside the scope of + this document. A device MUST NOT include more than one CLUE group in + its SDP message unless it is following a specification that defines + how multiple CLUE channels are signaled and is able to either + determine that the other side of the SDP exchange supports multiple + CLUE channels or fail gracefully in the event it does not. + +4.2. The CLUE Data Channel and the CLUE Grouping Semantic + + The CLUE data channel [RFC8850] is a bidirectional data channel + [RFC8831] used for the transport of CLUE messages, conveyed within an + SCTP over DTLS connection. This channel must be established before + CLUE protocol messages can be exchanged and CLUE-controlled media can + be sent. + + The data channel is negotiated over SDP as described in [RFC8864]. A + CLUE-capable device wishing to negotiate CLUE MUST also include a + CLUE group in their SDP Offer or Answer and include the "mid" of the + "m=" line for the data channel in that group. The CLUE group MUST + include the "mid" of the "m=" line for one (and only one) data + channel. + + Presence of the data channel in the CLUE group in an SDP Offer or + Answer also serves, along with the "sip.clue" media feature tag, as + an indication that the device supports CLUE and wishes to upgrade the + call to include CLUE-controlled media. A CLUE-capable device SHOULD + include a data channel "m=" line in offers and, when allowed by + [RFC3264], answers. + +4.3. CLUE-Controlled Media and the CLUE Grouping Semantic + + CLUE-controlled media lines in an SDP are "m=" lines in which the + content of the media streams to be sent is negotiated via the CLUE + protocol [RFC8847]. For an "m=" line to be CLUE controlled, its + "mid" attribute value MUST be included in the CLUE group. CLUE- + controlled media is controlled by the CLUE protocol as negotiated on + the CLUE data channel with a "mid" included in the CLUE group. + + "m=" lines not specified as being under CLUE control follow normal + rules for media streams negotiated in SDP as defined in documents + such as [RFC3264]. + + The restrictions on CLUE-controlled media that are defined below + always apply to "m=" lines in an SDP Offer or Answer, even if + negotiation of the data channel in SDP failed due to lack of CLUE + support by the remote device or for any other reason, or in an offer + if the recipient does not include the "mid" of the corresponding "m=" + line in their CLUE group. + +4.4. SDP Semantics for CLUE-Controlled Media + +4.4.1. Signaling CLUE Encodings + + The CLUE Framework [RFC8845] defines the concept of "Encodings", + which represent the sender's encode ability. Each Encoding the Media + Provider wishes to signal is done so via an "m=" line of the + appropriate media type, which MUST be marked as sendonly with the + "a=sendonly" attribute or as inactive with the "a=inactive" + attribute. + + The encoder limits of active (e.g., "a=sendonly") Encodings can then + be expressed using existing SDP syntax. For instance, for H.264, see + Table 6 in Section 8.2.2 of [RFC6184] for a list of valid parameters + for representing encoder sender stream limits. + + These Encodings are CLUE controlled and hence MUST include a "mid" in + the CLUE group as defined above. + + In addition to the normal restrictions defined in [RFC3264], the + stream MUST be treated as if the "m=" line direction attribute had + been set to "a=inactive" until the Media Provider has received a + valid CLUE 'configure' message specifying the Capture to be used for + this stream. This means that RTP packets MUST NOT be sent until + configuration is complete, while non-media packets such as STUN, + RTCP, and DTLS MUST be sent as per their relevant specifications, if + negotiated. + + Every "m=" line representing a CLUE Encoding MUST contain a "label" + attribute as defined in [RFC4574]. This label is used to identify + the Encoding by the sender in CLUE 'advertisement' messages and by + the receiver in CLUE 'configure' messages. Each label used for a + CLUE-controlled "m=" line MUST be different from the label on all + other "m=" lines in the CLUE group, unless an "m=" line represents a + dependent stream related to another "m=" line (such as a Forward + Error Correction (FEC) stream), in which case it MUST have the same + label value as the "m=" line on which it depends. + +4.4.1.1. Referencing Encodings in the CLUE Protocol + + CLUE Encodings are defined in SDP but can be referenced from CLUE + protocol messages -- this is how the protocol defines which Encodings + are a part of an Encoding Group (in 'advertisement' messages) and + which Encoding is used to encode a specific Capture (in 'configure' + messages). The labels on the CLUE-controlled "m=" lines are the + references that are used in the CLUE protocol. + + Each <encID> (in encodingIDList) in a CLUE 'advertisement' message + SHOULD represent an Encoding defined in SDP; the specific Encoding + referenced is a CLUE-controlled "m=" line in the most recent SDP + Offer/Answer message sent by the sender of the 'advertisement' + message with a label value corresponding to the text content of the + <encID>. If the <encID> is not defined in SDP, it MUST be one it + anticipates sending in a subsequent SDP Offer/Answer exchange. + + Each <encodingID> (in captureEncodingType) in a CLUE 'configure' + message MUST represent an Encoding defined in SDP; the specific + Encoding referenced is a CLUE-controlled "m=" line in the most recent + SDP Offer/Answer message received by the sender of the 'configure' + message with a label value corresponding to the text content of the + <encodingID>. + + Note that the non-atomic nature of SDP/CLUE protocol interaction may + mean that there are temporary periods where an <encID>/<encodingID> + in a CLUE message does not reference an SDP "m=" line, or where an + Encoding represented in SDP is not referenced in a CLUE protocol + message. See Section 5 for specifics. + +4.4.2. Negotiating Receipt of CLUE Capture Encodings in SDP + + A receiver who wishes to receive a CLUE stream via a specific + Encoding requires an "a=recvonly" "m=" line that matches the + "a=sendonly" Encoding. + + These "m=" lines are CLUE controlled and hence MUST include their + "mid" in the CLUE group. They MAY include a "label" attribute, but + this is not required by CLUE, as only label values associated with + "a=sendonly" Encodings are referenced by CLUE protocol messages. + +4.5. SDP Offer/Answer Procedures + +4.5.1. Generating the Initial Offer + + A CLUE-capable device sending an initial SDP Offer of a SIP session + and wishing to negotiate CLUE will include an "m=" line for the data + channel to convey the CLUE protocol, along with a CLUE group + containing the "mid" of the data channel "m=" line. + + For interoperability with non-CLUE devices, a CLUE-capable device + sending an initial SDP Offer SHOULD NOT include any "m=" line for + CLUE-controlled media beyond the "m=" line for the CLUE data channel, + and it SHOULD include at least one non-CLUE-controlled media "m=" + line. + + If the device has evidence that the receiver is also CLUE capable, + for instance, due to receiving an initial INVITE with no SDP but + including a "sip.clue" media feature tag, the above recommendation is + waived, and the initial offer MAY contain "m=" lines for CLUE- + controlled media. + + With the same interoperability recommendations as for Encodings, the + sender of the initial SDP Offer MAY also include "a=recvonly" media + lines to preallocate "m=" lines to receive media. Alternatively, it + MAY wait until CLUE protocol negotiation has completed before + including these lines in a new offer/answer exchange -- see Section 5 + for recommendations. + +4.5.2. Generating the Answer + +4.5.2.1. Negotiating Use of CLUE and the CLUE Data Channel + + If the recipient of an initial offer is CLUE capable, and the offer + contains both an "m=" line for a data channel and a CLUE group + containing the "mid" for that "m=" line, they SHOULD negotiate data + channel support for an "m=" line and include the "mid" of that "m=" + line in a corresponding CLUE group. + + A CLUE-capable recipient that receives an "m=" line for a data + channel but no corresponding CLUE group containing the "mid" of that + "m=" line MAY still include a corresponding data channel "m=" line if + there are any other non-CLUE protocols it can convey over that + channel, but the use of the CLUE protocol MUST NOT be negotiated on + this channel. + +4.5.2.2. Negotiating CLUE-Controlled Media + + If the initial offer contained "a=recvonly" CLUE-controlled media + lines, the recipient SHOULD include corresponding "a=sendonly" CLUE- + controlled media lines for accepted Encodings, up to the maximum + number of Encodings it wishes to advertise. As CLUE-controlled + media, the "mid" of these "m=" lines MUST be included in the + corresponding CLUE group. The recipient MUST set the direction of + the corresponding "m=" lines of any remaining "a=recvonly" CLUE- + controlled media lines received in the offer to "a=inactive". + + If the initial offer contained "a=sendonly" CLUE-controlled media + lines, the recipient MAY include corresponding "a=recvonly" CLUE- + controlled media lines, up to the maximum number of Capture Encodings + it wishes to receive. Alternatively, it MAY wait until CLUE protocol + negotiation has completed before including these lines in a new + offer/answer exchange -- see Section 5 for recommendations. The + recipient MUST set the direction of the corresponding "m=" lines of + any remaining "a=sendonly" CLUE-controlled media lines received in + the offer to "a=inactive". + +4.5.2.3. Negotiating Non-CLUE-controlled Media + + A CLUE-controlled device implementation MAY prefer to render initial, + single-stream audio and/or video for the user as rapidly as possible, + transitioning to CLUE-controlled media once that has been negotiated. + Alternatively, an implementation MAY wish to suppress initial media, + only providing media once the final, CLUE-controlled streams have + been negotiated. + + The receiver of the initial offer, if making the call CLUE-enabled + with their SDP Answer, can make their preference clear by their + action in accepting or rejecting non-CLUE-controlled media lines. + Rejecting these "m=" lines will ensure that no non-CLUE-controlled + media flows before the CLUE-controlled media is negotiated. In + contrast, accepting one or more non-CLUE-controlled "m=" lines in + this initial answer will enable initial media to flow. + + If the answerer chooses to send initial non-CLUE-controlled media in + a CLUE-enabled call, Section 4.5.4.1 addresses the need to disable it + once the CLUE-controlled media is fully negotiated. + +4.5.3. Processing the Initial Offer/Answer Negotiation + + In the event that both the offer and answer include a data channel + "m=" line with a "mid" value included in corresponding CLUE groups, + CLUE has been successfully negotiated, and the call is now CLUE + enabled. If not, then the call is not CLUE enabled. + +4.5.3.1. Successful CLUE Negotiation + + In the event of successful CLUE enablement of the call, devices MUST + now begin negotiation of the CLUE channel; see [RFC8850] for + negotiation details. If negotiation is successful, the sending of + CLUE protocol messages [RFC8847] can begin. + + A CLUE-capable device MAY choose not to send RTP on the non-CLUE- + controlled channels during the period in which control of the CLUE- + controlled media lines is being negotiated (though RTCP MUST still be + sent and received as normal). However, a CLUE-capable device MUST + still be prepared to receive media on non-CLUE-controlled media lines + that have been successfully negotiated as defined in [RFC3264]. + + If either side of the call wishes to add additional CLUE-controlled + "m=" lines to send or receive CLUE-controlled media, they MAY now + send a SIP request with a new SDP Offer following the normal rules of + SDP Offer/Answer and any negotiated extensions. + +4.5.3.2. CLUE Negotiation Failure + + In the event that the negotiation of CLUE fails and the call is not + CLUE enabled once the initial offer/answer negotiation completes, + then CLUE is not in use in the call. CLUE-capable devices MUST + either revert to non-CLUE behavior or terminate the call. + +4.5.4. Modifying the Session + +4.5.4.1. Adding and Removing CLUE-Controlled Media + + Subsequent offer/answer exchanges MAY add additional "m=" lines for + CLUE-controlled media or activate or deactivate existing "m=" lines + per the standard SDP mechanisms. + + In most cases, at least one additional exchange after the initial + offer/answer exchange will be required before both sides have added + all the Encodings and the ability to receive Encodings that they + desire. Devices MAY delay adding "a=recvonly" CLUE-controlled "m=" + lines until after CLUE protocol negotiation completes -- see + Section 5 for recommendations. + + Once CLUE media has been successfully negotiated, devices SHOULD + ensure that non-CLUE-controlled media is deactivated by setting their + ports to 0 in cases where it corresponds to the media type of CLUE- + controlled media that has been successfully negotiated. This + deactivation may require an additional SDP exchange or may be + incorporated into one that is part of the CLUE negotiation. + +4.5.4.2. Enabling CLUE Mid-Call + + A CLUE-capable device that receives an initial SDP Offer from a non- + CLUE device SHOULD include a new data channel "m=" line and + corresponding CLUE group in any subsequent offers it sends, to + indicate that it is CLUE capable. + + If, in an ongoing non-CLUE call, an SDP Offer/Answer exchange + completes with both sides having included a data channel "m=" line in + their SDP and with the "mid" for that channel in a corresponding CLUE + group, then the call is now CLUE enabled; negotiation of the data + channel and subsequently the CLUE protocol begins. + +4.5.4.3. Disabling CLUE Mid-Call + + If, during an ongoing CLUE-enabled call, a device wishes to disable + CLUE, it can do so by following the procedures for closing a data + channel as defined in Section 6.6.1 of [RFC8864]: sending a new SDP + Offer/Answer exchange and subsequent SCTP Stream Sequence Number + (SSN) reset for the CLUE channel. It MUST also remove the CLUE + group. Without the CLUE group, any "m=" lines that were previously + CLUE controlled no longer are; implementations MAY disable them by + setting their ports to 0 or MAY continue to use them -- in the latter + case, how they are used is outside the scope of this document. + + If a device follows the procedure above, or an SDP Offer/Answer + negotiation completes in a fashion in which either the "m=" CLUE data + channel line was not successfully negotiated and/or one side did not + include the data channel in the CLUE group, then CLUE for this call + is disabled. In the event that this occurs, CLUE is no longer + enabled. Any active "m=" lines still included in the CLUE group are + no longer CLUE controlled, and the implementation MAY either disable + them in a subsequent negotiation or continue to use them in some + other fashion. If the data channel is still present but not included + in the CLUE group semantic, CLUE protocol messages MUST no longer be + sent. + +4.5.4.4. CLUE Protocol Failure Mid-Call + + In contrast to the specific disablement of the use of CLUE described + above, the CLUE channel may fail unexpectedly. Two circumstances + where this can occur are: + + * The CLUE data channel terminates, either gracefully or + ungracefully, without any corresponding SDP renegotiation. + + * A channel error of the CLUE protocol causes it to return to the + IDLE state as defined in Section 6 of [RFC8847]. + + In this circumstance, implementations SHOULD continue to transmit and + receive CLUE-controlled media on the basis of the last negotiated + CLUE messages, until the CLUE protocol is re-established (in the + event of a channel error) or disabled mid-call by an SDP exchange as + defined in Section 4.5.4.3. Implementations MAY choose to send such + an SDP request to disable CLUE immediately or MAY continue on in a + call-preservation mode. + +5. Interaction of the CLUE Protocol and SDP Negotiations + + Information about media streams in CLUE is split between two message + types: SDP, which defines media addresses and limits, and the CLUE + channel, which defines properties of Capture Devices available, scene + information, and additional constraints. As a result, certain + operations, such as advertising support for a new transmissible + Capture with an associated stream, cannot be performed atomically, as + they require changes to both SDP and CLUE messaging. + + This section defines how the negotiation of the two protocols + interact, provides some recommendations on dealing with intermediate + stages in non-atomic operations, and mandates additional constraints + on when CLUE-configured media can be sent. + +5.1. Independence of SDP and CLUE Negotiation + + To avoid the need to implement interlocking state machines with the + potential to reach invalid states if messages were to be lost, or be + rewritten en route by middleboxes, the state machines in SDP and CLUE + operate independently. The state of the CLUE channel does not + restrict when an implementation may send a new SDP Offer or Answer; + likewise, the implementation's ability to send a new CLUE + 'advertisement' or 'configure' message is not restricted by the + results of or the state of the most recent SDP negotiation (unless + the SDP negotiation has removed the CLUE channel). + + The primary implication of this is that a device may receive an SDP + Offer/Answer message with a CLUE Encoding for which it does not yet + have Capture information or receive a CLUE 'configure' message + specifying a Capture Encoding for which the far end has not + negotiated a media stream in SDP. + + CLUE messages contain an <encID> (in encodingIDList) or <encodingID> + (in captureEncodingType), which is used to identify a specific + Encoding or captureEncoding in SDP; see [RFC8846] for specifics. The + non-atomic nature of CLUE negotiation means that a sender may wish to + send a new CLUE 'advertisement' message before the corresponding SDP + message. As such, the sender of the CLUE message MAY include an + <encID> that does not currently match a CLUE-controlled "m=" line + label in SDP; a CLUE-capable implementation MUST NOT reject a CLUE + protocol message solely because it contains <encID> elements that do + not match a label in SDP. + + The current state of the CLUE Participant or Media Provider/Consumer + state machines does not affect compliance with any of the normative + language of [RFC3264]. That is, they MUST NOT delay an ongoing SDP + exchange as part of a SIP server or client transaction; an + implementation MUST NOT delay an SDP exchange while waiting for CLUE + negotiation to complete or for a 'configure' message to arrive. + + Similarly, a device in a CLUE-enabled call MUST NOT delay any + mandatory state transitions in the CLUE Participant or Media + Provider/Consumer state machines due to the presence or absence of an + ongoing SDP exchange. + + A device with the CLUE Participant state machine in the ACTIVE state + MAY choose to delay moving from ESTABLISHED to ADV (Media Provider + state machine) or from ESTABLISHED to WAIT FOR CONF RESPONSE (Media + Consumer state machine) based on the SDP state. See [RFC8847] for + CLUE state machine specifics. Similarly, a device MAY choose to + delay initiating a new SDP exchange based on the state of their CLUE + state machines. + +5.2. Constraints on Sending Media + + While SDP and CLUE message states do not impose constraints on each + other, both impose constraints on the sending of media -- CLUE- + controlled media MUST NOT be sent unless it has been negotiated in + both CLUE and SDP: an implementation MUST NOT send a specific CLUE + Capture Encoding unless its most recent SDP exchange contains an + active media channel for that Encoding AND it has received a CLUE + 'configure' message specifying a valid Capture for that Encoding. + +5.3. Recommendations for Operating with Non-atomic Operations + + CLUE-capable devices MUST be able to handle states in which CLUE + messages make reference to EncodingIDs that do not match the most + recently received SDP, irrespective of the order in which SDP and + CLUE messages are received. While these mismatches will usually be + transitory, a device MUST be able to cope with such mismatches + remaining indefinitely. However, this document makes some + recommendations on message ordering for these non-atomic transitions. + + CLUE-capable devices MUST ensure that any inconsistencies between SDP + and CLUE signaling are temporary by sending updated SDP or CLUE + messages as soon as the relevant state machines and other constraints + permit. + + Generally, implementations that receive messages with incomplete + information will be most efficient if they wait until they have the + corresponding information they lack before sending messages to make + changes related to that information. For example, an answerer that + receives a new SDP Offer with three new "a=sendonly" CLUE "m=" lines + for which it has received no CLUE 'advertisement' message providing + the corresponding capture information would typically include + corresponding "a=inactive" lines in its answer, and it would only + make a new SDP Offer with "a=recvonly" when and if a new + 'advertisement' message arrives with Captures relevant to those + Encodings. + + Because of the constraints of SDP Offer/Answer and because new SDP + negotiations are generally more 'costly' than sending a new CLUE + message, implementations needing to make changes to both channels + SHOULD prioritize sending the updated CLUE message over sending the + new SDP message. The aim is for the recipient to receive the CLUE + changes before the SDP changes, allowing the recipient to send their + SDP Answers without incomplete information and reducing the number of + new SDP Offers required. + +6. Interaction of the CLUE Protocol and RTP/RTCP CaptureID + + The CLUE Framework [RFC8845] allows for Multiple Content Captures + (MCCs): Captures that contain multiple source Captures, whether + composited into a single stream or switched based on some metric. + + The Captures that contribute to these MCCs may or may not be defined + in the 'advertisement' message. If they are defined and the MCC is + providing them in a switched format, the recipient may wish to + determine which originating source Capture is currently being + provided, so that they can apply geometric corrections based on that + Capture's geometry or take some other action based on the original + Capture information. + + To do this, [RFC8849] allows for the CaptureID of the originating + Capture to be conveyed via RTP or RTCP. A Media Provider sending + switched media for an MCC with defined originating sources MUST send + the CaptureID in both RTP and RTCP, as described in the mapping + document. + +6.1. CaptureID Reception during MCC Redefinition + + Because the RTP/RTCP CaptureID is delivered via a different channel + to the 'advertisement' message in which in the contents of the MCC + are defined, there is an intrinsic race condition in cases where the + contents of an MCC are redefined. + + When a Media Provider redefines an MCC that involves CaptureIDs, the + reception of the relevant CaptureIDs by the recipient will either + lead or lag reception and the processing of the new 'advertisement' + message by the recipient. As such, a Media Consumer MUST NOT be + disrupted by any of the following scenarios in any CLUE-controlled + media stream it is receiving, whether that stream is for a static + Capture or for an MCC (as any static Capture may be redefined to an + MCC in a later 'advertisement' message): + + * By receiving RTP or RTCP containing a CaptureID when the most + recently processed 'advertisement' message means that no media + CaptureIDs are expected. + + * By receiving RTP or RTCP without CaptureIDs when the most recently + processed 'advertisement' message means that media CaptureIDs are + expected. + + * By receiving a CaptureID in RTP or RTCP for a Capture defined in + the most recently processed 'advertisement' message, but which the + same 'advertisement' message does not include in the MCC. + + * By receiving a CaptureID in RTP or RTCP for a Capture not defined + in the most recently processed 'advertisement' message. + +7. Multiplexing of CLUE-Controlled Media Using BUNDLE + +7.1. Overview + + A CLUE call may involve sending and/or receiving significant numbers + of media streams. Conventionally, media streams are sent and + received on unique ports. However, each separate port used for this + purpose may impose costs that a device wishes to avoid, such as the + need to open that port on firewalls and NATs, the need to collect + Interactive Connectivity Establishment (ICE) candidates [RFC8445], + etc. + + The BUNDLE extension [RFC8843] can be used to negotiate the + multiplexing of multiple media lines onto a single 5-tuple for + sending and receiving media, allowing devices in calls to another + BUNDLE-supporting device to potentially avoid some of the above + costs. + + While CLUE-capable devices MAY support the BUNDLE extension for this + purpose, supporting the extension is not mandatory for a device to be + CLUE compliant. + + A CLUE-capable device that supports BUNDLE SHOULD also support rtcp- + mux [RFC5761]. However, a CLUE-capable device that supports rtcp-mux + may or may not support BUNDLE. + +7.2. Usage of BUNDLE with CLUE + + This specification imposes no additional requirements or restrictions + on the usage of BUNDLE when used with CLUE. There is no restriction + on combining CLUE-controlled media lines and non-CLUE-controlled + media lines in the same BUNDLE group or in multiple such groups. + However, there are several steps an implementation may wish to take + to ameliorate the cost and time requirements of extra SDP Offer/ + Answer exchanges between CLUE and BUNDLE. + +7.2.1. Generating the Initial Offer + + BUNDLE mandates that the initial SDP Offer MUST use a unique address + for each "m=" line with a non-zero port. Because CLUE + implementations generally will not include CLUE-controlled media + lines, with the exception of the data channel in the initial SDP + Offer, CLUE devices that support large numbers of streams can avoid + ever having to open large numbers of ports if they successfully + negotiate BUNDLE. + + An implementation that does include CLUE-controlled media lines in + its initial SDP Offer while also using BUNDLE must take care to avoid + rendering its CLUE-controlled media lines unusable in the event the + far end does not negotiate BUNDLE if it wishes to avoid the risk of + additional SDP exchanges to resolve this issue. This is best + achieved by not sending any CLUE-controlled media lines in an initial + offer with the 'bundle-only' attribute unless it has been established + via some other channel that the recipient supports and is able to use + BUNDLE. + +7.2.2. Multiplexing of the Data Channel and RTP Media + + BUNDLE-supporting CLUE-capable devices MAY include the data channel + in the same BUNDLE group as RTP media. In this case, the device MUST + be able to demultiplex the various transports -- see Section 9.2 of + the BUNDLE specification [RFC8843]. If the BUNDLE group includes + protocols other than the data channel transported via DTLS, the + device MUST also be able to differentiate the various protocols. + +8. Example: A Call between Two CLUE-Capable Endpoints + + This example illustrates a call between two CLUE-capable Endpoints. + Alice, initiating the call, is a system with three cameras and three + screens. Bob, receiving the call, is a system with two cameras and + two screens. A call-flow diagram is presented, followed by a summary + of each message. + + To manage the size of this section, the SDP snippets only illustrate + video "m=" lines. SIP ACKs are not always discussed. Note that + BUNDLE is not in use. + + +----------+ +-----------+ + | Alice | | Bob | + | | | | + +----+-----+ +-----+-----+ + | | + | | + | SIP INVITE 1 | + |--------------------------------->| + | | + | | + | SIP 200 OK 1 | + |<---------------------------------| + | | + | | + | SIP ACK 1 | + |--------------------------------->| + | | + | | + | | + |<########### MEDIA 1 ############>| + | 1 video A->B, 1 video B->A | + |<################################>| + | | + | | + | | + |<================================>| + | CLUE DATA CHANNEL ESTABLISHED | + |<================================>| + | | + | | + | CLUE OPTIONS | + |<*********************************| + | | + | | + | CLUE OPTIONS RESPONSE | + |*********************************>| + | | + | | + | CLUE ADVERTISEMENT 1 | + |*********************************>| + | | + | | + | CLUE ADVERTISEMENT 2 | + |<*********************************| + | | + | | + | CLUE ACK 1 | + |<*********************************| + | | + | | + | CLUE ACK 2 | + |*********************************>| + | | + | | + | SIP INVITE 2 (+3 sendonly) | + |--------------------------------->| + | | + | | + | CLUE CONFIGURE 1 | + |<*********************************| + | | + | | + | SIP 200 OK 2 (+2 recvonly) | + |<---------------------------------| + | | + | | + | CLUE CONFIGURE RESPONSE 1 | + |*********************************>| + | | + | | + | SIP ACK 2 | + |--------------------------------->| + | | + | | + | | + |<########### MEDIA 2 ############>| + | 2 video A->B, 1 video B->A | + |<################################>| + | | + | | + | SIP INVITE 3 (+2 sendonly) | + |<---------------------------------| + | | + | | + | CLUE CONFIGURE 2 | + |*********************************>| + | | + | | + | SIP 200 OK 3 (+2 recvonly) | + |--------------------------------->| + | | + | | + | CLUE CONFIGURE RESPONSE 2 | + |<*********************************| + | | + | | + | SIP ACK 3 | + |<---------------------------------| + | | + | | + | | + |<########### MEDIA 3 ############>| + | 2 video A->B, 2 video B->A | + |<################################>| + | | + | | + | | + v v + + In SIP INVITE 1, Alice sends Bob a SIP INVITE with the basic audio + and video capabilities and data channel included in the SIP body as + per [RFC8841]. Alice also includes the "sip.clue" media feature tag + in the INVITE. A snippet of the SDP showing the grouping attribute + and the video "m=" line are shown below. Alice has included a "CLUE" + group and the mid corresponding to a data channel in the group (3). + Note that Alice has chosen not to include any CLUE-controlled media + in the initial offer -- the "mid" value of the video line is not + included in the "CLUE" group. + + ... + a=group:CLUE 3 + ... + m=video 6002 RTP/AVP 96 + a=rtpmap:96 H264/90000 + a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600 + a=sendrecv + a=mid:2 + ... + m=application 6100 UDP/DTLS/SCTP webrtc-datachannel + a=setup:actpass + a=sctp-port: 5000 + a=dcmap:2 subprotocol="CLUE";ordered=true + a=mid:3 + + Bob responds with a similar SDP in SIP 200 OK 1, which also has a + "CLUE" group including the "mid" value of a data channel; due to + their similarity, no SDP snippet is shown here. Bob wishes to + receive initial media and thus includes corresponding non-CLUE- + controlled audio and video lines. Bob also includes the "sip.clue" + media feature tag in the 200 OK. Alice and Bob are each now able to + send a single audio and video stream. This is illustrated as MEDIA + 1. + + With the successful initial SDP Offer/Answer exchange complete, Alice + and Bob are also free to negotiate the CLUE data channel. This is + illustrated as CLUE DATA CHANNEL ESTABLISHED. + + Once the data channel is established, CLUE protocol negotiation + begins. In this case, Bob was the DTLS client (sending "a=active" in + his SDP Answer) and hence is the CLUE Channel Initiator. He sends a + CLUE OPTIONS message describing his version support. On receiving + that message, Alice sends her corresponding CLUE OPTIONS RESPONSE. + + With the OPTIONS phase complete, Alice now sends her CLUE + 'advertisement' message (CLUE ADVERTISEMENT 1). She advertises three + static Captures representing her three cameras. She also includes + switched Captures suitable for systems with one or two screens. All + of these Captures are in a single Capture Scene, with suitable + Capture Scene Views that tell Bob he should subscribe to the three + static Captures, the two switched Captures, or the one switched + Capture. Alice has no simultaneity constraints, so all six Captures + are included in one simultaneous set. Finally, Alice includes an + Encoding Group with three Encoding IDs: "enc1", "enc2", and "enc3". + These Encoding IDs aren't currently valid but will match the next SDP + Offer she sends. + + Bob received CLUE ADVERTISEMENT 1 but does not yet send a 'configure' + message, because he has not yet received Alice's Encoding + information; thus, he does not know if she will have sufficient + resources in order to send him the two streams he ideally wants at a + quality he is happy with. Because Bob is not sending an immediate + 'configure' message with the "ack" element set, he must send an + explicit 'ack' message (CLUE ACK 1) to signal receipt of CLUE + ADVERTISEMENT 1. + + Bob also sends his CLUE 'advertisement' message (CLUE ADVERTISEMENT + 2) -- though the diagram shows that this occurs after Alice sends + CLUE ADVERTISEMENT 1, Bob sends his 'advertisement' message + independently and does not wait for CLUE ADVERTISEMENT 1 to arrive. + He advertises two static Captures representing his cameras. He also + includes a single composed Capture for single-screen systems, in + which he will composite the two camera views into a single video + stream. All three Captures are in a single Capture Scene, with + suitable Capture Scene Views that tell Alice she should subscribe to + either the two static Captures or the single composed Capture. Bob + also has no simultaneity constraints, so he includes all three + Captures in one simultaneous set. Bob also includes a single + Encoding Group with two Encoding IDs: "foo" and "bar". + + Similarly, Alice receives CLUE ADVERTISEMENT 2 but does not yet send + a 'configure' message, because she has not yet received Bob's + Encoding information; instead, she sends an 'ack' message (CLUE ACK + 2). + + Both sides have now sent their CLUE 'advertisement' messages, and an + SDP exchange is required to negotiate Encodings. For simplicity, in + this case, Alice is shown sending an INVITE with a new offer; in many + implementations, both sides might send an INVITE, which would be + resolved by use of the 491 Request Pending resolution mechanism from + [RFC3261]. + + Alice now sends SIP INVITE 2. She maintains the sendrecv audio, + video, and CLUE "m=" lines, and she adds three new sendonly "m=" + lines to represent the three CLUE-controlled Encodings she can send. + Each of these "m=" lines has a label corresponding to one of the + Encoding IDs from CLUE ADVERTISEMENT 1. Each also has its mid added + to the grouping attribute to show they are controlled by the CLUE + data channel. A snippet of the SDP showing the grouping attribute, + data channel, and video "m=" lines are shown below: + + ... + a=group:CLUE 3 4 5 6 + ... + m=video 6002 RTP/AVP 96 + a=rtpmap:96 H264/90000 + a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600 + a=sendrecv + a=mid:2 + ... + m=application 6100 UDP/DTLS/SCTP webrtc-datachannel + a=sctp-port: 5000 + a=dcmap:2 subprotocol="CLUE";ordered=true + a=mid:3 + ... + m=video 6004 RTP/AVP 96 + a=rtpmap:96 H264/90000 + a=fmtp:96 profile-level-id=42e016 + a=sendonly + a=mid:4 + a=label:enc1 + m=video 6006 RTP/AVP 96 + a=rtpmap:96 H264/90000 + a=fmtp:96 profile-level-id=42e016 + a=sendonly + a=mid:5 + a=label:enc2 + m=video 6008 RTP/AVP 96 + a=rtpmap:96 H264/90000 + a=fmtp:96 profile-level-id=42e016 + a=sendonly + a=mid:6 + a=label:enc3 + + Bob now has all the information he needs to decide which streams to + configure, allowing him to send both a CLUE 'configure' message and + his SDP Answer. As such, he now sends CLUE CONFIGURE 1. This + requests the pair of switched Captures that represent Alice's scene, + and he configures them with encoder ids "enc1" and "enc2". + + Bob also sends his SDP Answer as part of SIP 200 OK 2. Alongside his + original audio, video, and CLUE "m=" lines, he includes three + additional "m=" lines corresponding to the three added by Alice: two + active recvonly "m= "lines and an inactive "m=" line for the third. + He adds their "mid" values to the grouping attribute to show they are + controlled by the CLUE data channel. A snippet of the SDP showing + the grouping attribute and the video "m=" lines are shown below (mid + 100 represents the CLUE data channel, which is not shown): + + ... + a=group:CLUE 11 12 13 100 + ... + m=video 58722 RTP/AVP 96 + a=rtpmap:96 H264/90000 + a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600 + a=sendrecv + a=mid:10 + ... + m=video 58724 RTP/AVP 96 + a=rtpmap:96 H264/90000 + a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600 + a=recvonly + a=mid:11 + m=video 58726 RTP/AVP 96 + a=rtpmap:96 H264/90000 + a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600 + a=recvonly + a=mid:12 + m=video 58728 RTP/AVP 96 + a=rtpmap:96 H264/90000 + a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600 + a=inactive + a=mid:13 + + Alice receives Bob's CLUE CONFIGURE 1 message and sends CLUE + CONFIGURE RESPONSE 1 to acknowledge its reception. She does not yet + send the Capture Encodings specified, because at this stage, she + hasn't processed Bob's answer SDP and thus hasn't negotiated the + ability for Bob to receive these streams. + + On receiving SIP 200 OK 2 from Bob, Alice sends her SIP ACK (SIP ACK + 2). She is now able to send the two streams of video Bob requested + -- this is illustrated as MEDIA 2. + + The constraints of offer/answer meant that Bob could not include his + Encoding information as new "m=" lines in SIP 200 OK 2. As such, Bob + now sends SIP INVITE 3 to generate a new offer. Along with all the + streams from SIP 200 OK 2, Bob also includes two new sendonly + streams. Each stream has a label corresponding to the Encoding IDs + in his CLUE ADVERTISEMENT 2 message. He also adds their "mid" values + to the grouping attribute to show they are controlled by the CLUE + data channel. A snippet of the SDP showing the grouping attribute + and the video "m=" lines are shown below (mid 100 represents the CLUE + data channel, which is not shown): + + ... + a=group:CLUE 11 12 14 15 100 + ... + m=video 58722 RTP/AVP 96 + a=rtpmap:96 H264/90000 + a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600 + a=sendrecv + a=mid:10 + ... + m=video 58724 RTP/AVP 96 + a=rtpmap:96 H264/90000 + a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600 + a=recvonly + a=mid:11 + m=video 58726 RTP/AVP 96 + a=rtpmap:96 H264/90000 + a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600 + a=recvonly + a=mid:12 + m=video 0 RTP/AVP 96 + a=mid:13 + m=video 58728 RTP/AVP 96 + a=rtpmap:96 H264/90000 + a=fmtp:96 profile-level-id=42e016 + a=sendonly + a=label:foo + a=mid:14 + m=video 58730 RTP/AVP 96 + a=rtpmap:96 H264/90000 + a=fmtp:96 profile-level-id=42e016 + a=sendonly + a=label:bar + a=mid:15 + + Having received this, Alice now has all the information she needs to + send her CLUE 'configure' message and her SDP Answer. In CLUE + CONFIGURE 2, she requests the two static Captures from Bob to be sent + on Encodings "foo" and "bar". + + Alice also sends SIP 200 OK 3, matching two recvonly "m=" lines to + Bob's new sendonly lines. She includes their "mid" values in the + grouping attribute to show they are controlled by the CLUE data + channel. Alice then deactivates the initial non-CLUE-controlled + media, as bidirectional CLUE-controlled media is now available. A + snippet of the SDP showing the grouping attribute and the video "m=" + lines are shown below (mid 3 represents the data channel, not shown): + + ... + a=group:CLUE 3 4 5 7 8 + ... + m=video 0 RTP/AVP 96 + a=mid:2 + ... + m=video 6004 RTP/AVP 96 + a=rtpmap:96 H264/90000 + a=fmtp:96 profile-level-id=42e016 + a=sendonly + a=mid:4 + a=label:enc1 + m=video 6006 RTP/AVP 96 + a=rtpmap:96 H264/90000 + a=fmtp:96 profile-level-id=42e016 + a=sendonly + a=mid:5 + a=label:enc2 + m=video 0 RTP/AVP 96 + a=mid:6 + m=video 6010 RTP/AVP 96 + a=rtpmap:96 H264/90000 + a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600 + a=recvonly + a=mid:7 + m=video 6012 RTP/AVP 96 + a=rtpmap:96 H264/90000 + a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600 + a=recvonly + a=mid:8 + + Bob receives Alice's CLUE CONFIGURE 2 message and sends CLUE + CONFIGURE RESPONSE 2 to acknowledge its reception. Bob does not yet + send the Capture Encodings specified, because he hasn't yet received + and processed Alice's SDP Answer and negotiated the ability to send + these streams. + + Finally, on receiving SIP 200 OK 3, Bob is now able to send the two + streams of video Alice requested -- this is illustrated as MEDIA 3. + + Both sides of the call are now sending multiple video streams with + their sources defined via CLUE negotiation. As the call progresses, + either side can send a new 'advertisement' or 'configure' message or + the new SDP Offers/Answers to add, remove, or change what they have + available or want to receive. + +9. Example: A Call between a CLUE-Capable and Non-CLUE Endpoint + + In this brief example, Alice is a CLUE-capable Endpoint making a call + to Bob, who is not CLUE capable (i.e., is not able to use the CLUE + protocol). + + +----------+ +-----------+ + | Alice | | Bob | + | | | | + +----+-----+ +-----+-----+ + | | + | | + | SIP INVITE 1 | + |--------------------------------->| + | | + | | + | 200 0K 1 | + |<---------------------------------| + | | + | | + | SIP ACK 1 | + |--------------------------------->| + | | + | | + | | + |<########### MEDIA 1 ############>| + | 1 video A->B, 1 video B->A | + |<################################>| + | | + | | + | | + | | + v v + + In SIP INVITE 1, Alice sends Bob a SIP INVITE including the basic + audio and video capabilities and data channel in the SDP body as per + [RFC8841]. Alice also includes the "sip.clue" media feature tag in + the INVITE. A snippet of the SDP showing the grouping attribute and + the video "m=" line are shown below. Alice has included a "CLUE" + group and the mid corresponding to a data channel in the group (3). + Note that Alice has chosen not to include any CLUE-controlled media + in the initial offer -- the "mid" value of the video line is not + included in the "CLUE" group. + + ... + a=group:CLUE 3 + ... + m=video 6002 RTP/AVP 96 + a=rtpmap:96 H264/90000 + a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600 + a=sendrecv + a=mid:2 + ... + m=application 6100 UDP/DTLS/SCTP webrtc-datachannel + a=sctp-port: 5000 + a=dcmap:2 subprotocol="CLUE";ordered=true + a=mid:3 + + Bob is not CLUE capable and hence does not recognize the "CLUE" + semantic for the grouping attribute, nor does he support the data + channel. IN SIP 200 OK 1, he responds with an answer that includes + audio and video, but with the data channel zeroed. + + From the lack of a CLUE group, Alice understands that Bob does not + support CLUE, or does not wish to use it. Both sides are now able to + send a single audio and video stream to each other. At this point, + Alice begins to send her fallback video: in this case, it's likely a + switched view from whichever camera shows the current loudest + participant on her side. + +10. IANA Considerations + +10.1. New SDP Grouping Framework Attribute + + This document registers the following semantics with IANA in the + "Semantics for the 'group' SDP Attribute" subregistry (under the + "Session Description Protocol (SDP) Parameters" registry) per + [RFC5888]: + + +===========================+=======+==============+===========+ + | Semantics | Token | Mux Category | Reference | + +===========================+=======+==============+===========+ + | CLUE-controlled "m=" line | CLUE | NORMAL | RFC 8848 | + +---------------------------+-------+--------------+-----------+ + + Table 1 + +10.2. New SIP Media Feature Tag + + This specification registers a new media feature tag in the SIP + [RFC3261] tree per the procedures defined in [RFC2506] and [RFC3840]. + + Media feature tag name: sip.clue + + ASN.1 Identifier: 30 + + Summary of the media feature indicated by this tag: This feature tag + indicates that the device supports CLUE-controlled media. + + Values appropriate for use with this feature tag: Boolean. + + The feature tag is intended primarily for use in the following + applications, protocols, services, or negotiation mechanisms: + This feature tag is most useful in a communications application + for describing the capabilities of a device to use the CLUE + control protocol to negotiate the use of multiple media streams. + + Related standards or documents: RFC 8848 + + Security Considerations: Security considerations for this media + feature tag are discussed in Section 11 of RFC 8848. + + Name(s) & email address(es) of person(s) to contact for further + information: Internet Engineering Steering Group <iesg@ietf.org> + + Intended usage: COMMON + +11. Security Considerations + + CLUE makes use of a number of protocols and mechanisms, either + defined by CLUE or long-standing. The Security Considerations + section of the CLUE Framework document [RFC8845] addresses the need + to secure these mechanisms by following the recommendations of the + individual protocols. + + Beyond the need to secure the constituent protocols, the use of CLUE + does impose additional security concerns. One area of increased risk + involves the potential for a malicious party to subvert a CLUE- + capable device to attack a third party by driving large volumes of + media (particularly video) traffic at them by establishing a + connection to the CLUE-capable device and directing the media to the + victim. While this is a risk for all media devices, a CLUE-capable + device may allow the attacker to configure multiple media streams to + be sent, significantly increasing the volume of traffic directed at + the victim. + + This attack can be prevented by ensuring that the media recipient + intends to receive the media packets. As such, all CLUE-capable + devices MUST support key negotiation and receiver intent assurance + via DTLS / Secure Real-time Transport Protocol (SRTP) [RFC5763] on + CLUE-controlled RTP "m=" lines, and they MUST use it or some other + mechanism that provides receiver intent assurance. All CLUE- + controlled RTP "m" lines must be secured and implemented using + mechanisms such as SRTP [RFC3711]. CLUE implementations MAY choose + not to require the use of SRTP to secure legacy (non-CLUE-controlled) + media for backwards compatibility with older SIP clients that are + incapable of supporting it. + + CLUE also defines a new media feature tag that indicates CLUE + support. This tag may be present even in non-CLUE calls, which + increases the metadata available about the sending device; this can + help an attacker differentiate between multiple devices and identify + otherwise anonymized users via the fingerprint of features their + device supports. To prevent this, SIP signaling used to set up CLUE + sessions SHOULD always be encrypted using TLS [RFC5630]. + + The CLUE protocol also carries additional information that could be + used to help fingerprint a particular user or to identify the + specific version of software being used. The CLUE Framework + [RFC8847] provides details about these issues and how to mitigate + them. + +12. References + +12.1. Normative References + + [RFC2119] Bradner, S., "Key words for use in RFCs to Indicate + Requirement Levels", BCP 14, RFC 2119, + DOI 10.17487/RFC2119, March 1997, + <https://www.rfc-editor.org/info/rfc2119>. + + [RFC3711] Baugher, M., McGrew, D., Naslund, M., Carrara, E., and K. + Norrman, "The Secure Real-time Transport Protocol (SRTP)", + RFC 3711, DOI 10.17487/RFC3711, March 2004, + <https://www.rfc-editor.org/info/rfc3711>. + + [RFC3840] Rosenberg, J., Schulzrinne, H., and P. Kyzivat, + "Indicating User Agent Capabilities in the Session + Initiation Protocol (SIP)", RFC 3840, + DOI 10.17487/RFC3840, August 2004, + <https://www.rfc-editor.org/info/rfc3840>. + + [RFC4574] Levin, O. and G. Camarillo, "The Session Description + Protocol (SDP) Label Attribute", RFC 4574, + DOI 10.17487/RFC4574, August 2006, + <https://www.rfc-editor.org/info/rfc4574>. + + [RFC5763] Fischl, J., Tschofenig, H., and E. Rescorla, "Framework + for Establishing a Secure Real-time Transport Protocol + (SRTP) Security Context Using Datagram Transport Layer + Security (DTLS)", RFC 5763, DOI 10.17487/RFC5763, May + 2010, <https://www.rfc-editor.org/info/rfc5763>. + + [RFC5888] Camarillo, G. and H. Schulzrinne, "The Session Description + Protocol (SDP) Grouping Framework", RFC 5888, + DOI 10.17487/RFC5888, June 2010, + <https://www.rfc-editor.org/info/rfc5888>. + + [RFC8174] Leiba, B., "Ambiguity of Uppercase vs Lowercase in RFC + 2119 Key Words", BCP 14, RFC 8174, DOI 10.17487/RFC8174, + May 2017, <https://www.rfc-editor.org/info/rfc8174>. + + [RFC8831] Jesup, R., Loreto, S., and M. Tüxen, "WebRTC Data + Channels", RFC 8831, DOI 10.17487/RFC8831, January 2021, + <https://www.rfc-editor.org/info/rfc8831>. + + [RFC8841] Holmberg, C., Shpount, R., Loreto, S., and G. Camarillo, + "Session Description Protocol (SDP) Offer/Answer + Procedures for Stream Control Transmission Protocol (SCTP) + over Datagram Transport Layer Security (DTLS) Transport", + RFC 8841, DOI 10.17487/RFC8841, January 2021, + <https://www.rfc-editor.org/info/rfc8841>. + + [RFC8843] Holmberg, C., Alvestrand, H., and C. Jennings, + "Negotiating Media Multiplexing Using the Session + Description Protocol (SDP)", RFC 8843, + DOI 10.17487/RFC8843, January 2021, + <https://www.rfc-editor.org/info/rfc8843>. + + [RFC8845] Duckworth, M., Ed., Pepperell, A., and S. Wenger, + "Framework for Telepresence Multi-Streams", RFC 8845, + DOI 10.17487/RFC8845, January 2021, + <https://www.rfc-editor.org/info/rfc8845>. + + [RFC8846] Presta, R. and S P. Romano, "An XML Schema for the + Controlling Multiple Streams for Telepresence (CLUE) Data + Model", RFC 8846, DOI 10.17487/RFC8846, January 2021, + <http://www.rfc-editor.org/info/rfc8846>. + + [RFC8847] Presta, R. and S P. Romano, "Protocol for Controlling + Multiple Streams for Telepresence (CLUE)", RFC 8847, + DOI 10.17487/RFC8847, January 2021, + <https://www.rfc-editor.org/info/rfc8847>. + + [RFC8849] Even, R. and J. Lennox, "Mapping RTP Streams to + Controlling Multiple Streams for Telepresence (CLUE) Media + Captures", RFC 8849, DOI 10.17487/RFC8849, January 2021, + <https://www.rfc-editor.org/info/rfc8849>. + + [RFC8850] Holmberg, C., "Controlling Multiple Streams for + Telepresence (CLUE) Protocol Data Channel", RFC 8850, + DOI 10.17487/RFC8850, January 2021, + <https://www.rfc-editor.org/info/rfc8850>. + + [RFC8864] Drage, K., Makaraju, M., Ejzak, R., Marcon, J., and R. + Even, Ed., "Negotiation Data Channels Using the Session + Description Protocol (SDP)", RFC 8864, + DOI 10.17487/RFC8864, January 2021, + <https://www.rfc-editor.org/info/rfc8864>. + +12.2. Informative References + + [RFC2506] Holtman, K., Mutz, A., and T. Hardie, "Media Feature Tag + Registration Procedure", BCP 31, RFC 2506, + DOI 10.17487/RFC2506, March 1999, + <https://www.rfc-editor.org/info/rfc2506>. + + [RFC3261] Rosenberg, J., Schulzrinne, H., Camarillo, G., Johnston, + A., Peterson, J., Sparks, R., Handley, M., and E. + Schooler, "SIP: Session Initiation Protocol", RFC 3261, + DOI 10.17487/RFC3261, June 2002, + <https://www.rfc-editor.org/info/rfc3261>. + + [RFC3264] Rosenberg, J. and H. Schulzrinne, "An Offer/Answer Model + with Session Description Protocol (SDP)", RFC 3264, + DOI 10.17487/RFC3264, June 2002, + <https://www.rfc-editor.org/info/rfc3264>. + + [RFC3311] Rosenberg, J., "The Session Initiation Protocol (SIP) + UPDATE Method", RFC 3311, DOI 10.17487/RFC3311, October + 2002, <https://www.rfc-editor.org/info/rfc3311>. + + [RFC4566] Handley, M., Jacobson, V., and C. Perkins, "SDP: Session + Description Protocol", RFC 4566, DOI 10.17487/RFC4566, + July 2006, <https://www.rfc-editor.org/info/rfc4566>. + + [RFC5630] Audet, F., "The Use of the SIPS URI Scheme in the Session + Initiation Protocol (SIP)", RFC 5630, + DOI 10.17487/RFC5630, October 2009, + <https://www.rfc-editor.org/info/rfc5630>. + + [RFC5761] Perkins, C. and M. Westerlund, "Multiplexing RTP Data and + Control Packets on a Single Port", RFC 5761, + DOI 10.17487/RFC5761, April 2010, + <https://www.rfc-editor.org/info/rfc5761>. + + [RFC6184] Wang, Y.-K., Even, R., Kristensen, T., and R. Jesup, "RTP + Payload Format for H.264 Video", RFC 6184, + DOI 10.17487/RFC6184, May 2011, + <https://www.rfc-editor.org/info/rfc6184>. + + [RFC8445] Keranen, A., Holmberg, C., and J. Rosenberg, "Interactive + Connectivity Establishment (ICE): A Protocol for Network + Address Translator (NAT) Traversal", RFC 8445, + DOI 10.17487/RFC8445, July 2018, + <https://www.rfc-editor.org/info/rfc8445>. + +Acknowledgements + + Besides the authors, the team focusing on this document consists of: + Roni Even, Simon Pietro Romano, and Roberta Presta. + + Christian Groves, Jonathan Lennox, and Adam Roach have contributed + detailed comments and suggestions. + +Authors' Addresses + + Robert Hanton + Cisco Systems + + Email: rohanse2@cisco.com + + + Paul Kyzivat + + Email: pkyzivat@alum.mit.edu + + + Lennard Xiao + Beijing Chuangshiyoulian + + Email: lennard.xiao@outlook.com + + + Christian Groves + + Email: cngroves.std@gmail.com |