summaryrefslogtreecommitdiff
path: root/doc/rfc/rfc8848.txt
blob: c9173f3d2cbf83b6d80f38e81c00b6f16a947669 (plain) (blame)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
1001
1002
1003
1004
1005
1006
1007
1008
1009
1010
1011
1012
1013
1014
1015
1016
1017
1018
1019
1020
1021
1022
1023
1024
1025
1026
1027
1028
1029
1030
1031
1032
1033
1034
1035
1036
1037
1038
1039
1040
1041
1042
1043
1044
1045
1046
1047
1048
1049
1050
1051
1052
1053
1054
1055
1056
1057
1058
1059
1060
1061
1062
1063
1064
1065
1066
1067
1068
1069
1070
1071
1072
1073
1074
1075
1076
1077
1078
1079
1080
1081
1082
1083
1084
1085
1086
1087
1088
1089
1090
1091
1092
1093
1094
1095
1096
1097
1098
1099
1100
1101
1102
1103
1104
1105
1106
1107
1108
1109
1110
1111
1112
1113
1114
1115
1116
1117
1118
1119
1120
1121
1122
1123
1124
1125
1126
1127
1128
1129
1130
1131
1132
1133
1134
1135
1136
1137
1138
1139
1140
1141
1142
1143
1144
1145
1146
1147
1148
1149
1150
1151
1152
1153
1154
1155
1156
1157
1158
1159
1160
1161
1162
1163
1164
1165
1166
1167
1168
1169
1170
1171
1172
1173
1174
1175
1176
1177
1178
1179
1180
1181
1182
1183
1184
1185
1186
1187
1188
1189
1190
1191
1192
1193
1194
1195
1196
1197
1198
1199
1200
1201
1202
1203
1204
1205
1206
1207
1208
1209
1210
1211
1212
1213
1214
1215
1216
1217
1218
1219
1220
1221
1222
1223
1224
1225
1226
1227
1228
1229
1230
1231
1232
1233
1234
1235
1236
1237
1238
1239
1240
1241
1242
1243
1244
1245
1246
1247
1248
1249
1250
1251
1252
1253
1254
1255
1256
1257
1258
1259
1260
1261
1262
1263
1264
1265
1266
1267
1268
1269
1270
1271
1272
1273
1274
1275
1276
1277
1278
1279
1280
1281
1282
1283
1284
1285
1286
1287
1288
1289
1290
1291
1292
1293
1294
1295
1296
1297
1298
1299
1300
1301
1302
1303
1304
1305
1306
1307
1308
1309
1310
1311
1312
1313
1314
1315
1316
1317
1318
1319
1320
1321
1322
1323
1324
1325
1326
1327
1328
1329
1330
1331
1332
1333
1334
1335
1336
1337
1338
1339
1340
1341
1342
1343
1344
1345
1346
1347
1348
1349
1350
1351
1352
1353
1354
1355
1356
1357
1358
1359
1360
1361
1362
1363
1364
1365
1366
1367
1368
1369
1370
1371
1372
1373
1374
1375
1376
1377
1378
1379
1380
1381
1382
1383
1384
1385
1386
1387
1388
1389
1390
1391
1392
1393
1394
1395
1396
1397
1398
1399
1400
1401
1402
1403
1404
1405
1406
1407
1408
1409
1410
1411
1412
1413
1414
1415
1416
1417
1418
1419
1420
1421
1422
1423
1424
1425
1426
1427
1428
1429
1430
1431
1432
1433
1434
1435
1436
1437
1438
1439
1440
1441
1442
1443
1444
1445
1446
1447
1448
1449
1450
1451
1452
1453
1454
1455
1456
1457
1458
1459
1460
1461
1462
1463
1464
1465
1466
1467
1468
1469
1470
1471
1472
1473
1474
1475
1476
1477
1478
1479
1480
1481
1482
Internet Engineering Task Force (IETF)                         R. Hanton
Request for Comments: 8848                                 Cisco Systems
Category: Experimental                                        P. Kyzivat
ISSN: 2070-1721                                                         
                                                                 L. Xiao
                                                Beijing Chuangshiyoulian
                                                               C. Groves
                                                            January 2021


  Session Signaling for Controlling Multiple Streams for Telepresence
                                 (CLUE)

Abstract

   This document is about Controlling Multiple Streams for Telepresence
   (CLUE) signaling.  It specifies how the CLUE protocol and the CLUE
   data channel are used in conjunction with each other and with
   existing signaling mechanisms, such as SIP and the Session
   Description Protocol (SDP), to produce a telepresence call.

Status of This Memo

   This document is not an Internet Standards Track specification; it is
   published for examination, experimental implementation, and
   evaluation.

   This document defines an Experimental Protocol for the Internet
   community.  This document is a product of the Internet Engineering
   Task Force (IETF).  It represents the consensus of the IETF
   community.  It has received public review and has been approved for
   publication by the Internet Engineering Steering Group (IESG).  Not
   all documents approved by the IESG are candidates for any level of
   Internet Standard; see Section 2 of RFC 7841.

   Information about the current status of this document, any errata,
   and how to provide feedback on it may be obtained at
   https://www.rfc-editor.org/info/rfc8848.

Copyright Notice

   Copyright (c) 2021 IETF Trust and the persons identified as the
   document authors.  All rights reserved.

   This document is subject to BCP 78 and the IETF Trust's Legal
   Provisions Relating to IETF Documents
   (https://trustee.ietf.org/license-info) in effect on the date of
   publication of this document.  Please review these documents
   carefully, as they describe your rights and restrictions with respect
   to this document.  Code Components extracted from this document must
   include Simplified BSD License text as described in Section 4.e of
   the Trust Legal Provisions and are provided without warranty as
   described in the Simplified BSD License.

Table of Contents

   1.  Introduction
   2.  Terminology
   3.  Media Feature Tag Definition
   4.  SDP Grouping Framework CLUE Extension Semantics
     4.1.  General
     4.2.  The CLUE Data Channel and the CLUE Grouping Semantic
     4.3.  CLUE-Controlled Media and the CLUE Grouping Semantic
     4.4.  SDP Semantics for CLUE-Controlled Media
       4.4.1.  Signaling CLUE Encodings
         4.4.1.1.  Referencing Encodings in the CLUE Protocol
       4.4.2.  Negotiating Receipt of CLUE Capture Encodings in SDP
     4.5.  SDP Offer/Answer Procedures
       4.5.1.  Generating the Initial Offer
       4.5.2.  Generating the Answer
         4.5.2.1.  Negotiating Use of CLUE and the CLUE Data Channel
         4.5.2.2.  Negotiating CLUE-Controlled Media
         4.5.2.3.  Negotiating Non-CLUE-controlled Media
       4.5.3.  Processing the Initial Offer/Answer Negotiation
         4.5.3.1.  Successful CLUE Negotiation
         4.5.3.2.  CLUE Negotiation Failure
       4.5.4.  Modifying the Session
         4.5.4.1.  Adding and Removing CLUE-Controlled Media
         4.5.4.2.  Enabling CLUE Mid-Call
         4.5.4.3.  Disabling CLUE Mid-Call
         4.5.4.4.  CLUE Protocol Failure Mid-Call
   5.  Interaction of the CLUE Protocol and SDP Negotiations
     5.1.  Independence of SDP and CLUE Negotiation
     5.2.  Constraints on Sending Media
     5.3.  Recommendations for Operating with Non-atomic Operations
   6.  Interaction of the CLUE Protocol and RTP/RTCP CaptureID
     6.1.  CaptureID Reception during MCC Redefinition
   7.  Multiplexing of CLUE-Controlled Media Using BUNDLE
     7.1.  Overview
     7.2.  Usage of BUNDLE with CLUE
       7.2.1.  Generating the Initial Offer
       7.2.2.  Multiplexing of the Data Channel and RTP Media
   8.  Example: A Call between Two CLUE-Capable Endpoints
   9.  Example: A Call between a CLUE-Capable and Non-CLUE Endpoint
   10. IANA Considerations
     10.1.  New SDP Grouping Framework Attribute
     10.2.  New SIP Media Feature Tag
   11. Security Considerations
   12. References
     12.1.  Normative References
     12.2.  Informative References
   Acknowledgements
   Authors' Addresses

1.  Introduction

   To enable devices to participate in a telepresence call, where they
   select the sources they wish to view, receive those media sources,
   and display them in an optimal fashion, Controlling Multiple Streams
   for Telepresence (CLUE) employs two principal and interrelated
   protocol negotiations.  SDP [RFC4566], conveyed via SIP [RFC3261], is
   used to negotiate the specific media capabilities that can be
   delivered to specific addresses on a device.  Meanwhile, CLUE
   protocol messages [RFC8847], transported via a CLUE data channel
   [RFC8850], are used to negotiate the Capture Sources available, their
   attributes, and any constraints in their use.  They also allow the
   far-end device to specify which Captures they wish to receive.  It is
   recommended that those documents be read prior to this one as this
   document assumes familiarity with those protocols and hence uses
   terminology from each with limited introduction.

   Beyond negotiating the CLUE channel, SDP is also used to negotiate
   the details of supported media streams and the maximum capability of
   each of those streams.  As the CLUE Framework [RFC8845] defines a
   manner in which the Media Provider expresses their maximum Encoding
   Group capabilities, SDP is also used to express the encoding limits
   for each potential Encoding.

   Backwards compatibility is an important consideration of the
   protocol: it is vital that a CLUE-capable device contacting a device
   that does not support CLUE is able to fall back to a fully functional
   non-CLUE call.  The document also defines how a non-CLUE call may be
   upgraded to CLUE mid-call and, similarly, how CLUE functionality can
   be removed mid-call to return to a standard non-CLUE call.

2.  Terminology

   The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
   "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and
   "OPTIONAL" in this document are to be interpreted as described in
   BCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all
   capitals, as shown here.

   This document uses terminology defined in the CLUE Framework
   [RFC8845].

   A few additional terms specific to this document are defined as
   follows:

   CLUE-controlled media:  A media "m=" line that is under CLUE control;
      the Capture Source that provides the media on this "m=" line is
      negotiated in CLUE.  See Section 4 for details on how this control
      is signaled in SDP.  There is a corresponding "non-CLUE-
      controlled" media term.

   non-CLUE device:  A device that supports standard SIP and SDP but
      either does not support CLUE or does support CLUE but does not
      currently wish to invoke CLUE capabilities.

   RTCP:  RTP Control Protocol.

   SCTP:  Stream Control Transmission Protocol.

   STUN:  Session Traversal Utilities for NAT.

3.  Media Feature Tag Definition

   The "sip.clue" media feature tag [RFC3840] indicates support for CLUE
   in SIP [RFC3261] calls.  A CLUE-capable device SHOULD include this
   media feature tag in its REGISTER requests and OPTION responses.  It
   SHOULD also include the media feature tag in INVITE and UPDATE
   [RFC3311] requests and responses.

   Presence of the media feature tag in the contact field of a request
   or response can be used to determine that the far end supports CLUE.

4.  SDP Grouping Framework CLUE Extension Semantics

4.1.  General

   This section defines a new SDP Grouping Framework [RFC5888] extension
   called 'CLUE'.

   The CLUE extension can be indicated using an SDP session-level
   'group' attribute.  Each SDP media "m=" line that is included in this
   group, using SDP media-level mid attributes, is CLUE controlled by a
   CLUE data channel that is also included in this CLUE group.

   Currently, only support for a single CLUE group is specified; support
   for multiple CLUE groups in a single session is outside the scope of
   this document.  A device MUST NOT include more than one CLUE group in
   its SDP message unless it is following a specification that defines
   how multiple CLUE channels are signaled and is able to either
   determine that the other side of the SDP exchange supports multiple
   CLUE channels or fail gracefully in the event it does not.

4.2.  The CLUE Data Channel and the CLUE Grouping Semantic

   The CLUE data channel [RFC8850] is a bidirectional data channel
   [RFC8831] used for the transport of CLUE messages, conveyed within an
   SCTP over DTLS connection.  This channel must be established before
   CLUE protocol messages can be exchanged and CLUE-controlled media can
   be sent.

   The data channel is negotiated over SDP as described in [RFC8864].  A
   CLUE-capable device wishing to negotiate CLUE MUST also include a
   CLUE group in their SDP Offer or Answer and include the "mid" of the
   "m=" line for the data channel in that group.  The CLUE group MUST
   include the "mid" of the "m=" line for one (and only one) data
   channel.

   Presence of the data channel in the CLUE group in an SDP Offer or
   Answer also serves, along with the "sip.clue" media feature tag, as
   an indication that the device supports CLUE and wishes to upgrade the
   call to include CLUE-controlled media.  A CLUE-capable device SHOULD
   include a data channel "m=" line in offers and, when allowed by
   [RFC3264], answers.

4.3.  CLUE-Controlled Media and the CLUE Grouping Semantic

   CLUE-controlled media lines in an SDP are "m=" lines in which the
   content of the media streams to be sent is negotiated via the CLUE
   protocol [RFC8847].  For an "m=" line to be CLUE controlled, its
   "mid" attribute value MUST be included in the CLUE group.  CLUE-
   controlled media is controlled by the CLUE protocol as negotiated on
   the CLUE data channel with a "mid" included in the CLUE group.

   "m=" lines not specified as being under CLUE control follow normal
   rules for media streams negotiated in SDP as defined in documents
   such as [RFC3264].

   The restrictions on CLUE-controlled media that are defined below
   always apply to "m=" lines in an SDP Offer or Answer, even if
   negotiation of the data channel in SDP failed due to lack of CLUE
   support by the remote device or for any other reason, or in an offer
   if the recipient does not include the "mid" of the corresponding "m="
   line in their CLUE group.

4.4.  SDP Semantics for CLUE-Controlled Media

4.4.1.  Signaling CLUE Encodings

   The CLUE Framework [RFC8845] defines the concept of "Encodings",
   which represent the sender's encode ability.  Each Encoding the Media
   Provider wishes to signal is done so via an "m=" line of the
   appropriate media type, which MUST be marked as sendonly with the
   "a=sendonly" attribute or as inactive with the "a=inactive"
   attribute.

   The encoder limits of active (e.g., "a=sendonly") Encodings can then
   be expressed using existing SDP syntax.  For instance, for H.264, see
   Table 6 in Section 8.2.2 of [RFC6184] for a list of valid parameters
   for representing encoder sender stream limits.

   These Encodings are CLUE controlled and hence MUST include a "mid" in
   the CLUE group as defined above.

   In addition to the normal restrictions defined in [RFC3264], the
   stream MUST be treated as if the "m=" line direction attribute had
   been set to "a=inactive" until the Media Provider has received a
   valid CLUE 'configure' message specifying the Capture to be used for
   this stream.  This means that RTP packets MUST NOT be sent until
   configuration is complete, while non-media packets such as STUN,
   RTCP, and DTLS MUST be sent as per their relevant specifications, if
   negotiated.

   Every "m=" line representing a CLUE Encoding MUST contain a "label"
   attribute as defined in [RFC4574].  This label is used to identify
   the Encoding by the sender in CLUE 'advertisement' messages and by
   the receiver in CLUE 'configure' messages.  Each label used for a
   CLUE-controlled "m=" line MUST be different from the label on all
   other "m=" lines in the CLUE group, unless an "m=" line represents a
   dependent stream related to another "m=" line (such as a Forward
   Error Correction (FEC) stream), in which case it MUST have the same
   label value as the "m=" line on which it depends.

4.4.1.1.  Referencing Encodings in the CLUE Protocol

   CLUE Encodings are defined in SDP but can be referenced from CLUE
   protocol messages -- this is how the protocol defines which Encodings
   are a part of an Encoding Group (in 'advertisement' messages) and
   which Encoding is used to encode a specific Capture (in 'configure'
   messages).  The labels on the CLUE-controlled "m=" lines are the
   references that are used in the CLUE protocol.

   Each <encID> (in encodingIDList) in a CLUE 'advertisement' message
   SHOULD represent an Encoding defined in SDP; the specific Encoding
   referenced is a CLUE-controlled "m=" line in the most recent SDP
   Offer/Answer message sent by the sender of the 'advertisement'
   message with a label value corresponding to the text content of the
   <encID>.  If the <encID> is not defined in SDP, it MUST be one it
   anticipates sending in a subsequent SDP Offer/Answer exchange.

   Each <encodingID> (in captureEncodingType) in a CLUE 'configure'
   message MUST represent an Encoding defined in SDP; the specific
   Encoding referenced is a CLUE-controlled "m=" line in the most recent
   SDP Offer/Answer message received by the sender of the 'configure'
   message with a label value corresponding to the text content of the
   <encodingID>.

   Note that the non-atomic nature of SDP/CLUE protocol interaction may
   mean that there are temporary periods where an <encID>/<encodingID>
   in a CLUE message does not reference an SDP "m=" line, or where an
   Encoding represented in SDP is not referenced in a CLUE protocol
   message.  See Section 5 for specifics.

4.4.2.  Negotiating Receipt of CLUE Capture Encodings in SDP

   A receiver who wishes to receive a CLUE stream via a specific
   Encoding requires an "a=recvonly" "m=" line that matches the
   "a=sendonly" Encoding.

   These "m=" lines are CLUE controlled and hence MUST include their
   "mid" in the CLUE group.  They MAY include a "label" attribute, but
   this is not required by CLUE, as only label values associated with
   "a=sendonly" Encodings are referenced by CLUE protocol messages.

4.5.  SDP Offer/Answer Procedures

4.5.1.  Generating the Initial Offer

   A CLUE-capable device sending an initial SDP Offer of a SIP session
   and wishing to negotiate CLUE will include an "m=" line for the data
   channel to convey the CLUE protocol, along with a CLUE group
   containing the "mid" of the data channel "m=" line.

   For interoperability with non-CLUE devices, a CLUE-capable device
   sending an initial SDP Offer SHOULD NOT include any "m=" line for
   CLUE-controlled media beyond the "m=" line for the CLUE data channel,
   and it SHOULD include at least one non-CLUE-controlled media "m="
   line.

   If the device has evidence that the receiver is also CLUE capable,
   for instance, due to receiving an initial INVITE with no SDP but
   including a "sip.clue" media feature tag, the above recommendation is
   waived, and the initial offer MAY contain "m=" lines for CLUE-
   controlled media.

   With the same interoperability recommendations as for Encodings, the
   sender of the initial SDP Offer MAY also include "a=recvonly" media
   lines to preallocate "m=" lines to receive media.  Alternatively, it
   MAY wait until CLUE protocol negotiation has completed before
   including these lines in a new offer/answer exchange -- see Section 5
   for recommendations.

4.5.2.  Generating the Answer

4.5.2.1.  Negotiating Use of CLUE and the CLUE Data Channel

   If the recipient of an initial offer is CLUE capable, and the offer
   contains both an "m=" line for a data channel and a CLUE group
   containing the "mid" for that "m=" line, they SHOULD negotiate data
   channel support for an "m=" line and include the "mid" of that "m="
   line in a corresponding CLUE group.

   A CLUE-capable recipient that receives an "m=" line for a data
   channel but no corresponding CLUE group containing the "mid" of that
   "m=" line MAY still include a corresponding data channel "m=" line if
   there are any other non-CLUE protocols it can convey over that
   channel, but the use of the CLUE protocol MUST NOT be negotiated on
   this channel.

4.5.2.2.  Negotiating CLUE-Controlled Media

   If the initial offer contained "a=recvonly" CLUE-controlled media
   lines, the recipient SHOULD include corresponding "a=sendonly" CLUE-
   controlled media lines for accepted Encodings, up to the maximum
   number of Encodings it wishes to advertise.  As CLUE-controlled
   media, the "mid" of these "m=" lines MUST be included in the
   corresponding CLUE group.  The recipient MUST set the direction of
   the corresponding "m=" lines of any remaining "a=recvonly" CLUE-
   controlled media lines received in the offer to "a=inactive".

   If the initial offer contained "a=sendonly" CLUE-controlled media
   lines, the recipient MAY include corresponding "a=recvonly" CLUE-
   controlled media lines, up to the maximum number of Capture Encodings
   it wishes to receive.  Alternatively, it MAY wait until CLUE protocol
   negotiation has completed before including these lines in a new
   offer/answer exchange -- see Section 5 for recommendations.  The
   recipient MUST set the direction of the corresponding "m=" lines of
   any remaining "a=sendonly" CLUE-controlled media lines received in
   the offer to "a=inactive".

4.5.2.3.  Negotiating Non-CLUE-controlled Media

   A CLUE-controlled device implementation MAY prefer to render initial,
   single-stream audio and/or video for the user as rapidly as possible,
   transitioning to CLUE-controlled media once that has been negotiated.
   Alternatively, an implementation MAY wish to suppress initial media,
   only providing media once the final, CLUE-controlled streams have
   been negotiated.

   The receiver of the initial offer, if making the call CLUE-enabled
   with their SDP Answer, can make their preference clear by their
   action in accepting or rejecting non-CLUE-controlled media lines.
   Rejecting these "m=" lines will ensure that no non-CLUE-controlled
   media flows before the CLUE-controlled media is negotiated.  In
   contrast, accepting one or more non-CLUE-controlled "m=" lines in
   this initial answer will enable initial media to flow.

   If the answerer chooses to send initial non-CLUE-controlled media in
   a CLUE-enabled call, Section 4.5.4.1 addresses the need to disable it
   once the CLUE-controlled media is fully negotiated.

4.5.3.  Processing the Initial Offer/Answer Negotiation

   In the event that both the offer and answer include a data channel
   "m=" line with a "mid" value included in corresponding CLUE groups,
   CLUE has been successfully negotiated, and the call is now CLUE
   enabled.  If not, then the call is not CLUE enabled.

4.5.3.1.  Successful CLUE Negotiation

   In the event of successful CLUE enablement of the call, devices MUST
   now begin negotiation of the CLUE channel; see [RFC8850] for
   negotiation details.  If negotiation is successful, the sending of
   CLUE protocol messages [RFC8847] can begin.

   A CLUE-capable device MAY choose not to send RTP on the non-CLUE-
   controlled channels during the period in which control of the CLUE-
   controlled media lines is being negotiated (though RTCP MUST still be
   sent and received as normal).  However, a CLUE-capable device MUST
   still be prepared to receive media on non-CLUE-controlled media lines
   that have been successfully negotiated as defined in [RFC3264].

   If either side of the call wishes to add additional CLUE-controlled
   "m=" lines to send or receive CLUE-controlled media, they MAY now
   send a SIP request with a new SDP Offer following the normal rules of
   SDP Offer/Answer and any negotiated extensions.

4.5.3.2.  CLUE Negotiation Failure

   In the event that the negotiation of CLUE fails and the call is not
   CLUE enabled once the initial offer/answer negotiation completes,
   then CLUE is not in use in the call.  CLUE-capable devices MUST
   either revert to non-CLUE behavior or terminate the call.

4.5.4.  Modifying the Session

4.5.4.1.  Adding and Removing CLUE-Controlled Media

   Subsequent offer/answer exchanges MAY add additional "m=" lines for
   CLUE-controlled media or activate or deactivate existing "m=" lines
   per the standard SDP mechanisms.

   In most cases, at least one additional exchange after the initial
   offer/answer exchange will be required before both sides have added
   all the Encodings and the ability to receive Encodings that they
   desire.  Devices MAY delay adding "a=recvonly" CLUE-controlled "m="
   lines until after CLUE protocol negotiation completes -- see
   Section 5 for recommendations.

   Once CLUE media has been successfully negotiated, devices SHOULD
   ensure that non-CLUE-controlled media is deactivated by setting their
   ports to 0 in cases where it corresponds to the media type of CLUE-
   controlled media that has been successfully negotiated.  This
   deactivation may require an additional SDP exchange or may be
   incorporated into one that is part of the CLUE negotiation.

4.5.4.2.  Enabling CLUE Mid-Call

   A CLUE-capable device that receives an initial SDP Offer from a non-
   CLUE device SHOULD include a new data channel "m=" line and
   corresponding CLUE group in any subsequent offers it sends, to
   indicate that it is CLUE capable.

   If, in an ongoing non-CLUE call, an SDP Offer/Answer exchange
   completes with both sides having included a data channel "m=" line in
   their SDP and with the "mid" for that channel in a corresponding CLUE
   group, then the call is now CLUE enabled; negotiation of the data
   channel and subsequently the CLUE protocol begins.

4.5.4.3.  Disabling CLUE Mid-Call

   If, during an ongoing CLUE-enabled call, a device wishes to disable
   CLUE, it can do so by following the procedures for closing a data
   channel as defined in Section 6.6.1 of [RFC8864]: sending a new SDP
   Offer/Answer exchange and subsequent SCTP Stream Sequence Number
   (SSN) reset for the CLUE channel.  It MUST also remove the CLUE
   group.  Without the CLUE group, any "m=" lines that were previously
   CLUE controlled no longer are; implementations MAY disable them by
   setting their ports to 0 or MAY continue to use them -- in the latter
   case, how they are used is outside the scope of this document.

   If a device follows the procedure above, or an SDP Offer/Answer
   negotiation completes in a fashion in which either the "m=" CLUE data
   channel line was not successfully negotiated and/or one side did not
   include the data channel in the CLUE group, then CLUE for this call
   is disabled.  In the event that this occurs, CLUE is no longer
   enabled.  Any active "m=" lines still included in the CLUE group are
   no longer CLUE controlled, and the implementation MAY either disable
   them in a subsequent negotiation or continue to use them in some
   other fashion.  If the data channel is still present but not included
   in the CLUE group semantic, CLUE protocol messages MUST no longer be
   sent.

4.5.4.4.  CLUE Protocol Failure Mid-Call

   In contrast to the specific disablement of the use of CLUE described
   above, the CLUE channel may fail unexpectedly.  Two circumstances
   where this can occur are:

   *  The CLUE data channel terminates, either gracefully or
      ungracefully, without any corresponding SDP renegotiation.

   *  A channel error of the CLUE protocol causes it to return to the
      IDLE state as defined in Section 6 of [RFC8847].

   In this circumstance, implementations SHOULD continue to transmit and
   receive CLUE-controlled media on the basis of the last negotiated
   CLUE messages, until the CLUE protocol is re-established (in the
   event of a channel error) or disabled mid-call by an SDP exchange as
   defined in Section 4.5.4.3.  Implementations MAY choose to send such
   an SDP request to disable CLUE immediately or MAY continue on in a
   call-preservation mode.

5.  Interaction of the CLUE Protocol and SDP Negotiations

   Information about media streams in CLUE is split between two message
   types: SDP, which defines media addresses and limits, and the CLUE
   channel, which defines properties of Capture Devices available, scene
   information, and additional constraints.  As a result, certain
   operations, such as advertising support for a new transmissible
   Capture with an associated stream, cannot be performed atomically, as
   they require changes to both SDP and CLUE messaging.

   This section defines how the negotiation of the two protocols
   interact, provides some recommendations on dealing with intermediate
   stages in non-atomic operations, and mandates additional constraints
   on when CLUE-configured media can be sent.

5.1.  Independence of SDP and CLUE Negotiation

   To avoid the need to implement interlocking state machines with the
   potential to reach invalid states if messages were to be lost, or be
   rewritten en route by middleboxes, the state machines in SDP and CLUE
   operate independently.  The state of the CLUE channel does not
   restrict when an implementation may send a new SDP Offer or Answer;
   likewise, the implementation's ability to send a new CLUE
   'advertisement' or 'configure' message is not restricted by the
   results of or the state of the most recent SDP negotiation (unless
   the SDP negotiation has removed the CLUE channel).

   The primary implication of this is that a device may receive an SDP
   Offer/Answer message with a CLUE Encoding for which it does not yet
   have Capture information or receive a CLUE 'configure' message
   specifying a Capture Encoding for which the far end has not
   negotiated a media stream in SDP.

   CLUE messages contain an <encID> (in encodingIDList) or <encodingID>
   (in captureEncodingType), which is used to identify a specific
   Encoding or captureEncoding in SDP; see [RFC8846] for specifics.  The
   non-atomic nature of CLUE negotiation means that a sender may wish to
   send a new CLUE 'advertisement' message before the corresponding SDP
   message.  As such, the sender of the CLUE message MAY include an
   <encID> that does not currently match a CLUE-controlled "m=" line
   label in SDP; a CLUE-capable implementation MUST NOT reject a CLUE
   protocol message solely because it contains <encID> elements that do
   not match a label in SDP.

   The current state of the CLUE Participant or Media Provider/Consumer
   state machines does not affect compliance with any of the normative
   language of [RFC3264].  That is, they MUST NOT delay an ongoing SDP
   exchange as part of a SIP server or client transaction; an
   implementation MUST NOT delay an SDP exchange while waiting for CLUE
   negotiation to complete or for a 'configure' message to arrive.

   Similarly, a device in a CLUE-enabled call MUST NOT delay any
   mandatory state transitions in the CLUE Participant or Media
   Provider/Consumer state machines due to the presence or absence of an
   ongoing SDP exchange.

   A device with the CLUE Participant state machine in the ACTIVE state
   MAY choose to delay moving from ESTABLISHED to ADV (Media Provider
   state machine) or from ESTABLISHED to WAIT FOR CONF RESPONSE (Media
   Consumer state machine) based on the SDP state.  See [RFC8847] for
   CLUE state machine specifics.  Similarly, a device MAY choose to
   delay initiating a new SDP exchange based on the state of their CLUE
   state machines.

5.2.  Constraints on Sending Media

   While SDP and CLUE message states do not impose constraints on each
   other, both impose constraints on the sending of media -- CLUE-
   controlled media MUST NOT be sent unless it has been negotiated in
   both CLUE and SDP: an implementation MUST NOT send a specific CLUE
   Capture Encoding unless its most recent SDP exchange contains an
   active media channel for that Encoding AND it has received a CLUE
   'configure' message specifying a valid Capture for that Encoding.

5.3.  Recommendations for Operating with Non-atomic Operations

   CLUE-capable devices MUST be able to handle states in which CLUE
   messages make reference to EncodingIDs that do not match the most
   recently received SDP, irrespective of the order in which SDP and
   CLUE messages are received.  While these mismatches will usually be
   transitory, a device MUST be able to cope with such mismatches
   remaining indefinitely.  However, this document makes some
   recommendations on message ordering for these non-atomic transitions.

   CLUE-capable devices MUST ensure that any inconsistencies between SDP
   and CLUE signaling are temporary by sending updated SDP or CLUE
   messages as soon as the relevant state machines and other constraints
   permit.

   Generally, implementations that receive messages with incomplete
   information will be most efficient if they wait until they have the
   corresponding information they lack before sending messages to make
   changes related to that information.  For example, an answerer that
   receives a new SDP Offer with three new "a=sendonly" CLUE "m=" lines
   for which it has received no CLUE 'advertisement' message providing
   the corresponding capture information would typically include
   corresponding "a=inactive" lines in its answer, and it would only
   make a new SDP Offer with "a=recvonly" when and if a new
   'advertisement' message arrives with Captures relevant to those
   Encodings.

   Because of the constraints of SDP Offer/Answer and because new SDP
   negotiations are generally more 'costly' than sending a new CLUE
   message, implementations needing to make changes to both channels
   SHOULD prioritize sending the updated CLUE message over sending the
   new SDP message.  The aim is for the recipient to receive the CLUE
   changes before the SDP changes, allowing the recipient to send their
   SDP Answers without incomplete information and reducing the number of
   new SDP Offers required.

6.  Interaction of the CLUE Protocol and RTP/RTCP CaptureID

   The CLUE Framework [RFC8845] allows for Multiple Content Captures
   (MCCs): Captures that contain multiple source Captures, whether
   composited into a single stream or switched based on some metric.

   The Captures that contribute to these MCCs may or may not be defined
   in the 'advertisement' message.  If they are defined and the MCC is
   providing them in a switched format, the recipient may wish to
   determine which originating source Capture is currently being
   provided, so that they can apply geometric corrections based on that
   Capture's geometry or take some other action based on the original
   Capture information.

   To do this, [RFC8849] allows for the CaptureID of the originating
   Capture to be conveyed via RTP or RTCP.  A Media Provider sending
   switched media for an MCC with defined originating sources MUST send
   the CaptureID in both RTP and RTCP, as described in the mapping
   document.

6.1.  CaptureID Reception during MCC Redefinition

   Because the RTP/RTCP CaptureID is delivered via a different channel
   to the 'advertisement' message in which in the contents of the MCC
   are defined, there is an intrinsic race condition in cases where the
   contents of an MCC are redefined.

   When a Media Provider redefines an MCC that involves CaptureIDs, the
   reception of the relevant CaptureIDs by the recipient will either
   lead or lag reception and the processing of the new 'advertisement'
   message by the recipient.  As such, a Media Consumer MUST NOT be
   disrupted by any of the following scenarios in any CLUE-controlled
   media stream it is receiving, whether that stream is for a static
   Capture or for an MCC (as any static Capture may be redefined to an
   MCC in a later 'advertisement' message):

   *  By receiving RTP or RTCP containing a CaptureID when the most
      recently processed 'advertisement' message means that no media
      CaptureIDs are expected.

   *  By receiving RTP or RTCP without CaptureIDs when the most recently
      processed 'advertisement' message means that media CaptureIDs are
      expected.

   *  By receiving a CaptureID in RTP or RTCP for a Capture defined in
      the most recently processed 'advertisement' message, but which the
      same 'advertisement' message does not include in the MCC.

   *  By receiving a CaptureID in RTP or RTCP for a Capture not defined
      in the most recently processed 'advertisement' message.

7.  Multiplexing of CLUE-Controlled Media Using BUNDLE

7.1.  Overview

   A CLUE call may involve sending and/or receiving significant numbers
   of media streams.  Conventionally, media streams are sent and
   received on unique ports.  However, each separate port used for this
   purpose may impose costs that a device wishes to avoid, such as the
   need to open that port on firewalls and NATs, the need to collect
   Interactive Connectivity Establishment (ICE) candidates [RFC8445],
   etc.

   The BUNDLE extension [RFC8843] can be used to negotiate the
   multiplexing of multiple media lines onto a single 5-tuple for
   sending and receiving media, allowing devices in calls to another
   BUNDLE-supporting device to potentially avoid some of the above
   costs.

   While CLUE-capable devices MAY support the BUNDLE extension for this
   purpose, supporting the extension is not mandatory for a device to be
   CLUE compliant.

   A CLUE-capable device that supports BUNDLE SHOULD also support rtcp-
   mux [RFC5761].  However, a CLUE-capable device that supports rtcp-mux
   may or may not support BUNDLE.

7.2.  Usage of BUNDLE with CLUE

   This specification imposes no additional requirements or restrictions
   on the usage of BUNDLE when used with CLUE.  There is no restriction
   on combining CLUE-controlled media lines and non-CLUE-controlled
   media lines in the same BUNDLE group or in multiple such groups.
   However, there are several steps an implementation may wish to take
   to ameliorate the cost and time requirements of extra SDP Offer/
   Answer exchanges between CLUE and BUNDLE.

7.2.1.  Generating the Initial Offer

   BUNDLE mandates that the initial SDP Offer MUST use a unique address
   for each "m=" line with a non-zero port.  Because CLUE
   implementations generally will not include CLUE-controlled media
   lines, with the exception of the data channel in the initial SDP
   Offer, CLUE devices that support large numbers of streams can avoid
   ever having to open large numbers of ports if they successfully
   negotiate BUNDLE.

   An implementation that does include CLUE-controlled media lines in
   its initial SDP Offer while also using BUNDLE must take care to avoid
   rendering its CLUE-controlled media lines unusable in the event the
   far end does not negotiate BUNDLE if it wishes to avoid the risk of
   additional SDP exchanges to resolve this issue.  This is best
   achieved by not sending any CLUE-controlled media lines in an initial
   offer with the 'bundle-only' attribute unless it has been established
   via some other channel that the recipient supports and is able to use
   BUNDLE.

7.2.2.  Multiplexing of the Data Channel and RTP Media

   BUNDLE-supporting CLUE-capable devices MAY include the data channel
   in the same BUNDLE group as RTP media.  In this case, the device MUST
   be able to demultiplex the various transports -- see Section 9.2 of
   the BUNDLE specification [RFC8843].  If the BUNDLE group includes
   protocols other than the data channel transported via DTLS, the
   device MUST also be able to differentiate the various protocols.

8.  Example: A Call between Two CLUE-Capable Endpoints

   This example illustrates a call between two CLUE-capable Endpoints.
   Alice, initiating the call, is a system with three cameras and three
   screens.  Bob, receiving the call, is a system with two cameras and
   two screens.  A call-flow diagram is presented, followed by a summary
   of each message.

   To manage the size of this section, the SDP snippets only illustrate
   video "m=" lines.  SIP ACKs are not always discussed.  Note that
   BUNDLE is not in use.

                 +----------+                      +-----------+
                 |  Alice   |                      |    Bob    |
                 |          |                      |           |
                 +----+-----+                      +-----+-----+
                      |                                  |
                      |                                  |
                      | SIP INVITE 1                     |
                      |--------------------------------->|
                      |                                  |
                      |                                  |
                      |                     SIP 200 OK 1 |
                      |<---------------------------------|
                      |                                  |
                      |                                  |
                      | SIP ACK 1                        |
                      |--------------------------------->|
                      |                                  |
                      |                                  |
                      |                                  |
                      |<########### MEDIA 1 ############>|
                      |   1 video A->B, 1 video B->A     |
                      |<################################>|
                      |                                  |
                      |                                  |
                      |                                  |
                      |<================================>|
                      |   CLUE DATA CHANNEL ESTABLISHED  |
                      |<================================>|
                      |                                  |
                      |                                  |
                      | CLUE OPTIONS                     |
                      |<*********************************|
                      |                                  |
                      |                                  |
                      |            CLUE OPTIONS RESPONSE |
                      |*********************************>|
                      |                                  |
                      |                                  |
                      | CLUE ADVERTISEMENT 1             |
                      |*********************************>|
                      |                                  |
                      |                                  |
                      |             CLUE ADVERTISEMENT 2 |
                      |<*********************************|
                      |                                  |
                      |                                  |
                      | CLUE ACK 1                       |
                      |<*********************************|
                      |                                  |
                      |                                  |
                      |                       CLUE ACK 2 |
                      |*********************************>|
                      |                                  |
                      |                                  |
                      | SIP INVITE 2 (+3 sendonly)       |
                      |--------------------------------->|
                      |                                  |
                      |                                  |
                      |                 CLUE CONFIGURE 1 |
                      |<*********************************|
                      |                                  |
                      |                                  |
                      |       SIP 200 OK 2 (+2 recvonly) |
                      |<---------------------------------|
                      |                                  |
                      |                                  |
                      | CLUE CONFIGURE RESPONSE 1        |
                      |*********************************>|
                      |                                  |
                      |                                  |
                      | SIP ACK 2                        |
                      |--------------------------------->|
                      |                                  |
                      |                                  |
                      |                                  |
                      |<########### MEDIA 2 ############>|
                      |   2 video A->B, 1 video B->A     |
                      |<################################>|
                      |                                  |
                      |                                  |
                      |       SIP INVITE 3 (+2 sendonly) |
                      |<---------------------------------|
                      |                                  |
                      |                                  |
                      | CLUE CONFIGURE 2                 |
                      |*********************************>|
                      |                                  |
                      |                                  |
                      | SIP 200 OK 3 (+2 recvonly)       |
                      |--------------------------------->|
                      |                                  |
                      |                                  |
                      |        CLUE CONFIGURE RESPONSE 2 |
                      |<*********************************|
                      |                                  |
                      |                                  |
                      |                        SIP ACK 3 |
                      |<---------------------------------|
                      |                                  |
                      |                                  |
                      |                                  |
                      |<########### MEDIA 3 ############>|
                      |   2 video A->B, 2 video B->A     |
                      |<################################>|
                      |                                  |
                      |                                  |
                      |                                  |
                      v                                  v

   In SIP INVITE 1, Alice sends Bob a SIP INVITE with the basic audio
   and video capabilities and data channel included in the SIP body as
   per [RFC8841].  Alice also includes the "sip.clue" media feature tag
   in the INVITE.  A snippet of the SDP showing the grouping attribute
   and the video "m=" line are shown below.  Alice has included a "CLUE"
   group and the mid corresponding to a data channel in the group (3).
   Note that Alice has chosen not to include any CLUE-controlled media
   in the initial offer -- the "mid" value of the video line is not
   included in the "CLUE" group.

      ...
      a=group:CLUE 3
      ...
      m=video 6002 RTP/AVP 96
      a=rtpmap:96 H264/90000
      a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600
      a=sendrecv
      a=mid:2
      ...
      m=application 6100 UDP/DTLS/SCTP webrtc-datachannel
      a=setup:actpass
      a=sctp-port: 5000
      a=dcmap:2 subprotocol="CLUE";ordered=true
      a=mid:3

   Bob responds with a similar SDP in SIP 200 OK 1, which also has a
   "CLUE" group including the "mid" value of a data channel; due to
   their similarity, no SDP snippet is shown here.  Bob wishes to
   receive initial media and thus includes corresponding non-CLUE-
   controlled audio and video lines.  Bob also includes the "sip.clue"
   media feature tag in the 200 OK.  Alice and Bob are each now able to
   send a single audio and video stream.  This is illustrated as MEDIA
   1.

   With the successful initial SDP Offer/Answer exchange complete, Alice
   and Bob are also free to negotiate the CLUE data channel.  This is
   illustrated as CLUE DATA CHANNEL ESTABLISHED.

   Once the data channel is established, CLUE protocol negotiation
   begins.  In this case, Bob was the DTLS client (sending "a=active" in
   his SDP Answer) and hence is the CLUE Channel Initiator.  He sends a
   CLUE OPTIONS message describing his version support.  On receiving
   that message, Alice sends her corresponding CLUE OPTIONS RESPONSE.

   With the OPTIONS phase complete, Alice now sends her CLUE
   'advertisement' message (CLUE ADVERTISEMENT 1).  She advertises three
   static Captures representing her three cameras.  She also includes
   switched Captures suitable for systems with one or two screens.  All
   of these Captures are in a single Capture Scene, with suitable
   Capture Scene Views that tell Bob he should subscribe to the three
   static Captures, the two switched Captures, or the one switched
   Capture.  Alice has no simultaneity constraints, so all six Captures
   are included in one simultaneous set.  Finally, Alice includes an
   Encoding Group with three Encoding IDs: "enc1", "enc2", and "enc3".
   These Encoding IDs aren't currently valid but will match the next SDP
   Offer she sends.

   Bob received CLUE ADVERTISEMENT 1 but does not yet send a 'configure'
   message, because he has not yet received Alice's Encoding
   information; thus, he does not know if she will have sufficient
   resources in order to send him the two streams he ideally wants at a
   quality he is happy with.  Because Bob is not sending an immediate
   'configure' message with the "ack" element set, he must send an
   explicit 'ack' message (CLUE ACK 1) to signal receipt of CLUE
   ADVERTISEMENT 1.

   Bob also sends his CLUE 'advertisement' message (CLUE ADVERTISEMENT
   2) -- though the diagram shows that this occurs after Alice sends
   CLUE ADVERTISEMENT 1, Bob sends his 'advertisement' message
   independently and does not wait for CLUE ADVERTISEMENT 1 to arrive.
   He advertises two static Captures representing his cameras.  He also
   includes a single composed Capture for single-screen systems, in
   which he will composite the two camera views into a single video
   stream.  All three Captures are in a single Capture Scene, with
   suitable Capture Scene Views that tell Alice she should subscribe to
   either the two static Captures or the single composed Capture.  Bob
   also has no simultaneity constraints, so he includes all three
   Captures in one simultaneous set.  Bob also includes a single
   Encoding Group with two Encoding IDs: "foo" and "bar".

   Similarly, Alice receives CLUE ADVERTISEMENT 2 but does not yet send
   a 'configure' message, because she has not yet received Bob's
   Encoding information; instead, she sends an 'ack' message (CLUE ACK
   2).

   Both sides have now sent their CLUE 'advertisement' messages, and an
   SDP exchange is required to negotiate Encodings.  For simplicity, in
   this case, Alice is shown sending an INVITE with a new offer; in many
   implementations, both sides might send an INVITE, which would be
   resolved by use of the 491 Request Pending resolution mechanism from
   [RFC3261].

   Alice now sends SIP INVITE 2.  She maintains the sendrecv audio,
   video, and CLUE "m=" lines, and she adds three new sendonly "m="
   lines to represent the three CLUE-controlled Encodings she can send.
   Each of these "m=" lines has a label corresponding to one of the
   Encoding IDs from CLUE ADVERTISEMENT 1.  Each also has its mid added
   to the grouping attribute to show they are controlled by the CLUE
   data channel.  A snippet of the SDP showing the grouping attribute,
   data channel, and video "m=" lines are shown below:

      ...
      a=group:CLUE 3 4 5 6
      ...
      m=video 6002 RTP/AVP 96
      a=rtpmap:96 H264/90000
      a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600
      a=sendrecv
      a=mid:2
      ...
      m=application 6100 UDP/DTLS/SCTP webrtc-datachannel
      a=sctp-port: 5000
      a=dcmap:2 subprotocol="CLUE";ordered=true
      a=mid:3
      ...
      m=video 6004 RTP/AVP 96
      a=rtpmap:96 H264/90000
      a=fmtp:96 profile-level-id=42e016
      a=sendonly
      a=mid:4
      a=label:enc1
      m=video 6006 RTP/AVP 96
      a=rtpmap:96 H264/90000
      a=fmtp:96 profile-level-id=42e016
      a=sendonly
      a=mid:5
      a=label:enc2
      m=video 6008 RTP/AVP 96
      a=rtpmap:96 H264/90000
      a=fmtp:96 profile-level-id=42e016
      a=sendonly
      a=mid:6
      a=label:enc3

   Bob now has all the information he needs to decide which streams to
   configure, allowing him to send both a CLUE 'configure' message and
   his SDP Answer.  As such, he now sends CLUE CONFIGURE 1.  This
   requests the pair of switched Captures that represent Alice's scene,
   and he configures them with encoder ids "enc1" and "enc2".

   Bob also sends his SDP Answer as part of SIP 200 OK 2.  Alongside his
   original audio, video, and CLUE "m=" lines, he includes three
   additional "m=" lines corresponding to the three added by Alice: two
   active recvonly "m= "lines and an inactive "m=" line for the third.
   He adds their "mid" values to the grouping attribute to show they are
   controlled by the CLUE data channel.  A snippet of the SDP showing
   the grouping attribute and the video "m=" lines are shown below (mid
   100 represents the CLUE data channel, which is not shown):

      ...
      a=group:CLUE 11 12 13 100
      ...
      m=video 58722 RTP/AVP 96
      a=rtpmap:96 H264/90000
      a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600
      a=sendrecv
      a=mid:10
      ...
      m=video 58724 RTP/AVP 96
      a=rtpmap:96 H264/90000
      a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600
      a=recvonly
      a=mid:11
      m=video 58726 RTP/AVP 96
      a=rtpmap:96 H264/90000
      a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600
      a=recvonly
      a=mid:12
      m=video 58728 RTP/AVP 96
      a=rtpmap:96 H264/90000
      a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600
      a=inactive
      a=mid:13

   Alice receives Bob's CLUE CONFIGURE 1 message and sends CLUE
   CONFIGURE RESPONSE 1 to acknowledge its reception.  She does not yet
   send the Capture Encodings specified, because at this stage, she
   hasn't processed Bob's answer SDP and thus hasn't negotiated the
   ability for Bob to receive these streams.

   On receiving SIP 200 OK 2 from Bob, Alice sends her SIP ACK (SIP ACK
   2).  She is now able to send the two streams of video Bob requested
   -- this is illustrated as MEDIA 2.

   The constraints of offer/answer meant that Bob could not include his
   Encoding information as new "m=" lines in SIP 200 OK 2.  As such, Bob
   now sends SIP INVITE 3 to generate a new offer.  Along with all the
   streams from SIP 200 OK 2, Bob also includes two new sendonly
   streams.  Each stream has a label corresponding to the Encoding IDs
   in his CLUE ADVERTISEMENT 2 message.  He also adds their "mid" values
   to the grouping attribute to show they are controlled by the CLUE
   data channel.  A snippet of the SDP showing the grouping attribute
   and the video "m=" lines are shown below (mid 100 represents the CLUE
   data channel, which is not shown):

      ...
      a=group:CLUE 11 12 14 15 100
      ...
      m=video 58722 RTP/AVP 96
      a=rtpmap:96 H264/90000
      a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600
      a=sendrecv
      a=mid:10
      ...
      m=video 58724 RTP/AVP 96
      a=rtpmap:96 H264/90000
      a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600
      a=recvonly
      a=mid:11
      m=video 58726 RTP/AVP 96
      a=rtpmap:96 H264/90000
      a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600
      a=recvonly
      a=mid:12
      m=video 0 RTP/AVP 96
      a=mid:13
      m=video 58728 RTP/AVP 96
      a=rtpmap:96 H264/90000
      a=fmtp:96 profile-level-id=42e016
      a=sendonly
      a=label:foo
      a=mid:14
      m=video 58730 RTP/AVP 96
      a=rtpmap:96 H264/90000
      a=fmtp:96 profile-level-id=42e016
      a=sendonly
      a=label:bar
      a=mid:15

   Having received this, Alice now has all the information she needs to
   send her CLUE 'configure' message and her SDP Answer.  In CLUE
   CONFIGURE 2, she requests the two static Captures from Bob to be sent
   on Encodings "foo" and "bar".

   Alice also sends SIP 200 OK 3, matching two recvonly "m=" lines to
   Bob's new sendonly lines.  She includes their "mid" values in the
   grouping attribute to show they are controlled by the CLUE data
   channel.  Alice then deactivates the initial non-CLUE-controlled
   media, as bidirectional CLUE-controlled media is now available.  A
   snippet of the SDP showing the grouping attribute and the video "m="
   lines are shown below (mid 3 represents the data channel, not shown):

      ...
      a=group:CLUE 3 4 5 7 8
      ...
      m=video 0 RTP/AVP 96
      a=mid:2
      ...
      m=video 6004 RTP/AVP 96
      a=rtpmap:96 H264/90000
      a=fmtp:96 profile-level-id=42e016
      a=sendonly
      a=mid:4
      a=label:enc1
      m=video 6006 RTP/AVP 96
      a=rtpmap:96 H264/90000
      a=fmtp:96 profile-level-id=42e016
      a=sendonly
      a=mid:5
      a=label:enc2
      m=video 0 RTP/AVP 96
      a=mid:6
      m=video 6010 RTP/AVP 96
      a=rtpmap:96 H264/90000
      a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600
      a=recvonly
      a=mid:7
      m=video 6012 RTP/AVP 96
      a=rtpmap:96 H264/90000
      a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600
      a=recvonly
      a=mid:8

   Bob receives Alice's CLUE CONFIGURE 2 message and sends CLUE
   CONFIGURE RESPONSE 2 to acknowledge its reception.  Bob does not yet
   send the Capture Encodings specified, because he hasn't yet received
   and processed Alice's SDP Answer and negotiated the ability to send
   these streams.

   Finally, on receiving SIP 200 OK 3, Bob is now able to send the two
   streams of video Alice requested -- this is illustrated as MEDIA 3.

   Both sides of the call are now sending multiple video streams with
   their sources defined via CLUE negotiation.  As the call progresses,
   either side can send a new 'advertisement' or 'configure' message or
   the new SDP Offers/Answers to add, remove, or change what they have
   available or want to receive.

9.  Example: A Call between a CLUE-Capable and Non-CLUE Endpoint

   In this brief example, Alice is a CLUE-capable Endpoint making a call
   to Bob, who is not CLUE capable (i.e., is not able to use the CLUE
   protocol).

         +----------+                      +-----------+
         |  Alice   |                      |    Bob    |
         |          |                      |           |
         +----+-----+                      +-----+-----+
              |                                  |
              |                                  |
              | SIP INVITE 1                     |
              |--------------------------------->|
              |                                  |
              |                                  |
              |                         200 0K 1 |
              |<---------------------------------|
              |                                  |
              |                                  |
              | SIP ACK 1                        |
              |--------------------------------->|
              |                                  |
              |                                  |
              |                                  |
              |<########### MEDIA 1 ############>|
              |   1 video A->B, 1 video B->A     |
              |<################################>|
              |                                  |
              |                                  |
              |                                  |
              |                                  |
              v                                  v

   In SIP INVITE 1, Alice sends Bob a SIP INVITE including the basic
   audio and video capabilities and data channel in the SDP body as per
   [RFC8841].  Alice also includes the "sip.clue" media feature tag in
   the INVITE.  A snippet of the SDP showing the grouping attribute and
   the video "m=" line are shown below.  Alice has included a "CLUE"
   group and the mid corresponding to a data channel in the group (3).
   Note that Alice has chosen not to include any CLUE-controlled media
   in the initial offer -- the "mid" value of the video line is not
   included in the "CLUE" group.

      ...
      a=group:CLUE 3
      ...
      m=video 6002 RTP/AVP 96
      a=rtpmap:96 H264/90000
      a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600
      a=sendrecv
      a=mid:2
      ...
      m=application 6100 UDP/DTLS/SCTP webrtc-datachannel
      a=sctp-port: 5000
      a=dcmap:2 subprotocol="CLUE";ordered=true
      a=mid:3

   Bob is not CLUE capable and hence does not recognize the "CLUE"
   semantic for the grouping attribute, nor does he support the data
   channel.  IN SIP 200 OK 1, he responds with an answer that includes
   audio and video, but with the data channel zeroed.

   From the lack of a CLUE group, Alice understands that Bob does not
   support CLUE, or does not wish to use it.  Both sides are now able to
   send a single audio and video stream to each other.  At this point,
   Alice begins to send her fallback video: in this case, it's likely a
   switched view from whichever camera shows the current loudest
   participant on her side.

10.  IANA Considerations

10.1.  New SDP Grouping Framework Attribute

   This document registers the following semantics with IANA in the
   "Semantics for the 'group' SDP Attribute" subregistry (under the
   "Session Description Protocol (SDP) Parameters" registry) per
   [RFC5888]:

   +===========================+=======+==============+===========+
   |         Semantics         | Token | Mux Category | Reference |
   +===========================+=======+==============+===========+
   | CLUE-controlled "m=" line | CLUE  | NORMAL       | RFC 8848  |
   +---------------------------+-------+--------------+-----------+

                               Table 1

10.2.  New SIP Media Feature Tag

   This specification registers a new media feature tag in the SIP
   [RFC3261] tree per the procedures defined in [RFC2506] and [RFC3840].

   Media feature tag name:  sip.clue

   ASN.1 Identifier:  30

   Summary of the media feature indicated by this tag:  This feature tag
      indicates that the device supports CLUE-controlled media.

   Values appropriate for use with this feature tag:  Boolean.

   The feature tag is intended primarily for use in the following
   applications, protocols, services, or negotiation mechanisms:
      This feature tag is most useful in a communications application
      for describing the capabilities of a device to use the CLUE
      control protocol to negotiate the use of multiple media streams.

   Related standards or documents:  RFC 8848

   Security Considerations:  Security considerations for this media
      feature tag are discussed in Section 11 of RFC 8848.

   Name(s) & email address(es) of person(s) to contact for further
   information:  Internet Engineering Steering Group <iesg@ietf.org>

   Intended usage:  COMMON

11.  Security Considerations

   CLUE makes use of a number of protocols and mechanisms, either
   defined by CLUE or long-standing.  The Security Considerations
   section of the CLUE Framework document [RFC8845] addresses the need
   to secure these mechanisms by following the recommendations of the
   individual protocols.

   Beyond the need to secure the constituent protocols, the use of CLUE
   does impose additional security concerns.  One area of increased risk
   involves the potential for a malicious party to subvert a CLUE-
   capable device to attack a third party by driving large volumes of
   media (particularly video) traffic at them by establishing a
   connection to the CLUE-capable device and directing the media to the
   victim.  While this is a risk for all media devices, a CLUE-capable
   device may allow the attacker to configure multiple media streams to
   be sent, significantly increasing the volume of traffic directed at
   the victim.

   This attack can be prevented by ensuring that the media recipient
   intends to receive the media packets.  As such, all CLUE-capable
   devices MUST support key negotiation and receiver intent assurance
   via DTLS / Secure Real-time Transport Protocol (SRTP) [RFC5763] on
   CLUE-controlled RTP "m=" lines, and they MUST use it or some other
   mechanism that provides receiver intent assurance.  All CLUE-
   controlled RTP "m" lines must be secured and implemented using
   mechanisms such as SRTP [RFC3711].  CLUE implementations MAY choose
   not to require the use of SRTP to secure legacy (non-CLUE-controlled)
   media for backwards compatibility with older SIP clients that are
   incapable of supporting it.

   CLUE also defines a new media feature tag that indicates CLUE
   support.  This tag may be present even in non-CLUE calls, which
   increases the metadata available about the sending device; this can
   help an attacker differentiate between multiple devices and identify
   otherwise anonymized users via the fingerprint of features their
   device supports.  To prevent this, SIP signaling used to set up CLUE
   sessions SHOULD always be encrypted using TLS [RFC5630].

   The CLUE protocol also carries additional information that could be
   used to help fingerprint a particular user or to identify the
   specific version of software being used.  The CLUE Framework
   [RFC8847] provides details about these issues and how to mitigate
   them.

12.  References

12.1.  Normative References

   [RFC2119]  Bradner, S., "Key words for use in RFCs to Indicate
              Requirement Levels", BCP 14, RFC 2119,
              DOI 10.17487/RFC2119, March 1997,
              <https://www.rfc-editor.org/info/rfc2119>.

   [RFC3711]  Baugher, M., McGrew, D., Naslund, M., Carrara, E., and K.
              Norrman, "The Secure Real-time Transport Protocol (SRTP)",
              RFC 3711, DOI 10.17487/RFC3711, March 2004,
              <https://www.rfc-editor.org/info/rfc3711>.

   [RFC3840]  Rosenberg, J., Schulzrinne, H., and P. Kyzivat,
              "Indicating User Agent Capabilities in the Session
              Initiation Protocol (SIP)", RFC 3840,
              DOI 10.17487/RFC3840, August 2004,
              <https://www.rfc-editor.org/info/rfc3840>.

   [RFC4574]  Levin, O. and G. Camarillo, "The Session Description
              Protocol (SDP) Label Attribute", RFC 4574,
              DOI 10.17487/RFC4574, August 2006,
              <https://www.rfc-editor.org/info/rfc4574>.

   [RFC5763]  Fischl, J., Tschofenig, H., and E. Rescorla, "Framework
              for Establishing a Secure Real-time Transport Protocol
              (SRTP) Security Context Using Datagram Transport Layer
              Security (DTLS)", RFC 5763, DOI 10.17487/RFC5763, May
              2010, <https://www.rfc-editor.org/info/rfc5763>.

   [RFC5888]  Camarillo, G. and H. Schulzrinne, "The Session Description
              Protocol (SDP) Grouping Framework", RFC 5888,
              DOI 10.17487/RFC5888, June 2010,
              <https://www.rfc-editor.org/info/rfc5888>.

   [RFC8174]  Leiba, B., "Ambiguity of Uppercase vs Lowercase in RFC
              2119 Key Words", BCP 14, RFC 8174, DOI 10.17487/RFC8174,
              May 2017, <https://www.rfc-editor.org/info/rfc8174>.

   [RFC8831]  Jesup, R., Loreto, S., and M. Tüxen, "WebRTC Data
              Channels", RFC 8831, DOI 10.17487/RFC8831, January 2021,
              <https://www.rfc-editor.org/info/rfc8831>.

   [RFC8841]  Holmberg, C., Shpount, R., Loreto, S., and G. Camarillo,
              "Session Description Protocol (SDP) Offer/Answer
              Procedures for Stream Control Transmission Protocol (SCTP)
              over Datagram Transport Layer Security (DTLS) Transport",
              RFC 8841, DOI 10.17487/RFC8841, January 2021,
              <https://www.rfc-editor.org/info/rfc8841>.

   [RFC8843]  Holmberg, C., Alvestrand, H., and C. Jennings,
              "Negotiating Media Multiplexing Using the Session
              Description Protocol (SDP)", RFC 8843,
              DOI 10.17487/RFC8843, January 2021,
              <https://www.rfc-editor.org/info/rfc8843>.

   [RFC8845]  Duckworth, M., Ed., Pepperell, A., and S. Wenger,
              "Framework for Telepresence Multi-Streams", RFC 8845,
              DOI 10.17487/RFC8845, January 2021,
              <https://www.rfc-editor.org/info/rfc8845>.

   [RFC8846]  Presta, R. and S P. Romano, "An XML Schema for the
              Controlling Multiple Streams for Telepresence (CLUE) Data
              Model", RFC 8846, DOI 10.17487/RFC8846, January 2021,
              <http://www.rfc-editor.org/info/rfc8846>.

   [RFC8847]  Presta, R. and S P. Romano, "Protocol for Controlling
              Multiple Streams for Telepresence (CLUE)", RFC 8847,
              DOI 10.17487/RFC8847, January 2021,
              <https://www.rfc-editor.org/info/rfc8847>.

   [RFC8849]  Even, R. and J. Lennox, "Mapping RTP Streams to
              Controlling Multiple Streams for Telepresence (CLUE) Media
              Captures", RFC 8849, DOI 10.17487/RFC8849, January 2021,
              <https://www.rfc-editor.org/info/rfc8849>.

   [RFC8850]  Holmberg, C., "Controlling Multiple Streams for
              Telepresence (CLUE) Protocol Data Channel", RFC 8850,
              DOI 10.17487/RFC8850, January 2021,
              <https://www.rfc-editor.org/info/rfc8850>.

   [RFC8864]  Drage, K., Makaraju, M., Ejzak, R., Marcon, J., and R.
              Even, Ed., "Negotiation Data Channels Using the Session
              Description Protocol (SDP)", RFC 8864,
              DOI 10.17487/RFC8864, January 2021,
              <https://www.rfc-editor.org/info/rfc8864>.

12.2.  Informative References

   [RFC2506]  Holtman, K., Mutz, A., and T. Hardie, "Media Feature Tag
              Registration Procedure", BCP 31, RFC 2506,
              DOI 10.17487/RFC2506, March 1999,
              <https://www.rfc-editor.org/info/rfc2506>.

   [RFC3261]  Rosenberg, J., Schulzrinne, H., Camarillo, G., Johnston,
              A., Peterson, J., Sparks, R., Handley, M., and E.
              Schooler, "SIP: Session Initiation Protocol", RFC 3261,
              DOI 10.17487/RFC3261, June 2002,
              <https://www.rfc-editor.org/info/rfc3261>.

   [RFC3264]  Rosenberg, J. and H. Schulzrinne, "An Offer/Answer Model
              with Session Description Protocol (SDP)", RFC 3264,
              DOI 10.17487/RFC3264, June 2002,
              <https://www.rfc-editor.org/info/rfc3264>.

   [RFC3311]  Rosenberg, J., "The Session Initiation Protocol (SIP)
              UPDATE Method", RFC 3311, DOI 10.17487/RFC3311, October
              2002, <https://www.rfc-editor.org/info/rfc3311>.

   [RFC4566]  Handley, M., Jacobson, V., and C. Perkins, "SDP: Session
              Description Protocol", RFC 4566, DOI 10.17487/RFC4566,
              July 2006, <https://www.rfc-editor.org/info/rfc4566>.

   [RFC5630]  Audet, F., "The Use of the SIPS URI Scheme in the Session
              Initiation Protocol (SIP)", RFC 5630,
              DOI 10.17487/RFC5630, October 2009,
              <https://www.rfc-editor.org/info/rfc5630>.

   [RFC5761]  Perkins, C. and M. Westerlund, "Multiplexing RTP Data and
              Control Packets on a Single Port", RFC 5761,
              DOI 10.17487/RFC5761, April 2010,
              <https://www.rfc-editor.org/info/rfc5761>.

   [RFC6184]  Wang, Y.-K., Even, R., Kristensen, T., and R. Jesup, "RTP
              Payload Format for H.264 Video", RFC 6184,
              DOI 10.17487/RFC6184, May 2011,
              <https://www.rfc-editor.org/info/rfc6184>.

   [RFC8445]  Keranen, A., Holmberg, C., and J. Rosenberg, "Interactive
              Connectivity Establishment (ICE): A Protocol for Network
              Address Translator (NAT) Traversal", RFC 8445,
              DOI 10.17487/RFC8445, July 2018,
              <https://www.rfc-editor.org/info/rfc8445>.

Acknowledgements

   Besides the authors, the team focusing on this document consists of:
   Roni Even, Simon Pietro Romano, and Roberta Presta.

   Christian Groves, Jonathan Lennox, and Adam Roach have contributed
   detailed comments and suggestions.

Authors' Addresses

   Robert Hanton
   Cisco Systems

   Email: rohanse2@cisco.com


   Paul Kyzivat

   Email: pkyzivat@alum.mit.edu


   Lennard Xiao
   Beijing Chuangshiyoulian

   Email: lennard.xiao@outlook.com


   Christian Groves

   Email: cngroves.std@gmail.com