diff options
author | Thomas Voss <mail@thomasvoss.com> | 2024-11-27 20:54:24 +0100 |
---|---|---|
committer | Thomas Voss <mail@thomasvoss.com> | 2024-11-27 20:54:24 +0100 |
commit | 4bfd864f10b68b71482b35c818559068ef8d5797 (patch) | |
tree | e3989f47a7994642eb325063d46e8f08ffa681dc /doc/rfc/rfc4175.txt | |
parent | ea76e11061bda059ae9f9ad130a9895cc85607db (diff) |
doc: Add RFC documents
Diffstat (limited to 'doc/rfc/rfc4175.txt')
-rw-r--r-- | doc/rfc/rfc4175.txt | 1011 |
1 files changed, 1011 insertions, 0 deletions
diff --git a/doc/rfc/rfc4175.txt b/doc/rfc/rfc4175.txt new file mode 100644 index 0000000..dfce28c --- /dev/null +++ b/doc/rfc/rfc4175.txt @@ -0,0 +1,1011 @@ + + + + + + +Network Working Group L. Gharai +Request for Comments: 4175 USC/ISI +Category: Standards Track C. Perkins + University of Glasgow + September 2005 + + + RTP Payload Format for Uncompressed Video + +Status of This Memo + + This document specifies an Internet standards track protocol for the + Internet community, and requests discussion and suggestions for + improvements. Please refer to the current edition of the "Internet + Official Protocol Standards" (STD 1) for the standardization state + and status of this protocol. Distribution of this memo is unlimited. + +Copyright Notice + + Copyright (C) The Internet Society (2005). + +Abstract + + This memo specifies a packetization scheme for encapsulating + uncompressed video into a payload format for the Real-time Transport + Protocol, RTP. It supports a range of standard- and high-definition + video formats, including common television formats such as ITU + BT.601, and standards from the Society of Motion Picture and + Television Engineers (SMPTE), such as SMPTE 274M and SMPTE 296M. The + format is designed to be applicable and extensible to new video + formats as they are developed. + +1. Introduction + + This memo defines a scheme to packetize uncompressed, studio-quality + video streams for transport using RTP [RTP]. It supports a range of + standard and high-definition video formats, including ITU-R BT.601 + [601], SMPTE 274M [274] and SMPTE 296M [296]. + + Formats for uncompressed standard definition television are defined + by ITU Recommendation BT.601 [601] along with bit-serial and parallel + interfaces in Recommendation BT.656 [656]. These formats allow both + 625-line and 525-line operation, with 720 samples per digital active + line, 4:2:2 color sub-sampling, and 8- or 10-bit digital + representation. + + + + + + +Gharai & Perkins Standards Track [Page 1] + +RFC 4175 RTP Payload Format for Uncompressed Video September 2005 + + + The representation of uncompressed high-definition television is + specified in SMPTE standards 274M [274] and 296M [296]. SMPTE 274M + defines a family of scanning systems with an image format of + 1920x1080 pixels with progressive and interlaced scanning, while + SMPTE 296M defines systems with an image size of 1280x720 pixels and + progressive scanning. In progressive scanning, scan lines are + displayed in sequence from top to bottom of a full frame. In + interlaced scanning, a frame is divided into its odd and even scan + lines (called fields) and the two fields are displayed in succession. + SMPTE 274M and 296M define images with aspect ratios of 16:9, and + define the digital representation for RGB and YCbCr components. In + the case of YCbCr components, the Cb and Cr components are + horizontally sub-sampled by a factor of two (4:2:2 color encoding). + + Although these formats differ in their details, they are structurally + very similar. This memo specifies a payload format to encapsulate + these and other similar video formats for transport within RTP. + +2. Conventions Used in This Document + + The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", + "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this + document are to be interpreted as described in RFC 2119 [2119]. + +3. Payload Design + + Each scan line of digital video is packetized into one or more RTP + packets. If the data for a complete scan line exceeds the network + MTU, the scan line SHOULD be fragmented into multiple RTP packets, + each smaller than the MTU. A single RTP packet MAY contain data for + more than one scan line. Only the active samples are included in the + RTP payload: inactive samples and the contents of horizontal and + vertical blanking SHOULD NOT be transported. In instances where + ancillary data is being transmitted, the sender and receiver can + disambiguate between ancillary and video data via scan line numbers. + That is, the ancillary data will use scan line numbers that are not + within the scope of the video frame. + + Scan line numbers are included in the RTP payload header, along with + a field identifier for interlaced video. + + For SMPTE 296M format video, valid scan line numbers are from 26 + through 745, inclusive. For progressive scan SMPTE 274M format + video, valid scan lines are from scan line 42 through 1121, + inclusive. For interlaced scan SMPTE 274M format video, valid + scan line numbers for field one (F=0) are from 21 to 560 and valid + scan line numbers for the second field (F=1) are from 584 to 1123. + For ITU-R BT.601 format video, the blanking intervals defined in + + + +Gharai & Perkins Standards Track [Page 2] + +RFC 4175 RTP Payload Format for Uncompressed Video September 2005 + + + BT.656 are used: for 625 line video, lines 24 to 310 of field one + (F=0) and 337 to 623 of the second field (F=1) are valid; for 525 + line video, lines 21 to 263 of the first field, and 284 to 525 of + the second field are valid. Other formats (e.g., [372]) may + define different ranges of active lines. + + The payload header contains a 16-bit extension to the standard 16-bit + RTP sequence number, thereby extending the sequence number to 32 bits + and enabling the payload format to accommodate high data rates + without ambiguity. This is necessary as the 16-bit RTP sequence + number will roll over very quickly for high data rates. For example, + for a 1-Gbps video stream with packet sizes of at least 1000 octets, + the standard RTP packet will roll over in 0.5 seconds, which can be a + problem for detecting loss and out-of-order packets particularly in + instances where the round-trip time is greater than half a second. + The extended 32-bit number allows for a longer wrap-around time of + approximately nine hours. + + Each scan line comprises an integer number of pixels. Each pixel is + represented by a number of samples. Samples may be coded as 8-, 10-, + 12-, or 16-bit values. A sample may represent a color component or a + luminance component of the video. Color samples may be shared + between adjacent pixels. The sharing of color samples between + adjacent pixels is known as color sub-sampling. This is typically + done in the YCbCr color space for the purpose of reducing the size of + the image data. + + Pixels that share sample values MUST be transported together as a + "pixel group". If 10-bit or 12-bit samples are used, each pixel may + also comprise a non-integer number of octets. In this case, several + pixels MUST be combined into an octet-aligned pixel group for + transmission. These restrictions simplify the operation of receivers + by ensuring that the complete payload is octet aligned, and that + samples relating to a single pixel are not fragmented across multiple + packets [ALF]. + + For example, in YCbCr video with 4:1:1 color sub-sampling, each group + of 4 adjacent pixels comprises 6 samples, Y1 Y2 Y3 Y4 Cr Cb, with the + Cr and Cb values being shared between all 4 pixels. If samples are + 8-bit values, the result is a group of 4 pixels comprising 6 octets. + If, however, samples are 10-bit values, the resulting 60-bit group is + not octet aligned. To be both octet aligned and appropriately + framed, two groups of 4 adjacent pixels must be collected, thereby + becoming octet aligned on a 15-octet boundary. This length is + referred to as the pixel group size ("pgroup"). + + + + + + +Gharai & Perkins Standards Track [Page 3] + +RFC 4175 RTP Payload Format for Uncompressed Video September 2005 + + + Formally, the "pgroup" parameter is the size in octets of the + smallest grouping of pixels such that 1) the grouping comprises an + integer number of octets; and 2) if color sub-sampling is used, + samples are only shared within the grouping. When packetizing + digital active line content, video data MUST NOT be fragmented within + a pgroup. + + Video content is almost always associated with additional information + such as audio tracks, time code, etc. In professional digital video + applications, this data is commonly embedded in non-active portions + of the video stream (horizontal and vertical blanking periods) so + that precise and robust synchronization is maintained. This payload + format requires that applications using such synchronized ancillary + data SHOULD deliver it in separate RTP sessions that operate + concurrently with the video session. The normal RTP mechanisms + SHOULD be used to synchronize the media. + +4. RTP Packetization + + The standard RTP header is followed by a 2-octet payload header that + extends the RTP Sequence Number, and by a 6-octet payload header for + each line (or partial line) of video included. One or more lines, or + partial lines, of video data follow. This format makes the payload + header 32-bit aligned in the common case, where one scan line (or + fragment) of video is included in each RTP packet. + + For example, if two lines of video are encapsulated, the payload + format will be as shown in Figure 1. + + + + + + + + + + + + + + + + + + + + + + + +Gharai & Perkins Standards Track [Page 4] + +RFC 4175 RTP Payload Format for Uncompressed Video September 2005 + + + 0 1 2 3 + 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 + +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ + | V |P|X| CC |M| PT | Sequence Number | + +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ + | Time Stamp | + +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ + | SSRC | + +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ + | Extended Sequence Number | Length | + +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ + |F| Line No |C| Offset | + +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ + | Length |F| Line No | + +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ + |C| Offset | . + +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ . + . . + . Two (partial) lines of video data . + . . + +---------------------------------------------------------------+ + + Figure 1: RTP Payload Format showing two (partial) lines of video + +4.1. The RTP Header + + The fields of the fixed RTP header have their usual meaning, with the + following additional notes: + + Payload Type (PT): 7 bits + + A dynamically allocated payload type field that designates the + payload as uncompressed video. + + Timestamp: 32 bits + + For progressive scan video, the timestamp denotes the sampling + instant of the frame to which the RTP packet belongs. Packets MUST + NOT include data from multiple frames, and all packets belonging to + the same frame MUST have the same timestamp. + + For interlaced video, the timestamp denotes the sampling instant of + the field to which the RTP packet belongs. Packets MUST NOT + include data from multiple fields, and all packets belonging to the + same field MUST have the same timestamp. Use of field timestamps, + rather than a frame timestamp and field indicator bit, is needed to + support reverse 3-2 pulldown. + + + + +Gharai & Perkins Standards Track [Page 5] + +RFC 4175 RTP Payload Format for Uncompressed Video September 2005 + + + A 90-kHz timestamp SHOULD be used in both cases. If the sampling + instant does not correspond to an integer value of the clock (as + may be the case when interleaving), the value SHALL be truncated to + the next lowest integer, with no ambiguity. + + Marker bit (M): 1 bit + + If progressive scan video is being transmitted, the marker bit + denotes the end of a video frame. If interlaced video is being + transmitted, it denotes the end of the field. The marker bit MUST + be set to 1 for the last packet of the video frame/field. It MUST + be set to 0 for other packets. + + Sequence Number: 16 bits + + The low-order bits for RTP sequence number. The standard 16-bit + sequence number is augmented with another 16 bits in the payload + header in order avoid problems due to wrap-around when operating at + high rate rates. + +4.2. Payload Header + + Extended Sequence Number: 16 bits + + The high order bits of the extended 32-bit sequence number, in + network byte order. + + Length: 16 bits + + Number of octets of data included from this scan line, in network + byte order. This MUST be a multiple of the pgroup value. + + Line No.: 15 bits + + Scan line number of encapsulated data, in network byte order. + Successive RTP packets MAY contains parts of the same scan line + (with an incremented RTP sequence number, but the same timestamp), + if it is necessary to fragment a line. + + Offset: 15 bits + + Offset of the first pixel of the payload data within the scan line. + If YCbCr format data is being transported, this is the pixel offset + of the luminance sample; if RGB format data is being transported, + it is the pixel offset of the red sample; if BGR format data is + being transported, it is the pixel offset of the blue sample. The + + + + + +Gharai & Perkins Standards Track [Page 6] + +RFC 4175 RTP Payload Format for Uncompressed Video September 2005 + + + value is in network byte order. The offset has a value of zero if + the first sample in the payload corresponds to the start of the + line, and increments by one for each pixel. + + Field Identification (F): 1 bit + + Identifies which field the scan line belongs to, for interlaced + data. F=0 identifies the first field and F=1 the second field. + For progressive scan data (e.g., SMPTE 296M format video), F MUST + always be set to zero. + + Continuation (C): 1 bit + + Determines if an additional scan line header follows the current + scan line header in the RTP packet. Set to 1 if an additional + header follows, implying that the RTP packet is carrying data for + more than one scan line. Set to 0 otherwise. Several scan lines + MAY be included in a single packet, up to the path MTU limit. The + only way to determine the number of scan lines included per packet + is to parse the payload headers. + +4.3. Payload Data + + Depending on the video format, each RTP packet can include either a + single complete scan line, a single fragment of a scan line, or one + (or more) complete scan lines and scan line fragments. The length of + each scan line or scan line fragment MUST be an integer multiple of + the pgroup size in octets. Scan lines SHOULD be fragmented so that + the resulting RTP packet is smaller than the path MTU. + + It is possible that the scan line length is not evenly divisible by + the number of pixels in a pgroup, so the final pixel data of a scan + line does not align to either an octet or a pgroup boundary. + Nonetheless, the payload MUST contain a whole number of pgroups; the + sender MUST fill the remaining bits of the final pgroup with zero and + the receiver MUST ignore the fill data. (In effect, the trailing edge + of the image is black-filled to a pgroup boundary.) + + For RGB format video, samples are packed in order Red-Green-Blue. + For BGR format video, samples are packed in order Blue-Green-Red. + For both formats, if 8-bit samples are used, the pgroup is 3 octets. + If 10-bit samples are used, samples from 4 adjacent pixels form 15- + octet pgroups. If 12-bit samples are used, samples from 2 adjacent + pixels form 9-octet pgroups. If 16-bit samples are used, each pixel + forms a separate 6-octet pgroup. + + + + + + +Gharai & Perkins Standards Track [Page 7] + +RFC 4175 RTP Payload Format for Uncompressed Video September 2005 + + + For RGBA format video, samples are packed in order Red-Green-Blue- + Alpha. For BGRA format video, samples are packed in order Blue- + Green-Red-Alpha. For 8-, 10-, 12-, or 16-bit samples, each pixel + forms its own pgroup, with octet sizes of 4, 5, 6, and 8, + respectively. + + If the video is in YCbCr format, the packing of samples into the + payload depends on the color sub-sampling used. + + For YCbCr 4:4:4 format video, samples are packed in order Cb-Y-Cr for + both interlaced and progressive frames. If 8-bit samples are used, + the pgroup is 3 octets. If 10-bit samples are used, samples from 4 + adjacent pixels form 15-octet pgroups. If 12-bit samples are used, + samples from 2 adjacent pixels form 9-octet pgroups. If 16-bit + samples are used, each pixel forms a separate 6-octet pgroup. + + For YCbCr 4:2:2 format video, the Cb and Cr components are + horizontally sub-sampled by a factor of two (each Cb and Cr sample + corresponds to two Y components). Samples are packed in order Cb0- + Y0-Cr0-Y1 for both interlaced and progressive scan lines. For 8-, + 10-, 12-, or 16-bit samples, the pgroup is formed from two adjacent + pixels (4, 5, 6, or 8 octets, respectively). + + For YCbCr 4:1:1 format video, the Cb and Cr components are + horizontally sub-sampled by a factor of four (each Cb and Cr sample + corresponds to four Y components). Samples are packed in order Cb0- + Y0-Y1-Cr0-Y2-Y3 for both interlaced and progressive scan lines. For + 8-, 10-, 12-, or 16-bit samples, the pgroup is formed from four + adjacent pixels (6, 15, 9, or 12 octets, respectively). + + For YCbCr 4:2:0 video, the Cb and Cr components are sub-sampled by a + factor of two both horizontally and vertically. Therefore, + chrominance samples are shared between certain adjacent lines. + Figure 2 shows the composition of luminance and chrominance samples + for a 6x6 pixel grid of 4:2:0 YCbCr video. The pixel group is a + group of four pixels arranged in a 2x2 matrix. The octet size of the + pgroup for progressive scan 4:2:0 video with samples sizes of 8, 10, + 12, and 16 bits is 6, 15, 9, and 12 octets, respectively. For + interlaced 4:2:0 video, the corresponding pgroups are 4, 5, 6, and 8 + octets. + + + + + + + + + + + +Gharai & Perkins Standards Track [Page 8] + +RFC 4175 RTP Payload Format for Uncompressed Video September 2005 + + + line 0: Y00 Y01 Y02 Y03 Y04 Y05 + Cb00 Cr00 Cb01 Cr01 Cb02 Cr02 + line 1: Y10 Y11 Y12 Y13 Y14 Y15 + + line 2: Y20 Y21 Y22 Y23 Y24 Y25 + Cb10 Cr10 Cb11 Cr11 Cb12 Cr12 + line 3: Y30 Y31 Y32 Y33 Y34 Y35 + + line 4: Y40 Y41 Y42 Y43 Y44 Y45 + Cb20 Cr20 Cb21 Cr21 Cb22 Cr22 + line 5: Y50 Y51 Y52 Y53 Y54 Y55 + + Figure 2: Chrominance/luminance composition in 4:2:0 YCbCr video + + When packetizing progressive scan 4:2:0 YCbCr video, samples from two + consecutive scan lines are included in each packet. The scan line + number in the payload header is set to that of the first scan line of + the pair: + + line 0/1: + Y00-Y01-Y10-Y11-Cb00-Cr00 Y02-Y03-Y12-Y13-Cb01-Cr01 + Y04-Y05-Y14-Y15-Cb02-Cr02 + + line 2/3: + Y20-Y21-Y30-Y31-Cb10-Cr10 Y22-Y23-Y32-Y33-Cb11-Cr11 + Y24-Y25-Y34-Y35-Cb12-Cr12 + + line 4/5: + Y40-Y41-Y50-Y51-Cb20-Cr20 Y42-Y43-Y52-Y53-Cb21-Cr21 + Y44-Y45-Y54-Y55-Cb22-Cr22 + + Figure 3: Packetization of progressive 4:2:0 YCbCr video + + For interlaced transport, chrominance samples are transported with + every other line. The first set of chrominance samples may be + transported with either the first line of field 0, or the first line + of field 1. Figure 4 illustrates the transport of chrominance + samples starting with the first line of field 0 (signaled by the + "top-field-first" MIME parameter). + + + + + + + + + + + + +Gharai & Perkins Standards Track [Page 9] + +RFC 4175 RTP Payload Format for Uncompressed Video September 2005 + + + field 0: + line 0: Y00-Y01-Cb00-Cr00 Y02-Y03-Cb01-Cr01 Y04-Y05-Cb02-Cr02 + line 2: Y20-Y21 Y22-Y23 Y24-Y25 + line 4: Y40-Y41-Cb20-Cr20 Y42-Y43-Cb21-Cr21 Y44-Y45-Cb22-Cr22 + + field 1: + line 1: Y10-Y11 Y12-Y13 Y14-Y15 + line 3: Y30-Y31-Cb10-Cr10 Y32-Y33-Cb11 Cr11 Y34-Y35-Cb12-Cr12 + line 5: Y50-Y51 Y52-Y53 Y54-Y55 + + Figure 4: Packetization of interlaced 4:2:0 YCbCr video with + top-field-first. + + Chrominance values may be sampled with different offsets relative to + luminance values. For instance, in Figure 2, chrominance values are + sampled at the same distance from neighboring luminance samples. It + is also possible for a chrominance sample to be co-sited with a + luminance sample, as in Figure 5: + + line 0: Y00-C Y01 Y02-C Y03 Y04-C Y05 + + line 1: Y10 Y11 Y12 Y13 Y14 Y15 + + line 2: Y20-C Y21 Y22-C Y23 Y24-C Y25 + + line 3: Y30 Y31 Y32 Y33 Y34 Y35 + + line 4: Y40-C Y41 Y42-C Y43 Y44-C Y45 + + line 5: Y50 Y51 Y52 Y53 Y54 Y55 + + Figure 5: Co-sited video sampling in 4:2:0 YCbCr video where C + designates a CbCr pair + + In general, chrominance values may be placed between luminance + samples or co-sited. Positions can be designated by an integer + numbering system starting from left to right and top to bottom. The + position matrices shown in Figures 6, 7, and 8 apply for 4:2:0, + 4:2:2, and 4:1:1 video, respectively: + + line N: Y[0] [1] Y[2] Y[0] [1] Y[2] + [3] [4] Y[5] [3] [4] [5] + line N+1: Y[6] [7] Y[8] Y[6] [7] Y[8] + + Figure 6: Chrominance position matrix for 4:2:0 YCbCr video + + + + + + +Gharai & Perkins Standards Track [Page 10] + +RFC 4175 RTP Payload Format for Uncompressed Video September 2005 + + + line N: Y[0] [1] Y[2] [3] Y[0] [1] Y[2] [3] + line N+1: Y[0] [1] Y[2] [3] Y[0] [1] Y[2] [3] + + Figure 7: Chrominance position matrix for 4:2:2 YCbCr video + + + line N: Y[0] [1] Y[2] [3] Y[4] [5] Y[6] + line N+1: Y[0] [1] Y[2] [3] Y[4] [5] Y[6] + + Figure 8: Chrominance position matrix for 4:1:1 YCbCr video + + Although these positions do not affect the packetization order of + chrominance and luminance samples, the information is needed for + interpolation prior to display and therefore should be signaled to + the receiver. + +5. RTCP Considerations + + RTCP SHOULD be used as specified in RFC 3550 [RTP]. It is to be + noted that the sender's octet count in SR packets and the cumulative + number of packets lost will wrap around quickly for high data rate + streams. This means that these two fields may not accurately + represent octet count and number of packets lost since the beginning + of transmission, as defined in RFC 3550. Therefore, for network + monitoring purposes, other means of keeping track of these variables + SHOULD be used. + +6. IANA Considerations + + The IANA has registered one new MIME subtype along with an associated + RTP Payload Format, and has created two sub-parameter registries, as + described in the following. + +6.1. MIME type registration + + MIME media type name: video + + MIME subtype name: raw + + Required parameters: + + rate: The RTP timestamp clock rate. Applications using this + payload format SHOULD use a value of 90000. + + sampling: Determines the color (sub-)sampling mode of the video + stream. Currently defined values are RGB, RGBA, BGR, BGRA, + YCbCr-4:4:4, YCbCr-4:2:2, YCbCr-4:2:0, and YCbCr-4:1:1. New values + may be registered as described in section 6.2 of RFC 4175. + + + +Gharai & Perkins Standards Track [Page 11] + +RFC 4175 RTP Payload Format for Uncompressed Video September 2005 + + + width: Determines the number of pixels per line. This is an + integer between 1 and 32767. + + height: Determines the number of lines per frame. This is an + integer between 1 and 32767. + + depth: Determines the number of bits per sample. This is an + integer with typical values including 8, 10, 12, and 16. + + colorimetry: This parameter defines the set of colorimetric + specifications and other transfer characteristics for the video + source, by reference to an external specification. Valid values + and their specification are: + + BT601-5 ITU Recommendation BT.601-5 [601] + BT709-2 ITU Recommendation BT.709-2 [709] + SMPTE240M SMPTE standard 240M [240] + + New values may be registered as described in section 6.2 of RFC + 4175. + + + Optional parameters: + + Interlace: If this OPTIONAL parameter is present, it indicates that + the video stream is interlaced. If absent, progressive scan is + implied. + + Top-field-first: If this OPTIONAL parameter is present, it + indicates that chrominance samples are packetized starting with the + first line of field 0. Its absence implies that chrominance + samples are packetized starting with the first line of field 1. + + chroma-position: This OPTIONAL parameter defines the position of + chrominance samples relative to luminance samples. It is either a + single integer or a comma separated pair of integers. Integer + values range from 0 to 8, as specified in Figures 6-8 of RFC 4175. + A single integer implies that Cb and Cr are co-sited. A comma + separated pair of integers designates the locations of Cb and Cr + samples, respectively. In its absence, a single value of zero is + assumed for color-subsampled video (chroma-position=0). + + gamma: An OPTIONAL floating point gamma correction value. + + + + + + + + +Gharai & Perkins Standards Track [Page 12] + +RFC 4175 RTP Payload Format for Uncompressed Video September 2005 + + + Encoding considerations: + + Uncompressed video is only transmitted over RTP as specified in RFC + 4175. No file format media type has been defined to go with this + transmission media type at this time. + + Security considerations: See section 9 of RFC 4175. + + Interoperability considerations: NONE. + + Published specification: RFC 4175. + + Applications which use this media type: Video communication. + + Additional information: None + + Person & email address to contact for further information: + + Ladan Gharai <ladan@isi.edu> + IETF Audio/Video Transport working group. + + Intended usage: COMMON + + Author: Ladan Gharai <ladan@isi.edu> + Change controller: IETF AVT Working Group + delegated from the IESG + +6.2. Parameter Registration + + New values of the "sampling" parameter MAY be registered with the + IANA provided they reference an RFC or other permanent and readily + available specification (the Specification Required policy of RFC + 2434 [2434]). A new registration MUST define the packing order of + samples and a valid combinations of color and sub-sampling modes. + + New values of the "colorimetry" parameter MAY be registered with the + IANA provided they reference an RFC or other permanent and readily + available specification if colorimetric parameters and other + applicable transfer characteristics (the Specification Required + policy of RFC 2434 [2434]). + +7. Mapping MIME Parameters into SDP + + The information carried in the MIME media type specification has a + specific mapping to fields in the Session Description Protocol (SDP) + [SDP], which is commonly used to describe RTP sessions. When SDP is + used to specify sessions transporting uncompressed video, the mapping + is as follows: + + + +Gharai & Perkins Standards Track [Page 13] + +RFC 4175 RTP Payload Format for Uncompressed Video September 2005 + + + - The MIME type ("video") goes in SDP "m=" as the media name. + + - The MIME subtype (payload format name) goes in SDP "a=rtpmap" as + the encoding name. + + - Remaining parameters go in the SDP "a=fmtp" attribute by copying + them directly from the MIME media type string as a semicolon- + separated list of parameter=value pairs. + + A sample SDP mapping for uncompressed video is as follows: + + m=video 30000 RTP/AVP 112 + a=rtpmap:112 raw/90000 + a=fmtp:112 sampling=YCbCr-4:2:2; width=1280; height=720; depth=10; + colorimetry=BT.709-2; chroma-position=1 + + In this example, a dynamic payload type 112 is used for uncompressed + video. The RTP sampling clock is 90 kHz. Note that the "a=fmtp:" + line has been wrapped to fit this page, and will be a single long + line in the SDP file. + +8. Security Considerations + + RTP packets using the payload format defined in this specification + are subject to the security considerations discussed in the RTP + specification [RTP] and any appropriate RTP profile. This implies + that confidentiality of the media streams is achieved by encryption. + + This payload type does not exhibit any significant non-uniformity in + the receiver side computational complexity for packet processing to + cause a potential denial-of-service threat. + + It is important to note that uncompressed video can have immense + bandwidth requirements (up to 270 Mbps for standard-definition video, + and approximately 1 Gbps for high-definition video). This is + sufficient to cause potential for denial-of-service if transmitted + onto most currently available Internet paths. + + Accordingly, if best-effort service is being used, users of this + payload format MUST monitor packet loss to ensure that the packet + loss rate is within acceptable parameters. Packet loss is considered + acceptable if a TCP flow across the same network path, and + experiencing the same network conditions, would achieve an average + throughput, measured on a reasonable timescale, that is not less than + the RTP flow is achieving. This condition can be satisfied by + implementing congestion control mechanisms to adapt the transmission + + + + + +Gharai & Perkins Standards Track [Page 14] + +RFC 4175 RTP Payload Format for Uncompressed Video September 2005 + + + rate (or the number of layers subscribed for a layered multicast + session), or by arranging for a receiver to leave the session if the + loss rate is unacceptably high. + + This payload format may also be used in networks that provide + quality-of-service guarantees. If enhanced service is being used, + receivers SHOULD monitor packet loss to ensure that the service that + was requested is actually being delivered. If it is not, then they + SHOULD assume that they are receiving best-effort service and behave + accordingly. + +9. Relation to RFC 2431 + + In comparison with RFC 2431, this memo specifies support for a wider + variety of uncompressed video, in terms of frame size, color sub- + sampling and sample sizes. Although [BT656] can transport up to 4096 + scan lines and 2048 pixels per line, our payload type can support up + to 32768 scan lines and pixels per line. Also, RFC 2431 only address + 4:2:2 YCbCr data, while this memo covers YCbCr, RGB, RGBA, BGR, BGRA, + and most common color sub-sampling schemes. Given the variety of + video types that we cover, this memo also assumes out-of-band + signaling for sample size and data types (RFC 2431 uses in band + signaling). + +10. Relation to RFC 3497 + + RFC 3497 [292RTP] specifies a RTP payload format for encapsulating + SMPTE 292M video. The SMPTE 292M standard defines a bit-serial + digital interface for local area High-Definition Television (HDTV) + transport. As a transport medium, SMPTE 292M utilizes 10-bit words + and a fixed 1.485 Gbps (and 1.485/1.001 Gbps) data rate. SMPTE 292M + is typically used in the broadcast industry for the transport of + other video formats such as SMPTE 260M, SMPTE 295M, SMPTE 274M, and + SMPTE 296M. + + RFC 3497 defines a circuit emulation for the transport of SMPTE 292M + over RTP. It is very specific to SMPTE 292 and has been designed to + be interoperable with existing broadcast equipment with a constant + rate of 1.485 Gbps. + + This memo defines a flexible native packetization scheme that can + packetize any uncompressed video, at varying data rates. In + addition, unlike RFC 3497, this memo only transports active video + pixels (i.e., horizontal and vertical blanking are not transported). + + + + + + + +Gharai & Perkins Standards Track [Page 15] + +RFC 4175 RTP Payload Format for Uncompressed Video September 2005 + + +11. Acknowledgements + + The authors are grateful to Philippe Gentric, Chuck Harrison, Stephan + Wenger, and Dave Singer for their feedback. + + This memo is based upon work supported by the U.S. National Science + Foundation (NSF) under Grant No. 0230738. Any opinions, findings, + and conclusions or recommendations expressed in this material are + those of the authors and do not necessarily reflect the views of NSF. + +Normative References + + [RTP] Schulzrinne, H., Casner, S., Frederick, R., and V. Jacobson, + "RTP: A Transport Protocol for Real-Time Applications", STD + 64, RFC 3550, July 2003. + + [2119] Bradner, S., "Key words for use in RFCs to Indicate + Requirement Levels", BCP 14, RFC 2119, March 1997. + + [2434] Narten, T. and H. Alvestrand, "Guidelines for Writing an + IANA Considerations Section in RFCs", BCP 26, RFC 2434, + October 1998. + + [601] International Telecommunication Union, "Studio encoding + parameters of digital television for standard 4:3 and wide + screen 16:9 aspect ratios", Recommendation BT.601, October + 1995. + + [709] International Telecommunication Union, "Parameter Values for + HDTV Standards for Production and International Programme + Exchange", Recommendation BT.709-2 + + [240] Society of Motion Picture and Television Engineers, + "Television - Signal Parameters - 1125-Line High-Definition + Production", SMPTE 240M-1999. + +Informative References + + [274] Society of Motion Picture and Television Engineers, + "1920x1080 Scanning and Analog and Parallel Digital + Interfaces for Multiple Picture Rates", SMPTE 274M-1998. + + [296] Society of Motion Picture and Television Engineers, + "1280x720 Scanning, Analog and Digital Representation and + Analog Interfaces", SMPTE 296M-1998. + + + + + + +Gharai & Perkins Standards Track [Page 16] + +RFC 4175 RTP Payload Format for Uncompressed Video September 2005 + + + [372] Society of Motion Picture and Television Engineers, "Dual + Link 292M Interface for 1920 x 1080 Picture Raster", SMPTE + 372M-2002. + + [ALF] Clark, D. D., and Tennenhouse, D. L., "Architectural + Considerations for a New Generation of Protocols", In + Proceedings of SIGCOMM '90 (Philadelphia, PA, Sept. 1990), + ACM. + + [SDP] Handley, M. and V. Jacobson, "SDP: Session Description + Protocol", RFC 2327, April 1998. + + [BT656] Tynan, D., "RTP Payload Format for BT.656 Video Encoding", + RFC 2431, October 1998. + + [292RTP] Gharai, L., Perkins, C., Goncher, G., and A. Mankin, "RTP + Payload Format for Society of Motion Picture and Television + Engineers (SMPTE) 292M Video", RFC 3497, March 2003. + + [656] International Telecommunication Union, "Interfaces for + Digital Component Video Signals in 525-line and 625-line + Television Systems Operating at the 4:2:2 Level of + Recommendation ITU-R BT.601 (Part A)", Recommendation + BT.656, April 1998. + +Authors' Addresses + + Ladan Gharai + USC Information Sciences Institute + 3811 N. Fairfax Drive, #200 + Arlington, VA 22203 + USA + + EMail: ladan@isi.edu + + + Colin Perkins + University of Glasgow + Department of Computing Science + 17 Lilybank Gardens + Glasgow G12 8QQ + United Kingdom + + EMail: csp@csperkins.org + + + + + + + +Gharai & Perkins Standards Track [Page 17] + +RFC 4175 RTP Payload Format for Uncompressed Video September 2005 + + +Full Copyright Statement + + Copyright (C) The Internet Society (2005). + + This document is subject to the rights, licenses and restrictions + contained in BCP 78, and except as set forth therein, the authors + retain all their rights. + + This document and the information contained herein are provided on an + "AS IS" basis and THE CONTRIBUTOR, THE ORGANIZATION HE/SHE REPRESENTS + OR IS SPONSORED BY (IF ANY), THE INTERNET SOCIETY AND THE INTERNET + ENGINEERING TASK FORCE DISCLAIM ALL WARRANTIES, EXPRESS OR IMPLIED, + INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE + INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED + WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. + +Intellectual Property + + The IETF takes no position regarding the validity or scope of any + Intellectual Property Rights or other rights that might be claimed to + pertain to the implementation or use of the technology described in + this document or the extent to which any license under such rights + might or might not be available; nor does it represent that it has + made any independent effort to identify any such rights. Information + on the procedures with respect to rights in RFC documents can be + found in BCP 78 and BCP 79. + + Copies of IPR disclosures made to the IETF Secretariat and any + assurances of licenses to be made available, or the result of an + attempt made to obtain a general license or permission for the use of + such proprietary rights by implementers or users of this + specification can be obtained from the IETF on-line IPR repository at + http://www.ietf.org/ipr. + + The IETF invites any interested party to bring to its attention any + copyrights, patents or patent applications, or other proprietary + rights that may cover technology that may be required to implement + this standard. Please address the information to the IETF at ietf- + ipr@ietf.org. + +Acknowledgement + + Funding for the RFC Editor function is currently provided by the + Internet Society. + + + + + + + +Gharai & Perkins Standards Track [Page 18] + |