diff options
Diffstat (limited to 'doc/rfc/rfc7974.txt')
-rw-r--r-- | doc/rfc/rfc7974.txt | 1123 |
1 files changed, 1123 insertions, 0 deletions
diff --git a/doc/rfc/rfc7974.txt b/doc/rfc/rfc7974.txt new file mode 100644 index 0000000..aeef9c3 --- /dev/null +++ b/doc/rfc/rfc7974.txt @@ -0,0 +1,1123 @@ + + + + + + +Independent Submission B. Williams +Request for Comments: 7974 Akamai, Inc. +Category: Informational M. Boucadair +ISSN: 2070-1721 Orange + D. Wing + October 2016 + + + An Experimental TCP Option for Host Identification + +Abstract + + Recent RFCs have discussed issues with host identification in IP + address-sharing systems, such as address/prefix-sharing devices and + application-layer proxies. Potential solutions for revealing a host + identifier in shared address deployments have also been discussed. + This memo describes the design, deployment, and privacy + considerations for one such solution in operational use on the + Internet today that uses a TCP option to transmit a host identifier. + +Independent Submissions Editor Note + + This Informational document specifies an experimental TCP HOST_ID + option that is already fairly widely deployed. It discusses that + option's privacy considerations in considerable detail and highlights + the care providers need to exercise in any actual deployment. The + Independent Submissions Editor has chosen to publish this document in + the Independent Stream so that potential deployers and implementors + can understand all its details, so as to produce implementations that + will interwork properly with other (existing) deployments. + +IESG Note + + This proposal was previously proposed for adoption by the TCPM + working group and rejected as being an undesirable technical design + for both transport and privacy reasons. This document specifies a + new TCP option that uses the shared experimental options format. The + use of experimental TCP options is specified in [RFC6994] for TCP + options "that are not yet eligible for assigned codepoints". As this + proposal has been rejected by the IETF community, it is not eligible + for the registration of a TCP option codepoint. It should be further + noted that for experimental TCP options, it "is only appropriate to + use these values in explicitly-configured experiments; they MUST NOT + be shipped as defaults in implementations" [RFC4727]. The IESG also + carried out a review as described in [RFC5742] and concluded that + this proposal violates IETF principles expressed in [RFC7258] about + pervasive monitoring as an attack and should therefore not be + published without IETF review and IESG approval. (The process + + + +Williams, et al. Informational [Page 1] + +RFC 7974 Experimental TCP HOST_ID Option October 2016 + + + described in [RFC5742] nonetheless allows the Independent Submissions + Editor to publish, as has been chosen in this case.) Deployments of + this proprietary TCP option may be widely viewed as undermining + privacy and are likely to encounter issues with reliability of + transport. + +Status of This Memo + + This document is not an Internet Standards Track specification; it is + published for informational purposes. + + This is a contribution to the RFC Series, independently of any other + RFC stream. The RFC Editor has chosen to publish this document at + its discretion and makes no statement about its value for + implementation or deployment. Documents approved for publication by + the RFC Editor are not a candidate for any level of Internet + Standard; see Section 2 of RFC 7841. + + Information about the current status of this document, any errata, + and how to provide feedback on it may be obtained at + http://www.rfc-editor.org/info/rfc7974. + +Copyright Notice + + Copyright (c) 2016 IETF Trust and the persons identified as the + document authors. All rights reserved. + + This document is subject to BCP 78 and the IETF Trust's Legal + Provisions Relating to IETF Documents + (http://trustee.ietf.org/license-info) in effect on the date of + publication of this document. Please review these documents + carefully, as they describe your rights and restrictions with respect + to this document. + + + + + + + + + + + + + + + + + + +Williams, et al. Informational [Page 2] + +RFC 7974 Experimental TCP HOST_ID Option October 2016 + + +Table of Contents + + 1. Introduction ....................................................4 + 1.1. Important Use Cases ........................................4 + 1.2. Document Goals .............................................6 + 2. Terminology .....................................................6 + 3. Option Format ...................................................7 + 4. Option Use ......................................................7 + 4.1. Option Values ..............................................7 + 4.2. Sending Host Requirements ..................................9 + 4.2.1. Alternative SYN Cookie Support ......................9 + 4.2.2. Persistent TCP Connections ..........................9 + 4.2.3. Packet Fragmentation ...............................10 + 4.3. Multiple In-Path HOST_ID Senders ..........................10 + 5. Option Interpretation ..........................................11 + 6. Interaction with Other TCP Options .............................12 + 6.1. Multipath TCP (MPTCP) .....................................12 + 6.2. Authentication Option (TCP-AO) ............................12 + 6.3. TCP Fast Open (TFO) .......................................13 + 7. Security Considerations ........................................13 + 8. Privacy Considerations .........................................14 + 9. Pervasive Monitoring (PM) Considerations .......................15 + 10. IANA Considerations ...........................................16 + 11. References ....................................................16 + 11.1. Normative References .....................................16 + 11.2. Informative References ...................................17 + Acknowledgements ..................................................20 + Authors' Addresses ................................................20 + + + + + + + + + + + + + + + + + + + + + + + +Williams, et al. Informational [Page 3] + +RFC 7974 Experimental TCP HOST_ID Option October 2016 + + +1. Introduction + + A broad range of issues associated with address sharing have been + documented in [RFC6269] and [RFC7620]. In addition, [RFC6967] + provides an analysis of various solutions to the problem of revealing + the sending host's identifier (HOST_ID) information to the receiver, + indicating that a solution using a TCP [RFC793] option for this + purpose is among the possible approaches that could be applied with + limited performance impact and a high success ratio. The purpose of + this memo is to describe a TCP HOST_ID option that is currently + deployed on the public Internet using the TCP experimental option + codepoint, including discussion of related design, deployment, and + privacy considerations. + + Multiple documents have defined TCP options for the purpose of host + identification: [REVEAL], [HOSTID], and [OVERLAYPATH]. Specification + of multiple option formats to serve the purpose of host + identification increases the burden for potential implementers and + presents interoperability challenges as well, so the authors of those + documents have worked together to define a common TCP option that + supersedes the formats from those three documents. This memo + describes a version of that common TCP option format that is + currently in use on the public Internet. + + The option defined in this memo uses the TCP experimental option + codepoint sharing mechanism defined in [RFC6994]. One of the earlier + specifications, [OVERLAYPATH], is associated with unauthorized use of + a TCP option kind number, and moving to the TCP experimental option + codepoint has allowed the authors of that document to correct their + error. + +1.1. Important Use Cases + + The authors' implementations have primarily focused on the following + address-sharing use cases in which currently deployed systems insert + the HOST_ID option: + + Carrier-Grade NAT (CGN): As defined in [RFC6888], [RFC6333], and + other sources, a CGN allows multiple hosts connected to the public + Internet to share a single Internet routable IPv4 address. One + important characteristic of the CGN use case is that it modifies + IP packets in-path, but does not serve as the endpoint for the + associated TCP connections. + + Application Proxy: As defined in [RFC1919], an application proxy + splits a TCP connection into two segments, serving as an endpoint + for each of the connections and relaying data flows between the + connections. + + + +Williams, et al. Informational [Page 4] + +RFC 7974 Experimental TCP HOST_ID Option October 2016 + + + Overlay Network: An overlay network is an Internet-based system + providing security, optimization, or other services for data flows + that transit the system. A network-layer overlay will sometimes + act much like a CGN, in that packets transit the system with NAT + being applied at the edge of the overlay. A transport-layer or + application-layer overlay [RFC3135] will typically act much like + an application proxy, in that the TCP connection will be segmented + with the overlay network serving as an endpoint for each of the + TCP connections. + + In this set of sender use cases, the TCP option is applied to an + individual TCP packet either at the connection endpoint (e.g., an + application proxy or a transport-layer overlay network) or at an + address-sharing middlebox (e.g., a CGN or a network-layer overlay + network). See Section 4 for additional details about the types of + devices that add the option to a TCP packet, as well as existing + limitations on use of the option when it is inserted by an address- + sharing middlebox, including issues related to packet fragmentation. + + The existing receiver use cases considered by this memo include the + following: + + o Differentiating between attack and non-attack traffic when the + source of the attack is sharing an address with non-attack + traffic. + + o Application of per-subscriber policies for resource utilization, + etc., when multiple subscribers are sharing a common address. + + o Improving server-side load-balancing decisions by allowing the + load for multiple clients behind a shared address to be assigned + to different servers, even when session affinity is required at + the application layer. + + In all of the above cases, differentiation between address-sharing + clients is performed by a network function that does not process the + application-layer protocol (e.g., HTTP) or the security protocol + (e.g., TLS), because the action needs to be performed prior to + decryption or parsing the application layer. Due to this, a solution + implemented within the application layer or security protocol was + considered unable to fully meet the receiver-side requirements. At + the same time, as noted in [RFC6967], use of an IP option for this + purpose has a low success rate. For these reasons, using a TCP + option to deliver the host identifier was deemed by the authors to be + an effective way to satisfy these specific use cases. See Section 5 + for details about receiver-side interpretation of the option. + + + + + +Williams, et al. Informational [Page 5] + +RFC 7974 Experimental TCP HOST_ID Option October 2016 + + +1.2. Document Goals + + Publication of this memo is intended to serve multiple purposes. + + First and foremost, this document intends to inform readers about a + mechanism that is in broad use on the public Internet. The authors + are each affiliated with companies that have implemented, tested, + and/or deployed systems that use the HOST_ID option on the public + Internet. Other systems might encounter packets that contain this + TCP option, and this document is intended to help others understand + the nature of the TCP option when it is encountered so they can make + informed decisions about how to handle it. + + The testing effort documented in [HOSTID] indicated that a TCP option + could be used for host identification purposes without significant + disruption of TCP connectivity to legacy servers and networks that do + not support the option. It also showed how mechanisms available in + existing TCP implementations could make use of such a TCP option for + diagnostics and/or packet filtering. The authors' use of the TCP + option on the public Internet has confirmed that it can be used + effectively for our use cases, but it has also uncovered some + interoperability issues associated with the option's use on the + public Internet, especially regarding interactions with other TCP + options that support new transport capability being specified within + the IETF. Section 6 discusses those interactions and limitations and + explains how our systems handle associated issues. + + Discussions within the IETF have raised privacy concerns about the + option's use, especially in regard to pervasive monitoring risks. + Existing uses of the option limit the nature of the HOST_ID values + that are used and the systems that insert them in order to mitigate + pervasive monitoring risks. Sections 8 and 9 discuss the authors' + assessments of the privacy and monitoring impact of this TCP option + in its current uses and suggest behavior for some external systems + when the option is encountered. Continued discussion following + publication of this memo is expected to allow further refinement of + requirements related to the values used to populate the option and + how those values can be interpreted by the receiver. There is a + trade-off between providing the expected functionality to the + receiver and protecting the privacy of the sender, and continued + assessment will be necessary in order to find the right balance. + +2. Terminology + + The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", + "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this + document are to be interpreted as described in [RFC2119]. + + + + +Williams, et al. Informational [Page 6] + +RFC 7974 Experimental TCP HOST_ID Option October 2016 + + +3. Option Format + + When used for host identification, the TCP experimental option uses + the experiment identification mechanism described in [RFC6994] and + has the following format and content. + + 0 1 2 3 + 01234567 89012345 67890123 45678901 + +--------+--------+--------+--------+ + | Kind | Length | ExID | + +--------+--------+--------+--------+ + | HOST_ID ... + +--------+--- + + Kind: The option kind value is 253. + + Length: The length of the option is variable, based on the required + size of the host identifier (e.g., a 2-octet HOST_ID will require + a length of 6, while a 4-octet HOST_ID will require a length of + 8). + + ExID: The experiment ID value is 0x0348 (840). + + HOST_ID: The host identifier is a value that can be used to + differentiate among the various hosts sharing a common public IP + address. See below for further discussion of this value. + +4. Option Use + + This section describes requirements associated with the use of the + option, including expected option values, which hosts are allowed to + include the option, and segments that include the option. + +4.1. Option Values + + The information conveyed in the HOST_ID option is intended to + uniquely identify the sending host to the best capability of the + machine that adds the option to the segment, while at the same time + avoiding inclusion of information that does not assist this purpose. + In addition, the option is not intended to be used to expose + information about the sending host that could not be discovered by + observing segments in transit on some portion of the Internet path + between the sender and the receiver. Existing use cases have + different requirements for receiver-side functionality, so this + document attempts to provide a high degree of flexibility for the + machine that adds the option to TCP segments. + + + + + +Williams, et al. Informational [Page 7] + +RFC 7974 Experimental TCP HOST_ID Option October 2016 + + + The HOST_ID option value MUST correlate to IP addresses and/or TCP + port numbers that were changed by the inserting host/device (i.e., + some of the IP address and/or port number bits are used to generate + the HOST_ID). Example values that satisfy this requirement include + the following: + + Unique ID: An inserting host/device could maintain a pool of locally + unique ID values that are dynamically mapped to the unique source + IP address values in use behind the host/device as a result of + address sharing. This ID value would be meaningful only within + the context of a specific shared IP address due to the local + uniqueness characteristic. Such an ID value could be smaller than + an IP address (e.g., 16 bits) in order to conserve TCP option + space. This option is preferred because it does not increase IP + address visibility on the forward side of the address-sharing + system, and it SHOULD be used in cases where receiver-side + requirements can be met without direct inclusion of the original + IP address (e.g., some load-balancing uses). + + IP Address/Subnet: An inserting host/device could simply populate + the option value with the IP address value in use behind the host/ + device. In the case of IPv6 addresses, it could be difficult to + include the full address due to TCP option space constraints, so + the value would likely need to provide only a portion of the + address (e.g., the first 64 bits). + + IP Address and TCP Port: Some networks share public IP addresses + among multiple subscribers with a portion of the TCP port number + space being assigned to each subscriber [RFC6346]. When such a + system is behind an address-sharing host/device, inclusion of both + the IP address and the TCP port number will more uniquely identify + the sending host than just the IP address on its own. + + When multiple host identifiers are necessary (e.g., an IP address and + a port number), the HOST_ID option is included multiple times within + the packet, once for each identifier. While this approach + significantly increases option space utilization when multiple + identifiers are included, cases where only a single identifier is + included are expected to be more common; thus, it is beneficial to + optimize for those cases. Note that some middleboxes might reorder + TCP options, so this method could be problematic if such a middlebox + is in-path between the address-sharing system and the receiver. This + has not proven to be a problem for existing use cases. + + See Section 8 for discussion of privacy considerations related to + selection of HOST_ID values. + + + + + +Williams, et al. Informational [Page 8] + +RFC 7974 Experimental TCP HOST_ID Option October 2016 + + +4.2. Sending Host Requirements + + The HOST_ID option MUST only be added by the sending host or any + device involved in the forwarding path that changes IP addresses and/ + or TCP port numbers (e.g., NAT44 [RFC3022], L2-Aware NAT, DS-Lite + Address Family Transition Router (AFTR) [RFC6333], IPv6-to-IPv6 + Network Prefix Translation (NPTv6) [RFC6296], NAT64 [RFC6146], Dual- + Stack Extra Lite [RFC6619], TCP Proxy, etc.). The HOST_ID option + MUST NOT be added or modified en route by any device that does not + modify IP addresses and/or TCP port numbers. + + The sending host or intermediary device cannot determine whether the + option value is used in a stateful manner by the receiver, nor can it + determine whether SYN cookies are in use by the receiver. For this + reason, the option MUST be included in all segments, both SYN and + non-SYN segments, until return segments from the receiver positively + indicate that the TCP connection is fully established on the receiver + (e.g., the return segment either includes or acknowledges data). + +4.2.1. Alternative SYN Cookie Support + + The authors have also considered an alternative approach to SYN + cookie support in which the receiving host (i.e., the host that + accepts the TCP connection) echoes the option back to the sender in + the SYN/ACK segment when a SYN cookie is being sent. This would + allow the host sending HOST_ID to determine whether further inclusion + of the option is necessary. This approach would have the benefit of + not requiring inclusion of the option in non-SYN segments if SYN + cookies had not been used. Unfortunately, this approach fails if the + responding host itself does not support the option, since an + intermediate node would have no way to determine that SYN cookies had + been used. + +4.2.2. Persistent TCP Connections + + Some types of middleboxes (e.g., application proxy) open and maintain + persistent TCP connections to regularly visited destinations in order + to minimize the burden of connection establishment. Such middleboxes + might use a single persistent TCP connection for multiple different + client hosts over the life of the persistent connection. + + This specification does not attempt to support the use of persistent + TCP connections for multiple client hosts due to the perceived + complexity of providing such support. Instead, the HOST_ID option is + only allowed to be used at connection initiation. An inserting host/ + device that supports both the HOST_ID option and multi-client + persistent TCP connections MUST NOT apply the HOST_ID option to TCP + connections that could be used for multiple clients over the life of + + + +Williams, et al. Informational [Page 9] + +RFC 7974 Experimental TCP HOST_ID Option October 2016 + + + the connection. If the HOST_ID option was sent during connection + initiation, the inserting host/device MUST NOT reuse the connection + for data flows originating from a client that would require a + different HOST_ID value. + +4.2.3. Packet Fragmentation + + In order to avoid the overhead associated with in-path IP + fragmentation, it is desirable for the inserting host/device to avoid + including the HOST_ID option when IP fragmentation might be required. + This is not a firm requirement though, because the HOST_ID option is + only included in the first few packets of a TCP connection; thus, + associated IP fragmentation will generally have minimal impact. The + option SHOULD NOT be included in packets if the resulting packet + would require local fragmentation. + + It can be difficult to determine whether local fragmentation would be + required. For example, in cases where multiple interfaces with + different MTUs are in use, a local routing decision has to be made + before the MTU can be determined, and in some systems, this decision + could be made after TCP option handling is complete. Additionally, + it could be true that inclusion of the option causes the packet to + violate the path's MTU but the path's MTU has not been learned yet on + the sending host/device. + + In existing deployed systems, the impact of IP fragmentation that + results from use of the option has been minimal. + +4.3. Multiple In-Path HOST_ID Senders + + The possibility exists that there could be multiple in-path hosts/ + devices configured to insert the HOST_ID option. For example, the + client's TCP packets might first traverse a CGN device on their way + to the edge of a public Internet overlay network. In order for the + HOST_ID value to most uniquely identify the sender, it needs to + represent both the identity observed by the CGN device (the + subscriber's internal IP address, e.g., Shared Address Space + [RFC6598]) and the identity observed by the overlay network (the + shared address of the CGN device). The mechanism for handling the + received HOST_ID value could vary depending upon the nature of the + new HOST_ID value to be inserted, as described below. + + The problem of multiple in-path HOST_ID senders has not been observed + in existing deployed systems. For this reason, existing + implementations do not consistently support this scenario. Some + systems do not propagate forward the received HOST_ID option value in + any way, while other systems follow the guidance described below. + + + + +Williams, et al. Informational [Page 10] + +RFC 7974 Experimental TCP HOST_ID Option October 2016 + + + An inserting host/device that uses the received packet's source IP + address as the HOST_ID value (possibly along with the port) MUST + propagate forward the HOST_ID value(s) from the received packet, + since the source IP address and port only represent the previous + in-path address-sharing device and do not represent the original + sender. In the CGN-plus-overlay example, this means that the overlay + will include both the CGN's HOST_ID value(s) and a HOST_ID with the + source IP address received by the overlay. + + An inserting host/device that sends a unique ID (as described in + Section 4.1) has two options for how to handle the HOST_ID value(s) + from the received packet: + + 1. A host/device that sends a unique ID MAY strip the received + HOST_ID option and insert its own option, provided that it uses + the received HOST_ID value as a differentiator for selecting the + unique ID. What this means in the CGN-plus-overlay example above + is that the overlay is allowed to drop the HOST_ID value inserted + by the CGN provided that the HOST_ID value selected by the + overlay represents both the CGN itself and the HOST_ID value + inserted by the CGN. + + 2. A host/device that sends a unique ID MAY instead select a unique + ID that represents only the previous in-path address-sharing + host/device and propagate forward the HOST_ID value inserted by + the previous host/device. In the CGN-plus-overlay example, this + means that the overlay would include both the CGN's HOST_ID value + and a HOST_ID with a unique ID of its own that was selected to + represent the CGN's shared address. + + An inserting host/device that sends a unique ID MUST use one of the + above two mechanisms. + +5. Option Interpretation + + Due to the variable nature of the option value, it is not possible + for the receiving machine to reliably determine the value type from + the option itself. For this reason, a receiving host/device SHOULD + interpret the option value as an opaque identifier. + + This specification allows the inserting host/device to provide + multiple HOST_ID options. The order of appearance of TCP options + could be modified by some middleboxes, so receivers SHOULD NOT rely + on option order to provide additional meaning to the individual + options. Instead, when multiple HOST_ID options are present, their + values SHOULD be concatenated together in the order in which they + appear in the packet and treated as a single large identifier. + + + + +Williams, et al. Informational [Page 11] + +RFC 7974 Experimental TCP HOST_ID Option October 2016 + + + For both of the receiver requirements discussed above, this + specification uses SHOULD rather than MUST because reliable + interpretation and ordering of options could be possible if the + inserting host and the interpreting host are under common + administrative control and integrity-protect communication between + the inserting host and the interpreting host. Mechanisms for + signaling the value type(s) and integrity protection are not provided + by this specification, and in their absence, the receiving host/ + device MUST interpret the option value(s) as a single opaque + identifier. + +6. Interaction with Other TCP Options + + This section details how the HOST_ID option functions in conjunction + with other TCP options. + +6.1. Multipath TCP (MPTCP) + + TCP provides for a maximum of 40 octets for TCP options. As + discussed in Appendix A of MPTCP [RFC6824], a typical SYN from + modern, popular operating systems contains several TCP options (MSS + (Maximum Segment Size), window scale, SACK (selective acknowledgment) + permitted, and timestamp), which consume 19-24 octets depending on + word alignment of the options. The initial SYN from a multipath TCP + client would consume an additional 16 octets. + + HOST_ID needs at least 6 octets to be useful, so 9-21 octets are + sufficient for many scenarios that benefit from HOST_ID. However, 4 + octets are not enough space for the HOST_ID option. Thus, a TCP SYN + containing all the typical TCP options (MSS, window scale, SACK + permitted, and timestamp) and also containing multipath capable or + multipath join as well as being word-aligned has insufficient space + to accommodate HOST_ID. This means something has to give. The + choices are either to avoid word alignment in that case (freeing 5 + octets) or avoid adding the HOST_ID option. Each of these approaches + is used in existing implementations and has been deemed acceptable + for the associated use case. + +6.2. Authentication Option (TCP-AO) + + The TCP Authentication Option (TCP-AO) [RFC5925] is incompatible with + address sharing due to the fact that it provides integrity protection + of the source IP address. For this reason, the only use cases where + it makes sense to combine TCP-AO and HOST_ID are those where the TCP- + AO-NAT extension [RFC6978] is in use. Injecting a HOST_ID TCP option + does not interfere with the use of TCP-AO-NAT because the TCP options + are not included in the Message Authentication Code (MAC) + calculation. + + + +Williams, et al. Informational [Page 12] + +RFC 7974 Experimental TCP HOST_ID Option October 2016 + + +6.3. TCP Fast Open (TFO) + + The TFO option [RFC7413] uses a zero-length cookie (total option + length is 2 bytes) to request a TFO cookie for use on future + connections. The server-generated TFO cookie is required to be at + least 4 bytes long and allowed to be as long as 16 bytes (total + option length is 6 to 18 bytes). The cookie request form of the + option leaves enough room available in a SYN packet with the most + commonly used options to accommodate the HOST_ID option, but a valid + TFO cookie length longer than 13 bytes would prevent even the minimal + 6-byte HOST_ID option from being included in the header. + + There are multiple possibilities for allowing TFO and HOST_ID to be + supported for the same connection, including: + + o If the TFO implementation allows the cookie size to be + configurable, the configured cookie size can be specifically + selected to leave enough option space available in a typical TFO + SYN packet to allow inclusion of the HOST_ID option. + + o If the TFO implementation provides explicit support for the + HOST_ID option, it can be designed to use a shorter cookie length + when the HOST_ID option is present in the TFO cookie request SYN. + + Reducing the TFO cookie size in order to include the HOST_ID option + could have unacceptable security implications, so existing deployed + systems that use the HOST_ID option consider TFO and HOST_ID to be + mutually exclusive and do not support the use of both options on the + same TCP connection. + + It should also be noted that the presence of data in a TFO SYN + increases the likelihood that there will be no space available in the + SYN packet to support inclusion of the HOST_ID option without IP + fragmentation, even if there is enough room in the TCP option space. + This is an additional reason that the existing system considers TFO + and HOST_ID to be mutually exclusive. + +7. Security Considerations + + Security (including privacy) considerations common to all HOST_ID + solutions are discussed in [RFC6967]. + + The content of the HOST_ID option SHOULD NOT be used for purposes + that require a trust relationship between the sender and the receiver + (e.g., billing and/or subscriber policy enforcement). This + requirement uses SHOULD rather than MUST because reliable + interpretation of options could be possible if the inserting host and + the interpreting host are under common administrative control and + + + +Williams, et al. Informational [Page 13] + +RFC 7974 Experimental TCP HOST_ID Option October 2016 + + + integrity-protect communication between the inserting host and the + interpreting host. Mechanisms for signaling the value type(s) and + integrity protection are not provided by this specification, and in + their absence, the receiving host/device MUST NOT use the HOST_ID + value for purposes that require a trust relationship. + + Note that the above trust requirement applies equally to HOST_ID + option values propagated forward from a previous in-path host as + described in Section 4.3. In other words, if the trust mechanism + does not apply to all option values in the packet, then none of the + HOST_ID values can be considered trusted, and the receiving host/ + device MUST NOT use any of the HOST_ID values for purposes that + require a trust relationship. An inserting host/device that has such + a trust relationship MUST NOT propagate forward an untrusted HOST_ID + in such a way as to allow it to be considered trusted. + + When the receiving network uses the values provided by the option in + a way that does not require trust (e.g., maintaining session affinity + in a load-balancing system), then use of a mechanism to enforce the + trust relationship is OPTIONAL. + +8. Privacy Considerations + + Sending a TCP SYN across the public Internet necessarily discloses + the public IP address of the sending host. When an intermediate + address-sharing device is deployed on the public Internet, anonymity + of the hosts using the device will be increased, with hosts + represented by multiple source IP addresses on the ingress side of + the device using a single source IP address on the egress side. The + HOST_ID TCP option removes that increased anonymity, taking + information that was already visible in TCP packets on the public + Internet on the ingress side of the address-sharing device and making + it available on the egress side of the device as well. In some + cases, an explicit purpose of the address-sharing device is + anonymity, in which case use of the HOST_ID TCP option would be + incompatible with the purpose of the device. + + A NAT device used to provide interoperability between a local area + network (LAN) using private [RFC1918] IP addresses and the public + Internet is sometimes specifically intended to provide anonymity for + the LAN clients as described in the above paragraph. For this + reason, address-sharing devices at the border between a private LAN + and the public Internet MUST NOT insert the HOST_ID option. + + The HOST_ID option MUST NOT be used to provide client geographic or + network location information that was not publicly visible in IP + packets for the TCP flows processed by the inserting host. For + + + + +Williams, et al. Informational [Page 14] + +RFC 7974 Experimental TCP HOST_ID Option October 2016 + + + example, the client's IP address MAY be used as the HOST_ID option + value, but any geographic or network location information derived + from the client's IP address MUST NOT be used as the HOST_ID value. + + The HOST_ID option MAY provide differentiating information that is + locally unique such that individual TCP flows processed by the + inserting host can be reliably identified. The HOST_ID option MUST + NOT provide client identification information that was not publicly + visible in IP packets for the TCP flows processed by the inserting + host, such as subscriber information linked to the IP address. + + The HOST_ID value MUST be changed whenever the subscriber IP address + changes. This requirement ensures that the HOST_ID option does not + introduce a new globally unique identifier that persists across + subscriber IP address changes. + + The HOST_ID option MUST be stripped from IP packets traversing + middleboxes that provide network-based anonymity services. + +9. Pervasive Monitoring (PM) Considerations + + [RFC7258] provides the following guidance: "Those developing IETF + specifications need to be able to describe how they have considered + PM, and, if the attack is relevant to the work to be published, be + able to justify related design decisions." Legitimate concerns about + host identification have been raised within the IETF. The authors of + this memo have attempted to address those concerns by providing + details about the nature of the HOST_ID values and the types of + middleboxes that should and should not include the HOST_ID option in + TCP headers, which describes limitations already imposed by existing + deployed systems. This section is intended to highlight some + particularly important aspects of this design and the related + guidance/limitations that are relevant to the pervasive monitoring + discussion. + + When a generated identifier is used, this document prohibits the + address-sharing device from using globally unique or permanent + identifiers. Only locally unique identifiers are allowed. As with + persistent IP addresses, persistent HOST_ID values could facilitate + user tracking and are therefore prohibited. The specific + requirements for permissible HOST_ID values are discussed in Sections + 8 and 4.1. + + This specification does not target exposing a host beyond what the + original packet, issued from that host, would have already exposed on + the public Internet without introduction of the option. The option + is intended only to carry forward information that was conveyed to + the address-sharing device in the original packet, and HOST_ID option + + + +Williams, et al. Informational [Page 15] + +RFC 7974 Experimental TCP HOST_ID Option October 2016 + + + values that do not match this description are prohibited by + requirements discussed in Section 8. This design does not allow the + HOST_ID option to carry personally identifiable information, + geographic location identifiers, or any other information that is not + available in the wire format of the associated TCP/IP headers. + + This document's guidance on option values is followed in the existing + deployed system. Thus, the volatility of the information conveyed in + a HOST_ID option is similar to that of the public, subscriber IP + address. A distinct HOST_ID is used by the address-sharing function + when the host reboots or gets a new public IP address from the + subscriber network. + + The described TCP option allows network identification to a similar + level as the first 64 bits of an IPv6 address. That is, the server + can use the bits of the TCP option to help identify a host behind an + address-sharing device, in much the same way the server would use the + host's IPv6 network address if the client and server were using IPv6 + end to end. + + Some address-sharing middleboxes on the public Internet have the + express intention of providing originator anonymity. Publication of + this document can help such middleboxes recognize the associated risk + and take action to mitigate it (e.g., by stripping or modifying the + option value). + +10. IANA Considerations + + This document specifies a new TCP option (HOST_ID) that uses the + shared experimental options format [RFC6994], with ExID in network- + standard byte order. IANA has registered HOST_ID (0x0348) in the + "TCP Experimental Option Experiment Identifiers (TCP ExIDs)" + registry. + +11. References + +11.1. Normative References + + [RFC793] Postel, J., "Transmission Control Protocol", STD 7, + RFC 793, DOI 10.17487/RFC0793, September 1981, + <http://www.rfc-editor.org/info/rfc793>. + + [RFC2119] Bradner, S., "Key words for use in RFCs to Indicate + Requirement Levels", BCP 14, RFC 2119, + DOI 10.17487/RFC2119, March 1997, + <http://www.rfc-editor.org/info/rfc2119>. + + + + + +Williams, et al. Informational [Page 16] + +RFC 7974 Experimental TCP HOST_ID Option October 2016 + + + [RFC4727] Fenner, B., "Experimental Values In IPv4, IPv6, ICMPv4, + ICMPv6, UDP, and TCP Headers", RFC 4727, + DOI 10.17487/RFC4727, November 2006, + <http://www.rfc-editor.org/info/rfc4727>. + + [RFC5742] Alvestrand, H. and R. Housley, "IESG Procedures for + Handling of Independent and IRTF Stream Submissions", + BCP 92, RFC 5742, DOI 10.17487/RFC5742, December 2009, + <http://www.rfc-editor.org/info/rfc5742>. + + [RFC6994] Touch, J., "Shared Use of Experimental TCP Options", + RFC 6994, DOI 10.17487/RFC6994, August 2013, + <http://www.rfc-editor.org/info/rfc6994>. + +11.2. Informative References + + [HOSTID] Abdo, E., Boucadair, M., and J. Queiroz, "HOST_ID TCP + Options: Implementation & Preliminary Test Results", Work + in Progress, draft-abdo-hostid-tcpopt-implementation-03, + July 2012. + + [OVERLAYPATH] + Williams, B., "Overlay Path Option for IP and TCP", Work + in Progress, draft-williams-overlaypath-ip-tcp-rfc-04, + June 2013. + + [REVEAL] Yourtchenko, A. and D. Wing, "Revealing hosts sharing an + IP address using TCP option", Work in Progress, + draft-wing-nat-reveal-option-03, December 2011. + + [RFC1918] Rekhter, Y., Moskowitz, B., Karrenberg, D., de Groot, G., + and E. Lear, "Address Allocation for Private Internets", + BCP 5, RFC 1918, DOI 10.17487/RFC1918, February 1996, + <http://www.rfc-editor.org/info/rfc1918>. + + [RFC1919] Chatel, M., "Classical versus Transparent IP Proxies", + RFC 1919, DOI 10.17487/RFC1919, March 1996, + <http://www.rfc-editor.org/info/rfc1919>. + + [RFC3022] Srisuresh, P. and K. Egevang, "Traditional IP Network + Address Translator (Traditional NAT)", RFC 3022, + DOI 10.17487/RFC3022, January 2001, + <http://www.rfc-editor.org/info/rfc3022>. + + + + + + + + +Williams, et al. Informational [Page 17] + +RFC 7974 Experimental TCP HOST_ID Option October 2016 + + + [RFC3135] Border, J., Kojo, M., Griner, J., Montenegro, G., and Z. + Shelby, "Performance Enhancing Proxies Intended to + Mitigate Link-Related Degradations", RFC 3135, + DOI 10.17487/RFC3135, June 2001, + <http://www.rfc-editor.org/info/rfc3135>. + + [RFC5925] Touch, J., Mankin, A., and R. Bonica, "The TCP + Authentication Option", RFC 5925, DOI 10.17487/RFC5925, + June 2010, <http://www.rfc-editor.org/info/rfc5925>. + + [RFC6146] Bagnulo, M., Matthews, P., and I. van Beijnum, "Stateful + NAT64: Network Address and Protocol Translation from IPv6 + Clients to IPv4 Servers", RFC 6146, DOI 10.17487/RFC6146, + April 2011, <http://www.rfc-editor.org/info/rfc6146>. + + [RFC6269] Ford, M., Ed., Boucadair, M., Durand, A., Levis, P., and + P. Roberts, "Issues with IP Address Sharing", RFC 6269, + DOI 10.17487/RFC6269, June 2011, + <http://www.rfc-editor.org/info/rfc6269>. + + [RFC6296] Wasserman, M. and F. Baker, "IPv6-to-IPv6 Network Prefix + Translation", RFC 6296, DOI 10.17487/RFC6296, June 2011, + <http://www.rfc-editor.org/info/rfc6296>. + + [RFC6333] Durand, A., Droms, R., Woodyatt, J., and Y. Lee, "Dual- + Stack Lite Broadband Deployments Following IPv4 + Exhaustion", RFC 6333, DOI 10.17487/RFC6333, August 2011, + <http://www.rfc-editor.org/info/rfc6333>. + + [RFC6346] Bush, R., Ed., "The Address plus Port (A+P) Approach to + the IPv4 Address Shortage", RFC 6346, + DOI 10.17487/RFC6346, August 2011, + <http://www.rfc-editor.org/info/rfc6346>. + + [RFC6598] Weil, J., Kuarsingh, V., Donley, C., Liljenstolpe, C., and + M. Azinger, "IANA-Reserved IPv4 Prefix for Shared Address + Space", BCP 153, RFC 6598, DOI 10.17487/RFC6598, April + 2012, <http://www.rfc-editor.org/info/rfc6598>. + + [RFC6619] Arkko, J., Eggert, L., and M. Townsley, "Scalable + Operation of Address Translators with Per-Interface + Bindings", RFC 6619, DOI 10.17487/RFC6619, June 2012, + <http://www.rfc-editor.org/info/rfc6619>. + + [RFC6824] Ford, A., Raiciu, C., Handley, M., and O. Bonaventure, + "TCP Extensions for Multipath Operation with Multiple + Addresses", RFC 6824, DOI 10.17487/RFC6824, January 2013, + <http://www.rfc-editor.org/info/rfc6824>. + + + +Williams, et al. Informational [Page 18] + +RFC 7974 Experimental TCP HOST_ID Option October 2016 + + + [RFC6888] Perreault, S., Ed., Yamagata, I., Miyakawa, S., Nakagawa, + A., and H. Ashida, "Common Requirements for Carrier-Grade + NATs (CGNs)", BCP 127, RFC 6888, DOI 10.17487/RFC6888, + April 2013, <http://www.rfc-editor.org/info/rfc6888>. + + [RFC6967] Boucadair, M., Touch, J., Levis, P., and R. Penno, + "Analysis of Potential Solutions for Revealing a Host + Identifier (HOST_ID) in Shared Address Deployments", + RFC 6967, DOI 10.17487/RFC6967, June 2013, + <http://www.rfc-editor.org/info/rfc6967>. + + [RFC6978] Touch, J., "A TCP Authentication Option Extension for NAT + Traversal", RFC 6978, DOI 10.17487/RFC6978, July 2013, + <http://www.rfc-editor.org/info/rfc6978>. + + [RFC7258] Farrell, S. and H. Tschofenig, "Pervasive Monitoring Is an + Attack", BCP 188, RFC 7258, DOI 10.17487/RFC7258, May + 2014, <http://www.rfc-editor.org/info/rfc7258>. + + [RFC7413] Cheng, Y., Chu, J., Radhakrishnan, S., and A. Jain, "TCP + Fast Open", RFC 7413, DOI 10.17487/RFC7413, December 2014, + <http://www.rfc-editor.org/info/rfc7413>. + + [RFC7620] Boucadair, M., Ed., Chatras, B., Reddy, T., Williams, B., + and B. Sarikaya, "Scenarios with Host Identification + Complications", RFC 7620, DOI 10.17487/RFC7620, August + 2015, <http://www.rfc-editor.org/info/rfc7620>. + + + + + + + + + + + + + + + + + + + + + + + + +Williams, et al. Informational [Page 19] + +RFC 7974 Experimental TCP HOST_ID Option October 2016 + + +Acknowledgements + + Many thanks to W. Eddy, Y. Nishida, T. Reddy, M. Scharf, J. Touch, + A. Zimmermann, and A. Falk for their comments. + +Authors' Addresses + + Brandon Williams + Akamai, Inc. + 8 Cambridge Center + Cambridge, MA 02142 + United States of America + + Email: brandon.williams@akamai.com + + + Mohamed Boucadair + Orange + + Email: mohamed.boucadair@orange.com + + + Dan Wing + + Email: dwing-ietf@fuggles.com + + + + + + + + + + + + + + + + + + + + + + + + + + +Williams, et al. Informational [Page 20] + |