diff options
Diffstat (limited to 'doc/rfc/rfc8321.txt')
-rw-r--r-- | doc/rfc/rfc8321.txt | 1851 |
1 files changed, 1851 insertions, 0 deletions
diff --git a/doc/rfc/rfc8321.txt b/doc/rfc/rfc8321.txt new file mode 100644 index 0000000..4f16b12 --- /dev/null +++ b/doc/rfc/rfc8321.txt @@ -0,0 +1,1851 @@ + + + + + + +Internet Engineering Task Force (IETF) G. Fioccola, Ed. +Request for Comments: 8321 A. Capello +Category: Experimental M. Cociglio +ISSN: 2070-1721 L. Castaldelli + Telecom Italia + M. Chen + L. Zheng + Huawei Technologies + G. Mirsky + ZTE + T. Mizrahi + Marvell + January 2018 + + + Alternate-Marking Method for Passive and Hybrid Performance Monitoring + +Abstract + + This document describes a method to perform packet loss, delay, and + jitter measurements on live traffic. This method is based on an + Alternate-Marking (coloring) technique. A report is provided in + order to explain an example and show the method applicability. This + technology can be applied in various situations, as detailed in this + document, and could be considered Passive or Hybrid depending on the + application. + +Status of This Memo + + This document is not an Internet Standards Track specification; it is + published for examination, experimental implementation, and + evaluation. + + This document defines an Experimental Protocol for the Internet + community. This document is a product of the Internet Engineering + Task Force (IETF). It represents the consensus of the IETF + community. It has received public review and has been approved for + publication by the Internet Engineering Steering Group (IESG). Not + all documents approved by the IESG are a candidate for any level of + Internet Standard; see Section 2 of RFC 7841. + + Information about the current status of this document, any errata, + and how to provide feedback on it may be obtained at + https://www.rfc-editor.org/info/rfc8321. + + + + + + + +Fioccola, et al. Experimental [Page 1] + +RFC 8321 Alternate-Marking Method January 2018 + + +Copyright Notice + + Copyright (c) 2018 IETF Trust and the persons identified as the + document authors. All rights reserved. + + This document is subject to BCP 78 and the IETF Trust's Legal + Provisions Relating to IETF Documents + (https://trustee.ietf.org/license-info) in effect on the date of + publication of this document. Please review these documents + carefully, as they describe your rights and restrictions with respect + to this document. Code Components extracted from this document must + include Simplified BSD License text as described in Section 4.e of + the Trust Legal Provisions and are provided without warranty as + described in the Simplified BSD License. + +Table of Contents + + 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . 3 + 1.1. Requirements Language . . . . . . . . . . . . . . . . . . 5 + 2. Overview of the Method . . . . . . . . . . . . . . . . . . . 5 + 3. Detailed Description of the Method . . . . . . . . . . . . . 6 + 3.1. Packet Loss Measurement . . . . . . . . . . . . . . . . . 6 + 3.1.1. Coloring the Packets . . . . . . . . . . . . . . . . 11 + 3.1.2. Counting the Packets . . . . . . . . . . . . . . . . 12 + 3.1.3. Collecting Data and Calculating Packet Loss . . . . . 13 + 3.2. Timing Aspects . . . . . . . . . . . . . . . . . . . . . 13 + 3.3. One-Way Delay Measurement . . . . . . . . . . . . . . . . 15 + 3.3.1. Single-Marking Methodology . . . . . . . . . . . . . 15 + 3.3.2. Double-Marking Methodology . . . . . . . . . . . . . 17 + 3.4. Delay Variation Measurement . . . . . . . . . . . . . . . 18 + 4. Considerations . . . . . . . . . . . . . . . . . . . . . . . 18 + 4.1. Synchronization . . . . . . . . . . . . . . . . . . . . . 19 + 4.2. Data Correlation . . . . . . . . . . . . . . . . . . . . 19 + 4.3. Packet Reordering . . . . . . . . . . . . . . . . . . . . 20 + 5. Applications, Implementation, and Deployment . . . . . . . . 21 + 5.1. Report on the Operational Experiment . . . . . . . . . . 22 + 5.1.1. Metric Transparency . . . . . . . . . . . . . . . . . 24 + 6. Hybrid Measurement . . . . . . . . . . . . . . . . . . . . . 24 + 7. Compliance with Guidelines from RFC 6390 . . . . . . . . . . 25 + 8. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 27 + 9. Security Considerations . . . . . . . . . . . . . . . . . . . 27 + 10. References . . . . . . . . . . . . . . . . . . . . . . . . . 28 + 10.1. Normative References . . . . . . . . . . . . . . . . . . 28 + 10.2. Informative References . . . . . . . . . . . . . . . . . 29 + Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . . 32 + Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . 32 + + + + + +Fioccola, et al. Experimental [Page 2] + +RFC 8321 Alternate-Marking Method January 2018 + + +1. Introduction + + Nowadays, most Service Providers' networks carry traffic with + contents that are highly sensitive to packet loss [RFC7680], delay + [RFC7679], and jitter [RFC3393]. + + In view of this scenario, Service Providers need methodologies and + tools to monitor and measure network performance with an adequate + accuracy, in order to constantly control the quality of experience + perceived by their customers. On the other hand, performance + monitoring provides useful information for improving network + management (e.g., isolation of network problems, troubleshooting, + etc.). + + A lot of work related to Operations, Administration, and Maintenance + (OAM), which also includes performance monitoring techniques, has + been done by Standards Developing Organizations (SDOs): [RFC7276] + provides a good overview of existing OAM mechanisms defined in the + IETF, ITU-T, and IEEE. In the IETF, a lot of work has been done on + fault detection and connectivity verification, while a minor effort + has been thus far dedicated to performance monitoring. The IPPM WG + has defined standard metrics to measure network performance; however, + the methods developed in this WG mainly refer to focus on Active + measurement techniques. More recently, the MPLS WG has defined + mechanisms for measuring packet loss, one-way and two-way delay, and + delay variation in MPLS networks [RFC6374], but their applicability + to Passive measurements has some limitations, especially for pure + connection-less networks. + + The lack of adequate tools to measure packet loss with the desired + accuracy drove an effort to design a new method for the performance + monitoring of live traffic, which is easy to implement and deploy. + The effort led to the method described in this document: basically, + it is a Passive performance monitoring technique, potentially + applicable to any kind of packet-based traffic, including Ethernet, + IP, and MPLS, both unicast and multicast. The method addresses + primarily packet loss measurement, but it can be easily extended to + one-way or two-way delay and delay variation measurements as well. + + The method has been explicitly designed for Passive measurements, but + it can also be used with Active probes. Passive measurements are + usually more easily understood by customers and provide much better + accuracy, especially for packet loss measurements. + + + + + + + + +Fioccola, et al. Experimental [Page 3] + +RFC 8321 Alternate-Marking Method January 2018 + + + RFC 7799 [RFC7799] defines Passive and Hybrid Methods of Measurement. + In particular, Passive Methods of Measurement are based solely on + observations of an undisturbed and unmodified packet stream of + interest; Hybrid Methods are Methods of Measurement that use a + combination of Active Methods and Passive Methods. + + Taking into consideration these definitions, the Alternate-Marking + Method could be considered Hybrid or Passive, depending on the case. + In the case where the marking method is obtained by changing existing + field values of the packets (e.g., the Differentiated Services Code + Point (DSCP) field), the technique is Hybrid. In the case where the + marking field is dedicated, reserved, and included in the protocol + specification, the Alternate-Marking technique can be considered as + Passive (e.g., Synonymous Flow Label as described in [SFL-FRAMEWORK] + or OAM Marking Bits as described in [PM-MM-BIER]). + + The advantages of the method described in this document are: + + o easy implementation: it can be implemented by using features + already available on major routing platforms, as described in + Section 5.1, or by applying an optimized implementation of the + method for both legacy and newest technologies; + + o low computational effort: the additional load on processing is + negligible; + + o accurate packet loss measurement: single packet loss granularity + is achieved with a Passive measurement; + + o potential applicability to any kind of packet-based or frame-based + traffic: Ethernet, IP, MPLS, etc., and both unicast and multicast; + + o robustness: the method can tolerate out-of-order packets, and it's + not based on "special" packets whose loss could have a negative + impact; + + o flexibility: all the timestamp formats are allowed, because they + are managed out of band. The format (the Network Time Protocol + (NTP) [RFC5905] or the IEEE 1588 Precision Time Protocol (PTP) + [IEEE-1588]) depends on the precision you want; and + + o no interoperability issues: the features required to experiment + and test the method (as described in Section 5.1) are available on + all current routing platforms. Both a centralized or distributed + solution can be used to harvest data from the routers. + + + + + + +Fioccola, et al. Experimental [Page 4] + +RFC 8321 Alternate-Marking Method January 2018 + + + The method doesn't raise any specific need for protocol extension, + but it could be further improved by means of some extension to + existing protocols. Specifically, the use of Diffserv bits for + coloring the packets could not be a viable solution in some cases: a + standard method to color the packets for this specific application + could be beneficial. + +1.1. Requirements Language + + The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", + "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and + "OPTIONAL" in this document are to be interpreted as described in BCP + 14 [RFC2119] [RFC8174] when, and only when, they appear in all + capitals, as shown here. + +2. Overview of the Method + + In order to perform packet loss measurements on a production traffic + flow, different approaches exist. The most intuitive one consists in + numbering the packets so that each router that receives the flow can + immediately detect a packet that is missing. This approach, though + very simple in theory, is not simple to achieve: it requires the + insertion of a sequence number into each packet, and the devices must + be able to extract the number and check it in real time. Such a task + can be difficult to implement on live traffic: if UDP is used as the + transport protocol, the sequence number is not available; on the + other hand, if a higher-layer sequence number (e.g., in the RTP + header) is used, extracting that information from each packet and + processing it in real time could overload the device. + + An alternate approach is to count the number of packets sent on one + end, count the number of packets received on the other end, and + compare the two values. This operation is much simpler to implement, + but it requires the devices performing the measurement to be in sync: + in order to compare two counters, it is required that they refer + exactly to the same set of packets. Since a flow is continuous and + cannot be stopped when a counter has to be read, it can be difficult + to determine exactly when to read the counter. A possible solution + to overcome this problem is to virtually split the flow in + consecutive blocks by periodically inserting a delimiter so that each + counter refers exactly to the same block of packets. The delimiter + could be, for example, a special packet inserted artificially into + the flow. However, delimiting the flow using specific packets has + some limitations. First, it requires generating additional packets + within the flow and requires the equipment to be able to process + those packets. In addition, the method is vulnerable to out-of-order + reception of delimiting packets and, to a lesser extent, to their + loss. + + + +Fioccola, et al. Experimental [Page 5] + +RFC 8321 Alternate-Marking Method January 2018 + + + The method proposed in this document follows the second approach, but + it doesn't use additional packets to virtually split the flow in + blocks. Instead, it "marks" the packets so that the packets + belonging to the same block will have the same color, whilst + consecutive blocks will have different colors. Each change of color + represents a sort of auto-synchronization signal that guarantees the + consistency of measurements taken by different devices along the path + (see also [IP-MULTICAST-PM] and [OPSAWG-P3M], where this technique + was introduced). + + Figure 1 represents a very simple network and shows how the method + can be used to measure packet loss on different network segments: by + enabling the measurement on several interfaces along the path, it is + possible to perform link monitoring, node monitoring, or end-to-end + monitoring. The method is flexible enough to measure packet loss on + any segment of the network and can be used to isolate the faulty + element. + + Traffic Flow + ========================================================> + +------+ +------+ +------+ +------+ + ---<> R1 <>-----<> R2 <>-----<> R3 <>-----<> R4 <>--- + +------+ +------+ +------+ +------+ + . . . . . . + . . . . . . + . <------> <-------> . + . Node Packet Loss Link Packet Loss . + . . + <---------------------------------------------------> + End-to-End Packet Loss + + Figure 1: Available Measurements + +3. Detailed Description of the Method + + This section describes, in detail, how the method operates. A + special emphasis is given to the measurement of packet loss, which + represents the core application of the method, but applicability to + delay and jitter measurements is also considered. + +3.1. Packet Loss Measurement + + The basic idea is to virtually split traffic flows into consecutive + blocks: each block represents a measurable entity unambiguously + recognizable by all network devices along the path. By counting the + number of packets in each block and comparing the values measured by + different network devices along the path, it is possible to measure + packet loss occurred in any single block between any two points. + + + +Fioccola, et al. Experimental [Page 6] + +RFC 8321 Alternate-Marking Method January 2018 + + + As discussed in the previous section, a simple way to create the + blocks is to "color" the traffic (two colors are sufficient), so that + packets belonging to different consecutive blocks will have different + colors. Whenever the color changes, the previous block terminates + and the new one begins. Hence, all the packets belonging to the same + block will have the same color and packets of different consecutive + blocks will have different colors. The number of packets in each + block depends on the criterion used to create the blocks: + + o if the color is switched after a fixed number of packets, then + each block will contain the same number of packets (except for any + losses); and + + o if the color is switched according to a fixed timer, then the + number of packets may be different in each block depending on the + packet rate. + + The following figure shows how a flow looks like when it is split in + traffic blocks with colored packets. + + A: packet with A coloring + B: packet with B coloring + + | | | | | + | | Traffic Flow | | + -------------------------------------------------------------------> + BBBBBBB AAAAAAAAAAA BBBBBBBBBBB AAAAAAAAAAA BBBBBBBBBBB AAAAAAA + -------------------------------------------------------------------> + ... | Block 5 | Block 4 | Block 3 | Block 2 | Block 1 + | | | | | + + Figure 2: Traffic Coloring + + Figure 3 shows how the method can be used to measure link packet loss + between two adjacent nodes. + + Referring to the figure, let's assume we want to monitor the packet + loss on the link between two routers: router R1 and router R2. + According to the method, the traffic is colored alternatively with + two different colors: A and B. Whenever the color changes, the + transition generates a sort of square-wave signal, as depicted in the + following figure. + + + + + + + + + +Fioccola, et al. Experimental [Page 7] + +RFC 8321 Alternate-Marking Method January 2018 + + + Color A ----------+ +-----------+ +---------- + | | | | + Color B +-----------+ +-----------+ + Block n ... Block 3 Block 2 Block 1 + <---------> <---------> <---------> <---------> <---------> + + Traffic Flow + ===========================================================> + Color ...AAAAAAAAAAA BBBBBBBBBBB AAAAAAAAAAA BBBBBBBBBBB AAAAAAA... + ===========================================================> + + Figure 3: Computation of Link Packet Loss + + Traffic coloring can be done by R1 itself if the traffic is not + already colored. R1 needs two counters, C(A)R1 and C(B)R1, on its + egress interface: C(A)R1 counts the packets with color A and C(B)R1 + counts those with color B. As long as traffic is colored as A, only + counter C(A)R1 will be incremented, while C(B)R1 is not incremented; + conversely, when the traffic is colored as B, only C(B)R1 is + incremented. C(A)R1 and C(B)R1 can be used as reference values to + determine the packet loss from R1 to any other measurement point down + the path. Router R2, similarly, will need two counters on its + ingress interface, C(A)R2 and C(B)R2, to count the packets received + on that interface and colored with A and B, respectively. When an A + block ends, it is possible to compare C(A)R1 and C(A)R2 and calculate + the packet loss within the block; similarly, when the successive B + block terminates, it is possible to compare C(B)R1 with C(B)R2, and + so on, for every successive block. + + Likewise, by using two counters on the R2 egress interface, it is + possible to count the packets sent out of the R2 interface and use + them as reference values to calculate the packet loss from R2 to any + measurement point down R2. + + Using a fixed timer for color switching offers better control over + the method: the (time) length of the blocks can be chosen large + enough to simplify the collection and the comparison of measures + taken by different network devices. It's preferable to read the + value of the counters not immediately after the color switch: some + packets could arrive out of order and increment the counter + associated with the previous block (color), so it is worth waiting + for some time. A safe choice is to wait L/2 time units (where L is + the duration for each block) after the color switch, to read the + still counter of the previous color, so the possibility of reading a + running counter instead of a still one is minimized. The drawback is + that the longer the duration of the block, the less frequent the + measurement can be taken. + + + + +Fioccola, et al. Experimental [Page 8] + +RFC 8321 Alternate-Marking Method January 2018 + + + The following table shows how the counters can be used to calculate + the packet loss between R1 and R2. The first column lists the + sequence of traffic blocks, while the other columns contain the + counters of A-colored packets and B-colored packets for R1 and R2. + In this example, we assume that the values of the counters are reset + to zero whenever a block ends and its associated counter has been + read: with this assumption, the table shows only relative values, + which is the exact number of packets of each color within each block. + If the values of the counters were not reset, the table would contain + cumulative values, but the relative values could be determined simply + by the difference from the value of the previous block of the same + color. + + The color is switched on the basis of a fixed timer (not shown in the + table), so the number of packets in each block is different. + + +-------+--------+--------+--------+--------+------+ + | Block | C(A)R1 | C(B)R1 | C(A)R2 | C(B)R2 | Loss | + +-------+--------+--------+--------+--------+------+ + | 1 | 375 | 0 | 375 | 0 | 0 | + | 2 | 0 | 388 | 0 | 388 | 0 | + | 3 | 382 | 0 | 381 | 0 | 1 | + | 4 | 0 | 377 | 0 | 374 | 3 | + | ... | ... | ... | ... | ... | ... | + | 2n | 0 | 387 | 0 | 387 | 0 | + | 2n+1 | 379 | 0 | 377 | 0 | 2 | + +-------+--------+--------+--------+--------+------+ + + Table 1: Evaluation of Counters for Packet Loss Measurements + + During an A block (blocks 1, 3, and 2n+1), all the packets are + A-colored; therefore, the C(A) counters are incremented to the number + seen on the interface, while C(B) counters are zero. Conversely, + during a B block (blocks 2, 4, and 2n), all the packets are + B-colored: C(A) counters are zero, while C(B) counters are + incremented. + + When a block ends (because of color switching), the relative counters + stop incrementing; it is possible to read them, compare the values + measured on routers R1 and R2, and calculate the packet loss within + that block. + + For example, looking at the table above, during the first block + (A-colored), C(A)R1 and C(A)R2 have the same value (375), which + corresponds to the exact number of packets of the first block (no + loss). Also, during the second block (B-colored), R1 and R2 counters + have the same value (388), which corresponds to the number of packets + of the second block (no loss). During the third and fourth blocks, + + + +Fioccola, et al. Experimental [Page 9] + +RFC 8321 Alternate-Marking Method January 2018 + + + R1 and R2 counters are different, meaning that some packets have been + lost: in the example, one single packet (382-381) was lost during + block three, and three packets (377-374) were lost during block four. + + The method applied to R1 and R2 can be extended to any other router + and applied to more complex networks, as far as the measurement is + enabled on the path followed by the traffic flow(s) being observed. + + It's worth mentioning two different strategies that can be used when + implementing the method: + + o flow-based: the flow-based strategy is used when only a limited + number of traffic flows need to be monitored. According to this + strategy, only a subset of the flows is colored. Counters for + packet loss measurements can be instantiated for each single flow, + or for the set as a whole, depending on the desired granularity. + A relevant problem with this approach is the necessity to know in + advance the path followed by flows that are subject to + measurement. Path rerouting and traffic load-balancing increase + the issue complexity, especially for unicast traffic. The problem + is easier to solve for multicast traffic, where load-balancing is + seldom used and static joins are frequently used to force traffic + forwarding and replication. + + o link-based: measurements are performed on all the traffic on a + link-by-link basis. The link could be a physical link or a + logical link. Counters could be instantiated for the traffic as a + whole or for each traffic class (in case it is desired to monitor + each class separately), but in the second case, a couple of + counters are needed for each class. + + As mentioned, the flow-based measurement requires the identification + of the flow to be monitored and the discovery of the path followed by + the selected flow. It is possible to monitor a single flow or + multiple flows grouped together, but in this case, measurement is + consistent only if all the flows in the group follow the same path. + Moreover, if a measurement is performed by grouping many flows, it is + not possible to determine exactly which flow was affected by packet + loss. In order to have measures per single flow, it is necessary to + configure counters for each specific flow. Once the flow(s) to be + monitored has been identified, it is necessary to configure the + monitoring on the proper nodes. Configuring the monitoring means + configuring the rule to intercept the traffic and configuring the + counters to count the packets. To have just an end-to-end + monitoring, it is sufficient to enable the monitoring on the first- + and last-hop routers of the path: the mechanism is completely + transparent to intermediate nodes and independent from the path + followed by traffic flows. On the contrary, to monitor the flow on a + + + +Fioccola, et al. Experimental [Page 10] + +RFC 8321 Alternate-Marking Method January 2018 + + + hop-by-hop basis along its whole path, it is necessary to enable the + monitoring on every node from the source to the destination. In case + the exact path followed by the flow is not known a priori (i.e., the + flow has multiple paths to reach the destination), it is necessary to + enable the monitoring system on every path: counters on interfaces + traversed by the flow will report packet count, whereas counters on + other interfaces will be null. + +3.1.1. Coloring the Packets + + The coloring operation is fundamental in order to create packet + blocks. This implies choosing where to activate the coloring and how + to color the packets. + + In case of flow-based measurements, the flow to monitor can be + defined by a set of selection rules (e.g., header fields) used to + match a subset of the packets; in this way, it is possible to control + the number of involved nodes, the path followed by the packets, and + the size of the flows. It is possible, in general, to have multiple + coloring nodes or a single coloring node that is easier to manage and + doesn't raise any risk of conflict. Coloring in multiple nodes can + be done, and the requirement is that the coloring must change + periodically between the nodes according to the timing considerations + in Section 3.2; so every node that is designated as a measurement + point along the path should be able to identify unambiguously the + colored packets. Furthermore, [MULTIPOINT-ALT-MM] generalizes the + coloring for multipoint-to-multipoint flow. In addition, it can be + advantageous to color the flow as close as possible to the source + because it allows an end-to-end measure if a measurement point is + enabled on the last-hop router as well. + + For link-based measurements, all traffic needs to be colored when + transmitted on the link. If the traffic had already been colored, + then it has to be re-colored because the color must be consistent on + the link. This means that each hop along the path must (re-)color + the traffic; the color is not required to be consistent along + different links. + + Traffic coloring can be implemented by setting a specific bit in the + packet header and changing the value of that bit periodically. How + to choose the marking field depends on the application and is out of + scope here. However, some applications are reported in Section 5. + + + + + + + + + +Fioccola, et al. Experimental [Page 11] + +RFC 8321 Alternate-Marking Method January 2018 + + +3.1.2. Counting the Packets + + For flow-based measurements, assuming that the coloring of the + packets is performed only by the source nodes, the nodes between + source and destination (included) have to count the colored packets + that they receive and forward: this operation can be enabled on every + router along the path or only on a subset, depending on which network + segment is being monitored (a single link, a particular metro area, + the backbone, or the whole path). Since the color switches + periodically between two values, two counters (one for each value) + are needed: one counter for packets with color A and one counter for + packets with color B. For each flow (or group of flows) being + monitored and for every interface where the monitoring is Active, a + couple of counters are needed. For example, in order to separately + monitor three flows on a router with four interfaces involved, 24 + counters are needed (two counters for each of the three flows on each + of the four interfaces). Furthermore, [MULTIPOINT-ALT-MM] + generalizes the counting for multipoint-to-multipoint flow. + + In case of link-based measurements, the behavior is similar except + that coloring and counting operations are performed on a link-by-link + basis at each endpoint of the link. + + Another important aspect to take into consideration is when to read + the counters: in order to count the exact number of packets of a + block, the routers must perform this operation when that block has + ended; in other words, the counter for color A must be read when the + current block has color B, in order to be sure that the value of the + counter is stable. This task can be accomplished in two ways. The + general approach suggests reading the counters periodically, many + times during a block duration, and comparing these successive + readings: when the counter stops incrementing, it means that the + current block has ended, and its value can be elaborated safely. + Alternatively, if the coloring operation is performed on the basis of + a fixed timer, it is possible to configure the reading of the + counters according to that timer: for example, reading the counter + for color A every period in the middle of the subsequent block with + color B is a safe choice. A sufficient margin should be considered + between the end of a block and the reading of the counter, in order + to take into account any out-of-order packets. + + + + + + + + + + + +Fioccola, et al. Experimental [Page 12] + +RFC 8321 Alternate-Marking Method January 2018 + + +3.1.3. Collecting Data and Calculating Packet Loss + + The nodes enabled to perform performance monitoring collect the value + of the counters, but they are not able to directly use this + information to measure packet loss, because they only have their own + samples. For this reason, an external Network Management System + (NMS) can be used to collect and elaborate data and to perform packet + loss calculation. The NMS compares the values of counters from + different nodes and can calculate if some packets were lost (even a + single packet) and where those packets were lost. + + The value of the counters needs to be transmitted to the NMS as soon + as it has been read. This can be accomplished by using SNMP or FTP + and can be done in Push Mode or Polling Mode. In the first case, + each router periodically sends the information to the NMS; in the + latter case, it is the NMS that periodically polls routers to collect + information. In any case, the NMS has to collect all the relevant + values from all the routers within one cycle of the timer. + + It would also be possible to use a protocol to exchange values of + counters between the two endpoints in order to let them perform the + packet loss calculation for each traffic direction. + + A possible approach for the performance measurement (PM) architecture + is explained in [COLORING], while [IP-FLOW-REPORT] introduces new + information elements of IP Flow Information Export (IPFIX) [RFC7011]. + +3.2. Timing Aspects + + This document introduces two color-switching methods: one is based on + a fixed number of packets, and the other is based on a fixed timer. + But the method based on a fixed timer is preferable because it is + more deterministic, and it will be considered in the rest of the + document. + + In general, clocks in network devices are not accurate and for this + reason, there is a clock error between the measurement points R1 and + R2. But, to implement the methodology, they must be synchronized to + the same clock reference with an accuracy of +/- L/2 time units, + where L is the fixed time duration of the block. So each colored + packet can be assigned to the right batch by each router. This is + because the minimum time distance between two packets of the same + color but that belong to different batches is L time units. + + + + + + + + +Fioccola, et al. Experimental [Page 13] + +RFC 8321 Alternate-Marking Method January 2018 + + + In practice, in addition to clock errors, the delay between + measurement points also affects the implementation of the methodology + because each packet can be delayed differently, and this can produce + out of order at batch boundaries. This means that, without + considering clock error, we wait L/2 after color switching to be sure + to take a still counter. + + In summary, we need to take into account two contributions: clock + error between network devices and the interval we need to wait to + avoid packets being out of order because of network delay. + + The following figure explains both issues. + + ...BBBBBBBBB | AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | BBBBBBBBB... + |<======================================>| + | L | + ...=========>|<==================><==================>|<==========... + | L/2 L/2 | + |<===>| |<===>| + d | | d + |<==========================>| + available counting interval + + Figure 4: Timing Aspects + + It is assumed that all network devices are synchronized to a common + reference time with an accuracy of +/- A/2. Thus, the difference + between the clock values of any two network devices is bounded by A. + + The guard band d is given by: + + d = A + D_max - D_min, + + where A is the clock accuracy, D_max is an upper bound on the network + delay between the network devices, and D_min is a lower bound on the + delay. + + The available counting interval is L - 2d that must be > 0. + + The condition that must be satisfied and is a requirement on the + synchronization accuracy is: + + d < L/2. + + + + + + + + +Fioccola, et al. Experimental [Page 14] + +RFC 8321 Alternate-Marking Method January 2018 + + +3.3. One-Way Delay Measurement + + The same principle used to measure packet loss can be applied also to + one-way delay measurement. There are three alternatives, as + described hereinafter. + + Note that, for all the one-way delay alternatives described in the + next sections, by summing the one-way delays of the two directions of + a path, it is always possible to measure the two-way delay (round- + trip "virtual" delay). + +3.3.1. Single-Marking Methodology + + The alternation of colors can be used as a time reference to + calculate the delay. Whenever the color changes (which means that a + new block has started), a network device can store the timestamp of + the first packet of the new block; that timestamp can be compared + with the timestamp of the same packet on a second router to compute + packet delay. When looking at Figure 2, R1 stores the timestamp + TS(A1)R1 when it sends the first packet of block 1 (A-colored), the + timestamp TS(B2)R1 when it sends the first packet of block 2 + (B-colored), and so on for every other block. R2 performs the same + operation on the receiving side, recording TS(A1)R2, TS(B2)R2, and so + on. Since the timestamps refer to specific packets (the first packet + of each block), we are sure that timestamps compared to compute delay + refer to the same packets. By comparing TS(A1)R1 with TS(A1)R2 (and + similarly TS(B2)R1 with TS(B2)R2, and so on), it is possible to + measure the delay between R1 and R2. In order to have more + measurements, it is possible to take and store more timestamps, + referring to other packets within each block. + + In order to coherently compare timestamps collected on different + routers, the clocks on the network nodes must be in sync. + Furthermore, a measurement is valid only if no packet loss occurs and + if packet misordering can be avoided; otherwise, the first packet of + a block on R1 could be different from the first packet of the same + block on R2 (for instance, if that packet is lost between R1 and R2 + or it arrives after the next one). + + The following table shows how timestamps can be used to calculate the + delay between R1 and R2. The first column lists the sequence of + blocks, while other columns contain the timestamp referring to the + first packet of each block on R1 and R2. The delay is computed as a + difference between timestamps. For the sake of simplicity, all the + values are expressed in milliseconds. + + + + + + +Fioccola, et al. Experimental [Page 15] + +RFC 8321 Alternate-Marking Method January 2018 + + + +-------+---------+---------+---------+---------+-------------+ + | Block | TS(A)R1 | TS(B)R1 | TS(A)R2 | TS(B)R2 | Delay R1-R2 | + +-------+---------+---------+---------+---------+-------------+ + | 1 | 12.483 | - | 15.591 | - | 3.108 | + | 2 | - | 6.263 | - | 9.288 | 3.025 | + | 3 | 27.556 | - | 30.512 | - | 2.956 | + | | - | 18.113 | - | 21.269 | 3.156 | + | ... | ... | ... | ... | ... | ... | + | 2n | 77.463 | - | 80.501 | - | 3.038 | + | 2n+1 | - | 24.333 | - | 27.433 | 3.100 | + +-------+---------+---------+---------+---------+-------------+ + + Table 2: Evaluation of Timestamps for Delay Measurements + + The first row shows timestamps taken on R1 and R2, respectively, and + refers to the first packet of block 1 (which is A-colored). Delay + can be computed as a difference between the timestamp on R2 and the + timestamp on R1. Similarly, the second row shows timestamps (in + milliseconds) taken on R1 and R2 and refers to the first packet of + block 2 (which is B-colored). By comparing timestamps taken on + different nodes in the network and referring to the same packets + (identified using the alternation of colors), it is possible to + measure delay on different network segments. + + For the sake of simplicity, in the above example, a single + measurement is provided within a block, taking into account only the + first packet of each block. The number of measurements can be easily + increased by considering multiple packets in the block: for instance, + a timestamp could be taken every N packets, thus generating multiple + delay measurements. Taking this to the limit, in principle, the + delay could be measured for each packet by taking and comparing the + corresponding timestamps (possible but impractical from an + implementation point of view). + +3.3.1.1. Mean Delay + + As mentioned before, the method previously exposed for measuring the + delay is sensitive to out-of-order reception of packets. In order to + overcome this problem, a different approach has been considered: it + is based on the concept of mean delay. The mean delay is calculated + by considering the average arrival time of the packets within a + single block. The network device locally stores a timestamp for each + packet received within a single block: summing all the timestamps and + dividing by the total number of packets received, the average arrival + time for that block of packets can be calculated. By subtracting the + average arrival times of two adjacent devices, it is possible to + calculate the mean delay between those nodes. When computing the + mean delay, the measurement error could be augmented by accumulating + + + +Fioccola, et al. Experimental [Page 16] + +RFC 8321 Alternate-Marking Method January 2018 + + + the measurement error of a lot of packets. This method is robust to + out-of-order packets and also to packet loss (only a small error is + introduced). Moreover, it greatly reduces the number of timestamps + (only one per block for each network device) that have to be + collected by the management system. On the other hand, it only gives + one measure for the duration of the block (for instance, 5 minutes), + and it doesn't give the minimum, maximum, and median delay values + [RFC6703]. This limitation could be overcome by reducing the + duration of the block (for instance, from 5 minutes to a few + seconds), which implicates a highly optimized implementation of the + method. + +3.3.2. Double-Marking Methodology + + The Single-Marking methodology for one-way delay measurement is + sensitive to out-of-order reception of packets. The first approach + to overcome this problem has been described before and is based on + the concept of mean delay. But the limitation of mean delay is that + it doesn't give information about the delay value's distribution for + the duration of the block. Additionally, it may be useful to have + not only the mean delay but also the minimum, maximum, and median + delay values and, in wider terms, to know more about the statistic + distribution of delay values. So, in order to have more information + about the delay and to overcome out-of-order issues, a different + approach can be introduced; it is based on a Double-Marking + methodology. + + Basically, the idea is to use the first marking to create the + alternate flow and, within this colored flow, a second marking to + select the packets for measuring delay/jitter. The first marking is + needed for packet loss and mean delay measurement. The second + marking creates a new set of marked packets that are fully identified + over the network, so that a network device can store the timestamps + of these packets; these timestamps can be compared with the + timestamps of the same packets on a second router to compute packet + delay values for each packet. The number of measurements can be + easily increased by changing the frequency of the second marking. + But the frequency of the second marking must not be too high in order + to avoid out-of-order issues. Between packets with the second + marking, there should be a security time gap (e.g., this gap could + be, at the minimum, the mean network delay calculated with the + previous methodology) to avoid out-of-order issues and also to have a + number of measurement packets that are rate independent. If a + second-marking packet is lost, the delay measurement for the + considered block is corrupted and should be discarded. + + + + + + +Fioccola, et al. Experimental [Page 17] + +RFC 8321 Alternate-Marking Method January 2018 + + + Mean delay is calculated on all the packets of a sample and is a + simple computation to be performed for a Single-Marking Method. In + some cases, the mean delay measure is not sufficient to characterize + the sample, and more statistics of delay extent data are needed, + e.g., percentiles, variance, and median delay values. The + conventional range (maximum-minimum) should be avoided for several + reasons, including stability of the maximum delay due to the + influence by outliers. RFC 5481 [RFC5481], Section 6.5 highlights + how the 99.9th percentile of delay and delay variation is more + helpful to performance planners. To overcome this drawback, the idea + is to couple the mean delay measure for the entire batch with a + Double-Marking Method, where a subset of batch packets is selected + for extensive delay calculation by using a second marking. In this + way, it is possible to perform a detailed analysis on these double- + marked packets. Please note that there are classic algorithms for + median and variance calculation, but they are out of the scope of + this document. The comparison between the mean delay for the entire + batch and the mean delay on these double-marked packets gives useful + information since it is possible to understand if the Double-Marking + measurements are actually representative of the delay trends. + +3.4. Delay Variation Measurement + + Similar to one-way delay measurement (both for Single Marking and + Double Marking), the method can also be used to measure the inter- + arrival jitter. We refer to the definition in RFC 3393 [RFC3393]. + The alternation of colors, for a Single-Marking Method, can be used + as a time reference to measure delay variations. In case of Double + Marking, the time reference is given by the second-marked packets. + Considering the example depicted in Figure 2, R1 stores the timestamp + TS(A)R1 whenever it sends the first packet of a block, and R2 stores + the timestamp TS(B)R2 whenever it receives the first packet of a + block. The inter-arrival jitter can be easily derived from one-way + delay measurement, by evaluating the delay variation of consecutive + samples. + + The concept of mean delay can also be applied to delay variation, by + evaluating the average variation of the interval between consecutive + packets of the flow from R1 to R2. + +4. Considerations + + This section highlights some considerations about the methodology. + + + + + + + + +Fioccola, et al. Experimental [Page 18] + +RFC 8321 Alternate-Marking Method January 2018 + + +4.1. Synchronization + + The Alternate-Marking technique does not require a strong + synchronization, especially for packet loss and two-way delay + measurement. Only one-way delay measurement requires network devices + to have synchronized clocks. + + Color switching is the reference for all the network devices, and the + only requirement to be achieved is that all network devices have to + recognize the right batch along the path. + + If the length of the measurement period is L time units, then all + network devices must be synchronized to the same clock reference with + an accuracy of +/- L/2 time units (without considering network + delay). This level of accuracy guarantees that all network devices + consistently match the color bit to the correct block. For example, + if the color is toggled every second (L = 1 second), then clocks must + be synchronized with an accuracy of +/- 0.5 second to a common time + reference. + + This synchronization requirement can be satisfied even with a + relatively inaccurate synchronization method. This is true for + packet loss and two-way delay measurement, but not for one-way delay + measurement, where clock synchronization must be accurate. + + Therefore, a system that uses only packet loss and two-way delay + measurement does not require synchronization. This is because the + value of the clocks of network devices does not affect the + computation of the two-way delay measurement. + +4.2. Data Correlation + + Data correlation is the mechanism to compare counters and timestamps + for packet loss, delay, and delay variation calculation. It could be + performed in several ways depending on the Alternate-Marking + application and use case. Some possibilities are to: + + o use a centralized solution using NMS to correlate data; and + + o define a protocol-based distributed solution by introducing a new + protocol or by extending the existing protocols (e.g., see RFC + 6374 [RFC6374] or the Two-Way Active Measurement Protocol (TWAMP) + as defined in RFC 5357 [RFC5357] or the One-Way Active Measurement + Protocol (OWAMP) as defined in RFC 4656 [RFC4656]) in order to + communicate the counters and timestamps between nodes. + + In the following paragraphs, an example data correlation mechanism is + explained and could be used independently of the adopted solutions. + + + +Fioccola, et al. Experimental [Page 19] + +RFC 8321 Alternate-Marking Method January 2018 + + + When data is collected on the upstream and downstream nodes, e.g., + packet counts for packet loss measurement or timestamps for packet + delay measurement, and is periodically reported to or pulled by other + nodes or an NMS, a certain data correlation mechanism SHOULD be in + use to help the nodes or NMS tell whether any two or more packet + counts are related to the same block of markers or if any two + timestamps are related to the same marked packet. + + The Alternate-Marking Method described in this document literally + splits the packets of the measured flow into different measurement + blocks; in addition, a Block Number (BN) could be assigned to each + such measurement block. The BN is generated each time a node reads + the data (packet counts or timestamps) and is associated with each + packet count and timestamp reported to or pulled by other nodes or + NMSs. The value of a BN could be calculated as the modulo of the + local time (when the data are read) and the interval of the marking + time period. + + When the nodes or NMS see, for example, the same BNs associated with + two packet counts from an upstream and a downstream node, + respectively, it considers that these two packet counts correspond to + the same block, i.e., these two packet counts belong to the same + block of markers from the upstream and downstream nodes. The + assumption of this BN mechanism is that the measurement nodes are + time synchronized. This requires the measurement nodes to have a + certain time synchronization capability (e.g., the Network Time + Protocol (NTP) [RFC5905] or the IEEE 1588 Precision Time Protocol + (PTP) [IEEE-1588]). Synchronization aspects are further discussed in + Section 4.1. + +4.3. Packet Reordering + + Due to ECMP, packet reordering is very common in an IP network. The + accuracy of a marking-based PM, especially packet loss measurement, + may be affected by packet reordering. Take a look at the following + example: + + Block : 1 | 2 | 3 | 4 | 5 |... + --------|---------|---------|---------|---------|---------|--- + Node R1 : AAAAAAA | BBBBBBB | AAAAAAA | BBBBBBB | AAAAAAA |... + Node R2 : AAAAABB | AABBBBA | AAABAAA | BBBBBBA | ABAAABA |... + + Figure 5: Packet Reordering + + In Figure 5, the packet stream for Node R1 isn't being reordered and + can be safely assigned to interval blocks, but the packet stream for + Node R2 is being reordered; so, looking at the packet with the marker + + + + +Fioccola, et al. Experimental [Page 20] + +RFC 8321 Alternate-Marking Method January 2018 + + + of "B" in block 3, there is no safe way to tell whether the packet + belongs to block 2 or block 4. + + In general, there is the need to assign packets with the marker of + "B" or "A" to the right interval blocks. Most of the packet + reordering occurs at the edge of adjacent blocks, and they are easy + to handle if the interval of each block is sufficiently large. Then, + it can be assumed that the packets with different markers belong to + the block that they are closer to. If the interval is small, it is + difficult and sometimes impossible to determine to which block a + packet belongs. + + To choose a proper interval is important, and how to choose a proper + interval is out of the scope of this document. But an implementation + SHOULD provide a way to configure the interval and allow a certain + degree of packet reordering. + +5. Applications, Implementation, and Deployment + + The methodology described in the previous sections can be applied in + various situations. Basically, the Alternate-Marking technique could + be used in many cases for performance measurement. The only + requirement is to select and mark the flow to be monitored; in this + way, packets are batched by the sender, and each batch is alternately + marked such that it can be easily recognized by the receiver. + + Some recent Alternate-Marking Method applications are listed below: + + o IP Flow Performance Measurement (IPFPM): this application of the + marking method is described in [COLORING]. As an example, in this + document, the last reserved bit of the Flag field of the IPv4 + header is proposed to be used for marking, while a solution for + IPv6 could be to leverage the IPv6 extension header for marking. + + o OAM Passive Performance Measurement: In [RFC8296], two OAM bits + from the Bit Index Explicit Replication (BIER) header are reserved + for the Passive performance measurement marking method. + [PM-MM-BIER] details the measurement for multicast service over + the BIER domain. In addition, the Alternate-Marking Method could + also be used in a Service Function Chaining (SFC) domain. Lastly, + the application of the marking method to Network Virtualization + over Layer 3 (NVO3) protocols is considered by [NVO3-ENCAPS]. + + o MPLS Performance Measurement: RFC 6374 [RFC6374] uses the Loss + Measurement (LM) packet as the packet accounting demarcation + point. Unfortunately, this gives rise to a number of problems + that may lead to significant packet accounting errors in certain + situations. [MPLS-FLOW] discusses the desired capabilities for + + + +Fioccola, et al. Experimental [Page 21] + +RFC 8321 Alternate-Marking Method January 2018 + + + MPLS flow identification in order to perform a better in-band + performance monitoring of user data packets. A method of + accomplishing identification is Synonymous Flow Labels (SFLs) + introduced in [SFL-FRAMEWORK], while [SYN-FLOW-LABELS] describes + performance measurements in RFC 6374 with SFL. + + o Active Performance Measurement: [ALT-MM-AMP] describes how to + extend the existing Active Measurement Protocol, in order to + implement the Alternate-Marking methodology. [ALT-MM-SLA] + describes an extension to the Cisco SLA Protocol Measurement-Type + UDP-Measurement. + + An example of implementation and deployment is explained in the next + section, just to clarify how the method can work. + +5.1. Report on the Operational Experiment + + The method described in this document, also called Packet Network + Performance Monitoring (PNPM), has been invented and engineered in + Telecom Italia. + + It is important to highlight that the general description of the + methodology in this document is a consequence of the operational + experiment. The fundamental elements of the technique have been + tested, and the lessons learned from the operational experiment + inspired the formalization of the Alternate-Marking Method as + detailed in the previous sections. + + The methodology has been used experimentally in Telecom Italia's + network and is applied to multicast IPTV channels or other specific + traffic flows with high QoS requirements (i.e., Mobile Backhauling + traffic realized with a VPN MPLS). + + This technology has been employed by leveraging functions and tools + available on IP routers, and it's currently being used to monitor + packet loss in some portions of Telecom Italia's network. The + application of this method for delay measurement has also been + evaluated in Telecom Italia's labs. + + This section describes how the experiment has been executed, + particularly, how the features currently available on existing + routing platforms can be used to apply the method, in order to give + an example of implementation and deployment. + + The operational test, described herein, uses the flow-based strategy, + as defined in Section 3. Instead, the link-based strategy could be + applied to a physical link or a logical link (e.g., an Ethernet VLAN + or an MPLS Pseudowire (PW)). + + + +Fioccola, et al. Experimental [Page 22] + +RFC 8321 Alternate-Marking Method January 2018 + + + The implementation of the method leverages the available router + functions, since the experiment has been done by a Service Provider + (as Telecom Italia is) on its own network. So, with current router + implementations, only QoS-related fields and features offer the + required flexibility to set bits in the packet header. In case a + Service Provider only uses the three most-significant bits of the + DSCP field (corresponding to IP Precedence) for QoS classification + and queuing, it is possible to use the two least-significant bits of + the DSCP field (bit 0 and bit 1) to implement the method without + affecting QoS policies. That is the approach used for the + experiment. One of the two bits (bit 0) could be used to identify + flows subject to traffic monitoring (set to 1 if the flow is under + monitoring, otherwise, it is set to 0), while the second (bit 1) can + be used for coloring the traffic (switching between values 0 and 1, + corresponding to colors A and B) and creating the blocks. + + The experiment considers a flow as all the packets sharing the same + source IP address or the same destination IP address, depending on + the direction. In practice, once the flow has been defined, traffic + coloring using the DSCP field can be implemented by configuring an + access-list on the router output interface. The access-list + intercepts the flow(s) to be monitored and applies a policy to them + that sets the DSCP field accordingly. Since traffic coloring has to + be switched between the two values over time, the policy needs to be + modified periodically. An automatic script is used to perform this + task on the basis of a fixed timer. The automatic script is loaded + on board of the router and automatizes the basic operations that are + needed to realize the methodology. + + After the traffic is colored using the DSCP field, all the routers on + the path can perform the counting. For this purpose, an access-list + that matches specific DSCP values can be used to count the packets of + the flow(s) being monitored. The same access-list can be installed + on all the routers of the path. In addition, network flow + monitoring, such as provided by IPFIX [RFC7011], can be used to + recognize timestamps of the first/last packet of a batch in order to + enable one of the alternatives to measure the delay as detailed in + Section 3.3. + + In Telecom Italia's experiment, the timer is set to 5 minutes, so the + sequence of actions of the script is also executed every 5 minutes. + This value has shown to be a good compromise between measurement + frequency and stability of the measurement (i.e., the possibility of + collecting all the measures referring to the same block). + + For this experiment, both counters and any other data are collected + by using the automatic script that sends these out to an NMS. The + NMS is responsible for packet loss calculation, performed by + + + +Fioccola, et al. Experimental [Page 23] + +RFC 8321 Alternate-Marking Method January 2018 + + + comparing the values of counters from the routers along the flow + path(s). A 5-minute timer for color switching is a safe choice for + reading the counters and is also coherent with the reporting window + of the NMS. + + Note that the use of the DSCP field for marking implies that the + method in this case works reliably only within a single management + and operation domain. + + Lastly, the Telecom Italia experiment scales up to 1000 flows + monitored together on a single router, while an implementation on + dedicated hardware scales more, but it was tested only in labs for + now. + +5.1.1. Metric Transparency + + Since a Service Provider application is described here, the method + can be applied to end-to-end services supplied to customers. So it + is important to highlight that the method MUST be transparent outside + the Service Provider domain. + + In Telecom Italia's implementation, the source node colors the + packets with a policy that is modified periodically via an automatic + script in order to alternate the DSCP field of the packets. The + nodes between source and destination (included) have to use an + access-list to count the colored packets that they receive and + forward. + + Moreover, the destination node has an important role: the colored + packets are intercepted and a policy restores and sets the DSCP field + of all the packets to the initial value. In this way, the metric is + transparent because outside the section of the network under + monitoring, the traffic flow is unchanged. + + In such a case, thanks to this restoring technique, network elements + outside the Alternate-Marking monitoring domain (e.g., the two + Provider Edge nodes of the Mobile Backhauling VPN MPLS) are totally + unaware that packets were marked. So this restoring technique makes + Alternate Marking completely transparent outside its monitoring + domain. + +6. Hybrid Measurement + + The method has been explicitly designed for Passive measurements, but + it can also be used with Active measurements. In order to have both + end-to-end measurements and intermediate measurements (Hybrid + measurements), two endpoints can exchange artificial traffic flows + and apply Alternate Marking over these flows. In the intermediate + + + +Fioccola, et al. Experimental [Page 24] + +RFC 8321 Alternate-Marking Method January 2018 + + + points, artificial traffic is managed in the same way as real traffic + and measured as specified before. So the application of the marking + method can also simplify the Active measurement, as explained in + [ALT-MM-AMP]. + +7. Compliance with Guidelines from RFC 6390 + + RFC 6390 [RFC6390] defines a framework and a process for developing + Performance Metrics for protocols above and below the IP layer (such + as IP-based applications that operate over reliable or datagram + transport protocols). + + This document doesn't aim to propose a new Performance Metric but + rather a new Method of Measurement for a few Performance Metrics that + have already been standardized. Nevertheless, it's worth applying + guidelines from [RFC6390] to the present document, in order to + provide a more complete and coherent description of the proposed + method. We used a combination of the Performance Metric Definition + template defined in Section 5.4 of [RFC6390] and the Dependencies + laid out in Section 5.5 of that document. + + o Metric Name / Metric Description: as already stated, this document + doesn't propose any new Performance Metrics. On the contrary, it + describes a novel method for measuring packet loss [RFC7680]. The + same concept, with small differences, can also be used to measure + delay [RFC7679] and jitter [RFC3393]. The document mainly + describes the applicability to packet loss measurement. + + o Method of Measurement or Calculation: according to the method + described in the previous sections, the number of packets lost is + calculated by subtracting the value of the counter on the source + node from the value of the counter on the destination node. Both + counters must refer to the same color. The calculation is + performed when the value of the counters is in a steady state. + The steady state is an intrinsic characteristic of the marking + method counters because the alternation of color makes the + counters associated with each color still one at a time for the + duration of a marking period. + + o Units of Measurement: the method calculates and reports the exact + number of packets sent by the source node and not received by the + destination node. + + o Measurement Point(s) with Potential Measurement Domain: the + measurement can be performed between adjacent nodes, on a per-link + basis, or along a multi-hop path, provided that the traffic under + measurement follows that path. In case of a multi-hop path, the + measurements can be performed both end-to-end and hop-by-hop. + + + +Fioccola, et al. Experimental [Page 25] + +RFC 8321 Alternate-Marking Method January 2018 + + + o Measurement Timing: the method has a constraint on the frequency + of measurements. This is detailed in Section 3.2, where it is + specified that the marking period and the guard band interval are + strictly related each other to avoid out-of-order issues. That is + because, in order to perform a measurement, the counter must be in + a steady state, and this happens when the traffic is being colored + with the alternate color. As an example, in the experiment of the + method, the time interval is set to 5 minutes, while other + optimized implementations can also use a marking period of a few + seconds. + + o Implementation: the experiment of the method uses two encodings of + the DSCP field to color the packets; this enables the use of + policy configurations on the router to color the packets and + accordingly configure the counter for each color. The path + followed by traffic being measured should be known in advance in + order to configure the counters along the path and be able to + compare the correct values. + + o Verification: both in the lab and in the operational network, the + methodology has been tested and experimented for packet loss and + delay measurements by using traffic generators together with + precision test instruments and network emulators. + + o Use and Applications: the method can be used to measure packet + loss with high precision on live traffic; moreover, by combining + end-to-end and per-link measurements, the method is useful to + pinpoint the single link that is experiencing loss events. + + o Reporting Model: the value of the counters has to be sent to a + centralized management system that performs the calculations; such + samples must contain a reference to the time interval they refer + to, so that the management system can perform the correct + correlation; the samples have to be sent while the corresponding + counter is in a steady state (within a time interval); otherwise, + the value of the sample should be stored locally. + + o Dependencies: the values of the counters have to be correlated to + the time interval they refer to; moreover, because the experiment + of the method is based on DSCP values, there are significant + dependencies on the usage of the DSCP field: it must be possible + to rely on unused DSCP values without affecting QoS-related + configuration and behavior; moreover, the intermediate nodes must + not change the value of the DSCP field not to alter the + measurement. + + o Organization of Results: the Method of Measurement produces + singletons. + + + +Fioccola, et al. Experimental [Page 26] + +RFC 8321 Alternate-Marking Method January 2018 + + + o Parameters: currently, the main parameter of the method is the + time interval used to alternate the colors and read the counters. + +8. IANA Considerations + + This document has no IANA actions. + +9. Security Considerations + + This document specifies a method to perform measurements in the + context of a Service Provider's network and has not been developed to + conduct Internet measurements, so it does not directly affect + Internet security nor applications that run on the Internet. + However, implementation of this method must be mindful of security + and privacy concerns. + + There are two types of security concerns: potential harm caused by + the measurements and potential harm to the measurements. + + o Harm caused by the measurement: the measurements described in this + document are Passive, so there are no new packets injected into + the network causing potential harm to the network itself and to + data traffic. Nevertheless, the method implies modifications on + the fly to a header or encapsulation of the data packets: this + must be performed in a way that doesn't alter the quality of + service experienced by packets subject to measurements and that + preserves stability and performance of routers doing the + measurements. One of the main security threats in OAM protocols + is network reconnaissance; an attacker can gather information + about the network performance by passively eavesdropping on OAM + messages. The advantage of the methods described in this document + is that the marking bits are the only information that is + exchanged between the network devices. Therefore, Passive + eavesdropping on data-plane traffic does not allow attackers to + gain information about the network performance. + + o Harm to the Measurement: the measurements could be harmed by + routers altering the marking of the packets or by an attacker + injecting artificial traffic. Authentication techniques, such as + digital signatures, may be used where appropriate to guard against + injected traffic attacks. Since the measurement itself may be + affected by routers (or other network devices) along the path of + IP packets intentionally altering the value of marking bits of + packets, as mentioned above, the mechanism specified in this + document can be applied just in the context of a controlled + domain; thus, the routers (or other network devices) are locally + administered and this type of attack can be avoided. In addition, + an attacker can't gain information about network performance from + + + +Fioccola, et al. Experimental [Page 27] + +RFC 8321 Alternate-Marking Method January 2018 + + + a single monitoring point; it must use synchronized monitoring + points at multiple points on the path, because they have to do the + same kind of measurement and aggregation that Service Providers + using Alternate Marking must do. + + The privacy concerns of network measurement are limited because the + method only relies on information contained in the header or + encapsulation without any release of user data. Although information + in the header or encapsulation is metadata that can be used to + compromise the privacy of users, the limited marking technique in + this document seems unlikely to substantially increase the existing + privacy risks from header or encapsulation metadata. It might be + theoretically possible to modulate the marking to serve as a covert + channel, but it would have a very low data rate if it is to avoid + adversely affecting the measurement systems that monitor the marking. + + Delay attacks are another potential threat in the context of this + document. Delay measurement is performed using a specific packet in + each block, marked by a dedicated color bit. Therefore, a + man-in-the-middle attacker can selectively induce synthetic delay + only to delay-colored packets, causing systematic error in the delay + measurements. As discussed in previous sections, the methods + described in this document rely on an underlying time synchronization + protocol. Thus, by attacking the time protocol, an attacker can + potentially compromise the integrity of the measurement. A detailed + discussion about the threats against time protocols and how to + mitigate them is presented in RFC 7384 [RFC7384]. + +10. References + +10.1. Normative References + + [IEEE-1588] + IEEE, "IEEE Standard for a Precision Clock Synchronization + Protocol for Networked Measurement and Control Systems", + IEEE Std 1588-2008. + + [RFC2119] Bradner, S., "Key words for use in RFCs to Indicate + Requirement Levels", BCP 14, RFC 2119, + DOI 10.17487/RFC2119, March 1997, + <https://www.rfc-editor.org/info/rfc2119>. + + [RFC3393] Demichelis, C. and P. Chimento, "IP Packet Delay Variation + Metric for IP Performance Metrics (IPPM)", RFC 3393, + DOI 10.17487/RFC3393, November 2002, + <https://www.rfc-editor.org/info/rfc3393>. + + + + + +Fioccola, et al. Experimental [Page 28] + +RFC 8321 Alternate-Marking Method January 2018 + + + [RFC5905] Mills, D., Martin, J., Ed., Burbank, J., and W. Kasch, + "Network Time Protocol Version 4: Protocol and Algorithms + Specification", RFC 5905, DOI 10.17487/RFC5905, June 2010, + <https://www.rfc-editor.org/info/rfc5905>. + + [RFC7679] Almes, G., Kalidindi, S., Zekauskas, M., and A. Morton, + Ed., "A One-Way Delay Metric for IP Performance Metrics + (IPPM)", STD 81, RFC 7679, DOI 10.17487/RFC7679, January + 2016, <https://www.rfc-editor.org/info/rfc7679>. + + [RFC7680] Almes, G., Kalidindi, S., Zekauskas, M., and A. Morton, + Ed., "A One-Way Loss Metric for IP Performance Metrics + (IPPM)", STD 82, RFC 7680, DOI 10.17487/RFC7680, January + 2016, <https://www.rfc-editor.org/info/rfc7680>. + + [RFC8174] Leiba, B., "Ambiguity of Uppercase vs Lowercase in RFC + 2119 Key Words", BCP 14, RFC 8174, DOI 10.17487/RFC8174, + May 2017, <https://www.rfc-editor.org/info/rfc8174>. + +10.2. Informative References + + [ALT-MM-AMP] + Fioccola, G., Clemm, A., Bryant, S., Cociglio, M., + Chandramouli, M., and A. Capello, "Alternate Marking + Extension to Active Measurement Protocol", Work in + Progress, draft-fioccola-ippm-alt-mark-active-01, March + 2017. + + [ALT-MM-SLA] + Fioccola, G., Clemm, A., Cociglio, M., Chandramouli, M., + and A. Capello, "Alternate Marking Extension to Cisco SLA + Protocol RFC6812", Work in Progress, draft-fioccola-ippm- + rfc6812-alt-mark-ext-01, March 2016. + + [COLORING] Chen, M., Zheng, L., Mirsky, G., Fioccola, G., and T. + Mizrahi, "IP Flow Performance Measurement Framework", Work + in Progress, draft-chen-ippm-coloring-based-ipfpm- + framework-06, March 2016. + + [IP-FLOW-REPORT] + Chen, M., Zheng, L., and G. Mirsky, "IP Flow Performance + Measurement Report", Work in Progress, draft-chen-ippm- + ipfpm-report-01, April 2016. + + + + + + + + +Fioccola, et al. Experimental [Page 29] + +RFC 8321 Alternate-Marking Method January 2018 + + + [IP-MULTICAST-PM] + Cociglio, M., Capello, A., Bonda, A., and L. Castaldelli, + "A method for IP multicast performance monitoring", Work + in Progress, draft-cociglio-mboned-multicast-pm-01, + October 2010. + + [MPLS-FLOW] + Bryant, S., Pignataro, C., Chen, M., Li, Z., and G. + Mirsky, "MPLS Flow Identification Considerations", Work in + Progress, draft-ietf-mpls-flow-ident-06, December 2017. + + [MULTIPOINT-ALT-MM] + Fioccola, G., Cociglio, M., Sapio, A., and R. Sisto, + "Multipoint Alternate Marking method for passive and + hybrid performance monitoring", Work in Progress, + draft-fioccola-ippm-multipoint-alt-mark-01, October 2017. + + [NVO3-ENCAPS] + Boutros, S., Ganga, I., Garg, P., Manur, R., Mizrahi, T., + Mozes, D., Nordmark, E., Smith, M., Aldrin, S., and I. + Bagdonas, "NVO3 Encapsulation Considerations", Work in + Progress, draft-ietf-nvo3-encap-01, October 2017. + + [OPSAWG-P3M] + Capello, A., Cociglio, M., Castaldelli, L., and A. Bonda, + "A packet based method for passive performance + monitoring", Work in Progress, draft-tempia-opsawg-p3m-04, + February 2014. + + [PM-MM-BIER] + Mirsky, G., Zheng, L., Chen, M., and G. Fioccola, + "Performance Measurement (PM) with Marking Method in Bit + Index Explicit Replication (BIER) Layer", Work in + Progress, draft-ietf-bier-pmmm-oam-03, October 2017. + + [RFC4656] Shalunov, S., Teitelbaum, B., Karp, A., Boote, J., and M. + Zekauskas, "A One-way Active Measurement Protocol + (OWAMP)", RFC 4656, DOI 10.17487/RFC4656, September 2006, + <https://www.rfc-editor.org/info/rfc4656>. + + [RFC5357] Hedayat, K., Krzanowski, R., Morton, A., Yum, K., and J. + Babiarz, "A Two-Way Active Measurement Protocol (TWAMP)", + RFC 5357, DOI 10.17487/RFC5357, October 2008, + <https://www.rfc-editor.org/info/rfc5357>. + + [RFC5481] Morton, A. and B. Claise, "Packet Delay Variation + Applicability Statement", RFC 5481, DOI 10.17487/RFC5481, + March 2009, <https://www.rfc-editor.org/info/rfc5481>. + + + +Fioccola, et al. Experimental [Page 30] + +RFC 8321 Alternate-Marking Method January 2018 + + + [RFC6374] Frost, D. and S. Bryant, "Packet Loss and Delay + Measurement for MPLS Networks", RFC 6374, + DOI 10.17487/RFC6374, September 2011, + <https://www.rfc-editor.org/info/rfc6374>. + + [RFC6390] Clark, A. and B. Claise, "Guidelines for Considering New + Performance Metric Development", BCP 170, RFC 6390, + DOI 10.17487/RFC6390, October 2011, + <https://www.rfc-editor.org/info/rfc6390>. + + [RFC6703] Morton, A., Ramachandran, G., and G. Maguluri, "Reporting + IP Network Performance Metrics: Different Points of View", + RFC 6703, DOI 10.17487/RFC6703, August 2012, + <https://www.rfc-editor.org/info/rfc6703>. + + [RFC7011] Claise, B., Ed., Trammell, B., Ed., and P. Aitken, + "Specification of the IP Flow Information Export (IPFIX) + Protocol for the Exchange of Flow Information", STD 77, + RFC 7011, DOI 10.17487/RFC7011, September 2013, + <https://www.rfc-editor.org/info/rfc7011>. + + [RFC7276] Mizrahi, T., Sprecher, N., Bellagamba, E., and Y. + Weingarten, "An Overview of Operations, Administration, + and Maintenance (OAM) Tools", RFC 7276, + DOI 10.17487/RFC7276, June 2014, + <https://www.rfc-editor.org/info/rfc7276>. + + [RFC7384] Mizrahi, T., "Security Requirements of Time Protocols in + Packet Switched Networks", RFC 7384, DOI 10.17487/RFC7384, + October 2014, <https://www.rfc-editor.org/info/rfc7384>. + + [RFC7799] Morton, A., "Active and Passive Metrics and Methods (with + Hybrid Types In-Between)", RFC 7799, DOI 10.17487/RFC7799, + May 2016, <https://www.rfc-editor.org/info/rfc7799>. + + [RFC8296] Wijnands, IJ., Ed., Rosen, E., Ed., Dolganow, A., + Tantsura, J., Aldrin, S., and I. Meilik, "Encapsulation + for Bit Index Explicit Replication (BIER) in MPLS and Non- + MPLS Networks", RFC 8296, DOI 10.17487/RFC8296, January + 2018, <https://www.rfc-editor.org/info/rfc8296>. + + [SFL-FRAMEWORK] + Bryant, S., Chen, M., Li, Z., Swallow, G., Sivabalan, S., + and G. Mirsky, "Synonymous Flow Label Framework", Work in + Progress, draft-ietf-mpls-sfl-framework-00, August 2017. + + + + + + +Fioccola, et al. Experimental [Page 31] + +RFC 8321 Alternate-Marking Method January 2018 + + + [SYN-FLOW-LABELS] + Bryant, S., Chen, M., Li, Z., Swallow, G., Sivabalan, S., + Mirsky, G., and G. Fioccola, "RFC6374 Synonymous Flow + Labels", Work in Progress, draft-ietf-mpls-rfc6374-sfl-01, + December 2017. + +Acknowledgements + + The previous IETF specifications describing this technique were: + [IP-MULTICAST-PM] and [OPSAWG-P3M]. + + The authors would like to thank Alberto Tempia Bonda, Domenico + Laforgia, Daniele Accetta, and Mario Bianchetti for their + contribution to the definition and the implementation of the method. + + The authors would also thank Spencer Dawkins, Carlos Pignataro, Brian + Haberman, and Eric Vyncke for their assistance and their detailed and + precious reviews. + +Authors' Addresses + + Giuseppe Fioccola (editor) + Telecom Italia + Via Reiss Romoli, 274 + Torino 10148 + Italy + + Email: giuseppe.fioccola@telecomitalia.it + + + Alessandro Capello + Telecom Italia + Via Reiss Romoli, 274 + Torino 10148 + Italy + + Email: alessandro.capello@telecomitalia.it + + + Mauro Cociglio + Telecom Italia + Via Reiss Romoli, 274 + Torino 10148 + Italy + + Email: mauro.cociglio@telecomitalia.it + + + + + +Fioccola, et al. Experimental [Page 32] + +RFC 8321 Alternate-Marking Method January 2018 + + + Luca Castaldelli + Telecom Italia + Via Reiss Romoli, 274 + Torino 10148 + Italy + + Email: luca.castaldelli@telecomitalia.it + + + Mach(Guoyi) Chen + Huawei Technologies + + Email: mach.chen@huawei.com + + + Lianshu Zheng + Huawei Technologies + + Email: vero.zheng@huawei.com + + + Greg Mirsky + ZTE + United States of America + + Email: gregimirsky@gmail.com + + + Tal Mizrahi + Marvell + 6 Hamada St. + Yokneam + Israel + + Email: talmi@marvell.com + + + + + + + + + + + + + + + + +Fioccola, et al. Experimental [Page 33] + |