1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
|
Internet Engineering Task Force (IETF) X. Xu
Request for Comments: 7814 Huawei Technologies
Category: Informational C. Jacquenet
ISSN: 2070-1721 Orange
R. Raszuk
T. Boyes
Bloomberg LP
B. Fee
Extreme Networks
March 2016
Virtual Subnet: A BGP/MPLS IP VPN-Based Subnet Extension Solution
Abstract
This document describes a BGP/MPLS IP VPN-based subnet extension
solution referred to as "Virtual Subnet", which can be used for
building Layer 3 network virtualization overlays within and/or
between data centers.
Status of This Memo
This document is not an Internet Standards Track specification; it is
published for informational purposes.
This document is a product of the Internet Engineering Task Force
(IETF). It represents the consensus of the IETF community. It has
received public review and has been approved for publication by the
Internet Engineering Steering Group (IESG). Not all documents
approved by the IESG are a candidate for any level of Internet
Standard; see Section 2 of RFC 5741.
Information about the current status of this document, any errata,
and how to provide feedback on it may be obtained at
http://www.rfc-editor.org/info/rfc7814.
Xu, et al. Informational [Page 1]
^L
RFC 7814 Virtual Subnet March 2016
Copyright Notice
Copyright (c) 2016 IETF Trust and the persons identified as the
document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents
(http://trustee.ietf.org/license-info) in effect on the date of
publication of this document. Please review these documents
carefully, as they describe your rights and restrictions with respect
to this document. Code Components extracted from this document must
include Simplified BSD License text as described in Section 4.e of
the Trust Legal Provisions and are provided without warranty as
described in the Simplified BSD License.
Table of Contents
1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . 3
2. Terminology . . . . . . . . . . . . . . . . . . . . . . . . . 4
3. Solution Description . . . . . . . . . . . . . . . . . . . . 5
3.1. Unicast . . . . . . . . . . . . . . . . . . . . . . . . . 5
3.1.1. Intra-subnet Unicast . . . . . . . . . . . . . . . . 5
3.1.2. Inter-subnet Unicast . . . . . . . . . . . . . . . . 6
3.2. Multicast . . . . . . . . . . . . . . . . . . . . . . . . 8
3.3. Host Discovery . . . . . . . . . . . . . . . . . . . . . 9
3.4. ARP/ND Proxy . . . . . . . . . . . . . . . . . . . . . . 9
3.5. Host Mobility . . . . . . . . . . . . . . . . . . . . . . 9
3.6. Forwarding Table Scalability on Data-Center Switches . . 10
3.7. ARP/ND Cache Table Scalability on Default Gateways . . . 10
3.8. ARP/ND and Unknown Unicast Flood Avoidance . . . . . . . 10
3.9. Path Optimization . . . . . . . . . . . . . . . . . . . . 10
4. Limitations . . . . . . . . . . . . . . . . . . . . . . . . . 11
4.1. Non-support of Non-IP Traffic . . . . . . . . . . . . . . 11
4.2. Non-support of IP Broadcast and Link-Local Multicast . . 11
4.3. TTL and Traceroute . . . . . . . . . . . . . . . . . . . 11
5. Security Considerations . . . . . . . . . . . . . . . . . . . 12
6. References . . . . . . . . . . . . . . . . . . . . . . . . . 12
6.1. Normative References . . . . . . . . . . . . . . . . . . 12
6.2. Informative References . . . . . . . . . . . . . . . . . 13
Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . . 14
Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . 15
Xu, et al. Informational [Page 2]
^L
RFC 7814 Virtual Subnet March 2016
1. Introduction
For business continuity purposes, Virtual Machine (VM) migration
across data centers is commonly used in situations such as data-
center maintenance, migration, consolidation, expansion, or disaster
avoidance. The IETF community has recognized that IP renumbering of
servers (i.e., VMs) after the migration is usually complex and
costly. To allow the migration of a VM from one data center to
another without IP renumbering, the subnet on which the VM resides
needs to be extended across these data centers.
To achieve subnet extension across multiple cloud data centers in a
scalable way, the following requirements and challenges must be
considered:
a. VPN Instance Space Scalability: In a modern cloud data-center
environment, thousands or even tens of thousands of tenants could
be hosted over a shared network infrastructure. For security and
performance isolation purposes, these tenants need to be isolated
from one another.
b. Forwarding Table Scalability: With the development of server
virtualization technologies, it's not uncommon for a single cloud
data center to contain millions of VMs. This number already
implies a big challenge to the forwarding table scalability of
data-center switches. Provided multiple data centers of such
scale were interconnected at Layer 2, this challenge would become
even worse.
c. ARP/ND Cache Table Scalability: [RFC6820] notes that the Address
Resolution Protocol (ARP) / Neighbor Discovery (ND) cache tables
maintained by default gateways within cloud data centers can
raise scalability issues. Therefore, mastering the size of the
ARP/ND cache tables is critical as the number of data centers to
be connected increases.
d. ARP/ND and Unknown Unicast Flooding: It's well-known that the
flooding of ARP/ND broadcast/multicast messages as well as
unknown unicast traffic within large Layer 2 networks is likely
to affect network and host performance. When multiple data
centers that each host millions of VMs are interconnected at
Layer 2, the impact of such flooding would become even worse. As
such, it becomes increasingly important to avoid the flooding of
ARP/ND broadcast/multicast as well as unknown unicast traffic
across data centers.
Xu, et al. Informational [Page 3]
^L
RFC 7814 Virtual Subnet March 2016
e. Path Optimization: A subnet usually indicates a location in the
network. However, when a subnet has been extended across
multiple geographically dispersed data-center locations, the
location semantics of such a subnet is not retained any longer.
As a result, traffic exchanged between a specific user and a
server that would be located in different data centers may first
be forwarded through a third data center. This suboptimal
routing would obviously result in unnecessary consumption of the
bandwidth resources between data centers. Furthermore, in the
case where traditional Virtual Private LAN Service (VPLS)
technology [RFC4761] [RFC4762] is used for data-center
interconnect, return traffic from a server may be forwarded to a
default gateway located in a different data center due to the
configuration of a virtual router redundancy group. This
suboptimal routing would also unnecessarily consume the bandwidth
resources between data centers.
This document describes a BGP/MPLS IP VPN-based subnet extension
solution referred to as "Virtual Subnet", which can be used for data-
center interconnection while addressing all of the aforementioned
requirements and challenges. Here, the BGP/MPLS IP VPN means both
BGP/MPLS IPv4 VPN [RFC4364] and BGP/MPLS IPv6 VPN [RFC4659]. In
addition, since Virtual Subnet is built mainly on proven technologies
such as BGP/MPLS IP VPN and ARP/ND proxy [RFC925] [RFC1027]
[RFC4389], those service providers that provide Infrastructure as a
Service (IaaS) cloud services can rely upon their existing BGP/MPLS
IP VPN infrastructure and take advantage of their BGP/MPLS VPN
operational experience to interconnect data centers.
Although Virtual Subnet is described in this document as an approach
for data-center interconnection, it can be used within data centers
as well.
Note that the approach described in this document is not intended to
achieve an exact emulation of Layer 2 connectivity, and therefore it
can only support a restricted Layer 2 connectivity service model with
limitations that are discussed in Section 4. The discussion about
where this service model can apply is outside the scope of this
document.
2. Terminology
This memo makes use of the terms defined in [RFC4364].
Xu, et al. Informational [Page 4]
^L
RFC 7814 Virtual Subnet March 2016
3. Solution Description
3.1. Unicast
3.1.1. Intra-subnet Unicast
+--------------------+
+------------------+ | | +------------------+
|VPN_A:192.0.2.1/24| | | |VPN_A:192.0.2.1/24|
| \ | | | | / |
| +------+ \ ++---+-+ +-+---++/ +------+ |
| |Host A+-----+ PE-1 | | PE-2 +----+Host B| |
| +------+\ ++-+-+-+ +-+-+-++ /+------+ |
| 192.0.2.2/24 | | | | | | 192.0.2.3/24 |
| | | | | | | |
| DC West | | | IP/MPLS Backbone | | | DC East |
+------------------+ | | | | +------------------+
| +--------------------+ |
| |
VRF_A : V VRF_A : V
+------------+---------+--------+ +------------+---------+--------+
| Prefix |Next hop |Protocol| | Prefix |Next hop |Protocol|
+------------+---------+--------+ +------------+---------+--------+
|192.0.2.1/32|127.0.0.1| Direct | |192.0.2.1/32|127.0.0.1| Direct |
+------------+---------+--------+ +------------+---------+--------+
|192.0.2.2/32|192.0.2.2| Direct | |192.0.2.2/32| PE-1 | IBGP |
+------------+---------+--------+ +------------+---------+--------+
|192.0.2.3/32| PE-2 | IBGP | |192.0.2.3/32|192.0.2.3| Direct |
+------------+---------+--------+ +------------+---------+--------+
|192.0.2.0/24|192.0.2.1| Direct | |192.0.2.0/24|192.0.2.1| Direct |
+------------+---------+--------+ +------------+---------+--------+
Figure 1: Intra-subnet Unicast Example
As shown in Figure 1, two hosts (i.e., Hosts A and B) belonging to
the same subnet (i.e., 192.0.2.0/24) are located in different data
centers (i.e., DC West and DC East), respectively. PE routers (i.e.,
PE-1 and PE-2) that are used for interconnecting these two data
centers create host routes for their own local hosts respectively and
then advertise these routes by means of the BGP/MPLS IP VPN
signaling. Meanwhile, an ARP proxy is enabled on Virtual Routing and
Forwarding (VRF) attachment circuits of these PE routers.
Let's now assume that Host A sends an ARP request for Host B before
communicating with Host B. Upon receiving the ARP request, PE-1
acting as an ARP proxy returns its own MAC address as a response.
Host A then sends IP packets for Host B to PE-1. PE-1 tunnels such
packets towards PE-2, which in turn forwards them to Host B. Thus,
Xu, et al. Informational [Page 5]
^L
RFC 7814 Virtual Subnet March 2016
Hosts A and B can communicate with each other as if they were located
within the same subnet.
3.1.2. Inter-subnet Unicast
+--------------------+
+------------------+ | | +------------------+
|VPN_A:192.0.2.1/24| | | |VPN_A:192.0.2.1/24|
| \ | | | | / |
| +------+ \ ++---+-+ +-+---++/ +------+ |
| |Host A+-------+ PE-1 | | PE-2 +-+----+Host B| |
| +------+\ ++-+-+-+ +-+-+-++ | /+------+ |
| 192.0.2.2/24 | | | | | | | 192.0.2.3/24 |
| GW=192.0.2.4 | | | | | | | GW=192.0.2.4 |
| | | | | | | | +------+ |
| | | | | | | +----+ GW +-- |
| | | | | | | /+------+ |
| | | | | | | 192.0.2.4/24 |
| | | | | | | |
| DC West | | | IP/MPLS Backbone | | | DC East |
+------------------+ | | | | +------------------+
| +--------------------+ |
| |
VRF_A : V VRF_A : V
+------------+---------+--------+ +------------+---------+--------+
| Prefix |Next hop |Protocol| | Prefix |Next hop |Protocol|
+------------+---------+--------+ +------------+---------+--------+
|192.0.2.1/32|127.0.0.1| Direct | |192.0.2.1/32|127.0.0.1| Direct |
+------------+---------+--------+ +------------+---------+--------+
|192.0.2.2/32|192.0.2.2| Direct | |192.0.2.2/32| PE-1 | IBGP |
+------------+---------+--------+ +------------+---------+--------+
|192.0.2.3/32| PE-2 | IBGP | |192.0.2.3/32|192.0.2.3| Direct |
+------------+---------+--------+ +------------+---------+--------+
|192.0.2.4/32| PE-2 | IBGP | |192.0.2.4/32|192.0.2.4| Direct |
+------------+---------+--------+ +------------+---------+--------+
|192.0.2.0/24|192.0.2.1| Direct | |192.0.2.0/24|192.0.2.1| Direct |
+------------+---------+--------+ +------------+---------+--------+
| 0.0.0.0/0 | PE-2 | IBGP | | 0.0.0.0/0 |192.0.2.4| Static |
+------------+---------+--------+ +------------+---------+--------+
Figure 2: Inter-subnet Unicast Example (1)
As shown in Figure 2, only one data center (i.e., DC East) is
deployed with a default gateway (i.e., GW). PE-2, which is connected
to GW, would either be configured with or have learned a default
route from GW with the next hop being pointed at GW. Meanwhile, this
route is distributed to other PE routers (i.e., PE-1) as per normal
operation as described in [RFC4364]. Assume Host A sends an ARP
Xu, et al. Informational [Page 6]
^L
RFC 7814 Virtual Subnet March 2016
request for its default gateway (i.e., 192.0.2.4) prior to
communicating with a destination host outside of its subnet. Upon
receiving this ARP request, PE-1 acting as an ARP proxy returns its
own MAC address as a response. Host A then sends a packet for Host B
to PE-1. PE-1 tunnels such a packet towards PE-2 according to the
default route learned from PE-2, which in turn forwards that packet
to GW.
+--------------------+
+------------------+ | | +------------------+
|VPN_A:192.0.2.1/24| | | |VPN_A:192.0.2.1/24|
| \ | | | | / |
| +------+ \ ++---+-+ +-+---++/ +------+ |
| |Host A+----+--+ PE-1 | | PE-2 +-+----+Host B| |
| +------+\ | ++-+-+-+ +-+-+-++ | /+------+ |
| 192.0.2.2/24 | | | | | | | | 192.0.2.3/24 |
| GW=192.0.2.4 | | | | | | | | GW=192.0.2.4 |
| +------+ | | | | | | | | +------+ |
|--+ GW-1 +----+ | | | | | | +----+ GW-2 +-- |
| +------+\ | | | | | | /+------+ |
| 192.0.2.4/24 | | | | | | 192.0.2.4/24 |
| | | | | | | |
| DC West | | | IP/MPLS Backbone | | | DC East |
+------------------+ | | | | +------------------+
| +--------------------+ |
| |
VRF_A : V VRF_A : V
+------------+---------+--------+ +------------+---------+--------+
| Prefix |Next hop |Protocol| | Prefix |Next hop |Protocol|
+------------+---------+--------+ +------------+---------+--------+
|192.0.2.1/32|127.0.0.1| Direct | |192.0.2.1/32|127.0.0.1| Direct |
+------------+---------+--------+ +------------+---------+--------+
|192.0.2.2/32|192.0.2.2| Direct | |192.0.2.2/32| PE-1 | IBGP |
+------------+---------+--------+ +------------+---------+--------+
|192.0.2.3/32| PE-2 | IBGP | |192.0.2.3/32|192.0.2.3| Direct |
+------------+---------+--------+ +------------+---------+--------+
|192.0.2.4/32|192.0.2.4| Direct | |192.0.2.4/32|192.0.2.4| Direct |
+------------+---------+--------+ +------------+---------+--------+
|192.0.2.0/24|192.0.2.1| Direct | |192.0.2.0/24|192.0.2.1| Direct |
+------------+---------+--------+ +------------+---------+--------+
| 0.0.0.0/0 |192.0.2.4| Static | | 0.0.0.0/0 |192.0.2.4| Static |
+------------+---------+--------+ +------------+---------+--------+
Figure 3: Inter-subnet Unicast Example (2)
As shown in Figure 3, in the case where each data center is deployed
with a default gateway, hosts will get ARP responses directly from
their local default gateways, rather than from their local PE routers
when sending ARP requests for their default gateways.
Xu, et al. Informational [Page 7]
^L
RFC 7814 Virtual Subnet March 2016
+------+
+------+ PE-3 +------+
+------------------+ | +------+ | +------------------+
|VPN_A:192.0.2.1/24| | | |VPN_A:192.0.2.1/24|
| \ | | | | / |
| +------+ \ ++---+-+ +-+---++/ +------+ |
| |Host A+-------+ PE-1 | | PE-2 +------+Host B| |
| +------+\ ++-+-+-+ +-+-+-++ /+------+ |
| 192.0.2.2/24 | | | | | | 192.0.2.3/24 |
| GW=192.0.2.1 | | | | | | GW=192.0.2.1 |
| | | | | | | |
| DC West | | | IP/MPLS Backbone | | | DC East |
+------------------+ | | | | +------------------+
| +--------------------+ |
| |
VRF_A : V VRF_A : V
+------------+---------+--------+ +------------+---------+--------+
| Prefix |Next hop |Protocol| | Prefix |Next hop |Protocol|
+------------+---------+--------+ +------------+---------+--------+
|192.0.2.1/32|127.0.0.1| Direct | |192.0.2.1/32|127.0.0.1| Direct |
+------------+---------+--------+ +------------+---------+--------+
|192.0.2.2/32|192.0.2.2| Direct | |192.0.2.2/32| PE-1 | IBGP |
+------------+---------+--------+ +------------+---------+--------+
|192.0.2.3/32| PE-2 | IBGP | |192.0.2.3/32|192.0.2.3| Direct |
+------------+---------+--------+ +------------+---------+--------+
|192.0.2.0/24|192.0.2.1| Direct | |192.0.2.0/24|192.0.2.1| Direct |
+------------+---------+--------+ +------------+---------+--------+
| 0.0.0.0/0 | PE-3 | IBGP | | 0.0.0.0/0 | PE-3 | IBGP |
+------------+---------+--------+ +------------+---------+--------+
Figure 4: Inter-subnet Unicast Example (3)
Alternatively, as shown in Figure 4, PE routers themselves could be
configured as default gateways for their locally connected hosts as
long as these PE routers have routes to reach outside networks.
3.2. Multicast
To support IP multicast between hosts of the same Virtual Subnet,
Multicast VPN (MVPN) technologies [RFC6513] could be used without any
change. For example, PE routers attached to a given VPN join a
default provider multicast distribution tree that is dedicated to
that VPN. Ingress PE routers, upon receiving multicast packets from
their local hosts, forward them towards remote PE routers through the
corresponding default provider multicast distribution tree. Within
this context, the IP multicast doesn't include link-local multicast.
Xu, et al. Informational [Page 8]
^L
RFC 7814 Virtual Subnet March 2016
3.3. Host Discovery
PE routers should be able to dynamically discover their local hosts
and keep the list of these hosts up-to-date in a timely manner to
ensure the availability and accuracy of the corresponding host routes
originated from them. PE routers could accomplish local host
discovery by some traditional host-discovery mechanisms using ARP or
ND protocols.
3.4. ARP/ND Proxy
Acting as an ARP or ND proxy, a PE router should only respond to an
ARP request or Neighbor Solicitation (NS) message for a target host
when it has a best route for that target host in the associated VRF
and the outgoing interface of that best route is different from the
one over which the ARP request or NS message is received. In the
scenario where a given VPN site (i.e., a data center) is multihomed
to more than one PE router via an Ethernet switch or an Ethernet
network, the Virtual Router Redundancy Protocol (VRRP) [RFC5798] is
usually enabled on these PE routers. In this case, only the PE
router being elected as the VRRP Master is allowed to perform the
ARP/ND proxy function.
3.5. Host Mobility
During the VM migration process, the PE router to which the moving VM
is now attached would create a host route for that host upon
receiving a notification message of VM attachment (e.g., a gratuitous
ARP or unsolicited NA message). The PE router to which the moving VM
was previously attached would withdraw the corresponding host route
when noticing the detachment of that VM. Meanwhile, the latter PE
router could optionally broadcast a gratuitous ARP or send an
unsolicited NA message on behalf of that host with the source MAC
address being one of its own. In this way, the ARP/ND entry of this
host that moved and that has been cached on any local host would be
updated accordingly. In the case where there is no explicit VM
detachment notification mechanism, the PE router could also use the
following trick to detect the VM detachment: upon learning a route
update for a local host from a remote PE router for the first time,
the PE router could immediately check whether that local host is
still attached to it by some means (e.g., ARP/ND PING and/or ICMP
PING). It is important to ensure that the same MAC and IP are
associated to the default gateway active in each data center, as the
VM would most likely continue to send packets to the same default
gateway address after having migrated from one data center to
another. One possible way to achieve this goal is to configure the
Xu, et al. Informational [Page 9]
^L
RFC 7814 Virtual Subnet March 2016
same VRRP group on each location to ensure that the default gateway
active in each data center shares the same virtual MAC and virtual IP
addresses.
3.6. Forwarding Table Scalability on Data-Center Switches
In a Virtual Subnet environment, the MAC learning domain associated
with a given Virtual Subnet that has been extended across multiple
data centers is partitioned into segments, and each segment is
confined within a single data center. Therefore, data-center
switches only need to learn local MAC addresses, rather than learning
both local and remote MAC addresses.
3.7. ARP/ND Cache Table Scalability on Default Gateways
When default gateway functions are implemented on PE routers as shown
in Figure 4, the ARP/ND cache table on each PE router only needs to
contain ARP/ND entries of local hosts. As a result, the ARP/ND cache
table size would not grow as the number of data centers to be
connected increases.
3.8. ARP/ND and Unknown Unicast Flood Avoidance
In a Virtual Subnet environment, the flooding domain associated with
a given Virtual Subnet that was extended across multiple data
centers, is partitioned into segments and each segment is confined
within a single data center. Therefore, the performance impact on
networks and servers imposed by the flooding of ARP/ND broadcast/
multicast and unknown unicast traffic is minimized.
3.9. Path Optimization
As shown in Figure 4, to optimize the forwarding path for the traffic
between cloud users and cloud data centers, PE routers located in
cloud data centers (i.e., PE-1 and PE-2), which are also acting as
default gateways, propagate host routes for their own local hosts to
remote PE routers that are attached to cloud user sites (i.e., PE-3).
As such, traffic from cloud user sites to a given server on the
Virtual Subnet that has been extended across data centers would be
forwarded directly to the data-center location where that server
resides, since traffic is now forwarded according to the host route
for that server, rather than the subnet route. Furthermore, for
traffic coming from cloud data centers and forwarded to cloud user
sites, each PE router acting as a default gateway would forward
traffic according to the longest-match route in the corresponding
VRF. As a result, traffic from data centers to cloud user sites is
forwarded along an optimal path as well.
Xu, et al. Informational [Page 10]
^L
RFC 7814 Virtual Subnet March 2016
4. Limitations
4.1. Non-support of Non-IP Traffic
Although most traffic within and across data centers is IP traffic,
there may still be a few legacy clustering applications that rely on
non-IP communications (e.g., heartbeat messages between cluster
nodes). Since Virtual Subnet is strictly based on L3 forwarding,
those non-IP communications cannot be supported in the Virtual Subnet
solution. In order to support those few non-IP traffic (if present)
in the environment where the Virtual Subnet solution has been
deployed, the approach following the idea of "route all IP traffic,
bridge non-IP traffic" could be considered. In other words, all IP
traffic including both intra- and inter-subnet, would be processed
according to the Virtual Subnet design, while non-IP traffic would be
forwarded according to a particular Layer 2 VPN approach. Such a
unified L2/L3 VPN approach requires ingress PE routers to classify
packets received from hosts before distributing them to the
corresponding L2 or L3 VPN forwarding processes. Note that more and
more cluster vendors are offering clustering applications based on
Layer 3 interconnection.
4.2. Non-support of IP Broadcast and Link-Local Multicast
As illustrated before, intra-subnet traffic across PE routers is
forwarded at Layer 3 in the Virtual Subnet solution. Therefore, IP
broadcast and link-local multicast traffic cannot be forwarded across
PE routers in the Virtual Subnet solution. In order to support the
IP broadcast and link-local multicast traffic in the environment
where the Virtual Subnet solution has been deployed, the unified L2/
L3 overlay approach as described in Section 4.1 could be considered
as well. That is, IP broadcast and link-local multicast messages
would be forwarded at Layer 2 while routable IP traffic would be
processed according to the Virtual Subnet design.
4.3. TTL and Traceroute
As mentioned before, intra-subnet traffic is forwarded at Layer 3 in
the Virtual Subnet context. Since it doesn't require any change to
the Time-To-Live (TTL) handling mechanism of the BGP/MPLS IP VPN,
when doing a traceroute operation on one host for another host
(assuming that these two hosts are within the same subnet but are
attached to different sites), the traceroute output would reflect the
fact that these two hosts within the same subnet are actually
connected via a Virtual Subnet, rather than a Layer 2 connection
since the PE routers to which those two hosts are connected would be
displayed in the traceroute output. In addition, for any other
applications that generate intra-subnet traffic with TTL set to 1,
Xu, et al. Informational [Page 11]
^L
RFC 7814 Virtual Subnet March 2016
these applications may not work properly in the Virtual Subnet
context, unless special TTL processing and loop-prevention mechanisms
for such context have been implemented. Details about such special
TTL processing and loop-prevention mechanisms are outside the scope
of this document.
5. Security Considerations
Since the BGP/MPLS IP VPN signaling is reused without any change,
those security considerations as described in [RFC4364] are
applicable to this document. Meanwhile, since security issues
associated with the NDP are inherited due to the use of NDP proxy,
those security considerations and recommendations as described in
[RFC6583] are applicable to this document as well.
Inter-data-center traffic often carries highly sensitive information
at higher layers that is not directly understood (parsed) within an
egress or ingress PE. For example, migrating a VM will often mean
moving private keys and other sensitive configuration information.
For this reason, inter-data-center traffic should always be protected
for both confidentiality and integrity using a strong security
mechanism such as IPsec [RFC4301]. In the future, it may be feasible
to protect that traffic within the MPLS layer [MPLS-SEC] though at
the time of writing, the mechanism for that is not sufficiently
mature to recommend. Exactly how such security mechanisms are
deployed will vary from case to case, so securing the inter-data-
center traffic may or may not involve deploying security mechanisms
on the ingress/egress PEs or further "inside" the data centers
concerned. Note though that if security is not deployed on the
egress/ingress PEs, there is a substantial risk that some sensitive
traffic may be sent in the clear and will therefore be vulnerable to
pervasive monitoring [RFC7258] or other attacks.
6. References
6.1. Normative References
[RFC925] Postel, J., "Multi-LAN address resolution", RFC 925,
DOI 10.17487/RFC0925, October 1984,
<http://www.rfc-editor.org/info/rfc925>.
[RFC1027] Carl-Mitchell, S. and J. Quarterman, "Using ARP to
implement transparent subnet gateways", RFC 1027,
DOI 10.17487/RFC1027, October 1987,
<http://www.rfc-editor.org/info/rfc1027>.
Xu, et al. Informational [Page 12]
^L
RFC 7814 Virtual Subnet March 2016
[RFC4364] Rosen, E. and Y. Rekhter, "BGP/MPLS IP Virtual Private
Networks (VPNs)", RFC 4364, DOI 10.17487/RFC4364, February
2006, <http://www.rfc-editor.org/info/rfc4364>.
[RFC4389] Thaler, D., Talwar, M., and C. Patel, "Neighbor Discovery
Proxies (ND Proxy)", RFC 4389, DOI 10.17487/RFC4389, April
2006, <http://www.rfc-editor.org/info/rfc4389>.
6.2. Informative References
[MPLS-SEC] Farrel, A. and S. Farrell, "Opportunistic Security in MPLS
Networks", Work in Progress, draft-ietf-mpls-
opportunistic-encrypt-01, March 2016.
[RFC4301] Kent, S. and K. Seo, "Security Architecture for the
Internet Protocol", RFC 4301, DOI 10.17487/RFC4301,
December 2005, <http://www.rfc-editor.org/info/rfc4301>.
[RFC4659] De Clercq, J., Ooms, D., Carugi, M., and F. Le Faucheur,
"BGP-MPLS IP Virtual Private Network (VPN) Extension for
IPv6 VPN", RFC 4659, DOI 10.17487/RFC4659, September 2006,
<http://www.rfc-editor.org/info/rfc4659>.
[RFC4761] Kompella, K., Ed. and Y. Rekhter, Ed., "Virtual Private
LAN Service (VPLS) Using BGP for Auto-Discovery and
Signaling", RFC 4761, DOI 10.17487/RFC4761, January 2007,
<http://www.rfc-editor.org/info/rfc4761>.
[RFC4762] Lasserre, M., Ed. and V. Kompella, Ed., "Virtual Private
LAN Service (VPLS) Using Label Distribution Protocol (LDP)
Signaling", RFC 4762, DOI 10.17487/RFC4762, January 2007,
<http://www.rfc-editor.org/info/rfc4762>.
[RFC5798] Nadas, S., Ed., "Virtual Router Redundancy Protocol (VRRP)
Version 3 for IPv4 and IPv6", RFC 5798,
DOI 10.17487/RFC5798, March 2010,
<http://www.rfc-editor.org/info/rfc5798>.
[RFC6513] Rosen, E., Ed. and R. Aggarwal, Ed., "Multicast in MPLS/
BGP IP VPNs", RFC 6513, DOI 10.17487/RFC6513, February
2012, <http://www.rfc-editor.org/info/rfc6513>.
[RFC6583] Gashinsky, I., Jaeggli, J., and W. Kumari, "Operational
Neighbor Discovery Problems", RFC 6583,
DOI 10.17487/RFC6583, March 2012,
<http://www.rfc-editor.org/info/rfc6583>.
Xu, et al. Informational [Page 13]
^L
RFC 7814 Virtual Subnet March 2016
[RFC6820] Narten, T., Karir, M., and I. Foo, "Address Resolution
Problems in Large Data Center Networks", RFC 6820,
DOI 10.17487/RFC6820, January 2013,
<http://www.rfc-editor.org/info/rfc6820>.
[RFC7258] Farrell, S. and H. Tschofenig, "Pervasive Monitoring Is an
Attack", BCP 188, RFC 7258, DOI 10.17487/RFC7258, May
2014, <http://www.rfc-editor.org/info/rfc7258>.
Acknowledgements
Thanks to Susan Hares, Yongbing Fan, Dino Farinacci, Himanshu Shah,
Nabil Bitar, Giles Heron, Ronald Bonica, Monique Morrow, Rajiv Asati,
Eric Osborne, Thomas Morin, Martin Vigoureux, Pedro Roque Marque, Joe
Touch, Wim Henderickx, Alia Atlas, and Stephen Farrell for their
valuable comments and suggestions on this document. Thanks to Loa
Andersson for his WG LC review on this document. Thanks to Alvaro
Retana for his AD review on this document. Thanks to Ronald Bonica
for his RtgDir review. Thanks to Donald Eastlake 3rd for his Sec-DIR
review of this document. Thanks to Jouni Korhonen for the OPS-Dir
review of this document. Thanks to Roni Even for the Gen-ART review
of this document. Thanks to Sabrina Tanamal for the IANA review of
this document.
Xu, et al. Informational [Page 14]
^L
RFC 7814 Virtual Subnet March 2016
Authors' Addresses
Xiaohu Xu
Huawei Technologies
No.156 Beiqing Rd
Beijing 100095
China
Email: xuxiaohu@huawei.com
Christian Jacquenet
Orange
4 rue du Clos Courtel
Cesson-Sevigne, 35512
France
Email: christian.jacquenet@orange.com
Robert Raszuk
Bloomberg LP
731 Lexington Avenue
New York City, NY 10022
United States
Email: robert@raszuk.net
Truman Boyes
Bloomberg LP
Email: tboyes@bloomberg.net
Brendan Fee
Extreme Networks
Email: bfee@extremenetworks.com
Xu, et al. Informational [Page 15]
^L
|