From d3e3e3 at gmail.com Thu Sep 3 12:08:30 2009 From: d3e3e3 at gmail.com (Donald Eastlake) Date: Thu, 3 Sep 2009 15:08:30 -0400 Subject: [rbridge] Review of draft-ietf-trill-rbridge-protocol-13.txt In-Reply-To: References: Message-ID: <1028365c0909031208m46af5055n5376f67fc1ea6f49@mail.gmail.com> Hi Dan, Thanks for your efforts in providing these detailed comments. On Sun, Aug 30, 2009 at 8:41 AM, Romascanu, Dan (Dan) wrote: > I was asked by the TRILL WG chairs to perform a review of > draft-ietf-trill-rbridge-protocol-13.txt. Below please find my findings. > > I appreciate the work put by the editors and WG in designing this > solution and writing the document. The editors are to be saluted for > writing an I-D which is in most parts clear and well organized. > > I do believe that this work and the document as it stands still have a > number of problems that need to be solved. > > 1. I have serious reservations that this document can directly go to > Proposed Standard. For reasons detailed below there is a lot of > incertitude in what concerns the interoperability or coexistence with > the layer 2 protocols designed in the IEEE. Many of these are still work > in progress. For this good reason in many places the document is forced > to make assumptions and cannot make but statements like 'Rbridges are > generally compatible with IEEE ...' (section 2.4.2). Has the WG > discussed the option of issuing the first version of this document as > Experimental? It is true that many IEEE 802.1 standards efforts are "work in progress". But with the ever multiplying number of efforts in 802.1, this situation seems permanent. To wait for 802.1 standards efforts to end would be to wait forever. There is no actual question concerning the interoperability of TRILL with current 802.1 protocols. Section 2 of the draft is a broad overview. That is the reason it says "... RBridges are generally compatible with IEEE [802.1Q-2005] customer bridges ..." in Section 2.4.2. If all the details were given in Section 2, it wouldn't be a broad introductory overview. To avoid misleading readers, that phrase should probably be changed to "... RBridges are compatible with IEEE [802.1Q-2005] customer bridges except as discussed in this document ...". I cannot recall any status other than Proposed Standard ever being suggested in the TRILL working group for this document. The TRILL protocol has had more review by routing and bridging network experts than most IETF protocols at the Proposed Standard level. TRILL has been implemented in software, tested, and the results of that testing included back in the protocol design. Multiple vendors are very interested in TRILL and I predict that RBridge products will ship in 2010. > 2. It looks to me like the motivation of the standard is frozen in time > into a period of evolution of the IEEE 802.1 technology trying to solve > some problems that were in part dealt with by the IEEE in the meantime. > Out of the four spanning tree bullets listed in the introduction as > problems that TRILL solves the second and the fourth seem to have been > dealt with by the IEEE 802.1aq and the IEEE 802.1ak respectively. It appears to me that this comment does not suggest any changes in the draft. But, as a response, the TRILL WG was created by the IETF to produce a standard subsequent to substantial discussion of the fact that, after it rejected TRILL technology, IEEE 802.1 then started an effort with similar goals. (And after the IEEE 802 Executive Committee stated that it had no objection to the IETF pursuing development of a TRILL standard.) > 3. All the VLAN discussions in the specification mention only the > C-VLANs. What about provider bridges? Can these be migrated to Rbridges? > The explanation in Appendix E is not sufficient, the compatibility with > provider bridges needs to be dealt in the core part of the I-D. The TRILL protocol specified in this draft is, as stated in the draft, a customer bridge standard. The draft is careful to refer to 802.1Q-2005 all over the place. An IEEE 802.1Q-2005 bridge is a purely customer bridge. This TRILL protocol specification does not try to solve the provider bridge problem. (I believe it would be relatively easy to extend TRILL to do that but it should be done in a separate document.) Provider bridges (and provider backbone bridges) cannot be replace by customer bridges, whether those customer bridges are 802.1 customer bridges or RBridges based on this spec. On the other hand, as stated in Appendix E, provider bridges (and/or provider backbone bridges) can be used to transparently interconnect parts of an RBridge campus. Since RBridges are compatible with provider bridges and provider backbone bridges, as I understand the word compatible, I don't understand what there is to "deal" with. However, an appropriately reworded version of the paragraph just above this paragraph can certainly be added to the main body of the draft. > 4. There needs to be more discussion about the IEEE protocols that do > not define forwarding or tagging. If TRILL makes a claim that the best > migration policy is to replace IEEE bridges by Rbridges it needs to > define behavior relative to protocols like IEEE 802.1AE Media Access > Control (MAC) Security, IEEE 802.1AB Station and Media Access Control > Connectivity Discovery, and IEEE 802.1aj Two Port Media Relay, as > bridges and end stations that are being replaced may support any or a > combination of these. This document does not claim that the best migration policy is incremental replacement, just that a network of 802.1Q-2005 customer bridges will, with the differences described in the document, operate correctly if some or all are replaced by RBridges. 802.1 protocols such as 802.1AE, 802.1AB, and 802.1X and 802.3 protocols such as Link Aggregation are logically implemented within ports. It makes no difference whether those ports are ports on 802.1D or 802.1Q or RBridge or IP router devices. (And, assuming they continue to develop in the direction it appears they are going, this will also be true of 802.1Qaz and 802.1Qbb.) This layering is covered in overview Section 2.4.1 and in more detail in Section 4.9.2 of the draft. As far as I can see, the draft makes RBridge "behavior relative to" this type of protocol, which is that they are orthogonal, clear. It would be true that if someone is incrementally replacing 802.1Q-2005 bridges that are using one or more of these, for want of a better term, "port level" protocols, such as 802.1AE, with RBridges, then obviously they need to use RBridges that also have the same port level protocols configured the same way. A statement to that effect could be added to the draft. 802.1aj is a work in progress. Although referred to as a "type of bridge", as currently specified 802.1aj devices are, as far as I can tell, a fancy type of repeater. For example, to quote from 802.1aj Draft D4.0, "A MAC Relay is transparent to all frame-based media independent protocols except those explicitly addressed to this device." (Of course, 802.1aj repeaters have a management interface and any node, whether a bridge or end station, could have independent functionality added to it to interact with that management interface, but I don't think of that as a change in the bridge or end station functionality.) A statement could be added to the draft saying "RBridges are compatible with IEEE 802.1aj devices as currently specified, in the same sense that IEEE 802.1Q-2005 bridges are compatible with such devices." or the like. > 5. For all the reasons above I think it is required that this document > is also reviewed in detail by IEEE 802.1. I suggest that a formal review > be required via a liaison letter before or during IETF Last Call. A review by IEEE 802.1 has been done. The TRILL draft has been modified based on that review (see item 2 of "Changes from -12 to -13" in Appendix Z). > 6. It would be good to add a section that provides some estimates about > the operational impact of the deployment of TRILL. For example a claim > is made that TRILL will allow internal forwarding tables to be > substantially smaller than in conventional bridges (Abstract). How much > smaller? Is there an estimation of the level of extra traffic created by > TRILL? Any recommendations on tuning to make it more efficient? The reason that forwarding tables are expected to be smaller is explained in the draft. From the explanation given, it seems fairly clear that the amount smaller primarily depends on the ratio between the number of RBridges and the number of end stations. What "extra" traffic do you refer to? The normal IS-IS control traffic? If so, no, the working group has not produced an estimate of how much IS-IS control traffic would be added. Nor has it produced an estimate of how much the automatic use of shortest paths would decrease traffic on the average link. There has been no work yet (although, of course, this could be added to a future Charter revision) on tuning of parameters. My opinion is that it is premature to do so before TRILL is actually deployed in a variety of environments. > 7. The claim made in the introduction section about TRILL avoiding the > problem of the 'significant configuration' required by the IP routers is > not substantiated but what follows. My reading of section 4.8.1 is that > although management configuration of ports and VLAN parameters is not > mandatory, it is highly recommended, and in its absence the protocol > cannot be expected to perform efficiently. Routers require subnet address configuration. Bridges, including RBridges, do not require any similar address configuration. This is what is being referred to and this can be clarified in the draft. Need for VLAN configuration depends on application. I do not see how it makes sense to make the blanket claim that "management configuration of ports and VLAN parameters" is "highly recommended". It is needed for some applications and totally unnecessary for others. > 8. Speaking about configuration parameters I could not locate the place > where the default interval between TRILL hello messages is defined and > how it can be changed. Maybe I missed it, if not please add this. This is deferred to the IS-IS standard and/or IETF ISIS WG. The Holding Time for a port is an IS-IS configuration parameter. The number of Hellos per Holding Time, facilities for adding timing jitter to avoid self-synchronization, etc., are all specified in IS-IS. It would be reasonable to add a statement such as "TRILL Hellos should be sent with the same timing as IS-IS LAN Hellos." or the like. > 9. In section 2, page 13, there is a recommendation to support SNMPv3 > over IP. I think this actually should be mapping of SNMPv3 over UDP over > IPv4 (RFC 3417) and mapping of SNMP over UDP over IPv6 (RFC 3419) OK, that recommendation can be made more precise and explicit as suggested. > 10. I find confusing the claim in section 2.4.2 > 'If the bridges replaced by > RBridges were zero configuration bridges, then their RBridge > replacements will not require configuration." > > I do not know what 'zero configuration' bridges are. I do not see an > Rbridge behaving better than a IEEE 802 bridge without configuration. > See also my comment #7. "Zero configuration" means no configuration has been done, i.e., it is run as out of the box with default settings, as in the former IETF ZEROCONF working group. That phrase can be replaced with something clearer. Is there any particular wording you would suggest? By definition "unmanaged bridges", a well known product category, are zero configuration because there is no way to configure them. A "managed bridge" would be zero configuration if its configuration features were not used. All the quoted text is saying is that default configuration 802.1Q-2005 bridges can be incrementally replaced by default configuration RBridges. As I say, this can be clarified. > 11. In section 4.1.1 > > " As specified in [802.1Q-2005], the priority field contains an > unsigned value from 0 through 7 where 1 indicates the lowest > priority, 7 the highest priority, and the default priority zero is > considered to be higher than priority 1 but lower than priority 2." > > This does not seem accurate, see Annex G in IEEE 802.1D-2005 Indeed, the draft does not reflect the 802.1D-2004 standard, which specifies priority 2 as "spare". However, the draft's target is 802.1Q-2005 bridges. 802.1Q-2005 is specifically cited in the quoted text above. In Appendix G of 802.1Q-2005 (page 282), priorities are specified as described in this draft. In particular, in 802.1Q-2005, priority 2 is specified as "Excellent Effort" and is the priority between 0 and 3. > 12. Section 4.3.1 - it would be good to justify where the figure 1470 > comes from. Also note that an end-station would not support such an MTU > unless it supports IEEE 802.3as, so it would be good to explain how > problems are avoided when an Rbridge replaces an IEEE 802.1 bridge that > connects to end-stations. The problem was TRILL IS-IS frames too long to go through classic 1512 byte limit (called "basic frame" in 802.3as-2006) or possibly 1516 byte (called "Q-tagged frame" in 802.3as-2006) limit devices between RBridges. I believe what 802.3as-2006 adds to previous 802.3 standards is an "envelope frame" with a limit of 1996 bytes, which doesn't seem relevant as it is much larger than 1470. (I'm including the source and destination MAC addresses in these lengths. The lengths given in 802.3as-2006 are 12 bytes smaller but do not include those addresses.) The problem is in being sure that IS-IS control messages (except, of course, MTU size test messages that exceed the MTU) can get through inter-RBridge links and the draft can be clarified to say so. End stations are not relevant to the problem although, I would think, they would mostly support the classic "basic frame" size or larger and thus be able to handle 1470. 1470 was an engineering compromise between TRILL WG members as a value which, with reasonable additional headers/tags/encapsulation/..., would be extremely unlikely to run into MTU problems in real world use of TRILL on IEEE 802.3 links. Some thought this default should be a little lower or higher. In any case, this parameter can be configured and the lowest configured value for an RBridge in the campus controls. > 13. I am confused about the need for section 5. The first part that > talks about assignment of multicast addresses seems redundant with > content in the IANA/IEEE considerations section. The second part tries > to list parameters that are configurable per Rbridge and per port? Than > many of the parameters listed in 4.8 and 4.9 are missing. I guess Section 5 is an artifact of the way the document evolved. It would indeed be an improvement to remove the first part and either remove the second part and the whole section or make the second part complete. > 14. Section 7.1 - I do not think that there is a need for registries of > the version numbers or Header reserved bits - these would be defined in > later versions of the protocol. For the TRILL Nicknames registry, the > appropriate RFC 5226 policy term is 'RFC Required' rather than 'RFC > Publication'. OK, that section can be modified as suggest From d3e3e3 at gmail.com Thu Sep 3 18:44:27 2009 From: d3e3e3 at gmail.com (Donald Eastlake) Date: Thu, 3 Sep 2009 21:44:27 -0400 Subject: [rbridge] Fwd: Review of draft-ietf-trill-rbridge-protocol-13.txt In-Reply-To: <1028365c0909031208m46af5055n5376f67fc1ea6f49@mail.gmail.com> References: <1028365c0909031208m46af5055n5376f67fc1ea6f49@mail.gmail.com> Message-ID: <1028365c0909031844v433114d0r1129b8f70b081312@mail.gmail.com> My apologies. Somehow my previous try at posting this got truncated, Below should be the full response. Thanks, Donald ============================= Donald E. Eastlake 3rd +1-508-634-2066 (home) 155 Beaver Street Milford, MA 01757 USA d3e3e3 at gmail.com ---------- Forwarded message ---------- From: Donald Eastlake Date: Thu, Sep 3, 2009 at 3:08 PM Subject: Re: Review of draft-ietf-trill-rbridge-protocol-13.txt To: "Romascanu, Dan (Dan)" , rbridge at postel.org Cc: Erik Nordmark , Ralph Droms , ext Bernard Aboba Hi Dan, Thanks for your efforts in providing these detailed comments. On Sun, Aug 30, 2009 at 8:41 AM, Romascanu, Dan (Dan) wrote: > I was asked by the TRILL WG chairs to perform a review of > draft-ietf-trill-rbridge-protocol-13.txt. Below please find my findings. > > I appreciate the work put by the editors and WG in designing this > solution and writing the document. The editors are to be saluted for > writing an I-D which is in most parts clear and well organized. > > I do believe that this work and the document as it stands still have a > number of problems that need to be solved. > > 1. I have serious reservations that this document can directly go to > Proposed Standard. For reasons detailed below there is a lot of > incertitude in what concerns the interoperability or coexistence with > the layer 2 protocols designed in the IEEE. Many of these are still work > in progress. For this good reason in many places the document is forced > to make assumptions and cannot make but statements like 'Rbridges are > generally compatible with IEEE ...' (section 2.4.2). Has the WG > discussed the option of issuing the first version of this document as > Experimental? It is true that many IEEE 802.1 standards efforts are "work in progress". But with the ever multiplying number of efforts in 802.1, this situation seems permanent. To wait for 802.1 standards efforts to end would be to wait forever. There is no actual question concerning the interoperability of TRILL with current 802.1 protocols. Section 2 of the draft is a broad overview. That is the reason it says "... RBridges are generally compatible with IEEE [802.1Q-2005] customer bridges ..." in Section 2.4.2. If all the details were given in Section 2, it wouldn't be a broad introductory overview. To avoid misleading readers, that phrase should probably be changed to "... RBridges are compatible with IEEE [802.1Q-2005] customer bridges except as discussed in this document ...". I cannot recall any status other than Proposed Standard ever being suggested in the TRILL working group for this document. The TRILL protocol has had more review by routing and bridging network experts than most IETF protocols at the Proposed Standard level. TRILL has been implemented in software, tested, and the results of that testing included back in the protocol design. Multiple vendors are very interested in TRILL and I predict that RBridge products will ship in 2010. > 2. It looks to me like the motivation of the standard is frozen in time > into a period of evolution of the IEEE 802.1 technology trying to solve > some problems that were in part dealt with by the IEEE in the meantime. > Out of the four spanning tree bullets listed in the introduction as > problems that TRILL solves the second and the fourth seem to have been > dealt with by the IEEE 802.1aq and the IEEE 802.1ak respectively. It appears to me that this comment does not suggest any changes in the draft. But, as a response, the TRILL WG was created by the IETF to produce a standard subsequent to substantial discussion of the fact that, after it rejected TRILL technology, IEEE 802.1 then started an effort with similar goals. (And after the IEEE 802 Executive Committee stated that it had no objection to the IETF pursuing development of a TRILL standard.) > 3. All the VLAN discussions in the specification mention only the > C-VLANs. What about provider bridges? Can these be migrated to Rbridges? > The explanation in Appendix E is not sufficient, the compatibility with > provider bridges needs to be dealt in the core part of the I-D. The TRILL protocol specified in this draft is, as stated in the draft, a customer bridge standard. The draft is careful to refer to 802.1Q-2005 all over the place. An IEEE 802.1Q-2005 bridge is a purely customer bridge. This TRILL protocol specification does not try to solve the provider bridge problem. (I believe it would be relatively easy to extend TRILL to do that but it should be done in a separate document.) ?Provider bridges (and provider backbone bridges) cannot be replace by customer bridges, whether those customer bridges are 802.1 customer bridges or RBridges based on this spec. On the other hand, as stated in Appendix E, provider bridges (and/or provider backbone bridges) can be used to transparently interconnect parts of an RBridge campus. Since RBridges are compatible with provider bridges and provider backbone bridges, as I understand the word compatible, I don't understand what there is to "deal" with. However, an appropriately reworded version of the paragraph just above this paragraph can certainly be added to the main body of the draft. > 4. There needs to be more discussion about the IEEE protocols that do > not define forwarding or tagging. If TRILL makes a claim that the best > migration policy is to replace IEEE bridges by Rbridges it needs to > define behavior relative to protocols like IEEE 802.1AE Media Access > Control (MAC) Security, IEEE 802.1AB Station and Media Access Control > Connectivity Discovery, and IEEE 802.1aj Two Port Media Relay, as > bridges and end stations that are being replaced may support any or a > combination of these. This document does not claim that the best migration policy is incremental replacement, just that a network of 802.1Q-2005 customer bridges will, with the differences described in the document, operate correctly if some or all are replaced by RBridges. 802.1 protocols such as 802.1AE, 802.1AB, and 802.1X and 802.3 protocols such as Link Aggregation are logically implemented within ports. It makes no difference whether those ports are ports on 802.1D or 802.1Q or RBridge or IP router devices. (And, assuming they continue to develop in the direction it appears they are going, this will also be true of 802.1Qaz and 802.1Qbb.) This layering is covered in overview Section 2.4.1 and in more detail in Section 4.9.2 of the draft. As far as I can see, the draft makes RBridge "behavior relative to" this type of protocol, which is that they are orthogonal, clear. It would be true that if someone is incrementally replacing 802.1Q-2005 bridges that are using one or more of these, for want of a better term, "port level" protocols, such as 802.1AE, with RBridges, then obviously they need to use RBridges that also have the same port level protocols configured the same way. A statement to that effect could be added to the draft. 802.1aj is a work in progress. ?Although referred to as a "type of bridge", as currently specified 802.1aj devices are, as far as I can tell, a fancy type of repeater. For example, to quote from 802.1aj Draft D4.0, "A MAC Relay is transparent to all frame-based media independent protocols except those explicitly addressed to this device." (Of course, 802.1aj repeaters have a management interface and any node, whether a bridge or end station, could have independent functionality added to it to interact with that management interface, but I don't think of that as a change in the bridge or end station functionality.) A statement could be added to the draft saying "RBridges are compatible with IEEE 802.1aj devices as currently specified, in the same sense that IEEE 802.1Q-2005 bridges are compatible with such devices." or the like. > 5. For all the reasons above I think it is required that this document > is also reviewed in detail by IEEE 802.1. I suggest that a formal review > be required via a liaison letter before or during IETF Last Call. A review by IEEE 802.1 has been done. The TRILL draft has been modified based on that review (see item 2 of "Changes from -12 to -13" in Appendix Z). > 6. It would be good to add a section that provides some estimates about > the operational impact of the deployment of TRILL. For example a claim > is made that TRILL will allow internal forwarding tables to be > substantially smaller than in conventional bridges (Abstract). How much > smaller? Is there an estimation of the level of extra traffic created by > TRILL? Any recommendations on tuning to make it more efficient? The reason that forwarding tables are expected to be smaller is explained in the draft. From the explanation given, it seems fairly clear that the amount smaller primarily depends on the ratio between the number of RBridges and the number of end stations. What "extra" traffic do you refer to? The normal IS-IS control traffic? If so, no, the working group has not produced an estimate of how much IS-IS control traffic would be added. Nor has it produced an estimate of how much the automatic use of shortest paths would decrease traffic on the average link. There has been no work yet (although, of course, this could be added to a future Charter revision) on tuning of parameters. My opinion is that it is premature to do so before TRILL is actually deployed in a variety of environments. > 7. The claim made in the introduction section about TRILL avoiding the > problem of the 'significant configuration' required by the IP routers is > not substantiated but what follows. My reading of section 4.8.1 is that > although management configuration of ports and VLAN parameters is not > mandatory, it is highly recommended, and in its absence the protocol > cannot be expected to perform efficiently. Routers require subnet address configuration. Bridges, including RBridges, do not require any similar address configuration. This is what is being referred to and this can be clarified in the draft. Need for VLAN configuration depends on application. I do not see how it makes sense to make the blanket claim that "management configuration of ports and VLAN parameters" is "highly recommended". It is needed for some applications and totally unnecessary for others. > 8. Speaking about configuration parameters I could not locate the place > where the default interval between TRILL hello messages is defined and > how it can be changed. Maybe I missed it, if not please add this. This is deferred to the IS-IS standard and/or IETF ISIS WG. The Holding Time for a port is an IS-IS configuration parameter. The number of Hellos per Holding Time, facilities for adding timing jitter to avoid self-synchronization, etc., are all specified in IS-IS. It would be reasonable to add a statement such as "TRILL Hellos should be sent with the same timing as IS-IS LAN Hellos." or the like. > 9. In section 2, page 13, there is a recommendation to support SNMPv3 > over IP. I think this actually should be mapping of SNMPv3 over UDP over > IPv4 (RFC 3417) and mapping of SNMP over UDP over IPv6 (RFC 3419) OK, that recommendation can be made more precise and explicit as suggested. > 10. I find confusing the claim in section 2.4.2 > ? 'If the bridges replaced by > ? RBridges were zero configuration bridges, then their RBridge > ? replacements will not require configuration." > > I do not know what 'zero configuration' bridges are. I do not see an > Rbridge behaving better than a IEEE 802 bridge without configuration. > See also my comment #7. "Zero configuration" means no configuration has been done, i.e., it is run as out of the box with default settings, as in the former IETF ZEROCONF working group. That phrase can be replaced with something clearer. Is there any particular wording you would suggest? By definition "unmanaged bridges", a well known product category, are zero configuration because there is no way to configure them. A "managed bridge" would be zero configuration if its configuration features were not used. All the quoted text is saying is that default configuration 802.1Q-2005 bridges can be incrementally replaced by default configuration RBridges. As I say, this can be clarified. > 11. In section 4.1.1 > > " ?As specified in [802.1Q-2005], the priority field contains an > ? unsigned value from 0 through 7 where 1 indicates the lowest > ? priority, 7 the highest priority, and the default priority zero is > ? considered to be higher than priority 1 but lower than priority 2." > > This does not seem accurate, see Annex G in IEEE 802.1D-2005 Indeed, the draft does not reflect the 802.1D-2004 standard, which specifies priority 2 as "spare". However, the draft's target is 802.1Q-2005 bridges. 802.1Q-2005 is specifically cited in the quoted text above. In Appendix G of 802.1Q-2005 (page 282), priorities are specified as described in this draft. In particular, in 802.1Q-2005, priority 2 is specified as "Excellent Effort" and is the priority between 0 and 3. > 12. Section 4.3.1 - it would be good to justify where the figure 1470 > comes from. Also note that an end-station would not support such an MTU > unless it supports IEEE 802.3as, so it would be good to explain how > problems are avoided when an Rbridge replaces an IEEE 802.1 bridge that > connects to end-stations. The problem was TRILL IS-IS frames too long to go through classic 1512 byte limit (called "basic frame" in 802.3as-2006) or possibly 1516 byte (called "Q-tagged frame" in 802.3as-2006) limit devices between RBridges. I believe what 802.3as-2006 adds to previous 802.3 standards is an "envelope frame" with a limit of 1996 bytes, which doesn't seem relevant as it is much larger than 1470. (I'm including the source and destination MAC addresses in these lengths. The lengths given in 802.3as-2006 are 12 bytes smaller but do not include those addresses.) The problem is in being sure that IS-IS control messages (except, of course, MTU size test messages that exceed the MTU) can get through inter-RBridge links and the draft can be clarified to say so. End stations are not relevant to the problem although, I would think, they would mostly support the classic "basic frame" size or larger and thus be able to handle 1470. 1470 was an engineering compromise between TRILL WG members as a value which, with reasonable additional headers/tags/encapsulation/..., would be extremely unlikely to run into MTU problems in real world use of TRILL on IEEE 802.3 links. Some thought this default should be a little lower or higher. In any case, this parameter can be configured and the lowest configured value for an RBridge in the campus controls. > 13. I am confused about the need for section 5. The first part that > talks about assignment of multicast addresses seems redundant with > content in the IANA/IEEE considerations section. The second part tries > to list parameters that are configurable per Rbridge and per port? Than > many of the parameters listed in 4.8 and 4.9 are missing. I guess Section 5 is an artifact of the way the document evolved. It would indeed be an improvement to remove the first part and either remove the second part and the whole section or make the second part complete. > 14. Section 7.1 - I do not think that there is a need for registries of > the version numbers or Header reserved bits - these would be defined in > later versions of the protocol. For the TRILL Nicknames registry, the > appropriate RFC 5226 policy term is 'RFC Required' rather than 'RFC > Publication'. OK, that section can be modified as suggested. Thanks for the correction re RFC 5226 policy terms. > 15. Normative References - [802.3] has a 2008 edition Thanks for pointing out the update. > 16. Annex E will certainly evolve in time until publication, as the IEEE > 802.1 documents are also in evolution. At this point in time I would > suggest to add IEEE 802.1AB, IEEE 802.1AE and IEEE 802.1aj. Also IEEE > 802.1aq, IEEE 802.1Qaw and IEEE 802.1Qay have been approved in the > meantime. Well, if something like Appendix E is included in a draft, it will always just be a snapshot. To avoid continuing change later in the process, perhaps it should be frozen as of a specific date and that date cited in the Appendix. Perhaps the date the draft is forwarded to the TRILL WG Area Director. Appendix E is currently limited to completed and in process amendments to IEEE 802.1Q-2005. As such it should include 802.1aj. It could be expanded to include other relevant 802.1 standards and 802.1 standards efforts. Indeed, 802.1Qaw was approved on 25 July 2009 and 802.1Qay was approved on 5 August 2009, both after the date of this draft, and their status should be updated. However, it looks to me like 802.1aq is still a work in progress. > I hope this helps. > > Regards, > Dan Note: In a number of cases above it says something "could" be changed or added in the draft. That word is used because some discussion may be required in the working group as to what is best. Thanks, Donald (responding as the document editor) From d3e3e3 at gmail.com Fri Sep 4 14:29:57 2009 From: d3e3e3 at gmail.com (Donald Eastlake) Date: Fri, 4 Sep 2009 17:29:57 -0400 Subject: [rbridge] Editorial change to Section 4.2.4.4 TRILL LSP Information Message-ID: <1028365c0909041429o465dcdd5h730a32731a95f0f@mail.gmail.com> Hi, Unless there is objection, I'd like to make Section 4.2.4.4 a bit more rigorous by changing it as described below. I believe this is an editorial change. (1) State at the beginning of the section that (1a) it summarizes all contents of the TRILL LSP that are mentioned in the draft, (1b) unless the item is stated to be optional in the section, the item MUST be included, and (1c) other TLVs MAY be included unless their inclusion is prohibited elsewhere in the draft. (2) Add two items to the list: (2a) "Optionally, the largrest TRILL IS-IS frame that the RBridge can handle using the originatingLSPBufferSize TLV #14 (see Section 4.3)"; (2b) "Optionally, the Authentication TLV #10.". Thanks, Doanld ============================= ?Donald E. Eastlake 3rd ? +1-508-634-2066 (home) ?155 Beaver Street ?Milford, MA 01757 USA ?d3e3e3 at gmail.com From Radia.Perlman at sun.com Fri Sep 4 21:03:44 2009 From: Radia.Perlman at sun.com (Radia Perlman) Date: Fri, 04 Sep 2009 21:03:44 -0700 Subject: [rbridge] Editorial change to Section 4.2.4.4 TRILL LSP Information In-Reply-To: <1028365c0909041429o465dcdd5h730a32731a95f0f@mail.gmail.com> References: <1028365c0909041429o465dcdd5h730a32731a95f0f@mail.gmail.com> Message-ID: <4AA1E320.60307@sun.com> Seems a reasonable clarification. Radia Donald Eastlake wrote: > Hi, > > Unless there is objection, I'd like to make Section 4.2.4.4 a bit more > rigorous by changing it as described below. I believe this is an > editorial change. > > (1) State at the beginning of the section that (1a) it summarizes all > contents of the TRILL LSP that are mentioned in the draft, (1b) unless > the item is stated to be optional in the section, the item MUST be > included, and (1c) other TLVs MAY be included unless their inclusion > is prohibited elsewhere in the draft. > > (2) Add two items to the list: (2a) "Optionally, the largrest TRILL > IS-IS frame that the RBridge can handle using the > originatingLSPBufferSize TLV #14 (see Section 4.3)"; (2b) "Optionally, > the Authentication TLV #10.". > > Thanks, > Doanld > ============================= > Donald E. Eastlake 3rd +1-508-634-2066 (home) > 155 Beaver Street > Milford, MA 01757 USA > d3e3e3 at gmail.com > > _______________________________________________ > rbridge mailing list > rbridge at postel.org > http://mailman.postel.org/mailman/listinfo/rbridge > From Radia.Perlman at sun.com Fri Sep 4 21:06:04 2009 From: Radia.Perlman at sun.com (Radia Perlman) Date: Fri, 04 Sep 2009 21:06:04 -0700 Subject: [rbridge] Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? Message-ID: <4AA1E3AC.1040007@sun.com> If there's still a chance for minor, mostly editorial changes... In section 4.6.2, which is about how to handle receipt of a TRILL-encapsulated frame: >> 8. The port on which the frame was received is checked and the frame >> discarded if there is no TRILL IS-IS adjacency on that port. I'd like to delete that, since I don't think that check is necessary, and I was reading a paper about an architecture where endnodes are smart and have some sort of directory to look up the destination, so that the endnode can do the TRILL encapsulation, which would work fine (as long as we delete that extra check), and might be more secure than having RBridges just learn where endnodes are. And smart endnodes could coexist with endnodes and RBridges that work the way we've always imagined it....anyway, I just would like to get rid of the extra check, since I don't think it adds anything, and it does preclude some flexibility that might be useful for some applications. There's another place with that check: >>4.6.2.4 Known Unicast TRILL Data Frames >> The port on which the frame was received is checked and the frame >> discarded if there is no TRILL IS-IS adjacency on that port. Again, just deleting that sentence I think would be good. From Radia.Perlman at sun.com Sat Sep 5 15:28:09 2009 From: Radia.Perlman at sun.com (Radia Perlman) Date: Sat, 05 Sep 2009 15:28:09 -0700 Subject: [rbridge] Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? In-Reply-To: <4AA1E3AC.1040007@sun.com> References: <4AA1E3AC.1040007@sun.com> Message-ID: <4AA2E5F9.6000906@sun.com> And another case where it might be useful for an endnode to do the TRILL encapsulation is so that a management station (presumably an endnode) can communicate with an RBridge. Radia Radia Perlman wrote: > If there's still a chance for minor, mostly editorial changes... > > In section 4.6.2, which is about how to handle receipt of a > TRILL-encapsulated frame: > > >> 8. The port on which the frame was received is checked and the frame > >> discarded if there is no TRILL IS-IS adjacency on that port. > > I'd like to delete that, since I don't think that check is necessary, and > I was reading a paper about an architecture where endnodes are smart > and have some sort of directory to look up the destination, so that > the endnode can do the TRILL encapsulation, which would work fine (as > long as we delete that extra check), and > might be more secure than having RBridges just learn where endnodes are. > And smart endnodes could coexist with endnodes and RBridges that work > the way we've always imagined it....anyway, I > just would like to get rid of the extra check, since I don't think it > adds anything, and it does preclude some flexibility that might be useful > for some applications. > > There's another place with that check: > > >>4.6.2.4 Known Unicast TRILL Data Frames > > >> The port on which the frame was received is checked and the frame > >> discarded if there is no TRILL IS-IS adjacency on that port. > > Again, just deleting that sentence I think would be good. > > _______________________________________________ > rbridge mailing list > rbridge at postel.org > http://mailman.postel.org/mailman/listinfo/rbridge > From d3e3e3 at gmail.com Sat Sep 5 17:51:42 2009 From: d3e3e3 at gmail.com (Donald Eastlake) Date: Sat, 5 Sep 2009 20:51:42 -0400 Subject: [rbridge] Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? In-Reply-To: <4AA2E5F9.6000906@sun.com> References: <4AA1E3AC.1040007@sun.com> <4AA2E5F9.6000906@sun.com> Message-ID: <1028365c0909051751t5a82eacg510694aa93eaa414@mail.gmail.com> A management station that just wants to use SNMP can just be a management station. If it wants to get LSPs, etc., I would think it might as well also be a full fledged RBridge that happens to have only one adjacency... Donald ============================= On Sat, Sep 5, 2009 at 6:28 PM, Radia Perlman wrote: > And another case where it might be useful for an endnode to do the TRILL > encapsulation is so that a > management station (presumably an endnode) can communicate with an RBridge. > > Radia > > > Radia Perlman wrote: >> If there's still a chance for minor, mostly editorial changes... >> >> In section 4.6.2, which is about how to handle receipt of a >> TRILL-encapsulated frame: >> >> ?>> ?8. The port on which the frame was received is checked and the frame >> ?>> ? ? discarded if there is no TRILL IS-IS adjacency on that port. >> >> I'd like to delete that, since I don't think that check is necessary, and >> I was reading a paper about an architecture where endnodes are smart >> and have some sort of directory to look up the destination, so that >> the endnode can do the TRILL encapsulation, which would work fine (as >> long as we delete that extra check), and >> might be more secure than having RBridges just learn where endnodes are. >> And smart endnodes could coexist with endnodes and RBridges that work >> the way we've always imagined it....anyway, I >> just would like to get rid of the extra check, since I don't think it >> adds anything, and it does preclude some flexibility that might be useful >> for some applications. >> >> There's another place with that check: >> >> ?>>4.6.2.4 Known Unicast TRILL Data Frames >> >> ?>> ? The port on which the frame was received is checked and the frame >> ?>> ? discarded if there is no TRILL IS-IS adjacency on that port. >> >> Again, just deleting that sentence I think would be good. >> >> _______________________________________________ >> rbridge mailing list >> rbridge at postel.org >> http://mailman.postel.org/mailman/listinfo/rbridge >> > > _______________________________________________ > rbridge mailing list > rbridge at postel.org > http://mailman.postel.org/mailman/listinfo/rbridge > From d3e3e3 at gmail.com Tue Sep 8 11:43:26 2009 From: d3e3e3 at gmail.com (Donald Eastlake) Date: Tue, 8 Sep 2009 14:43:26 -0400 Subject: [rbridge] Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? In-Reply-To: <4AA1E3AC.1040007@sun.com> References: <4AA1E3AC.1040007@sun.com> Message-ID: <1028365c0909081143n527925eerc3e1d289bc486d1d@mail.gmail.com> In looking at Section 4.6, I think I found another problem. How to handle multiple ports on an RBridge that all connect to the same link is, I believe, correctly described in Section 4.4.4. However, the step-by-step frame processing text in Section 4.6 does not correctly mirror Section 4.4.4. In particular, it only includes some text for this special case for multicast and broadcast native frames. In fact, some special treatment is needed in this case for all native frames and for multi-destination TRILL data frames. (No special treatment is needed for known unicast TRILL data frames as they will only be accepted on one port in any case and no new special treatment is needed for layer 2 or TRILL control frames.) So, unless there is objection, I plan to change Section 4.6 to correctly follow the provisions of Section 4.4.4. Thanks, Donald ============================= Donald E. Eastlake 3rd +1-508-634-2066 (home) 155 Beaver Street Milford, MA 01757 USA d3e3e3 at gmail.com On Sat, Sep 5, 2009 at 12:06 AM, Radia Perlman wrote: > If there's still a chance for minor, mostly editorial changes... > > In section 4.6.2, which is about how to handle receipt of a > TRILL-encapsulated frame: > > ?>> ?8. The port on which the frame was received is checked and the frame > ?>> ? ? discarded if there is no TRILL IS-IS adjacency on that port. > > I'd like to delete that, since I don't think that check is necessary, and > I was reading a paper about an architecture where endnodes are smart > and have some sort of directory to look up the destination, so that > the endnode can do the TRILL encapsulation, which would work fine (as > long as we delete that extra check), and > might be more secure than having RBridges just learn where endnodes are. > And smart endnodes could coexist with endnodes and RBridges that work > the way we've always imagined it....anyway, I > just would like to get rid of the extra check, since I don't think it > adds anything, and it does preclude some flexibility that might be useful > for some applications. > > There's another place with that check: > > ?>>4.6.2.4 Known Unicast TRILL Data Frames > > ?>> ? The port on which the frame was received is checked and the frame > ?>> ? discarded if there is no TRILL IS-IS adjacency on that port. > > Again, just deleting that sentence I think would be good. > > _______________________________________________ > rbridge mailing list > rbridge at postel.org > http://mailman.postel.org/mailman/listinfo/rbridge > From swaminathan.ramalingam at gmail.com Tue Sep 8 23:46:36 2009 From: swaminathan.ramalingam at gmail.com (Ramalingam, Swaminathan (Swami)) Date: Wed, 9 Sep 2009 12:16:36 +0530 Subject: [rbridge] Wide Metric TLV and SPF Message-ID: TRILL mandates to use wide metric TLV (RFC 5305) and not to use TLV type 2 (traditional metric). Distribution tree calculation talks about only cost based tree calculation (i.e. just metric based). I assume this is just to use wide range of metric value or is there any plan that TRILL can use CSPF based path calculation (I don't see any mentioning of this in the draft version 13). It probably make sense if TRILL can use TE capability (parameters) to do the path calculation, optionally. Also I don't see any mentioning of sub-TLV usage for TRILL purpose. -Swami From d3e3e3 at gmail.com Wed Sep 9 07:19:55 2009 From: d3e3e3 at gmail.com (Donald Eastlake) Date: Wed, 9 Sep 2009 10:19:55 -0400 Subject: [rbridge] Wide Metric TLV and SPF In-Reply-To: References: Message-ID: <1028365c0909090719x67d816f5l399e922ea77516a7@mail.gmail.com> Hi, On Wed, Sep 9, 2009 at 2:46 AM, Ramalingam, Swaminathan (Swami) wrote: > TRILL mandates to use wide metric TLV (RFC 5305) and not to use TLV > type 2 (traditional metric). Distribution tree calculation talks about > only cost based tree calculation (i.e. just metric based). I assume > this is just to use wide range of metric value or is there any plan > that TRILL can use CSPF based path calculation (I don't see any > mentioning of this in the draft version 13). It probably make sense if This may be due to the attention paid in TRILL design to zero configuration operation. Any Constrained Shortest Path First (which I assume is what you mean by CSPF) path calculations based on some policy would require the use of that policy at all RBridges of interest (possibly all RBridges in the campus) to avoid loops. Some policies could also result in partition from the point of view of traffic transmission. I don't see anything in the draft prohibiting use of such policy. > TRILL can use TE capability (parameters) to do the path calculation, > optionally. Also I don't see any mentioning of sub-TLV usage for TRILL > purpose. "sub-TLVs" are mentioned in several places in the draft. If you are talking about sub-TLVs associated with adjacencies in the IS Extended Reachability TLV, that seems like the only reasonable place to put the tested MTU for an adjacency and the like, as mentioned in Section 4.3. As discussed on this list, that will probably be added to item 1 in Section 4.2.4.4 in the next version of the draft. I don't think there is anything in the draft prohibiting other sub-TLVs. > -Swami Thanks, Donald From d3e3e3 at gmail.com Wed Sep 9 17:15:39 2009 From: d3e3e3 at gmail.com (Donald Eastlake) Date: Wed, 9 Sep 2009 20:15:39 -0400 Subject: [rbridge] Fwd: Nomcom 2009-10: Important Reminder: Call for Nominations, Local Office hours, Nominee Questionnaires available In-Reply-To: <20090909143800.E2CD928C13E@core3.amsl.com> References: <20090909143800.E2CD928C13E@core3.amsl.com> Message-ID: <1028365c0909091715p66963dcayc70739d8ee30cbe5@mail.gmail.com> We have been requested to forward this to the working group mailing list. Donald ---------- Forwarded message ---------- From: Mary Barnes Date: Wed, Sep 9, 2009 at 10:37 AM Subject: Nomcom 2009-10: Important Reminder: Call for Nominations, Local Office hours, Nominee Questionnaires available To: IETF Announcement list Cc: ietf at ietf.org Hi all, This is a 2nd reminder of the Call for Nominations that is underway. ?We *really* need more nominations in order to properly execute the task of selecting candidates for the open positions. At this time, the number of nominations for all the positions is about 1/2 of what is necessary for the Nomcom to do a conscientious job of evaluating the nominees and we are over 2/3 of the way through the nominations period. Nomcom cannot do their job without this important input from the community. So, please consider making nominations for the open positions - it takes just a few minutes of your time - the details are below. ?Right now, we just need the names/email addresses for the nominees - we'll be soliciting feedback later. Also, consider that over 1/2 the nominees are not able to accept the nominations for a variety of reasons - e.g., lack of funding, lack of sponsor support, etc. Thus, please consider nominating more than one individual for each of the positions. As well as the reminder for the call for nominations, this email also serves as a reminder for: 2) Local Office Hours 3) Questionnaires available for nominees Best Regards, Mary Barnes nomcom-chair at ietf.org mary.h.barnes at gmail.com mary.barnes at nortel.com ========================================================================= 1) Call for Nominations ------------------------ The nomination period ends in less than 2 weeks on Sept. 18th, 2009. We appreciate the folks that have taken the time to make nominations thus far. But, we do need more nominations. Please use the online tool to nominate - it's fast, easy and secure: https://wiki.tools.ietf.org/group/nomcom/09/nominate Details on the open positions, as well as other details and options for making nominations are available on the Nomcom homepage: https://wiki.tools.ietf.org/group/nomcom/09/ Please consider that the sooner you make the nominations, the more time your nominee(s) will have to complete the necessary questionnaire (item 3 below). ?As well, please consider nominating more than one person for a particular position. You will have the opportunity to provide additional feedback later and it's important to consider that not all nominees will be able to accept the nomination. 2) Local office hours ----------------------- In order facilitate additional feedback, the Nomcom has decided to make themselves available for office areas at various geographic locations for 3 weeks in September, starting on the 8th. Below please find the list of locations where Nomcom members will be available for these f2f meetings . Unless dates are identified below, the Nomcom member is generally available for part of the day during the weeks of Sept 8-11, Sept 14-18 and Sept 21-25. Also, languages other than English in which the Nomcom member is fluent are identfied. Please contact a Nomcom member in a specific geographic location to arrange a convenient meeting time and place. Most Nomcom members are flexible as to meeting locations - i.e., we can travel to your office, meet at our offices or somewhere in between. As a reminder folks can always contact any Nomcom member to provide feedback at anytime - i.e., you don't need to participate in these f2f sessions to provide feedback. Belgium: ? ? Dimitri Papadimitriou - dimitri.papadimitriou at alcatel-lucent.be (Sept 21-25) (Languages: French) Boston, Mass, USA: ? ? Stephen Kent - kent at bbn.com ?(Sept 16-18) Boulder, CO, USA: ? ? Wassim Haddad - wmhaddad at gmail.com (Sept 14-18) Dallas/Ft. Worth, TX, USA: ? ? Mary Barnes ?- mary.h.barnes at gmail.com ? ? Lucy Yong - lucyyong at huawei.com ?(Languages: Chinese) Helsinki, Finland: ? ? ?Simo Veikkolainen - simo.veikkolainen at nokia.com (Languages: Finnish) Ithaca, NY, USA: ? ? ?Scott Brim - scott.brim at gmail.com Montevideo, Uruguay: ? ? ?Roque Gagliano - roque at lacnic.net (Sept 14-18, 21-25) (Languages: Spanish, Portuguese) Montreal, Quebec, Canada ? ? Wassim Haddad - wmhaddad at gmail.com (Sept 8-11) ? ? -- Can also be available in Ottawa if folks are interested Paris, France: ? ? ?Dimitri Papadimitriou - dimitri.papadimitriou at alcatel-lucent.be (Sept 15-18) ?(Languages: French) San Diego, CA, USA: ? ? ?Randall Gellens - rg+ietf at qualcomm.com ? ? ?Dave Crocker - dcrocker at bbiw.net ?(Sept 16-18) Silicon Valley/SF Bay, ?CA, USA: ? ? ?Dave Crocker - dcrocker at bbiw.net ?(Sept 8-11, Sept 14-15, Sept 21-25) ? ? ?Dorothy Gellert - Dorothy.gellert at gmail.com 3) Questionnaires available for nominees: For folks that have been notified that they have been nominated for any of the positions, the questionnaires are now available on the Nomcom09 tools website: https://wiki.tools.ietf.org/group/nomcom/09/iab-questionnaire https://wiki.tools.ietf.org/group/nomcom/09/iaoc-questionnaire https://wiki.tools.ietf.org/group/nomcom/09/iesg-questionnaire If you have any questions, please let me know. I will be contacting everyone individually, as well as sending reminders. _______________________________________________ Ietf mailing list Ietf at ietf.org https://www.ietf.org/mailman/listinfo/ietf From cait at asomi.com Thu Sep 10 11:01:24 2009 From: cait at asomi.com (Caitlin Bestler) Date: Thu, 10 Sep 2009 11:01:24 -0700 Subject: [rbridge] Review of draft-ietf-trill-rbridge-protocol-13.txt In-Reply-To: <1028365c0909031208m46af5055n5376f67fc1ea6f49@mail.gmail.com> References: <1028365c0909031208m46af5055n5376f67fc1ea6f49@mail.gmail.com> Message-ID: <4AA93EF4.4070706@asomi.com> Donald Eastlake wrote: > Hi Dan, > > Thanks for your efforts in providing these detailed comments. > > On Sun, Aug 30, 2009 at 8:41 AM, Romascanu, Dan (Dan) wrote: >> I was asked by the TRILL WG chairs to perform a review of >> draft-ietf-trill-rbridge-protocol-13.txt. Below please find my findings. >> >> I appreciate the work put by the editors and WG in designing this >> solution and writing the document. The editors are to be saluted for >> writing an I-D which is in most parts clear and well organized. >> >> I do believe that this work and the document as it stands still have a >> number of problems that need to be solved. >> >> 1. I have serious reservations that this document can directly go to >> Proposed Standard. For reasons detailed below there is a lot of >> incertitude in what concerns the interoperability or coexistence with >> the layer 2 protocols designed in the IEEE. Many of these are still work >> in progress. For this good reason in many places the document is forced >> to make assumptions and cannot make but statements like 'Rbridges are >> generally compatible with IEEE ...' (section 2.4.2). Has the WG >> discussed the option of issuing the first version of this document as >> Experimental? > > It is true that many IEEE 802.1 standards efforts are "work in > progress". But with the ever multiplying number of efforts in 802.1, > this situation seems permanent. To wait for 802.1 standards efforts to > end would be to wait forever. > > There is no actual question concerning the interoperability of TRILL > with current 802.1 protocols. > > Section 2 of the draft is a broad overview. That is the reason it says > "... RBridges are generally compatible with IEEE [802.1Q-2005] > customer bridges ..." in Section 2.4.2. If all the details were given > in Section 2, it wouldn't be a broad introductory overview. To avoid > misleading readers, that phrase should probably be changed to > "... RBridges are compatible with IEEE [802.1Q-2005] customer bridges > except as discussed in this document ...". > > I cannot recall any status other than Proposed Standard ever being > suggested in the TRILL working group for this document. The TRILL > protocol has had more review by routing and bridging network experts > than most IETF protocols at the Proposed Standard level. TRILL has > been implemented in software, tested, and the results of that testing > included back in the protocol design. Multiple vendors are very > interested in TRILL and I predict that RBridge products will ship in > 2010. > I believe these issues are already adequaetely covered in the draft by the specification of an RBridges use of the EISS and ISS interfaces in section 2.4.1. Yes, there is a lot of evolution in 802.1 protocols. But that is exactly why the EISS interface is defined. 802.1 takes great care to preserve this interface and define new protocol features below it. One of the inherent strengths of IP/Ethernet networking is that it is constantly evolving. Just look at the 802.1 amendments and IETF RFCs adopted over the past decade. This constant evolution keeps the protocols dealing with current problems rather than becoming relics. TRILL's layering is well defined to take advantage of protocol evolution above and below it without becoming unduly interdependent with those layers. Of course many implementations will effectively merge their TRILL and 802.1 layers, but anyone taking on such a merger would presumably be fully aware of the set of 802.1 protocols they would be required to support. -- Caitlin Bestler cait at asomi.com - http://www.asomi.com/CaitlinBestlerResume.html From Donald.Fedyk at alcatel-lucent.com Thu Sep 10 16:45:06 2009 From: Donald.Fedyk at alcatel-lucent.com (FEDYK Don) Date: Thu, 10 Sep 2009 18:45:06 -0500 Subject: [rbridge] Review of draft-ietf-trill-rbridge-protocol-13.txt In-Reply-To: <1028365c0909031208m46af5055n5376f67fc1ea6f49@mail.gmail.com> References: <1028365c0909031208m46af5055n5376f67fc1ea6f49@mail.gmail.com> Message-ID: <292BD6E016C29243BB231D329D626C8C02D236E5@USDALSMBS05.ad3.ad.alcatel.com> Hi Donald I too would like to commend Dan for his careful and comprehensive feedback. But I have a few corrections to your comments. Inline: > -----Original Message----- > > Hi Dan, > > Thanks for your efforts in providing these detailed comments. > > On Sun, Aug 30, 2009 at 8:41 AM, Romascanu, Dan > (Dan) wrote: > > I was asked by the TRILL WG chairs to perform a review of > > draft-ietf-trill-rbridge-protocol-13.txt. Below please find > my findings. > > > > I appreciate the work put by the editors and WG in designing this > > solution and writing the document. The editors are to be > saluted for > > writing an I-D which is in most parts clear and well organized. > > > > I do believe that this work and the document as it stands > still have a > > number of problems that need to be solved. > > > > 1. I have serious reservations that this document can > directly go to > > Proposed Standard. For reasons detailed below there is a lot of > > incertitude in what concerns the interoperability or > coexistence with > > the layer 2 protocols designed in the IEEE. Many of these are still > > work in progress. For this good reason in many places the > document is > > forced to make assumptions and cannot make but statements like > > 'Rbridges are generally compatible with IEEE ...' (section > 2.4.2). Has > > the WG discussed the option of issuing the first version of this > > document as Experimental? > > It is true that many IEEE 802.1 standards efforts are "work > in progress". But with the ever multiplying number of efforts > in 802.1, this situation seems permanent. To wait for 802.1 > standards efforts to end would be to wait forever. This is not an accurate representation of the work in 802.1. Several projects have completed this year. The 802.1 Working group has considerable scope. > > There is no actual question concerning the interoperability > of TRILL with current 802.1 protocols. > That is not the way I read the liaison referenced at the end of this message. > Section 2 of the draft is a broad overview. That is the > reason it says "... RBridges are generally compatible with > IEEE [802.1Q-2005] customer bridges ..." in Section 2.4.2. If > all the details were given in Section 2, it wouldn't be a > broad introductory overview. To avoid misleading readers, > that phrase should probably be changed to "... RBridges are > compatible with IEEE [802.1Q-2005] customer bridges except as > discussed in this document ...". > > I cannot recall any status other than Proposed Standard ever > being suggested in the TRILL working group for this document. > The TRILL protocol has had more review by routing and > bridging network experts than most IETF protocols at the > Proposed Standard level. How can you substantiate that comment? Trill was one of the smaller Groups that I have attended. It was not scheduled in the Routing area at all. Certainly some routing experts reviewed Trill. But many I know were reviewing the use of IS-IS by Trill and not evaluating Trill. > TRILL has been implemented in > software, tested, and the results of that testing included > back in the protocol design. Multiple vendors are very > interested in TRILL and I predict that RBridge products will > ship in 2010. > > > 2. It looks to me like the motivation of the standard is frozen in > > time into a period of evolution of the IEEE 802.1 > technology trying to > > solve some problems that were in part dealt with by the > IEEE in the meantime. > > Out of the four spanning tree bullets listed in the introduction as > > problems that TRILL solves the second and the fourth seem > to have been > > dealt with by the IEEE 802.1aq and the IEEE 802.1ak respectively. > > It appears to me that this comment does not suggest any > changes in the draft. But, as a response, the TRILL WG was > created by the IETF to produce a standard subsequent to > substantial discussion of the fact that, after it rejected > TRILL technology, IEEE 802.1 then started an effort with > similar goals. (And after the IEEE 802 Executive Committee > stated that it had no objection to the IETF pursuing > development of a TRILL standard.) Trill was created by individuals in the IETF that had goals that were different than IEEE 802.1. IEEE is a an open standards body just like the IETF. We are open to good ideas on control planes and shortest path tree computation. But we work on bridging data plane and the properties and tools of that data plane. The Trill data plan is not Ethernet and it is not IP either. So Trill is different. > > > 3. All the VLAN discussions in the specification mention only the > > C-VLANs. What about provider bridges? Can these be migrated > to Rbridges? > > The explanation in Appendix E is not sufficient, the compatibility > > with provider bridges needs to be dealt in the core part of the I-D. > > The TRILL protocol specified in this draft is, as stated in > the draft, a customer bridge standard. The draft is careful > to refer to > 802.1Q-2005 all over the place. An IEEE 802.1Q-2005 bridge is > a purely customer bridge. > > This TRILL protocol specification does not try to solve the > provider bridge problem. (I believe it would be relatively > easy to extend TRILL to do that but it should be done in a > separate document.) Provider bridges (and provider backbone > bridges) cannot be replace by customer bridges, whether those > customer bridges are 802.1 customer bridges or RBridges based > on this spec. On the other hand, as stated in Appendix E, > provider bridges (and/or provider backbone bridges) can be > used to transparently interconnect parts of an RBridge campus. > > Since RBridges are compatible with provider bridges and > provider backbone bridges, as I understand the word > compatible, I don't understand what there is to "deal" with. > However, an appropriately reworded version of the paragraph > just above this paragraph can certainly be added to the main > body of the draft. > > > 4. There needs to be more discussion about the IEEE > protocols that do > > not define forwarding or tagging. If TRILL makes a claim > that the best > > migration policy is to replace IEEE bridges by Rbridges it needs to > > define behavior relative to protocols like IEEE 802.1AE > Media Access > > Control (MAC) Security, IEEE 802.1AB Station and Media > Access Control > > Connectivity Discovery, and IEEE 802.1aj Two Port Media Relay, as > > bridges and end stations that are being replaced may > support any or a > > combination of these. > > This document does not claim that the best migration policy > is incremental replacement, just that a network of > 802.1Q-2005 customer bridges will, with the differences > described in the document, operate correctly if some or all > are replaced by RBridges. > > 802.1 protocols such as 802.1AE, 802.1AB, and 802.1X and > 802.3 protocols such as Link Aggregation are logically > implemented within ports. It makes no difference whether > those ports are ports on 802.1D or 802.1Q or RBridge or IP > router devices. (And, assuming they continue to develop in > the direction it appears they are going, this will also be > true of 802.1Qaz and 802.1Qbb.) This layering is covered in > overview Section 2.4.1 and in more detail in Section 4.9.2 of > the draft. As far as I can see, the draft makes RBridge > "behavior relative to" this type of protocol, which is that > they are orthogonal, clear. > > It would be true that if someone is incrementally replacing > 802.1Q-2005 bridges that are using one or more of these, for > want of a better term, "port level" protocols, such as > 802.1AE, with RBridges, then obviously they need to use > RBridges that also have the same port level protocols > configured the same way. A statement to that effect could be > added to the draft. > > 802.1aj is a work in progress. Although referred to as a > "type of bridge", as currently specified 802.1aj devices are, > as far as I can tell, a fancy type of repeater. For example, > to quote from 802.1aj Draft D4.0, "A MAC Relay is transparent > to all frame-based media independent protocols except those > explicitly addressed to this device." (Of course, 802.1aj > repeaters have a management interface and any node, whether a > bridge or end station, could have independent functionality > added to it to interact with that management interface, but I > don't think of that as a change in the bridge or end station > functionality.) A statement could be added to the draft > saying "RBridges are compatible with IEEE 802.1aj devices as > currently specified, in the same sense that IEEE 802.1Q-2005 > bridges are compatible with such devices." or the like. > > > 5. For all the reasons above I think it is required that > this document > > is also reviewed in detail by IEEE 802.1. I suggest that a formal > > review be required via a liaison letter before or during > IETF Last Call. > > A review by IEEE 802.1 has been done. The TRILL draft has > been modified based on that review (see item 2 of "Changes > from -12 to -13" > in Appendix Z). As one who participated in the Review we only reviewed the documents with respect to the IEEE 802.1 sections of the document. The IEEE did not comment on Trill anymore than it would comment on say Fiber Channel or IP. The review is public for everyone to read right here. http://www.ieee802.org/1/files/public/docs2009/liaison-to-trill-wg-0309. pdf Regards, Don From eric.gray at ericsson.com Fri Sep 11 03:46:34 2009 From: eric.gray at ericsson.com (Eric Gray) Date: Fri, 11 Sep 2009 06:46:34 -0400 Subject: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? References: <4AA1E3AC.1040007@sun.com> <4AA2E5F9.6000906@sun.com> Message-ID: Radia, Sorry, but that doesn't make any sense. Management is typically (possibly exclusively) done using IP. As such, it would be the case that management would be done by an IP station, which probably would also be an Ethernet end station. As such, it would be located at the "edge" of an RBridge campus, and would presumably communicate either directly with "adjacent" RBridges, or it's frames would be "ingressed" by an "adjacent" RBridge for delivery to a remote RBridge. There is nothing about this that requires a management station to be able to TRILL-encapsulate frames. If someone wants to build an end-station that can do this, then they can build a one port RBridge and there is nothing we, or anyone, can do to stop them. -- Eric -----Original Message----- From: rbridge-bounces at postel.org [mailto:rbridge-bounces at postel.org] On Behalf Of Radia Perlman Sent: Saturday, September 05, 2009 6:28 PM To: TRILL/RBridge Working Group Subject: Re: [rbridge] Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? And another case where it might be useful for an endnode to do the TRILL encapsulation is so that a management station (presumably an endnode) can communicate with an RBridge. Radia Radia Perlman wrote: > If there's still a chance for minor, mostly editorial changes... > > In section 4.6.2, which is about how to handle receipt of a > TRILL-encapsulated frame: > > >> 8. The port on which the frame was received is checked and the frame > >> discarded if there is no TRILL IS-IS adjacency on that port. > > I'd like to delete that, since I don't think that check is necessary, and > I was reading a paper about an architecture where endnodes are smart > and have some sort of directory to look up the destination, so that > the endnode can do the TRILL encapsulation, which would work fine (as > long as we delete that extra check), and > might be more secure than having RBridges just learn where endnodes are. > And smart endnodes could coexist with endnodes and RBridges that work > the way we've always imagined it....anyway, I > just would like to get rid of the extra check, since I don't think it > adds anything, and it does preclude some flexibility that might be useful > for some applications. > > There's another place with that check: > > >>4.6.2.4 Known Unicast TRILL Data Frames > > >> The port on which the frame was received is checked and the frame > >> discarded if there is no TRILL IS-IS adjacency on that port. > > Again, just deleting that sentence I think would be good. > > _______________________________________________ > rbridge mailing list > rbridge at postel.org > http://mailman.postel.org/mailman/listinfo/rbridge > _______________________________________________ rbridge mailing list rbridge at postel.org http://mailman.postel.org/mailman/listinfo/rbridge From d3e3e3 at gmail.com Fri Sep 11 06:26:41 2009 From: d3e3e3 at gmail.com (Donald Eastlake) Date: Fri, 11 Sep 2009 09:26:41 -0400 Subject: [rbridge] My Affiliation Message-ID: <1028365c0909110626t352a189eh6591114fe6ca5e1e@mail.gmail.com> It has been suggested that working group chairs whose affiliation is not obvious from their email address should post that information to their working group, especially when it changes. I used to be affiliated with Motorola but am now affiliated with Stellar Switches. Thanks, Donald ============================= Donald E. Eastlake 3rd +1-508-634-2066 (home) 155 Beaver Street Milford, MA 01757 USA d3e3e3 at gmail.com From Radia.Perlman at Sun.COM Fri Sep 11 10:25:38 2009 From: Radia.Perlman at Sun.COM (Radia Perlman) Date: Fri, 11 Sep 2009 10:25:38 -0700 Subject: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? In-Reply-To: References: <4AA1E3AC.1040007@sun.com> <4AA2E5F9.6000906@sun.com> Message-ID: <4AAA8812.5030802@sun.com> The problem with managing via IP is that all the RBridges would need IP addresses. I guess that's plausible without configuration because of DHCP, but in an environment where the TRILL campus is broken, it might be better to talk to something using lower level stuff. I believe SNMP can work over Ethernet. But the real question is why do we need that check in the first place. If there aren't good reasons for it, it's one less thing to do when forwarding a packet. Radia Eric Gray wrote: > Radia, > > Sorry, but that doesn't make any sense. > > Management is typically (possibly exclusively) done using > IP. As such, it would be the case that management would be done > by an IP station, which probably would also be an Ethernet end > station. As such, it would be located at the "edge" of an > RBridge campus, and would presumably communicate either directly > with "adjacent" RBridges, or it's frames would be "ingressed" > by an "adjacent" RBridge for delivery to a remote RBridge. > > There is nothing about this that requires a management > station to be able to TRILL-encapsulate frames. > > If someone wants to build an end-station that can do this, > then they can build a one port RBridge and there is nothing we, > or anyone, can do to stop them. > > -- > Eric > > -----Original Message----- > From: rbridge-bounces at postel.org [mailto:rbridge-bounces at postel.org] On Behalf Of Radia Perlman > Sent: Saturday, September 05, 2009 6:28 PM > To: TRILL/RBridge Working Group > Subject: Re: [rbridge] Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? > > And another case where it might be useful for an endnode to do the TRILL > encapsulation is so that a > management station (presumably an endnode) can communicate with an RBridge. > > Radia > > > Radia Perlman wrote: > >> If there's still a chance for minor, mostly editorial changes... >> >> In section 4.6.2, which is about how to handle receipt of a >> TRILL-encapsulated frame: >> >> >> 8. The port on which the frame was received is checked and the frame >> >> discarded if there is no TRILL IS-IS adjacency on that port. >> >> I'd like to delete that, since I don't think that check is necessary, and >> I was reading a paper about an architecture where endnodes are smart >> and have some sort of directory to look up the destination, so that >> the endnode can do the TRILL encapsulation, which would work fine (as >> long as we delete that extra check), and >> might be more secure than having RBridges just learn where endnodes are. >> And smart endnodes could coexist with endnodes and RBridges that work >> the way we've always imagined it....anyway, I >> just would like to get rid of the extra check, since I don't think it >> adds anything, and it does preclude some flexibility that might be useful >> for some applications. >> >> There's another place with that check: >> >> >>4.6.2.4 Known Unicast TRILL Data Frames >> >> >> The port on which the frame was received is checked and the frame >> >> discarded if there is no TRILL IS-IS adjacency on that port. >> >> Again, just deleting that sentence I think would be good. >> >> _______________________________________________ >> rbridge mailing list >> rbridge at postel.org >> http://mailman.postel.org/mailman/listinfo/rbridge >> >> > > _______________________________________________ > rbridge mailing list > rbridge at postel.org > http://mailman.postel.org/mailman/listinfo/rbridge > > _______________________________________________ > rbridge mailing list > rbridge at postel.org > http://mailman.postel.org/mailman/listinfo/rbridge > From eric.gray at ericsson.com Fri Sep 11 14:11:47 2009 From: eric.gray at ericsson.com (Eric Gray) Date: Fri, 11 Sep 2009 17:11:47 -0400 Subject: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? In-Reply-To: <4AAA8812.5030802@sun.com> References: <4AA1E3AC.1040007@sun.com> <4AA2E5F9.6000906@sun.com> <4AAA8812.5030802@sun.com> Message-ID: Radia, Is it possible that - if you forward RBridge/TRILL encapsulated frames that you don't know - for certain - come from an RBridge you know about - you might get a persistent loop? I certainly believe that it is something of a break in the forwarding model if it is possible for an RBridge to even receive RBridge/TRILL encapsulated frames from an entity that is not peered with the receiving RBridge. -- Eric -----Original Message----- From: Radia.Perlman at Sun.COM [mailto:Radia.Perlman at Sun.COM] Sent: Friday, September 11, 2009 1:26 PM To: Eric Gray Cc: TRILL/RBridge Working Group Subject: Re: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? Importance: High The problem with managing via IP is that all the RBridges would need IP addresses. I guess that's plausible without configuration because of DHCP, but in an environment where the TRILL campus is broken, it might be better to talk to something using lower level stuff. I believe SNMP can work over Ethernet. But the real question is why do we need that check in the first place. If there aren't good reasons for it, it's one less thing to do when forwarding a packet. Radia Eric Gray wrote: > Radia, > > Sorry, but that doesn't make any sense. > > Management is typically (possibly exclusively) done using > IP. As such, it would be the case that management would be done > by an IP station, which probably would also be an Ethernet end > station. As such, it would be located at the "edge" of an > RBridge campus, and would presumably communicate either directly > with "adjacent" RBridges, or it's frames would be "ingressed" > by an "adjacent" RBridge for delivery to a remote RBridge. > > There is nothing about this that requires a management > station to be able to TRILL-encapsulate frames. > > If someone wants to build an end-station that can do this, > then they can build a one port RBridge and there is nothing we, > or anyone, can do to stop them. > > -- > Eric > > -----Original Message----- > From: rbridge-bounces at postel.org [mailto:rbridge-bounces at postel.org] On Behalf Of Radia Perlman > Sent: Saturday, September 05, 2009 6:28 PM > To: TRILL/RBridge Working Group > Subject: Re: [rbridge] Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? > > And another case where it might be useful for an endnode to do the TRILL > encapsulation is so that a > management station (presumably an endnode) can communicate with an RBridge. > > Radia > > > Radia Perlman wrote: > >> If there's still a chance for minor, mostly editorial changes... >> >> In section 4.6.2, which is about how to handle receipt of a >> TRILL-encapsulated frame: >> >> >> 8. The port on which the frame was received is checked and the frame >> >> discarded if there is no TRILL IS-IS adjacency on that port. >> >> I'd like to delete that, since I don't think that check is necessary, and >> I was reading a paper about an architecture where endnodes are smart >> and have some sort of directory to look up the destination, so that >> the endnode can do the TRILL encapsulation, which would work fine (as >> long as we delete that extra check), and >> might be more secure than having RBridges just learn where endnodes are. >> And smart endnodes could coexist with endnodes and RBridges that work >> the way we've always imagined it....anyway, I >> just would like to get rid of the extra check, since I don't think it >> adds anything, and it does preclude some flexibility that might be useful >> for some applications. >> >> There's another place with that check: >> >> >>4.6.2.4 Known Unicast TRILL Data Frames >> >> >> The port on which the frame was received is checked and the frame >> >> discarded if there is no TRILL IS-IS adjacency on that port. >> >> Again, just deleting that sentence I think would be good. >> >> _______________________________________________ >> rbridge mailing list >> rbridge at postel.org >> http://mailman.postel.org/mailman/listinfo/rbridge >> >> > > _______________________________________________ > rbridge mailing list > rbridge at postel.org > http://mailman.postel.org/mailman/listinfo/rbridge > > _______________________________________________ > rbridge mailing list > rbridge at postel.org > http://mailman.postel.org/mailman/listinfo/rbridge > From cait at asomi.com Fri Sep 11 14:53:23 2009 From: cait at asomi.com (Caitlin Bestler) Date: Fri, 11 Sep 2009 14:53:23 -0700 Subject: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? In-Reply-To: References: <4AA1E3AC.1040007@sun.com> <4AA2E5F9.6000906@sun.com> <4AAA8812.5030802@sun.com> Message-ID: <4AAAC6D3.6010201@asomi.com> Eric Gray wrote: > Radia, > > Is it possible that - if you forward RBridge/TRILL > encapsulated frames that you don't know - for certain - come > from an RBridge you know about - you might get a persistent > loop? I certainly believe that it is something of a break > in the forwarding model if it is possible for an RBridge to > even receive RBridge/TRILL encapsulated frames from an entity > that is not peered with the receiving RBridge. > Wouldn't the Hop Count still prevent loops? From Radia.Perlman at Sun.COM Fri Sep 11 16:32:03 2009 From: Radia.Perlman at Sun.COM (Radia Perlman) Date: Fri, 11 Sep 2009 16:32:03 -0700 Subject: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? In-Reply-To: <4AAAC6D3.6010201@asomi.com> References: <4AA1E3AC.1040007@sun.com> <4AA2E5F9.6000906@sun.com> <4AAA8812.5030802@sun.com> <4AAAC6D3.6010201@asomi.com> Message-ID: <4AAADDF3.9080004@sun.com> Right. As Caitlin said...encapsulated packets have a hop count...also, an endnode is initiating it, not forwarding it, so I don't see how it could cause a loop. Radia Caitlin Bestler wrote: > Eric Gray wrote: >> Radia, >> >> Is it possible that - if you forward RBridge/TRILL encapsulated >> frames that you don't know - for certain - come >> from an RBridge you know about - you might get a persistent loop? I >> certainly believe that it is something of a break >> in the forwarding model if it is possible for an RBridge to even >> receive RBridge/TRILL encapsulated frames from an entity that is not >> peered with the receiving RBridge. >> > > Wouldn't the Hop Count still prevent loops? > From eric.gray at ericsson.com Fri Sep 11 16:44:04 2009 From: eric.gray at ericsson.com (Eric Gray) Date: Fri, 11 Sep 2009 19:44:04 -0400 Subject: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? In-Reply-To: <4AAAC6D3.6010201@asomi.com> References: <4AA1E3AC.1040007@sun.com> <4AA2E5F9.6000906@sun.com> <4AAA8812.5030802@sun.com> <4AAAC6D3.6010201@asomi.com> Message-ID: Caitlin/Radia, Hop Count - like TTL - doesn't prevent loops, it just reduces their severity (by limiting the impact of individual PDUs to some finite number of hops). In this case, since we don't know the sender of the PDU, we have no firmer basis than dewy-eyed optimism to believe the hop count will be set to a usefully limiting value. If we could be certain - for the case that Radia mentioned - that the sender was an end station, then we (maybe) would have some justification for believing no loop could occur. But the fact is that - just because the question Radia had asked was based on the possibility that the unknown sender could be an end station does not mean that is the only case we need to consider. -- Eric -----Original Message----- From: Caitlin Bestler [mailto:cait at asomi.com] Sent: Friday, September 11, 2009 5:53 PM To: Eric Gray Cc: Radia.Perlman at Sun.COM; TRILL/RBridge Working Group Subject: Re: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? Importance: High Eric Gray wrote: > Radia, > > Is it possible that - if you forward RBridge/TRILL > encapsulated frames that you don't know - for certain - come > from an RBridge you know about - you might get a persistent > loop? I certainly believe that it is something of a break > in the forwarding model if it is possible for an RBridge to > even receive RBridge/TRILL encapsulated frames from an entity > that is not peered with the receiving RBridge. > Wouldn't the Hop Count still prevent loops? From cait at asomi.com Fri Sep 11 20:51:02 2009 From: cait at asomi.com (Caitlin Bestler) Date: Fri, 11 Sep 2009 20:51:02 -0700 Subject: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? In-Reply-To: References: <4AA1E3AC.1040007@sun.com> <4AA2E5F9.6000906@sun.com> <4AAA8812.5030802@sun.com> <4AAAC6D3.6010201@asomi.com> Message-ID: <4AAB1AA6.2020208@asomi.com> Eric Gray wrote: > Caitlin/Radia, > > Hop Count - like TTL - doesn't prevent loops, it just reduces > their severity (by limiting the impact of individual PDUs to > some finite number of hops). > > In this case, since we don't know the sender of the PDU, we > have no firmer basis than dewy-eyed optimism to believe the > hop count will be set to a usefully limiting value. > > If we could be certain - for the case that Radia mentioned > - that the sender was an end station, then we (maybe) would > have some justification for believing no loop could occur. > > But the fact is that - just because the question Radia had > asked was based on the possibility that the unknown sender > could be an end station does not mean that is the only case > we need to consider. > > -- > Eric > Those are certainly valid reasons why an RBridge might want to drop encapsulated frames from unknown sources (specifically, that they MAY do so). But the real question is whether there is a justification for stating that they MUST do so. The approach that Radia is presenting is classic "Be liberal in what you accept". It is optimized to be resilient to minor discovery snafus. The argument for rejecting it is that if whomever is the true source of this frame had done their job properly then the problem would not exist at all, or they are an attacker. In either case they should not be accomodated. Now if there was an effective Denial of Service attack that this would strategy would enable then rejecting the frame would make sense. But this doesn't strike me as a very effective attack. Simply limiting forgiveness to periods when the queue depths were out of the Yellow or Red zones would short circuit any DoS attack with this technique. So I'd lean towards being liberal in what is accepted. -- Caitlin Bestler cait at asomi.com - http://www.asomi.com/CaitlinBestlerResume.html From bernard_aboba at hotmail.com Sun Sep 13 01:11:02 2009 From: bernard_aboba at hotmail.com (Bernard Aboba) Date: Sun, 13 Sep 2009 01:11:02 -0700 Subject: [rbridge] Comments on draft-ietf-trill-rbridge-protocol-13.txt In-Reply-To: <292BD6E016C29243BB231D329D626C8C02D236E5@USDALSMBS05.ad3.ad.alcatel.com> References: <1028365c0909031208m46af5055n5376f67fc1ea6f49@mail.gmail.com> <292BD6E016C29243BB231D329D626C8C02D236E5@USDALSMBS05.ad3.ad.alcatel.com> Message-ID: Review of draft-ietf-trill-rbridge-protocol-13.txt Bernard Aboba Summary Overall, I found this document to be very easy to read. From an architectural perspective, I believe that the authors have done a good job of articulating where TRILL fits in the IEEE 802 architecture. In particular, the document makes it clear that TRILL is designed as a substitute for (R)STP, but does not represent a re-thinking of Ethernet or the IEEE 802 architecture in general. Of course, it is possible or even likely that some unforseen interactions could arise, but my overall assessment is that the authors have been thoughtful in their design, utilizing encapsulation to enable maximum compatibility between Rbridges and legacy bridges. Of course, this thoughtful approach to backward compatibility also means that Rbridges will have more of an evolutionary impact than perhaps was first envisaged. For example, an IEEE 802.11 AP connected to an infrastructure based on Rbridges will still have some of the same limitations with respect to restrictions on multiple Associations within an SSID as would exist if the AP were to be connected to a conventional bridged Ethernet. My major areas of concern with this document relates to potential MTU issues as well as potential compatibility issues with the extensions enumerated in the Appendix. There are also a few instances in which the document is not as clear as it could be, or includes optional capabilities that IMHO should either be made mandatory to implement or be eliminated. However, I do not think that these concerns are sufficient to require the document to be published as Experimental. If and when TRILL is deployed, I expect that a number of issues will be found and that a -bis document will eventually need to be prepared. That is par for the course nowadays, and if this were to preclude consideration for Proposed Standard status, then we wouldn't have many new protocol specifications under consideration for PS. In my view, the more relevant question with respect to status is whether the protocol is sufficiently well specified so as to preclude the introduction of widespread interoperability problems. Where a document could potentially introduce such problems and where initial deployment is likely to result in major changes to the design, an Experimental status would be warranted. In retrospect, such a Status would have been more appropriate for protocols such as SIP whose continual "evolution" has lead to persistent interoperabiltiy problems a decade after its initial introduction. However, TRILL is based on a mature routing protocol (IS-IS) with demonstrated interoperability, with some modest enhancements. Other than a few optional capabilities which could be made mandatory or eliminated, and some instances where the text needs to be clarified, the specification is relatively straightforward, and so it seems unlikely that we will see numerous major interoperability issues between TRILL implementations, on the scale of what we have seen with SIP. More likely is the potential for compatibility issues between TRILL and existing legacy bridges. However, rather than requiring the authors to enumerate alll such potential issues and provide solutions prior to publication (which could create years of delay), a more practical approach would be for the document to more clearly enumerate the scenarios believed to be most suitable for initial deployment. Timers STP has been superceded by RSTP in order to improve convergence times. Looking through this spec, I'm not clear whether TRILL was designed to compete with RSTP convergence times, and if so, what the default values should be. Optional functionality By IETF standards, there is only a modest amount of optional functionality in this spec, but what there is (ESADI and options) doesn't seem compelling to me. Is this functionality really necessary, or is it in there only to provide "value add" (e.g. opportunities for non-interoperability)? RBridge nicknames I understand why nicknames are needed (to avoid making the MTU issues even worse). However, we have learned with RFC 3927 that collisions within 16-bit spaces can be painful (e.g. collision probabilities can be quite high if you have a substantial number of Rbridges in the network). Overall, I think that nickname collisions (along with optional functionality such as ESADI and Options) represent one of the potential weaknesses in the spec, particularly since Rbridges are most attractive in situations (such as datacenters) where the number of RBridges could be large. Some tweaks in the algorithms are suggested below. Global uniqueness of MAC addresses We are now seeing situations in which some of the conventional assumptions made by IEEE 802.1 have broken down. One of these is the widespread deployment of virtualization within datacenters. In these scenarios, multiple MAC addresses may be assigned to a single (virtualized) NIC. To limit a potential explosion in demand for MAC addresses, MAC addresses can be assigned by management software from a vendor OUI, and as a result, MAC addresses are not guaranteed to be unique across VLANs. In reading through the document, it seems clear to me that the intent is for traffic to be routed by VLAN and MAC address, so that end stations are not required to have globally unique MAC addresses (although Rbridge MAC addresses do need to be globally unique). However, there are a number of instances in which the text is not as clear as it could be on this point (see below for detailed comments). Note that the ability of TRILL to handle non-globally unique endstation MAC addresses is IMHO a major advantage as compared to single-spanning tree switches, or even "Q in Q" provider bridges. Some of these advantages might be worth calling out. For example: 1. TRILL encapsulation potentially shields legacy bridges from learning MAC addresses which might cause problems (e.g. single spanning tree implementations). 2. TRILL encapsulation can shield "Q in Q" provider bridges from exposure to MAC address duplication which could occur when a provider needs to handle traffic from customers with their own distinct datacenters utilizing virtualization. 3. Greenfield TRILL deployments only require end station MAC addresses to be unique per VLAN. As a result, TRILL is virtualization-friendly, and the prohibitions on virtualization described in documents such as IEEE 802.1X-REV are not unnecessary in an Rbridge deployment. MTU issues While support for PMTU discovery is quite common within TCP implementations, the same is not true for UDP. Legacy implementations that lack support for IEEE 802.1AB-REV, and automatically set an Ethernet interface MTU to 1500 are quite widespread. In greenfield Rbridge installations designed to support a larger MTU between Rbridges, this should be a solveable problem. It also should be addressable in situations where Rbridges are installed alongside relatively new switches that support MTUs of 1530 or greater. However, I do wonder what issues could arise in situations where Rbridges are installed alongside switches that only support Ethernet frames of 1512 or 1516 octets. Reading the document, it was not clear whether MTU probing was "mandatory to implement" but I think this functionality should be mandatory, if only for diagnostic purposes. I also think that the algorithm for MTU determination could be better specified, perhaps by incorporating elements from RFC 4821. Port protocols There are a number of IEEE 802.1 protocols (such as IEEE 802.1X, IEEE 802.1AB, etc.) that relate to the state of a port and utilize a non-forwardable multicast address. The draft states that an Rbridge will not forward frames sent to a non-forwardable multicast address. This cleanly segments the problem in the sense that TRILL is put forward purely as a substitute for the spanning tree protocol, leaving the operation of port protocols intact. In such a model, the operation of IEEE 802.1X-2004 and IEEE 802.1AB-REV should not be affected; these protocols, if implemented at the edge Rbridge, should operate largely as they do today from the point of view of the end station. However, there are some corner cases in which limitations could arise. One example is an edge device such as wired VOIP handset with one or more wired Ethernet ports. These devices often do not implement "port protocols" such as IEEE 802.1X-2004, but instead operate like a TPMR, passing them through to the switch. I do not see this as a problem with the Rbridge specification per se, because the spec is attempting to mimic the behavior of a conventional switch in this (and other cases). However, it might be worth a sentence explicitly stating that -- and noting that potential extensions to other cases such as Virtual RBridges or Provider Rbridges are not excluded, but are left to future work. I would also note that IEEE 802.1X-REV goes beyond "port access control" to defining "pseudo" and "virtual" ports. Virtual ports are created via MACSEC (IEEE 802.1AE) and pseudo ports involve MAC-based authentication state, so as to allow IEEE 802.1X supplicants to coexist on shared media. Among other things, pseudo and virtual ports introduce the notion of IEEE 802.1X traffic that could be destined to a unicast address (as in IEEE 802.11i) rather than to a non-forwardable multicast address. As is stated in the document, the location of TRILL in the IEEE 802.1 architecture makes it transparent to these extensions; however in places the document language is outdated and could be cleaned up (see below for examples). Detailed Comments Abstract The design supports VLANs and optimization of the distribution of multi-destination frames based on VLAN and IP derived multicast groups. It also allows forwarding tables to be sized according to the number of RBridges (rather than the number of end nodes), which allows internal forwarding tables to be substantially smaller than in conventional bridges. [BA] Since core bridges can forward based on VLAN tags and not MAC addresses, this claim seems somewhat exaggerated. Section 1 IEEE 802.1 bridges avoid these problems by transparently gluing many physical links into what appears to IP to be a single LAN [802.1D]. However, 802.1 bridge forwarding using the spanning tree protocol has some disadvantages: [BA] Throughout the document, I get the sense that TRILL is aiming at a target that is somewhat frozen in time. Today most new bridge implementations support RSTP, which offers considerably faster convergence. Shortest path bridging is progressing, etc. Overall, I'd like the document to be clearer about what advantages apply to classic STP, and which ones also apply to enhancements such as RSTP, shortest path bridging, etc. For example, it seems to me that TRILL has advantages over single spanning tree implementations in terms of ability to customize forwarding per VLAN. Based on the document, it seems that it may have some disadvantages in terms of convergence times and initialization behavior, as compared with RSTP. So overall, I think that Section 1 could do a better job of articulating the pros/cons of TRILL. In particular, it appears to me that some of the pros described in the problem statement document have not been realized, and some addition pros (such as virtualization support) have arisen. in most cases they can incrementally replace IEEE [BA] The use of "most" here is somewhat vague. I might say "as described in Appendix A, they can incrementally replace IEEE..." While they can be applied to a variety of link protocols, this specification focuses on IEEE [802.3] links. [BA] This would seem to suggest that TRILL could be used on Token Ring. You might want to specifically exclude this usage (or interconnection with source routing bridges, for that matter). Section 1.2 Section 2: general RBridge description Section 3: the TRILL header Section 4: other TRILL protocol details In case of conflict, the order of precedence of these section is as follows, with those appearing earlier in this list having precedence over those that appear later: 4 > 3 > 2 [BA] "this list" is ambiguous. Do you mean the above list, in which case, Section 2 would take precedence over Section 4? Or do you mean Section 4 takes precedence over 2? I think you need to make this more clear. Section 1.3 1.3 Terminology and Notation in this document "TRILL" is the protocol specified herein while an "RBridge" is a devices that implement that protocol. The second letter in Rbridge is case insensitive. Both Rbridge and RBridge are correct. [BA] s/devices/device/ In this document, the term "link", unless otherwise qualified, means "bridged LAN", that is to say, the combination of one or more [802.3] links with zero or more brides, hubs, repeaters, or the like. The [BA] Do you really want to be combining IEEE 802.3 links with brides (or grooms)? Suggest changing "brides" to "bridges". The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in [RFC2119]. [BA] You might want to move this to earlier in the section (or give it its own sub-section). Section 2 RBridges run a link state protocol amongst themselves. This gives them enough information to compute pair-wise optimal paths for unicast, and calculate distribution trees for delivery of frames either to unknown MAC destinations or to multicast/broadcast groups. [RBridges] [RP1999] [BA] I think you mean "unknown VLAN/MAC destination pairs" here, no? used, within a campus, even by an RBridge that lacks an IP or other Layer 3 transport stack or which has zero configuration and thus no Layer 3 address, by transporting SNMP with Ethernet [RFC4789]. [BA] The term "zero configuration" is ambiguous here, since it might be construed to refer to an RFC 3927 "zero config" IP address within 169.254/16. In any case, a switch obtaining a dynamic IP address can still be "zero configuration", so I'm not sure what the point is. Section 2.1 An RBridge, RB1, which is the VLAN-x forwarder on any of its links MUST learn the location of VLAN-x end nodes, both on the links for which it is VLAN-x forwarder, and on other links in the campus. RB1 learns the port and Layer 2 (MAC) addresses of end nodes on links for [BA] Later on it is made clear that we are talking about learning the VLAN as well as the port and MAC address. It's important to be consistent on this point throughout the document. Also, it can be more secure because not only might the enrollment be authenticated (for example by cryptographically based EAP methods via [802.1X]), but ESADI also supports cryptographic authentication of its messages [RFC5304]. [BA] Elsewhere the document makes it clear that IEEE 802.1X is implemented below TRILL so that they don't interact. Here you seem to be saying that ESADI and EAP could be related. If IEEE 802.1X operating on a port or source MAC does not allow non-802.1X frames to pass, then ESADI should not be announcing the unauthenticated source MAC, right? Note that where 802.1X frames are sent to a unicast address, they *will* be forwarded, and learning will occur; however, filtering will occur at the edge so as to prevent incoming or outgoing frames to/from unauthenticated MAC addresses. The potential for different behavior of learning and ESADI concerns me. Advertising end nodes using ESADI is optional, as is learning from these announcements. [BA] Does it make any sense to support Advertising but not learning, or learning and not advertising? If both are optional, then these combinations are possible. In general, I'm not a fan of optional functionality. ESADI is optional, and it's not clear to me that the benefits outweigh the potential headaches. Is ESADI really necessary, particularly given the claims made about scaling with the number of Rbridges, not end nodes? Section 2.2 2. elimination of the need for original source and destination MAC address learning in transit RBridges; [BA] The specification doesn't discuss "destination MAC address learning". Is this a typo? 3. direction of frames towards the egress RBridge (this enables forwarding tables of RBridges to be sized with the number of RBridges rather than the total number of end nodes); and, [BA] I am unclear about the situations in which this claim applies. Are we only talking about core Rbridges, or edge ones as well? What about scaling properties when ESADI is implemented? 2.2.1 Known-Unicast These frames have a unicast inner MAC destination address (Inner.MacDA) and are those for which ingress RBridge knows the egress RBridge for that destination MAC address. [BA] I think you mean for "VLAN/destination MAC address pair", no? 2.2.2 1. unicast frames for which the destination is unknown: the Inner.MacDA is unicast, but the ingress RBridge does not know its location; [BA] do you mean "does not include an entry for the VLAN/destination MAC address pair"? 3. multicast frames for which the Layer 2 destination address is not derived from an IP multicast address: the Inner.MacDA is multicast, and not from the set of Layer 2 multicast addresses derived from IPv4 or IPv6 multicast addresses; [BA] Does this work for *all* multicast addresses not derived from an IP multicast address (e.g. addresses used in provider bridging)? Section 3.3 3.3 Reserved (R) The two R bits are reserved for future use in extensions to this version zero of the TRILL protocol. They MUST be initially set to zero, transparently copied by transit RBridges, and ignored on receipt. [BA] From this sentence, I'm not clear who is ignoring the R bits. Does the sentence just apply to transit RBridges or all RBridges? Section 3.5 Note: Most RBridge implementations are expected to be optimized for the simplest and most common cases of frame forwarding and processing. The inclusion of any options may, and the inclusion of complex or lengthy options very likely will, cause frame processing using a "slow path" with markedly inferior performance to "fast path" processing. Limited slow path throughput may cause such frames to be lost. [BA] This makes a very good case for removing options from this specification. Do we really need this?? This seems like it will bring with it all the issues that options have in IPv4, and then some. Section 3.6 Although the RBridge MAY decrease the hop count by more than 1, under the circumstances described above, the RBridge forwarding a frame MUST decrease the hop count by at least 1, and discards the frame if it cannot do so because the hop count is 0. [BA] The MAY here seems dangerous. Could an implementation decrement hop count by 10? This seems like one of those situations where the spec should be more strict (e.g. SHOULD decrement by 1). Allowing a wide variety of behavior seems like it would be courting trouble. Is there value in allowing variation, and if so, please explain what that value is. Section 3.7.3 o Nickname values MAY be configured. An RBridge that has been configured with one or more nickname values will have priority for those nickname values over all Rbridges with non-configured nicknames. [BA] RFC 3927 does not permit static assignment of link scope addresses because it was fearned that this would lead to implementations ignoring collision detection. I realize that configured nicknames get priority, but it still seems like a good idea for an Rbridge to test for conflict before configuring the nickname, so as to avoid a potential conflict, no? o Each RBridge is responsible for ensuring that its nickname or each of its nicknames is unique. If RB1 chooses nickname x, and RB1 discovers, through receipt of RB2's LSP, that RB2 has also chosen x, then the RBridge with the numerically higher priority keeps the nickname, or if there is a tie in priority, the RBridge with the numerically higher IS-IS System ID keeps the nickname, and the other RBridge MUST select a new nickname. This can require an RBridge with a configured nickname to select a replacement nickname. [BA] Given that a configured nickname might need to select a replacement, what is the value of supporting configuration? Also, this would suggest that even configured nicknames need to test for uniqueness prior to configuration, rather than relying on increased priority due to aging (which is undefined in the spec) to avoid forcing nodes that have been using a nickname for a long time to change. Also, just because RB1 gets RB2's LSP doesn't mean that RB2 simultaneously has RB1's LSP. So one side can assume that the other will give up the nickname, but that might not happen for a while. Do you want to encourage RB1 to send its LSP right away after it detects a collision so that the information asymmetry is quickly corrected? o To minimize the probability of nickname collisions, when an RBridge selects a new nickname, it does so by randomly hashing some of its parameters, e.g., interface MAC addresses, time and date, and other entropy sources such as those given in [RFC4086]. There is no reason for all Rbridges to use the same algorithm for selecting nicknames. [BA] Randomness isn't required to reduce collision probabilities. It's only necessary for the distribution within the space to be uniform. RFC 3927 doesn't require randomness because it was felt that this would just increase the difficulty of debugging with no net benefit. I'd suggest that you rethink this. An RBridge MAY request multiple nicknames so that it can be the root of multiple trees for multipathing of multi-destination frames. These trees would all be shortest path trees from the RBridge but, since the tree number is used in tie breaking when there are multiple equal cost paths (see Section 4.5.1), the different trees will likely utilize different links. [BA] In a deployment with a substantial number of Rbridges, the collision probability will already be high. If each Rbridge has multiple nicknames, it will impact scaling in a negative way. Have you thought this through? For example, in a situation where each Rbridge has 16 nicknames, you might start seeing high collision probabilities with as few as 16 Rbridges. I'd suggest that this practice be NOT RECOMMENDED. ection 4.1.1 Frames with the same source address, destination address, VLAN, and priority that are received on the same port as each other and are transmitted on the same port MUST be transmitted in the order received unless the RBridge classifies the frames into more fine grained flows, in which case this ordering requirement applies to each such flow. (Such frames might not be sent out the same port if multipath is implemented. See Appendix C.) [BA] Do you mean "granular"? Section 4.2.4.3 o Loop avoidance: - Inhibiting itself for a configurable time from zero to 30 seconds, which defaults to 30 second, after it sees a root bridge change on the link (see Section 4.9.3.2). [BA] The 30 second default might make sense where RBridges are deployed alongisde STP, but is this default needed where the legacy bridges run RSTP? Overall, I'm curious as to how the failover times in TRILL will compare to RSTP. The spec doesn't say much about this. 4.2.5.2 TRILL ESADI Information The information in ESADI is the list of local end station MAC addresses known to the originating RBridge and, for each such address, a one octet unsigned "confidence" rating in the range 0-254 (see Section 4.8). In order to make it practical to optionally provide for VLAN ID translation, as specified in a separate document, TRILL ESADI frames MUST NOT contain the VLAN ID in the body of the frame after the Inner.VLAN tag. [BA] Does VLAN ID translation really require support for ESADI? Section 4.3.1 Sz is determined by having each RBridge (optionally) advertise, in its LSP, its assumption of the value of the campus-wide Sz. This LSP element is known in IS-IS as the originatingLSPBufferSize, TLV #14. The default and minimum value for Sz, and the implicitly advertised value of Sz if the TLV is absent, is 1470 bytes. [BA] Given the potential headaches that can be caused by MTU issues, I wonder whether the spec couldn't be tightened. If implementations don't advertise Sz, then it seems like we could end up with a 1470 octet MTU limitation on endstations, even where this might not be necessary (e.g. MTU discovery could enable a larger MTU). This seems like it could cause headaches where it wouldn't be necessary. Would it make sense to require Sz to be advertised where MTU discovery finds a larger MTU size than 1470? Would it make sense to require Sz to be advertised where MTU discovery finds a larger MTU size than 1470? Section 4.3.2 There are two new TRILL IS-IS message types for use between pairs of RBridge neighbors to test the bidirectional packet size capacity of their connection. These messages are: -- MTU-probe -- MTU-ack Both the MTU-probe and the MTU-ack are padded to the size being tested. [BA] This section doesn't say whether support for MTU-probe is mandatory. The way it is written, I'd be concerned that implementers would not support it, and that lack of MTU discovery capability would cause problems in deployments. This section might also benefit from a more detailed specification of the MTU discovery algorith (such as incorporating elements from the Packetization Layer Path MTU RFC). Section 4.6 Source address information ( { VLAN, Outer.MacSA, port } ) is learned from any frame with a unicast sources address (see Section 4.8). [BA] Good to see this clearly stated here. As noted earlier, there are places in the document where this is not as clear. Section 4.6.1.1 4. If a unicast destination address is unknown, RB1 handles the frame as described in Section 4.6.1.2 for a broadcast frame except that the Inner.MacDA is the original native frame's unicast destination address. [BA] Do you mean "if a VLAN/unicast destination address pair is unknown"? Section 4.6.2.4 If RBn is a transit RBridge the hop count is decremented by one and the frame forwarded to the next hop RBridge towards the egress RBridge. The Inner.VLAN and ingress nickname are not examined by a transit RBridge when it forwards a known unicast TRILL data frame. [BA] Elsewhere it says that the hop count MAY be decremented by more than one. Section 4.8.1 3. By Layer 2 registration protocols learning the { source MAC, VLAN, port } of end stations registering at a local port. [BA] Are you referring to IEEE 802.11 association here? This won't tell you the VLAN of the end-station (this might not be determined until after authentication). Section 4.8 Although outside the scope of this specification, there are some Layer 2 features in which a set of VLANs has shared learning, where one of the VLANs is the "primary" and the other VLANs in the group are "secondaries". [BA] One concern about this section is that it might be construed to permit trees shared between VLANs. You might make it clear that this is not the intent. Section 4.9.1 [BA] Given the earlier discussion of "zero config" you might put in a sentence or two indicating that the configuration bits are only needed for special circumstances. You might also state what the default setting of the bits is. Section 4.9.2 Low-level control frames are handled in the lower level port/link control logic in the same way as in an [802.1Q-2005] bridge. This can optionally include a variety of 802.1 or link specific protocols such as link layer discovery, link aggregation (Clause 43 of [802.3]), MAC security [802.1AE], or port based access control [802.1X]. While handled at a low level, these frames may affect higher level processing. For example, a Layer 2 registration protocol may affect the confidence in learned addresses. The upper interface [BA] IEEE 802.1X-REV is no longer purely "port based", since it supports "pseudo" and "virtual" ports as well. Section 6 IEEE 802.1 port admission and link security mechanisms, such as [802.1X] and [802.1AE], can also be used. These are best thought of as being implemented within a port and are outside the scope of TRILL (just as they are generally out of scope for bridging standards [802.1D] and 802.1Q); however, TRILL can make use of secure registration through the confidence level communicated in optional TRILL ESADI (see Section 4.8). [BA] Neither IEEE 802.1AE nor IEEE 802.1X-REV are based on physical ports. Instead, I'd refer to the diagrams which make it clear that these mechanisms operate below TRILL. Section A.3.4 The spanning tree solution does quite well in this particular case. But it depends on both RB1 and RB2 having implemented the optional feature of being able to configure a port to emit BPDUs as described in Section A.3.3 above. It also makes the bridged LAN whose partition [BA] This somewhat begs the question about whether being able to emit BPDUs should be optional, recommended or mandatory. Appendix C When multipathing is used, frames that follow different paths will be subject to different delays and may be re-ordered. While some traffic may be order/delay insensitive, typically most traffic consists of flows of frames where re-ordering within a flow is damaging. How to determine flows or what granularity flows should have is beyond the scope of this document but, as an example, under many circumstances it would be safe to consider all the frames flowing between a particular pair of end station ports to be a flow. [BA] I think you have to say more here, given that Ethernet invariants include ordering requirements. Certainly considering all frames between a pair of end stations to be a flow would be conservative, but this isn't the Ethernet requirement, right? -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mailman.postel.org/pipermail/rbridge/attachments/20090913/1e5bf003/attachment-0001.html From roger.lapuh at nortel.com Mon Sep 14 14:04:45 2009 From: roger.lapuh at nortel.com (Roger Lapuh) Date: Mon, 14 Sep 2009 22:04:45 +0100 Subject: [rbridge] Review of draft-ietf-trill-rbridge-protocol-13.txt Message-ID: <78F2E9EF10EF9447A0FDAF9E6036E24E1172DC84@zharhxm0.corp.nortel.com> Hi Don (Eastlake) I know you were not trying to imply this, but I got the impression from your comments that others might infer that the current link-state based IEEE 802.1 project could take a long time and will not see any market validation soon. Thus I wanted to point out that a pre-standard version of 802.1aq (SPBm) has been shipping and is generally available since 2008 and is deployed in several carrier - as well as enterprise customer (data center) networks. It has been proven to be a scalable and robust solution with great potentials to extend beyond L2 virtualization. With that in mind there are several vendors implementing it as well. For broad market adoption, it makes sense for any new link-state Ethernet approach to provide maximum interoperability with existing IEEE standards. Also, while going through the TRILL draft I was wondering how well you perceive end-stations and applications will handle the out-of-ordered (Unicast) packets that you point out within the TRILL design in section 4.5 page 48? regards, Roger Nortel ---------- Forwarded message ---------- From: FEDYK Don Date: Thu, Sep 10, 2009 at 7:45 PM Subject: Re: [rbridge] Review of draft-ietf-trill-rbridge-protocol-13.txt To: Donald Eastlake , "Romascanu, Dan (Dan)" , rbridge at postel.org Cc: ext Bernard Aboba , Ralph Droms Hi Donald I too would like to commend Dan for his careful and comprehensive feedback. But I have a few corrections to your comments. Inline: > -----Original Message----- > > Hi Dan, > > Thanks for your efforts in providing these detailed comments. > > On Sun, Aug 30, 2009 at 8:41 AM, Romascanu, Dan > (Dan) wrote: > > I was asked by the TRILL WG chairs to perform a review of > > draft-ietf-trill-rbridge-protocol-13.txt. Below please find > my findings. > > > > I appreciate the work put by the editors and WG in designing this > > solution and writing the document. The editors are to be > saluted for > > writing an I-D which is in most parts clear and well organized. > > > > I do believe that this work and the document as it stands > still have a > > number of problems that need to be solved. > > > > 1. I have serious reservations that this document can > directly go to > > Proposed Standard. For reasons detailed below there is a lot of > > incertitude in what concerns the interoperability or > coexistence with > > the layer 2 protocols designed in the IEEE. Many of these are still > > work in progress. For this good reason in many places the > document is > > forced to make assumptions and cannot make but statements like > > 'Rbridges are generally compatible with IEEE ...' (section > 2.4.2). Has > > the WG discussed the option of issuing the first version of this > > document as Experimental? > > It is true that many IEEE 802.1 standards efforts are "work > in progress". But with the ever multiplying number of efforts > in 802.1, this situation seems permanent. To wait for 802.1 > standards efforts to end would be to wait forever. This is not an accurate representation of the work in 802.1. Several projects have completed this year. The 802.1 Working group has considerable scope. > > There is no actual question concerning the interoperability > of TRILL with current 802.1 protocols. > That is not the way I read the liaison referenced at the end of this message. > Section 2 of the draft is a broad overview. That is the > reason it says "... RBridges are generally compatible with > IEEE [802.1Q-2005] customer bridges ..." in Section 2.4.2. If > all the details were given in Section 2, it wouldn't be a > broad introductory overview. To avoid misleading readers, > that phrase should probably be changed to "... RBridges are > compatible with IEEE [802.1Q-2005] customer bridges except as > discussed in this document ...". > > I cannot recall any status other than Proposed Standard ever > being suggested in the TRILL working group for this document. > The TRILL protocol has had more review by routing and > bridging network experts than most IETF protocols at the > Proposed Standard level. How can you substantiate that comment? Trill was one of the smaller Groups that I have attended. It was not scheduled in the Routing area at all. Certainly some routing experts reviewed Trill. But many I know were reviewing the use of IS-IS by Trill and not evaluating Trill. > TRILL has been implemented in > software, tested, and the results of that testing included > back in the protocol design. Multiple vendors are very > interested in TRILL and I predict that RBridge products will > ship in 2010. > > > 2. It looks to me like the motivation of the standard is frozen in > > time into a period of evolution of the IEEE 802.1 > technology trying to > > solve some problems that were in part dealt with by the > IEEE in the meantime. > > Out of the four spanning tree bullets listed in the introduction as > > problems that TRILL solves the second and the fourth seem > to have been > > dealt with by the IEEE 802.1aq and the IEEE 802.1ak respectively. > > It appears to me that this comment does not suggest any > changes in the draft. But, as a response, the TRILL WG was > created by the IETF to produce a standard subsequent to > substantial discussion of the fact that, after it rejected > TRILL technology, IEEE 802.1 then started an effort with > similar goals. (And after the IEEE 802 Executive Committee > stated that it had no objection to the IETF pursuing > development of a TRILL standard.) Trill was created by individuals in the IETF that had goals that were different than IEEE 802.1. IEEE is a an open standards body just like the IETF. We are open to good ideas on control planes and shortest path tree computation. But we work on bridging data plane and the properties and tools of that data plane. The Trill data plan is not Ethernet and it is not IP either. So Trill is different. > > > 3. All the VLAN discussions in the specification mention only the > > C-VLANs. What about provider bridges? Can these be migrated > to Rbridges? > > The explanation in Appendix E is not sufficient, the compatibility > > with provider bridges needs to be dealt in the core part of the I-D. > > The TRILL protocol specified in this draft is, as stated in > the draft, a customer bridge standard. The draft is careful > to refer to > 802.1Q-2005 all over the place. An IEEE 802.1Q-2005 bridge is > a purely customer bridge. > > This TRILL protocol specification does not try to solve the > provider bridge problem. (I believe it would be relatively > easy to extend TRILL to do that but it should be done in a > separate document.) Provider bridges (and provider backbone > bridges) cannot be replace by customer bridges, whether those > customer bridges are 802.1 customer bridges or RBridges based > on this spec. On the other hand, as stated in Appendix E, > provider bridges (and/or provider backbone bridges) can be > used to transparently interconnect parts of an RBridge campus. > > Since RBridges are compatible with provider bridges and > provider backbone bridges, as I understand the word > compatible, I don't understand what there is to "deal" with. > However, an appropriately reworded version of the paragraph > just above this paragraph can certainly be added to the main > body of the draft. > > > 4. There needs to be more discussion about the IEEE > protocols that do > > not define forwarding or tagging. If TRILL makes a claim > that the best > > migration policy is to replace IEEE bridges by Rbridges it needs to > > define behavior relative to protocols like IEEE 802.1AE > Media Access > > Control (MAC) Security, IEEE 802.1AB Station and Media > Access Control > > Connectivity Discovery, and IEEE 802.1aj Two Port Media Relay, as > > bridges and end stations that are being replaced may > support any or a > > combination of these. > > This document does not claim that the best migration policy > is incremental replacement, just that a network of > 802.1Q-2005 customer bridges will, with the differences > described in the document, operate correctly if some or all > are replaced by RBridges. > > 802.1 protocols such as 802.1AE, 802.1AB, and 802.1X and > 802.3 protocols such as Link Aggregation are logically > implemented within ports. It makes no difference whether > those ports are ports on 802.1D or 802.1Q or RBridge or IP > router devices. (And, assuming they continue to develop in > the direction it appears they are going, this will also be > true of 802.1Qaz and 802.1Qbb.) This layering is covered in > overview Section 2.4.1 and in more detail in Section 4.9.2 of > the draft. As far as I can see, the draft makes RBridge > "behavior relative to" this type of protocol, which is that > they are orthogonal, clear. > > It would be true that if someone is incrementally replacing > 802.1Q-2005 bridges that are using one or more of these, for > want of a better term, "port level" protocols, such as > 802.1AE, with RBridges, then obviously they need to use > RBridges that also have the same port level protocols > configured the same way. A statement to that effect could be > added to the draft. > > 802.1aj is a work in progress. Although referred to as a > "type of bridge", as currently specified 802.1aj devices are, > as far as I can tell, a fancy type of repeater. For example, > to quote from 802.1aj Draft D4.0, "A MAC Relay is transparent > to all frame-based media independent protocols except those > explicitly addressed to this device." (Of course, 802.1aj > repeaters have a management interface and any node, whether a > bridge or end station, could have independent functionality > added to it to interact with that management interface, but I > don't think of that as a change in the bridge or end station > functionality.) A statement could be added to the draft > saying "RBridges are compatible with IEEE 802.1aj devices as > currently specified, in the same sense that IEEE 802.1Q-2005 > bridges are compatible with such devices." or the like. > > > 5. For all the reasons above I think it is required that > this document > > is also reviewed in detail by IEEE 802.1. I suggest that a formal > > review be required via a liaison letter before or during > IETF Last Call. > > A review by IEEE 802.1 has been done. The TRILL draft has > been modified based on that review (see item 2 of "Changes > from -12 to -13" > in Appendix Z). As one who participated in the Review we only reviewed the documents with respect to the IEEE 802.1 sections of the document. The IEEE did not comment on Trill anymore than it would comment on say Fiber Channel or IP. The review is public for everyone to read right here. http://www.ieee802.org/1/files/public/docs2009/liaison-to-trill-wg-0309. pdf Regards, Don _______________________________________________ rbridge mailing list rbridge at postel.org http://mailman.postel.org/mailman/listinfo/rbridge -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mailman.postel.org/pipermail/rbridge/attachments/20090914/e5b740b3/attachment.html From cait at asomi.com Mon Sep 14 21:23:14 2009 From: cait at asomi.com (Caitlin Bestler) Date: Mon, 14 Sep 2009 21:23:14 -0700 Subject: [rbridge] Review of draft-ietf-trill-rbridge-protocol-13.txt In-Reply-To: <78F2E9EF10EF9447A0FDAF9E6036E24E1172DC84@zharhxm0.corp.nortel.com> References: <78F2E9EF10EF9447A0FDAF9E6036E24E1172DC84@zharhxm0.corp.nortel.com> Message-ID: <4AAF16B2.9040008@asomi.com> Roger Lapuh wrote: > Hi Don (Eastlake) > > I know you were not trying to imply this, but I got the impression from > your comments that others might infer that the current link-state > based IEEE 802.1 project could take a long time and will not see any > market validation soon. Thus I wanted to point out that a pre-standard > version of 802.1aq (SPBm) has been shipping and is generally available > since 2008 and is deployed in several carrier - as well as enterprise > customer (data center) networks. It has been proven to be a scalable and > robust solution with great potentials to extend beyond L2 > virtualization. With that in mind there are several vendors implementing > it as well. > > For broad market adoption, it makes sense for any new link-state > Ethernet approach to provide maximum interoperability with existing IEEE > standards. > > Also, while going through the TRILL draft I was wondering how well you > perceive end-stations and applications will handle the out-of-ordered > (Unicast) packets that you point out within the TRILL design in section > 4.5 page 48? > > > There are two major points that need to be made in regard to 802.1 projects as they relate to Trill/RBridges. First, 802.1 will always have work in progress. This is not some sort of slam that implies 802.1 cannot finish things, but rather a recognition that one of 802.1 strengths is that it continually evolves. 802.1 networking will not stabilize until it has been obsolete for several years. Deployed RBRidges will of course have to track these changes, but only because they almost certainly *include* an 802.1Q Bridge. Layering is nice, but extra boxes take space and power. But by positioning TRILL as an EISS client (with some minor ISS client usage as noted) insulates TRILL from 802.1 protocol evolution, and provides clear guidance on how to deal with each new evolution in the protocol -- because new 802.1 protocols seldom disturb the EISS interface and would be very explicit about how to deal with compatability if a new extension to the EISS interface was required. Second, Shortest Path Briding may address some of the same problem-space as TRILL, but it takes a distinctly different solution. SPB, like most current work in 802.1, is based upon creating "clouds" with special characteristics out of a set of 802.1 Bridges that are directly connected with each other. Essentially any new feature grows out from a core. TRILL, by contrast can be deployed from the edges. It does not care what type of 802.1 Bridges provide connectivity between RBridges, just that the connectivity exists. A SPB-cloud, however, requires an unbroken chain of SPB Bridges. This distinction alone, in my opinion, justifies keeping TRILL as a distinct solution whatever the state of deployment of SPB or SPBB is. From d3e3e3 at gmail.com Tue Sep 15 10:37:18 2009 From: d3e3e3 at gmail.com (Donald Eastlake) Date: Tue, 15 Sep 2009 13:37:18 -0400 Subject: [rbridge] Link Aggregation Reference Message-ID: <1028365c0909151037j4a0fe460k47d1ef75173e2f9f@mail.gmail.com> The current base protocol draft refers to Clause 43 of IEEE 802.3 for Link Aggregation. However, this IEEE standard is moving from 802.3 to 802.1 and, in fact, IEEE 802.1AX-2008 has already been adopted and is even available for free download, having been adopted more than six months ago. It is my understanding that this material will be removed from IEEE 802.3 the next time it is revised, so IEEE 802.1AX-2008 seem like a better standard to refer to. Unless there is some objection, references to Clause 43 of IEEE 802.3 in the base protocol specification draft will be replaced by references to IEEE 802.1AX-2008. Thanks, Donald ============================= Donald E. Eastlake 3rd +1-508-634-2066 (home) 155 Beaver Street Milford, MA 01757 USA d3e3e3 at gmail.com From Radia.Perlman at Sun.COM Tue Sep 15 14:54:27 2009 From: Radia.Perlman at Sun.COM (Radia Perlman) Date: Tue, 15 Sep 2009 14:54:27 -0700 Subject: [rbridge] Review of draft-ietf-trill-rbridge-protocol-13.txt In-Reply-To: <4AAF16B2.9040008@asomi.com> References: <78F2E9EF10EF9447A0FDAF9E6036E24E1172DC84@zharhxm0.corp.nortel.com> <4AAF16B2.9040008@asomi.com> Message-ID: <4AB00D13.4070104@sun.com> To reinforce what Caitlin said, TRILL is layered above 802, and as such it's similar to IP in both its interaction with Ethernet and its independence from it. TRILL devices are, as Caitlin said, like endnodes to layer 2. There might be new features (such as congestion feedback) that would be useful to exploit. There are sometimes subtle issues in adjacent layers that should be thought through (e.g., VRRP moving a MAC address around), so it's really good to have experts checking out if there are any subtle issues. And TRILL has uncovered some (like the need to consider what VLAN tag(s) to use in IS-IS Hellos, and to be defensive against VLAN translation). But if there really were any work going on in IEEE that would break TRILL, it would also likely break many other IETF protocols as well. Radia Caitlin Bestler wrote: > Roger Lapuh wrote: > >> Hi Don (Eastlake) >> >> I know you were not trying to imply this, but I got the impression from >> your comments that others might infer that the current link-state >> based IEEE 802.1 project could take a long time and will not see any >> market validation soon. Thus I wanted to point out that a pre-standard >> version of 802.1aq (SPBm) has been shipping and is generally available >> since 2008 and is deployed in several carrier - as well as enterprise >> customer (data center) networks. It has been proven to be a scalable and >> robust solution with great potentials to extend beyond L2 >> virtualization. With that in mind there are several vendors implementing >> it as well. >> >> For broad market adoption, it makes sense for any new link-state >> Ethernet approach to provide maximum interoperability with existing IEEE >> standards. >> >> Also, while going through the TRILL draft I was wondering how well you >> perceive end-stations and applications will handle the out-of-ordered >> (Unicast) packets that you point out within the TRILL design in section >> 4.5 page 48? >> >> >> >> > > There are two major points that need to be made in regard to 802.1 > projects as they relate to Trill/RBridges. > > First, 802.1 will always have work in progress. This is not some sort > of slam that implies 802.1 cannot finish things, but rather a > recognition that one of 802.1 strengths is that it continually evolves. > 802.1 networking will not stabilize until it has been obsolete for > several years. Deployed RBRidges will of course have to track these > changes, but only because they almost certainly *include* an 802.1Q > Bridge. Layering is nice, but extra boxes take space and power. > > But by positioning TRILL as an EISS client (with some minor ISS client > usage as noted) insulates TRILL from 802.1 protocol evolution, and > provides clear guidance on how to deal with each new evolution in > the protocol -- because new 802.1 protocols seldom disturb the EISS > interface and would be very explicit about how to deal with > compatability if a new extension to the EISS interface was > required. > > > Second, Shortest Path Briding may address some of the same problem-space > as TRILL, but it takes a distinctly different solution. SPB, like most > current work in 802.1, is based upon creating "clouds" with special > characteristics out of a set of 802.1 Bridges that are directly > connected with each other. Essentially any new feature grows out > from a core. > > TRILL, by contrast can be deployed from the edges. It does not care > what type of 802.1 Bridges provide connectivity between RBridges, just > that the connectivity exists. A SPB-cloud, however, requires an unbroken > chain of SPB Bridges. > > This distinction alone, in my opinion, justifies keeping TRILL as a > distinct solution whatever the state of deployment of SPB or SPBB is. > _______________________________________________ > rbridge mailing list > rbridge at postel.org > http://mailman.postel.org/mailman/listinfo/rbridge > From eric.gray at ericsson.com Wed Sep 16 06:41:58 2009 From: eric.gray at ericsson.com (Eric Gray) Date: Wed, 16 Sep 2009 09:41:58 -0400 Subject: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? In-Reply-To: <4AAB1AA6.2020208@asomi.com> References: <4AA1E3AC.1040007@sun.com> <4AA2E5F9.6000906@sun.com> <4AAA8812.5030802@sun.com> <4AAAC6D3.6010201@asomi.com> <4AAB1AA6.2020208@asomi.com> Message-ID: Caitlin, Sorry, but I do not understand your arguments. In particular, when you say "In either case they should not be accomodated" are you saying that the PDUs they wish to forward will not be forwarded, or do you mean to say that the protocol does not need to worry about these cases? -- Eric -----Original Message----- From: Caitlin Bestler [mailto:cait at asomi.com] Sent: Friday, September 11, 2009 11:51 PM To: Eric Gray Cc: Radia.Perlman at Sun.COM; TRILL/RBridge Working Group Subject: Re: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? Importance: High Eric Gray wrote: > Caitlin/Radia, > > Hop Count - like TTL - doesn't prevent loops, it just reduces > their severity (by limiting the impact of individual PDUs to > some finite number of hops). > > In this case, since we don't know the sender of the PDU, we > have no firmer basis than dewy-eyed optimism to believe the > hop count will be set to a usefully limiting value. > > If we could be certain - for the case that Radia mentioned > - that the sender was an end station, then we (maybe) would > have some justification for believing no loop could occur. > > But the fact is that - just because the question Radia had > asked was based on the possibility that the unknown sender > could be an end station does not mean that is the only case > we need to consider. > > -- > Eric > Those are certainly valid reasons why an RBridge might want to drop encapsulated frames from unknown sources (specifically, that they MAY do so). But the real question is whether there is a justification for stating that they MUST do so. The approach that Radia is presenting is classic "Be liberal in what you accept". It is optimized to be resilient to minor discovery snafus. The argument for rejecting it is that if whomever is the true source of this frame had done their job properly then the problem would not exist at all, or they are an attacker. In either case they should not be accomodated. Now if there was an effective Denial of Service attack that this would strategy would enable then rejecting the frame would make sense. But this doesn't strike me as a very effective attack. Simply limiting forgiveness to periods when the queue depths were out of the Yellow or Red zones would short circuit any DoS attack with this technique. So I'd lean towards being liberal in what is accepted. -- Caitlin Bestler cait at asomi.com - http://www.asomi.com/CaitlinBestlerResume.html From cait at asomi.com Wed Sep 16 09:02:57 2009 From: cait at asomi.com (Caitlin Bestler) Date: Wed, 16 Sep 2009 09:02:57 -0700 Subject: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? In-Reply-To: References: <4AA1E3AC.1040007@sun.com> <4AA2E5F9.6000906@sun.com> <4AAA8812.5030802@sun.com> <4AAAC6D3.6010201@asomi.com> <4AAB1AA6.2020208@asomi.com> Message-ID: <4AB10C31.3060405@asomi.com> Eric Gray wrote: > Caitlin, > > Sorry, but I do not understand your arguments. > > In particular, when you say "In either case they > should not be accomodated" are you saying that the PDUs > they wish to forward will not be forwarded, or do you > mean to say that the protocol does not need to worry > about these cases? > I was explaining my understanding of the rationale for having a MUST NOT forward requirement. Basically, the reasoning is that there is no identified need for these frames to be forwarded that cannot be solved by having implementers read the specs more carefully. Therefore, *any* risk of even short-lived loops would not be warranted. However, the countering philosophy is one that has served the Internet well: be liberal in what you accept. In this specific case I believe it applies, because removing the requirement does not impede the ability of RBridges to later deal with any DoS attack that is developed, but does allow the flexibility of being tolerant of bring-up configuration issues. To be specific, if there were any language in the draft covering this I think it should be that these frames MAY be forwarded. Just deleting the language would have the same effect, so my minimalist side would prefer that. I would definitely be opposed to stating that these frames MUST or SHOULD be forwarded, because as cited in the above rationale, they shouldn't have existed in the first place. > -- > Eric > > -----Original Message----- > From: Caitlin Bestler [mailto:cait at asomi.com] > Sent: Friday, September 11, 2009 11:51 PM > To: Eric Gray > Cc: Radia.Perlman at Sun.COM; TRILL/RBridge Working Group > Subject: Re: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? > Importance: High > > Eric Gray wrote: >> Caitlin/Radia, >> >> Hop Count - like TTL - doesn't prevent loops, it just reduces >> their severity (by limiting the impact of individual PDUs to >> some finite number of hops). >> >> In this case, since we don't know the sender of the PDU, we >> have no firmer basis than dewy-eyed optimism to believe the >> hop count will be set to a usefully limiting value. >> >> If we could be certain - for the case that Radia mentioned >> - that the sender was an end station, then we (maybe) would >> have some justification for believing no loop could occur. >> >> But the fact is that - just because the question Radia had >> asked was based on the possibility that the unknown sender >> could be an end station does not mean that is the only case >> we need to consider. >> >> -- >> Eric >> > > Those are certainly valid reasons why an RBridge might want to > drop encapsulated frames from unknown sources (specifically, that > they MAY do so). > > But the real question is whether there is a justification for > stating that they MUST do so. The approach that Radia is presenting > is classic "Be liberal in what you accept". It is optimized to > be resilient to minor discovery snafus. > > The argument for rejecting it is that if whomever is the true > source of this frame had done their job properly then the > problem would not exist at all, or they are an attacker. > In either case they should not be accomodated. > > > Now if there was an effective Denial of Service attack that > this would strategy would enable then rejecting the frame > would make sense. But this doesn't strike me as a very effective > attack. Simply limiting forgiveness to periods when the queue > depths were out of the Yellow or Red zones would short circuit > any DoS attack with this technique. So I'd lean towards being > liberal in what is accepted. > > > > -- Caitlin Bestler cait at asomi.com - http://www.asomi.com/CaitlinBestlerResume.html From eric.gray at ericsson.com Wed Sep 16 09:12:32 2009 From: eric.gray at ericsson.com (Eric Gray) Date: Wed, 16 Sep 2009 12:12:32 -0400 Subject: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? In-Reply-To: <4AB10C31.3060405@asomi.com> References: <4AA1E3AC.1040007@sun.com> <4AA2E5F9.6000906@sun.com> <4AAA8812.5030802@sun.com> <4AAAC6D3.6010201@asomi.com> <4AAB1AA6.2020208@asomi.com> <4AB10C31.3060405@asomi.com> Message-ID: Caitlin, Forwarding is special, even in the "Postel" sense. Being conservative in what you send and liberal in what you receive is a two-edged sword with both edges applying to forwarding. Perhaps a better decision model would be based on handling litter. Just because you think it is okay to pick it up, doesn't mean I think it is okay to pass it along. In this case, if your implementation passes it along, the recipient is now missing the bit of differentiation that you had because that implementation will have received it from an RBridge it *does* know about. -- Eric -----Original Message----- From: Caitlin Bestler [mailto:cait at asomi.com] Sent: Wednesday, September 16, 2009 12:03 PM To: Eric Gray Cc: Radia.Perlman at Sun.COM; TRILL/RBridge Working Group Subject: Re: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? Importance: High Eric Gray wrote: > Caitlin, > > Sorry, but I do not understand your arguments. > > In particular, when you say "In either case they > should not be accomodated" are you saying that the PDUs > they wish to forward will not be forwarded, or do you > mean to say that the protocol does not need to worry > about these cases? > I was explaining my understanding of the rationale for having a MUST NOT forward requirement. Basically, the reasoning is that there is no identified need for these frames to be forwarded that cannot be solved by having implementers read the specs more carefully. Therefore, *any* risk of even short-lived loops would not be warranted. However, the countering philosophy is one that has served the Internet well: be liberal in what you accept. In this specific case I believe it applies, because removing the requirement does not impede the ability of RBridges to later deal with any DoS attack that is developed, but does allow the flexibility of being tolerant of bring-up configuration issues. To be specific, if there were any language in the draft covering this I think it should be that these frames MAY be forwarded. Just deleting the language would have the same effect, so my minimalist side would prefer that. I would definitely be opposed to stating that these frames MUST or SHOULD be forwarded, because as cited in the above rationale, they shouldn't have existed in the first place. > -- > Eric > > -----Original Message----- > From: Caitlin Bestler [mailto:cait at asomi.com] > Sent: Friday, September 11, 2009 11:51 PM > To: Eric Gray > Cc: Radia.Perlman at Sun.COM; TRILL/RBridge Working Group > Subject: Re: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? > Importance: High > > Eric Gray wrote: >> Caitlin/Radia, >> >> Hop Count - like TTL - doesn't prevent loops, it just reduces >> their severity (by limiting the impact of individual PDUs to >> some finite number of hops). >> >> In this case, since we don't know the sender of the PDU, we >> have no firmer basis than dewy-eyed optimism to believe the >> hop count will be set to a usefully limiting value. >> >> If we could be certain - for the case that Radia mentioned >> - that the sender was an end station, then we (maybe) would >> have some justification for believing no loop could occur. >> >> But the fact is that - just because the question Radia had >> asked was based on the possibility that the unknown sender >> could be an end station does not mean that is the only case >> we need to consider. >> >> -- >> Eric >> > > Those are certainly valid reasons why an RBridge might want to > drop encapsulated frames from unknown sources (specifically, that > they MAY do so). > > But the real question is whether there is a justification for > stating that they MUST do so. The approach that Radia is presenting > is classic "Be liberal in what you accept". It is optimized to > be resilient to minor discovery snafus. > > The argument for rejecting it is that if whomever is the true > source of this frame had done their job properly then the > problem would not exist at all, or they are an attacker. > In either case they should not be accomodated. > > > Now if there was an effective Denial of Service attack that > this would strategy would enable then rejecting the frame > would make sense. But this doesn't strike me as a very effective > attack. Simply limiting forgiveness to periods when the queue > depths were out of the Yellow or Red zones would short circuit > any DoS attack with this technique. So I'd lean towards being > liberal in what is accepted. > > > > -- Caitlin Bestler cait at asomi.com - http://www.asomi.com/CaitlinBestlerResume.html From ddutt at cisco.com Wed Sep 16 09:41:04 2009 From: ddutt at cisco.com (Dinesh G Dutt) Date: Wed, 16 Sep 2009 09:41:04 -0700 Subject: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? In-Reply-To: <4AB10C31.3060405@asomi.com> References: <4AA1E3AC.1040007@sun.com> <4AA2E5F9.6000906@sun.com> <4AAA8812.5030802@sun.com> <4AAAC6D3.6010201@asomi.com> <4AAB1AA6.2020208@asomi.com> <4AB10C31.3060405@asomi.com> Message-ID: <4AB11520.200@cisco.com> I agree, Dinesh Caitlin Bestler wrote: > Eric Gray wrote: > >> Caitlin, >> >> Sorry, but I do not understand your arguments. >> >> In particular, when you say "In either case they >> should not be accomodated" are you saying that the PDUs >> they wish to forward will not be forwarded, or do you >> mean to say that the protocol does not need to worry >> about these cases? >> >> > > I was explaining my understanding of the rationale for > having a MUST NOT forward requirement. Basically, the > reasoning is that there is no identified need for these > frames to be forwarded that cannot be solved by having > implementers read the specs more carefully. Therefore, > *any* risk of even short-lived loops would not be > warranted. > > However, the countering philosophy is one that has > served the Internet well: be liberal in what you accept. > In this specific case I believe it applies, because > removing the requirement does not impede the ability > of RBridges to later deal with any DoS attack that > is developed, but does allow the flexibility of > being tolerant of bring-up configuration issues. > > To be specific, if there were any language in the > draft covering this I think it should be that these > frames MAY be forwarded. Just deleting the language > would have the same effect, so my minimalist side > would prefer that. > > I would definitely be opposed to stating that these > frames MUST or SHOULD be forwarded, because as cited > in the above rationale, they shouldn't have existed > in the first place. > > > >> -- >> Eric >> >> -----Original Message----- >> From: Caitlin Bestler [mailto:cait at asomi.com] >> Sent: Friday, September 11, 2009 11:51 PM >> To: Eric Gray >> Cc: Radia.Perlman at Sun.COM; TRILL/RBridge Working Group >> Subject: Re: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? >> Importance: High >> >> Eric Gray wrote: >> >>> Caitlin/Radia, >>> >>> Hop Count - like TTL - doesn't prevent loops, it just reduces >>> their severity (by limiting the impact of individual PDUs to >>> some finite number of hops). >>> >>> In this case, since we don't know the sender of the PDU, we >>> have no firmer basis than dewy-eyed optimism to believe the >>> hop count will be set to a usefully limiting value. >>> >>> If we could be certain - for the case that Radia mentioned >>> - that the sender was an end station, then we (maybe) would >>> have some justification for believing no loop could occur. >>> >>> But the fact is that - just because the question Radia had >>> asked was based on the possibility that the unknown sender >>> could be an end station does not mean that is the only case >>> we need to consider. >>> >>> -- >>> Eric >>> >>> >> Those are certainly valid reasons why an RBridge might want to >> drop encapsulated frames from unknown sources (specifically, that >> they MAY do so). >> >> But the real question is whether there is a justification for >> stating that they MUST do so. The approach that Radia is presenting >> is classic "Be liberal in what you accept". It is optimized to >> be resilient to minor discovery snafus. >> >> The argument for rejecting it is that if whomever is the true >> source of this frame had done their job properly then the >> problem would not exist at all, or they are an attacker. >> In either case they should not be accomodated. >> >> >> Now if there was an effective Denial of Service attack that >> this would strategy would enable then rejecting the frame >> would make sense. But this doesn't strike me as a very effective >> attack. Simply limiting forgiveness to periods when the queue >> depths were out of the Yellow or Red zones would short circuit >> any DoS attack with this technique. So I'd lean towards being >> liberal in what is accepted. >> >> >> >> >> > > > -- We make our world significant by the courage of our questions and by the depth of our answers. - Carl Sagan From cait at asomi.com Wed Sep 16 10:20:24 2009 From: cait at asomi.com (Caitlin Bestler) Date: Wed, 16 Sep 2009 10:20:24 -0700 Subject: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? In-Reply-To: References: <4AA1E3AC.1040007@sun.com> <4AA2E5F9.6000906@sun.com> <4AAA8812.5030802@sun.com> <4AAAC6D3.6010201@asomi.com> <4AAB1AA6.2020208@asomi.com> <4AB10C31.3060405@asomi.com> Message-ID: <4AB11E58.5060601@asomi.com> Eric Gray wrote: > Caitlin, > > Forwarding is special, even in the "Postel" sense. Being > conservative in what you send and liberal in what you receive is > a two-edged sword with both edges applying to forwarding. > > Perhaps a better decision model would be based on handling > litter. Just because you think it is okay to pick it up, doesn't > mean I think it is okay to pass it along. In this case, if your > implementation passes it along, the recipient is now missing the > bit of differentiation that you had because that implementation > will have received it from an RBridge it *does* know about. > I understand that rationale. It is the basis of most 802.1 enhancements and fundamental to all network-oriented security (as opposed to endpoint driven). For example, an XYZ-capable Bridge does not admit frames to a controlled priority unless the frames come from an End Station it knows of or another Bridge that it knows is XYZ-capable. In this way any end station receiving a frame on an XYZ-controlled priority knows something about the end station that originated it and the bridges the frame traversed. But exactly what information is being provided by the fact that a frame is TRILL encapsulated? The end station does not see the encapsulated frame, and I do not see how de-encapsulating is impacted in any way. A specific example of a malformed or malicious frame that would cause damage would justify a MUST NOT forward, but I have not thought of any. Now, if a group of RBridges wanted to extend any given 802.1 cloud then they would have to follow the same rules on priority admission. But without such a special extension, the RBridge is merely adjacent to two 802.1 XYZ clouds, and is the originator of the frame when it forwards it. Just as a router, it would have to follow any rules that applied to an end station participating in an 802.1 XYZ cloud. -- Caitlin Bestler cait at asomi.com - http://www.asomi.com/CaitlinBestlerResume.html From eric.gray at ericsson.com Wed Sep 16 10:32:31 2009 From: eric.gray at ericsson.com (Eric Gray) Date: Wed, 16 Sep 2009 13:32:31 -0400 Subject: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? In-Reply-To: <4AB11E58.5060601@asomi.com> References: <4AA1E3AC.1040007@sun.com> <4AA2E5F9.6000906@sun.com> <4AAA8812.5030802@sun.com> <4AAAC6D3.6010201@asomi.com> <4AAB1AA6.2020208@asomi.com> <4AB10C31.3060405@asomi.com> <4AB11E58.5060601@asomi.com> Message-ID: Caitlin, IMO, I think you're focusing on the wrong information. It is not the fact that an item was TRILL encapsulated that is lost in transiting a supposed "ingress" RBridge, it is the fact that it was received by the RBridge already TRILL-encapsulated from somewhere unknown to that Rbridge. The significance of that information is that the there is an apparent topological anomaly that is being hidden by the RBridge that initially forwarded the TRILL-encapsulated frame. Since there is no way to include this information in subsequent forwarding, the initial RBridge ought not to be allowed to forward the questionable frame. Therefore, "MUST NOT forward" is the correct behavior. -- Eric -----Original Message----- From: Caitlin Bestler [mailto:cait at asomi.com] Sent: Wednesday, September 16, 2009 1:20 PM To: Eric Gray Cc: Group; Radia.Perlman at Sun.COM; TRILL/RBridge at ISI.EDU Subject: Re: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? Importance: High Eric Gray wrote: > Caitlin, > > Forwarding is special, even in the "Postel" sense. Being > conservative in what you send and liberal in what you receive is > a two-edged sword with both edges applying to forwarding. > > Perhaps a better decision model would be based on handling > litter. Just because you think it is okay to pick it up, doesn't > mean I think it is okay to pass it along. In this case, if your > implementation passes it along, the recipient is now missing the > bit of differentiation that you had because that implementation > will have received it from an RBridge it *does* know about. > I understand that rationale. It is the basis of most 802.1 enhancements and fundamental to all network-oriented security (as opposed to endpoint driven). For example, an XYZ-capable Bridge does not admit frames to a controlled priority unless the frames come from an End Station it knows of or another Bridge that it knows is XYZ-capable. In this way any end station receiving a frame on an XYZ-controlled priority knows something about the end station that originated it and the bridges the frame traversed. But exactly what information is being provided by the fact that a frame is TRILL encapsulated? The end station does not see the encapsulated frame, and I do not see how de-encapsulating is impacted in any way. A specific example of a malformed or malicious frame that would cause damage would justify a MUST NOT forward, but I have not thought of any. Now, if a group of RBridges wanted to extend any given 802.1 cloud then they would have to follow the same rules on priority admission. But without such a special extension, the RBridge is merely adjacent to two 802.1 XYZ clouds, and is the originator of the frame when it forwards it. Just as a router, it would have to follow any rules that applied to an end station participating in an 802.1 XYZ cloud. -- Caitlin Bestler cait at asomi.com - http://www.asomi.com/CaitlinBestlerResume.html From eric.gray at ericsson.com Wed Sep 16 10:35:55 2009 From: eric.gray at ericsson.com (Eric Gray) Date: Wed, 16 Sep 2009 13:35:55 -0400 Subject: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? References: <4AA1E3AC.1040007@sun.com> <4AA2E5F9.6000906@sun.com> <4AAA8812.5030802@sun.com> <4AAAC6D3.6010201@asomi.com> <4AAB1AA6.2020208@asomi.com> <4AB10C31.3060405@asomi.com> <4AB11E58.5060601@asomi.com> Message-ID: Forwarded as there was an error in one address in the prior mail. Apologies if tyou receive 2 copies... ==================================================================== Caitlin, IMO, I think you're focusing on the wrong information. It is not the fact that an item was TRILL encapsulated that is lost in transiting a supposed "ingress" RBridge, it is the fact that it was received by the RBridge already TRILL-encapsulated from somewhere unknown to that Rbridge. The significance of that information is that the there is an apparent topological anomaly that is being hidden by the RBridge that initially forwarded the TRILL-encapsulated frame. Since there is no way to include this information in subsequent forwarding, the initial RBridge ought not to be allowed to forward the questionable frame. Therefore, "MUST NOT forward" is the correct behavior. -- Eric -----Original Message----- From: Caitlin Bestler [mailto:cait at asomi.com] Sent: Wednesday, September 16, 2009 1:20 PM To: Eric Gray Cc: Group; Radia.Perlman at Sun.COM; TRILL/RBridge at ISI.EDU Subject: Re: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? Importance: High Eric Gray wrote: > Caitlin, > > Forwarding is special, even in the "Postel" sense. Being > conservative in what you send and liberal in what you receive is > a two-edged sword with both edges applying to forwarding. > > Perhaps a better decision model would be based on handling > litter. Just because you think it is okay to pick it up, doesn't > mean I think it is okay to pass it along. In this case, if your > implementation passes it along, the recipient is now missing the > bit of differentiation that you had because that implementation > will have received it from an RBridge it *does* know about. > I understand that rationale. It is the basis of most 802.1 enhancements and fundamental to all network-oriented security (as opposed to endpoint driven). For example, an XYZ-capable Bridge does not admit frames to a controlled priority unless the frames come from an End Station it knows of or another Bridge that it knows is XYZ-capable. In this way any end station receiving a frame on an XYZ-controlled priority knows something about the end station that originated it and the bridges the frame traversed. But exactly what information is being provided by the fact that a frame is TRILL encapsulated? The end station does not see the encapsulated frame, and I do not see how de-encapsulating is impacted in any way. A specific example of a malformed or malicious frame that would cause damage would justify a MUST NOT forward, but I have not thought of any. Now, if a group of RBridges wanted to extend any given 802.1 cloud then they would have to follow the same rules on priority admission. But without such a special extension, the RBridge is merely adjacent to two 802.1 XYZ clouds, and is the originator of the frame when it forwards it. Just as a router, it would have to follow any rules that applied to an end station participating in an 802.1 XYZ cloud. -- Caitlin Bestler cait at asomi.com - http://www.asomi.com/CaitlinBestlerResume.html From cait at asomi.com Wed Sep 16 11:19:36 2009 From: cait at asomi.com (Caitlin Bestler) Date: Wed, 16 Sep 2009 11:19:36 -0700 Subject: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? In-Reply-To: References: <4AA1E3AC.1040007@sun.com> <4AA2E5F9.6000906@sun.com> <4AAA8812.5030802@sun.com> <4AAAC6D3.6010201@asomi.com> <4AAB1AA6.2020208@asomi.com> <4AB10C31.3060405@asomi.com> <4AB11E58.5060601@asomi.com> Message-ID: <4AB12C38.3000901@asomi.com> Eric Gray wrote: > > Caitlin, > > IMO, I think you're focusing on the wrong information. It is > not the fact that an item was TRILL encapsulated that is lost in > transiting a supposed "ingress" RBridge, it is the fact that it was > received by the RBridge already TRILL-encapsulated from somewhere > unknown to that Rbridge. > > The significance of that information is that the there is an > apparent topological anomaly that is being hidden by the RBridge > that initially forwarded the TRILL-encapsulated frame. Since there > is no way to include this information in subsequent forwarding, the > initial RBridge ought not to be allowed to forward the questionable > frame. Therefore, "MUST NOT forward" is the correct behavior. > That is a strong argument for "MUST NOT ignore the anomaly by simply forwarding the frame". However, dropping the frame is not the only method of resolving the anomaly. I see no reason why the exact mechanism needs a MUST. In my opinion the only justification for MUST DROP would be if the non-dropped frame could cause further damage, and *any* method of resolving the anomaly is acceptable (although customers who buy RBridges are likely to have preferences). -- Caitlin Bestler cait at asomi.com - http://www.asomi.com/CaitlinBestlerResume.html From Radia.Perlman at Sun.COM Wed Sep 16 13:16:13 2009 From: Radia.Perlman at Sun.COM (Radia Perlman) Date: Wed, 16 Sep 2009 13:16:13 -0700 Subject: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? In-Reply-To: <4AB12C38.3000901@asomi.com> References: <4AA1E3AC.1040007@sun.com> <4AA2E5F9.6000906@sun.com> <4AAA8812.5030802@sun.com> <4AAAC6D3.6010201@asomi.com> <4AAB1AA6.2020208@asomi.com> <4AB10C31.3060405@asomi.com> <4AB11E58.5060601@asomi.com> <4AB12C38.3000901@asomi.com> Message-ID: <4AB1478D.8050402@sun.com> Given that it might be controversial to remove the check entirely (and have it as MUST NOT check), I propose we make it optional, with a default. I also think we should clean up the wording in the spec, because I think it is weird, in the shared media case where R1 has two neighbors: R2 (an IS-IS adjacency) and E (an endnode), that (as I interpret the current wording), R1 will accept encapsulated frames from E if R2 is up, but stop accepting them from E if R2 goes down. So I propose we change 4.6.2: >> 8. The port on which the frame was received is checked and the frame >> discarded if there is no TRILL IS-IS adjacency on that port. to something like: An RBridge MAY choose to reject TRILL-encapsulated frames from a neighbor that it does not have an IS-IS adjacency with. The default behavior is that it will reject such frames, although it may be configured to skip that check. ****** Note that by changing from "receiving from a port" to "receiving from a neighbor" I'm attempting to say that R1 not only checks the port, but the source MAC address in the outer header, to avoid the weird case I mentioned above. I also don't have strong opinions about which way the default should be, but given that I was advocating getting rid of the check, I'm being politic by having the default being to reject such frames. :-) Radia From d3e3e3 at gmail.com Wed Sep 16 19:44:52 2009 From: d3e3e3 at gmail.com (Donald Eastlake) Date: Wed, 16 Sep 2009 22:44:52 -0400 Subject: [rbridge] Review of draft-ietf-trill-rbridge-protocol-13.txt In-Reply-To: <78F2E9EF10EF9447A0FDAF9E6036E24E1172DC84@zharhxm0.corp.nortel.com> References: <78F2E9EF10EF9447A0FDAF9E6036E24E1172DC84@zharhxm0.corp.nortel.com> Message-ID: <1028365c0909161944u7d6ab76dubecc8429cec24c5c@mail.gmail.com> On Mon, Sep 14, 2009 at 5:04 PM, Roger Lapuh wrote: > Hi Don (Eastlake) > > I know you were not trying to imply this, but I got the impression from your > comments?that others might infer that the current link-state based??IEEE > 802.1 project could take a long time ... I'm sorry if my remarks were not clear. I did not mean to say anything about the length of time it takes to complete standards efforts in IEEE 802.1. In fact, as far as I can tell, there are roughly equal complaints in the IETF and in IEEE 802 that standards efforts take too long. My point was that, as Caitlin said, if you were to wait until all current 802.1 efforts that might in some way bear on TRILL were completed, you would find that additional 802.1 efforts of that type had been started in the mean time. And if you were to then wait for those to be completed, you would find that yet other new 802.1 efforts of that type had been started. And so on. So, as I said, "to wait for 802.1 standards efforts to end", meaning to wait until there are no 802.1 standards activities in process, "would be to wait forever." In any evert, given that TRILL is layered above 802.1, it is mostly independent of IEEE. if there were problems with future IEEE work, then it is likely that all the IP protocols would be impacted. >... > > Also, while going through the TRILL draft I was wondering how well you > perceive end-stations and applications will handle the out-of-ordered > (Unicast) packets that you point out within the TRILL design in section 4.5 > page 48? That seems like an odd question. If you have applications where such occasional out-of-order unicast frames were a problem you would configure your RBridge campus so as to make sure they were of extremely low probability. For example by calculating a distribution tree per RBridge or configuring addresses to eliminate unknown unicast frames for connected end-stations or other techniques. In fact, a few words should probably be added to the draft about this. Thanks, Donald > > regards, > > Roger > Nortel From eric.gray at ericsson.com Thu Sep 17 09:48:06 2009 From: eric.gray at ericsson.com (Eric Gray) Date: Thu, 17 Sep 2009 12:48:06 -0400 Subject: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? In-Reply-To: <4AB1478D.8050402@sun.com> References: <4AA1E3AC.1040007@sun.com> <4AA2E5F9.6000906@sun.com> <4AAA8812.5030802@sun.com> <4AAAC6D3.6010201@asomi.com> <4AAB1AA6.2020208@asomi.com> <4AB10C31.3060405@asomi.com> <4AB11E58.5060601@asomi.com> <4AB12C38.3000901@asomi.com> <4AB1478D.8050402@sun.com> Message-ID: Radia, From the perspective of protocol specification, it is somewhat incorrect to say "X MAY do 'p' (and) the default behavior is that X does 'p'." Normative use of MAY in the first part of the example above means that the ability to "do 'p'" is optional. The second part of the same example, however, implies that ability to "do 'p'" is mandatory, since 'p' is the default behavior. Similarly, it is not clear what "reject" means in a forwarding context. I agree that changing "from a port" (which is ambiguous) to "from a neighbor" is an improvement. Hence I would suggest the following alternative wording: "By default, an RBridge MUST NOT forward TRILL-encapsulated frames from a neighbor that it does not have an IS-IS adjacency with. RBridges MAY be configured to allow forwarding of these frames in cases where it is known that a non-peering device (such as an end-station) is configured to originate TRILL-encapsulated frames that can be safely forwarded." It's conceivable that "that can be safely forwarded" is ambiguous, and I am okay with eliding that part of the above text. However, doing so may also result in a comment (from the IESG, for instance) that the specification needs to be (at least somewhat) explicit about when it is okay to allow this kind of forwarding to occur. -- Eric -----Original Message----- From: Radia.Perlman at Sun.COM [mailto:Radia.Perlman at Sun.COM] Sent: Wednesday, September 16, 2009 4:16 PM To: Caitlin Bestler Cc: Eric Gray; Group Subject: Re: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? Given that it might be controversial to remove the check entirely (and have it as MUST NOT check), I propose we make it optional, with a default. I also think we should clean up the wording in the spec, because I think it is weird, in the shared media case where R1 has two neighbors: R2 (an IS-IS adjacency) and E (an endnode), that (as I interpret the current wording), R1 will accept encapsulated frames from E if R2 is up, but stop accepting them from E if R2 goes down. So I propose we change 4.6.2: >> 8. The port on which the frame was received is checked and the frame >> discarded if there is no TRILL IS-IS adjacency on that port. to something like: An RBridge MAY choose to reject TRILL-encapsulated frames from a neighbor that it does not have an IS-IS adjacency with. The default behavior is that it will reject such frames, although it may be configured to skip that check. ****** Note that by changing from "receiving from a port" to "receiving from a neighbor" I'm attempting to say that R1 not only checks the port, but the source MAC address in the outer header, to avoid the weird case I mentioned above. I also don't have strong opinions about which way the default should be, but given that I was advocating getting rid of the check, I'm being politic by having the default being to reject such frames. :-) Radia From Radia.Perlman at Sun.COM Thu Sep 17 13:24:36 2009 From: Radia.Perlman at Sun.COM (Radia Perlman) Date: Thu, 17 Sep 2009 13:24:36 -0700 Subject: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? In-Reply-To: References: <4AA1E3AC.1040007@sun.com> <4AA2E5F9.6000906@sun.com> <4AAA8812.5030802@sun.com> <4AAAC6D3.6010201@asomi.com> <4AAB1AA6.2020208@asomi.com> <4AB10C31.3060405@asomi.com> <4AB11E58.5060601@asomi.com> <4AB12C38.3000901@asomi.com> <4AB1478D.8050402@sun.com> Message-ID: <4AB29B04.8030607@sun.com> Eric's wording is OK with me: > > "By default, an RBridge MUST NOT forward TRILL-encapsulated frames from > a neighbor that it does not have an IS-IS adjacency with. RBridges MAY > be configured to allow forwarding of these frames in cases where it is > known that a non-peering device (such as an end-station) is configured > to originate TRILL-encapsulated frames that can be safely forwarded." > From ddutt at cisco.com Thu Sep 17 14:03:44 2009 From: ddutt at cisco.com (Dinesh G Dutt) Date: Thu, 17 Sep 2009 14:03:44 -0700 Subject: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? In-Reply-To: <4AB29B04.8030607@sun.com> References: <4AA1E3AC.1040007@sun.com> <4AA2E5F9.6000906@sun.com> <4AAA8812.5030802@sun.com> <4AAAC6D3.6010201@asomi.com> <4AAB1AA6.2020208@asomi.com> <4AB10C31.3060405@asomi.com> <4AB11E58.5060601@asomi.com> <4AB12C38.3000901@asomi.com> <4AB1478D.8050402@sun.com> <4AB29B04.8030607@sun.com> Message-ID: <4AB2A430.2070007@cisco.com> I'm fine with Eric's wording too, Dinesh Radia Perlman wrote: > Eric's wording is OK with me: > > > >> "By default, an RBridge MUST NOT forward TRILL-encapsulated frames from >> a neighbor that it does not have an IS-IS adjacency with. RBridges MAY >> be configured to allow forwarding of these frames in cases where it is >> known that a non-peering device (such as an end-station) is configured >> to originate TRILL-encapsulated frames that can be safely forwarded." >> >> > > _______________________________________________ > rbridge mailing list > rbridge at postel.org > http://mailman.postel.org/mailman/listinfo/rbridge > > -- We make our world significant by the courage of our questions and by the depth of our answers. - Carl Sagan From carlsonj at workingcode.com Fri Sep 18 04:13:10 2009 From: carlsonj at workingcode.com (James Carlson) Date: Fri, 18 Sep 2009 07:13:10 -0400 Subject: [rbridge] Resend: Getting rid of unnecessary and limiting check on TRILL-encapsulated frames? In-Reply-To: <4AB2A430.2070007@cisco.com> References: <4AA1E3AC.1040007@sun.com> <4AA2E5F9.6000906@sun.com> <4AAA8812.5030802@sun.com> <4AAAC6D3.6010201@asomi.com> <4AAB1AA6.2020208@asomi.com> <4AB10C31.3060405@asomi.com> <4AB11E58.5060601@asomi.com> <4AB12C38.3000901@asomi.com> <4AB1478D.8050402@sun.com> <4AB29B04.8030607@sun.com> <4AB2A430.2070007@cisco.com> Message-ID: <4AB36B46.8040700@workingcode.com> Dinesh G Dutt wrote: > I'm fine with Eric's wording too, > > Dinesh > Radia Perlman wrote: >> Eric's wording is OK with me: >> >> >> >>> "By default, an RBridge MUST NOT forward TRILL-encapsulated frames from >>> a neighbor that it does not have an IS-IS adjacency with. RBridges MAY >>> be configured to allow forwarding of these frames in cases where it is >>> known that a non-peering device (such as an end-station) is configured >>> to originate TRILL-encapsulated frames that can be safely forwarded." It looks good to me, too. -- James Carlson 42.703N 71.076W From eric.gray at ericsson.com Fri Sep 18 08:38:30 2009 From: eric.gray at ericsson.com (Eric Gray) Date: Fri, 18 Sep 2009 11:38:30 -0400 Subject: [rbridge] DRB Priority Message-ID: I have one or two dumb questions. At several points in the base protocol specification, it states that DRB election depends on (priority, MAC) This information is presumably carried in TRILL-Hello messages. The MAC SA of a message sender is easy enough, but where is the priority carried? Since it is not explicitly listed as part of the information that MUST appear in all TRILL- Hello messages, I assume it is part of the fixed header of IS-IS LAN Hello messages. I further assume that the format and contents of the fixed header is defined in ISO/IEC 10589:2002 (since it does not make sense that this would be defined in either of the other two references related to IS-IS in sections 8 and 9 - i.e. - IS-IS Cryptographic Authentication, or IS-IS extensions for Traffic Engineering). That's a "reaching assumption" because I cannot find anywhere that the draft equates DRB priority to IS-IS DR priority. If that assumption is true, why is there not an explicit reference near the text in section 4.4.2 referring to the "fixed header" of "an IS-IS LAN Hello" message? If the assumption is not true, where would I look to find out where the priority field is defined, and how it is carried in a TRILL-Hello message? -- Eric -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mailman.postel.org/pipermail/rbridge/attachments/20090918/d5abe38f/attachment.html From d3e3e3 at gmail.com Fri Sep 18 10:41:59 2009 From: d3e3e3 at gmail.com (Donald Eastlake) Date: Fri, 18 Sep 2009 13:41:59 -0400 Subject: [rbridge] DRB Priority In-Reply-To: References: Message-ID: <1028365c0909181041q6bb733aejcbd6567b37d40480@mail.gmail.com> Hi Eric, On Fri, Sep 18, 2009 at 11:38 AM, Eric Gray wrote: > I have?one or two?dumb questions. > > At several points in the base protocol specification, it states that DRB > election depends > on (priority, MAC)? This information is presumably carried in TRILL-Hello > messages. Yes. > The MAC SA of a message sender is easy enough, but where is the priority > carried? > > Since it is not explicitly listed as part of the information that MUST > appear in all TRILL- > Hello messages, I assume it is part of the fixed header of IS-IS LAN Hello > messages. > I further assume that the format and contents of the fixed header is defined > in ISO/IEC > 10589:2002 (since it does not make sense that this would be defined in > either of the > other two references related to IS-IS in sections 8 and 9 - i.e. - IS-IS > Cryptographic > Authentication, or IS-IS extensions for Traffic Engineering). Yes, the 7-bit priority field in the IS-IS LAN Hello fixed header in specified ISO 10589 on page 66, Section 9.5. > That's a "reaching assumption" because I cannot find anywhere that the draft > equates > DRB priority to IS-IS DR priority. DRB (Designated RBridge) is the the TRILL dialect for DIS (Designated Intermediate System). They name the same thing. Probably a few words should be added so that it is clear that the DRB determination priority is the IS-IS DIS determination priority. > If that?assumption is true, why is there not an explicit reference near the > text in section > 4.4.2 referring to the "fixed header" of?"an IS-IS LAN Hello" message? ? When I look at draft -13, the very first paragraph of Section 4.4.2 is as follows: The TRILL-Hello has a new IS-IS message type. It starts with the same fixed header as an IS-IS LAN Hello. > If the assumption is not true, where?would I look to find out where the > priority field is > defined, and how it is carried in a TRILL-Hello message? > > -- > Eric Thanks, Donald From eric.gray at ericsson.com Fri Sep 18 11:10:35 2009 From: eric.gray at ericsson.com (Eric Gray) Date: Fri, 18 Sep 2009 14:10:35 -0400 Subject: [rbridge] DRB Priority In-Reply-To: <1028365c0909181041q6bb733aejcbd6567b37d40480@mail.gmail.com> References: <1028365c0909181041q6bb733aejcbd6567b37d40480@mail.gmail.com> Message-ID: Donald, Thanks! There is one reference to ISO 10589 that is driectly related to DRB election, so I guess that it makes sense that DRB priority is included. Like you said, it could be a little clearer. Size was the information I was looking for. I don't know that I actually have access to the ISO standard itself, but I needed to know the size of the field. Thanks again... -- Eric -----Original Message----- From: Donald Eastlake [mailto:d3e3e3 at gmail.com] Sent: Friday, September 18, 2009 1:42 PM To: Eric Gray Cc: TRILL Subject: Re: [rbridge] DRB Priority Importance: High Hi Eric, On Fri, Sep 18, 2009 at 11:38 AM, Eric Gray wrote: > I have?one or two?dumb questions. > > At several points in the base protocol specification, it states that DRB > election depends > on (priority, MAC)? This information is presumably carried in TRILL-Hello > messages. Yes. > The MAC SA of a message sender is easy enough, but where is the priority > carried? > > Since it is not explicitly listed as part of the information that MUST > appear in all TRILL- > Hello messages, I assume it is part of the fixed header of IS-IS LAN Hello > messages. > I further assume that the format and contents of the fixed header is defined > in ISO/IEC > 10589:2002 (since it does not make sense that this would be defined in > either of the > other two references related to IS-IS in sections 8 and 9 - i.e. - IS-IS > Cryptographic > Authentication, or IS-IS extensions for Traffic Engineering). Yes, the 7-bit priority field in the IS-IS LAN Hello fixed header in specified ISO 10589 on page 66, Section 9.5. > That's a "reaching assumption" because I cannot find anywhere that the draft > equates > DRB priority to IS-IS DR priority. DRB (Designated RBridge) is the the TRILL dialect for DIS (Designated Intermediate System). They name the same thing. Probably a few words should be added so that it is clear that the DRB determination priority is the IS-IS DIS determination priority. > If that?assumption is true, why is there not an explicit reference near the > text in section > 4.4.2 referring to the "fixed header" of?"an IS-IS LAN Hello" message? ? When I look at draft -13, the very first paragraph of Section 4.4.2 is as follows: The TRILL-Hello has a new IS-IS message type. It starts with the same fixed header as an IS-IS LAN Hello. > If the assumption is not true, where?would I look to find out where the > priority field is > defined, and how it is carried in a TRILL-Hello message? > > -- > Eric Thanks, Donald From d3e3e3 at gmail.com Sun Sep 20 07:32:26 2009 From: d3e3e3 at gmail.com (Donald Eastlake) Date: Sun, 20 Sep 2009 10:32:26 -0400 Subject: [rbridge] Comments on draft-ietf-trill-rbridge-protocol-13.txt In-Reply-To: References: <1028365c0909031208m46af5055n5376f67fc1ea6f49@mail.gmail.com> <292BD6E016C29243BB231D329D626C8C02D236E5@USDALSMBS05.ad3.ad.alcatel.com> Message-ID: <1028365c0909200732m77156393m9b72c6c5192e742f@mail.gmail.com> Hi Bernard, Thanks for your very thorough review and detailed comments. [DE] My responses are below marked with "[DE]" at the left margin and in a few cases I've inserted addition "[BA]"s in front of your remarks for clarity. [BA] Here is my review. Review of draft-ietf-trill-rbridge-protocol-13.txt Bernard Aboba Summary Overall, I found this document to be very easy to read. From an architectural perspective, I believe that the authors have done a good job of articulating where TRILL fits in the IEEE 802 architecture. In particular, the document makes it clear that TRILL is designed as a substitute for (R)STP, but does not represent a re-thinking of Ethernet or the IEEE 802 architecture in general. Of course, it is possible or even likely that some unforeseen interactions could arise, but my overall assessment is that the authors have been thoughtful in their design, utilizing encapsulation to enable maximum compatibility between Rbridges and legacy bridges. Of course, this thoughtful approach to backward compatibility also means that Rbridges will have more of an evolutionary impact than perhaps was first envisaged. For example, an IEEE 802.11 AP connected to an infrastructure based on Rbridges will still have some of the same limitations with respect to restrictions on multiple Associations within an SSID as would exist if the AP were to be connected to a conventional bridged Ethernet. My major areas of concern with this document relates to potential MTU issues as well as potential compatibility issues with the extensions enumerated in the Appendix. There are also a few instances in which the document is not as clear as it could be, or includes optional capabilities that IMHO should either be made mandatory to implement or be eliminated. [DE] On MTU and optional capabilities, see below. However, I do not think that these concerns are sufficient to require the document to be published as Experimental. If and when TRILL is deployed, I expect that a number of issues will be found and that a -bis document will eventually need to be prepared. That is par for the course nowadays, and if this were to preclude consideration for Proposed Standard status, then we wouldn't have many new protocol specifications under consideration for PS. [DE] Thanks. In my view, the more relevant question with respect to status is whether the protocol is sufficiently well specified so as to preclude the introduction of widespread interoperability problems. Where a document could potentially introduce such problems and where initial deployment is likely to result in major changes to the design, an Experimental status would be warranted. In retrospect, such a Status would have been more appropriate for protocols such as SIP whose continual "evolution" has lead to persistent interoperability problems a decade after its initial introduction. However, TRILL is based on a mature routing protocol (IS-IS) with demonstrated interoperability, with some modest enhancements. Other than a few optional capabilities which could be made mandatory or eliminated, and some instances where the text needs to be clarified, the specification is relatively straightforward, and so it seems unlikely that we will see numerous major interoperability issues between TRILL implementations, on the scale of what we have seen with SIP. More likely is the potential for compatibility issues between TRILL and existing legacy bridges. However, rather than requiring the authors to enumerate all such potential issues and provide solutions prior to publication (which could create years of delay), a more practical approach would be for the document to more clearly enumerate the scenarios believed to be most suitable for initial deployment. [DE] The document could say that the use of optimal paths and multi-pathing are of more benefit the more mesh-like the network is. Timers STP has been superseded by RSTP in order to improve convergence times. Looking through this spec, I'm not clear whether TRILL was designed to compete with RSTP convergence times, and if so, what the default values should be. [DE] RSTP was standardized by 802.1w-2001, long before the proposal of TRILL. TRILL, as described in this specification, is the application of link state router technology to the VLAN aware customer bridging problem. RBridges are true routers, swapping the out MAC addresses on each hop as well as decrementing a hop count. I think it is better to view TRILL as a different technical approach with different characteristics than bridging rather than as some competition based just on the metric of convergence time. See further remarks below. Optional functionality By IETF standards, there is only a modest amount of optional functionality in this spec, but what there is (ESADI and options) doesn't seem compelling to me. Is this functionality really necessary, or is it in there only to provide "value add" (e.g. opportunities for non-interoperability)? [DE] See responses below. RBridge nicknames I understand why nicknames are needed (to avoid making the MTU issues even worse). However, we have learned with RFC 3927 that collisions within 16-bit spaces can be painful (e.g. collision probabilities can be quite high if you have a substantial number of Rbridges in the network). Overall, I think that nickname collisions (along with optional functionality such as ESADI and Options) represent one of the potential weaknesses in the spec, particularly since Rbridges are most attractive in situations (such as datacenters) where the number of RBridges could be large. Some tweaks in the algorithms are suggested below. [DE] Nicknames not only help with MTU, they also simplify fast path forwarding lookup, making the table indexes narrower. [DE] Thanks for providing a pointer to RFC 3927. [DE] There are significant differences between nickname collisions and RFC 3927 link local IPv4 address collisions. With RFC 3927 addresses, you have to pick an address to try from across the entire range, since you don't know which are in use. Furthermore, two hosts may simultaneously detect a collision and both re-try with new values. With nicknames, you pick a new nickname only from among those that were free, since you can see all the nicknames that are in use in the link state. And in case of a collision detected simultaneously by two RBridges, they can both see each other's priority and only one will retry. [DE] RFC3927 recommends against more than 1300 hosts on a link, saying that a new host connecting and picking an address at random has a 98% of avoiding collision. If an RBridge joins a campus and there is at least one nickname free, either the RBridge will wait until it has the link state and then assert the free nickname or, if it immediately asserts a nickname that collides, either it or, depending on priority, the RBridge that used to have that nickname, will switch directly to the free nickname. Of course, as far as I know, no one is planning to run a campus with anything like that many nicknames in use but, because of these differences, nickname resolution should be fine with an order of mangnitude more allocations than the RFC 3927 recommendation. And, since nicknames are only needed by RBridges as opposed to RFC 3927 IP addresses which are needed by end stations, there will normally be far fewer nicknames needed than IP addresses for a particular network size (see comments below in reference to more than one nickname per RBridge). Global uniqueness of MAC addresses We are now seeing situations in which some of the conventional assumptions made by IEEE 802.1 have broken down. One of these is the widespread deployment of virtualization within datacenters. In these scenarios, multiple MAC addresses may be assigned to a single (virtualized) NIC. To limit a potential explosion in demand for MAC addresses, MAC addresses can be assigned by management software from a vendor OUI, and as a result, MAC addresses are not guaranteed to be unique across VLANs. In reading through the document, it seems clear to me that the intent is for traffic to be routed by VLAN and MAC address, so that end stations are not required to have globally unique MAC addresses (although Rbridge MAC addresses do need to be globally unique). However, there are a number of instances in which the text is not as clear as it could be on this point (see below for detailed comments). [DE] Yes, as per below, there are a number of cases where it currently says "MAC address", or the like, and VLAN needs to be added so it says "VLAN and MAC address", or the like. Note that the ability of TRILL to handle non-globally unique endstation MAC addresses is IMHO a major advantage as compared to single-spanning tree switches, or even "Q in Q" provider bridges. Some of these advantages might be worth calling out. For example: 1. TRILL encapsulation potentially shields legacy bridges from learning MAC addresses which might cause problems (e.g. single spanning tree implementations). 2. TRILL encapsulation can shield "Q in Q" provider bridges from exposure to MAC address duplication which could occur when a provider needs to handle traffic from customers with their own distinct datacenters utilizing virtualization. 3. Greenfield TRILL deployments only require end station MAC addresses to be unique per VLAN. As a result, TRILL is virtualization-friendly, and the prohibitions on virtualization described in documents such as IEEE 802.1X-REV are not necessary in an Rbridge deployment. MTU issues While support for PMTU discovery is quite common within TCP implementations, the same is not true for UDP. Legacy implementations that lack support for IEEE 802.1AB-REV, and automatically set an Ethernet interface MTU to 1500 are quite widespread. In greenfield Rbridge installations designed to support a larger MTU between Rbridges, this should be a solvable problem. It also should be addressable in situations where Rbridges are installed alongside relatively new switches that support MTUs of 1530 or greater. However, I do wonder what issues could arise in situations where Rbridges are installed alongside switches that only support Ethernet frames of 1512 or 1516 octets. Reading the document, it was not clear whether MTU probing was "mandatory to implement" but I think this functionality should be mandatory, if only for diagnostic purposes. I also think that the algorithm for MTU determination could be better specified, perhaps by incorporating elements from RFC 4821. [DE] There are two issues for MTU: MTU for end station data and MTU for TRILL IS-IS control frames. [DE] There needs to be a campus wide lower limit for the MTU of inter-RBridge links to be sure of correct operation so that TRILL IS-IS control frames can get through. Each RBridge advertises its requested MTU for this purpose, the lowest value wins (but not less than 1470), and all RBridges SHOULD test the inter-RBridge links to see that their MTU meets or exceed this value. MTU for end station data frames is a whole different thing. The draft needs to make this clear. I'll suggest some wording changes on the mailing list. Port protocols There are a number of IEEE 802.1 protocols (such as IEEE 802.1X, IEEE 802.1AB, etc.) that relate to the state of a port and utilize a non-forwardable multicast address. The draft states that an Rbridge will not forward frames sent to a non-forwardable multicast address. This cleanly segments the problem in the sense that TRILL is put forward purely as a substitute for the spanning tree protocol, leaving the operation of port protocols intact. In such a model, the operation of IEEE 802.1X-2004 and IEEE 802.1AB-REV should not be affected; these protocols, if implemented at the edge Rbridge, should operate largely as they do today from the point of view of the end station. However, there are some corner cases in which limitations could arise. One example is an edge device such as wired VOIP handset with one or more wired Ethernet ports. These devices often do not implement "port protocols" such as IEEE 802.1X-2004, but instead operate like a TPMR, passing them through to the switch. I do not see this as a problem with the Rbridge specification per se, because the spec is attempting to mimic the behavior of a conventional switch in this (and other cases). However, it might be worth a sentence explicitly stating that -- and noting that potential extensions to other cases such as Virtual RBridges or Provider Rbridges are not excluded, but are left to future work. [DE] Yes, something about "extensions to support provider bridging services are left for future work" or the like, since this spec covers only customer bridging services, seems reasonable. I would also note that IEEE 802.1X-REV goes beyond "port access control" to defining "pseudo" and "virtual" ports. Virtual ports are created via MACSEC (IEEE 802.1AE) and pseudo ports involve MAC-based authentication state, so as to allow IEEE 802.1X supplicants to coexist on shared media. Among other things, pseudo and virtual ports introduce the notion of IEEE 802.1X traffic that could be destined to a unicast address (as in IEEE 802.11i) rather than to a non-forwardable multicast address. As is stated in the document, the location of TRILL in the IEEE 802.1 architecture makes it transparent to these extensions; however in places the document language is outdated and could be cleaned up (see below for examples). [DE] See discussion below re pseudo/virtual ports. Detailed Comments Abstract The design supports VLANs and optimization of the distribution of multi-destination frames based on VLAN and IP derived multicast groups. It also allows forwarding tables to be sized according to the number of RBridges (rather than the number of end nodes), which allows internal forwarding tables to be substantially smaller than in conventional bridges. [BA] Since core bridges can forward based on VLAN tags and not MAC addresses, this claim seems somewhat exaggerated. [DE] Well, perhaps that statement in the abstract should be limited to unicast forwarding information since RBridge multi-destination frame forwarding SHOULD also prune distribution based on looking at the VLAN and destination MAC address (although this pruning is only an optimization so things will work if it is not done). However, I'm not aware of any 802.1 VLAN aware bridging standards that have a bridge forward known unicast customer data frames based only on VLAN, ignoring MAC address. As far as I know, VLAN aware bridge forwarding lookups are 60-bits wide, a 12-bit VLAN plus 48-bit MAC address. Section 1 IEEE 802.1 bridges avoid these problems by transparently gluing many physical links into what appears to IP to be a single LAN [802.1D]. However, 802.1 bridge forwarding using the spanning tree protocol has some disadvantages: [BA] Throughout the document, I get the sense that TRILL is aiming at a target that is somewhat frozen in time. Today most new bridge implementations support RSTP, which offers considerably faster convergence. Shortest path bridging is progressing, etc. Overall, I'd like the document to be clearer about what advantages apply to classic STP, and which ones also apply to enhancements such as RSTP, shortest path bridging, etc. [DE] It seems to me that it is a good thing that the TRILL working group has had consistent goals, rather than constantly changing it goals. RSTP was standardized is 2001 by 802.1w, long before the TRILL effort started. See Section 2.3 of RFC 5556 (TRILL Problem and Applicability Statement) for some relevant comments. [BA] For example, it seems to me that TRILL has advantages over single spanning tree implementations in terms of ability to customize forwarding per VLAN. Based on the document, it seems that it may have some disadvantages in terms of convergence times and initialization behavior, as compared with RSTP. So overall, I think that Section 1 could do a better job of articulating the pros/cons of TRILL. In particular, it appears to me that some of the pros described in the problem statement document have not been realized, and some addition pros (such as virtualization support) have arisen. [DE] Convergence is a bit more complex that it seems at first and depends on engineering, implementation, which aspects of convergence are included, and how they are measured, as well as protocol. For example, one might expect RBridges engineered for rapid fail-over to also implement BFD for rapid failure detection. How would that be factored in? See also Section 2.3 of RFC 5556. in most cases they can incrementally replace IEEE [BA] The use of "most" here is somewhat vague. I might say "as described in Appendix A, they can incrementally replace IEEE..." [DE] Yes, even though Sections 1 and 2 are supposed to be general overview section, it seems like the use of general words like "most" causes some to jump to the conclusion that a more precise description isn't known, even though it is. The wording should be improved. While they can be applied to a variety of link protocols, this specification focuses on IEEE [802.3] links. [BA] This would seem to suggest that TRILL could be used on Token Ring. You might want to specifically exclude this usage (or interconnection with source routing bridges, for that matter). [DE] The next link type of interest among TRILL working group members beyond 802.3 seems to be PPP and Jim Carlson is working on a draft for that. But I don't see any reason to rule out yet other types of links. TRILL was intended to be used with a variety of link types. You would expect to have a separate document specifying how to handle each link type. I can't see any reason you couldn't use a Token Ring (802.5) link to interconnect some number of end stations, RBridge, and/or bridge ports. From the point of view of bridges, an RBridge is pretty much an end station. So, while I could be wrong, I would guess that a document specifying how an RBridge Token Ring port would work would say that it would handle receipt/transmission of frames with a token ring functional address the same as other token ring end stations. [DE] Wording should be added to the draft saying that a separate specification would be expected for each link type. [DE] Source routing bridges are something I know even less about. But RBridges look like end stations to bridges and terminate spanning tree. So, I would imagine that if they handled route explorer frames as if they were an end station, then RBridges would work with source routed bridges also. Section 1.2 Section 2: general RBridge description Section 3: the TRILL header Section 4: other TRILL protocol details In case of conflict, the order of precedence of these section is as follows, with those appearing earlier in this list having precedence over those that appear later: 4 > 3 > 2 [BA] "this list" is ambiguous. Do you mean the above list, in which case, Section 2 would take precedence over Section 4? Or do you mean Section 4 takes precedence over 2? I think you need to make this more clear. [DE] Thanks. Section 4 has highest precedence and 2 has lowest. The ambiguity in wording needs to be fixed. Section 1.3 1.3 Terminology and Notation in this document "TRILL" is the protocol specified herein while an "RBridge" is a devices that implement that protocol. The second letter in Rbridge is case insensitive. Both Rbridge and RBridge are correct. [BA] s/devices/device/ [DE] Yup, thanks. In this document, the term "link", unless otherwise qualified, means "bridged LAN", that is to say, the combination of one or more [802.3] links with zero or more brides, hubs, repeaters, or the like. The [BA] Do you really want to be combining IEEE 802.3 links with brides (or grooms)? Suggest changing "brides" to "bridges". [DE] Well, in some ways RBridges are the marriage of Routers and Bridges but you are probably right about the correction. The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in [RFC2119]. [BA] You might want to move this to earlier in the section (or give it its own sub-section). [DE] OK, it can be moved to the beginning of the section. Section 2 RBridges run a link state protocol amongst themselves. This gives them enough information to compute pair-wise optimal paths for unicast, and calculate distribution trees for delivery of frames either to unknown MAC destinations or to multicast/broadcast groups. [RBridges] [RP1999] [BA] I think you mean "unknown VLAN/MAC destination pairs" here, no? [DE] Yes, this is one of the cases where VLAN needs to be added to MAC address. used, within a campus, even by an RBridge that lacks an IP or other Layer 3 transport stack or which has zero configuration and thus no Layer 3 address, by transporting SNMP with Ethernet [RFC4789]. [BA] The term "zero configuration" is ambiguous here, since it might be construed to refer to an RFC 3927 "zero config" IP address within 169.254/16. In any case, a switch obtaining a dynamic IP address can still be "zero configuration", so I'm not sure what the point is. [DE] OK. The term "zero configuration" or an equivalent do not seem necessary here. And, in general, I think all remaining occurrences of "zero configuration" should be removed or replaced with "default configuration" or the like. Section 2.1 An RBridge, RB1, which is the VLAN-x forwarder on any of its links MUST learn the location of VLAN-x end nodes, both on the links for which it is VLAN-x forwarder, and on other links in the campus. RB1 learns the port and Layer 2 (MAC) addresses of end nodes on links for [BA] Later on it is made clear that we are talking about learning the VLAN as well as the port and MAC address. It's important to be consistent on this point throughout the document. [DE] Yes, even though it is speaking of the "VLAN-x forwarder", so there is an implicaiton that it is learning the MAC within VLAN-x, this should be made more explicit. Also, it can be more secure because not only might the enrollment be authenticated (for example by cryptographically based EAP methods via [802.1X]), but ESADI also supports cryptographic authentication of its messages [RFC5304]. [BA] Elsewhere the document makes it clear that IEEE 802.1X is implemented below TRILL so that they don't interact. Here you seem to be saying that ESADI and EAP could be related. If IEEE 802.1X operating on a port or source MAC does not allow non-802.1X frames to pass, then ESADI should not be announcing the unauthenticated source MAC, right? Note that where 802.1X frames are sent to a unicast address, they *will* be forwarded, and learning will occur; however, filtering will occur at the edge so as to prevent incoming or outgoing frames to/from unauthenticated MAC addresses. The potential for different behavior of learning and ESADI concerns me. [DE] Yes, 802.1X is implemented below TRILL but it mentions several places in the draft that information can flow between TRILL and such lower level protocols. For example, the RBridge's confidence in a locally learned address can be influenced by 802.1X authentication. [DE] The wording should be clarified but it isn't saying that 802.1X and ESADI are related. It is saying that address learning can be more secure for two reasons. The first reason is that addresses can be learned in conjunction with a cryptographically secured Layer 2 authentication protocol of which 802.1X is just an example. The second reason is that addresses can be securely transmitted through cryptographically secured ESADI messages. These two reasons are mostly orthogonal. Advertising end nodes using ESADI is optional, as is learning from these announcements. [BA] Does it make any sense to support Advertising but not learning, or learning and not advertising? If both are optional, then these combinations are possible. [DE] Learning from observing frames at the data level is the bridging mind-set way of doing things. Learning through the control plane, i.e. ESADI, is how you would do it coming from a router mind-set. For interoperability with default configuration, at least one of these two techniques needs to be mandatory to implement and enabled by default. In this case, data plane learning prevailed in the working group and is mandatory to implement and enabled by default. [DE] All combinations of ESADI advertising/learning are reasonable and no combination cause any significant interoperability problems if data plane learning is left enabled, as is the default. Things get more dicey is you disable data plane learning. But some users demand the ability to do things like disabling all learning and having only statically configured addressing information. [BA] In general, I'm not a fan of optional functionality. ESADI is optional, and it's not clear to me that the benefits outweigh the potential headaches. Is ESADI really necessary, particularly given the claims made about scaling with the number of Rbridges, not end nodes? [DE] If an end station is unplugged from one RBridge and plugged into another then, depending on circumstances, frames addressed to that end station can be black holed. This is, they can be sent to the older RBridge that the end station used to be connected to until cached address information at some remote RBridge(s) times out, possibly for a number of minutes or longer. With ESADI, the link interruption and establishment from the unplugging and plugging can cause immediate updates to be sent. [DE] The forwarding tables of transit RBridges scale with the number of RBridges rather than the number of end nodes since transit RBridges don't need to learn end station addresses. But ingress and egress RBridges do need to learn end station addresses for the VLANs for which they are an appointed forwarder on one or more ports so their tables related to encapsulation/decapsulation do scale with the number of VLAN scoped addresses. Whether you use data plane or control plan address learning doesn't have that much effect on scaling. Section 2.2 2. elimination of the need for original source and destination MAC address learning in transit RBridges; [BA] The specification doesn't discuss "destination MAC address learning". Is this a typo? [DE] Yeah, or maybe best to replace "original source and destination MAC" with "end station VLAN and MAC". 3. direction of frames towards the egress RBridge (this enables forwarding tables of RBridges to be sized with the number of RBridges rather than the total number of end nodes); and, [BA] I am unclear about the situations in which this claim applies. Are we only talking about core Rbridges, or edge ones as well? What about scaling properties when ESADI is implemented? [DE] Well, it says "forwarding tables". (1) There is an encapsulation process at the ingress RBridge. Ingress RBridges have to learn MAC addresses and VLANs of remote end stations in the VLANs for which they are ingressing native frames. This encapsulation database grows with the number of such remote end stations. (2) The ingress RBridge and each transit RBridge forwards the encapsulated data frame. This is what uses the forwarding table which scales with the number of RBridges. (3) There is a decapsulation process at the egress RBridge. Egress RBridges have to learn MAC addresses and VLANs of local end stations in the VLANs for which they are egressing native frames. This decapsulation database grows with the number of such local end stations. [DE] ESADI has only to do with the transport of addressing information, not the amount of such information any particular RBridge needs. So implementing ESADI has little effect on scaling. 2.2.1 Known-Unicast These frames have a unicast inner MAC destination address (Inner.MacDA) and are those for which ingress RBridge knows the egress RBridge for that destination MAC address. [BA] I think you mean for "VLAN/destination MAC address pair", no? [DE] Yes, another case where VLAN needs to be added. 2.2.2 1. unicast frames for which the destination is unknown: the Inner.MacDA is unicast, but the ingress RBridge does not know its location; [BA] do you mean "does not include an entry for the VLAN/destination MAC address pair"? [DE] Should have VLAN added and the wording should be tweaked. 3. multicast frames for which the Layer 2 destination address is not derived from an IP multicast address: the Inner.MacDA is multicast, and not from the set of Layer 2 multicast addresses derived from IPv4 or IPv6 multicast addresses; [BA] Does this work for *all* multicast addresses not derived from an IP multicast address (e.g. addresses used in provider bridging)? [DE] Frames addressed to the small number of special bridging/link/TRILL reserved addresses are handled specially. That exception should be added. Section 3.3 3.3 Reserved (R) The two R bits are reserved for future use in extensions to this version zero of the TRILL protocol. They MUST be initially set to zero, transparently copied by transit RBridges, and ignored on receipt. [BA] From this sentence, I'm not clear who is ignoring the R bits. Does the sentence just apply to transit RBridges or all RBridges? [DE] Good point. Should probably say "They MUST be set to zero when the TRILL header is added by in ingress RBridge, transparently copied but otherwise ignored by transit RBridges, and ignored by egress RBridges. Section 3.5 Note: Most RBridge implementations are expected to be optimized for the simplest and most common cases of frame forwarding and processing. The inclusion of any options may, and the inclusion of complex or lengthy options very likely will, cause frame processing using a "slow path" with markedly inferior performance to "fast path" processing. Limited slow path throughput may cause such frames to be lost. [BA] This makes a very good case for removing options from this specification. Do we really need this?? This seems like it will bring with it all the issues that options have in IPv4, and then some. [DE] There was some controversy about options and the above warning is probably over alarmist. The hard size limit on the options area was based on input from ASIC engineers. The base protocol draft contains only the minimum hooks for options and the working group consensus has been to include these hooks. To revisit this question at this point would cause substantial delay. Section 3.6 Although the RBridge MAY decrease the hop count by more than 1, under the circumstances described above, the RBridge forwarding a frame MUST decrease the hop count by at least 1, and discards the frame if it cannot do so because the hop count is 0. [BA] The MAY here seems dangerous. Could an implementation decrement hop count by 10? This seems like one of those situations where the spec should be more strict (e.g. SHOULD decrement by 1). Allowing a wide variety of behavior seems like it would be courting trouble. Is there value in allowing variation, and if so, please explain what that value is. [DE] The big fear is that, even with a hop count, multi-destination frames (multicast, etc.) in some sort of temporary loop can spawn multiple copies when they go through a distribution tree fork saturating your network. Known unicast frames are much less dangerous and TRILL considers the TTL mechanism adequate to keep them under control. It is because of this concern with multi-destination frames that they are subject to the mandatory stringent reverse path forwarding check (RPFC), which should, in conjunction with their hop count, keep multi-destination frames under control. [DE] So, assume, say, a TRILL encapsulated broadcast frames arrives at RBridge X via port-1 with a hop count of 13 while on a distribution tree such that RBridge X will forward two copies of the frame, one out of port-2 and one out of port-3. Assume that the most distant RBridge down the port-2 branch of this distribution tree is 12 hops away. But the most distant RBridge down the port-3 branch is only 2 hops away. The RBridge MUST decrease the hop count by at least one, which in this case, still leaves just enough to complete the distribution on the max-12 hop branch. But the language you are referring to permits the RBridge to reduce the hop count by more, up to 10 in this case, for the max-2 hop branch out of port-3, as long as it still has enough hop count left in that copy for complete distribution. But it always has to reduce it by at least one even if that doesn't leave a big enough hop count for complete distribution. This is an optional extra safety measure to control multi-destination frames, the most dangerous kind of frame. Section 3.7.3 o Nickname values MAY be configured. An RBridge that has been configured with one or more nickname values will have priority for those nickname values over all Rbridges with non-configured nicknames. [BA] RFC 3927 does not permit static assignment of link scope addresses because it was feared that this would lead to implementations ignoring collision detection. I realize that configured nicknames get priority, but it still seems like a good idea for an Rbridge to test for conflict before configuring the nickname, so as to avoid a potential conflict, no? [DE] A number of working group members believe that some customers insist on being able to configure everything and would want all the RBridges in their campus to have pre-assigned nicknames. If it isn't provided for in the spec, it will be implemented in a variety of proprietary ways. Under the provisions of this section, I don't see how it can do any harm. The worst someone could do is manually configure all their RBridges with the same nickname. Then the nickname procedure would sort this all out and would force all the but the highest nickname priority RBridge to change nickname, regardless of the configuration. o Each RBridge is responsible for ensuring that its nickname or each of its nicknames is unique. If RB1 chooses nickname x, and RB1 discovers, through receipt of RB2's LSP, that RB2 has also chosen x, then the RBridge with the numerically higher priority keeps the nickname, or if there is a tie in priority, the RBridge with the numerically higher IS-IS System ID keeps the nickname, and the other RBridge MUST select a new nickname. This can require an RBridge with a configured nickname to select a replacement nickname. [BA] Given that a configured nickname might need to select a replacement, what is the value of supporting configuration? [DE] So that, if you properly configure it, you know what nicknames particular RBridges will have. Of course, if you improperly configure them, that is, configure conflicts, then your configuration efforts were not very useful. [BA] Also, this would suggest that even configured nicknames need to test for uniqueness prior to configuration, rather than relying on increased priority due to aging (which is undefined in the spec) to avoid forcing nodes that have been using a nickname for a long time to change. [DE] I believe the spec makes it clear that every RBridge in the campus has to check that their nickname isn't duplicated elsewhere by a higher nickname priority RBridge. The time to make that check and the only time it makes sense to do such a check is when you receive an LSP that changes what you think some other RBridges nickname and/or nickname priority are. [BA] Also, just because RB1 gets RB2's LSP doesn't mean that RB2 simultaneously has RB1's LSP. So one side can assume that the other will give up the nickname, but that might not happen for a while. Do you want to encourage RB1 to send its LSP right away after it detects a collision so that the information asymmetry is quickly corrected? [DE] This really seems to me to be something best left to implementations and IS-IS. Generally speaking, you send LSPs when something changes or you get a sequence number PDU from your neighbor indicating that their link state is incomplete or out of date, in which case you send them what they are missing or have only an older copy of. Sure, there can be time delays, but there is no reason in your example above to think that R1 and R2 are directly connected. IS-IS reliable flooding assures that every RBridge will end up with a complete copy of the core link state no matter how long ago an LSP changed. Spontaneously sending a redundant copy of your LSP that has already been sent won't speed things up. Spontaneously sending your LSP right away when you change something in it is good, but that's what you normally do anyway. o To minimize the probability of nickname collisions, when an RBridge selects a new nickname, it does so by randomly hashing some of its parameters, e.g., interface MAC addresses, time and date, and other entropy sources such as those given in [RFC4086]. There is no reason for all Rbridges to use the same algorithm for selecting nicknames. [BA] Randomness isn't required to reduce collision probabilities. It's only necessary for the distribution within the space to be uniform. RFC 3927 doesn't require randomness because it was felt that this would just increase the difficulty of debugging with no net benefit. I'd suggest that you rethink this. [DE] Well, you need a unique seed to start with although you could, as RFC 3927 suggests, use a pseudo-random number generator thereafter. For nicknames, the distribution isn't over the space of all nicknames but only over the nicknames that appear to be available based on the link state database held by the RBridge selecting a new nickname. An RBridge MAY request multiple nicknames so that it can be the root of multiple trees for multipathing of multi-destination frames. These trees would all be shortest path trees from the RBridge but, since the tree number is used in tie breaking when there are multiple equal cost paths (see Section 4.5.1), the different trees will likely utilize different links. [BA] In a deployment with a substantial number of Rbridges, the collision probability will already be high. If each Rbridge has multiple nicknames, it will impact scaling in a negative way. Have you thought this through? For example, in a situation where each Rbridge has 16 nicknames, you might start seeing high collision probabilities with as few as 16 Rbridges. I'd suggest that this practice be NOT RECOMMENDED. [DE] It is expected that very few RBridges in a campus would have multiple nicknames. This would have to be configured by the network manager since one nickname is the default. If a network manager was using this feature, they would pick a few RBridges that, becasue of the network topology, were particularly good places from which to calculate multiple different shortest path distribution trees. Such trees need separate nicknames so traffic can be multipathed across them. [DE] 16 RBridges each with 16 nicknames isn't going to cause much of a collision probability, at least in my opinion. That's only 256 nicknames. For example, assume a huge campus with 10,000 RBridges with random nicknames assigned to those RBridges. This would mean that about ~1540 RBridges would have an initial colliding nickname so the ~770 lower priority of these RBridges would pick new nikcnames from the ~54,230 available nicknames. I think this would be expected to result in ~758 of them picking a nickname not picked by any of the other RBridges and ~12 experiencing a second collision. So ~6 of the lower priority of these second colliders would pick a new nickname out of then ~53,466 available nicknames with an expected probability of 99.93% that all 6 would pick non-conflicting names on their second try. Less than one in a thousand times, you would have to go to a third round. Is that sort of behavior OK? I suppose it depends on the application but I think it would be fine for the initial start-up of most data centers. Section 4.1.1 Frames with the same source address, destination address, VLAN, and priority that are received on the same port as each other and are transmitted on the same port MUST be transmitted in the order received unless the RBridge classifies the frames into more fine grained flows, in which case this ordering requirement applies to each such flow. (Such frames might not be sent out the same port if multipath is implemented. See Appendix C.) [BA] Do you mean "granular"? [DE] I don't see much difference between "fine grained" and "finely granular" but I think the first, which is the current wording, reads better. Section 4.2.4.3 o Loop avoidance: - Inhibiting itself for a configurable time from zero to 30 seconds, which defaults to 30 second, after it sees a root bridge change on the link (see Section 4.9.3.2). [BA] The 30 second default might make sense where RBridges are deployed alongside STP, but is this default needed where the legacy bridges run RSTP? Overall, I'm curious as to how the failover times in TRILL will compare to RSTP. The spec doesn't say much about this. [DE] 30 seconds was chosen as the default for safety. It's configurable so, for example, if you know that all your bridges are RSTP and in the same room with low transmission delays, you can configure it down. Although an RBridge can see if it is receiving RSTP BPDUs from immediately adjacent bridges, it would be very hard for an RBridge to assure itself that RSTP is being used on all the links interior to an attached bridged LAN. Failover times are more complex that might seem at first, especially if you include questions related to the updating of learned addresses, and depend on the engineering and implemention of the specific devices, the way convergence is measured, and the specific circumstances, as well as on protocols. 4.2.5.2 TRILL ESADI Information The information in ESADI is the list of local end station MAC addresses known to the originating RBridge and, for each such address, a one octet unsigned "confidence" rating in the range 0-254 (see Section 4.8). In order to make it practical to optionally provide for VLAN ID translation, as specified in a separate document, TRILL ESADI frames MUST NOT contain the VLAN ID in the body of the frame after the Inner.VLAN tag. [BA] Does VLAN ID translation really require support for ESADI? [DE] What the draft tries to say here is that, to support VLAN ID translation, TRILL ESADI frames, if used, are subject to a restriction. It says nothing about a requirement to support ESADI. This comment was included so that, when designing the encoding of or extending ESADI, nothing would be done that would break VLAN translation. The wording should be clarified. Section 4.3.1 Sz is determined by having each RBridge (optionally) advertise, in its LSP, its assumption of the value of the campus-wide Sz. This LSP element is known in IS-IS as the originatingLSPBufferSize, TLV #14. The default and minimum value for Sz, and the implicitly advertised value of Sz if the TLV is absent, is 1470 bytes. [BA] Given the potential headaches that can be caused by MTU issues, I wonder whether the spec couldn't be tightened. If implementations don't advertise Sz, then it seems like we could end up with a 1470 octet MTU limitation on endstations, even where this might not be necessary (e.g. MTU discovery could enable a larger MTU). This seems like it could cause headaches where it wouldn't be necessary. [DE] This needs to be clarified. The whole MTU feature was motivated by problems with TRILL IS-IS frames on inter-RBridge links. The only thing Sz limits is the size of PDUs generated for TRILL IS-IS (except for MTU-probe/ack PDUs). This is needed to assure proper operation. It doesn't really have anything to do with MTU on links to end stations. There is no way provided in TRILL or IS-IS to communicate an MTU to an end station. See more below. [BA] Would it make sense to require Sz to be advertised where MTU discovery finds a larger MTU size than 1470? [DE] What do you mean "advertised"? Each RBridge calculates Sz, which is the maximum size for all TRILL IS-IS PDUs including LSP (link state PDUs) but, of course, excluding MTU-probes/acks that can be bigger than Sz for testing. Each RBridge calculates this by calculating the minimum originatingLSPBufferSize advertised in the link state by any RBridge but not less than 1470. There is also a facility for advertising the MTU of links as determined by the MTU probe and ack. This is advertised with other link information in the link state database. Section 4.3.2 There are two new TRILL IS-IS message types for use between pairs of RBridge neighbors to test the bidirectional packet size capacity of their connection. These messages are: -- MTU-probe -- MTU-ack Both the MTU-probe and the MTU-ack are padded to the size being tested. [BA] This section doesn't say whether support for MTU-probe is mandatory. The way it is written, I'd be concerned that implementers would not support it, and that lack of MTU discovery capability would cause problems in deployments. [BA] This section might also benefit from a more detailed specification of the MTU discovery algorithm (such as incorporating elements from the Packetization Layer Path MTU RFC). [DE] This isn't a path MTU determination. It is a link MTU determination only applied to Inter-RBridge links. With 802.3 links, which is what this draft is aimed at, as long as Sz is at the default 1470 bytes, TRILL IS-IS PDUs necessary for proper operation should get through. But the draft says that RBridges SHOULD check. Section 4.6 Source address information ( { VLAN, Outer.MacSA, port } ) is learned from any frame with a unicast sources address (see Section 4.8). [BA] Good to see this clearly stated here. As noted earlier, there are places in the document where this is not as clear. [DE] Yes, hopefully all such places will be fixed. Section 4.6.1.1 4. If a unicast destination address is unknown, RB1 handles the frame as described in Section 4.6.1.2 for a broadcast frame except that the Inner.MacDA is the original native frame's unicast destination address. [BA] Do you mean "if a VLAN/unicast destination address pair is unknown"? [DE] Yes, hopefully all such places will be fixed. Section 4.6.2.4 If RBn is a transit RBridge the hop count is decremented by one and the frame forwarded to the next hop RBridge towards the egress RBridge. The Inner.VLAN and ingress nickname are not examined by a transit RBridge when it forwards a known unicast TRILL data frame. [BA] Elsewhere it says that the hop count MAY be decremented by more than one. [DE] The only case where an RBridge is permitted to decrease the hop count by more than one is when it forwards a multi-destination frame onto a branch of the frame's distribution tree. As discussed above, in that case, it can reduce the hop count to the distance to the most remote RBridge in the distribution tree branch. Section 4.6.2.4 that you are commenting on concerns the handling of known unicast frames. RBridges are not permitted to decrease their hop count by more than 1. Section 4.8.1 3. By Layer 2 registration protocols learning the { source MAC, VLAN, port } of end stations registering at a local port. [BA] Are you referring to IEEE 802.11 association here? This won't tell you the VLAN of the end-station (this might not be determined until after authentication). [DE] This provision is not meant to be limited to any particular existing Layer 2 registration protocol, whether 802.11 or 802.16 or whatever, and is intended to include Layer 2 registration protocols specified in the future, where appropriate. "IEEE 802.11 association and authentication" is used as an example in Section 2.1 and Section 4.2.4.3. "Authentication" is specifically included in the current draft wording where 802.11 is used as an example. Section 4.8 Although outside the scope of this specification, there are some Layer 2 features in which a set of VLANs has shared learning, where one of the VLANs is the "primary" and the other VLANs in the group are "secondaries". [BA] One concern about this section is that it might be construed to permit trees shared between VLANs. You might make it clear that this is not the intent. [DE] This section relates to VLAN/MAC address learning where the separate identities of some number of multiple VLAN are merged, that is, what 802.1 calls SVL (Shared VLAN Learning), and that should probably be clarified. However, I'm not sure what "trees" you are talking about. If you mean TRILL multi-destination frame distribution trees, they are always shared across all VLANs. Section 4.9.1 [BA] Given the earlier discussion of "zero config" you might put in a sentence or two indicating that the configuration bits are only needed for special circumstances. You might also state what the default setting of the bits is. [DE] I'm not sure that's true. I can easily see a network manager adopting a policy (and a configuration where this policy is reasonable), that all ports in their RBridge campus will be configured as either trunk or access. Would that be "special circumstances"? But giving default values is probably a good idea. Section 4.9.2 Low-level control frames are handled in the lower level port/link control logic in the same way as in an [802.1Q-2005] bridge. This can optionally include a variety of 802.1 or link specific protocols such as link layer discovery, link aggregation (Clause 43 of [802.3]), MAC security [802.1AE], or port based access control [802.1X]. While handled at a low level, these frames may affect higher level processing. For example, a Layer 2 registration protocol may affect the confidence in learned addresses. The upper interface [BA] IEEE 802.1X-REV is no longer purely "port based", since it supports "pseudo" and "virtual" ports as well. [DE] It seems to me that the right thing is to add a definition of "port" to Section 1.3 that makes it clear that the unadorned word "port" includes pseudo and virtual ports. In addition, some references, where appropriate, to something as being implemented "in ports" could be changed to saying implemented "below the EISS layer" or the like. Section 6 IEEE 802.1 port admission and link security mechanisms, such as [802.1X] and [802.1AE], can also be used. These are best thought of as being implemented within a port and are outside the scope of TRILL (just as they are generally out of scope for bridging standards [802.1D] and 802.1Q); however, TRILL can make use of secure registration through the confidence level communicated in optional TRILL ESADI (see Section 4.8). [BA] Neither IEEE 802.1AE nor IEEE 802.1X-REV are based on physical ports. Instead, I'd refer to the diagrams which make it clear that these mechanisms operate below TRILL. [DE] OK. See also response immediately above. Section A.3.4 The spanning tree solution does quite well in this particular case. But it depends on both RB1 and RB2 having implemented the optional feature of being able to configure a port to emit BPDUs as described in Section A.3.3 above. It also makes the bridged LAN whose partition [BA] This somewhat begs the question about whether being able to emit BPDUs should be optional, recommended or mandatory. [DE] The text in Appendix A should be changed to make it clear that it is only talking about spanning tree BPDUs. As per Section 4.9.3.3 of the draft, there are conditions under which RBridges SHOULD send topology change BPDUs and RBridges MAY send spanning tree BPDUs. The implementation requirement key words appear in the body of the draft. Appendix C When multipathing is used, frames that follow different paths will be subject to different delays and may be re-ordered. While some traffic may be order/delay insensitive, typically most traffic consists of flows of frames where re-ordering within a flow is damaging. How to determine flows or what granularity flows should have is beyond the scope of this document but, as an example, under many circumstances it would be safe to consider all the frames flowing between a particular pair of end station ports to be a flow. [BA] I think you have to say more here, given that Ethernet invariants include ordering requirements. Certainly considering all frames between a pair of end stations to be a flow would be conservative, but this isn't the Ethernet requirement, right? [DE] It seems like there are two possible responses to your comment. [DE] One is to add more detail and complexity. The next step in that direction would probably be to add priority and VLAN so it would say "... all the frames of the same priority and in the same VLAN flowing between ...". But I think that would invite further complaints leading to further details and yet more complexity in what is supposed to just be a dead simple, ultra-conservative example. [DE] The alternative, which I prefer, is to simplify. Just say "How to determine flows or what granularity they should have is beyond the scope of this document." dropping the example. Thanks, Donald ============================= Donald E. Eastlake 3rd +1-508-634-2066 (home) 155 Beaver Street Milford, MA 01757 USA d3e3e3 at gmail.com From d3e3e3 at gmail.com Mon Sep 21 05:51:50 2009 From: d3e3e3 at gmail.com (Donald Eastlake) Date: Mon, 21 Sep 2009 08:51:50 -0400 Subject: [rbridge] Comments on draft-ietf-trill-rbridge-protocol-13.txt In-Reply-To: References: <1028365c0909031208m46af5055n5376f67fc1ea6f49@mail.gmail.com> <292BD6E016C29243BB231D329D626C8C02D236E5@USDALSMBS05.ad3.ad.alcatel.com> <1028365c0909200732m77156393m9b72c6c5192e742f@mail.gmail.com> Message-ID: <1028365c0909210551j6b89b06ej3e066a85c36f2e5b@mail.gmail.com> Hi Bernard, On Sun, Sep 20, 2009 at 3:50 PM, Bernard Aboba wrote: > >... > >> [DE] There are significant differences between nickname collisions and >> RFC 3927 link local IPv4 address collisions. With RFC 3927 addresses, >> you have to pick an address to try from across the entire range, since >> you don't know which are in use. Furthermore, two hosts may >> simultaneously detect a collision and both re-try with new >> values. With nicknames, you pick a new nickname only from among those >> that were free, since you can see all the nicknames that are in use in >> the link state. And in case of a collision detected simultaneously by >> two RBridges, they can both see each other's priority and only one >> will retry. > > [BA]? Yes, this does make a very big difference.? I wasn't clear about this > based on my reading the spec.? Maybe it would be worth including some > wording explaining the above? [DE] Sure, how about the following: "To minimize the probability of nickname collisions, an RBridge selects a nickname randomly from the apparently available nicknames, based on its copy of the link state. This random selection can be by the RBridge hashing some of its parameters, e.g., SystemID, time and date, and other entropy sources, such as those given in [RFC4086], each time or by the RBridge using such hashing to create a seed and making any selections based on pseudo-random numbers generated from that seed [RFC4086]. The random numbers or seed and the algorithm used SHOULD make uniformly distributed selections over the available nicknames. Convergence to a nickname-collision free campus is accelerated by selecting new nicknames only from those that appear to be available and by having the highest priority nickname involved in a nickname conflict retain its value. There is no reason for all Rbridges to use the same algorithm for selecting nicknames." > >... > >> [DE] Well, perhaps that statement in the abstract should be limited to >> unicast forwarding information since RBridge multi-destination frame >> forwarding SHOULD also prune distribution based on looking at the VLAN >> and destination MAC address (although this pruning is only an >> optimization so things will work if it is not done). However, I'm not >> aware of any 802.1 VLAN aware bridging standards that have a bridge >> forward known unicast customer data frames based only on VLAN, >> ignoring MAC address. As far as I know, VLAN aware bridge forwarding >> lookups are 60-bits wide, a 12-bit VLAN plus 48-bit MAC address. > > [BA] The lookups will typically include the MAC address because switches > are typically designed so that they could be connected to edge ports.? But > a switch that was only to be used in the core could do the lookups based > purely on the VLAN and still function correctly.? My point was that core > state is not required to be dependent on the number of MAC addresses, > either for Rbridges or conventional bridges. [DE] OK. >> [DE] Convergence is a bit more complex that it seems at first and >> depends on engineering, implementation, which aspects of convergence >> are included, and how they are measured, as well as protocol. For >> example, one might expect RBridges engineered for rapid fail-over to >> also implement BFD for rapid failure detection. How would that be >> factored in? See also Section 2.3 of RFC 5556. > > [BA] Agreed.? RSTP was not mentioned, so that this made me wonder > whether it was somehow not being factored in.? The spec might say > upfront that the term "spanning tree" applies to both STP and RSTP, > and that Rbridges can make use of IS-IS implementations engineered > for rapid convergence. [DE] OK. > >... > >> [DE] If an end station is unplugged from one RBridge and plugged into >> another then, depending on circumstances, frames addressed to that end >> station can be black holed. This is, they can be sent to the older >> RBridge that the end station used to be connected to until cached >> address information at some remote RBridge(s) times out, possibly for >> a number of minutes or longer. With ESADI, the link interruption and >> establishment from the unplugging and plugging can cause immediate >> updates to be sent. > > [BA] Thanks for the explanation.? This might be useful to include in the > document. [DE] OK. > >... > >> [DE] So, assume, say, a TRILL encapsulated broadcast frames arrives at >> RBridge X via port-1 with a hop count of 13 while on a distribution >> tree such that RBridge X will forward two copies of the frame, one out >> of port-2 and one out of port-3. Assume that the most distant RBridge >> down the port-2 branch of this distribution tree is 12 hops away. But >> the most distant RBridge down the port-3 branch is only 2 hops >> away. The RBridge MUST decrease the hop count by at least one, which >> in this case, still leaves just enough to complete the distribution on >> the max-12 hop branch. But the language you are referring to permits >> the RBridge to reduce the hop count by more, up to 10 in this case, >> for the max-2 hop branch out of port-3, as long as it still has enough >> hop count left in that copy for complete distribution. But it always >> has to reduce it by at least one even if that doesn't leave a big >> enough hop count for complete distribution. This is an optional extra >> safety measure to control multi-destination frames, the most dangerous >> kind of frame. > > [BA] From the wording, I wasn't clear that the MAY only applied to > multi-destination frames, not unicast. [DE] OK, some words to make this clear should be added. >> [DE] Well, you need a unique seed to start with although you could, as >> RFC 3927 suggests, use a pseudo-random number generator >> thereafter. For nicknames, the distribution isn't over the space of >> all nicknames but only over the nicknames that appear to be available >> based on the link state database held by the RBridge selecting a new >> nickname. > > [BA] This wasn't clear to me from the document.? Might it make sense to > state that? [DE] I think you correctly interpreted the current wording as assuming a random number generation each time. See suggested new wording above for the relevant bullet point in the draft. >> [DE] What the draft tries to say here is that, to support VLAN ID >> translation, TRILL ESADI frames, if used, are subject to a >> restriction. It says nothing about a requirement to support >> ESADI. This comment was included so that, when designing the encoding >> of or extending ESADI, nothing would be done that would break VLAN >> translation. The wording should be clarified. > > [BA] OK.? This wasn't clear to me from the existing wording. [DE] Wording should be added to clarify this. >> [DE] This needs to be clarified. The whole MTU feature was motivated >> by problems with TRILL IS-IS frames on inter-RBridge links. The only >> thing Sz limits is the size of PDUs generated for TRILL IS-IS (except >> for MTU-probe/ack PDUs). This is needed to assure proper operation. It >> doesn't really have anything to do with MTU on links to end stations. >> There is no way provided in TRILL or IS-IS to communicate an MTU to an >> end station. See more below. > > [BA] OK.? I wasn't clear that this wasn't about end station MTU. [DE] OK. I think there was some text and a diagram in the -12 draft that can possibly be re-instated to help this. > > ... > >> [DE] The alternative, which I prefer, is to simplify. Just say "How to >> determine flows or what granularity they should have is beyond the >> scope of this document." dropping the example. > > [BA] I agree that getting into a lot of detail is probably not a good idea. > However, you could refer to an existing document (such as Link Aggregation) > that discusses the issue. [DE] OK, that's a reasonable solution. Thanks, Donald From d3e3e3 at gmail.com Mon Sep 21 05:59:48 2009 From: d3e3e3 at gmail.com (Donald Eastlake) Date: Mon, 21 Sep 2009 08:59:48 -0400 Subject: [rbridge] Fwd: Comments on draft-ietf-trill-rbridge-protocol-13.txt In-Reply-To: References: <1028365c0909031208m46af5055n5376f67fc1ea6f49@mail.gmail.com> <292BD6E016C29243BB231D329D626C8C02D236E5@USDALSMBS05.ad3.ad.alcatel.com> <1028365c0909200732m77156393m9b72c6c5192e742f@mail.gmail.com> Message-ID: <1028365c0909210559n78551092vcbc81d848661d4cc@mail.gmail.com> For some reason Bernard's message below does not seem to be in the mailing list archives so I'm forwarding another copy to the list. As a result, the message below and my response to it will probably appear out of order in the list... Thanks, Donald ============================= Donald E. Eastlake 3rd +1-508-634-2066 (home) 155 Beaver Street Milford, MA 01757 USA d3e3e3 at gmail.com ---------- Forwarded message ---------- From: Bernard Aboba Date: Sun, Sep 20, 2009 at 3:50 PM Subject: RE: Comments on draft-ietf-trill-rbridge-protocol-13.txt To: d3e3e3 at gmail.com Cc: rbridge at postel.org, rdroms at cisco.com, Jari Arkko , "dromasca at avaya.com" > [DE] The document could say that the use of optimal paths and > multi-pathing are of more benefit the more mesh-like the network is. [BA] That might be helpful. > [DE] RSTP was standardized by 802.1w-2001, long before the proposal of > TRILL. TRILL, as described in this specification, is the application > of link state router technology to the VLAN aware customer bridging > problem. RBridges are true routers, swapping the out MAC addresses on > each hop as well as decrementing a hop count. I think it is better to > view TRILL as a different technical approach with different > characteristics than bridging rather than as some competition based > just on the metric of convergence time. See further remarks below. [BA] That's reasonable -- and your later comments make it clear that TRILL implementations could support rapid convergence. > [DE] There are significant differences between nickname collisions and > RFC 3927 link local IPv4 address collisions. With RFC 3927 addresses, > you have to pick an address to try from across the entire range, since > you don't know which are in use. Furthermore, two hosts may > simultaneously detect a collision and both re-try with new > values. With nicknames, you pick a new nickname only from among those > that were free, since you can see all the nicknames that are in use in > the link state. And in case of a collision detected simultaneously by > two RBridges, they can both see each other's priority and only one > will retry. [BA]? Yes, this does make a very big difference.? I wasn't clear about this based on my reading the spec.? Maybe it would be worth including some wording explaining the above? > [DE] Yes, as per below, there are a number of cases where it currently > says "MAC address", or the like, and VLAN needs to be added so it says > "VLAN and MAC address", or the like. [BA] OK. > [DE] Yes, something about "extensions to support provider bridging > services are left for future work" or the like, since this spec covers > only customer bridging services, seems reasonable. [BA] OK > [DE] Well, perhaps that statement in the abstract should be limited to > unicast forwarding information since RBridge multi-destination frame > forwarding SHOULD also prune distribution based on looking at the VLAN > and destination MAC address (although this pruning is only an > optimization so things will work if it is not done). However, I'm not > aware of any 802.1 VLAN aware bridging standards that have a bridge > forward known unicast customer data frames based only on VLAN, > ignoring MAC address. As far as I know, VLAN aware bridge forwarding > lookups are 60-bits wide, a 12-bit VLAN plus 48-bit MAC address. [BA] The lookups will typically include the MAC address because switches are typically designed so that they could be connected to edge ports.? But a switch that was only to be used in the core could do the lookups based purely on the VLAN and still function correctly.? My point was that core state is not required to be dependent on the number of MAC addresses, either for Rbridges or conventional bridges. > [DE] Convergence is a bit more complex that it seems at first and > depends on engineering, implementation, which aspects of convergence > are included, and how they are measured, as well as protocol. For > example, one might expect RBridges engineered for rapid fail-over to > also implement BFD for rapid failure detection. How would that be > factored in? See also Section 2.3 of RFC 5556. [BA] Agreed.? RSTP was not mentioned, so that this made me wonder whether it was somehow not being factored in.? The spec might say upfront that the term "spanning tree" applies to both STP and RSTP, and that Rbridges can make use of IS-IS implementations engineered for rapid convergence. > [DE] Yes, even though Sections 1 and 2 are supposed to be general > overview section, it seems like the use of general words like "most" > causes some to jump to the conclusion that a more precise description > isn't known, even though it is. The wording should be improved. [BA] OK > [DE] Wording should be added to the draft saying that a separate > specification would be expected for each link type. [BA] OK > [DE] Source routing bridges are something I know even less about. But > RBridges look like end stations to bridges and terminate spanning > tree. So, I would imagine that if they handled route explorer frames > as if they were an end station, then RBridges would work with source > routed bridges also. [BA] I don't know much about source routing bridges either, so I'll leave discussion of that to someone who does.? But if compatibility with those aren't an explicit goal, it might make sense to say so. > [DE] Thanks. Section 4 has highest precedence and 2 has lowest. The > ambiguity in wording needs to be fixed. [BA] OK > [DE] OK. The term "zero configuration" or an equivalent do not seem > necessary here. And, in general, I think all remaining occurrences of > "zero configuration" should be removed or replaced with "default > configuration" or the like. [BA] OK > [DE] Yes, even though it is speaking of the "VLAN-x forwarder", so > there is an implicaiton that it is learning the MAC within VLAN-x, > this should be made more explicit. [BA] OK > [DE] If an end station is unplugged from one RBridge and plugged into > another then, depending on circumstances, frames addressed to that end > station can be black holed. This is, they can be sent to the older > RBridge that the end station used to be connected to until cached > address information at some remote RBridge(s) times out, possibly for > a number of minutes or longer. With ESADI, the link interruption and > establishment from the unplugging and plugging can cause immediate > updates to be sent. [BA] Thanks for the explanation.? This might be useful to include in the document. > [DE] Yeah, or maybe best to replace "original source and destination > MAC" with "end station VLAN and MAC". [BA] That makes sense. > [DE] Frames addressed to the small number of special > bridging/link/TRILL reserved addresses are handled specially. That > exception should be added. [BA] OK. > [DE] Good point. Should probably say "They MUST be set to zero when > the TRILL header is added by in ingress RBridge, transparently copied > but otherwise ignored by transit RBridges, and ignored by egress > RBridges. [BA] OK > [DE] There was some controversy about options and the above warning is > probably over alarmist. The hard size limit on the options area was > based on input from ASIC engineers. The base protocol draft contains > only the minimum hooks for options and the working group consensus has > been to include these hooks. To revisit this question at this point > would cause substantial delay. [BA] I thought the wording was fine, actually. > [DE] So, assume, say, a TRILL encapsulated broadcast frames arrives at > RBridge X via port-1 with a hop count of 13 while on a distribution > tree such that RBridge X will forward two copies of the frame, one out > of port-2 and one out of port-3. Assume that the most distant RBridge > down the port-2 branch of this distribution tree is 12 hops away. But > the most distant RBridge down the port-3 branch is only 2 hops > away. The RBridge MUST decrease the hop count by at least one, which > in this case, still leaves just enough to complete the distribution on > the max-12 hop branch. But the language you are referring to permits > the RBridge to reduce the hop count by more, up to 10 in this case, > for the max-2 hop branch out of port-3, as long as it still has enough > hop count left in that copy for complete distribution. But it always > has to reduce it by at least one even if that doesn't leave a big > enough hop count for complete distribution. This is an optional extra > safety measure to control multi-destination frames, the most dangerous > kind of frame. [BA] From the wording, I wasn't clear that the MAY only applied to multi-destination frames, not unicast. > [DE] Well, you need a unique seed to start with although you could, as > RFC 3927 suggests, use a pseudo-random number generator > thereafter. For nicknames, the distribution isn't over the space of > all nicknames but only over the nicknames that appear to be available > based on the link state database held by the RBridge selecting a new > nickname. [BA] This wasn't clear to me from the document.? Might it make sense to state that? > [DE] What the draft tries to say here is that, to support VLAN ID > translation, TRILL ESADI frames, if used, are subject to a > restriction. It says nothing about a requirement to support > ESADI. This comment was included so that, when designing the encoding > of or extending ESADI, nothing would be done that would break VLAN > translation. The wording should be clarified. [BA] OK.? This wasn't clear to me from the existing wording. > [DE] This needs to be clarified. The whole MTU feature was motivated > by problems with TRILL IS-IS frames on inter-RBridge links. The only > thing Sz limits is the size of PDUs generated for TRILL IS-IS (except > for MTU-probe/ack PDUs). This is needed to assure proper operation. It > doesn't really have anything to do with MTU on links to end stations. > There is no way provided in TRILL or IS-IS to communicate an MTU to an > end station. See more below. [BA] OK.? I wasn't clear that this wasn't about end station MTU. > [DE] The only case where an RBridge is permitted to decrease the hop > count by more than one is when it forwards a multi-destination frame > onto a branch of the frame's distribution tree. As discussed above, in > that case, it can reduce the hop count to the distance to the most > remote RBridge in the distribution tree branch. Section 4.6.2.4 that > you are commenting on concerns the handling of known unicast > frames. RBridges are not permitted to decrease their hop count by more > than 1. [BA] This makes sense -- I just wasn't clear about this based on my reading of the text. > [DE] It seems to me that the right thing is to add a definition of > "port" to Section 1.3 that makes it clear that the unadorned word > "port" includes pseudo and virtual ports. In addition, some > references, where appropriate, to something as being implemented "in > ports" could be changed to saying implemented "below the EISS layer" > or the like. [BA] OK. > [DE] The text in Appendix A should be changed to make it clear that it > is only talking about spanning tree BPDUs. As per Section 4.9.3.3 of > the draft, there are conditions under which RBridges SHOULD send > topology change BPDUs and RBridges MAY send spanning tree BPDUs. The > implementation requirement key words appear in the body of the draft. [BA] OK. > [DE] The alternative, which I prefer, is to simplify. Just say "How to > determine flows or what granularity they should have is beyond the > scope of this document." dropping the example. [BA] I agree that getting into a lot of detail is probably not a good idea. However, you could refer to an existing document (such as Link Aggregation) that discusses the issue. From d3e3e3 at gmail.com Tue Sep 22 11:08:26 2009 From: d3e3e3 at gmail.com (Donald Eastlake) Date: Tue, 22 Sep 2009 14:08:26 -0400 Subject: [rbridge] Comments on draft-ietf-trill-rbridge-protocol-13.txt In-Reply-To: References: <1028365c0909031208m46af5055n5376f67fc1ea6f49@mail.gmail.com> <292BD6E016C29243BB231D329D626C8C02D236E5@USDALSMBS05.ad3.ad.alcatel.com> <1028365c0909200732m77156393m9b72c6c5192e742f@mail.gmail.com> <1028365c0909210551j6b89b06ej3e066a85c36f2e5b@mail.gmail.com> Message-ID: <1028365c0909221108x7f58cc4bs65f69065e44a0e40@mail.gmail.com> Looks like we are converging :-) See below On Tue, Sep 22, 2009 at 12:14 AM, Bernard Aboba wrote: >> [DE] Sure, how about the following: >> >> "To minimize the probability of nickname collisions, an RBridge >> selects a nickname randomly from the apparently available nicknames, >> based on its copy of the link state. This random selection can be by >> the RBridge hashing some of its parameters, e.g., SystemID, time and >> date, and other entropy sources, such as those given in [RFC4086], >> each time or by the RBridge using such hashing to create a seed and >> making any selections based on pseudo-random numbers generated from >> that seed [RFC4086]. The random numbers or seed and the algorithm used >> SHOULD make uniformly distributed selections over the available >> nicknames. Convergence to a nickname-collision free campus is >> accelerated by selecting new nicknames only from those that appear to >> be available and by having the highest priority nickname involved in a >> nickname conflict retain its value. There is no reason for all >> Rbridges to use the same algorithm for selecting nicknames." > > [BA] Looks good. OK, thanks. >> > [BA] OK.? I wasn't clear that this wasn't about end station MTU. >> >> [DE] OK. I think there was some text and a diagram in the -12 draft >> that can possibly be re-instated to help this. > > [BA]? OK.? Overall, the draft left me wondering whether there would > be any data MTU issues with legacy bridges that might only support > frames of 1518 octets and if so, in what circumstances. I'd also like > the draft to clearly state the assumptions that are being made with > respect to support of standards such as IEEE 802.3as (Frame > Expansion). [DE] On MTU issues, problems due to support of only 1512 or 1516 bytes (called "basic frame" and "Q-tagged frame" in 802.3as) would, as far as I can tell, only occur in fairly ancient legacy equipment. Indeed it was an ancient hub that supported only the basic frame size that first revealed the problem for IS-IS LAN Hellos when they were VLAN tagged, pushing them over 1512. This was the impetus for creating TRILL Hellos. It is my impression that almost all modern Ethernet equipment, even the cheapest off-the-shelf stuff at your local home and office retail outlet, support some non-standard larger size -- at least a couple dozen extra bytes over the basic frame. I'm not sure the 802.3as added standard 1996-byte "envelope frame" has much relationship to reality. The 802.3as concept, as I understand it, is that data should stay at its current 1500 byte size and the additional headroom in the envelope frame should be available for additional tags, tunneling headers, etc. I actually see no reason to believe this theory will work given that I believe the original Ethernet recommendation was to limit data to 1024 bytes and the additional space in 1500 bytes was supposed to be available for such other purposes. And, in fact, most data center high end Ethernet and bridging hardware supports jumbo frames, commonly around 9K bytes although there are other values. And one should note that Fiber Channel over Ethernet frames will substantially exceed the "envelope frame" size. Anyway, the assumption that should be clearly stated in the draft is that TRILL is intended to work as long as the inter-RBridge links are big enough that 1470-byte TRILL IS-IS frames can make it through after the addition of a VLAN tag and whatever other tags/headers are needed on the link. This is the size limit for TRILL Hellos and if they get through, operation is safe. As long as the links in the campus meet this criterion, which should be true even with classic basic frame devices (whether bridges or hubs or whatever) in the inter-RBridge links, things will work fine. The campus wide agreement on Sz and the MTU-probe/ack are intended to enable setting a larger size for link state PDUs and similar TRILL IS-IS frames. But, as stated in the draft, measured MTU for inter-RBridge links can be reported in the link state, along with the cost and other information about the adjacency. And, as also stated in the draft, they could thus be used in traffic engineering routes for larger frames not supported by all the RBridges or links in the campus. But TRILL does not attempt to provide an end-to-end MTU reporting service or the like and anticipates that appropriate end-to-end path MTU determination will be used when needed by end stations. This should probably also be stated in the draft. Thanks, Donald ============================= Donald E. Eastlake 3rd +1-508-634-2066 (home) 155 Beaver Street Milford, MA 01757 USA d3e3e3 at gmail.com From d3e3e3 at gmail.com Tue Sep 22 21:15:14 2009 From: d3e3e3 at gmail.com (Donald Eastlake) Date: Wed, 23 Sep 2009 00:15:14 -0400 Subject: [rbridge] Granularity of Tree Root Priority Message-ID: <1028365c0909222115ied514e4ld018b358315492cb@mail.gmail.com> Currently, an RBridge has a single priority for its nickname or nicknames to be distribution tree root(s). This made lots of sense when each RBridge could only have one nickname. Now that selected RBridges can be configured to have more than one nickname, I think it would make sense to have this priority be per nickname. That way you can get the current effect by configuring them all for a particular RBridge to be the same priority, if that is what you want. But you can also configure to different priorities the different nicknames (or nickname slots if you want to look at it that way) for an RBridge that has multiple nicknames. That way, for example, if the number of distribution trees is constrained, you can be sure they are spread across multiple RBridges rather than tending to have all or nothing of the potentially trees rooted at each RBridge with multiple nicknames. Also, it would probably be a good idea to say that a nickname configured with zero, the lowest priority, is saying that it does not want to be a tree root and that it won't be one (except that if all RBridge nicknames in a campus are configured with zero tree priority, the highest priority based on tie-breakers is still a root so you will always have at least one distribution tree). Thanks, Donald ============================= Donald E. Eastlake 3rd +1-508-634-2066 (home) 155 Beaver Street Milford, MA 01757 USA d3e3e3 at gmail.com From Radia.Perlman at Sun.COM Tue Sep 22 22:27:13 2009 From: Radia.Perlman at Sun.COM (Radia Perlman) Date: Tue, 22 Sep 2009 22:27:13 -0700 Subject: [rbridge] Granularity of Tree Root Priority In-Reply-To: <1028365c0909222115ied514e4ld018b358315492cb@mail.gmail.com> References: <1028365c0909222115ied514e4ld018b358315492cb@mail.gmail.com> Message-ID: <4AB9B1B1.9080900@sun.com> I don't think this matters much either way. On the one hand, the only real excuse for R1 having multiple nicknames is because someone thought R1 was well positioned, in terms of being central to the topology, or whatever, to be a root, and want to be able to use multiple shortest-path trees from R1 as the distribution trees. So that would argue that a single priority would be enough -- if one of R1's nicknames is not going to be one of the selected tree roots, then R1 shouldn't be using that extra nickname. In other words, if R1 is configured to take 6 nicknames, it's because the network administrator wants R1 to be the root of 6 trees. On the other hand, it's really just a matter of how one organizes the information in the LSP, right? Whether the priority is associated with the nickname or the RBridge? I don't think it complicates processing, or adds a lot of overhead to an LSP. So I don't think it's harmful to make the priority be per nickname. It might be nice to add words to the spec to discourage people from thinking that lots of RBridges ought to get lots of nicknames. And am I forgetting some other reason for an RBridge having multiple nicknames? Radia Donald Eastlake wrote: > Currently, an RBridge has a single priority for its nickname or > nicknames to be distribution tree root(s). This made lots of sense > when each RBridge could only have one nickname. > > Now that selected RBridges can be configured to have more than one > nickname, I think it would make sense to have this priority be per > nickname. That way you can get the current effect by configuring them > all for a particular RBridge to be the same priority, if that is what > you want. But you can also configure to different priorities the > different nicknames (or nickname slots if you want to look at it that > way) for an RBridge that has multiple nicknames. That way, for > example, if the number of distribution trees is constrained, you can > be sure they are spread across multiple RBridges rather than tending > to have all or nothing of the potentially trees rooted at each RBridge > with multiple nicknames. > > Also, it would probably be a good idea to say that a nickname > configured with zero, the lowest priority, is saying that it does not > want to be a tree root and that it won't be one (except that if all > RBridge nicknames in a campus are configured with zero tree priority, > the highest priority based on tie-breakers is still a root so you will > always have at least one distribution tree). > > Thanks, > Donald > ============================= > Donald E. Eastlake 3rd +1-508-634-2066 (home) > 155 Beaver Street > Milford, MA 01757 USA > d3e3e3 at gmail.com > _______________________________________________ > rbridge mailing list > rbridge at postel.org > http://mailman.postel.org/mailman/listinfo/rbridge > From d3e3e3 at gmail.com Wed Sep 23 08:30:58 2009 From: d3e3e3 at gmail.com (Donald Eastlake) Date: Wed, 23 Sep 2009 11:30:58 -0400 Subject: [rbridge] Granularity of Tree Root Priority In-Reply-To: <4AB9B1B1.9080900@sun.com> References: <1028365c0909222115ied514e4ld018b358315492cb@mail.gmail.com> <4AB9B1B1.9080900@sun.com> Message-ID: <1028365c0909230830q1405798fybcc28210e0d8e5b8@mail.gmail.com> Hi Radia, On Wed, Sep 23, 2009 at 1:27 AM, Radia Perlman wrote: > I don't think this matters much either way. On the one hand, the only > real excuse for R1 having multiple nicknames is because someone thought R1 > was well positioned, in terms of being central to the topology, or whatever, > to > be a root, and want to be able to use multiple shortest-path trees from R1 > as > the distribution trees. ?So that would argue that a single priority would > be enough -- if one of R1's nicknames is not going to be one of the selected > tree roots, then R1 shouldn't be using that extra nickname. In other words, > if R1 is configured to take 6 nicknames, it's because the network > administrator > wants R1 to be the root of 6 trees. That's a great argument assuming everything is going as the network administrator intended. But lets say something goes slightly awry and an RBridge gets attached or reset that has a low number of trees it can calculate, say six. It seems better to make it possible for the administrator to set things up so that those will be at six different RBridges rather than having your hypothetical R1 being the root for all 6 allowed trees. > On the other hand, it's really just a matter of how one organizes the > information > in the LSP, right? Whether the priority is associated with the nickname or > the RBridge? I don't think it complicates processing, or adds > a lot of overhead to an LSP. So I don't think it's harmful to make the > priority be per nickname. It might > be nice to add words to the > spec to discourage people from thinking that lots of RBridges ought to get > lots of nicknames. Yes, Bernard Aboba's remarks also encourage the addition of such words. > And am I forgetting some other reason for an RBridge having multiple > nicknames? I don't think so. > Radia Thanks, Donald > Donald Eastlake wrote: >> >> Currently, an RBridge has a single priority for its nickname or >> nicknames to be distribution tree root(s). This made lots of sense >> when each RBridge could only have one nickname. >> >> Now that selected RBridges can be configured to have more than one >> nickname, I think it would make sense to have this priority be per >> nickname. That way you can get the current effect by configuring them >> all for a particular RBridge to be the same priority, if that is what >> you want. But you can also configure to different priorities the >> different nicknames (or nickname slots if you want to look at it that >> way) for an RBridge that has multiple nicknames. That way, for >> example, if the number of distribution trees is constrained, you can >> be sure they are spread across multiple RBridges rather than tending >> to have all or nothing of the potentially trees rooted at each RBridge >> with multiple nicknames. >> >> Also, it would probably be a good idea to say that a nickname >> configured with zero, the lowest priority, is saying that it does not >> want to be a tree root and that it won't be one (except that if all >> RBridge nicknames in a campus are configured with zero tree priority, >> the highest priority based on tie-breakers is still a root so you will >> always have at least one distribution tree). >> >> Thanks, >> Donald ============================= Donald E. Eastlake 3rd +1-508-634-2066 (home) 155 Beaver Street Milford, MA 01757 USA d3e3e3 at gmail.com From Radia.Perlman at Sun.COM Wed Sep 23 17:15:02 2009 From: Radia.Perlman at Sun.COM (Radia Perlman) Date: Wed, 23 Sep 2009 17:15:02 -0700 Subject: [rbridge] Granularity of Tree Root Priority In-Reply-To: <1028365c0909230830q1405798fybcc28210e0d8e5b8@mail.gmail.com> References: <1028365c0909222115ied514e4ld018b358315492cb@mail.gmail.com> <4AB9B1B1.9080900@sun.com> <1028365c0909230830q1405798fybcc28210e0d8e5b8@mail.gmail.com> Message-ID: <4ABABA06.3010408@sun.com> Donald, yeah, I agree you should make the priority per nickname. There's no downside to your proposed change, it's a really minor change, and I could imagine a case where there are two well-situated root candidates; R1 and R2, and you want to have as many trees as possible, half rooted at R1, and half rooted at R2, so that ingress guys near R1 will choose to path split among the R1-rooted trees, and ingress guys near R2 will choose to path-split among the R2-rooted trees. So, yeah, by giving each of R1 and R2, say, 5 nicknames, with interleaving priorities, you'll get half the trees rooted at R1 and half at R2. Radia Donald Eastlake wrote: > Hi Radia, > > On Wed, Sep 23, 2009 at 1:27 AM, Radia Perlman wrote: > >> I don't think this matters much either way. On the one hand, the only >> real excuse for R1 having multiple nicknames is because someone thought R1 >> was well positioned, in terms of being central to the topology, or whatever, >> to >> be a root, and want to be able to use multiple shortest-path trees from R1 >> as >> the distribution trees. So that would argue that a single priority would >> be enough -- if one of R1's nicknames is not going to be one of the selected >> tree roots, then R1 shouldn't be using that extra nickname. In other words, >> if R1 is configured to take 6 nicknames, it's because the network >> administrator >> wants R1 to be the root of 6 trees. >> > > That's a great argument assuming everything is going as the network > administrator intended. But lets say something goes slightly awry and > an RBridge gets attached or reset that has a low number of trees it > can calculate, say six. It seems better to make it possible for the > administrator to set things up so that those will be at six different > RBridges rather than having your hypothetical R1 being the root for > all 6 allowed trees. > > >> On the other hand, it's really just a matter of how one organizes the >> information >> in the LSP, right? Whether the priority is associated with the nickname or >> the RBridge? I don't think it complicates processing, or adds >> a lot of overhead to an LSP. So I don't think it's harmful to make the >> priority be per nickname. It might >> be nice to add words to the >> spec to discourage people from thinking that lots of RBridges ought to get >> lots of nicknames. >> > > Yes, Bernard Aboba's remarks also encourage the addition of such words. > > >> And am I forgetting some other reason for an RBridge having multiple >> nicknames? >> > > I don't think so. > > >> Radia >> > > Thanks, > Donald > > >> Donald Eastlake wrote: >> >>> Currently, an RBridge has a single priority for its nickname or >>> nicknames to be distribution tree root(s). This made lots of sense >>> when each RBridge could only have one nickname. >>> >>> Now that selected RBridges can be configured to have more than one >>> nickname, I think it would make sense to have this priority be per >>> nickname. That way you can get the current effect by configuring them >>> all for a particular RBridge to be the same priority, if that is what >>> you want. But you can also configure to different priorities the >>> different nicknames (or nickname slots if you want to look at it that >>> way) for an RBridge that has multiple nicknames. That way, for >>> example, if the number of distribution trees is constrained, you can >>> be sure they are spread across multiple RBridges rather than tending >>> to have all or nothing of the potentially trees rooted at each RBridge >>> with multiple nicknames. >>> >>> Also, it would probably be a good idea to say that a nickname >>> configured with zero, the lowest priority, is saying that it does not >>> want to be a tree root and that it won't be one (except that if all >>> RBridge nicknames in a campus are configured with zero tree priority, >>> the highest priority based on tie-breakers is still a root so you will >>> always have at least one distribution tree). >>> >>> Thanks, >>> Donald >>> > > ============================= > Donald E. Eastlake 3rd +1-508-634-2066 (home) > 155 Beaver Street > Milford, MA 01757 USA > d3e3e3 at gmail.com > From d3e3e3 at gmail.com Tue Sep 29 13:30:09 2009 From: d3e3e3 at gmail.com (Donald Eastlake) Date: Tue, 29 Sep 2009 16:30:09 -0400 Subject: [rbridge] Hiroshima IETF Meeting Message-ID: <1028365c0909291330r6efff01fr1bf0ccfa0157ddc5@mail.gmail.com> Hi, It does not currently appear that there are any major decisions that would require a TRILL working group meeting in Hiroshima. It also seems likely that attendance will be limited at that meeting. So, unless there is a strong working group feeling the other way, Erik and I do not plan to have a TRILL WG meeting at the next IETF meeting. Thanks, Donald ============================= Donald E. Eastlake 3rd +1-508-634-2066 (home) 155 Beaver Street Milford, MA 01757 USA d3e3e3 at gmail.com From d3e3e3 at gmail.com Tue Sep 29 20:52:35 2009 From: d3e3e3 at gmail.com (Donald Eastlake) Date: Tue, 29 Sep 2009 23:52:35 -0400 Subject: [rbridge] Some corner cases related to ambiguous destinations Message-ID: <1028365c0909292052w7689b2cfta2a02011bbfc51a1@mail.gmail.com> A couple of corner cases have been noticed related to ambiguous destinations. In particular multiple RBridges could, as a transient condition, have the same nickname. In addition, it is possible, through manual configuration, layer 2 registration protocols, and/or ESADI to end up with the same destination MAC address in the same VLAN on two or more links. The text as the bottom of this message is suggested as a new subsection of the base protocol draft to take care of this where the ambiguous destinations are at different RBridges. It is also possible for duplicate MAC addresses in the same VLAN to be on links for which a single RBridge is the appointed forwarder. To take care of this, a few words can be added to section 4.6.2.4 saying that if the egress RBridge believes that the destination MAC address in the right VLAN is on two or more links for which it is appointed forwarder with the same confidence, it sends the frame to all those links. Thanks, Donald ============================= Donald E. Eastlake 3rd +1-508-634-2066 (home) 155 Beaver Street Milford, MA 01757 USA d3e3e3 at gmail.com 4.2.6 Shortest Path Computation, Forwarding, and Ambiguous Destinations This section describes the logical result desired. Alternative implementation methods may be used as long as they producing the same forwarding behavior. When building a forwarding table, an RBridge R1 calculates shortest paths from itself as described in [RFC1195] Appendix C.1. Nicknames are added into the shortest path calculation as a final step, just as with an endnode. If multiple RBridges , say Ra and Rb, claim the same nickname, this is a transitory condition and one of Ra or Rb will defer and choose a new nickname. However, R1 simply adds that nickname as if it were attached to both Ra and Rb, and uses the standard shortest path calculation to choose the next hop An ingress RBridge RB2 maps a native frame's known unicast destination MAC address and VLAN into an egress RBridge nickname. If RB2 learns addresses only from the observation of received and decapsulated frames, then such MAC addresses cannot be duplicated within a VLAN in RB2 tables because more recent learned information, if of a higher or equal confidence, overwrites previous information. However, duplicates of the same MAC within a VLAN can appear in ESADI data and between ESADI data and addresses learned from the observation of received and decapulsated frames, entered by manual configuration, or learned through layer 2 registration protocols. If duplicate MAC addresses occur within a VLAN, RB2 sends frames to the MAC with the highest confidence. If confidences are also tied between the duplicates, for consistency it is sugested that RB2 direct all such frames (or all such frames in the same ECMP flow) toward the same egress RBridge; however, the use of other policies will not cause a problem since transit RBridges do not examine the Inner.MacDA for known unicast frames.