US20140241345A1 - Source routing with fabric switches in an ethernet fabric network - Google Patents
Source routing with fabric switches in an ethernet fabric network Download PDFInfo
- Publication number
- US20140241345A1 US20140241345A1 US13/781,561 US201313781561A US2014241345A1 US 20140241345 A1 US20140241345 A1 US 20140241345A1 US 201313781561 A US201313781561 A US 201313781561A US 2014241345 A1 US2014241345 A1 US 2014241345A1
- Authority
- US
- United States
- Prior art keywords
- source
- fabric
- routing
- path
- routing information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L45/00—Routing or path finding of packets in data switching networks
- H04L45/34—Source routing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L49/00—Packet switching elements
- H04L49/25—Routing or path finding in a switch fabric
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L45/00—Routing or path finding of packets in data switching networks
- H04L45/54—Organization of routing tables
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L45/00—Routing or path finding of packets in data switching networks
- H04L45/72—Routing based on the source address
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L45/00—Routing or path finding of packets in data switching networks
- H04L45/74—Address processing for routing
- H04L45/745—Address table lookup; Address filtering
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L47/00—Traffic control in data switching networks
- H04L47/10—Flow control; Congestion control
- H04L47/12—Avoiding congestion; Recovering from congestion
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L49/00—Packet switching elements
- H04L49/35—Switches specially adapted for specific applications
- H04L49/351—Switches specially adapted for specific applications for local area network [LAN], e.g. Ethernet switches
Definitions
- the present invention relates to data center infrastructure, and more particularly, this invention relates to reducing the overhead associated with using look-up tables in fabric switches to reduce latency.
- a switching processor such as a switching application specific integrated circuit (ASIC) may be used to choose a port to send received network packets.
- a switching processor such as a switching application specific integrated circuit (ASIC)
- ASIC switching application specific integrated circuit
- a look-up table is utilized to choose which port to send a received packet based on a destination address designated in a header of the received packet.
- these look-up tables may encompass vast amounts of data, which causes latency in using the look-up table to determine an egress port to forward packets to. Accordingly, it would be beneficial to have a method to reduce the overhead associated with using look-up tables in fabric switches in order to reduce fabric latency.
- a system for source routing packets includes a network fabric having a plurality of fabric switches interconnected in the network fabric and a switch controller having logic adapted to configure the network fabric, determine one or more paths through the network fabric between any two hosts connected thereto, and create a source-routing table to store the one or more paths through the network fabric between any two hosts connected thereto.
- a computer program product for source routing packets includes a computer readable storage medium having program code embodied therewith, the program code readable/executable by a switch controller to: configure a network fabric having a plurality of fabric switches interconnected in the network fabric, determine one or more paths through the network fabric between any two hosts connected thereto, and create a source-routing table to store the one or more paths through the network fabric between any two hosts connected thereto.
- a method for source routing packets includes receiving or creating a packet using a network interface card (NIC) of a host connected to a network fabric having a plurality of fabric switches interconnected therein, determining a path through the network fabric by consulting a source-routing table stored to the host, storing source-routing information to a packet header for the packet, the source-routing information including the path, and sending the packet to a first device or hop indicated by the path in the source-routing information.
- NIC network interface card
- a method for source routing packets includes receiving a packet, receiving source-routing information with a fabric switch interconnected to other fabric switches in a network fabric, the source-routing information being sent from a switch controller, storing the source-routing information to a source-routing table that indicates a sequence of devices or hops between the fabric switch and each known destination address in the network fabric, determining a next device or hop in a path through the network fabric by consulting the source-routing table, storing a portion of the source-routing information to a packet header for the packet, the portion of the source-routing information including at least a portion of the path, and sending the packet to the next device or hop indicated by the at least the portion of the path in the portion of the source-routing information.
- FIG. 1 illustrates a network architecture, in accordance with one embodiment.
- FIG. 2 shows a representative hardware environment that may be associated with the servers and/or clients of FIG. 1 , in accordance with one embodiment.
- FIG. 3 shows a system for source routing packets, according to one embodiment.
- FIG. 4 shows an exemplary path through a network fabric, according to one embodiment.
- FIG. 5A shows an exemplary frame format for a packet having source-routing information, according to one embodiment.
- FIG. 5B is an exemplary tag protocol identifier, according to one embodiment.
- FIG. 6 is a flowchart of a method, according to one embodiment.
- FIG. 7 is a flowchart of a method, according to one embodiment.
- a system for source routing packets includes a network fabric having a plurality of fabric switches interconnected in the network fabric and a switch controller having logic adapted to configure the network fabric, determine one or more paths through the network fabric between any two hosts connected thereto, and create a source-routing table to store the one or more paths through the network fabric between any two hosts connected thereto.
- a computer program product for source routing packets includes a computer readable storage medium having program code embodied therewith, the program code readable/executable by a switch controller to: configure a network fabric having a plurality of fabric switches interconnected in the network fabric, determine one or more paths through the network fabric between any two hosts connected thereto, and create a source-routing table to store the one or more paths through the network fabric between any two hosts connected thereto.
- a method for source routing packets includes receiving or creating a packet using a network interface card (NIC) of a host connected to a network fabric having a plurality of fabric switches interconnected therein, determining a path through the network fabric by consulting a source-routing table stored to the host, storing source-routing information to a packet header for the packet, the source-routing information including the path, and sending the packet to a first device or hop indicated by the path in the source-routing information.
- NIC network interface card
- a method for source routing packets includes receiving a packet, receiving source-routing information with a fabric switch interconnected to other fabric switches in a network fabric, the source-routing information being sent from a switch controller, storing the source-routing information to a source-routing table that indicates a sequence of devices or hops between the fabric switch and each known destination address in the network fabric, determining a next device or hop in a path through the network fabric by consulting the source-routing table, storing a portion of the source-routing information to a packet header for the packet, the portion of the source-routing information including at least a portion of the path, and sending the packet to the next device or hop indicated by the at least the portion of the path in the portion of the source-routing information.
- a switch controller such as a controller operating OpenFlow software (an OpenFlow Controller) or a switch controller that operates according to software-defined network (SDN) standards
- a plurality of switches in a network fabric which are capable of communicating with the switch controller may be instructed of desirable paths with which to forward received packets in order to best utilize the network fabric.
- intelligence or functionality may be built into the switch controller to determine paths through the network fabric and to deliver these desired paths to individual switches in the network fabric that are compliant with whatever software the switch controller utilizes.
- the switches when the switch controller operates according to OpenFlow and/or SDN standards, the switches may be OpenFlow and/or SDN compliant in order to utilize the source routing techniques described herein.
- aspects of the present invention may be embodied as a system, method or computer program product. Accordingly, aspects of the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.) or an embodiment combining software and hardware aspects that may all generally be referred to herein as “logic,” a “circuit,” “module,” or “system.” Furthermore, aspects of the present invention may take the form of a computer program product embodied in one or more computer readable medium(s) having computer readable program code embodied thereon.
- the computer readable medium may be a computer readable signal medium or a non-transitory computer readable storage medium.
- a non-transitory computer readable storage medium may be, for example, but not limited to, a system, apparatus, device, or any suitable combination of the foregoing which may rely on any suitable technology types, such as electronic, magnetic, optical, electromagnetic, infrared, semiconductor, etc.
- non-transitory computer readable storage medium More specific examples (a non-exhaustive list) of the non-transitory computer readable storage medium include the following: a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), a portable compact disc read-only memory (CD-ROM), a Blu-ray disc read-only memory (BD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
- a non-transitory computer readable storage medium may be any tangible medium that is capable of containing, or storing a program or application for use by or in connection with an instruction execution system, apparatus, or device.
- a computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated signal may take any of a variety of forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof.
- a computer readable signal medium may be any computer readable medium that is not a non-transitory computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device, such as an electrical connection having one or more wires, an optical fibre, etc.
- Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fibre cable, RF, etc., or any suitable combination of the foregoing.
- Computer program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C++, or the like, and conventional procedural programming languages, such as the “C” programming language or similar programming languages.
- the program code may execute entirely on a user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server.
- the remote computer or server may be connected to the user's computer through any type of network, including a local area network (LAN), storage area network (SAN), and/or a wide area network (WAN), or the connection may be made to an external computer, for example through the Internet using an Internet Service Provider (ISP).
- LAN local area network
- SAN storage area network
- WAN wide area network
- ISP Internet Service Provider
- These computer program instructions may also be stored in a computer readable medium that may direct a computer, other programmable data processing apparatus, or other devices to function in a particular manner, such that the instructions stored in the computer readable medium produce an article of manufacture including instructions which implement the function/act specified in the flowchart and/or block diagram block or blocks.
- the computer program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other devices to cause a series of operational steps to be performed on the computer, other programmable apparatus or other devices to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide processes for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
- FIG. 1 illustrates a network architecture 100 , in accordance with one embodiment.
- a plurality of remote networks 102 are provided including a first remote network 104 and a second remote network 106 .
- a gateway 101 may be coupled between the remote networks 102 and a proximate network 108 .
- the networks 104 , 106 may each take any form including, but not limited to a LAN, a WAN such as the Internet, public switched telephone network (PSTN), internal telephone network, etc.
- PSTN public switched telephone network
- the gateway 101 serves as an entrance point from the remote networks 102 to the proximate network 108 .
- the gateway 101 may function as a router, which is capable of directing a given packet of data that arrives at the gateway 101 , and a switch, which furnishes the actual path in and out of the gateway 101 for a given packet.
- At least one data server 114 coupled to the proximate network 108 , and which is accessible from the remote networks 102 via the gateway 101 .
- the data server(s) 114 may include any type of computing device/groupware. Coupled to each data server 114 is a plurality of user devices 116 .
- Such user devices 116 may include a desktop computer, laptop computer, handheld computer, printer, and/or any other type of logic-containing device.
- a user device 111 may also be directly coupled to any of the networks, in some embodiments.
- a peripheral 120 or series of peripherals 120 may be coupled to one or more of the networks 104 , 106 , 108 .
- databases and/or additional components may be utilized with, or integrated into, any type of network element coupled to the networks 104 , 106 , 108 .
- a network element may refer to any component of a network.
- methods and systems described herein may be implemented with and/or on virtual systems and/or systems which emulate one or more other systems, such as a UNIX system which emulates an IBM z/OS environment, a UNIX system which virtually hosts a MICROSOFT WINDOWS environment, a MICROSOFT WINDOWS system which emulates an IBM z/OS environment, etc.
- This virtualization and/or emulation may be enhanced through the use of VMWARE software, in some embodiments.
- one or more networks 104 , 106 , 108 may represent a cluster of systems commonly referred to as a “cloud.”
- cloud computing shared resources, such as processing power, peripherals, software, data, servers, etc., are provided to any system in the cloud in an on-demand relationship, thereby allowing access and distribution of services across many computing systems.
- Cloud computing typically involves an Internet connection between the systems operating in the cloud, but other techniques of connecting the systems may also be used, as known in the art.
- FIG. 2 shows a representative hardware environment associated with a user device 116 and/or server 114 of FIG. 1 , in accordance with one embodiment.
- FIG. 2 illustrates a typical hardware configuration of a workstation having a central processing unit (CPU) 210 , such as a microprocessor, and a number of other units interconnected via one or more buses 212 which may be of different types, such as a local bus, a parallel bus, a serial bus, etc., according to several embodiments.
- CPU central processing unit
- the workstation shown in FIG. 2 includes a Random Access Memory (RAM) 214 , Read Only Memory (ROM) 216 , an I/O adapter 218 for connecting peripheral devices such as disk storage units 220 to the one or more buses 212 , a user interface adapter 222 for connecting a keyboard 224 , a mouse 226 , a speaker 228 , a microphone 232 , and/or other user interface devices such as a touch screen, a digital camera (not shown), etc., to the one or more buses 212 , communication adapter 234 for connecting the workstation to a communication network 235 (e.g., a data processing network) and a display adapter 236 for connecting the one or more buses 212 to a display device 238 .
- a communication network 235 e.g., a data processing network
- display adapter 236 for connecting the one or more buses 212 to a display device 238 .
- the workstation may have resident thereon an operating system such as the MICROSOFT WINDOWS Operating System (OS), a MAC OS, a UNIX OS, etc. It will be appreciated that a preferred embodiment may also be implemented on platforms and operating systems other than those mentioned.
- OS MICROSOFT WINDOWS Operating System
- MAC OS MAC OS
- UNIX OS UNIX OS
- a preferred embodiment may also be implemented on platforms and operating systems other than those mentioned.
- a preferred embodiment may be written using JAVA, XML, C, and/or C++ language, or other programming languages, along with an object oriented programming methodology.
- Object oriented programming (OOP) which has become increasingly used to develop complex applications, may be used.
- a system 300 is shown according to one embodiment, which has a plurality of fabric switches 304 interconnected in a network fabric 302 , each of the fabric switches 304 being connected to one another via connections 306 .
- Each fabric switch 304 is connected, directly or indirectly to a switch controller 308 (as denoted by dashed line connection 310 between the switch controller 308 and the network fabric 302 ).
- the switch controller 308 is capable of receiving information from each of the fabric switches 304 and is capable of sending information and/or commands to the fabric switches 304 .
- the switch controller 308 may operate according to OpenFlow and/or SDN standards, and each fabric switch 304 may be OpenFlow and/or SDN compliant. In other embodiments, the switch controller 308 may utilize a different application capable of controlling the fabric switches 304 as would be known by one of skill in the art, such as Beacon, Jaxon, NOX, POX, Maestro, etc.
- the network fabric 302 may be a physical and/or virtual network fabric (a network fabric which utilizes only physical devices, a network fabric which only utilizes virtual devices, and/or a network fabric which utilizes a combination of physical and virtual devices).
- each of the fabric switches 304 may be a physical switch, a virtual switch, or a combination thereof.
- the system 300 may further comprise one or more hosts 312 connected to the network fabric 302 via one or more fabric switches 304 via connections 314 .
- Any of the hosts 312 may be a physical host, a virtual host, or a combination thereof.
- the hosts may be any type of device capable of communicating with the network fabric 302 , such as another network, a server, a controller, a workstation, an end station, etc.
- Each host 312 may include an interface for communicating with the network fabric 302 and one or more fabric switches 304 therein.
- Each of the hosts 312 are unaware of the physical components of the network fabric 302 and instead view the network fabric 302 as a single entity to which a connection may be made, in one approach.
- each host 312 is actually connected to at least one physical fabric switch 304 within the network fabric 302 .
- the host 312 may be connected to multiple fabric switches 304 in the case of a Link Aggregation (LAG) connection.
- LAG Link Aggregation
- the switch controller 308 may comprise logic adapted to analyze and configure the network fabric 302 such that there is one or more non-looping paths through the network fabric 302 between any two hosts 312 or other end stations connected to the network fabric 302 .
- the logic may be able to determine multiple paths through the network fabric 302 , in order to provide redundancy, increased throughput, and decreased latency, among other advantages.
- Some factors include the number of layers in the fabric, L, the number of nodes per layer, N L , the switch controller's topology and connectivity graph (and whether the switch controller 308 is capable of globalizing the routing decisions), etc.
- the multipathing may take place in-order via Equal Cost Multi-Pathing (ECMP) and/or LAG hashing (and what type of hash used may be a consideration, such as an industry standard, a legacy system, etc.).
- ECMP Equal Cost Multi-Pathing
- LAG hashing and what type of hash used may be a consideration, such as an industry standard, a legacy system, etc.
- the multipathing may support high performance operation via adaptive routing.
- Converged Enhanced Ethernet may also be supported by the network fabric 302 , such as by using Priority Flow Control (PFC) and/or Enhanced Transmission Selection (ETS) along the complete path through the network fabric 302 in addition to Quantized Congestion Notification (QCN). Additionally, link congestion may trigger saturation tree with QCN.
- PFC Priority Flow Control
- ETS Enhanced Transmission Selection
- QCN Quantized Congestion Notification
- interface-based path representation where a single interface to a network may be used to gain perspective on the network from a point of view of that interface. This interface-based path representation may then be used to span the network fabric 302 , as shown in FIG. 3 .
- Host 1 is shown connected directly to fabric switch S1.
- the interface for Host 1 to the network fabric 302 may be a single physical port, a virtual port, a static LAG, a dynamic LAG, or any other suitable interface between Host 1 and fabric switch S1.
- a global forwarding table may be created, managed, updated, and utilized by the switch controller 308 to make routing decisions, for example, once a packet is received by fabric switch S1 from Host 1 all the way until the packet is received by host 2 via S3.
- the switch controller 308 may be consulted anytime a routing decision is to be made for a packet received by any of the fabric switches 304 in the network fabric 302 .
- each fabric switch 304 may have resident therein a source-routing table. In this case, the fabric switch 304 inserts the route information into each incoming packet that does not yet have source-routing information stored therein.
- One disadvantage of this approach is that a lot of redundancy in terms of routing information in the network is introduced, which makes routing updates cumbersome, since they must be done for each fabric switch 304 in the network fabric 302 .
- One advantage of this approach is that legacy (i.e., non-source routing capable) devices and components (e.g., network interface cards (NICs), legacy switches, etc.) may be attached to the network fabric 302 .
- legacy i.e., non-source routing capable
- components e.g., network interface cards (NICs), legacy switches, etc.
- FIG. 4 a portion 400 of the network fabric 302 is shown, with one exemplary path through the network fabric 302 shown in more detail.
- This path is between two hosts 312 , specifically Host 1 to Host 2, and includes three fabric switches 304 —S1, S2, and S3.
- Host 1 may be assumed to have an address of Addr1 and Host 2 an address of Addr2, while it may be assumed that fabric switch S1 is connected to Host 1 via port 4 (denoted as P4), and to fabric switch S2 via a LAG (denoted as L2).
- fabric switch S2 is connected to fabric switch S1 via LAG L2.
- fabric switch S2 is connected to fabric switch S3 via port 3 (denoted as P3), while fabric switch S3 is connected to fabric switch S2 via port 6 (denoted as P6) and to Host 2 via port 2 (denoted as P2), as shown in FIG. 4 .
- the path between Host 1 and Host 2 may be represented in each fabric switch 304 in a forwarding table, according to one embodiment, which may be stored locally to each fabric switch 304 , or globally by the switch controller 308 in another embodiment.
- each destination port may be a physical port, a virtual port, or a combination thereof.
- fabric switch S1 when a packet is received by fabric switch S1 from Host 1, and the packet is to be forwarded to Host 2, fabric switch S1 will follow a path from S1/L2/P5 to S2/P3 to S3/P2. This is because the LAG L2 is chosen and follows port 5 out of fabric switch 1 to fabric switch 2. Likewise, in the reverse direction, fabric switch 2 chooses port 4 in the LAG L2. Furthermore, this forwarding logic may take into account a hashing algorithm information exchange protocol, and in one approach, only the edge switches (switches S1 and S3 in this example) may maintain forwarding tables, and intermediate switches (switch S2 in this example) may simply follow the source route in the packet.
- the switch controller 308 may have certain physical topology available to construct the paths through the network fabric 302 . In that topology, the fabric switches 304 and physical connectivity therebetween are shown. In using the topology information, the switch controller 308 is adapted to determine the ARP entries associated with either its local ARP connectivity or general subnet distribution in the network fabric 302 . So combining this information, the switch controller 308 creates these source-routing tables and may offload them to any devices capable of source routing in the network fabric 302 or connected thereto.
- the switch controller 308 may offload the forwarding table information to just the fabric switches 304 , or to the fabric switches 304 and the end hosts 312 .
- each end host 312 may have a forwarding table that includes source-routing information for packets being sent to other end hosts connected to the network fabric 302 .
- a network interface card (NIC) of the host 312 or some other component or device may produce the source routing for each forwarded packet.
- each NIC has a source-routing table that indicates the sequence of turns or hops to each known destination, and inserts the route into a frame of the packet upon injection into the fabric network 302 .
- hardware support from the fabric switches 304 is beneficial in order for this approach to function properly, because when each fabric switch 304 has the capability to inspect the frame for the presence of a source route the various fabric switches 304 in the path may then take its respective routing decision based on that source-routed information.
- each fabric switch 304 may still have a traditional routing table to handle non-source-routed frames, regardless of whether the fabric switch 304 has the capability to handle source-routed frames.
- the fabric switch 304 may simply rely on a traditional routing table with which to determine a next hop and egress port.
- one or more devices within the path may lack the ability to handle the source-routed frames, but the packet may still be forwarded without problems until it reaches another fabric switch 304 or device in the path which is capable of handling a source-routed frame, where it will once again be handled according to the source routing.
- Each device in the network fabric may be a virtual device, a physical device, or a combination thereof.
- each egress port may be a physical port, a virtual port, or a combination thereof.
- each device in the network fabric 302 and connected thereto capable of source routing may know each of the other devices to which it is connected which are source-routing capable. Accordingly, each source routing capable device is able to determine when it is forwarding a packet to a device which is not capable of source routing. In this case, the source-routing information may be stripped from the packet, and may appear as a standard packet to the receiving device.
- the exemplary path through the network fabric 302 is again referenced.
- This path is between two hosts 312 , specifically Host 1 to Host 2, and includes three fabric switches 304 —S1, S2, and S3.
- Host 1 and Host 2 have the forwarding tables, and the switches are instructed to forward packets according to the source-routed information included therein.
- the path would be represented as follows in the forwarding table for each host 312 , where the destination port set is identified as [device]/[port].
- the source-routing information may be encapsulated in the packets in various different ways, and may depend on any protocols and/or network types that the packet adheres to.
- the source-routing information may be included in a header of a packet. One example of this is shown in FIG. 5A .
- the frame format 500 includes a destination media access control address (DMAC) 502 , a source MAC address (SMAC) 504 , a source routing tag (SR-Tag) 506 that includes the source-routing information, a service tag (S-Tag) 508 , a customer tag (C-Tag) 510 , an ethertype descriptor 512 , a payload 514 for the packet, and an optional frame check sequence (FCS) 516 .
- DMAC destination media access control address
- SMAC source MAC address
- SR-Tag source routing tag
- FCS optional frame check sequence
- the DMAC 502 , SMAC 504 , S-Tag 508 , C-Tag 510 , payload 514 and FCS 516 may behave and be utilized in the same manner as typical for any packet adhering to any of various IEEE standards; however, the ether type descriptor 512 may take into account the length of the SR-Tag 506 .
- the SR-Tag 506 when a switch is not source routing capable, then the SR-Tag 506 may be omitted from the packet in order for the switch to understand the information in the header of the packet. This may be performed by any device which forwards the packet to a non-source routing compliant device, such as a legacy switch. Then when the packet is received from this legacy device by another source routing compliant device, the SR-Tag 506 may be reinserted into the header and the source-routing information may be restored from this hop forward to the destination in one embodiment, or the entire source-routing information may be added to the SR-Tag 506 from the source to the destination, in an alternate embodiment.
- a non-source routing compliant device such as a legacy switch.
- the SR-Tag 506 may be reinserted into the header and the source-routing information may be restored from this hop forward to the destination in one embodiment, or the entire source-routing information may be added to the SR-Tag 506 from the source to the destination, in an alternate embodiment.
- the SR-Tag 506 may include source-routing information, enforcement options, and hop count information.
- a Tag Protocol Identifier such as the TPID 520 shown in FIG. 5B according to one embodiment, may be used to denote the SR-Tag 506 .
- the TPID 520 to denote a SR-Tag may have the code 0 ⁇ D2D2, but is not so constrained, as any available string may be used to denote an SR-Tag.
- the SR-Tag 506 may be formatted to include a series of strings, each string having a predetermined length. In this example, the strings are 16 bits long, but any length may be used, such as 8 bits, 24 bits, 32 bits, etc.
- the first string may be designated for Enforcement Options 522 and the Hop Count 524 , with each string including half the string length (8 bits each) or some other division.
- the Enforcement Options 522 may be used to indicate any enforcement criteria for a particular packet. For example, if the switch has a forwarding table which is inconsistent with a next hop stipulated in the source-routing information, then the switch may be directed to overwrite its own forwarding table with the source-routing information or the source-routing information may be rewritten based on the switch's local forwarding table. This decision may be indicated in the Enforcement Options 522 . These Enforcement Options 522 also may dictate whether the source-routing information is strictly followed or if it may be bypassed. Then, other traffic management options may be present, such as which of various available ports to choose to egress the packet (such as in a LAG or some other suitable arrangement). This is possible because it is a logical interface.
- a logical interface might come up with more than one physical port to choose from.
- the Enforcement Options 522 may indicate that a port with lowest latency should be chosen, or a port with the highest latency but more reliability, or some other traffic management decision that are understood by intermediate switches. Most of the instructions that may be stored in the Enforcement Options 522 may be related to reliability and/or traffic management. Some of these options may even allow filtering or not filtering based on the SR-Tags because the SR-Tags are a necessary component in order to filter, and basically the options might indicate that the SR-Tag is to be retained no matter what. In another approach, the switch may use the SR-Tag if it understands it, or it may discard the SR-Tag if it does not.
- the Hop Count 524 is used to denote which hop the packet is currently at. After the initial string having the Enforcement Options 522 and the Hop Count 524 , a series of Bridge IDs 526 related to Logical Port IDs 528 and Options 530 thereof may be listed, one for each hop, shown as hop 0, hop 1, hop 2, . . . , hop N. The number of Bridge IDs 526 and Logical Port IDs 528 and Options 530 may depend on the number of hops in the designated path, e.g., N.
- the Hop Count 524 indicates the current hop in the path where the packet is supposed to be, e.g., a number between 0 and N, either beginning at N or 0. If the packet is not at the indicated hop, then corrective action may be taken by the switch to correct any issues with the Hop Count 524 and/or designated path.
- Each set of Bridge ID 526 and Logical Port ID 528 and Options thereof may be 16 bits in length, with the Bridge IDs 526 being 16 bits, the Logical Port ID 528 being 12 bits, and the Options 530 being 4 bits. Of course, any other length may be used for these fields, as would be understood by one of skill in the art.
- FIG. 6 a flowchart of a method 600 for source routing packets is shown, according to one embodiment.
- the method 600 may be performed in accordance with the present invention in any of the environments depicted in FIGS. 1-5B , among others, in various embodiments. Of course, more or less operations than those specifically described in FIG. 6 may be included in method 600 , as would be understood by one of skill in the art upon reading the present descriptions.
- each of the steps of the method 600 may be performed by any suitable component of the operating environment.
- the method 600 may be partially or entirely performed by a fabric switch, an end station, a processor (such as an ASIC, a switching ASIC, a CPU, etc.) embodied in a computer, a switch controller, a host connected to a network fabric having a plurality of fabric switches interconnected therein, etc.
- a fabric switch such as an ASIC, a switching ASIC, a CPU, etc.
- a processor such as an ASIC, a switching ASIC, a CPU, etc.
- method 600 may initiate with operation 602 , where a packet is received or created using a network interface card (NIC) of a host connected to a network fabric.
- NIC network interface card
- the NIC in this method is source routing capable.
- the network fabric includes a plurality of fabric switches interconnected therein, each fabric switch possibly being source routing capable.
- the switch controller may be adapted to operate according to OpenFlow standards, and the NIC or host may be OpenFlow compliant.
- source-routing table details and rules may be received from the OpenFlow Controller, as a way of programming which information and how the information is stored in the source-routing table.
- a path through the network fabric is determined by consulting a source-routing table stored to the host.
- the path may be chosen from many different available paths between the host and the detonation address of the packet.
- traffic may be load balanced between the destination address and the host by changing which path is selected for each new packet, stream of packets, flow, etc.
- source-routing information may be received from a switch controller, and the source-routing information may be stored to the source-routing table, thereby allowing the host to send packets to any known destination in the network fabric without the use of a traditional look-up table.
- source-routing information is stored to a packet header for the packet, the source-routing information comprising the path.
- the source-routing information may be stored in a SR-Tag in the packet header.
- the SR-Tag may comprise, as described in more detail previously, an enforcement options field, a hop count indicator field for indicating a current device or hop in the path, and the source-routing information for the path, comprising a bridge indicator associated with a logical port indicator and options thereof for each device or hop in the path.
- the packet is sent to a first device or hop indicated by the path in the source-routing information. This operation may be performed without the use of a look-up table.
- the first device or hop may be part of the path stored in the packet header which indicates the path through the network fabric.
- any or all operations of method 600 may be implemented in a system, a fabric switch, a device, a network, a host, a processor, and/or a computer program product.
- FIG. 7 another flowchart of method 700 for source routing packets is shown, according to one embodiment.
- the method 700 may be performed in accordance with the present invention in any of the environments depicted in FIGS. 1-5B , among others, in various embodiments. Of course, more or less operations than those specifically described in FIG. 7 may be included in method 700 , as would be understood by one of skill in the art upon reading the present descriptions.
- each of the steps of the method 700 may be performed by any suitable component of the operating environment.
- the method 700 may be partially or entirely performed by a fabric switch, an end station, a processor (such as an ASIC, a switching ASIC, a CPU, etc.) embodied in a computer, a switch controller, a host connected to a network fabric having a plurality of fabric switches interconnected therein, etc.
- a fabric switch such as an ASIC, a switching ASIC, a CPU, etc.
- a processor such as an ASIC, a switching ASIC, a CPU, etc.
- method 700 may initiate with operation 702 , where a packet is received, such as by a fabric switch in a network fabric comprising a plurality of interconnected fabric switches.
- a fabric switch in a network fabric comprising a plurality of interconnected fabric switches.
- one or more hosts may be connected to the network fabric.
- the fabric switch may be connected, directly or indirectly, to a switch controller for controlling certain functions thereof.
- the switch controller may be adapted to operate according to OpenFlow standards, and the fabric switch may be OpenFlow compliant.
- source-routing information may be received with a fabric switch, the source-routing information being sent from the switch controller.
- the source-routing information is stored to a source-routing table that indicates a sequence of devices or hops between the fabric switch and each known destination address in the network fabric.
- a next device or hop in a path through the network fabric is determined by consulting the source-routing table. This operation may be performed without the use of a look-up table, in one approach.
- a portion of the source-routing information is stored to a packet header for the packet, the portion of the source-routing information comprising at least a portion of the path.
- the portion of the source-routing information may be stored in a SR-Tag, the SR-Tag comprising an enforcement options field, a hop count indicator field for indicating a current device or hop in the path, and the portion of the source-routing information for the at least the portion of the path.
- the portion of the source-routing information comprising a bridge indicator associated with a logical port indicator and options thereof for each device or hop in the path.
- the packet is sent to the next device or hop indicated by the at least the portion of the path in the portion of the source-routing information.
- any or all operations of method 700 may be implemented in a system, a fabric switch, a device, a network, a host, a processor, and/or a computer program product.
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
Description
- The present invention relates to data center infrastructure, and more particularly, this invention relates to reducing the overhead associated with using look-up tables in fabric switches to reduce latency.
- A switching processor, such as a switching application specific integrated circuit (ASIC), may be used to choose a port to send received network packets. Typically, a look-up table is utilized to choose which port to send a received packet based on a destination address designated in a header of the received packet. However, as fabric networks grow larger, these look-up tables may encompass vast amounts of data, which causes latency in using the look-up table to determine an egress port to forward packets to. Accordingly, it would be beneficial to have a method to reduce the overhead associated with using look-up tables in fabric switches in order to reduce fabric latency.
- In one embodiment, a system for source routing packets includes a network fabric having a plurality of fabric switches interconnected in the network fabric and a switch controller having logic adapted to configure the network fabric, determine one or more paths through the network fabric between any two hosts connected thereto, and create a source-routing table to store the one or more paths through the network fabric between any two hosts connected thereto.
- According to another embodiment, a computer program product for source routing packets includes a computer readable storage medium having program code embodied therewith, the program code readable/executable by a switch controller to: configure a network fabric having a plurality of fabric switches interconnected in the network fabric, determine one or more paths through the network fabric between any two hosts connected thereto, and create a source-routing table to store the one or more paths through the network fabric between any two hosts connected thereto.
- In another embodiment, a method for source routing packets includes receiving or creating a packet using a network interface card (NIC) of a host connected to a network fabric having a plurality of fabric switches interconnected therein, determining a path through the network fabric by consulting a source-routing table stored to the host, storing source-routing information to a packet header for the packet, the source-routing information including the path, and sending the packet to a first device or hop indicated by the path in the source-routing information.
- In yet another embodiment, a method for source routing packets includes receiving a packet, receiving source-routing information with a fabric switch interconnected to other fabric switches in a network fabric, the source-routing information being sent from a switch controller, storing the source-routing information to a source-routing table that indicates a sequence of devices or hops between the fabric switch and each known destination address in the network fabric, determining a next device or hop in a path through the network fabric by consulting the source-routing table, storing a portion of the source-routing information to a packet header for the packet, the portion of the source-routing information including at least a portion of the path, and sending the packet to the next device or hop indicated by the at least the portion of the path in the portion of the source-routing information.
- Other aspects and embodiments of the present invention will become apparent from the following detailed description, which, when taken in conjunction with the drawings, illustrate by way of example the principles of the invention.
-
FIG. 1 illustrates a network architecture, in accordance with one embodiment. -
FIG. 2 shows a representative hardware environment that may be associated with the servers and/or clients ofFIG. 1 , in accordance with one embodiment. -
FIG. 3 shows a system for source routing packets, according to one embodiment. -
FIG. 4 shows an exemplary path through a network fabric, according to one embodiment. -
FIG. 5A shows an exemplary frame format for a packet having source-routing information, according to one embodiment. -
FIG. 5B is an exemplary tag protocol identifier, according to one embodiment. -
FIG. 6 is a flowchart of a method, according to one embodiment. -
FIG. 7 is a flowchart of a method, according to one embodiment. - The following description is made for the purpose of illustrating the general principles of the present invention and is not meant to limit the inventive concepts claimed herein. Further, particular features described herein can be used in combination with other described features in each of the various possible combinations and permutations.
- Unless otherwise specifically defined herein, all terms are to be given their broadest possible interpretation including meanings implied from the specification as well as meanings understood by those skilled in the art and/or as defined in dictionaries, treatises, etc.
- It must also be noted that, as used in the specification and the appended claims, the singular forms “a,” “an,” and “the” include plural referents unless otherwise specified.
- In one general embodiment, a system for source routing packets includes a network fabric having a plurality of fabric switches interconnected in the network fabric and a switch controller having logic adapted to configure the network fabric, determine one or more paths through the network fabric between any two hosts connected thereto, and create a source-routing table to store the one or more paths through the network fabric between any two hosts connected thereto.
- According to another general embodiment, a computer program product for source routing packets includes a computer readable storage medium having program code embodied therewith, the program code readable/executable by a switch controller to: configure a network fabric having a plurality of fabric switches interconnected in the network fabric, determine one or more paths through the network fabric between any two hosts connected thereto, and create a source-routing table to store the one or more paths through the network fabric between any two hosts connected thereto.
- In another general embodiment, a method for source routing packets includes receiving or creating a packet using a network interface card (NIC) of a host connected to a network fabric having a plurality of fabric switches interconnected therein, determining a path through the network fabric by consulting a source-routing table stored to the host, storing source-routing information to a packet header for the packet, the source-routing information including the path, and sending the packet to a first device or hop indicated by the path in the source-routing information.
- In yet another general embodiment, a method for source routing packets includes receiving a packet, receiving source-routing information with a fabric switch interconnected to other fabric switches in a network fabric, the source-routing information being sent from a switch controller, storing the source-routing information to a source-routing table that indicates a sequence of devices or hops between the fabric switch and each known destination address in the network fabric, determining a next device or hop in a path through the network fabric by consulting the source-routing table, storing a portion of the source-routing information to a packet header for the packet, the portion of the source-routing information including at least a portion of the path, and sending the packet to the next device or hop indicated by the at least the portion of the path in the portion of the source-routing information.
- By using a switch controller, such as a controller operating OpenFlow software (an OpenFlow Controller) or a switch controller that operates according to software-defined network (SDN) standards, a plurality of switches in a network fabric which are capable of communicating with the switch controller may be instructed of desirable paths with which to forward received packets in order to best utilize the network fabric. To accomplish this, intelligence or functionality may be built into the switch controller to determine paths through the network fabric and to deliver these desired paths to individual switches in the network fabric that are compliant with whatever software the switch controller utilizes. In addition, in one approach, when the switch controller operates according to OpenFlow and/or SDN standards, the switches may be OpenFlow and/or SDN compliant in order to utilize the source routing techniques described herein.
- As will be appreciated by one skilled in the art, aspects of the present invention may be embodied as a system, method or computer program product. Accordingly, aspects of the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.) or an embodiment combining software and hardware aspects that may all generally be referred to herein as “logic,” a “circuit,” “module,” or “system.” Furthermore, aspects of the present invention may take the form of a computer program product embodied in one or more computer readable medium(s) having computer readable program code embodied thereon.
- Any combination of one or more computer readable medium(s) may be utilized. The computer readable medium may be a computer readable signal medium or a non-transitory computer readable storage medium. A non-transitory computer readable storage medium may be, for example, but not limited to, a system, apparatus, device, or any suitable combination of the foregoing which may rely on any suitable technology types, such as electronic, magnetic, optical, electromagnetic, infrared, semiconductor, etc. More specific examples (a non-exhaustive list) of the non-transitory computer readable storage medium include the following: a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), a portable compact disc read-only memory (CD-ROM), a Blu-ray disc read-only memory (BD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a non-transitory computer readable storage medium may be any tangible medium that is capable of containing, or storing a program or application for use by or in connection with an instruction execution system, apparatus, or device.
- A computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated signal may take any of a variety of forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may be any computer readable medium that is not a non-transitory computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device, such as an electrical connection having one or more wires, an optical fibre, etc.
- Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fibre cable, RF, etc., or any suitable combination of the foregoing.
- Computer program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C++, or the like, and conventional procedural programming languages, such as the “C” programming language or similar programming languages. The program code may execute entirely on a user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer or server may be connected to the user's computer through any type of network, including a local area network (LAN), storage area network (SAN), and/or a wide area network (WAN), or the connection may be made to an external computer, for example through the Internet using an Internet Service Provider (ISP).
- Aspects of the present invention are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatuses (systems), and computer program products according to various embodiments of the invention. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, may be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
- These computer program instructions may also be stored in a computer readable medium that may direct a computer, other programmable data processing apparatus, or other devices to function in a particular manner, such that the instructions stored in the computer readable medium produce an article of manufacture including instructions which implement the function/act specified in the flowchart and/or block diagram block or blocks.
- The computer program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other devices to cause a series of operational steps to be performed on the computer, other programmable apparatus or other devices to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide processes for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
-
FIG. 1 illustrates anetwork architecture 100, in accordance with one embodiment. As shown inFIG. 1 , a plurality ofremote networks 102 are provided including a firstremote network 104 and a secondremote network 106. Agateway 101 may be coupled between theremote networks 102 and aproximate network 108. In the context of thepresent network architecture 100, thenetworks - In use, the
gateway 101 serves as an entrance point from theremote networks 102 to theproximate network 108. As such, thegateway 101 may function as a router, which is capable of directing a given packet of data that arrives at thegateway 101, and a switch, which furnishes the actual path in and out of thegateway 101 for a given packet. - Further included is at least one
data server 114 coupled to theproximate network 108, and which is accessible from theremote networks 102 via thegateway 101. It should be noted that the data server(s) 114 may include any type of computing device/groupware. Coupled to eachdata server 114 is a plurality ofuser devices 116.Such user devices 116 may include a desktop computer, laptop computer, handheld computer, printer, and/or any other type of logic-containing device. It should be noted that auser device 111 may also be directly coupled to any of the networks, in some embodiments. - A peripheral 120 or series of
peripherals 120, e.g., facsimile machines, printers, scanners, hard disk drives, networked and/or local storage units or systems, etc., may be coupled to one or more of thenetworks networks - According to some approaches, methods and systems described herein may be implemented with and/or on virtual systems and/or systems which emulate one or more other systems, such as a UNIX system which emulates an IBM z/OS environment, a UNIX system which virtually hosts a MICROSOFT WINDOWS environment, a MICROSOFT WINDOWS system which emulates an IBM z/OS environment, etc. This virtualization and/or emulation may be enhanced through the use of VMWARE software, in some embodiments.
- In more approaches, one or
more networks -
FIG. 2 shows a representative hardware environment associated with auser device 116 and/orserver 114 ofFIG. 1 , in accordance with one embodiment.FIG. 2 illustrates a typical hardware configuration of a workstation having a central processing unit (CPU) 210, such as a microprocessor, and a number of other units interconnected via one ormore buses 212 which may be of different types, such as a local bus, a parallel bus, a serial bus, etc., according to several embodiments. - The workstation shown in
FIG. 2 includes a Random Access Memory (RAM) 214, Read Only Memory (ROM) 216, an I/O adapter 218 for connecting peripheral devices such asdisk storage units 220 to the one ormore buses 212, auser interface adapter 222 for connecting akeyboard 224, amouse 226, aspeaker 228, amicrophone 232, and/or other user interface devices such as a touch screen, a digital camera (not shown), etc., to the one ormore buses 212,communication adapter 234 for connecting the workstation to a communication network 235 (e.g., a data processing network) and adisplay adapter 236 for connecting the one ormore buses 212 to adisplay device 238. - The workstation may have resident thereon an operating system such as the MICROSOFT WINDOWS Operating System (OS), a MAC OS, a UNIX OS, etc. It will be appreciated that a preferred embodiment may also be implemented on platforms and operating systems other than those mentioned. A preferred embodiment may be written using JAVA, XML, C, and/or C++ language, or other programming languages, along with an object oriented programming methodology. Object oriented programming (OOP), which has become increasingly used to develop complex applications, may be used.
- Now referring to
FIG. 3 , asystem 300 is shown according to one embodiment, which has a plurality offabric switches 304 interconnected in anetwork fabric 302, each of the fabric switches 304 being connected to one another viaconnections 306. Eachfabric switch 304 is connected, directly or indirectly to a switch controller 308 (as denoted by dashedline connection 310 between theswitch controller 308 and the network fabric 302). Theswitch controller 308 is capable of receiving information from each of the fabric switches 304 and is capable of sending information and/or commands to the fabric switches 304. - According to one embodiment, the
switch controller 308 may operate according to OpenFlow and/or SDN standards, and eachfabric switch 304 may be OpenFlow and/or SDN compliant. In other embodiments, theswitch controller 308 may utilize a different application capable of controlling the fabric switches 304 as would be known by one of skill in the art, such as Beacon, Jaxon, NOX, POX, Maestro, etc. - In addition, the
network fabric 302 may be a physical and/or virtual network fabric (a network fabric which utilizes only physical devices, a network fabric which only utilizes virtual devices, and/or a network fabric which utilizes a combination of physical and virtual devices). In addition, each of the fabric switches 304 may be a physical switch, a virtual switch, or a combination thereof. - The
system 300 may further comprise one ormore hosts 312 connected to thenetwork fabric 302 via one ormore fabric switches 304 viaconnections 314. Any of thehosts 312 may be a physical host, a virtual host, or a combination thereof. The hosts may be any type of device capable of communicating with thenetwork fabric 302, such as another network, a server, a controller, a workstation, an end station, etc. Eachhost 312 may include an interface for communicating with thenetwork fabric 302 and one ormore fabric switches 304 therein. Each of thehosts 312 are unaware of the physical components of thenetwork fabric 302 and instead view thenetwork fabric 302 as a single entity to which a connection may be made, in one approach. Of course, eachhost 312 is actually connected to at least onephysical fabric switch 304 within thenetwork fabric 302. Thehost 312 may be connected tomultiple fabric switches 304 in the case of a Link Aggregation (LAG) connection. - The
switch controller 308 may comprise logic adapted to analyze and configure thenetwork fabric 302 such that there is one or more non-looping paths through thenetwork fabric 302 between any twohosts 312 or other end stations connected to thenetwork fabric 302. Ideally, the logic may be able to determine multiple paths through thenetwork fabric 302, in order to provide redundancy, increased throughput, and decreased latency, among other advantages. - There are many factors to consider in determining paths through the
network fabric 302. Some factors include the number of layers in the fabric, L, the number of nodes per layer, NL, the switch controller's topology and connectivity graph (and whether theswitch controller 308 is capable of globalizing the routing decisions), etc. - Furthermore, in order for multipathing to take place in the
network fabric 302, the multipathing may take place in-order via Equal Cost Multi-Pathing (ECMP) and/or LAG hashing (and what type of hash used may be a consideration, such as an industry standard, a legacy system, etc.). In addition, the multipathing may support high performance operation via adaptive routing. - Converged Enhanced Ethernet (CEE) may also be supported by the
network fabric 302, such as by using Priority Flow Control (PFC) and/or Enhanced Transmission Selection (ETS) along the complete path through thenetwork fabric 302 in addition to Quantized Congestion Notification (QCN). Additionally, link congestion may trigger saturation tree with QCN. - In one embodiment, interface-based path representation, where a single interface to a network may be used to gain perspective on the network from a point of view of that interface. This interface-based path representation may then be used to span the
network fabric 302, as shown inFIG. 3 . For example,Host 1 is shown connected directly to fabric switch S1. In this example, the interface forHost 1 to thenetwork fabric 302 may be a single physical port, a virtual port, a static LAG, a dynamic LAG, or any other suitable interface betweenHost 1 and fabric switch S1. Also, in this example, a global forwarding table may be created, managed, updated, and utilized by theswitch controller 308 to make routing decisions, for example, once a packet is received by fabric switch S1 fromHost 1 all the way until the packet is received byhost 2 via S3. - In one embodiment, the
switch controller 308 may be consulted anytime a routing decision is to be made for a packet received by any of the fabric switches 304 in thenetwork fabric 302. - In another embodiment, each
fabric switch 304 may have resident therein a source-routing table. In this case, thefabric switch 304 inserts the route information into each incoming packet that does not yet have source-routing information stored therein. One disadvantage of this approach is that a lot of redundancy in terms of routing information in the network is introduced, which makes routing updates cumbersome, since they must be done for eachfabric switch 304 in thenetwork fabric 302. One advantage of this approach is that legacy (i.e., non-source routing capable) devices and components (e.g., network interface cards (NICs), legacy switches, etc.) may be attached to thenetwork fabric 302. - Now referring to
FIG. 4 , aportion 400 of thenetwork fabric 302 is shown, with one exemplary path through thenetwork fabric 302 shown in more detail. This path is between twohosts 312, specifically Host 1 to Host 2, and includes threefabric switches 304—S1, S2, and S3. For the sake of this description,Host 1 may be assumed to have an address of Addr1 andHost 2 an address of Addr2, while it may be assumed that fabric switch S1 is connected to Host 1 via port 4 (denoted as P4), and to fabric switch S2 via a LAG (denoted as L2). Likewise, fabric switch S2 is connected to fabric switch S1 via LAG L2. Furthermore, it may be assumed that fabric switch S2 is connected to fabric switch S3 via port 3 (denoted as P3), while fabric switch S3 is connected to fabric switch S2 via port 6 (denoted as P6) and to Host 2 via port 2 (denoted as P2), as shown inFIG. 4 . - The path between
Host 1 andHost 2 may be represented in eachfabric switch 304 in a forwarding table, according to one embodiment, which may be stored locally to eachfabric switch 304, or globally by theswitch controller 308 in another embodiment. - In this example, the path would be represented as follows in the forwarding table for each fabric switch, where the destination port set is identified as [device]/[port]. Furthermore, each destination port may be a physical port, a virtual port, or a combination thereof.
-
Forwarding Table S1 Destination Address Destination Port Set Host Addr1 S1/P4 Host Addr2 S1/L2/P5, S2/P3, S3/P2 -
Forwarding Table S2 Destination Address Destination Port Set Host Addr1 S2/L2/P4, S1/P4 Host Addr2 S2/P3, S3/P2 -
Forwarding Table S3 Destination Address Destination Port Set Host Addr1 S3/P6, S2/L2/P4, S1/P4 Host Addr2 S3/P2 - Therefore, when a packet is received by fabric switch S1 from
Host 1, and the packet is to be forwarded to Host 2, fabric switch S1 will follow a path from S1/L2/P5 to S2/P3 to S3/P2. This is because the LAG L2 is chosen and followsport 5 out offabric switch 1 tofabric switch 2. Likewise, in the reverse direction,fabric switch 2 choosesport 4 in the LAG L2. Furthermore, this forwarding logic may take into account a hashing algorithm information exchange protocol, and in one approach, only the edge switches (switches S1 and S3 in this example) may maintain forwarding tables, and intermediate switches (switch S2 in this example) may simply follow the source route in the packet. - In another embodiment, referring again to
FIG. 3 , theswitch controller 308 may have certain physical topology available to construct the paths through thenetwork fabric 302. In that topology, the fabric switches 304 and physical connectivity therebetween are shown. In using the topology information, theswitch controller 308 is adapted to determine the ARP entries associated with either its local ARP connectivity or general subnet distribution in thenetwork fabric 302. So combining this information, theswitch controller 308 creates these source-routing tables and may offload them to any devices capable of source routing in thenetwork fabric 302 or connected thereto. - The
switch controller 308 may offload the forwarding table information to just the fabric switches 304, or to the fabric switches 304 and the end hosts 312. In this embodiment, eachend host 312 may have a forwarding table that includes source-routing information for packets being sent to other end hosts connected to thenetwork fabric 302. - In this approach, a network interface card (NIC) of the
host 312 or some other component or device may produce the source routing for each forwarded packet. In this approach, each NIC has a source-routing table that indicates the sequence of turns or hops to each known destination, and inserts the route into a frame of the packet upon injection into thefabric network 302. Still, hardware support from the fabric switches 304 is beneficial in order for this approach to function properly, because when eachfabric switch 304 has the capability to inspect the frame for the presence of a source route thevarious fabric switches 304 in the path may then take its respective routing decision based on that source-routed information. Otherwise, when afabric switch 304 which lacks source-routing capability encounters a packet, it will only be able to send the packet along according to some other information, without the benefit of the source-routing information which indicates a chosen route through the network. In addition, eachfabric switch 304 may still have a traditional routing table to handle non-source-routed frames, regardless of whether thefabric switch 304 has the capability to handle source-routed frames. - In the case where a switch lacks the ability to handle source-routed frames, the
fabric switch 304 may simply rely on a traditional routing table with which to determine a next hop and egress port. In this case, one or more devices within the path may lack the ability to handle the source-routed frames, but the packet may still be forwarded without problems until it reaches anotherfabric switch 304 or device in the path which is capable of handling a source-routed frame, where it will once again be handled according to the source routing. Each device in the network fabric may be a virtual device, a physical device, or a combination thereof. Furthermore, each egress port may be a physical port, a virtual port, or a combination thereof. - In any embodiment described herein, each device in the
network fabric 302 and connected thereto capable of source routing may know each of the other devices to which it is connected which are source-routing capable. Accordingly, each source routing capable device is able to determine when it is forwarding a packet to a device which is not capable of source routing. In this case, the source-routing information may be stripped from the packet, and may appear as a standard packet to the receiving device. - Referring again to
FIG. 4 , the exemplary path through thenetwork fabric 302 is again referenced. This path is between twohosts 312, specifically Host 1 to Host 2, and includes threefabric switches 304—S1, S2, and S3. In this approach, however,Host 1 andHost 2 have the forwarding tables, and the switches are instructed to forward packets according to the source-routed information included therein. - In this approach, the path would be represented as follows in the forwarding table for each
host 312, where the destination port set is identified as [device]/[port]. -
Forwarding Table Host 1Destination Address Destination Port Set Host Addr1 Internal Host Addr2 S1/L2/P5, S2/P3, S1/P2 -
Forwarding Table Host 2Destination Address Destination Port Set Host Addr1 S3/P6, S2/L2/P4, S1/P4 Host Addr2 Internal - The source-routing information may be encapsulated in the packets in various different ways, and may depend on any protocols and/or network types that the packet adheres to. In one embodiment, the source-routing information may be included in a header of a packet. One example of this is shown in
FIG. 5A . - Referring to
FIG. 5A , aframe format 500 for an exemplary packet is shown according to one embodiment. Theframe format 500 includes a destination media access control address (DMAC) 502, a source MAC address (SMAC) 504, a source routing tag (SR-Tag) 506 that includes the source-routing information, a service tag (S-Tag) 508, a customer tag (C-Tag) 510, anethertype descriptor 512, apayload 514 for the packet, and an optional frame check sequence (FCS) 516. - In one approach, the
DMAC 502,SMAC 504, S-Tag 508, C-Tag 510,payload 514 andFCS 516 may behave and be utilized in the same manner as typical for any packet adhering to any of various IEEE standards; however, theether type descriptor 512 may take into account the length of the SR-Tag 506. - Regarding the SR-
Tag 506, when a switch is not source routing capable, then the SR-Tag 506 may be omitted from the packet in order for the switch to understand the information in the header of the packet. This may be performed by any device which forwards the packet to a non-source routing compliant device, such as a legacy switch. Then when the packet is received from this legacy device by another source routing compliant device, the SR-Tag 506 may be reinserted into the header and the source-routing information may be restored from this hop forward to the destination in one embodiment, or the entire source-routing information may be added to the SR-Tag 506 from the source to the destination, in an alternate embodiment. - The SR-
Tag 506, in some embodiments, may include source-routing information, enforcement options, and hop count information. - A Tag Protocol Identifier (TPID), such as the
TPID 520 shown inFIG. 5B according to one embodiment, may be used to denote the SR-Tag 506. TheTPID 520 to denote a SR-Tag may have thecode 0×D2D2, but is not so constrained, as any available string may be used to denote an SR-Tag. The SR-Tag 506 may be formatted to include a series of strings, each string having a predetermined length. In this example, the strings are 16 bits long, but any length may be used, such as 8 bits, 24 bits, 32 bits, etc. The first string may be designated forEnforcement Options 522 and theHop Count 524, with each string including half the string length (8 bits each) or some other division. - The
Enforcement Options 522 may be used to indicate any enforcement criteria for a particular packet. For example, if the switch has a forwarding table which is inconsistent with a next hop stipulated in the source-routing information, then the switch may be directed to overwrite its own forwarding table with the source-routing information or the source-routing information may be rewritten based on the switch's local forwarding table. This decision may be indicated in theEnforcement Options 522. TheseEnforcement Options 522 also may dictate whether the source-routing information is strictly followed or if it may be bypassed. Then, other traffic management options may be present, such as which of various available ports to choose to egress the packet (such as in a LAG or some other suitable arrangement). This is possible because it is a logical interface. A logical interface might come up with more than one physical port to choose from. TheEnforcement Options 522 may indicate that a port with lowest latency should be chosen, or a port with the highest latency but more reliability, or some other traffic management decision that are understood by intermediate switches. Most of the instructions that may be stored in theEnforcement Options 522 may be related to reliability and/or traffic management. Some of these options may even allow filtering or not filtering based on the SR-Tags because the SR-Tags are a necessary component in order to filter, and basically the options might indicate that the SR-Tag is to be retained no matter what. In another approach, the switch may use the SR-Tag if it understands it, or it may discard the SR-Tag if it does not. - The
Hop Count 524 is used to denote which hop the packet is currently at. After the initial string having theEnforcement Options 522 and theHop Count 524, a series ofBridge IDs 526 related toLogical Port IDs 528 andOptions 530 thereof may be listed, one for each hop, shown ashop 0,hop 1,hop 2, . . . , hop N. The number ofBridge IDs 526 andLogical Port IDs 528 andOptions 530 may depend on the number of hops in the designated path, e.g., N.The Hop Count 524 indicates the current hop in the path where the packet is supposed to be, e.g., a number between 0 and N, either beginning at N or 0. If the packet is not at the indicated hop, then corrective action may be taken by the switch to correct any issues with theHop Count 524 and/or designated path. - Each set of
Bridge ID 526 andLogical Port ID 528 and Options thereof may be 16 bits in length, with theBridge IDs 526 being 16 bits, theLogical Port ID 528 being 12 bits, and theOptions 530 being 4 bits. Of course, any other length may be used for these fields, as would be understood by one of skill in the art. - Now referring to
FIG. 6 , a flowchart of amethod 600 for source routing packets is shown, according to one embodiment. Themethod 600 may be performed in accordance with the present invention in any of the environments depicted inFIGS. 1-5B , among others, in various embodiments. Of course, more or less operations than those specifically described inFIG. 6 may be included inmethod 600, as would be understood by one of skill in the art upon reading the present descriptions. - Each of the steps of the
method 600 may be performed by any suitable component of the operating environment. For example, in one embodiment, themethod 600 may be partially or entirely performed by a fabric switch, an end station, a processor (such as an ASIC, a switching ASIC, a CPU, etc.) embodied in a computer, a switch controller, a host connected to a network fabric having a plurality of fabric switches interconnected therein, etc. - As shown in
FIG. 6 ,method 600 may initiate withoperation 602, where a packet is received or created using a network interface card (NIC) of a host connected to a network fabric. The NIC in this method is source routing capable. The network fabric includes a plurality of fabric switches interconnected therein, each fabric switch possibly being source routing capable. - According to one embodiment, the switch controller may be adapted to operate according to OpenFlow standards, and the NIC or host may be OpenFlow compliant. In this approach, source-routing table details and rules may be received from the OpenFlow Controller, as a way of programming which information and how the information is stored in the source-routing table.
- In
operation 604, a path through the network fabric is determined by consulting a source-routing table stored to the host. The path may be chosen from many different available paths between the host and the detonation address of the packet. In one embodiment, traffic may be load balanced between the destination address and the host by changing which path is selected for each new packet, stream of packets, flow, etc. - In a further embodiment, source-routing information may be received from a switch controller, and the source-routing information may be stored to the source-routing table, thereby allowing the host to send packets to any known destination in the network fabric without the use of a traditional look-up table.
- In
operation 606, source-routing information is stored to a packet header for the packet, the source-routing information comprising the path. - In a further embodiment, the source-routing information may be stored in a SR-Tag in the packet header. The SR-Tag may comprise, as described in more detail previously, an enforcement options field, a hop count indicator field for indicating a current device or hop in the path, and the source-routing information for the path, comprising a bridge indicator associated with a logical port indicator and options thereof for each device or hop in the path.
- In
operation 608, the packet is sent to a first device or hop indicated by the path in the source-routing information. This operation may be performed without the use of a look-up table. The first device or hop may be part of the path stored in the packet header which indicates the path through the network fabric. - In more embodiments, referring again to
FIG. 6 , any or all operations ofmethod 600 may be implemented in a system, a fabric switch, a device, a network, a host, a processor, and/or a computer program product. - Now referring to
FIG. 7 , another flowchart ofmethod 700 for source routing packets is shown, according to one embodiment. Themethod 700 may be performed in accordance with the present invention in any of the environments depicted inFIGS. 1-5B , among others, in various embodiments. Of course, more or less operations than those specifically described inFIG. 7 may be included inmethod 700, as would be understood by one of skill in the art upon reading the present descriptions. - Each of the steps of the
method 700 may be performed by any suitable component of the operating environment. For example, in one embodiment, themethod 700 may be partially or entirely performed by a fabric switch, an end station, a processor (such as an ASIC, a switching ASIC, a CPU, etc.) embodied in a computer, a switch controller, a host connected to a network fabric having a plurality of fabric switches interconnected therein, etc. - As shown in
FIG. 7 ,method 700 may initiate withoperation 702, where a packet is received, such as by a fabric switch in a network fabric comprising a plurality of interconnected fabric switches. In addition, one or more hosts may be connected to the network fabric. The fabric switch may be connected, directly or indirectly, to a switch controller for controlling certain functions thereof. - According to one embodiment, the switch controller may be adapted to operate according to OpenFlow standards, and the fabric switch may be OpenFlow compliant.
- In
operation 704, source-routing information may be received with a fabric switch, the source-routing information being sent from the switch controller. - In
operation 706, the source-routing information is stored to a source-routing table that indicates a sequence of devices or hops between the fabric switch and each known destination address in the network fabric. - In
operation 708, a next device or hop in a path through the network fabric is determined by consulting the source-routing table. This operation may be performed without the use of a look-up table, in one approach. - In
operation 710, a portion of the source-routing information is stored to a packet header for the packet, the portion of the source-routing information comprising at least a portion of the path. - In one embodiment, the portion of the source-routing information may be stored in a SR-Tag, the SR-Tag comprising an enforcement options field, a hop count indicator field for indicating a current device or hop in the path, and the portion of the source-routing information for the at least the portion of the path. The portion of the source-routing information comprising a bridge indicator associated with a logical port indicator and options thereof for each device or hop in the path.
- In
operation 712, the packet is sent to the next device or hop indicated by the at least the portion of the path in the portion of the source-routing information. - In more embodiments, referring again to
FIG. 7 , any or all operations ofmethod 700 may be implemented in a system, a fabric switch, a device, a network, a host, a processor, and/or a computer program product. - While various embodiments have been described above, it should be understood that they have been presented by way of example only, and not limitation. Thus, the breadth and scope of an embodiment of the present invention should not be limited by any of the above-described exemplary embodiments, but should be defined only in accordance with the following claims and their equivalents.
Claims (23)
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/781,561 US9270618B2 (en) | 2013-02-28 | 2013-02-28 | Source routing with fabric switches in an ethernet fabric network |
GB1516051.8A GB2525823B (en) | 2013-02-28 | 2014-01-30 | Source routing with fabric switches in an ethernet fabric network |
PCT/IB2014/058659 WO2014132149A1 (en) | 2013-02-28 | 2014-01-30 | Source routing with fabric switches in an ethernet fabric network |
US14/987,719 US9680746B2 (en) | 2013-02-28 | 2016-01-04 | Source routing with fabric switches in an ethernet fabric network |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/781,561 US9270618B2 (en) | 2013-02-28 | 2013-02-28 | Source routing with fabric switches in an ethernet fabric network |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/987,719 Continuation US9680746B2 (en) | 2013-02-28 | 2016-01-04 | Source routing with fabric switches in an ethernet fabric network |
Publications (2)
Publication Number | Publication Date |
---|---|
US20140241345A1 true US20140241345A1 (en) | 2014-08-28 |
US9270618B2 US9270618B2 (en) | 2016-02-23 |
Family
ID=51388086
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/781,561 Active 2033-12-07 US9270618B2 (en) | 2013-02-28 | 2013-02-28 | Source routing with fabric switches in an ethernet fabric network |
US14/987,719 Active US9680746B2 (en) | 2013-02-28 | 2016-01-04 | Source routing with fabric switches in an ethernet fabric network |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/987,719 Active US9680746B2 (en) | 2013-02-28 | 2016-01-04 | Source routing with fabric switches in an ethernet fabric network |
Country Status (3)
Country | Link |
---|---|
US (2) | US9270618B2 (en) |
GB (1) | GB2525823B (en) |
WO (1) | WO2014132149A1 (en) |
Cited By (37)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140269325A1 (en) * | 2013-03-15 | 2014-09-18 | International Business Machines Corporation | Bypassing congestion points in a converged enhanced ethernet fabric |
US20150055654A1 (en) * | 2013-08-23 | 2015-02-26 | Futurewei Technologies, Inc. | Segmented Source Routing in a Network |
US20150264116A1 (en) * | 2014-03-14 | 2015-09-17 | Ira Weiny | Scalable Address Resolution |
US20150281049A1 (en) * | 2014-03-31 | 2015-10-01 | Vmware, Inc. | Fast lookup and update of current hop limit |
US20150326439A1 (en) * | 2014-05-09 | 2015-11-12 | Silicon Image, Inc. | Stream creation with limited topology information |
US9219691B2 (en) | 2013-03-15 | 2015-12-22 | International Business Machines Corporation | Source-driven switch probing with feedback request |
US9401857B2 (en) | 2013-03-15 | 2016-07-26 | International Business Machines Corporation | Coherent load monitoring of physical and virtual networks with synchronous status acquisition |
US20160241460A1 (en) * | 2015-02-17 | 2016-08-18 | Huawei Technologies Co., Ltd. | Service Chaining Using Source Routing |
US20160241459A1 (en) * | 2013-10-26 | 2016-08-18 | Huawei Technologies Co.,Ltd. | Method for acquiring, by sdn switch, exact flow entry, and sdn switch, controller, and system |
US20170026224A1 (en) * | 2015-07-22 | 2017-01-26 | Cisco Technology, Inc. | Resilient segment routing service hunting with tcp session stickiness |
WO2017014904A1 (en) * | 2015-07-20 | 2017-01-26 | Schweitzer Engineering Laboratories, Inc . | Configuration of a software defined network |
US20170048139A1 (en) * | 2015-08-14 | 2017-02-16 | Futurewei Technologies, Inc. | Interconnecting an Overlay Management Control Network (OMCN) to a Non-OMCN |
US9660836B2 (en) | 2014-05-06 | 2017-05-23 | Lattice Semiconductor Corporation | Network topology discovery |
WO2017088190A1 (en) * | 2015-11-27 | 2017-06-01 | 华为技术有限公司 | Method and apparatus for acquiring path information |
US9680746B2 (en) | 2013-02-28 | 2017-06-13 | International Business Machines Corporation | Source routing with fabric switches in an ethernet fabric network |
CN107409132A (en) * | 2015-03-10 | 2017-11-28 | 华为技术有限公司 | The software defined network control signaling for traffic engineering of polymorphic type transmission is enabled in the dataplane |
US9866483B2 (en) | 2015-07-20 | 2018-01-09 | Schweitzer Engineering Laboratories, Inc. | Routing of traffic in network through automatically generated and physically distinct communication paths |
US9900206B2 (en) | 2015-07-20 | 2018-02-20 | Schweitzer Engineering Laboratories, Inc. | Communication device with persistent configuration and verification |
US9954781B2 (en) | 2013-03-15 | 2018-04-24 | International Business Machines Corporation | Adaptive setting of the quantized congestion notification equilibrium setpoint in converged enhanced Ethernet networks |
US20180176152A1 (en) * | 2016-12-19 | 2018-06-21 | Futurewei Technologies, Inc. | Programmable and low latency switch fabric for scale-out router |
US10291514B2 (en) | 2015-04-17 | 2019-05-14 | Huawei Technologies Co., Ltd. | Software defined network (SDN) control signaling for traffic engineering to enable multi-type transport in a data plane |
US20190182143A1 (en) * | 2017-04-09 | 2019-06-13 | Barefoot Networks, Inc. | Source Routing Design with Simplified Forwarding Elements |
US10341311B2 (en) | 2015-07-20 | 2019-07-02 | Schweitzer Engineering Laboratories, Inc. | Communication device for implementing selective encryption in a software defined network |
US10659314B2 (en) | 2015-07-20 | 2020-05-19 | Schweitzer Engineering Laboratories, Inc. | Communication host profiles |
US10785189B2 (en) | 2018-03-01 | 2020-09-22 | Schweitzer Engineering Laboratories, Inc. | Selective port mirroring and in-band transport of network communications for inspection |
US10863558B2 (en) | 2016-03-30 | 2020-12-08 | Schweitzer Engineering Laboratories, Inc. | Communication device for implementing trusted relationships in a software defined network |
US10904207B2 (en) * | 2019-04-29 | 2021-01-26 | Cloudflare, Inc. | Intelligently routing a response packet along a same connection as a request packet |
US10979309B2 (en) | 2019-08-07 | 2021-04-13 | Schweitzer Engineering Laboratories, Inc. | Automated convergence of physical design and configuration of software defined network |
US11075908B2 (en) | 2019-05-17 | 2021-07-27 | Schweitzer Engineering Laboratories, Inc. | Authentication in a software defined network |
CN113489613A (en) * | 2021-07-16 | 2021-10-08 | 锐捷网络股份有限公司 | Message forwarding method and device |
US11165685B2 (en) | 2019-12-20 | 2021-11-02 | Schweitzer Engineering Laboratories, Inc. | Multipoint redundant network device path planning for programmable networks |
US11228521B2 (en) | 2019-11-04 | 2022-01-18 | Schweitzer Engineering Laboratories, Inc. | Systems and method for detecting failover capability of a network device |
US11336564B1 (en) | 2021-09-01 | 2022-05-17 | Schweitzer Engineering Laboratories, Inc. | Detection of active hosts using parallel redundancy protocol in software defined networks |
US11418432B1 (en) | 2021-04-22 | 2022-08-16 | Schweitzer Engineering Laboratories, Inc. | Automated communication flow discovery and configuration in a software defined network |
US11750502B2 (en) | 2021-09-01 | 2023-09-05 | Schweitzer Engineering Laboratories, Inc. | Detection of in-band software defined network controllers using parallel redundancy protocol |
US11838174B2 (en) | 2022-02-24 | 2023-12-05 | Schweitzer Engineering Laboratories, Inc. | Multicast fast failover handling |
US11848860B2 (en) | 2022-02-24 | 2023-12-19 | Schweitzer Engineering Laboratories, Inc. | Multicast fast failover turnaround overlap handling |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9654395B2 (en) * | 2014-03-31 | 2017-05-16 | Kulcloud | SDN-based service chaining system |
US9876714B2 (en) | 2014-11-14 | 2018-01-23 | Nicira, Inc. | Stateful services on stateless clustered edge |
US9866473B2 (en) | 2014-11-14 | 2018-01-09 | Nicira, Inc. | Stateful services on stateless clustered edge |
US10044617B2 (en) | 2014-11-14 | 2018-08-07 | Nicira, Inc. | Stateful services on stateless clustered edge |
US11533255B2 (en) * | 2014-11-14 | 2022-12-20 | Nicira, Inc. | Stateful services on stateless clustered edge |
US11296984B2 (en) | 2017-07-31 | 2022-04-05 | Nicira, Inc. | Use of hypervisor for active-active stateful network service cluster |
US11570092B2 (en) | 2017-07-31 | 2023-01-31 | Nicira, Inc. | Methods for active-active stateful network service cluster |
US10951584B2 (en) | 2017-07-31 | 2021-03-16 | Nicira, Inc. | Methods for active-active stateful network service cluster |
US11153122B2 (en) | 2018-02-19 | 2021-10-19 | Nicira, Inc. | Providing stateful services deployed in redundant gateways connected to asymmetric network |
US11799761B2 (en) | 2022-01-07 | 2023-10-24 | Vmware, Inc. | Scaling edge services with minimal disruption |
US11962564B2 (en) | 2022-02-15 | 2024-04-16 | VMware LLC | Anycast address for network address translation at edge |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030033427A1 (en) * | 2001-05-10 | 2003-02-13 | Brahmaroutu Surender V. | Method for determining multiple paths between ports in a switched fabric |
US20120250682A1 (en) * | 2011-03-30 | 2012-10-04 | Amazon Technologies, Inc. | Frameworks and interfaces for offload device-based packet processing |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030016668A1 (en) | 2001-07-17 | 2003-01-23 | Antonio Mugica | Method of device-to-device communications in hybrid distributed device control networks |
US8111618B2 (en) | 2006-01-27 | 2012-02-07 | Alcatel Lucent | End-to-end service quality using source-routed probes |
US8279870B2 (en) | 2007-08-01 | 2012-10-02 | Silver Spring Networks, Inc. | Method and system of routing in a utility smart-grid network |
US8539098B2 (en) | 2007-10-17 | 2013-09-17 | Dispersive Networks, Inc. | Multiplexed client server (MCS) communications and systems |
CN101471841A (en) | 2007-12-29 | 2009-07-01 | 华为技术有限公司 | Method and apparatus for maintaining routing table |
US8599863B2 (en) | 2009-10-30 | 2013-12-03 | Calxeda, Inc. | System and method for using a multi-protocol fabric module across a distributed server interconnect fabric |
CA2784456A1 (en) | 2010-03-26 | 2011-09-29 | Peter Ashwood Smith | Distributed failure recovery in a routed ethernet network |
US8897134B2 (en) | 2010-06-25 | 2014-11-25 | Telefonaktiebolaget L M Ericsson (Publ) | Notifying a controller of a change to a packet forwarding configuration of a network element over a communication channel |
US20120099591A1 (en) | 2010-10-26 | 2012-04-26 | Dell Products, Lp | System and Method for Scalable Flow Aware Network Architecture for Openflow Based Network Virtualization |
ES2607106T3 (en) | 2011-04-04 | 2017-03-29 | Nec Corporation | Network system, switch and terminal detection method connected |
US9270618B2 (en) | 2013-02-28 | 2016-02-23 | International Business Machines Corporation | Source routing with fabric switches in an ethernet fabric network |
-
2013
- 2013-02-28 US US13/781,561 patent/US9270618B2/en active Active
-
2014
- 2014-01-30 WO PCT/IB2014/058659 patent/WO2014132149A1/en active Application Filing
- 2014-01-30 GB GB1516051.8A patent/GB2525823B/en not_active Expired - Fee Related
-
2016
- 2016-01-04 US US14/987,719 patent/US9680746B2/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030033427A1 (en) * | 2001-05-10 | 2003-02-13 | Brahmaroutu Surender V. | Method for determining multiple paths between ports in a switched fabric |
US20120250682A1 (en) * | 2011-03-30 | 2012-10-04 | Amazon Technologies, Inc. | Frameworks and interfaces for offload device-based packet processing |
Cited By (72)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9680746B2 (en) | 2013-02-28 | 2017-06-13 | International Business Machines Corporation | Source routing with fabric switches in an ethernet fabric network |
US9219689B2 (en) | 2013-03-15 | 2015-12-22 | International Business Machines Corporation | Source-driven switch probing with feedback request |
US20150078170A1 (en) * | 2013-03-15 | 2015-03-19 | International Business Machines Corporation | Bypassing congestion points in a converged enhanced ethernet fabric |
US9998377B2 (en) | 2013-03-15 | 2018-06-12 | International Business Machines Corporation | Adaptive setting of the quantized congestion notification equilibrium setpoint in converged enhanced ethernet networks |
US9954781B2 (en) | 2013-03-15 | 2018-04-24 | International Business Machines Corporation | Adaptive setting of the quantized congestion notification equilibrium setpoint in converged enhanced Ethernet networks |
US9401857B2 (en) | 2013-03-15 | 2016-07-26 | International Business Machines Corporation | Coherent load monitoring of physical and virtual networks with synchronous status acquisition |
US9197563B2 (en) * | 2013-03-15 | 2015-11-24 | International Business Machines Corporation | Bypassing congestion points in a converged enhanced ethernet fabric |
US9219691B2 (en) | 2013-03-15 | 2015-12-22 | International Business Machines Corporation | Source-driven switch probing with feedback request |
US20140269325A1 (en) * | 2013-03-15 | 2014-09-18 | International Business Machines Corporation | Bypassing congestion points in a converged enhanced ethernet fabric |
US9253096B2 (en) * | 2013-03-15 | 2016-02-02 | International Business Machines Corporation | Bypassing congestion points in a converged enhanced ethernet fabric |
US9325609B2 (en) * | 2013-08-23 | 2016-04-26 | Futurewei Technologies, Inc. | Segmented source routing in a network |
CN105850082A (en) * | 2013-08-23 | 2016-08-10 | 华为技术有限公司 | Segmented source routing in a network |
US20150055654A1 (en) * | 2013-08-23 | 2015-02-26 | Futurewei Technologies, Inc. | Segmented Source Routing in a Network |
US9742656B2 (en) * | 2013-10-26 | 2017-08-22 | Huawei Technologies Co., Ltd. | Method for acquiring, by SDN switch, exact flow entry, and SDN switch, controller, and system |
US10367718B2 (en) * | 2013-10-26 | 2019-07-30 | Huawei Technologies Co., Ltd. | Method for acquiring, by SDN switch, exact flow entry, and SDN switch, controller, and system |
US20160241459A1 (en) * | 2013-10-26 | 2016-08-18 | Huawei Technologies Co.,Ltd. | Method for acquiring, by sdn switch, exact flow entry, and sdn switch, controller, and system |
US20150264116A1 (en) * | 2014-03-14 | 2015-09-17 | Ira Weiny | Scalable Address Resolution |
US9667528B2 (en) * | 2014-03-31 | 2017-05-30 | Vmware, Inc. | Fast lookup and update of current hop limit |
US10187294B2 (en) * | 2014-03-31 | 2019-01-22 | Vmware, Inc. | Fast lookup and update of current hop limit |
US10841204B2 (en) | 2014-03-31 | 2020-11-17 | Vmware, Inc. | Fast lookup and update of current hop limit |
US20150281049A1 (en) * | 2014-03-31 | 2015-10-01 | Vmware, Inc. | Fast lookup and update of current hop limit |
US9660836B2 (en) | 2014-05-06 | 2017-05-23 | Lattice Semiconductor Corporation | Network topology discovery |
US20150326439A1 (en) * | 2014-05-09 | 2015-11-12 | Silicon Image, Inc. | Stream creation with limited topology information |
US9590825B2 (en) | 2014-05-09 | 2017-03-07 | Lattice Semiconductor Corporation | Stream creation with limited topology information |
US9686101B2 (en) * | 2014-05-09 | 2017-06-20 | Lattice Semiconductor Corporation | Stream creation with limited topology information |
US10079722B2 (en) | 2014-05-09 | 2018-09-18 | Lattice Semiconductor Corporation | Stream creation with limited topology information |
US9736063B2 (en) * | 2015-02-17 | 2017-08-15 | Huawei Technologies Co., Ltd. | Service chaining using source routing |
WO2016131401A1 (en) | 2015-02-17 | 2016-08-25 | Huawei Technologies Co., Ltd. | Service chaining using source routing |
US10361947B2 (en) * | 2015-02-17 | 2019-07-23 | Huawei Technologies Co., Ltd. | Service chaining using source routing |
EP3248363A4 (en) * | 2015-02-17 | 2018-02-21 | Huawei Technologies Co., Ltd. | Service chaining using source routing |
US20160241460A1 (en) * | 2015-02-17 | 2016-08-18 | Huawei Technologies Co., Ltd. | Service Chaining Using Source Routing |
CN107409132A (en) * | 2015-03-10 | 2017-11-28 | 华为技术有限公司 | The software defined network control signaling for traffic engineering of polymorphic type transmission is enabled in the dataplane |
EP3257228A4 (en) * | 2015-03-10 | 2018-03-14 | Huawei Technologies Co., Ltd. | Software defined network (sdn) control signaling for traffic engineering to enable multi-type transport in data plane |
US10291514B2 (en) | 2015-04-17 | 2019-05-14 | Huawei Technologies Co., Ltd. | Software defined network (SDN) control signaling for traffic engineering to enable multi-type transport in a data plane |
US20170026243A1 (en) * | 2015-07-20 | 2017-01-26 | Schweitzer Engineering Laboratories, Inc. | Configuration of a software defined network |
WO2017014904A1 (en) * | 2015-07-20 | 2017-01-26 | Schweitzer Engineering Laboratories, Inc . | Configuration of a software defined network |
US9923779B2 (en) * | 2015-07-20 | 2018-03-20 | Schweitzer Engineering Laboratories, Inc. | Configuration of a software defined network |
CN107851109A (en) * | 2015-07-20 | 2018-03-27 | 施瓦哲工程实验有限公司 | The configuration of software defined network |
US10659314B2 (en) | 2015-07-20 | 2020-05-19 | Schweitzer Engineering Laboratories, Inc. | Communication host profiles |
US9900206B2 (en) | 2015-07-20 | 2018-02-20 | Schweitzer Engineering Laboratories, Inc. | Communication device with persistent configuration and verification |
US9866483B2 (en) | 2015-07-20 | 2018-01-09 | Schweitzer Engineering Laboratories, Inc. | Routing of traffic in network through automatically generated and physically distinct communication paths |
US10341311B2 (en) | 2015-07-20 | 2019-07-02 | Schweitzer Engineering Laboratories, Inc. | Communication device for implementing selective encryption in a software defined network |
US11277341B2 (en) | 2015-07-22 | 2022-03-15 | Cisco Technology, Inc. | Resilient segment routing service hunting with TCP session stickiness |
US10505846B2 (en) * | 2015-07-22 | 2019-12-10 | Cisco Technology, Inc. | Resilient segment routing service hunting with TCP session stickiness |
US20170026224A1 (en) * | 2015-07-22 | 2017-01-26 | Cisco Technology, Inc. | Resilient segment routing service hunting with tcp session stickiness |
US20170048139A1 (en) * | 2015-08-14 | 2017-02-16 | Futurewei Technologies, Inc. | Interconnecting an Overlay Management Control Network (OMCN) to a Non-OMCN |
WO2017088190A1 (en) * | 2015-11-27 | 2017-06-01 | 华为技术有限公司 | Method and apparatus for acquiring path information |
US10554536B2 (en) | 2015-11-27 | 2020-02-04 | Huawei Technologies Co., Ltd. | Method for obtaining path information and apparatus |
US10863558B2 (en) | 2016-03-30 | 2020-12-08 | Schweitzer Engineering Laboratories, Inc. | Communication device for implementing trusted relationships in a software defined network |
US20180176152A1 (en) * | 2016-12-19 | 2018-06-21 | Futurewei Technologies, Inc. | Programmable and low latency switch fabric for scale-out router |
US10447585B2 (en) * | 2016-12-19 | 2019-10-15 | Futurewei Technologies, Inc. | Programmable and low latency switch fabric for scale-out router |
WO2018113606A1 (en) | 2016-12-19 | 2018-06-28 | Huawei Technologies Co., Ltd. | Programmable and Low Latency Switch Fabric for Scale-Out Router |
EP3545656B1 (en) * | 2016-12-19 | 2023-06-21 | Huawei Technologies Co., Ltd. | Programmable and low latency switch fabric for scale-out router |
US10700959B2 (en) * | 2017-04-09 | 2020-06-30 | Barefoot Networks, Inc. | Source routing design with simplified forwarding elements |
US10826815B2 (en) | 2017-04-09 | 2020-11-03 | Barefoot Networks, Inc. | Verification of access control list rules provided with a message |
US10764170B2 (en) | 2017-04-09 | 2020-09-01 | Barefoot Networks, Inc. | Generation of path failure message at forwarding element based on message path |
US10757005B2 (en) | 2017-04-09 | 2020-08-25 | Barefoot Networks, Inc. | Execution of packet-specified actions at forwarding element |
US20190182143A1 (en) * | 2017-04-09 | 2019-06-13 | Barefoot Networks, Inc. | Source Routing Design with Simplified Forwarding Elements |
US10785189B2 (en) | 2018-03-01 | 2020-09-22 | Schweitzer Engineering Laboratories, Inc. | Selective port mirroring and in-band transport of network communications for inspection |
US11489810B2 (en) | 2019-04-29 | 2022-11-01 | Cloudflare, Inc. | Intelligently routing a response packet along a same connection as a request packet |
US11799827B2 (en) | 2019-04-29 | 2023-10-24 | Cloudflare, Inc. | Intelligently routing a response packet along a same connection as a request packet |
US10904207B2 (en) * | 2019-04-29 | 2021-01-26 | Cloudflare, Inc. | Intelligently routing a response packet along a same connection as a request packet |
US11075908B2 (en) | 2019-05-17 | 2021-07-27 | Schweitzer Engineering Laboratories, Inc. | Authentication in a software defined network |
US10979309B2 (en) | 2019-08-07 | 2021-04-13 | Schweitzer Engineering Laboratories, Inc. | Automated convergence of physical design and configuration of software defined network |
US11228521B2 (en) | 2019-11-04 | 2022-01-18 | Schweitzer Engineering Laboratories, Inc. | Systems and method for detecting failover capability of a network device |
US11165685B2 (en) | 2019-12-20 | 2021-11-02 | Schweitzer Engineering Laboratories, Inc. | Multipoint redundant network device path planning for programmable networks |
US11418432B1 (en) | 2021-04-22 | 2022-08-16 | Schweitzer Engineering Laboratories, Inc. | Automated communication flow discovery and configuration in a software defined network |
CN113489613A (en) * | 2021-07-16 | 2021-10-08 | 锐捷网络股份有限公司 | Message forwarding method and device |
US11336564B1 (en) | 2021-09-01 | 2022-05-17 | Schweitzer Engineering Laboratories, Inc. | Detection of active hosts using parallel redundancy protocol in software defined networks |
US11750502B2 (en) | 2021-09-01 | 2023-09-05 | Schweitzer Engineering Laboratories, Inc. | Detection of in-band software defined network controllers using parallel redundancy protocol |
US11838174B2 (en) | 2022-02-24 | 2023-12-05 | Schweitzer Engineering Laboratories, Inc. | Multicast fast failover handling |
US11848860B2 (en) | 2022-02-24 | 2023-12-19 | Schweitzer Engineering Laboratories, Inc. | Multicast fast failover turnaround overlap handling |
Also Published As
Publication number | Publication date |
---|---|
US9680746B2 (en) | 2017-06-13 |
US9270618B2 (en) | 2016-02-23 |
GB2525823B (en) | 2018-02-21 |
GB201516051D0 (en) | 2015-10-28 |
US20160119236A1 (en) | 2016-04-28 |
GB2525823A (en) | 2015-11-04 |
WO2014132149A1 (en) | 2014-09-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9680746B2 (en) | Source routing with fabric switches in an ethernet fabric network | |
US9699105B2 (en) | Self-routing multicast in a software defined network fabric | |
US8804572B2 (en) | Distributed switch systems in a trill network | |
US10484518B2 (en) | Dynamic port type detection | |
US10225179B2 (en) | Virtual port channel bounce in overlay network | |
US9258185B2 (en) | Fibre channel over Ethernet support in a trill network environment | |
US9363189B2 (en) | Credit based flow control in lossless ethernet networks | |
US10164883B2 (en) | System and method for flow management in software-defined networks | |
US9843504B2 (en) | Extending OpenFlow to support packet encapsulation for transport over software-defined networks | |
JP5497244B2 (en) | Method, master switch, switching network, program, apparatus, and system for implementing flow control in a switching network | |
CN105099848B (en) | Network extension group | |
US10693790B1 (en) | Load balancing for multipath group routed flows by re-routing the congested route | |
US20130318243A1 (en) | Integrated heterogeneous software-defined network | |
EP3095216B1 (en) | Single hop overlay architecture for line rate performance in campus networks | |
US8953607B2 (en) | Internet group membership protocol group membership synchronization in virtual link aggregation | |
US8675669B2 (en) | Policy homomorphic network extension | |
US9036646B2 (en) | Distributed routing mechanisms for a virtual switch enabled by a trill-based fabric |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION, NEW Y Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:DECUSATIS, CASIMER M.;GUSAT, MIRCEA;KAMBLE, KESHAV G.;AND OTHERS;SIGNING DATES FROM 20130304 TO 20130321;REEL/FRAME:030080/0389 |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 4 |
|
AS | Assignment |
Owner name: KYNDRYL, INC., NEW YORK Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:INTERNATIONAL BUSINESS MACHINES CORPORATION;REEL/FRAME:057885/0644 Effective date: 20210930 |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1552); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 8 |