Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
STRICTLY NONBLOCKING MULTICAST MULTI-STAGE NETWORKS
Document Type and Number:
WIPO Patent Application WO/2006/033651
Kind Code:
A1
Abstract:
A three-stage network (Fig. 1A) is operated in strictly nonblocking manner in accordance with the invention includes an input stage (101) having r1 switches (IS) and n1 inlet links (IL) for each of r1 switches, an output stage (120) having r2 switches (OS) and n2 outlet links (OL) for each of r2 switches. The network also has a middle stage (130) of m switches (MS), and each middle switch has at least one link connected to each input switch for a total of at least r1 first internal links and at least one link connected to each output switch for a total of at least r2 second internal links, where m ≥ 2 * n1 + n2 - 1. In one embodiment, each multicast connection is set up through such a three-stage network by use of at most two switches in the middle stage.

Inventors:
KONDA VENKAT (US)
Application Number:
PCT/US2003/027972
Publication Date:
March 30, 2006
Filing Date:
September 06, 2003
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
TEAK TECHNOLOGIES INC (US)
KONDA VENKAT (US)
International Classes:
H04Q1/00; (IPC1-7): H04Q1/00
Foreign References:
US5801641A1998-09-01
US5544160A1996-08-06
US5276425A1994-01-04
US5023864A1991-06-11
Other References:
See also references of EP 1668923A4
Download PDF:
Claims:
M-0002 PCTCLAIMS
1. What is claimed is: A network having a plurality of multicast connections, said network comprising: an input stage comprising rλ input switches, and W1 inlet links for each of said rx 5 input switches; an output stage comprising r2 output switches, and n2 outlet links for each of said r2 output switches; and a middle stage comprising m middle switches, and each middle switch comprising at least one link (hereinafter "first internal link") connected to each input 10 switch for a total of at least r, first internal links, each middle switch further comprising at least one link (hereinafter "second internal link") connected to each output switch for a total of at least r2 second internal links; wherein each multicast connection from an inlet link passes through at most two middle switches, and said multicast connection further passes to a plurality of outlet links , 15 from said at most two middle switches.
2. The network of claim 1 , wherein m ≥ 2* nι +n2 l .
3. The network of claim 2, further is always capable of setting up said multicast connection by never changing path of an existing multicast connection, and the network is hereinafter "strictly 20 nonblocking network".
4. The network of claim 1 further comprising a controller coupled to each of said input, output and middle stages to set up said multicast connection.
5. The network of claim 2 wherein said rt input switches and r2 output switches are the same number of switches. MOOO2 PCT .
6. The network of claim 2 wherein said H1 inlet links and n2 outlet links are the same number of links and n} = n2 = n, then m > 3 * n 1.
7. The strictly nonblocking network of claim 3, wherein each of said input switches, or each of said output switches, or each of said middle switches further recursively comprise one or more strictly nonblocking networks.
8. The network of claim 1, wherein each of said input switches, or each of said output switches, or each of said middle switches further recursively comprise one or more networks.
9. A method for setting up one or more multicast connections in a network having an input stage having K1 *rx inlet links and r, input switches, an output stage having n2 *r2 outlet links and r2 output switches, and a middle stage having m middle switches, where each middle switch is connected to each of said rx input switches through T1 first internal links and each middle switch further comprising at least one link connected to at most d sajd output switches for a total of at least d second internal links, wherein 1 < d < r2 , said method comprising: receiving a multicast connection at said input stage; fanning out said multicast connection in said input stage into at most two middle switches to set up said multicast connection to a plurality of output switches among said r2 output switches, wherein said plurality of output switches are specified as destinations of said multicast connection, wherein first internal links from said input switch to said at most two middle switches and second internal links to said destinations from said at most two middles switches are available.
10. The method of claim 9 wherein said act of fanning out is performed without changing any existing connection to pass through another middle switch.
11. The method of claim 9 wherein said act of fanning out is performed recursively. M0002PCT .
12. A method for setting up one or more multicast connections in a network having an input stage having W1 * rx inlet links and rx input switches, an output stage having n2 * r2 outlet links and r2 output switches, and a middle stage having m middle switches, where each middle switch is connected to each of said rx input switches through r, first internal links and each middle switch further comprising at least one link connected to at most d said output switches for a total of at least d second internal links, wherein 1 < d ≤ r2 , said method comprising: checking if at least a first subset of destination output switches of said multicast connection have available second internal links to a first middle switch; and checking if a second middle switch has available second internal links to a second subset of destination output switches of said multicast connection. wherein each destination output switch of said multicast connection is one of said first subset of destination output switches and said second subset of destination output switches.
13. The method of claim 12 further comprising: checking if the input switch of said multicast connection has an available first internal link to said first middle switch and to said second middle switch.
14. The method of claim 12 further comprising: prior to said checkings, checking if all the destination output switches of said multicast connection are available at said first middle switch.
15. The method of claim 12 further comprising: repeating said checkings of available second internal links to another second subset of destination output switches for each middle switch other than said first and said second middle switches. wherein each destination output Switch of said multicast connection is one of said first subset of destination output switches and said second subset of destination output switches.
16. The method of claim 12 further comprising: M0O02 PCT repeating said checkings of available second internal links to another first subset of destination output switches with each middle stage switch other than said first middle stage switch.
17. The method of claim 12 further comprising: repeating said checkings of available first internal link to each middle stage switch other than said first middle switch and said second middle switch.
18. The method of claim 12 further comprising: setting up each of said multicast connection from its said input switch to its said output switches through not more than two middle switches, selected by said checkings, by fanning out said multicast connection in its said input switch into not more than said two middle stage switches.
19. A method of claim 12 wherein any of said acts of checking and setting up are performed recursively.
20. A method of setting up a multicast connection through a threestage network, said method comprising: fanning out only one or two times in an initial stage.
21. The method of claim 20 further comprising: fanning out any number of times in each of the remaining stages^ wherein said threestage network includes said remaining stages and said initial stage.
22. The method of claim 20 further comprising: repeating said acts of fanning out with a plurality of portions of each of said stages.
23. The method of claim 20 further comprising: recursively performing said act of fanning out.
24. The method of claim 20 wherein: M0002 PCT a remaining stage immediately following said initial stage comprises internal links that are at least two times the total number of inlet links of said initial stage.
25. The method of claim 20 wherein: said initial stage comprises a plurality of first switches, and a plurality of inlet links connected to each said first switch; and a remaining stage immediately following said initial stage comprises a plurality of second switches, that are at least double the number of inlet links of each first switch and each second switch comprises a plurality of internal links at least equal in number to the number of first switches in said initial stage.
26. A network comprising: an input stage comprising N1 or M1 * η inlet links and r, input switches and W1 inlet links for each of said r, input switches, and N1 = nx * rx , said nx inlet links for receiving multicast connections; an output stage comprising N2 or n2 * r2 outlet links and r2 output switches and n2 outlet links for each of said r2 output switches, and N2 = n2 *r2, said n2 outlet links for transmitting said received connections; and a middle stage having m middle switches, and each middle switch comprising at least one link connected to each input switch for a total of at least rx first internal links and each middle switch further comprising at least one link connected to at most d output switches for a total of at least d second internal links, wherein 1 < d < r2 , said initial stage having multicast connections with a fanout of one or two.
27. The network of claim 26 further comprising: said multicast connections having a fanout of one or more in said middle stage.
28. The network of claim 26 further comprising: said multicast connections having a fanout of one or more in said output stage.
29. A network having a plurality of multicast connections, said network comprising: M0002 PCT an input stage comprising ^1 input switches and W1 inlet links for each of said η input switches, and N1 = nl *rl ; an output stage comprising r2 output switches and n2 outlet links for each of said r2 output switches, and N2 = Tt2 * r2 ; and a middle stage comprising m middle switches, and each middle switch comprising at least one link connected to each input switch for a total of at least r, first internal links; each middle switch further comprising at least one link connected to each output switch for a total of at least r2 second internal links, wherein each multicast connection from an inlet link passes through at most three middles switches, and said multicast connection further passes to a plurality of outlet links from said at most three middle switches.
30. The network of claim 29, wherein w > 3 * «j + «2 l,.
31. The network of claim 3O, further is always capable of setting up said multicast connection by never changing path of an existing multicast connection, and the network is hereinafter "strictly nonblocking network".
32. The network of claim 29 comprising a controller in communication with said input, output and middle stages to set up said multicast connection.
33. The network of claim 3O wherein said rλ input switches and r2 output switches are the same number of switches.
34. The network of claim 30 wherein said W1 inlet links and n2 outlet links are the same number of links and H1 = n2 = κ, then m ≥ 4*nl .
35. The strictly nonblocking network of claim 31 , M0002 PCT wherein each of said input switches, or each of said output switches, or each of said middle switches further recursively comprise one or more strictly nonblocking networks.
36. The network of claim 29, wherein each of said input switches, or each of said output switches, or each of said middle switches further recursively comprise one or more networks.
37. A method for setting up one or more multicast connections in a network having an input stage having H1 *rx inlet links and T1 input switches, an output stage having n2 *r2 outlet links and r2 output switches, and a middle stage having m middle switches, where each middle switch is connected to each, of said rλ input switches through ^1 first internal links and each middle switch further comprising at least one link connected to at most d said output switches for a total of at least d second internal links, wherein \ < d ≤ r2 , said method comprising : receiving a multicast connection at said input stage; fanning out said multicast connection in said input stage into at most three middle switches to set up said multicast connection to a plurality of output switches among said r2 output switches of said multicast connection, wherein said plurality of output switches are specified as destinations of said multicast connection, wherein first internal links from said input switch to said at most three middle switches and second internal links to said destinations from said at most three middle switches are available.
38. A method of claim 37 wherein said act of fanning out is performed without changing any existing connection to pass through another middle switch.
39. A method of claim 37 wherein said act of fanning out is performed recursively.
40. A method for setting up one or more multicast connections in a network having an input stage having nx * rx inlet links and rx input switches, an output stage having n2 * r2 outlet links and r2 output switches, and a middle stage having m middle switches, where each middle switch is connected to eact* of said T1 input switches through rλ first M0002 PCT internal links and each middle switch further comprising at least one link connected to at most d said output switches for a total of at least d second internal links, wherein 1 < d ≤ r2 , said method comprising : checking if all the destination output switches of said multicast connection have available second internal links from at most three middle switches.
41. The method of claim 40 further comprising: checking if the input switch of said multicast connection has available first internal links to at most said three middle switches.
42. The method of claim 40 further comprising; repeating said checkings of available second internal links to all said destination output switches for all the other combinations of at most three middle switches.
43. The method of claim 40 further comprising: repeating said checkings of available first internal links for all the other combinations of at most three middle switches.
44. The method of claim 40 further comprising: setting up each of said connection from its said input switch to its said output switches through at most said three middle switches, selected by said checkings, by fanning out said multicast connection in its said input switch into at most said three middle stage switches;.
45. The method of claim 40 wherein any of said acts of checking and setting up are performed recursively.
46. A method of setting up a multicast connection through a threestage network, said method comprising: fanning out at most three times in an initial stage.
47. The method of claim 46 further comprising: fanning out any number of times in each of the remaining stages, M00Q2 PCT wherein said threestage network includes said remaining stages and said initial stage.
48. The method of claim 46 further comprising: repeating said acts of fanning out with a plurality of portions of each said stages.
49. The method of claim 46 further comprising: recursively performing said act of fanning out.
50. The method of claim 46 wherein: a remaining stage immediately following said initial stage comprises internal links that are at least three times the total number of inlet links of said initial stage.
51. The method of claim 46 wherein: said initial stage comprises a plurality of first switches, and plurality of inlet links connected to each said first switch; and a remaining stage immediately following said initial stage comprises a plurality of second switches, that are at least three times the number of inlet links of each first switch and each second switch comprises a plurality of first internal links at least equal in number to the number of first switches in said initial stage.
52. A network comprising: an input stage comprising N1 or «, *rλ inlet links and rx input switches and n^ inlet links for each of said rx input switches, and N1 = nγ * rγ, said W1 inlet links for receiving connection connections; an output stage comprising N2 or «2 * rt outlet links and r2 output switches and »2 outlet links for each of said r2 output switches, and N2 = n2 *r2 , said n2 outlet links for transmitting said received connections; and a middle stage having m middle switches, and each middle switch has at least one link connected to each input switch for a total of at least T1 first internal links and each middle switch further comprising at least one link connected to at most d output switches for a total of at least d second internal links, wherein 1 < d < r2 , M0002 PCT said initial stage having multicast connections with a fanout of at most three.
53. The network of claim 52 further comprising: said multicast connections having a fanout of one or more in said middle stage.
54. The network of claim 52 further comprising: said multicast connections having a fanout of one or more in said output stage.
55. A network having a plurality of multicast connections, said network comprising: an input stage comprising r, input switches and n, inlet links for each of said r, input switches, and N1 = «, * r, ; an output stage comprising r2 output switches and n2 outlet links for each of said r2 output switches, and N2 = n2 * r2 ; and a middle stage comprising m middle switches, and each middle switch comprising at least one link connected to each input switch for a total of at least rλ first internal links; each middle switch further comprising at least one link connected to each output switch for a total of at least r2 second internal links, for x ≥ 1 , wherein each multicast connection from an inlet link passes through at most x middles switches, and said multicast connection further passes to a plurality of outlet links from said at most x middle switches.
56. The network of claim 55, wherein m > x * nx + n2 1 , for x ≥ 2.
57. The network of claim 56, further is always capable of setting up said connection by never changing path of a previously set up multicast connection, and the network is hereinafter "strictly nonblocking network".
58. The network of claim 55 comprising a controller in communication with said input, output and middle stages to set up said multicast connection. M0002 PCT .
59. The network of claim 56 wherein said rx input switches and r2 output switches are the same number of switches.
60. The network of claim 56 wherein said W1 inlet links and n2 outlet links are the same number of links and Tt1 =H2 =U, then m > (JC + 1)* n 1.
61. The strictly nonblocking network of claim 57, wherein each of said input switches, or each of said output switches, or each of said middle switches further recursively comprise one or more strictly nonblocking networks.
62. The network of claim 55, wherein each of said input switches, or each of said output switches, or each of said middle switches further recursively comprise one or more networks.
63. A method for setting up one or more multicast connections in a network having an input stage having nλ * rx inlet links and rx input switches, an output stage having n2 * r2 outlet links and r2 output switches, and a middle stage having m middle switches, where each middle switch is connected to each of said T1 input switches through r, first internal links and each middle switch further comprising at least one link connected to at most d said output switches for a total of at least d second internal links, wherein 1 < d ≤ r2 , for x ≥ 2, said method comprising: receiving a multicast connection at said input stage; fanning out said multicast connection in said input stage into at most x middle switches to set up said multicast connection to a plurality of output switches among said r2 output switches, wherein said plurality of output switches are specified as destinations of said multicast connection, wherein first internal links from said input switch to said at most x middle switches and second internal links to said destinations from said at most x middles switches are available.
64. The method of claim 63 wherein said act of fanning out is performed without changing any existing connection to pass through another middle switch. M0002 PCT .
65. The method of claim 63 wherein said act of fanning out is performed recursively.
66. A method for setting up one or more multicast connections in a network having an input stage having nx * T1 inlet links and η input switches, an output stage having n2 * r2 outlet links and r2 output switches, and a middle stage having m middle switches, where each middle switch is connected to each of said rx input switches through rx first internal links and each of said r2 said output switches through r2 second internal links, for x ≥ 2, said method comprising: checking if all the destination output switches of said multicast connection have available second internal links from at most x middle switches.
67. The method of claim 66 further comprising: checking if the input switch of said multicast connection has an available first internal links to said at most JC middle switches.
68. The method of claim 66 further comprising: repeating said checkings of available second internal links to all said destination output switches for all the other combinations of at most x middle switches.
69. The method of claim 66 further comprising: repeating said checkings of available first internal links for all the other combinations of at most x middle switches.
70. The method of claim 66 further comprising: setting up each of said connection from its said input switch to its said output switches through at most x said middle switches, selected by said checkings, by fanning out said multicast connection in its said input switch into at most said JC middle stage switches.
71. The method of claim 66 wherein any of said acts of checking and setting up are performed recursively. M0002 PCT .
72. A method of setting up a multicast connection through a threestage network, for x ≥ 2 , said method comprising: fanning out at most x times in an initial stage.
73. The method of claim 72 further comprising: fanning out any number of times in each of the remaining stages, wherein said threestage network includes said remaining stages and said initial stage.
74. The method of claim 72 further comprising: repeating said acts of fanning out with a plurality of portions of each of said stages.
75. The method of claim 72 further comprising: recursively performing said act of fanning out.
76. The method of claim 72 wherein: a remaining stage immediately following said initial stage comprises internal links that are at least x times the total number of inlet links of said initial stage.
77. The method of claim 72 wherein: said initial stage comprises a plurality of first switches, and plurality of inlet links connected to each said first switch; and a remaining stage immediately following said initial stage comprises a plurality of second switches that are at least x times the number of inlet links of each first switch and each second switch comprises a plurality of first internal links at least equal in number to the number of first switches in said initial stage.
78. A network comprising: an input stage comprising Ni or nx * rx inlet links and rλ input switches and M1 inlet links for each of said rx input switches, and N1 = nx *rx, said nx inlet links for receiving connection connections; MQ002 PCT an output stage comprising N2 or n2 * r2 outlet links and r2 output switches and «2 outlet links for each of said r2 output switches, and N2 = n2 *r2, said n2 outlet links for transmitting said received connections; and a middle stage having m middle switches, and each middle switch has at least one link connected to each input switch for a total of at least rx first internal links and each middle switch further comprising at least one link connected to at most d said output switches for a total of at least d second internal links, wherein 1 < d < r2 , said initial stage having multicast connections with a fanput of at most x , for x ≥ 2.
79. The network of claim 78 further comprising: said multicast connections having a fanout of one or more in said middle stage.
80. The network of claim 78 further comprising: said multicast connections having a fanout of one or more in said output stage.
81. A network having a plurality of multicast connections, said network comprising: an input stage comprising T1 input switches and nlw inlet links in input switch w, for each of said rt input switches such that w e [l,rj and W1 = MAX(r^lw}; an output stage comprising r2 output switches and n2v outlet links in output switch v , for each of said r2 output switches such that v e [l, r2 ] and n2 = MAX{n2v ) ; and a middle stage comprising m middle switches, and each middle switch comprising at least one link connected to each input switch for a total of at least T1 first internal links; each middle switch further comprising at least one link connected to at most d said output switches for a total of at least d second internal links, wherein \ < d ≤ r2 , p p wherein m ≥ ∑ (xt * at +Ti1 I) , where ∑ at = O1 + W2 and X1 , X2,...,, xp ≥ 1 ; M0002 PCT wherein, for 1 ≤ i ≤ p, multicast connections from α, inlet links of each input switch pass through at most x, middles switches.
82. The network of claim 81, where xux2,...,xp ≥ 2, further is capable of setting up said connection by never changing path of a previously set up multicast connection, and the network is hereinafter "strictly nonblocking network".
83. The network of claim 81 comprising a controller in communication with said input, output and middle stages to set up said multicast connection.
84. The network of claim 81 wherein said /*, input switches and r2 output switches are the same number of switches.
85. The network of claim 81 wherein said n, inlet links and n2 outlet links are the same number of links and nr = n2 = n.
86. The strictly nonblocking network of claim 82, wherein each of said input switches, or each of said output switches, or each of said middle switches further recursively comprise one or more strictly nonblocking networks.
87. The network of claim 81 , wherein each of said input switches, or each of said output switches, or each of said middle switches further recursively comprise one or more networks.
88. A network having a plurality of multicast connections, said network comprising: an input stage comprising T1 input switches and nϊw inlet links in input switch w , for each of said rx input switches such that w e [l,rj and nλ = MdX(nlw); an output stage comprising r2 output switches and n2v outlet links in output switch v, for each of said r2 output switches such that v e [l,r2] and n2 = MAX(n2v); and M0002 PCT a middle stage comprising m middle switches, and each middle switch comprising at least one link connected to each input switch for a total of at least rx first internal links; each middle switch further comprising at least one link connected to at most d said output switches for a total of at least d second internal links, wherein 1 < d ≤ r2 , wherein each multicast connection from an inlet link passes through at most one or two middles switches, and said multicast connection further passes a plurality of outlet links from said at most two middle switches.
89. The network of claim 88, wherein m ≥ 2 * nx + n2 1 ,.
90. The network of claim 89, further is always capable of setting up said connection by never changing path of a previously set up multicast connection, and the network is hereinafter "strictly nonblocking network".
91. The network of claim 88 comprising a controller in communication with said input, output and middle stages to set up said multicast connection.
92. The network of claim 89 wherein said rx input switches and r2 output switches are the same number of switches.
93. The network of claim 89 wherein said nx inlet links and n2 outlet links are the same number of links and nx ~ n2 = w, then m ≥ 3 *nl .
94. The strictly nonblocking network of claim 90, wherein each of said input switches, or each of said output switches, or each of said middle switches further recursively comprise one or more strictly nonblocking networks.
95. The network of claim 88, M0002 PCT wherein each of said input switches, or each of said output switches, or each of said middle switches further recursively comprise one or more networks.
96. A network, having a plurality of multicast connections, said network comprising: an input stage comprising rt input switches and nlw inlet links in input switch w, for each of said rλ input switches such that w e [l,r,] and w, = MA∑(nλJ); an output stage comprising r2 output switches and n2v outlet links in output switch v, for each of said r2 output switches such that v 6 [l,r2] and n2 = MAX(n2v); and a middle stage comprising m middle switches, and each middle switch comprising at least one link connected to each input switch for a total of at least rγ first internal links; eacli middle switch further comprising at least one link connected to at most d said output switches for a total of at least d second internal links, wherein \ < d ≤ r2 , wherein each multicast connection from an inlet link passes through at most three middles switches, and said multicast connection further passes a plurality of outlet links from said at most three middle switches.
97. The network of claim 96, wherein m ≥ 3 * n, + n2 1 ,.
98. The network of claim 97, further is always capable of setting up said connection by never changing path of a previously set up multicast connection, and the network is hereinafter "strictly nonblocking network".
99. The network of claim 96 comprising a controller in communication with said input, output and middle stages to set up said multicast connection.
100. The network of claim 97 wherein said T1 input switches and r2 output switches are the same number of switches. MOOO2 PCT .
101. The network of claim 97 wherein said «, inlet links and n2 outlet links are the same number of links and H1 = H1 = U, then m ≥ 4 * n 1.
102. The strictly nonblocking network of claim 98, wherein each of said input switches, or each of said output switches, or each of said middle switches further recursively comprise one or more strictly nonblocking networks.
103. The network of claim 96 , wherein each of said input switches, or each of said output switches, or each of said middle switches further recursively comprise one or more networks.
104. A network having a plurality of multicast connections, said network comprising: an input stage comprising rx input switches and nlv inlet links in input switch w, for each of said rx input switches such that w e [1,T1] and H1 = MAX{nlw); an output stage comprising r2 output switches and n2v outlet links in output switch v, for each of said rz output switches such that v e [l,r2] and n2 = MAX(n2r); and a middle stage comprising m middle switches, and each middle switch comprising at least one link connected to each input switch for a total of at least rt first internal links; each middle switch further comprising at least one link connected to at most d output switches for a total of at least d second internal links, wherein 1 < d < r2 , wherein each multicast connection from an inlet link passes through at most x middles switches, and said multicast connection further passes a plurality of outlet links from said at most x middle switches.
105. The network of claim 104, wherein m ≥ x * nx + n2 1.
106. The network of claim 105, M0002 PCT further is always capable of setting up said connection by never changing path of a previously set up multicast connection, and the network is hereinafter "strictly nonblocking network".
107. The network of claim 104 comprising a controller in communication with said input, output and middle stages to set up said multicast connection.
108. The network of claim 105 wherein said rλ input switches and r2 output switches are the same number of switches.
109. The network of claim 105 wherein said nx inlet links and n2 outlet links are the same number of links and nx = n2 = n, then m ≥ (x + 1)* n .
110. The strictly nonblocking network of claim 106, wherein each of said input switches, or each of said output switches, or each of said middle switches further recursively comprise one or more strictly nonblocking networks.
111. The network of claim 104, wherein each of said input switches, or each of said output switches, or each of said middle switches further recursively comprise one or more networks.
112. A network comprising a plurality of input subnetworks, a plurality of middle subnetworks, and a plurality of output subnetworks, wherein at least one of said input subnetworks, said middle subnetworks and said output subnetworks recursively comprise: an input stage comprising rx input switches and nu, inlet links in input switch w, for each of said rx input switches such that w e [l,rj and Tt1 MA∑(nlw); an output stage comprising r2 output switches and n2v outlet links in output switch v, for each of said r2 output switches such that v e [l,r2] and n2 = MAX{n2v); and a middle stage, said middle stage comprising m middle switches, and each middle switch comprising at least one link (hereinafter "first internal link") connected to each MOOO2 PCT input switch for a total of at least rλ first internal links, each middle switch further comprising at least one link (hereinafter "second internal link") connected to at most d said output switches for a total of at least d second internal links, wherein 1 < d < r2 , and for x ≤ 2 ; wherein each multicast connection from an inlet link passes through at most x middle switches, and said multicast connection further passes to a plurality of outlet links from said at most x middle switches.
Description:
M-0002 PCT

STRICTLY NONBLOCKBVG MULTICAST MULTI-STAGE NETWORKS

CROSS REFERENCE TO CD-ROM APPENDIX

Appendix A includes software written in the C programming language for a prototype of a scheduling method to set up connections through a three-stage network. The C code is compilable by Visual C++ compiler, version 6.0 available from Microsoft Corporation, to form an executable file for use in an IBM compatible personal computer. Appendix A also includes documentation in a readme file for the C code and also instructions on how to compile and execute the C code.

cddir

Volume in drive D is 010925_1627 Volume Serial Number is 08C3-4D4A

Directory of D:\

09/25/01 04:27p <DIR>

09/25/01 04:27p <DIR>

09/25/01 04:27p <DIR> M-1222-1

3 File(s) 0 bytes

Directory of D:\M-1222 ~1

09/25/01 04:27p <DIR>

09/25/01 04:27p <DIR>

09/21/01 11:42a 57,057 ODTl.RTF

09/21/01 11:36a 1,600 README.TXT

09/21/01 11:42a 30,378 SNB.C

5 File (s) 89,035 bytes

Total Files Listed:

8 File (s) 89,035 bytes

0 bytes free

A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or patent disclosure, as it appears in the

M-0002 PCT

U.S. Patent and Trademark Office patent files or records, but otherwise reserves all copyrights whatsoever.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is Continuation In Part PCT Application to and incorporates by reference in its entirety the related U.S. Patent Application Serial No. 09/967,106 entitled "STRICTLY NON-BLOCKING MULTICAST MULTI-STAGE NETWORKS" by Venkat Konda assigned to the same assignee as the current application, and filed on 27, September 2001. This application is related to and incorporates by reference in its entirety the related U.S. Patent Application Serial No. 09/967,815 entitled "REARRANGEABLY NON-BLOCKING MULTICAST MULTI-STAGE NETWORKS" by Venkat Konda assigned to the same assignee as the current application, filed on 27, September 2001, and its Continuation In Part PCT Application Docket No. M-0001 filed concurrently.

This application is related to and incorporates by reference in its entirety the related U.S. Provisional Patent Application Docket No. M-0003 entitled "STRICTLY NON-BLOCKING MULTICAST LINEAR-TIME MULTI-STAGE NETWORKS" and U.S. Provisional Patent Application Docket No. M-0004 entitled "STRICTLY NON- BLOCKING MULTICAST MULTI-SPLIT LINEAR-TIME MULTI-STAGE NETWORKS" by Venkat Konda assigned to the same assignee as the current application and filed concurrently.

BACKGROUND OF INVENTION

As is well known in the art, a Clos switching network is a network of switches configured as a multi-stage network so that fewer switching points are necessary to implement connections between its inlet links (also called "inputs") and outlet links (also called "outputs") than would be required by a single stage (e.g. crossbar) switch having the same number of inputs and outputs. Clos networks are very popularly used in digital crossconnects, switch fabrics and parallel computer systems. However Clos networks may block some of the connection requests.

There are generally three types of nonblocking networks: strictly nonblocking; wide sense nonblocking; and rearrangeably nonblocking (See V.E. Benes, "Mathematical

M-OOO 2 PCT

Theory of Connecting Networks and Telephone Traffic" Academic Press, 1965 that is incorporated by reference, as background). In a rearrangeably nonblocking network, a connection path is guaranteed as a result of the network's ability to rearrange prior connections as new incoming calls are received. In strictly nonblocking network, for any connection request from an inlet link to some set of outlet links, it is always possible to provide a connection path through the network to satisfy the request without disturbing other existing connections, and if more than one such path is available, any path can be selected without being concerned about realization of future potential connection requests. In wide-sense nonblocking networks, it is also always possible to provide a connection path through the network to satisfy the request without disturbing other existing connections, but in this case the path used to satisfy the connection request must be carefully selected so as to maintain the nonblocking connecting capability for future potential connection requests.

U.S. Patent 5,451,936 entitled "Non-blocking Broadcast Network" granted to Yang et al. is incorporated by reference herein as background of the invention. This patent describes a number of well known nonblocking multi-stage switching network designs in the background section at column 1, line 22 to column 3, 59.

An article by Y. Yang, and G.M., Masson entitled, "Non-blocking Broadcast Switching Networks" IEEE Transactions on Computers, Vol. 40, No. 9, September 1991 that is incorporated by reference as background indicates that if the number of switches in the middle stage, m, of a three-stage network satisfies the relation m ≥ min((» - ϊ)(x + r Vx )) where 1 < x ≤ min(« - 1, r) , the resulting network is nonblocking for multicast assignments. In the relation, r is the number of switches in the input stage, and n is the number of inlet links in each input switch. Kim and Du (See D.S. Kim, and D. Du, "Performanpe of Split Routing Algorithm for three-stage multicast networks", IEEE/ACM Transactions on Networking, Vol. 8, No. 4, August 2000 incorporated herein by reference} studied the blocking probability for multicast connections for different scheduling algorithms.

M-0002 PCT

SUMMARY OF INVENTION

A three-stage network is operated in strictly nonblocking manner in accordance with the invention includes an input stage having T 1 switches and n x inlet links for each of r, switches, an output stage having r 2 switches and M 2 outlet links for each of r 2 switches. The network also has a middle stage of m switches, and each middle switch has at least one link connected to each input switch for a total of at least r x first internal links and at least one link connected to each output switch for a total of at least r 2 second internal links, where m ≥ 2*n ι +n 2 -l . In one embodiment, each multicast connection is set up through such a three-stage network by use of at most two switches in the middle stage. When the number of inlet links in each input switch n x is equal to the number of outlet links in each output switch n 2 , and H 1 = n 2 = n, a three-stage network is operated in strictly nonblocking manner in accordance with the invention if m ≥3*n~l . Also in accordance with the invention, a three-stage network having more middle switches than 2 * W 1 + n 2 — 1 is operated in strictly nonblocking manner even if some multicast connections are set up by using more than two middle switches as long as each connection has available links into at least two middle switches and there are always at least W 1 - I unused links from each input switch to middle switches, after each connection is set up.

BRIEF DESCRIPTION OF DRAWINGS

FIG. IA is a diagram of an exemplary three-stage symmetrical network with exemplary multicast connections in accordance with the invention; and FIG. IB is high- level flowchart of a scheduling method according to the invention, used to set up the multicast connections in the network 100 of FIG. IA.

FIG. 2A is a diagram of a general symmetrical three-stage strictly nonblocking network with n inlet links in each of r input stage switches, n outlet links in each of r output stage switches, and AM = 3 * « -1 middle stage switches that are used with the method of FIG. IB in one embodiment; and FIG, 2B is a diagram of a general non-

M-0002 PCT

symmetrical three-stage strictly nonblocking network with «, inlet links in each of η input stage switches, n 2 outlet links in each of r 2 output stage switches, and m = 2*n i +n 2 -\ middle stage switches that are used with the method of FIG. IB in one embodiment;

FIG. 3 A is intermediate level flowchart of one implementation of the method 140 of FIG. IB; FIG. 3B shows an exemplary F(8,3,9) network with certain existing multicast connections; FIG. 3C shows the network of FIG. 3B after a new connection is set up by selecting one middle switch in the network, using the method of FIG. 3 A in one implementation; and FIG. 3D shows the network of FIG. 3C after another new connection is set up by selecting two middle switches in the network, using the method of FIG. 3A in one implementation.

FIG. 4A is another intermediate level flowchart of one implementation of the act 142 of FIG. 3A. FIG. 4B is low-level flowchart of one variant of act 142 of the method of FIG. 4A; and FIG. 4C illustrates, in a flowchart, pseudo code for one example of scheduling method of FIG. 4B. FIG. 4D implements, in one embodiment, the data structures used to store and retrieve data from memory of a controller that implements the method of FIG. 4C.

FIG. 5A is a diagram of an exemplary three-stage network where the middle stage switches are each three-stage networks; FIG. 5B is high-level flowchart, in one embodiment, of a recursively scheduling method in a recursively large multi-stage network such as the network in FIG. 5A.

FIG. 6A is a diagram of a general symmetrical three-stage network with n inlet links in each of r input stage switches and m = 4 * n - 1 middle stage switches; and FIG. 6B is high-level flowchart, in one embodiment, of a scheduling method used to set up multicast connections the network of FIG. 6 A, according to the invention.

FIG. 7A is a diagram of a general symmetrical three-stage network with n inlet links in each of r input stage switches and m = x*n middle stage switches for x>2 and FIG. 7B is high-level flowchart, in one embodiment, of a scheduling method used to set up multicast connections in the network of FIG. 7A, according to the invention.

M-0002 PCT

FIG. 8A is a diagram of an exemplary F(6,3,4) three-stage network, with m = 3 * n - 1 middle stage switches implemented in space-space-space configuration, with certain existing multicast connections setup using the method 140 of FIG. 3 A; FIG. 8B is the first time step of the TST implementation of the network in FIG. 8A; FIG. 8C is the second time step of the TST implementation of the network in FIG. 8A.

DETAILED DESCRIPTION OF THE INVENTION

The present invention is concerned with the design and operation of multi-stage switching networks for broadcast, unicast and multicast connections. When a transmitting device simultaneously sends information to more than one receiving device, the one-to-many connection required between the transmitting device and the receiving devices is called a multicast connection. A set of multicast connections is referred to as a multicast assignment. When a transmitting device sends information to one receiving device, the one-to-one connection required between the transmitting device and the receiving device is called unicast connection. When a transmitting device simultaneously sends information to all the available receiving devices, the one-to-all connection required between the transmitting device and trie receiving devices is called a broadcast connection.

In general, a multicast connection is meant to be one-to-many connection, which includes unicast and broadcast connections. A multicast assignment in a switching network is nonblocking if any of the available inlet links can always be connected to any of the available outlet links. In certain multi-stage networks of the type described herejn, any connection request of arbitrary fan-out, i.e. from an inlet link to an outlet link or to a set of outlet links of the network, can be satisfied without blocking with never needing to rearrange any of the previous connection requests. Depending on the number of switches in a middle stage of such a network, such connection requests may be satisfied without blocking if necessary by rearranging some of the previous connection requests as described in detail in U.S. Patent Application Serial No. 09/967,815 that is incorporated by reference above.

M-0002 PCT

Referring to FIG. IA, an exemplary symmetrical three-stage Clos network of sixteen switches for satisfying communication requests, such as setting up a telephone call or a data call, between an input stage 110 and output stage 120 via a middle stage 130 is shown where input stage 110 consists of four, three by eight switches IS1-IS4 and output stage 120 consists of four, eight by three switches OS1-OS4, and middle stage 130 consists of eight, four by four switches MS1-MS8. Such a network can be operated in strictly non-blocking manner, because the number of switches in the middle stage 130 (i.e. eight switches) is equal to 3*n-l, where the n is the number of links (i.e. three inlet links) of each of the switches in the input stage 110 and output stage 120. The specific method used in implementing the strictly non-blocking connectivity can be any of a number of different methods that will be apparent to a skilled person in view of the disclosure. One such method is described below in reference to FIG. IB.

hi one embodiment of this network each of the input switches IS1-IS4 and output switches OS1-OS4 are crossbar switches. When the number of stages of the network is one, the switching network is called single-stage switching network, crossbar switching network or more simply crossbar switch. A (N * M) crossbar switching network with Ν inlet links and M outlet links is composed of NM cross points. As the values of Ν and M get larger, the cost of making such a crossbar switching network becomes prohibitively expensive. In another embodiment of the network in FIG. 1 A each of the input switches IS1-IS4 and output switches OS1-OS4 are shared memory switches.

The number of switches of input stage 110 and of output stage 120 can be denoted in general with the variable r for each stage. The number of middle switches is denoted by m . The size of each input switch IS1-IS4 can be denoted in general with the notation n * m and of each output switch OS1-OS4 can be denoted in general with the notation m * n . Likewise, the size of each middle switch MSl-MSS can be denoted as r*r . A switch as used herein can be either a crossbar switch, or a network of switches each of which in turn may be a crossbar switch or a network of switches. A three-stage network can be represented with the notation V(m,n,r), where n represents the number of inlet links to each input switch (for example the links BL1-IL3 for the input switch ISl) and m represents the number of middle switches MS1-MS8. Although it is not necessary that there be the same number of inlet links D-1-IL12 as there are outlet links OL1-OL12, in a

M-0002PCT

symmetrical network they are the same. Each of the m middle switches MSl -MS8 are connected to each of the r input switches through r links (hereinafter "first internal" links, for example the links FL1-FL4 connected to the middle switch MSl from each of the input switch IS1-IS4), and connected to each of the output switches through r second internal links (hereinafter "second internal" links, for example the links SLl -SL4 connected from the middle switch MSl to each of the output switch OS1-OS4-).

Each of the first internal links FL1-FL32 and second internal links SL1-SL32 are either available for use by a new connection or not available if currently used l>y an existing connection. The input switches IS1-IS4 are also referred to as the network input ports. The input stage 110 is often referred to as the first stage. The output switches OS1-OS4 are also referred to as the network output ports. The output stage 120 is often referred to as the last stage. In a three-stage network, the second stage 130 is referred to as the middle stage. The middle stage switches MS1-MS8 are referred to as middle switches or middle ports.

In one embodiment, the network also includes a controller coupled with each of the input stage 110, output stage 120 and middle stage 130 to form connections between an inlet link IL1-IL12 and an arbitrary number of outlet links OL1-OL12. In this embodiment the controller maintains in memory a pair of lists of available destinations for the connection through a pair of middle switches (e.g. MSl and MS2 in FIG IA) to implement a fan-out of two. In a similar manner a set of n lists are maintained in an embodiment of the controller that uses a fan-out of n.

FIG. IB shows a high-level flowchart of a scheduling method 140, in one embodiment executed by the controller of FlG. IA. According to this embodiment, a multicast connection request is received in act 141. Then a connection to satisfy the request is set up in act 142 by fanning out into at most two switches in middle stage 130 from its input switch.

In the example illustrated in FIG. IA, a fan-out of four is possible to satisfy a multicast connection request if input switch is IS2, but only two middle stage switches will be, used in accordance with this method. Similarly, although a fan-out of three is possible for a multicast connection request if the input switch is ISl, again only a fan-out

M-0Q02 PCT

of two is used. The specific middle switches that are chosen when selecting a fan-out of two is irrelevant to the method of FIG. IB so long as at most two middle switches are selected to ensure that the connection request is satisfied, i.e. the destination switches identified by the connection request can be reached from the middle switches that are part of the selected fan-out. In essence, limiting the fan-out from input switch to no more than two middle switches permits the network 100 to be operated in strictly nonblocking manner in accordance with the invention.

After act 142, the control is returned to act 141 so that acts 141 and 142 are executed in a loop for each multicast connection request. According to one embodiment as shown further below it is not necessary to have more than 3 * n - 1 middle stage switches in network 100 of the FIG. IA, where the number of inlet links IL1-IL3 equals the number of outlet links OL1-OL3, both represented by the variable n for the network to be a strictly nonblocking symmetrical switching network, when the scheduling method of FIG. IB is used.

The connection request of the type described above in reference to method 140 of

FIG. IB can be unicast connection request, a multicast connection request or a broadcast connection request, depending on the example. In case of a unicast connection request, a fan-out of one is used, i.e. a single middle stage switch is used to satisfy the request. Moreover, although in the above-described embodiment a limit of two has been placed on the fan-out into the middle stage switches, the limit can be greater depending on the number of middle stage switches in a network, as discussed below in reference to FIG. 2 A (while maintaining the strictly nonblocking nature of operation of the network). Moreover, in method 140 described above in reference to FIG. IB any arbitrary fan-out may be used between each middle stage switch and the output stage switches, and also any arbitrary fan-out may be used within each output stage switch, to satisfy the connection request. Moreover, although method 140 of FIG. IB has been illustrated with examples in a sixteen switch network 100 of FIG. IA, the method 140 can be used with any general network, of the type illustrated in FIGs. 2A and 2B.

Network of FIG. IA is an example of general symmetrical three-stage network shown in FIG. 2A. The general symmetrical three-stage network can be operated in strictly nonblocking manner if m ≥ 3 * n - 1 (and in the example of FIG. 2 A,

M-0002 PCT

m = 3*H~l), wherein has n inlet links for each of r input switches ISl-ISr (for example the links ILl 1 -IL In to the input switch ISl) and n outlet links for each of r output switches OSl-OSr (for example OLl 1-OLln to the output switch OSl). Each of the m switches MSl-MSm are connected to each of the input switches through r first internal links (for example the links FLl 1 -FLrI connected to the middle switch MSl from each of the input switch ISl-ISr), and connected to each of the output switches through r second internal links (for example the links SLl 1-SLr 1 connected from the middle switch MSl to each of the output switch OSl-OSr). In such a general symmetrical network no more than 3*«-l middle stage switches MSl-MS(3n-l) are necessary for the network to be strictly nonblocking, when using a scheduling method of the type illustrated in FIG. IB. Although FIG. 2A shows an equal number of first internal links and second internal links, as is the case for a symmetrical three-stage network, the present invention, however, applies even to non-symmetrical networks of the type illustrated in FIG 2B (described next).

In general, an (N 1 * N 2 ) asymmetric network of three stages can be operated in strictly nonblocking manner if m ≥ 2 * Ti 1 + τι 2 - 1 (and in the example of FIG. 2B 5 m = 2 * «, + R 2 - 1 ), wherein network (FIG. 2B) has r x (n x * nϊ) switches ISl-ISri in the first stage, m (τ\ * r 2 ) switches MSl-MSm in the middle stage, and T 2 (m*n 2 ) switches OSl-OSr 2 in the last stage where N 1 = Tt 1 * T 1 is the total number of inlet links and N 2 = n 2 * r 2 is the total number of outlet links of the network. Each of the m switches MSl-MS(2*n!+n 2 -l) are connected to each of the input switches through r x first internaf links (for example the links FLl 1-FLrj 1 connected to the middle switch MSl from each of the input switch ISl-ISr 1 ), and connected to each of the output switches through r 2 second internal links (for example the links SLl 1-SLr 2 I connected from the middle switch MSl to each of the output switch OSl-OSr 2 ). Such a multi-stage switching network is denoted as a V(nι, Ti 1 , ^n 2 , r 2 ) network. For the special symmetrical case where Ti 1 = Ti 2 = Ti and T 1 = r 2 = r , the three-stage network is denoted as a V{m,n,r) network. In general, the set of inlet links is denoted as (1,2,...,T 1 W 1 ) and the set of output switches are denoted as O = (l,2,...,r 2 }. In an asymmetrical three-stage network, as shown in FIG. 2B with H 1 inlet links for each of T 1 input switches, τι 2 outlet links for

M-OOO 2 PCT

each of r 2 output switches, no more than 2 * «, + n 2 - 1 middle stage switches are necessary for the network to be strictly nonblocking, again when using the scheduling method of FIG. IB. The network has all connections set up such that each connection passes through at most two middle switches to be connected to all destination outlet links.

Every switch in the multi-stage networks discussed herein has multicast capability. In a Vim, H 1 , r x , n 2 , r 2 ) network, if a network inlet link is to be connected to more than one outlet link on the same output switch, then it is only necessary for the corresponding input switch to have one path to that output switch. This follows because that path can be multicast within the output switch to as many outlet links as necessary. Multicast assignments can therefore be described in terms of connections between input switches and output switches. An existing connection or a new connection from an input switch to f output switches is said to have fan-out / ' . If all multicast assignments of a first type, wherein any inlet link of an input switch is to be connected in an output switch to at most one outlet link are realizable, then multicast assignments of a second type, wherein any inlet link of each input switch is to be connected to more than one outlet link in the same output switch, can also be realized. For this reason, the following discussion is limited to general multicast connections of the first type (with fan-out r\ 1 ≤ r'≤ r 2 ) although the same discussion is applicable to the second type.

To characterize a multicast assignment, for each inlet link / e (1,2,...,/-,B 1 ), let I j = O, where O c {l,2,...,r 2 }, denote the subset of output switches to which inlet link i is to be connected in the multicast assignment. For example, the network of Fig. IA shows an exemplary three-stage network, namely F(8,3,4) , with the following multicast assignment I 1 = {l,2}, I 2 = {l,3,4}, I 6 = {3} and all other /, = φ for j = [1-12]. It should be noted that the connection I 1 fans out in the first stage switch ISl into the middle stage switches MSl and MS2, and fans out in middle switches MSl and MS2 only once into output switches OSl and OS2 respectively. The connection I 1 also fans out in the last stage switch OSl once into outlet link OLl and in the last stage switch OS2 into the outlet links OL4, OL5 and OL6. The connection I 2 fans out once in the input switch ISl into middle switch MS4 and fans out in the middle stage switch MS4 into the last stage switches OSl, OS3 and OS4. The connection I 2 fans out once in the output switches

M-0002 PCT

OSl, OS3, and OS4 into outlet links OL2, OL7, and OL12 respectively. The connection I 6 fans out once in the input switch into middle switch MS3, fans out in the middle switch MS3 once into output switch OS3, fans out once in the output switch into outlet link OL9. In accordance with the invention, each connection can fan out in the first stage switch into at most two middle stage switches, and in the middle switches and last stage switches it can fan out any arbitrary number of times as required by the connection request.

Two multicast connection requests /,. = O 1 and 1 } - O } for / ≠ j are said to be compatible, if and only if O 1 Pl O } , = φ . It means when the requests /, and I } are compatible, and if the inlet links / and j do not belong to the same input switch, they can be set up through the same middle switch.

FIG. 3 A is intermediate level flowchart of one implementation of the method of FIG. IB. In the following "destination switch" or "destination" refers to any switch in the output stage 120 that is identified in a connection request. According to this implementation, a connection request is received in act 141. Then the method 140 checks in act 142 if the connection can be set up through only one middle switch and if act 142A finds a middle switch which has second internal links to all the destinations available then the connection is set up in act 142C and the control returns to act 141. If act 142 A results in "no", the control goes to act 142B where the method 140 finds two middle switches through which the connection can be set up. Then the control goes to act 142C, where act 142C sets up the connection through the two middle switches. Therefore no more than two middle switches are used when attempting to satisfy the connection request. When the connection is set up in 142C, control returns to act 141 so that acts 141 and 142 are executed in a loop, for each connection request all the middle switches until the connection is set up.

M-0Q02 PCT

TABLE l A Multicast assignment in a F(8,3,9) Network

Requests for r = 1 Requests for r = 2 Requests for r

/, = {1, 2,3}, I 4 = {1,4,7}, / 7 = {1, 5,9},

/ 2 = {4,5,6}, I 5 = {2, 5,8}, / 8 = {2, 6,7},

Z 3 = {7,8,9}, Z 6 = {3, 6,9}, I 9 = {3, 4,8}

Table 1 above shows a multicast assignment in F(8,3,9) network. This network has a total of twenty-seven inlet links and twenty-seven outlet links. The multicast assignment in Table 1 shows nine multicast connections, three each from the first three input switches. Each of the nine connections has a fan-out of three. For example, the connection request I 1 has the destinations as the output switches OSl, OS2, and OS3 (referred to as 1, 2, 3 in Table 1). Request I 1 only shows the output switches and does not show which outlet links are the destinations. However it can be observed that each output switch is used only three times in the multicast assignment of Table 1, using all the three outlet links in each output switch. For example, output switch 1 is used in requests Ii , I 4 , I 7 , so that all three outlet links of output switch 1 are in use, and a specific identification of each outlet link is irrelevant. And so when all the nine connections are set up all the twenty-seven outlet links will be in use.

FIG. 3B shows an initial state of the F(8,3,9) network in which the connections Ii — 1 7 of Table 1 are previously set up. [For the sake of simplicity, FIG. 3B, FIG. 3C, and FIG. 3D do not show in the diagrams the first internal links and second internal links connected to the middle switches MS7 and MS8.] The connections I 1 , h, h, h, h, h, and I 7 pass through the middle switches MSl 5 MS2, MS3, MS4, MS5, MS6, and MS7 respectively. Each of these connections \s fanning out only once in the input switch and fanning out three times in each middle switch. Connection I 1 from input switch ISl fans out once into middle switch MSl, and from middle switch MSl thrice into output switches OSl, OS2, and OS3. Connection I 2 from input switch ISl fans out once into middle switch MS2, and from middle switch MS2 thrice into output switches OS4, OS5, and OS6. Connection I 3 from input switch ISl fans out once into middle switch MS3,

M-0002 PCT

and from middle switch MS3 thrice into output switches OS7, OS8, and OS9. Connection I 4 from input switch IS2 fans out once into middle switch MS4, and from middle switch MS4 thrice into output switches OSl, OS4, and OS7. Connection I 5 from input switch IS2 fans out once into middle switch MS5, and from middle switch MS5 thrice into output switches OS2, OS5, and OS8. Connection I 6 from input switch IS2 fans out once into middle switch MS6, and from middle switch MS6 thrice into output switches OS3, OS6, and OS9. Connection I 7 from input switch IS3 fans out once into middle switch MS7, and from middle switch MS7 thrice into output switches OSl, OS5, and OS9.

Method 140 of FIG. 3 A next sets up a connection I 8 from input switch IS3 to output switches OS2, OS6 and OS7 as follows. FIG. 3C shows the state of the network of FIG. 3B after the connection h of Table 1 is set up. In act 142A the scheduling method of FIG. 3 A finds that only the middle switch MS8 is available to set up the connection Ig (because all other middle switches MS1-MS7 have unavailable second internal links to at least one destination switch), and sets up the connection in act 142C through switch MS8. Therefore, Connection Ig from input switch IS3 fans out only once into middle switch MS8, and from middle switch MS8 three times into output switches OS2, OS6, and OS7to be connected to all the destinations.

Method 140 next sets up a connection I 9 from input switch IS3 to output switches OS3, OS4 and OS8 as follows. FIG. 3D shows the state of the network of FIG. 3C after the connection I9 of Table 1 is set up. The scheduling method of FIG. 3A could not find a single middle switch that has links to all required destinations available to set up the connection. However in act 142B, it finds two middle switches MSl and MS2 to together have links to all required destinations available for the connection and accordingly the connection I 9 is set up in act 142C. And so connection I9 fans out twice in the first switch IS3 into the middle switches MSl and MS2. Also in the middle switch MSl it fans out twice into output switches OS4 and OS8, and in the middle switch MS2 it fans out once into output switch OS3 to be connected to all the required destinations.

Act 142 of FIG. 3 A is implemented in one embodiment by acts 242A-242D illustrated in FIG. 4A. Specifically, in this embodiment, act 142A is implemented by acts 242A, 242C, and 242D wherein a loop is formed to check if a middle switch has an

M-0002 PCT

available link to the input switch, and also has available links to all the required destination switches. In this implementation, the same loop is also used with an additional act 242B to implement act 142B of FIG. 3 A. Use of the same loop as illustrated in FIG. 4 A provides efficiency by eliminating repetition of the same acts, namely acts 242 A, 242C, and 242D that would otherwise have been repeated if act 142B is performed independent of act 142A (FIG. 3A). In act 242B, the method of FIG. 4A checks if another middle switch has available links to destinations that could not be reached by use of the middle switch in act 242A (described above). As illustrated in FIG. 4B, act 242B is reached when the decision in act 242A is "no". In one specific example, acts 242A-242B of FIG. 4C are implemented by use of the information developed in act 242A, for an efficient implementation as discussed next.

FIG. 4B is a low-level flowchart of one variant of act 142 of FIG. 4A. The control to act 142 comes from act 141 after a connection request is received. In act 142Al, an index variable i is set to a first middle switch 1 among the group of middle switches that form stage 130 (FIG. 2B) to initialize an outer loop (formed of acts of

142A2, 142A3, 242B, 242C and 242D) of a doubly nested loop. Act 142A2 checks if the input switch of the connection has an available link to the middle switch i. If not control goes to act 242C. Else if there is an available link to middle switch i, the control goes to act 142A3. Act 142A3 checks if middle switch i has available links to all the destination switches of the multicast connection request. If so the control goes to act 142Cl and the connection is set up through middle switch i. And all the used links from middle switch i tα destination output switches are marked as unavailable for future requests. Also the method returns "SUCCESS". Act 242C checks if middle switch i is the last middle switch, but act 242C never results in "yes" which means it always finds at most two middle switches to set up the connection. If act 242C results in "no", the control goes to act 242D from act 242C where i is set to the next middle switch. And the outer loops next iteration starts.

If act 142A3 results in "no" the control goes to act 142B. In act 142Bl another index variable j is set to middle switch 1 to initialize an inner loop (formed of acts 142B2, 142B3, 142B4 and 142B5) of the doubly nested loop. Then the control goes to act

142B2, where the method 140 checks if middle switch j is equal to middle switch i. If middle switch j is equal to middle switch i, the control goes to act 142B4. Else if middle

M-0002 PCT

switch j is not equal to middle switch i, the control goes to act 142B3 where the method 140 checks if for all the destinations that have unavailable links from middle switch i have available links from middle switch j. If act 142B3 results in "yes", the connection is set up through middle switch i and middle switch j, in act 142C2. Also all the links used in act 142C2 from middle switch i and middle switch j to destination output switches for setting up the connection are marked as unavailable for future requests and the method returns "SUCCESS". If act 142B3 results in "no", the control goes to act 142B4. In act 142B4, the method 140 checks if middle switch j is last middle switch, and if so the control goes to act 142A4, if not the control goes to act 142B5 where middle switch j is set to the next middle switch. From act 142B5 the control transfers to act 142B2. And thus acts 142B2, 142B3, 142B4 and 142B5 form the inner loop stepping through all the middle switches until two middle switches are found to set up the connection. In a three- stage network of FIG.2B with n λ inlet links for each of ^ 1 input switches, n 2 outlet links for each of r 2 output switches, no more than 2 * W 1 + n 2 ^l middle stage switches are necessary for the network to be strictly nonblocking and hence no more than

2 * »i + « 2 - 1 middle stage switches are necessary for the method of FIG. 4A to always find one or two niiddle switches to set up the connection.

FIG.4C illustrates, in a flowchart, a computer implementation of one example of the scheduling method of FIG. 4B. The flowchart FIG. 4C is similar to the flowchart of FIG. 4B excepting for three differences. In FIG. 4C the check for setting up the connection through one middle switch also efficiently implements the half of the check for setting up the connection through two middle switches. The second difference is the loop control code. In the flowchart of FIG. 4B the loop exit test is performed at the end of the inner and outer loops whereas in the flowchart of FIG. 4C the loop exit test is performed at the beginning of the inner loop and outer loops.

And the following method illustrates the psuedo code for one implementation of the scheduling method of FIG. 4C to always set up a new multicast connection request through the network of FIG. 2B, when there are at least 2 ' * n } + n 2 -1 middle switches in the network as discussed above.

M-0002 PCT

Pseudo code of the scheduling method:

Step 1 : c = current connection request;

Step 2: for i = mid_switch_l to mid_switch_m do {

Step 3: if (c has no available link to i) continue; Step 4: O 1 = Set of all destination switches of c having available links from i ;

Step 5: O k — Set of all destination switches of c having no available links from i ;

Step 6: if ( O 1 — All the required destination switches of c) {

Set up c through i ;

Mark all the used paths to and from I as unavailable; return ("SUCCESS");

}

Step 7: for j = mid_switch_l to mid_switch_m do {

Step 8: if (ϊ = j) { continue; Step 9: } else {

O j = Set of all destination switches of c having available links from j ;

Set up c through i and j;

Mark all the used paths to and from i and j as unavailable; return ("SUCCESS");

} } } } Step 11 : return("Never Happens");

Step 1 above labels the current connection request as "c". Step 2 starts an outer loop of a doubly nested loop and steps through all the middle switches. If the input switch of c has no available link to the middle switch i, next middle switch is selected to be i in the Step 3. Steps 4 and 5 determine the set of destination switches of c having and not having available links from middle switch i, respectively. In Step 6 if middle switch i have available links to all the destination switches of connection request c, connection request c is set up through middle switch i. And all the used links of middle switch i to output switches are marked as unavailable for future requests. Also the method returns "SUCCESS". Step 7 starts the inner loop to step through all the middle switches to search for the second middle switch, and if middle switch i is same as the middle switch j,

M-0002 PCT

Step 8 continues to select the next middle switch to be j. Step 9 determines the set of all destination switches having available links from middle switch j. And in Step 10, if all the links that are unavailable from middle switch i are available from middle switch j, connection request c is set up through middle switch i and middle switch j. All the used links from middle switch i and middle switch j to output switches are marked as unavailable and the method returns "SUCCESS". These steps are repeated for all the pairs of middle switches. One or two middle switches can always be found through which c can be set up, and so the control will never reach Step 11. It is easy to observe that the number of steps performed by the scheduling method is proportional to m 2 , where m is the number of middle switches in the network and hence the scheduling method is of time complexity Oψi 2 ).

Table 2 shows how the steps 1-11 of trie above pseudo code implement the flowchart of the method illustrated in FIG. 4C, in one particular implementation.

FIG. 4D illustrates, in one embodiment, the data structures used to store and retrieve data from memory of a controller that implements the method of FIG. 4C. In this embodiment, the fan-out of at most two in the input switch of each connection is implemented by use of two data structures (such as arrays or linked lists) to indicate the

M-OOO 2 PCT

destinations that can be reached from each of two middle switches. Specifically as illustrated in FIG. 4D, two arrays 530 and 550 are determined for each of the two middle switches MSi and MSj that are checked for possible use in setting up the connection, for example in act 142 of the scheduling method 140 of FIG. IB. Arrays 530 and 550 are determined as follows. Each connection request 510 is specified by an array 520 of destination switch identifiers (and also an inlet link of an input switch identifier). Another array 560 of middle switches contains m elements one each for all the middle switches of the network. Each element of array 560 has a pointer to one ofm arrays, 570- 1 to 570-m, containing a bit that indicates availability status (hereinafter availability status bit) for each output switch OSl-OSr as shown in FIG. 4D. If second internal link to an output switch is available from a middle switch, the corresponding bit in the availability status array is set to 'A' (to denote available, i.e. unused link) as shown in FIG. 4D. Otherwise the corresponding bit is set to 'U' (to denote unavailable, i.e. used link).

For each connection 510 each pair of middle switches MSi, and MSj are checked to see if all the destinations of connection 510 are reachable from the pair. Specifically this condition is checked by using the availability status arrays 570-i, 570-j of two middle switches MSi and MSj, to determine the available destinations of the connection 510 from MSi and MSj in the respective arrays 530 and 550. In one implementation, each destination is checked if it is available from any one of the middle switches MSi and MSj, and if both the middle switches MSi and MSj do not have availability for a particular destination, this particular pair of middle switches MSi and MSj cannot be used to set up the connection. However if middle switches MSi and MSj are determined, to have unavailability of a particular destination, a different pair of middle switches are checked for example the middle switches MSi and MSk. In this implementation, middle switches MSi and MSk are checked for the availability of all the destinations of the connection 510 in the same manner as middle switches MSi and MSj. Therefore in this implementation, there is no need to use an additional array 540 of unavailable destinations from middle switch MSi (as discussed next).

An alternative implementation saves (see act 305 of FIG. 4C) an array 540 (see

FIG. 4D) of unavailable destinations from middle switch MSi, at the time middle switch MSi is first paired with a middle switch, (e.g. MSj) other than itself when attempting to

M-0002 PCT

satisfy the connection request 510. Such saving of array 540 eliminates the need for each destination of the connection request 510 to be checked for middle switch MSi, when middle switch MSi is paired with another middle switch (e.g. MSk). If the array 540 of unavailable destinations from MSi is saved once, only these destinations (in array 540) need to be checked for availability in middle switch MSk, which improves the speed of the computation. The embodiment of FIG. 4D can be implemented to set up connections in a controller 580 and memory 500 (described above in reference to FIG. IA, FIG. 2A, and FIG. 2B etc.).

In rearrangeably nonblocking networks, the switch hardware cost is reduced at the expense of increasing the time required to set up connection a connection. The set up time is increased in a rearrangeably nonblocking network because existing connections that are disrupted to implement rearrangement need to be themselves set up, in addition to> the new connection. For this reason, it is desirable to minimize or even eliminate the need for rearrangements to existing connections when setting up a new connection. When the need for rearrangement is eliminated, that network is either wide-sense nonblocking or strictly nonblocking, depending on the number of middle switches and the scheduling method. Embodiments of rearrangeably nonblocking networks using 2 *« or more middle switches are described in the related U.S. Patent application Serial No. 09/967,815 that is incorporated by reference above.

In strictly nonblocking multicast networks, for any request to form a multicast connection from an inlet link to some set of outlet links, it is always possible to find a path through the network to satisfy the request without disturbing any existing multicast connections, and if more than one such path is available, any of them can be selected without being concerned about realization of future potential multicast connection requests. In wide-sense nonblocking multicast networks, it is again always possible to provide a connection path through the network to satisfy the request without disturbing other existing multicast connections, but in this case the path used to satisfy the connection request must be selected to maintain nonblocking connecting capability for future multicast connection requests. In strictly nonblocking networks and in wide-sense nonblocking networks, the switch hardware cost is increased but the time required to set up connections is reduced compared to rearrangeably nonblocking networks. The

M-0002 PCT

foregoing discussion relates to embodiments of strictly nonblocking networks where the connection set up time is reduced.

To provide the proof for the current invention, first the proof for the rearrangeably nonblocking behavior of symmetric networks V(m,n,r) of the invention is presented. (Proof for rearrangeably nonblocking networks using 2 * n or more middle switches is described in the related U.S. Patent application, Serial No. 09/967,815 that is incorporated by reference above.) Later it will be extended for asymmetric networks V(m, W 1 , r, , « 2 , r 2 ) . When m >= 2 * n , the V(m, n, r) Clos network is operated in rearrangeably nonblocking manner for multicast connections if the following scheduling criterion is met: Every connection request is fanned out at most twice in the input switch; Alternatively every connection request is set up through at most two middle switches.

Since when m >= 2 * n — 1 , the V(m, n, r) network is strictly nonblocking for unicast assignments, it means for unicast assignments, applicant notes that there always exists an available link through at least one middle switch from any arbitrary input switch to any arbitrary output switch. Alternatively, if there exists available links from an arbitrary input switch to a number of middle switches at least one of these middle switches has an available link to any arbitrary output switch. It also means when m >= 2 * n - 1 , from any arbitrary input switch if there exists available links to a number of middle switches, all output switches have available links from at least one of those middle switches.

To prove that the network is rearrangeably nonblocking for multicast assignments, applicant notes that it is necessary and sufficient to prove the following two conditions: 1) There are enough middle switches to fan out each connection at most twice in the input switch; 2) From an arbitrary input switch, there always exist at least two middle switches with available links between these two middle switches and the input switch such that there are available links to all the destination output switches, of any connection request (e.g. AU output switches in case of a broadcast connection request), from these two middle switches.

To prove the condition 1, applicant observes that there are enough middle switches if each connection is fanned out at most twice since m >= 2 * n . Moreover

M-0002 PCT

applicant provides proof for the condition 2 by contradiction as follows. In the worst- case scenario, suppose all the r output switches have (n - 1) outlet links already connected. Now suppose from the given input switch all the output switches have to be reached for the nth outlet link.

Suppose there are not at least two middle switches available through which there are available links from the given input switch to all the output switches. If it happens, then each of the middle switches will have — + 1 second internal links already in use.

\2 J i.e., total second internal links used in all the middle switches is given by,

= n* r + 2* n

Which is not possible because the maximum possible second internal links in use is n * r .

So there always exist at least two middle switches through which there are paths from any given input switch to all the output switches. Since the number of middle switches m - 2*n is sufficient to set up the multicast connections, the. V(tn, n, r) Clos network can be operated in rearrangeably nonblocking manner. Hence, if m >= 2 * n , the V(m,n,r) Clos network can be operated in rearrangeably nonblocking manner for multicast connections of any arbitrary fan-out.

Now the proof that the V(m,n,r) network is strictly nonblocking for multicast assignments when m = 3*n-l is presented. Compared to strictly nonblocking unicast algorithm, in the above provided proof for rearrangeably nonblocking where m >- 2* n applicant notes that for realizing a multicast connection from each inlet link in an input switch, one additional potential first internal link is taken away from the rest of the inlet links from the same input switch and hence with an additional n - 1 middle switches, one each for the first n - 1 inlet links, the V(m, n, r) network is strictly nonblocking for multicast assignments.

M-00Q2 PCT

To extend the proof (described above), applicant now shows that V(m, W 1 , r λ , w 2 , r 2 ) network can be operated in rearrangeably nonblocking manner for multicast connections when m >= n l + n 2 , by considering the two cases W 1 < n 2 and W 1 > n 2 .

1) «, < « 2 : In this case, the number of middle switches necessary is 2 * W 1 which is

< («, + « 2 ) . To prove the sufficiency, even though there are a total of n 2 * r 2 outlet links in the network, in the worst-case scenario only W 1 * r 2 second internal links will be needed. This is because, even if all w 2 * r 2 outlet links are destinations of the connections, using the fan-out capability in the output switches the rearrangeably nonblocking behavior can be realized. And so 2 * W 1 which is < (W 1 + n 2 ) middle switches is sufficient.

2) W 1 > w 2 : In this case, since there are a total of w 2 * r 2 outlet links in the network, only a maximum of w 2 * r 2 second internal links will be used even if all the w 2 * r 2 outlet links are destinations of the network connections. When the number of middle switches is W 1 + w 2 the total second internal links in the network is given by r 2 * (« ! + w 2 ) which is more than the required number, according to the rearrangeability proof for V(m, w, r) as shown earlier, which is r 2 * (2 * w 2 ) . Also from any input switch only a maximum of n 2 out of W 1 available inlet links can each have fan-out of r 2 . And so only a maximum of w 2 connections from any input switch need to be fanned out into two. And so W 1 + w 2 middle switches are sufficient.

The extension of the proof when m >= 2 * W 1 + w 2 - 1 that V{m, n x ,r x ,n 2 ,r 2 ) network is operated in strictly nonblocking manner, is similar to that of V(m,n,r) network.

M-0Q02 PCT

Table 3

A multicast assignment in a FCl 4,5,25) Network

Requests for r = 1 Requests for r = 2 Requests for r = 3

/,={1,2,3,4,5}, / 6 ={1,6,11,16,21}, /„={1,7,13,19,25},

Z 2 ={6,7,8,9,10}, I 7 ={2,7,12,17,22}, Z 12 ={2,8,14,20,21},

Z 3 ={11,12,13,14,15}, / 8 ={3,8,13,18,23}, Z 13 ={3,9,15,16,22},

I 4 ={16,17,18,19,20}, / 9 ={4,9,14,19,24}, Z 14 ={4,10,11,17,23},

I 5 ={21,22,23,24,25}, / 10 ={5,10,15,20,25}, Z 15 ={5,6,12,18,24},

Requests for r = 4 Requests for r — 5

I 16 ={1,8,15,17,24}, / 21 ={1,9,12,20,23},

I 17 ={2,9,11,18,25}, / 22 ={2,10,13,16,24},

I ιs ={3,10,12,19,21}, / 23 ={3,6,14,17,25},

/ 19 ={4,6,13,20,22}, Z 24 ={4,7,15,18,21},

I 20 ={5,7,14,16,23}, / 25 ={5,8,11,19,22}

Table 3 shows an exemplary multicast assignment in a V(10,5,25) network. Each request has a fan-out of five. All the outlet links are connected in this multicast assignment since each output switch is used exactly five times in the requests corresponding to five outlet links of each output switch. In one implementation, Table 4 shows by using only #» = 3*H-1 = 14 middle switches, the multicast assignment can be set up to operate the network in strictly nonblocking manner.

M-0002 PCT

Each row in Table 4 represents an input switch and each column represents a middle switch. And each element in the table represents the list of output switches set up through the corresponding middle switch for a connection originating from the corresponding input switch. The correspondence between different connections from the same row of Table 4 and hence from the same input switch can be obtained from the multicast assignment of the Table 3.

Referring to FIG. 5A a five stage strictly nonblocking network is shown according to an embodiment of the present invention that uses recursion as follows. The five stage network comprises input stage 110 and output stage 120, with inlet links IL1-IL12 and outlet links OL1-OL12 respectively, where input stage 110 consist of six, two by five switches IS1-IS6, and output stage 120 consist of six, five by two switches OS1-OS6. However, unlike the single switches of middle stage 130 of the three^stage network of FIG. IA, the middle stage 130 of FIG. 5A consists of five, six by six three-stage

M-0002 PCT

subnetworks MS1-MS5 (wherein the term "subnetwork" has the same meaning as the term "network"). Each of the five middle switches MSl -MS 5 are connected to each of the input switches through six first internal links (for example the links FL1-FL6 connected to the middle switch MSl from each of the input switch IS1-IS6), and connected to each of the output switches through six second internal links (for example the links SL1-SL6 connected from the middle switch MSl to each of the output switch OS1-OS6). In one embodiment, the network also includes a controller coupled with the input stage 110, output stage 120 and middle stage subnetworks 130 to form connections between an inlet links ILlτDL12 and an arbitrary number of outlet links OL1-OL12.

Each of middle switches MS1-MS5 is a ^(5,2,3) three-stage subnetwork. For example, the three-stage subnetwork MSl comprises input stage of three, two by five switches MIS1-MIS3 with inlet links FL1-FL6, and an output stage of three, five by two switches MOS1-MOS3 with outlet links SL1-SL6. The middle stage of MSl consists of five, three by three switches MMS1-MMS5. Each of the middle switches MMS1-MMS5 are connected to each of the input switches MIS1-MIS3 through three first internal links (for example the links MFL1-MFL3 connected to the middle switch MMSl from each of the input switch MIS1-MIS3), and connected to each of the output switches MOSl- MOS3 through three second internal links (for example the links MSL1-MSL3 connected from the middle switch MMSl to each of the output switch MOS1-MOS3). In similar fashion the number of stages can increase to 7, 9, etc.

As with the three-stage network, the network of FIG. 5 A has the property of being operable in strictly nonblocking manner as described herein with no more than 3 * n - 1 middle stage three-stage networks. In the network of FIG. 5 A the middle stage requires no more than 3 * n -1 three-stage subnetworks. Thus in FIG, 5 A where n equals 2, middle stage 130 has five middle stage three-stage subnetworks MS1-MS5. Furthermore, according to the present invention, each of the middle stage subnetworks MS1-MS5 require no more than 2* A 1 +k 2 -1 middle switches MMS1-MMS5, where k } is the number of inlet links for each middle input switch MISl -MS3 and k 2 is the number of outlet links for each middle output switch MOS1-MOS3.

In general, according to certain embodiments, one or more of the switches, in any of the first, middle and last stages can be recursively replaced by a three-stage

M-OOO 2 PCT

subnetwork with no more than 2 * n x + n 2 - 1 middle stage switches where n, is the number of inlet links to the first stage switch in the subnetwork and n 2 is the number of outlet links to the last stage switch of the subnetwork for strictly nonblocking operation, for multicast connections of arbitrary fan-out. Note that because the term "subnetwork" has the same meaning as "network", the just described replacement can be repeated recursively, as often as desired, depending on the embodiment. Also each subnetwork may have a separate controller and memory to schedule the multicast connections of corresponding network.

It should be understood that the methods, discussed so far, are applicable to k- stage networks for k>3 by recursively using the design criteria developed on any of the switches in the network. The presentation of the methods in terms of three-stage networks is only for notational convenience. That is, these methods can be generalized by recursively replacing each of a subset of switches (at least 1) in the network with a smaller three-stage network, which has the same number of total inlet links and total outlet links as the switch being replaced. For instance, in a three-stage network, one or more switches in either the input, middle or output stages can be replaced with a three- stage network to expand the network. If, for example, a five-stage network is desired, then all middle switches (or all input switches or all output switches) are replaced with a three-stage network

In accordance with the invention, in any of the recursive three-stage networks each connection can fan out in the first stage switch into at most two middle stage subnetworks, and in the middle switches and last stage switches it can fan out any arbitrary number of times as required by the connection request. For example as shown in the network of FIG. 5 A, connection Ii fans out in the first stage switch ISl once into middle stage subnetwork MSl. In middle Stage subnetwork MSl it fans out four times into output switches OSl, OS2, OS3 and OS5. In output switches OSl and OS3 it fans out twice. Specifically from output switch OSl into outlet links OLl, OL2, and from output switch OS3 into outlet links OL5, OL6. In output switches OS2 and OS5 it fans out once into outlet links OS4 and OS9 respectively. However in the three-stage network MSl 3 it can fan out at most twice in the first stage, for example connection I 1 fans out twice in the input switch MISl into middle switches MMS2 and MMS3 of the three-stage

M-Q002 PCT

subnetwork MSl. Similarly a connection can fan out arbitrary number of times in the middle and last stages of any three-stage subnetwork. For example connection I 1 fans out twice in middle switch MMS2 into output switches MOSl and MOS2 of three-stage subnetwork MSl. In the output switch MOSl of three-stage subnetwork MSl it fans out twice into output switches OSl and OS2. And in the output switch MOS2 of three-stage subnetwork MSl it fans out once into output switch OS3. Also the connection I 1 fans out in middle switch MMS3 once into output switch MOS2 of the three-stage subnetwork MSl and from there once into output switch OS5.

The connection I 3 fans out once into three-stage subnetwork MS3 where it is fanned out three times into output switches OS2, OS4, and OS6. In output switches OS2, OS4, and OS6 it fans out once into outlet links OL3, OL8, and OL 12 respectively. The connection I 3 fans out once in the input switch MIS7 of three-stage subnetwork MS3 into middle switch MMS 12 of three-stage subnetwork MS3 where it fans out three times into output switches MOST, MOS8, and MOS9 of the three-stage subnetwork MS3. In each of the three output switches MOS7, MOS8 and MOS9 of the three-stage subnetwork MS3 it fans out once output switches OS2, OS4, and OS6 respectively.

FIG. 5B shows a high-level flowchart of a scheduling method, in one embodiment executed by the controller of FIG. 5A. The method of FIG. 5B is used only for networks that have three stages each of which may be in turn composed of three-stage subnetworks, in a recursive manner as described above in reference to FIG. 5A.

According to this embodiment, a multicast connection request is received in act 250 (FIG. 5B). Then a connection to satisfy the request is set up in act 260 by fanning out into at most two middle stage subnetworks from its input switch. Then the control goes to act 270. Act 270 recursively goes through each subnetwork contained in the network. For each subnetwork found in act 270 the control goes to act 280 and each subnetwork is treated as a network and the scheduling is performed, similarly. Once all the recursive subnetworks are scheduled the control transfers from act 270 to act 250 so that each multicast connection will be scheduled in the same manner in a loop.

A direct extension of the foregoing discussion is that when the number of middle switches is increased, the above ^ described methods can be changed to improve speed. For example when m = 4*n-l, each multicast connection can be fanned out into at most

M-0002 PCT

three middle switches and the V(m,n,r) network can be operated in strictly nonblocking manner. Similarly, when m = 3*n i +n 2 —l 9 the V(m, n x , r x , n 2 , r 2 ) network is operated in strictly nonblocking manner if each multicast connection is fanned out into at most three middle switches. FIG. 6A shows a general symmetrical multi-stage network with ?« = 4 * » — 1 middle switches. Excepting for the middle switches to be m = 4 * n - 1 , the description of FIG. 6 A is similar to FIG. 2A. FIG. 6B shows the scheduling method by fanning out into at most three middle switches. Excepting for the additional act 142D of testing for three middle switches and setting up a connection through three middle switches in act 142C, the description of the method of FIG. 6B is similar to the method of FIG. 3 A.

In general when m ~ (x + l)*n-l and x ≥ 2 each multicast connection can be fanned out into at most x middle switches and the V(m, n, r) network is operated in strictly nonblocking manner. Similarly, when m - x*n x +n 2 —\, the V(m, n x ,r x ,n 2 ,r 2 ) network is operated in strictly nonblocking manner if each multicast connection is fanned out into at most x middle switches. FIG. TA. shows a general symmetrical multi-stage network with τn = (x+l)*n~l middle switches. Excepting for the middle switches to be m = (x + 1)* n - 1 1 the description of FIG. 7A is similar to FIG.2A. FIG. 7B shows the scheduling method by fanning out into at most x middle switches. Excepting for the additional act 142X of testing for x middle switches and setting up a connection through x middle switches in act 142C, the description of the method of FIG. 7B is similar to the method of FIG.3A.

In an alternative embodiment, when m ≥ X 1 * a x + χ 2 * α 2 + ... + x p * a p + n x - 1 , where α, +a 2 +... + a p = n x +n 2 , the network is operated in strictly nonblocking manner as described herein, when multicast connections are set up such that connections from a t inlet links of each input switch pass through at most x t middle switches for 1 < / < p .

A V{m,n x ,r λ ,n 2 ,r j ) network can be further generalized, in an embodiment, by having an input stage comprising T 1 input switches and n ϊw inlet links in input switch w , for each of said r x input switches such that ΛV e [l-rj and n x = MAX(n Xw ); an output stage

M-0002 PCT

comprising r 2 output switches and w 2v outlet links in output switch v, for each of said r 2 output switches such that v e [l,r 2 ] and w 2 = MAX{n 2v ); and a middle stage comprising m middle switches, and each middle switch comprising at least one link connected to each input switch for a total of at least r, first internal links; each middle switch further comprising at least one link connected to at most d said output switches for a total of at least d second internal links, wherein 1 < d < r 2 , and applicant notes that such an embodiment can also be operated in strictly nonblocking manner, according to the current invention, for setting up multicast connections by fanning otit not more than twice in the input switch, when m >= 2 * W 1 + n 2 - 1.

The V(m, W j , η , W 2 , r 2 ) network embodiments described so far, in the current invention, are implemented in space-space-space, also known as SSS, configuration. In this configuration all the input switches, output switches and middle switches are implemented as separate switches, for example in one embodiment as crossbar switches. The three-stage networks V(m, n v ry,n 2 ,r 2 ) can also be implemented in a time-space- time, also known as TST, configuration. In TST configuration, in the first stage and the last stage all the input switches and all the output switches are implemented as separate switches. However the middle stage, in accordance with the current invention, uses

number of switches where m >— 2 * n, + «, — 1 , with each middle switch

MW(Tt 1 , W 2 ) having r λ first internal links connected to all input switches and also having r 2 second internal links connected to all output switches. The TST configuration implements the switching mechanism, in accordance with the current invention, in MIN(Ti 1 , n 2 ) steps in a circular fashion. So in TST configuration, the middle stage physically implements only

steps, to switch packets or timeslots from input ports to the output ports.

The three-stage networks V(m, W 1 , r t , n 2 , r 2 ) implemented in TST configuration play a key role in communication switching systems. In one embodiment a crossconnect in a TDM based switching system such as SONET/SDH, each communication link is

M-OOO 2 PCT

time-division multiplexed - as an example an OC- 12 SONET link consists of 336 VTl .5 channels time-division multiplexed. In another embodiment a switch fabric in packet based switching system such as IP, each communication link is statistically time division multiplexed. When a V(m, H 1 ^n 2 S 1 ) network is switching TDM or packet based links, each of the r x input switches receive time division multiplexed signals - for example if each input switch is receiving an OC- 12 SONET stream and if the switching granularity is VTl.5 then M 1 (= 336) inlet links with each inlet link receiving a different VT1.5 channel. A crossconnect, using a V{m,n l ,r l ,n 2 ,r 2 ) network, to switch would implement a TST configuration, so that switching is also performed in time division multiplexied fashion just the same way communication in the links is performed in time division multiplexed fashion.

For example, the network of FIG. 8A shows an exemplary three-stage network, namely V (5,2,A) in space-space-space configuration, with the following multicast assignment I 1 = {l}, I 2 = {1,2,3,4} , and I 4 = {3,4}. According to the current invention, the multicast assignment is setup by fanning out each connection not more than twice in the first stage. The connection / ] fans out in the first stage switch ISl into the middle stage switch MSl, and fans out in middle switch MSl into output switch OSl. The connection I 1 also fans out in the last stage switch OSl into the outlet link OL2. Tlie connection I 2 fans out in the first stage switch ISl into the middle stage switches MS3 and MS4. The connection I 2 fans out in middle switch MS3 into output switches OSl, OS3, and OS4. The connection I 2 also fans out in the last stage switches OSl, OS3, and OS4 into the outlet links OLl, OL5 and OL8 respectively. The connection I 2 fans out in the middle switch MS4 once into output switch OS2. The connection I 2 fans out in the output switch OS2 into outlet links 0L3 and OL4.

The connection I 4 fans out once in the input switch IS2 into middle switches

MS2 and MS5. The connection I 4 fans out in the middle stage switch MS2 into the last stage switch OS3. The connection I 4 fans out once in the output switch OS3 into outlet link OL6; The connection I 4 also fans out in the middle switch MS5 into the last stage switch OS4. The connection I 4 output switch OS4 into outlet link OL7.

M-OQO 2 PCT

FIG. 8B and FIG. 8C illustrate the implementation of the TST configuration of the F(5,2,4) network of FIG. 8A. According to the current invention, in TST configuration also the multicast assignment is setup by fanning out each connection not more than twice in the first stage, in exactly the same as the scheduling method is performed in SSS configuration. Since in the network of FIG. 8A, » = 2 , the TST configuration of the network of FIG. 8 A has n = 2 different time steps; and since — = 3 , the middle stage

I n I in the TST configuration implements only 3 middle switches each with 4 first internal links and 4 second internal links as shown in FIG. 8B and FIG. 8C. In the first time step, as shown in FIG. 8B the three middle switches act as MSl, MS2, and MS3 of the network of FIG. 8 A. Similarly in the second time step, as shown in FIG. 8C the first two middle switches act as MS4 and MS5 of the network of FIG. 8A, the third middle switch is not necessary in this time step.

In the first time step, FIG. 8B implements the switching functionality of middle switches MSl, MS2, and MS3, and since in the network of FIG. 8A, connections I 1 , I 4 , and I 1 are fanned out through middle switches MSl, MS2, and MS3 to the output switches OSl, OS3, and {OS1, OS3, OS4} respectively; and so connections I 1 , I 4 , and I 2 are fanned out to destination outlet links OL2, OL6 and {OL1, OL5 ? OL8} respectively, just exactly the same way they are routed in the network of FIG. 8 A in all the three stages. Similarly in the second time step, FIG. 8C implements the switching functionality of middle switches MS4 and MS5, and since in the network of FIG. 8 A, connections I 2 and I 4 are fanned out through middle switches MS4 and MS5 to the output switches OS2 and OS4 respectively^ and so connections I 2 and I 4 are fanned out to destination outlet links {OL3, OL4} and OL7 respectively, just exactly the same way they are routed in the network of FIG. 8A in all the three stages; In this time step middle switch MS6 is not needed.

In digital cross connects, optical cross connects, and packet or cell switch fabrics since the inlet links and outlet links are used time-division multiplexed fashion, the switching network such as the F(w,» j ,r,,« 2 ,r 2 ) network implemented in TST configuration will save cost, power and space compared to a space-space-space

M-0002 PCT

configuration. In accordance with the invention, the V {m,n x ,r λ ,n 2 ,r 2 ) network implemented in TST configuration, using the same scheduling method as in SSS configuration i.e., with each connection fanning out in the first stage switch into only one middle stage switch, and in the middle switches and last stage switches it can fan out any arbitrary number of times as required by the connection request, is operable in strictly m nonblocking manner with number of middle switches is equal to where

MM{n x ,n 2 ) m >= 2*n l +n 2 -l .

Numerous modifications and adaptations of the embodiments, implementations, and examples described herein will be apparent to the skilled artisan in view of the disclosure.

For example, in one embodiment a method of the type described above is modified as follows when the number of output switches r 2 is less than or equal to four. Specifically, a three-stage network is operated in strictly nonblocking manner when the multicast connection is fanned out only once in the input stage, with m number of middle stage switches where

m ≥ LV*- ~ j* is > 1 and odd, or when 2 »

m ≥ * s > 2 and even, and

m ≥ H 1 + R 2 - 1 when * • So when r 2 is less than or equal to eight a three-stage network is operated in strictly nonblocking manner for m ≤ 2 * n .

For example, in another embodiment, a method of the type described above is modified to set up a multirate multi-stage network as follows. Specifically, a multirate connection can be specified as a type of multicast connection. In a multicast connection, an inlet link transmits to multiple outlet links, whereas in a multirate connection multiple inlet links transmit to a single outlet link when the rate of data transfer of all the paths in use meet the requirements of multirate connection request. In such a case a multirate connection can be set up (in a method that works backwards from the output stage to the input stage), with fan-in (instead of fan-out) of not more than two in the output stage and

M-OOO 2 PGT

arbitrary fan-in in the input stages and middle stages. And a three-stage multirate network is operated in rearrangeably nonblocking manner with the exact same requirements on the number of middle stage switches as described above for certain embodiments.

Numerous such modifications and adaptations are encompassed by the attached claims.