METHOD FOR CONTROLLING BUFFERING OF PACKETS IN A COMMUNICATION NETWORK

Title:

METHOD FOR CONTROLLING BUFFERING OF PACKETS IN A COMMUNICATION NETWORK

Document Type and Number:

WIPO Patent Application WO/2015/090719

Kind Code:

Abstract:

The present invention concerns a method for controlling buffering of packets (1) in a communication network. The method comprises to store packets (1) of one or more packet flows in a queue (2) of a buffer (3), to set a threshold (4) on the queue (2), to determine whether the queue (2) exceeds the threshold (4). If the queue (2) exceeds the threshold (4), a congestion notification on a first packet (1a) of the queue (2) causing the exceeding of the threshold (4) is provided and a timeout interval (5) is started. Until expiry of the timeout interval (5), any threshold violation is ignored by not providing a congestion notification on further packets (1b) of the queue (2) causing the exceeding of the threshold (4) during the timeout interval (5). The present invention concerns further a network element (8) for a communication network, the network element (8) is adapted to store packets (1) of one or more packet flows in a queue (2) of a buffer (3), to set a threshold (4) on the queue (2), to determine whether the queue (2) exceeds the threshold (4) and, if the queue (2) exceeds the threshold (4), to provide a congestion notification on a first packet (1a) of the queue (2) causing the exceeding of the threshold (4), to start a timeout interval (5) and to ignore any threshold violation until expiry of the timeout interval (5) by not providing a congestion notification on further packets (1b) of the queue (2) causing the exceeding of the threshold (4) during the timeout interval (5). The present invention concerns further a computer program product adapted to perform the above method, when executed on a control unit (9).

More Like This:

WO/2015/162734	RELAY DEVICE AND DATA TRANSFER METHOD
WO/2019/149287	METHOD AND APPARATUS FOR DETECTING LINK PACKET LOSS, STORAGE MEDIUM, AND PROCESSOR
JP2004056726	TRAFFIC VOLUME CONTROL APPARATUS AND TRAFFIC VOLUME CONTROL METHOD

Inventors:

LAUTENSCHLAEGER WOLFRAM (DE)

Application Number:

PCT/EP2014/073945

Publication Date:

June 25, 2015

Filing Date:

November 06, 2014

Export Citation:

Click for automatic bibliography generation Help

Assignee:

ALCATEL LUCENT (FR)

International Classes:

H04L47/32

Foreign References:

US5914936A	1999-06-22
EP2040423A1	2009-03-25

Other References:

None

Attorney, Agent or Firm:

WETZEL, Emmanuelle (Stuttgart, DE)

Download PDF:

View/Download PDF PDF Help

Claims:

1 . A method for controlling buffering of packets (1 ) in a communication network, the method comprising:

- storing packets (1 ) of one or more packet flows in a queue (2) of a buffer (3);

- setting a threshold (4) on the queue (2);

- determining whether the queue (2) exceeds the threshold (4),

- if the queue (2) exceeds the threshold (4), providing a congestion notification on a first packet (1a) of the queue (2) causing the exceeding of the threshold (4) and starting a timeout interval (5); and

- ignoring any threshold violation until expiry of the timeout interval (5) by not providing a congestion notification on further packets (1 b) of the queue (2) causing the exceeding of the threshold (4) during the timeout interval (5),

2. The method according to claim 1 , wherein the timeout interval (5) is set in the range of 1 to 3 round trip times.

3. The method according to claim 1 , wherein the timeout interval (5) is dynamically adapted, based on the time that the queue (2) is above the threshold (4) and the time that the queue (2) is below the threshold (4), but not empty.

4. The method according to claim 3, wherein the timeout interval (5) is determined as a function of theta := cumulative time {above - below} threshold

k := max(1 , gamma * theta)

interval := initial interval/k wherein k is a factor, which is a maximum of 1 and a product of a scaling parameter gamma and a parameter theta, wherein theta is a function of a cumulative time of a time that the queue (2) is above the threshold (5) and a time that the queue (2) is below the threshold (5), but not empty.

5. The method according to claim 1 , wherein the method further comprises:

- starting the timeout interval (5), if the queue (2) is determined to be empty.

6. The method according to claim 1 , wherein the threshold (4) is dynamically adapted so that the queue (2) drains to zero at least once in a predetermined time frame.

7. The method according to claim 1 , wherein the step of setting a threshold (4) comprises:

- setting the threshold (4) on queue size, wherein the queue size is defined by a length or a volume of the queue (2).

8. The method according to claim 7, wherein the step of determining whether the queue (2) exceeds the threshold (4) is executed at enqueuing (6) of the packets (1 ) of the one or more packet flows in the queue (2).

9. The method according to claim 7, wherein the threshold (4) is set below a maximum capacity (10) of the buffer (3), in particular at less than 60% of the buffer size, preferably at less than 50% of the buffer size.

10. The method according to claim 1 , wherein the step of setting a threshold (4) comprises:

- setting the threshold (4) on queuing delay.

1 1 . The method according to claim 10, wherein the step of determining whether the queue (2) exceeds the threshold (4) is executed at dequeuing (7) of the packets (1) of the one or more packet flows out of the queue (2).

12. The method according to claim 10, wherein the threshold (4) is set below a maximum predetermined delay (1 1 ) assigned to the one or more packet flows.

13. The method according to claim 1 , wherein the step of providing a

congestion notification on a first packet (1 a) comprises:

- dropping the first packet (1 a) and/or marking the first packet (1a).

14. A network element (8) for a communication network, the network element (8) is adapted to store packets (1 ) of one or more packet flows in a queue (2) of a buffer (3), to set a threshold (4) on the queue (2), to determine whether the queue (2) exceeds the threshold (4) and, if the queue (2) exceeds the threshold (4), to provide a congestion notification on a first packet (1 a) of the queue (2) causing the exceeding of the threshold (4) and to start a timeout interval (5) and to ignore any threshold violation until expiry of the timeout interval (5) by not providing a congestion notification on further packets (1 b) of the queue (2) causing the exceeding of the threshold (4) during the timeout interval (5).

15. A computer program product adapted to perform the method of claim 1 , when executed on a control unit (9).

Description:

Method for controlling buffering of packets in a communication network

Background

TCP traffic, which is the majority of the Internet traffic, requires large buffers to efficiently fill the transmission capacities. The Bandwidth ^* Delay product rule for buffer dimensioning is dominating since decades. However, large buffers in combination with TCP traffic cause large queuing delays and jitter which is bad for real time applications. On the other hand, devices with smaller buffers, e.g. some Ethernet switches, do not reach the capacity limit at large round trip times.

Conventionally, a buffer drops the packets at the tail of the queue which it cannot buffer any more, because it is full. This is also termed "tail drop" as the packets are dropped at the tail of the queue at the moment of the decision whether to put and keep packets in the queue of the buffer or to drop the packets at the tail of the queue If the transmission capacity is constantly exceeded, any buffer is overflowing, regardless of its size. In other words, buffers are there to cope with short term overloads by keeping the excess load in a queue for subsequent release when the overload is over. As a

consequence buffers should be empty most of the time.. If however a buffer is large but full all the time it cannot absorb short term overloads anymore and, even worse, adds unneeded queuing delay, which might be called "bufferbloat".

If a network becomes congested, the buffers of the network are permanently full. As it is rather a question of chance of which packet flows the packets are buffered or dropped, tail drop is regarded as unfair. In case of dropped packets, the sender of the packet flows which get informed by the receiver of the traffic flows that packets are missing reduce the data-rate of the packet flows by sending less packets per second. In case packets of a plurality of traffic or packet flows are dropped by the buffer, correspondingly a plurality of senders will reduce the data-rate of the plurality of traffic flows which might lead to underutilization of the transmission capacity. If after some time all senders restart sending packets with increased data-rate this might again lead to congestion. This common reaction of reducing and increasing of the data-rate of many senders of packet flows with the consequence of either underutilization or congestion of the transmission links and thus of the network is also called "global synchronization". Global synchronization is aimed to be avoided.

Currently, some Active Queue Management (AQM) schemas such as RED (Random Early Detection) and CoDel (Controlled Delay) are implemented. RED determines the average queue size of the queue of packets stored in the buffer. Based on statistically determined probabilities, packets are dropped or stored in the buffer. The probability of dropping packets increases as the queue gets filled. As dropped packets cause the receiver of a packet flow in which packets are missing to inform the sender of the packet flow of the dropped or missed packets, the dropping of the packets by the buffer might be understood as an implicit congestion notification of the buffer. This is by dropping packets, the buffer implicitly notifies that the buffer is congested. As an alternative to this implicit congestion notification by dropping packets, packets are stored in the buffer (as long as the maximum capacity is not exceeded) and the packets are marked by an explicit congestion notification (ECN) to notify that the buffer has reached a predetermined limit. This explicit congestion notification will also cause the receiver of the packet flow comprising marked packets to inform the sender of the packet flow to reduce the data-rate of the packet flow. The drawback of RED is that this solution is hard to deploy, because this implementation needs setting many parameters dependent on the network conditions, which might be unclear and quite impossible to implement from the perspective of the operator of the buffer.

In contrast to RED, the CoDel does not need to set parameters such as in RED. CoDel determines the queuing delay at each hop of the network over an interval which is initially set to 100 milliseconds. When a packet is put out of the buffer for forwarding the packet, the queuing delay of this packet is determined and used to decide of the packet is dropped or forwarded, causing also the interval to be shortened or reset to the initial value, respectively. However, the results of CoDel to avoid the bufferbloat problem and to avoid congestion of the network are determined to be not really better than plain tail drop and worse than well tuned RED.

Alternatively, for large traffic aggregates with more than 500 TCP flows and large spreading of round trip times, for example at Internet backbones, the Bandwidth ^* Delay product rule has been reduced by a 1/sqrt (N) factor (N = the number of TCP flows, sqrt = square root) by implementing plain tail drop.

Nevertheless, it is not applicable for N < 100 and also not for uniform RTT (round trip time) distributions like e.g. on interconnection links between large data centers. The round trip time indicates how long it takes between the sending of a signal or a packet and the receiving of an acknowledgment from the receiver of the signal at the sender, by which acknowledgment the receiver indicates that he has received the signal or not, or he has received a packet flow but some packets are missing. In more general, the round trip time could be explained as the time that a signal takes from a start point of the way to the end point plus the time that a corresponding signal takes from the end point to the start point. The round trip time might be about 100 - 300 milliseconds in Internet scales.

Summary

The present invention aims to provide an improved solution to the bufferboat problem and to provide improved congestion avoidance, in particular to avoid global synchronization.

This objective of the invention is achieved by a method for controlling buffering of packets in a communication network. The method comprises to store packets of one or more packet flows in a queue of a buffer, to set a threshold on the queue and to determine whether the queue exceeds the threshold. If the queue exceeds the threshold, a congestion notification is provided on a first packet of the queue which causes the exceeding of the threshold and a timeout interval is started. Until expiry of the timeout interval, any threshold violation is ignored. This means, even if it is determined that further packets of the queue cause the queue to exceed the threshold, a congestion notification on these "exceeding" or "excess causing" packets is not provided. Thus, the threshold violation of the queue caused by these packets is ignored by not providing a congestion notification on further packets of the queue causing the exceeding of the threshold during the timeout interval. In this context, if it is said that packets exceed the threshold or the term exceeding packet(s) is used, it is more precisely meant that the queue is caused by theses packet(s) to exceed the threshold. The objective of the present invention is further achieved by a network element for a communication network. The network element is adapted to store packets of one or more packet flows in a queue of a buffer, to set a t reshold on the queue, to determine whether the queue exceeds the threshold and, if the queue exceeds the threshold, to provide a congestion notification on a first packet of the queue causing the exceeding of the threshold and to start a timeout interval. Furthermore the network element is adapted to ignore any threshold violation by further packets until expiry of the timeout interval. This means the network element is adapted not to provide a congestion notification on further packets of the queue causing the exceeding of the threshold during the timeout interval.

Furthermore, the objective of the present invention is achieved by a computer program product adapted to perform the above cited method for controlling buffering of packets in a communication network, when executed on a control unit. The control unit may be implemented as a single unit, a stand-alone device, or within a database, integrated in a computer and/or within a computer network.

The features of embodiments both of the independent claims and of the dependent claims of the present invention might be implemented in hardware, software or combination thereof.

In particular, the network element and/or the control unit may be implemented through the use of hardware, software and/or hardware capable of executing software in association with appropriate software.

More specifically, the network element and/or the control unit can be comprised or implemented by circuit-based processes, including possible implementation as a single integrated circuit, such as an ASIC (= Application Specific Integrated Circuit) or such as an FPGA (= Field Programmable Gate Array), a multi-chip module, a single card, or a multi-card circuit pack. The functions of the network element and/or control unit may be implemented as processing blocks in a software program. Such software may be employed in a digital signal

processor, micro-controller, or general-purpose computer implemented as a single device or integrated in a computer network.

The network element and/or the control unit may comprise program code embodied in tangible media, such as magnetic recording media, optical recording media, solid state memory, floppy diskettes, CD-ROMs, hard drives, or any other machine-readable storage medium, wherein, when the program code is loaded into and executed in the network element and/or the control unit, the network element and/or the control unit become an apparatus used for practicing the invention.

Further advantages of the present invention are achieved by features of preferred embodiments. The features of preferred embodiments might be implemented in any suitable manner, in particular in combination with the features of further preferred embodiments thus leading to again further preferred embodiments. The features disclosed within one preferred

embodiment might also be implemented without one or more features of the same preferred embodiment in case the one or more features of the preferred embodiment are not mandatory for implementing the preferred embodiment. Thus, one or more features of one or more preferred embodiments might be implemented in any suitable manner resulting in further preferred embodiments. In a preferred embodiment, the timeout interval is set in the range of 1 to 3 round trip times. The round trip time might be defined as the time it takes for a signal, e.g. a packet of a packet flow, to go from one point of a communication path, for example from the sender of the signal or packet, to another point of the communication path, for example to the receiver of the signal or packet, plus the time it takes for the signal or packet or a corresponding signal, for example response or acknowledgment message, to go the same way back. During the timeout interval, the violation of the threshold is ignored. If the queue exceeds the threshold and there is no timeout interval running, a congestion notification is provided. This congestion notification finally gets to the sender of the packet flow and causes the sender to reduce the data rate of the packet flow. The reduction of the data rate will take effect only at the moment when the packet flow with the reduced data rate comes to the buffer. It is one objective of the present invention to avoid global synchronization. Thus, it is implemented not to reduce the data rate of all packet flows or of many packet flows which are processed by the buffer even in case many packets of these packet flows cause the queue to exceed the threshold. Actually, after a first packet of a (first) packet flow is determined to exceed the threshold or in other words cause the queue to exceed the threshold, only on this packet a congestion notification is provided. Because of the congestion notification on this packet the sender of the packet flow to which this first packet belongs will reduce the data rate of said packet flow. Preferably, the congestion notification provided by the buffer will be determined by the receiver of the packet flow. The receiver of the packet flow will consequently inform the sender of the packet flow to reduce the data rate of the packet flow. However, it will take the time that the congestion notification is received by the sender and the time that packets with the reduced data rate get to the buffer, until the reduction of the data rate of the packet flow takes effect. During this time, preferably no further packet flows should be caused to reduce their data rates or more precisely the sender of the further packets flows to reduce the date rate of these further packet flows even in case the packets of the packet flows exceed the threshold. It is desired to await if for example the reduction of the data rate of the first packet flow is enough to avoid buffer overflow, or at least that the number of packets exceeding the threshold is reduced. This is, because the data rate of the first packet flow is reduced, the buffer will have to process less packets in total so that further reduction of date rates of one or more further packet flows might not be necessary any more. However, as described it will take at least one round trip time for the reduction of the data-rate of the first packet flow to take effect. Therefore, the timeout interval might be set at one round trip time (RTT). However, the timeout interval might be set at higher values, for example at two round trip times. This is, because the round trip time is typically unknown to the queue manager and therefore a reasonable (worst case) guess is applied. Furthermore, typically in practice sub-RTT burstiness of flows might cause another RTT delay until effective rate reduction takes effect. This is because, the packets are often processed, sent or buffered, bulk-wise. Therefore, the receiver of a packet flow or the sender of a packet flow which receives an acknowledgment message might need more time to determine that a packet is missing or that a congestion notification is received. Therefore, the timeout interval might be set at two RTT. Furthermore, it might be preferable that the timeout interval is set at three RTT times. By a longer timeout interval the violation of the threshold is ignored for a corresponding longer time period thus resulting that the reduction of the data rate of more packet flows is avoided which despite exceeding the threshold are sent with their initial, un-reduced data rate. By this the transmission capacity is better used, because otherwise too many data flows would have reduced their data rate leading to an empty buffer and, in consequence, an idle transmission link. In a preferred embodiment, the timeout interval is dynamically adapted, based in particular on the time that the queue is above the threshold and the time that the queue is below the threshold. During the timeout interval, packets exceeding the threshold are not provided with a congestion notification. A congestion notification causes the sender of the corresponding packet flow to reduce the data rate of the packet flow. The reduction of the datarate of the packet flow takes effect only at least one round trip time later as the congestion notification is provided. In normal operation after the timeout the queue size should be below the threshold due to the one flow has reduced its data rate. However, there might be operational conditions, where this single reduction is not sufficient to drop below the threshold. Or there might be operational conditions where after the drop below the threshold the subsequent data rate increase is so fast that at timeout expiration the threshold is already violated for the next time. Both cases are characterized by the queue being most of the time above the threshold. Both cases can be avoided by an adaptation towards shorter timeouts. Briefly, therefore, it might be preferable to adapt the timeout interval dynamically. In principle, the longer the timeout interval is, the more often a violation of the threshold is ignored, thus the packet flows will increase their data rate. The shorter the timeout interval is, the more often packets exceeding the threshold will be provided with a congestion notification thus leading the sender of the packet-flow to reduce the data-rate of the respective packet flow. In the first case, due to too many packets in the buffer and no congestion notification during the timeout interval, there might be too many packets causing possibly the queue to reach the maximum capacity of the buffer leading to tail drop of packets which can not be buffered any more. In the second case, there might be too less packets in the sense that the buffer capacity is not fully used. Therefore, the timeout interval is preferably adjusted according to the actual traffic characteristics. Particularly, if the number of TCP flows is large, the timeout interval should be smaller than the initial timeout interval setting.

In this context, it should be noted that the number of TCP flows might be defined as large, if there are more flows than the typical congestion window size of one of the flows. Each sender of a packet flow has typically determined a congestion window size for each packet flow. The congestion window size is an amount of packets of the packet flow which the sender will sent, even if the sender has not yet any acknowledgement of the packets to be received or not. In other words, a sender of data packets sends packets and waits for an acknowledgement e.g. of successful reception of the packets or of missed packets from the receiver of the packets. As long as the sender did not yet receive any acknowledgement for a particular packet, this packet is called packet in flight. The sender does not send any further packets of a packet flow, if the amount of packets in has reached a predetermined value which is determined by the congestion window size. Only if the sender receives acknowledgment for a packet (which is then no packet in flight any more), the sender will send another packet which is a packet in flight until acknowledgment for this packet is received. The amount of packets in flight is determined by the congestion window size of the packet flow. Accordingly, in preferred

embodiments, the number of TCP flows in the network is defined as large, if the number of TCP flows is more than the typical congestion window size of one of the flows.

In a preferred embodiment, the timeout interval is determined as a function of the value of the initial interval, such as the interval is set at the beginning of implementing the method of controlling the buffering of the packets, the time that the queue is above the threshold, the time that the non-empty queue is below the threshold and a scaling parameter gamma. The cumulative time of the time that the queue is above the threshold and the time that the queue is below the threshold, but not empty, is calculated, this is the time that the queue is above the threshold is reduced by the time that the queue is below the threshold, but not considering the time that the queue is empty. The time that the queue is empty is not considered, because the time of an empty queue is not relevant for the setting or adapting of the timeout interval. The time that the queue is empty because of too small traffic offer is not relevant for setting or adapting the timeout interval. The difference is the cumulative time, which is defined as the variable theta. The variable theta is multiplied by a

predetermined scaling parameter gamma that controls the sensitivity of adaptation. If the product of gamma and theta is greater than "one", then the variable k is defined as the value of the product of gamma and theta. If the product of gamma and theta is smaller than "one", then the variable k is defined with the value "one". This means k is at least one. The adapted interval is determined by dividing the initial interval by the variable k. If k is one, the adapted interval is the same as the initial interval. If k is greater than one, the adapted interval is reduced in comparison with the initial interval. Preferably, the timeout interval (= interval in the formula) is determined by the formula: theta := cumulative time {above - below} threshold k := max(1 , gamma ^* theta)

interval := initial_interval/k

During the timeout interval, a violation of the threshold by the queue is ignored. If it is determined that the queue exceeds the threshold before the timeout interval has expired, no further congestion notification is provided. This means, the one or more packets of the queue which are above the threshold and thus violating the threshold are not provided a congestion notification. In preferred embodiment, even the step of determining a congestion violation within the timeout period is not implemented, this is it is not even determined at all during the timeout period if the queue exceeds the threshold.

Preferably, if after expiry of the timeout interval, it is determined that the queue still exceeds the threshold, a congestion notification is provided on the first packet causing after expiry of the timeout interval the queue to exceed the threshold.

Preferably, if after expiry of the timeout interval, the queue does not exceed the threshold, no congestion notification is provided. However, if the queue then exceeds again the threshold and there is no timeout interval running, a congestion notification is provided on the first packet of the queue causing the queue to exceed the threshold.

In a preferred embodiment, the timeout interval is started or restarted, if the queue is determined to be empty. This is the timeout interval is not only started, if a congestion notification is provided on a packet, but also if the queue is found to be empty. It might happen that the packets of the queue which have been stored in the buffer are all forwarded, this means taken out of the buffer or out of the queue. At this moment, the queue is empty, because there are no packets in the buffer. Accordingly, the sender(s) of the packet flows might increase the data rates very much, because the sender(s) of the packet flows are not aware of any bottle neck within the network, as the packets are forwarded and no congestion notification is provided to the sender(s). Accordingly, due to the strong increase of data rate, a burst of packet flows might occur. It might be preferable to prevent large packet bursts from immediately triggering a congestion notification in case a first packet exceeds the threshold. Therefore, the timeout interval during which any threshold violation is ignored, is started or restarted, in case the queue is found to be empty.

In preferred embodiments, the threshold is dynamically adapted. Without adapting the threshold, the queue could oscillate around the threshold far away from zero, thus the queue being constantly far away from empty, which means an unnecessary queuing delay. In a preferred embodiment, the threshold can be decreased if the queue does never drain empty over a predetermined time frame, e.g., 1 to 10 seconds. In a preferred embodiment, the threshold can be decreased if the queue hits the buffer size (tail drop) during the timeout interval. Preferably, the threshold can be piecemeal increased if neither of both conditions occurs. Preferably, the threshold can be piecemeal increased, if the queue drained empty over a predetermined time frame, e.g. 1 to 10 seconds. Preferably, the threshold can be piecemeal increased, if the queue did not hit the buffer size during the timeout interval. Preferably, the threshold can be piecemeal increased, if the queue drained empty over a predetermined time frame, e.g. 1 to 10 seconds, and the queue did not hit the buffer size during the timeout interval, wherein preferably the predetermined time frame and the timeout interval might overlap at least partially.

In a preferred embodiment, the threshold is dynamically adapted so that the queue drains to zero at least once in a predetermined time frame. It might be preferable that the queue is empty at least once in a predetermined time frame, e.g. 1 to 10 seconds. In case within the predetermined time period, the queue actually drains to zero at least one time, i.e. is empty at least one time, the threshold might be kept as it is. In case within the predetermined time period, the queue does not drain to zero least for one time, the threshold is reduced, i.e. set at a lower level. In case, during the predetermined time frame, the queue drains to zero for example several times, in particular for a predetermined amount of times, e.g. four times, the threshold is preferably set to a higher value, which might be the value before the threshold had been reduced or another higher value as the reduced value. Similarly, in case during the predetermined time frame, the queue keeps on being empty for a

predetermined time period, e.g. 50 milliseconds, the threshold is set to a higher value, which might be the value before the threshold had been reduced or another higher value as the reduced value.

In a preferred embodiment, the setting of the threshold is based on queue size. In particular, the queue size might be defined by a length of the queue or a volume of the queue. In this case, the queue exceeds the threshold, if the amount of packets in the queue is greater than the amount of packets which is determined as the threshold of the queue. For example, the threshold is set at a value of 100 000 packets. Then, the queue exceeds the threshold, if the queue has more than 00 000 packets. In other words, the 100 001 ^st packet is the first packet of the queue which causes the queue to exceed the threshold. It might also be said the 100 001 ^s1 packet is the first packet exceeding the threshold.

For exemplary reasons, in the above described embodiments, this 100 001 ^st packet is the packet on which the congestion notification is provided, as this packet is the first packet of the queue exceeding the threshold.

In a preferred embodiment, it is determined at enqueuing of the packets of the one or more packet flows in the queue if the queue exceeds the threshold. Packets which arrive at the buffer are typically processed by the buffer by storing the packets in the queue in the buffer, which is termed as enqueuing (the packets in the queue, thus in the buffer). As mentioned above, in case it is determined that the queue exceeds the threshold due to the packets exceeding the threshold, on the first packet of the queue which actually exceeds the threshold, a congestion notification is provided.

The congestion notification might be provided by dropping the packet. In this case, the first packet of the queue exceeding the threshold is dropped. If the packet is dropped, the packet is not kept in the buffer and thus is not forwarded out of the buffer to the receiver of the corresponding packet flow. The receiver of said packet flow determines that the packet flow misses the packet which is dropped at the buffer. Accordingly, the receiver of the packet flow sends a message to the sender of the packet flow informing the sender of the missing packet. The sender is informed of the missing packet. The sender, however, knows, that it has sent the packet on the way to the receiver. Consequently, the sender concludes that the packet which has not reached the receiver must have been lost on the way to the receiver, in particular that the packet might have been dropped by a buffer. The sender knows that a buffer in general drops packet(s) of a packet flow, in case the buffer can not process, i.e. buffer and forward, all packets arriving at the buffer, in particular because the data rates of one or more of the packet flows to which the packet(s) belong are too high and thus there are too many packet(s) at the buffer. Consequently, the sender of the packet flow with the missing packet reduces the data rate of the packet flow to which the missing packet belongs. Conclusively, by dropping a packet of a packet flow the buffer notifies the receiver of a congestion of the buffer.

Therefore, the dropping of a packet is a congestion notification provided on the dropped packet. The congestion notification on a packet by dropping the packet might be termed as "implicit" congestion notification. The receiver of the packet flow to which the dropped packet belonged is notified implicitly of the

congestion, because the receiver will determine that in the packet flow a packet is missing, namely the dropped packet. Then, the receiver communicates a corresponding message about the missing packet and therefore potentially about the congestion of the buffer to the sender of the packet flow to which the dropped packet belongs.

In a preferred embodiment, a congestion notification by the buffer is provided by marking the first packet of the queue exceeding the threshold. The marking of the first packet might be provided by providing an input in the header fields of the packet indicating the congestion of the buffer. The marking of the first packet might be setting a value in a packet header field. The marking of a packet to notify the receiver of the packet of the congestion of the buffer is termed explicit congestion notification. The receiver of the explicit congestion notification, i.e. of the packet comprising the indicated congestion notification, sends a corresponding message to the sender of the packet flow to which the marked packet belongs informing the sender about the congestion notification of the buffer.

In a preferred embodiment, a first packet exceeding the threshold might be marked and subsequently dropped. In this case, the marking of the packet might be regarded as an indication that the packet is to be dropped preferably if dropping is needed.

In a preferred embodiment, the threshold is set below a maximum capacity of the buffer, in particular at less than 60% of the buffer size, preferably at less than 50% of the buffer size. The lower the threshold is set, the more likely and/or the more frequently in principle a packet causes the queue to exceed the threshold. If the threshold is set to a low value, the queue will often exceed this low value resulting in a packet to be dropped or marked. The packet which is dropped is the first packet will causes the queue to exceed the threshold. The queue comprises the packets of packet flows arriving at the buffer. The buffer processes the packets by putting the packets in the queue (enqueuing), storing the packets and putting the packets out the queue (dequeuing). Because of the arriving packets, the queue might be rising as more packets arrive at the buffer than are put out of the buffer. Therefore the packets filling up the buffer, the queue of the packets in the buffer gets longer and longer.

In a preferred embodiment, the threshold is set on the size of the queue, in particular on the length of the queue. Preferably, the size of the queue might be determined in terms of packets. Preferably, the size of the queue might be determined in terms of bytes. As the packets arrive at the buffer, the queue length increases, if more packets arrive at the buffer than can be processes by the buffer, in particular by dequeuing the packets out of the packets and forwarding the packets. As more packets arrive at the buffer for enqueuing than packet which are put out of the buffer at dequeuing, at a certain moment the length of the queue will exceed the threshold. In other words, in principle each new arriving packet makes the queue longer, and each packet being put out of the buffer at dequeuing makes the queue shorter. As more packets arrive at the buffer than packets leave the buffer, the queue length increases and there is a first packet which can be determined as causing the queue to exceed the threshold. In other words, one of the packets arriving at buffer and put in the queue will be the first packet by which the length of the queue exceeds the threshold, because in this example the length of the queue is more than the length determined by the threshold. The length of the queue might be indicated by the number of packets which are in the queue. If the number of packets exceeds the number of packets determined by the threshold, the queue exceeds the threshold. The first packet is the packet of the queue by which the number of the queue is higher than the number of packets determined by the threshold. According to the implementation, the first packet is the packet of the queue by which the length of the queue is longer than the length of the queue determined by the threshold. Further according to the implementation, the first packet is the packet of the queue by which the volume of the queue is larger than the volume of the queue determined by the threshold.

In a preferred embodiment, the threshold is set on queuing delay. The packets of the one or more packet flows are stored in the buffer, until they are put out of the buffer for forwarding. The time that the packets have been stored in the buffer, until being put out of the buffer, this is until dequeuing, is termed queuing delay. According to the implementation, it is determined at every dequeuing of a packet, how long the packet has been stored in the buffer, this means how long is the queuing delay of the packet. Although this packet could be regarded as ready for forwarding to the receiver, because the packet just has been put out of the buffer, this means the buffering of the packet has been successfully processed, nevertheless this packet might be dropped or at least marked with a congestion notification. This is at every dequeuing of each packet, the queuing delay of the packet is determined. It might be determined that the queuing delay of the packet is too long. In particular, it might be determined that the queuing delay exceeds a predetermined value indicated by a threshold which is in this case set on the queuing delay. In other words, it might be determined that the queue exceeds the threshold which is set on queuing delay. Again in other words, it might be determined a packet of which the queuing delay exceeds the threshold set on queuing delay. Accordingly a congestion notification is provided on a packet. The congestion notification is provided on the first packet of the queue which exceeds the threshold set on queuing delay.

In summary, on each packet stored in the buffer the queuing delay which is the time period that the packet is stored in the buffer is determined. If the packets can not be forwarded within the time of queuing delay determined by the threshold, in particular if the capacity of the network of forwarding the packets out of the packets is not sufficient, the queuing delay of the packets increases more and more. Therefore, at a certain moment or from a certain moment on, the packets are stored longer than the time which is determined by the threshold. At a certain moment the queuing delay is longer than the time period determined by the threshold. The first packet on which it is determined that the queuing delay is longer than the time period of the queuing delay set by the threshold, is the packet on which a congestion notification is provided. The congestion notification on the first packet causing the queue to exceed the threshold might be provided by dropping the packet or marking the packet or marking and dropping the packet. Analogously, to the above described embodiments, the congestion notification by dropping and/or marking the packet causes the sender of the packet flow to which the dropped and/or marked packet belongs to reduce the data rate of the packet flow to which the dropped and/or marked packet belongs.

Preferably, the threshold is set below a maximum predetermined delay assigned to the one or more data flows (or packet flows). In a preferred embodiment, to the one or more data flows, more precisely to the packets of one or more of the one or more data flows a maximum delay might be predetermined. For example, a data flow might carry a particular application, which might be in particular a real time application such as IP-telephony.

Accordingly, the buffer delay of the packets of said application should be rather low. Therefore, a threshold is set to the delay of the packet of said data flow at a rather low value. The threshold is in particular set below a maximum

predetermined delay to the data flow. As the threshold is set below the maximum predetermined delay, a packet of the data flow which causes the queue to exceed the threshold is dropped and/or marked so that consequently the sender of the packet flow is caused to reduce the data rate of the packet flow or alternatively look for another path through the communication network for transporting the packet flow. The threshold is set below a maximum predetermined delay assigned to the data flow in particular so that the data rate of the packet flow might be reduced or an alternative path is found for the data flow before the quality of the data transport becomes too worse. In other words, if the notification message is provided only at the moment at which the maximum predetermined delay is reached, the quality of the data transport of the packet flow might be already very bad due to the large buffer delay. As the threshold is set below the maximum predetermined delay a notification message upon excess of the threshold is already provided at a moment at which the quality of transport might still be acceptable. It is not awaited until the maximum predetermined delay of the packet is reached for providing a notification message, because the notification messages is already provided at the moment of and due to the excess of the threshold by the queue below the maximum predetermined delay. In other words, the data rate of the packet flow is reduced or an alternative path for the packet flow is searched and possibly chosen while the quality of transport of the packet flow is still acceptable or at least better than if the congestion notification was provided only in case the maximum predetermined delay would have been reached or excessed by the queue.

Brief description of the figures

The features and advantages of the present invention will be more readily understood by studying the following preferred embodiments with reference to the following figures of which

Fig. 1 depicts a schematic diagram of steps of a method for