Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
A METHOD FOR ESTIMATING FLOW SIZE DISTRIBUTIONS
Document Type and Number:
WIPO Patent Application WO/2017/030515
Kind Code:
A1
Abstract:
The invention relates to a method for estimating flow size distributions which is performed by a telecommunication system enabling applicability of the dynamic customized pricing techn ques by providing accurate estimation of the service usage rates via flow size distributions (FSD) of the end users and by providing a correlation between the estimated FSDs and a particular ujtili8ty function representing the pricing method of the data services in telecommunication field.

Inventors:
KARATEPE ALPER (TR)
ZEYDAN ENGIN (TR)
KURT BARIS (TR)
KARABULUT KURT GÜNES (TR)
DEMIR MEHMET ÖZGÜN (TR)
CEMGIL TAYLAN (TR)
Application Number:
PCT/TR2016/050112
Publication Date:
February 23, 2017
Filing Date:
April 14, 2016
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
AVEA ILETISIM HIZMETLERI ANONIM SIRKETI ( TEKNOLOJI MERKEZI ) (TR)
International Classes:
G06Q10/04; G06F17/18
Foreign References:
EP2372953A22011-10-05
US20100157809A12010-06-24
US20050210533A12005-09-22
US20100034102A12010-02-11
EP1603274A12005-12-07
EP1603273A12005-12-07
US7957315B22011-06-07
US8064359B22011-11-22
US8335160B22012-12-18
US20070019548A12007-01-25
US7512980B22009-03-31
US20130036469A12013-02-07
US20130226669A12013-08-29
Attorney, Agent or Firm:
DESTEK PATENT, INC. (TR)
Download PDF:
Claims:
CLAIMS

1. A method for estimating flow size distributions which is performed by a telecommunication system enabling applicability of the dynamic customized pricing techniques by an accurate determination of the service usage rates via flow size distributions (FSD) of end users and correlation of the determined FSDs with a particular utility function representing the pricing method of data services in telecommunication field, characterized in that the telecommunication system performing said method comprises: - a Cellular Telecommunication Network Connection (1 ) provided by the operator;

- a Customizable Sampling Module (2) for obtaining a sample data value from the high speed data transmission;

a Flow Size Estimation and Estimation Module (3) used for estimation of the flow sizes belonging to the sampled data and for flow size estimations fulfilled;

- a Clustering Algorithm (4) which clusters the data sizes according to the day and time;

- a Usage Analysis Module (5) which analyze the information obtained by FSD and

clustering process and data usage of the subscribers;

a Pricing and Income Account Module (6) which calculates the income from the subscriber whose usage and pricing are determined within the given input parameters of the chosen pricing model;

- a Visualization Module (7) which visualize the result of the network usage analysis; and characterized in comprising the steps of: - sampling the packets from the Cellular Telecommunication Network Connection (1 ) according to the available sampling parameters by the Customizable Sampling Module

(2);

- transmitting the packet information obtained after sampling to the Flow Size Estimation and Estimation Module (3);

- updating the flow size estimation according to the new information; if necessary, changing the sampling parameter and sending it to the Customizable Sampling Module (2) by the Flow Size Estimation and Estimation Module (3);

- sending the estimated flow size to the Clustering Algorithm (4) in order to cluster according to the day and time; - analysis of the network usage by the Usage Analysis Module (5) by using the flow size estimation;

- providing an income account according to different pricing models by the Pricing and Income Account Module (6);

- visualization of the information on the number of the packets and the sizes thereof in the flow by the Visualization Module (7) .

Description:
A METHOD FOR ESTIMATING FLOW SIZE DISTRIBUTIONS TECHNICAL FIELD

The invention relates to a method for estimating flow size distributions which is developed for operators in telecommunication field; which facilitates estimating the usage patterns; which can characterize the statistical information about traffic patterns, network resource usage and user behaviors; which can enable detect traffic/user abnormality and improve network performance; which offers solutions for evaluation and development of the pricing policy of the network operator; and which is performed by a telecommunication system enabling applicability of the dynamic customized pricing techniques by providing accurate estimation of the service usage rates via flow size distributions (FSD) of end users and by providing a correlation between the estimated FSDs and a particular utility function representing the pricing method of the data services in telecommunication field.

STATE OF ART Most of the studies for estimating flow size distribution focus on deducing from sampled traffic data. However, if it is taken into account that the general data size is very large, some new studies take the data flow into account. In order to solve this problem, sampling techniques should be used which provide a balance between the practical application limits and estimation accuracy.

Currently, two main solutions are generally used in telecommunication field for tracking flows: sFlow : It is based on the uniform sampling technology which meets the basic requirements for a network tracking solution (a packet is a sample per second (s)).

- NetFlow : It is a protocol collecting IP traffic data which was developed by Cisco Systems.

Most of the routers and switches support this feature. Also, routers such as Juniper jFlow support NetFlow solution as they can receive NetFlow data. The drawbacks of the existing applications are listed as follows:

NetFlow and sFlow solutions may miss some tiny flows although they can identify estimation errors important only for FSD estimations. Tiny flows consist of few number of packets. When compared to large flows, it is more probable to miss these packets via uniform sampling.

- Another drawback of the existing approaches is that the time intervals in FSD variations cannot be tracked.

Existing FSD estimation algorithms do not constitute a utility function based on the methodology and pricing policies.

By using a given utility function, an integrated solution has not been found yet which enables the pricing policies to be estimated in a customized way.

The following patent applications can be given as examples for the known state of the art:

[1 ] EP 20050253187 (M.S. Kodialam and T.V. Lakshman. Per-flow traffic estimation, December 7 2005)

[2] EP 20050253184 (H. Fang, S.K. Muralidharan, and V.L. Tirunell. Accelerated per-flow traffic estimation, December 7 2005.)

[3] US 7,957,315 (N. Duffield, L.M. Breslau, C. Ee, A. Gerber, C. Lund, and S. Sen. System and method for sampling network traffic, June 7 201 1 .)

[4] US 8,064,359 (N. Duffield, L.M. Breslau, C. Ee, A. Gerber, C. Lund, and S. Sen. System and method for spatially consistent sampling of flow records at constrained, content-dependent rates, November 22 201 1 .)

[5] US 8,335,160 (A. Carvalho, Y. Teplitsky, S. Rahman, M. Tiwari, and V. Valluri. Flow sampling with top talkers, December 18 2012.)

[6] US 1 1 /258,444 (B.Krishnamurthy. Method and apparatus for data network sampling, January 25 2007)

[7] US 7,512,980 (J.A. Copeland and J. Jerrim. Packet sampling flow-based detection of network intru- sions, March 31 2009)

[8] US 13/197,402 (K.M. Worth. Detecting suspicious network activity using flow sampling, February 7 2013) [9] US 13/780,941 (M. Chiang, C. Jae-Wong, S. Ha, and S. Sen. System and methods for time dependent internet pricing, August 29 2013.)

In [1 ], a method of estimating per-flow traffic by determining the sampling interval based on a desired level of accuracy is described. In [2], a method of splitting the incoming traffic stream into a number of sub-streams by a hash function for flow display which requires high memory is described. In [3], a hash function is defined in order to calculate the number of the sampled flows and the sampling possibility is chosen according to the real flow sizes. In [4], it is extended as to include a flow based prioritization. A sampling technique which is not proper for source usage of the users is described in [5]. A sampling according to the signatures designed in accordance with a certain application such as intervention is described in [6].

In [7], a technique for analyzing the sampled packets in order to detect the intrusions by assigning a concern index value for each flow is described. In [8], the flows are sampled in order to evaluate the security of the network and in order to detect any kind of suspicious activity. Moreover, in [9], systems and methods for implementing time-dependent pricing in wireless/broadband access networks are disclosed.

Consequently, because of the mentioned drawbacks and the inadequacies of the existing solutions, an improvement or development in the methods for estimating flow size distribution which enable the pricing policies to be estimated in a customized way different from the known methods is necessary.

OBJECTIVES OF THE INVENTION

The objective of the invention is to improve the estimation methods of the flow sizes by providing a new method in order to estimate the above mentioned flow size distribution and in order to overcome the disadvantages in user-specific pricing.

The objective of the invention is to provide a method which provides an applicable solution having high accuracy rate and which combines data transmission with packet sampling.

The objective of the invention is to facilitate the estimation procedure of the patterns in network data traffic which is critical for an operator by being different from the existing applications. An objective of the invention is to provide information necessary for traffic planning, network management and efficient usage of available sources which are encountered in demand- oriented architecture structures.

An objective of the invention is to help the network operator by providing potential clues in order to characterize the statistical information on traffic patterns, network source usage and user behaviors; in order to detect the traffic/user abnormality; and in order to increase the network performance.

An objective of the invention is to provide an extensive point of view in order to evaluate and improve the pricing policy of the network operator thanks to the patterns.

An objective of the invention is to search, design and improve the tools which can determine the output of a particular pricing policy.

The objective of the invention is to search for the applicability of the dynamic customized pricing techniques in order to evaluate the general utility function for data services.

The objective of the invention is to provide a solution which comprises an accurate determination of the service usage rates via flow size distributions (FSDs) of the end users and correlation of the data services of the determined FSDs with a particular utility function representing the pricing method.

The invention is a method for estimating flow size distributions which is performed by a telecommunication system enabling applicability of the dynamic customized pricing techniques by an accurate determination of the service usage rates via FSDs of the end users and correlation of the determined FSDs with a particular utility function representing the pricing method of data services in telecommunication field.

In order to fulfill the above mentioned objects the method comprises the steps of: sampling the packets from the Cellular Telecommunication Network Connection (1 ) according to the available sampling parameters by the Customizable Sampling Module

(2);

- transmitting the packet information obtained after sampling to the Flow Size Estimation and Estimation Module (3);

updating the flow size estimation according to the new information; if necessary, changing the sampling parameter and sending it to the Customizable Sampling Module (2) by the Flow Size Estimation and Estimation Module (3);

- sending the estimated flow size to the Clustering Algorithm (4) in order to cluster according to the day and time;

- analysis of the network usage by the Usage Analysis Module (5) by using the flow size estimation;

- providing an income account according to different pricing models by the Pricing and Income Account Module (6);

visualization of the information on the number of the packets and the sizes thereof in the flow by the Visualization Module (7).

DRAWINGS WHICH HELP UNDERSTANDING OF THE INVENTION

The structural and characteristic features and all of the advantages of the present invention can be understood more clearly thanks to the drawings and the detailed description given below with reference to the these drawings and therefore, the assessment should be performed by taking these drawings and detailed description into account.

Figure 1 is a general view of method operation of the present invention and the communication of the elements performing this method in the system with each other.

DEFINITIONS OF THE REFERENCES

1. Cellular Telecommunication Network Connection

2. Customizable Sampling Module

3. Flow Size Estimation and Estimation Module

4. Clustering Algorithm (Clustering)

5. Usage Analysis Module 6. Pricing and Income Account Module

7. Visualization Module

The drawings do not need to be scaled and unnecessary details may be omitted which are not needed for the invention to be understood. Moreover, the elements which are substantially identical or which have at least substantially identical functions are denoted with the same number.

DETAILED DESCRIPTION OF THE INVENTION

The preferred process steps of the Flow Size Distribution Estimation method of the present invention are only described in order to clarify the subject matter in this description.

Most of the studies on estimation of flow size distribution focus on deduction from sampled traffic data. However, when it is considered that the data size is very large, some recent studies take data flow into consideration. Because, in order to process these estimation algorithms high speed connections, a high CPU and memory are necessary. The flow size distribution estimation method of the present invention provides an applicable solution with high accuracy rate and combines data transmission with traditional packet sampling. Therefore the invention provide searching new sampling methods of data flow.

High connection speed and minimum source usage is necessary in order to process the FSD estimation algorithms. For this reason, the stability of the connection speed is very important. Moreover, it is known that the most accurate solution for FSD estimates enable estimation of the whole flow population distribution at the measurement point. Online detection of the traffic flow size distribution is provided by obtaining an online algorithm via using probabilistic methods which are suitable for network-wide distribution.

Another drawback of the existing approaches is that time intervals in FSD variations cannot be tracked. The method of the present invention enables the models to be developed by including time notion and enables understanding how the flow sizes vary based on time. Therefore, tracking of the traffic flow size distributions varying based on time can be performed. The detected flow size distributions can be used for basic problems such as testing the pricing strategies.

The system performing the method of the present invention generally comprises:

- a Cellular Telecommunication Network Connection (1 ) provided by the operator;

- a Customizable Sampling Module (2) for obtaining a sample data value from the high speed data transmission;

a Flow Size Estimation and Estimation Module (3) used for estimation of the flow sizes belonging to the sampled data and for flow size estimations fulfilled;

- a Clustering Algorithm (4) which clusters the data sizes according to the day and time;

- a Usage Analysis Module (5) which analyzes the information obtained by FSD and

clustering process and data usage of the subscribers;

a Pricing and Income Account Module (6) which calculates the income from the subscriber whose usage and pricing are determined within the given input parameters of the chosen pricing model;

- a Visualization Module (7) which visualizes the result of the network usage analysis;

The Customizable Sampling Module (2) samples the packets from Cellular Telecommunication Network Connection (1 ) according to available sampling parameter. In this step, a part of the packets are processes for counting and some of them are ignored. The packet information obtained after sampling is transmitted to the Flow Size Estimation and Estimation Module (3). This module updates the flow size estimation according to the new information. At the same time, if necessary, it changes the sampling parameter and sends it to the Customizable Sampling Module (2). Estimated flow size is sent to the Clustering Algorithm (4) in order to be clustered based on day and time. The Usage Analysis Module (5) analyzes the network usage by using flow size estimation and the Pricing and Income Account Module (6) performs an income account according to different pricing models. All of the information and statistical results about the packet number and packet sizes in the flow are graphically visualized by the Visualization Module (7). The Visualization Module (7) can process real-time and offline data.