ANOMALOUS BEHAVIOUR DETECTION SYSTEM

Title:

ANOMALOUS BEHAVIOUR DETECTION SYSTEM

Document Type and Number:

WIPO Patent Application WO/2007/042837

Kind Code:

Abstract:

A method and system for detecting anomalous behavior is disclosed comprising the steps of: generating a non-anomalous profile of a plurality of data records calculating a first probability that one or more new data records belong to the non-anomalous profile; calculating a likelihood value, based on the first probability, that the one or more new data records do not belong to the non-anomalous profile.

Inventors:

GIROLAMI MARK (GB)
DRUMMOND IAIN ROSS (GB)
HALL IAN D (GB)

Application Number:

PCT/GB2006/050299

Publication Date:

April 19, 2007

Filing Date:

September 21, 2006

Export Citation:

Click for automatic bibliography generation Help

Assignee:

MEMEX TECHNOLOGY LTD (GB)
GIROLAMI MARK (GB)
DRUMMOND IAIN ROSS (GB)
HALL IAN D (GB)

International Classes:

H04M15/00; H04M3/22

Foreign References:

GB2321362A	1998-07-22
GB2303275A	1997-02-12
US5790645A	1998-08-04
US20040111305A1	2004-06-10

Attorney, Agent or Firm:

MURGITROYD & COMPANY (165-169 Scotland Street Glasgow, Strathclyde G5 8PL, GB)

Download PDF:

View/Download PDF PDF Help

Claims:

CLAIMS

1. A method for detecting anomalous behaviour comprising the steps of: generating a non-anomalous profile of a plurality of data records calculating a first probability that one or more new data records belong to the non-anomalous profile; calculating a likelihood value, based on the first probability, that the one or more new data records do not belong to the non-anomalous profile.

2. A method as claimed in claim 1 , further comprising the step of generating an anomalous profile of a plurality of anomalous data records and calculating a second probability that one or more new data records belong to an anomalous profile.

3. A method as claimed in claim 2, wherein the likelihood value is based on the second probability as well as the first probability.

4. A method as claimed in any of claims 1 to 3, further comprising the step of comparing the likelihood value to a predetermined threshold value, wherein the one or more new data records are classified as anomalous if the likelihood value is greater than the threshold value.

5. A method as claimed in any of claims 1 to 4, wherein the threshold value is calculated by simulating data records according to the non-anomalous profile and generating a simulated distribution, the threshold value being taken from the simulated distribution.

6. A method as claimed in any of claims 1 to 5, wherein the non- anomalous profile is a non-anomalous probability distribution of the

plurality of data records corresponding to non-anomalous behaviour.

7. A method as claimed in claim 6, wherein the non-anomalous probability distribution is generated from a function of a series of feature probability distributions representing characterising features of the data records.

8. A method as claimed in claim 7, wherein the feature probability distributions are derived from a Dirichlet prior probability distribution.

9. A method as claimed in claim 7, wherein the feature probability distributions are derived from a Dirichlet prior probability distribution and the corresponding multinomial likelihood.

10. A method as claimed in any of claims 1 to 9, wherein the anomalous profile is an anomalous probability distribution of the plurality of anomalous data records corresponding to anomalous behaviour.

1 1. A method as claimed in claim 10, wherein the anomalous probability distribution is generated from a function of a series of feature probability distributions representing characterising features of the data records.

12. A method as claimed in claim 1 1 , wherein the feature probability distributions are derived from a Dirichlet prior probability distribution.

13. A method as claimed in claim 1 1 , wherein the feature probability distributions are derived from a Dirichlet prior probability distribution and the corresponding multinomial likelihood.

14. A method as claimed in any of claims 1 to 13, wherein each of a plurality of users has an associated plurality of data records and the likelihood value is calculated for each user.

15. A method as claimed in claim 14, wherein the threshold value is calculated for each user.

16. A method as claimed in any of claims 1 to 15, wherein, for a new user, the associated plurality of data records is taken from one or more other users and the non-anomalous profile is generated accordingly.

17. A method as claimed in any of claims 1 to 16, wherein the data records are call data records.

18. A method as claimed in claim 17, wherein the characterising features of the call data records are one or more of the following: day of call, time call initiated, destination of call and duration of call.

19. A method as claimed in any of claims 1 to 18, further comprising the step of generating an alarm to alert one or more operators when the one or more new data records have a likelihood value above the threshold value.

20. An anomalous behaviour detection system comprising: a plurality of data records; a non-anomalous profile generation means enabled to generate a non-anomalous profile from the plurality of data records; a probability calculation means enabled to calculate a first probability that one or more new data records belong to the non-anomalous profile; a likelihood calculation means enabled to calculate a likelihood value, based on the first probability, that the

one or more new data records do not belong to the non-anomalous profile.

21. A system as claimed in claim 20, further comprising an anomalous profile generation means enabled to generate an anomalous profile from a plurality of anomalous data records and wherein the probability calculation means is further enabled to calculate a second probability that one or more new data records belong to an anomalous profile.

22. A system as claimed in claim 21 , wherein the likelihood value is based on the second probability as well as the first probability.

23. A system as claimed in any of claims 20 to 22, further comprising a likelihood comparison means enabled to compare the likelihood value to a predetermined threshold value, wherein the one or more new data records are classified as anomalous if the likelihood value is greater than the threshold value.

24. A system as claimed in any of claims 20 to 23, further comprising a threshold calculation means enabled to calculate the threshold value by simulating data records according to the non- anomalous profile and generating a simulated distribution, the threshold value being taken from the simulated distribution.

25. A system as claimed in any of claims 20 to 24, wherein the non- anomalous profile is a non-anomalous probability distribution of the plurality of data records corresponding to non-anomalous behaviour.

26. A system as claimed in claim 25, wherein the non-anomalous probability distribution is generated from a function of a series of

feature probability distributions representing characterising features of the data records.

27. A system as claimed in claim 26, wherein the feature probability distributions are derived from a Dirichlet prior probability distribution.

28. A system as claimed in claim 26, wherein the feature probability distributions are derived from a Dirichlet prior probability distribution and the corresponding multinomial likelihood.

29. A system as claimed in any of claims 20 to 28, wherein the anomalous profile is an anomalous probability distribution of the plurality of anomalous data records corresponding to anomalous behaviour.

30. A system as claimed in claim 29, wherein the anomalous probability distribution is generated from a function of a series of feature probability distributions representing characterising features of the data records.

31. A system as claimed in claim 30, wherein the feature probability distributions are derived from a Dirichlet prior probability distribution.

32. A system as claimed in claim 30, wherein the feature probability distributions are derived from a Dirichlet prior probability distribution and the corresponding multinomial likelihood.

33. A system as claimed in any of claims 20 to 32, wherein each of a plurality of users has an associated plurality of data records and the likelihood value is calculated for each user.

34. A system as claimed in claim 33, wherein the threshold value is calculated for each user.

35. A system as claimed in any of claims 20 to 34, wherein, for a new user, the associated plurality of data records is taken from one or more other users and the non-anomalous profile is generated accordingly.

36. A system as claimed in any of claims 20 to 35, wherein the data records are call data records.

37. A system as claimed in claim 36, wherein the characterising features of the call data records are one or more of the following: day of call, time call initiated, destination of call and duration of call.

38. A system as claimed in any of claims 20 to 37, wherein the system further comprises an alarm generation means enabled to alert one or more operators when the one or more new data records have a likelihood value above the threshold value.

39. A computer program product directly loadable into the internal memory of a digital computer comprising software code portions for performing the method of claim 1 to 20.

Description:

Anomalous behaviour detection system

The present invention relates to an anomalous behaviour detection system and particularly, but not exclusively, to an anomalous behaviour detection and scoring system for telephone calls.

Each year in the telecommunications sector, fraudulent transactions account for a substantial loss of annual revenue for telecom providers. The detection of such fraudulent activity is an arduous task and presents a significant challenge to researchers and practitioners alike. This is due to the nature of the telecommunications domain where a high volume of transactional call data is produced.

In fact, only a very small percentage of call transactions are actually fraudulent and to detect these in real time compounds the problem.

Various solutions have been proposed, for example in "Detection of Fraud in Mobile Telecommunications" (Shawe-Taylor, J., Howker, K. and Burge, P., Information Security Technical Report 4(1 ):pp. 16-28, 1999) a system comprising of rule-based and artificial neural network components is developed, whilst in "Signature Based methods for Data Streams" C.

Cortes, and D. Pregibon (C. Cortes, and D. Pregibon , Data Mining and Knowledge Discovery, 5, pp 167-182, 2001 ) and "Detecting Fraud in the Real World" (M. Cahill, D. Lambert, J. Pinheiro, and D. Sun, Handbook of Massive Data Sets, pp 91 1 -929, 2002) signature based methods are proposed.

Typically, these solutions have drawbacks which limit the suitability to many areas which they might be employed.

Limitations of these and other similar solutions include:

• current systems have dependencies on additional data from other sources (e.g. billing systems), such links are expensive to engineer and maintain;

• solutions comprising Neural Networks are not necessarily sensitive to behaviour which has not been "trained" - a new type of behaviour will often not be detected, or simply be mislabelled by the system;

• the complexity of the existing underlying computation is generally such that it mandates the use of moderately, or extremely, expensive computer hardware to fulfil the objective of processing a day of observed behaviour within a day;

• when the modes of behaviour between the most divergent elements of the observed population are notionally close together (for example, for a set of telephone calls from a residential area), existing systems can oscillate chaotically between zero output and returning most of the input, therefore, existing systems often require constant "tuning" of internal parameters to limit this behaviour;

• presentation to an operator of the rationale for a "detected" condition is often very difficult (or indeed impossible) with current systems - neural networks in particular do not lend themselves to explaining how a result has actually been arrived at;

• existing systems are not easily able to model and refine hypothetical patterns of behaviour, which can then be immediately detected;

• many existing systems disregard entire classes of input on the grounds that they are too similar, too expensive to process or simply do not conform to some other criteria that the system requires to enable processing, when often removing elements of observed behaviour from the process will reduce the efficiency of the results; and

• the systems have limited or no ability to handle new users or those with small numbers of historically observed events.

According to a first aspect of the present invention there is provided a method for detecting anomalous behaviour comprising the steps of: generating a non-anomalous profile of a plurality of data records; calculating a first probability that one or more new data records belong to the non-anomalous profile; calculating a likelihood value, based on the first probability, that the one or more new data records do not belong to the non-anomalous profile.

Preferably, the method further comprises the step of generating an anomalous profile of a plurality of anomalous data records and calculating a second probability that one or more new data records belong to an anomalous profile.

Preferably, the likelihood value is based on the second probability as well as the first probability.

Preferably, the method further comprises the step of comparing the likelihood value to a predetermined threshold value, wherein the one or more new data records are classified as anomalous if the likelihood value is greater than the threshold value.

Preferably, the threshold value is calculated by simulating data records according to the non-anomalous profile and generating a simulated distribution, the threshold value being taken from the simulated distribution.

Preferably, the non-anomalous profile is a non-anomalous probability distribution of the plurality of data records corresponding to non- anomalous behaviour.

Preferably, the non-anomalous probability distribution is generated from a function of a series of feature probability distributions representing characterising features of the data records.

Preferably, the feature probability distributions are derived from a Dirichlet prior probability distribution.

Further preferably, the feature probability distributions are derived from a Dirichlet prior probability distribution and the corresponding multinomial likelihood.

Preferably, the anomalous profile is an anomalous probability distribution of the plurality of anomalous data records corresponding to anomalous behaviour.

Preferably, the anomalous probability distribution is generated from a function of a series of feature probability distributions representing characterising features of the data records.

Preferably, the feature probability distributions are derived from a Dirichlet prior probability distribution.

Further preferably, the feature probability distributions are derived from a Dirichlet prior probability distribution and the corresponding multinomial likelihood.

Preferably, each of a plurality of users has an associated plurality of data records and the likelihood value is calculated for each user.

Preferably, the threshold value is calculated for each user.

Preferably, for a new user, the associated plurality of data records is taken from one or more other users and the non-anomalous profile is generated accordingly.

Preferably, the data records are call data records.

Preferably, the characterising features of the call data records are one or more of the following: day of call, time call initiated, destination of call and duration of call.

Preferably, the method further comprises the step of generating an alarm to alert one or more operators when the one or more new data records have a likelihood value above the threshold value.

According to a second aspect of the present invention there is provided an anomalous behaviour detection system comprising: a plurality of data records; a non-anomalous profile generation means enabled to generate a non-anomalous profile from the plurality of data records; a probability calculation means enabled to calculate a first probability that one or more new data records belong to the non- anomalous profile; a likelihood calculation means enabled to calculate a likelihood value, based on the first probability, that the one or more new data records do not belong to the non-anomalous profile.

Preferably, the system further comprises an anomalous profile generation means enabled to generate an anomalous profile from a plurality of anomalous data records and wherein the probability calculation means is

further enabled to calculate a second probability that one or more new data records belong to an anomalous profile.

Preferably, the likelihood value is based on the second probability as well as the first probability.

Preferably, the system further comprises a likelihood comparison means enabled to compare the likelihood value to a predetermined threshold value, wherein the one or more new data records are classified as anomalous if the likelihood value is greater than the threshold value.

Preferably, the system further comprises a threshold calculation means enabled to calculate the threshold value by simulating data records according to the non-anomalous profile and generating a simulated distribution, the threshold value being taken from the simulated distribution.