Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
SYSTEM AND METHOD FOR DEMOGRAPHIC ANALYTICS BASED ON MULTIMODAL INFORMATION
Document Type and Number:
WIPO Patent Application WO/2012/143939
Kind Code:
A2
Abstract:
The system and method of the present invention are described for automatic detection of error in the entry of particular category of individuals, especially referring to gender and age classification either real time while creating a database of such information or on an existing database on the record of individuals by analyzing their biometric characteristics like speech, image or face and other related demographic information like name of the individual in order to accord each individual with a unique identification.

Inventors:
SINHA ANIRUDDHA (IN)
MISRA PRATEEP (IN)
BANERJEE SNEHASIS (IN)
PAL ARPAN (IN)
Application Number:
PCT/IN2012/000265
Publication Date:
October 26, 2012
Filing Date:
April 12, 2012
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
TATA CONSULTANCY SERVICES LTD (IN)
SINHA ANIRUDDHA (IN)
MISRA PRATEEP (IN)
BANERJEE SNEHASIS (IN)
PAL ARPAN (IN)
International Classes:
B32B33/00; C23F1/00
Foreign References:
US20100115114A12010-05-06
US20010001877A12001-05-24
US20090222255A12009-09-03
US20090228294A12009-09-10
US20080049983A12008-02-28
US20040184591A12004-09-23
Other References:
None
See also references of EP 2697064A4
Attorney, Agent or Firm:
GUPTA, Priyank (B-105 ICC Trade Towers,Senapati Bapat Road, Pune 6, IN)
Download PDF:
Claims:
WHAT IS CLAIMED IS:

1) An information processing method for gender verification involving automatic detection of error in gender category entry, in an information storage media wherein the said method is based on multimodal data analysis technique and the gender identification information is captured and interpreted in the processor implemented steps of:

receiving biometric based gender information from an individual by utilizing biometric matching modules and other gender identification information from multiple modalities for generating multimodal data; analyzing the received gender identification information by employing biometric matching modules and anthroponomastic analysis technique on the captured multimodal data for assigning a probable gender to the individual; assigning a confidence fuzzy score with values associated thereof to the analyzed gender identification information by means of a computer implemented verification report generating module; verifying the correctness of identification information based on the expression of the individual in an identifiable language and; transmitting an error reported by the verification module via an alert generation module, for correction of identification information, on occurrence of any such error.

2) An information processing method for gender verification, as claimed in 1 , wherein the gender identification information includes biometric information and name of the individuals. 3) An information processing method for gender verification as claimed i claim 1, wherein the biometric information comprises of one or more parameters including voice, facial patterns, respiration volume, skin thickness, biochemical features (e.g., blood biochemistry), fingerprints, palm prints, retinal identification, iris scan and the like.

4) An information processing method for gender verification as claimed in claim 1 , wherein the other gender identification information includes anthroponomastic analysis of name.

5) An information processing method for gender verification as claimed in claim 1 , wherein the probable gender is selected from male, female and neutral after biometric and anthroponomastic analysis.

6) An information processing method for gender verification as claimed in claim 1, wherein the probabilistic or fuzzy score is assigned to evaluate and estimate correctness of gender identification by assigning threshold qualifying score to each of identification information.

7) An information processing method for gender verification as claimed in claim 1 , wherein the method is configured to be implemented either real time while creating the information storage media or on an existing information storage media.

8) An information processing system for gender verification involving automatic detection of error in gender category entry, in an information storage media based on multimodal data analysis technique, wherein the said system comprises of : biometric matching modules adapted to assign probable gender to captured biometric information for generating identification information;

a transitory storage module capable of communicating with biometric matching modules for storing the gathered biometric and other related identification information;

a report generating module for generating fuzzy score for each of the captured biometric and other identification information;

a verification module to verify consistency in entry of categorical information associated with the gender information and;

an information storage media adapted to store validated and accurate information for assigning unique identification to individuals.

9) An information processing system for automatic detection of error, as claimed in claim 8, further comprising an alert generating module for reporting error in the gender entry of information.

Description:
"SYSTEM AND METHOD FOR DEMOGRAPHIC ANALYTICS BASED ON MULTIMODAL INFORMATION"

FIELD OF THE INVENTION

The present invention relates to the field of data processing, and more particularly, to the method and apparatus for gender verification of individuals based on multimodal data analysis approach.

BACKGROUND OF THE INVENTION:

In order to capture the record of individuals for according them a unique identification, all the necessary information needs to be gathered and managed in an appropriate database. This information includes their name, gender, age, marital status, any photograph and biometric characteristics like fingerprints, palm prints, retinal identification, iris scan, face recognition or speech samples. Such a valuable piece of information is stored at an appropriate database for further identification of individuals and their gender verification.

However, it has been observed that in many instances the gender or the age of individuals is wrongly entered in such databases when the record comparisons are made in real time. This in turn necessitates the requirement of strategy or methods for gender verification, their ethnicity and age estimation from the gathered demographic information. Automated verification of demographic information has numerous applications including passive surveillance such that each individual is correctly identified and his/her identity is stored in a database to be searched whenever the access is sought.

As a result, an active area of research and development is dedicated to improve biometric characteristic identification in recent years. For example, face detection has been a well researched field to detect the gender based on global features (shape, hair contour) and geometric features (eyebrow thickness, nose width etc.) but the accuracy drawn in such cases has been in the range of 85% to 92%.

Another popular approach to estimate gender and age based on formant/ pitch analysis is through the use of speech recognition technology. However, current speech recognition based identification typically exhibits high error rates; their accuracy reported as 98% for clean speech and 95% for noisy speech. Further, speech recognition systems work well under laboratory conditions, but intend to show a considerable decrease in recognition rates when used in a normal operating environment. This decrease in accuracy occurs for the most part because of the unpredictable and variable noise levels found in a normal operating setting, and the way individuals alter their speech patterns to compensate for this noise.

Incorporating name as one of the parameters for gender and/or age identification and verification, also poses multiple challenges based on individual geographical origin or location and hence prone to an error attack of approximately 5%.

There is thus a widely recognized need for, and it would be highly advantageous to have, a method and apparatus for automatically reporting error based on an individual's wrong classification with respect to a particular category, such as an age and/or gender- category.

This in turn triggers the need to develop a more mature and reliable system which reports gender verification and consistency of demographic data maintained at the appropriate database by way of extracting the intelligent information using the multiple data inputs instead of only relying on any of the biometric characteristic recognition techniques.

)

OBJECT OF THE INVENTION:

In accordance with the present invention there is provided a system and method to automatically detect error in a particular category of entry in an information storage media based on predetermined biometric characteristic of individuals and other related information.

Another object of the present invention is to detect error in the gender and age category of the entry in the database.

It is an object of the present invention to utilize name of the individual as the other related information for gender identification.

It is yet another object of the present invention to provide significantly high accuracy rates ranging above 90% for gender detection from facial recognition.

Yet another object of the present invention is to utilize multiple data inputs including the biometric characteristic information to verify the consistency of data made in database in gender or age category.

Another aspect of the present invention utilizes background color, face image features, speech and name as the recognition parameters for gender and age verification.

Still another object of the present invention is to analyze multimodal data for the verification of demographic information.

It is another object of the present invention to achieve maximum performance of the system by extracting intelligent information based on the multiple inputs and deciding on the correctness of those inputs.

Yet another object of the present invention is to generate automatic interactive alerts whenever an incorrect data entry of a particular category is reported.

BRIEF DESCRIPTION OF THE ACCOMPANYING DRAWINGS

The foregoing detailed description of preferred embodiments is better understood when read in conjunction with the appended drawings. For the purpose of illustrating the invention, there is shown in the drawings example constructions of the invention; however, the invention is not limited to the specific methods and system disclosed. In the drawings:

Fig. 1 highlights the well delineated architectural view of the constituting modules performing gender and age verification according to embodiment of the present invention.

Fig. 2 sets forth the flow diagram illustrating gender verification and checking data consistency according to one aspect of the present invention.

DETAILED DESCRIPTION OF THE INVENTION:

Some embodiments of this invention, illustrating all its features, will now be discussed in detail.

The words "comprising," "having," "containing," and "including," and other forms thereof, are intended to be equivalent in meaning and be open ended in that an item or items following any one of these words is not meant to be an exhaustive listing of such item or items, or meant to be limited to only the listed item or items.

It must also be noted that as used herein and in the appended claims, the singular forms "a," "an," and "the" include plural references unless the context clearly dictates otherwise. Although any systems and methods similar or equivalent to those described herein can be used in the practice or testing of embodiments of the present invention, the preferred, systems and methods are now described. The foregoing and other objects, features and advantages of the invention will be apparent from the following more particular descriptions of exemplary embodiments of the invention as illustrated in the accompanying drawings wherein like reference numbers generally represent like parts of exemplary embodiments of the invention. The preferred embodiment of the present invention as described below relate to a method and system which can be used for automatically detecting error in the entry of any categorical information of individuals gathered from a demographic survey. Such an automatic detection of error can be made real time while creating the database or on an existing database on the record of people collected. Specifically, the present invention can be used to detect error in the entry of gender and /or age information of individuals for generating their unique identification number by analyzing speech, image and name of individuals.

Before explaining at least one embodiment of the invention in detail, it is to be understood that the invention is not limited in its application to the details of construction and the arrangement of the components set forth in the following description or illustrated in the drawings.

Fig 100 shows system architecture for demographic data collection eventually for gender verification. The system 100 gathers individual's information like name for anthroponomastic analysis, address, date of birth, gender and biometric characteristics of the individual like voice, facial patterns, respiration volume, skin thickness, biochemical features (e.g., blood biochemistry), fingerprints, palm prints, retinal identification, iris scan etc to be stored in a temporary storage module 101 constituting the system. The storage module is in communication with a plurality of biometric matching modules through a suitable networking module such as IP network. These biometric engines are capable of processing multimodal biometric data collected.

It will be recognized by those skilled in the art that a biometric matching engine may include any known technologies to detect 2D face, 3D face, hand geometry, single fingerprint, ten finger live scan, iris, palm, full hand, signature, ear, finger vein, retina, DNA, voice etc.

The data gathered temporarily in the storage module 101 is adapted to be processed for the verification of the demographic data. The process of verification can be initiated either at the time of collecting the data and storing it in the storage module or once the entries are made of the record of people in the module. The biometric matching modules in communication with the temporary storage module 101 within a communicating network process the gathered biometric information. Similarly the demographic data is also processed for further verification and subsequently a unique identification is accorded to that record in the maintained information storage media within the storage module 101. The unique identification specifies individual records that could contain face, fingerprint, iris, speech, face recognition enabling recordals. The system therefore, collects multiple data inputs comprising demographic data in module 102 which may include individuals name, sex, height, weight, hair color, eye color, etc. and biometric data in module 103.

The combination of such demographic and biometric data is processed to generate the combined demographic report in the report generating module 104 to verify the consistency of gender and age of a person along with interactivity with the person. This approach tends to reduce errors in the collection and maintenance of demographic records. Once the report gets generated, a multimodal data verification process is initiated in the verification module 105 to generate alerts for abnormalities in gender, age etc. while the valid and accurate data gets stored in the identification information storage media 106. The process of error alert generation is executed by alert generating module 107.

The data obtained from different equipment, commonly referred to as modalities-Ml , M2, M3, M4, M5, M6, M7, M8 like biometric matching module or equipment for capturing images or gestures etc is a multimodal fusion which is eventually analyzed for multimodal interpretation.

The gender verification can be based on combining some or all of the following multiple parameters: a. Analyzing background color - Male and female can be asked to stand in front of separate background color. For example red for female and blue for male. Next, by employing image processing technique using any one of the modalities, defined set of attributes, including any values or scores generated, gender verification can be done. b. Face image feature analysis - Again this can be done with the captured photo from a camera as a modality. Next, to improve the accuracy an approach to associate females with an ethnic identification, commonly understood by that geographically and traditionally common heritage can be made. c. Speech analysis - In this approach, the individual can be asked to tell their name and age for anthroponomastic analysis. The speech interpretation is done for its content to generate a set of data and/ or associated values that can be used to detect the gender. d. Name analysis - This involves anthroponomastic analysis of names to obtain the probability number for a gender.

Instead of only gender verification, a generic data verification approach can be executed which will analyze the consistency of the following: a. Gender from multiple sources

b. Age information and facial and speech features

c. Name and gender consistency

Finally, once the data recording process is over, the system would communicate a note of thanks in a locally identified language as a step indicating the end of verification. The content of thanks is based on gender, so that any discrepancy can be immediately put forward by the user. This will serve the purpose of social courtesy as well as overall verification. Fig. 2 is a procedural flow diagram illustrating gender verification and checking data consistency according to one of the preferred embodiments of the present invention. As discussed, the system 100 collects multiple data inputs from different modalities 101 & 102 to generate a multimodal data which gets stored at a transitory storage module 103. The different modalities interpret the multimodal data in order to generate a demographic report at report generator module 104. In one of the other embodiments, each of the multimodal interpretation in the set of multimodal interpretations is typically a unimodal interpretation; that is, each is an interpretation of one modality. However, in one of the other embodiments each multimodal interpretation can be generated by more than one modality.

The multi modal interpretations resulting from different modalities are substantially non- overlapping and essentially independent making them non ambiguous interpretations. These interpretations are attributed with one confidence fuzzy score with values associated with it in the verification module 105. These fuzzy scores are analyzed and further interpreted to take the decision on the consistency of the gender and age from multiple inputs and decide on the correctness of those inputs. The confidence or fuzzy score enables in retrieving correct gender related information. In case of confidence scores lower than the threshold score value, the different modules are reframed to support the gathered gender identification information with more relevant criteria's and parameters determining the gender of an individual. Thus the characterization of gender is invoked using such fuzzy scores and extracting relevant information from different modalities.

In order to verify for the correctness of data entered and to avoid any ambiguity in response generated, a general greeting in a locally identified language is made which will allow a verification of the data in a natural manner based on the expression of the person after this is heard. The error, if reported, gets notified by way of alert generation by module 107 while the correct entries get permanently stored in the information storage media 106. While the present invention has been described with reference to exemplary embodiments, it is to be understood that the invention is not limited to the disclosed exemplary embodiments. The scope of the following claims is to be accorded the broadest interpretation so as to encompass all such modifications and equivalent structures and functions.