PARIKH SANJAY (IN)
NAIR RAJESH (US)
PARIKH SANJAY (IN)
US20080046292A1 | 2008-02-21 | |||
US20090319535A1 | 2009-12-24 | |||
US20050154614A1 | 2005-07-14 | |||
US20090299767A1 | 2009-12-03 |
V We Claim: 1. A method for creating multiple tagged clinical trial data, the method comprising: receiving clinical trial information from a plurality of sources; removing redundancies from the clinical trial information received from the plurality of sources to form collated clinical trial data; baseline tagging of the collated clinical data using non-indication parameters; creating a disease specific list of indication parameters, wherein indication parameters are classified into main indication parameters and sub indication parameters; advanced tagging of the collated clinical trial data using indication parameters; and creating multiple tagged clinical data using baseline tagging and advanced tagging. 2. The method of claim 1 wherein the step of removing redundancies involves removing redundancies based on at least a statistical keyword match done for same clinical trial information from a source and/or from at least two sources from the plurality of sources. 3. The method of claim 1 wherein the collated data comprises clustered clinical trial data after removing the redundancies. 4. The method of claim 1 further comprising mapping a new clinical trial information to an existing multiple tagged clinical data. 5. The method of claim 1 further comprising creating a new multiple tagged clinical data from a new clinical trial information. 6. The method of claim 1 further comprising creating an enhanced trial database of the multiple tagged clinical data. 7. A tool for analyzing clinical trial information using the method of claim 1-6. 8. A computer program product comprising: a computer useable medium having a computer readable code including instructions for: receiving clinical trial information from a plurality of sources; removing redundancies from the clinical trial information received from the plurality of sources to form collated clinical trial data; baseline tagging of the collated clinical data using non-indication parameters; creating a disease specific list of indication parameters, wherein indication parameters are classified into main indication parameters and sub indication parameters; advanced tagging of the collated clinical trial data using indication parameters; and creating multiple tagged clinical data using baseline tagging and advanced tagging. 9. The computer program product of claim 8 further comprising mapping a new clinical trial information to an existing multiple tagged clinical data. 10. The computer program product of claim 8 further comprising creating a new multiple tagged clinical data from a new clinical trial information. 11. The computer program product of claim 8 further comprising creating an enhanced trial database of the multiple tagged clinical data. |
TECHNICAL FIELD
[0001] The invention relates generally to clinical trial management and more specifically to a method for organizing clinical trial data for efficient retrieval and use. BACKGROUND
[0002] In the medical field, clinical trials are typically conducted to allow safety and efficacy data to be collected for drugs, diagnostics, devices, therapy protocols, and other health or disease management related aspects. There are details procedures that need to be followed by corporates, research or health organizations to plan and conduct the trials for any new and/or development phase drugs, diagnostics, devices, therapy protocols, etc. The trial planning involves selection of the sites or centres where the trial would be conducted, these could be single center in one country or multiple centers in different countries. Similarly, there is a choice of healthy volunteers and/or patients depending on the type of product for which clinical trial is being conducted. Besides these, there are elaborate lab procedures that need to be selected for the clinical trials.
[0003] Clinical trials thus involve efficient planning and huge costs for all of the above mentioned activities, and design of clinical trials is critical to ensure that one gets relevant results for the product being tested. Clinical trials are also usually required before the national regulatory authority approves marketing of the drug or device, or a new dose of the drug, for use on patients.
[0004] The information from the ongoing and completed clinical trials is therefore very valuable to all those who may be engaged in similar research efforts for effective new clinical trial design. Currently, the information pertaining to clinical trials is available from discrete information sources. An indicative list of such information sources includes public domain sources like the website www. Clinicaltrials.gov, World Health Organization's clinical trial registry, and country specific clinical trial registry like Indian clinical trial registry, Sri Lankan clinical trial registry etc.; a company specific clinical trial registry like Glaxo SmithKline clinical trial registry, Roche clinical trial registry, etc.; and literature resources like PubMed, conference abstracts, and the like. The clinical trial data currently available is huge and widely dispersed. [0005] There have been some inter- governmental efforts to provide a portal to access clinical trial information from select databases, for example the IFPMA Clinical Trial Portal that provides links to ClinicalStudyResults.org, ClinicalTrials.gov, Current Controlled Trials, Japan Pharmaceutical Information Center, and Pharmaceutical Industry Clinical Trials database. However, these efforts currently lack integration of all the different sources of information and the search features are limited.
[0006] Therefore there is a continuing need to address issues related to accessing clinical data information from all the different sources with ease and analyzing the data to find out the progress of any trial or results therefrom. [0007] Accordingly there is a need to have a single window platform that is able to access all the different information sources and provide usable information on time and with speed.
BRIEF DESCRIPTION
[0008] In one aspect, the invention provides a method for creating multiple tagged clinical trial data. The method comprises receiving clinical trial information from different sources, and removing redundancies from the clinical trial information received from the plurality of sources to form collated clinical trial data. The method further involves baseline tagging of the collated clinical data using non-indication parameters, creating a disease specific list of indication parameters, where indication parameters are classified into at least main indication parameters and sub indication parameters. The method further includes advanced tagging of the collated clinical trial data using indication parameters and creating multiple tagged clinical data using baseline tagging and advanced tagging. DRAWINGS
[0009] These and other features, aspects, and advantages of the present invention will become better understood when the following detailed description is read with reference to the accompanying drawings in which like characters represent like parts throughout the drawings, wherein:
[0010] FIG. 1 is a diagrammatic representation of the overall method for creating an enhanced trial database that includes multiple tagged data; and
[0011] FIG. 2 is a diagrammatic representation of different components to enable the method of FIG. 1.
DETAILED DESCRIPTION
[0012] As used herein and in the claims, the singular forms "a," "an," and
"the" include the plural reference unless the context clearly indicates otherwise.
[0013] The clinical trial, or simply trials herein, refers to a health intervention study and includes but is not limited to studies related to drugs, devices, dosages, therapy protocols, diagnostics.
[0014] As used herein the clinical trial data is data or information available at any time point after initiation of a clinical trial including clinical study design. As one of ordinary skill in the art will appreciate, different data will become available at different stages of clinical trials, all of which are meant to be included as clinical trial data. Thus, for example, a clinical study design alone may be clinical trial data, or in the middle of a clinical trial, data such as investigators, geography, experimental details, and the like will constitute clinical trial data, while at the completion of a clinical trial, data such as results, end points, and so on will also be included as part of clinical trial data. [0015] The clinical trial management as used herein refers to management of clinical trials. The management of clinical trial is achieved using the clinical trial data as defined herein.
[0016] The indication area as used herein refers to a condition which makes a particular treatment or procedure advisable.
[0017] The non-indication parameters as used herein refer to parameters, which are seen across the clinical trials irrespective of indication area the trial was conducted. Thus the non-indication parameters are independent of an indication area. The exemplary but non-limiting non-indication parameters include Trial Phase, Trial Status, Study design, Race, Gender, Age, Study sponsor, Investigator, Trial Site, Drug, Treatment duration, and Intervention type.
[0018] The indication parameters as used herein refer to parameters that are specific for a given indication area. The exemplary but non-limiting indication parameters include Patient segment, Inclusion criteria, Exclusion criteria, Endpoints - Efficacy & Safety, and Diagnostic and Laboratory parameters.
[0019] According to one aspect, a method for creating multiple tagged clinical trial data is provided and is shown generally as flowchart 10 in FIG. l. The method includes receiving clinical trial data and information from multiple sources as indicated at step 12 of the flowchart. Such sources include the website www. Clinicaltrials.gov, World Health Organization's clinical trial registry, and country specific clinical trial registry like Indian clinical trial registry, Sri Lankan clinical trial registry etc. ; company specific clinical trial registry like Glaxo SmithKline clinical trial registry, Roche clinical trial registry, etc.; literature resources like PubMed, conference abstracts, and the like. In the exemplary embodiment, the information is collected from such sources using crawlers that are usually written using a dynamic programming language, such as Perl programming language, and then stored in a database. [0020] The method further involves removing redundancies from the clinical trial information received from these sources to form collated clinical trial data as shown at step 14 of the flowchart. The removal of redundancies is based on at least a statistical keyword match done for same clinical trial information from a source and/or from at least two sources from the multiple sources to yield the collated data that is free of redundancies. Also, the collated data as described herein includes a clustered clinical trial data after removing the redundancies. For example, a given clinical trial can be represented in multiple sources, with the same title or a different title conveying the same meaning. For example, when three different sources for trial data such as the websites clinicaltrials.gov, WHO website for clinical trials and Indian clinical trial registry are searched for a trail ID NCT00455533, they all show only one trial information for measuring the efficacy of four drugs Cyclophosphamide, Doxorubicin, Ixabepilone, Paclitaxel in early breast cancer. It may be noted that some of the data fields are same but some are different in these three sources of trail data. If the Indian clinical trial record is compared with other sources, WHO and clinicaltrials.gov, they are not the same in first glance but comparing the secondary IDs and drugs used and using domain knowledge, it may be concluded that the same trial is being represented by the three different sources, and hence a uniform representation with all the information pertaining to this trial from these three different sources needs to be clustered after removing the redundancies. Thus with the clustered data, any given clinical trial gets analyzed in one step. In the above example, if clustering was not there, one would have to analyze all the above three clinical trials separately. Through this method step, any incremental data also gets associated with the trail, such as, but not limited to, site and investigators data for the given trial from different sources. In the above example, there was a lot of information about the investigators and sites used in India sourced from Indian clinical trial registry but the same data was not present in other two sources (clinicaltrials.gov and WHO). Thus clustering ensures that all the data for any given trial gets associated to provide a complete set of information for every trial. [0021] Baseline tagging of the collated clinical data is then done as shown at step 16 using non-indication parameters. A sample list of non-indication parameters is given in Table 1.
Trial Phase Trial Status Study Type
1. Phase I 1. Planned 1. Interventional Study
2. Phase I/II 2. Open 2. Observational Study
3. Phase II
3. Closed 3. Dose Optimization/Dose Consolidation Study
4. Phase II/III
4. Completed 4. Dose Titration Study
5. Phase III
5. Temporarily Closed 5. Investigator-Initiated Study
6. Phase
III/IV 6. Terminated 6. Extension Study
7. Phase IV 7. Pharmacoeconomics Study (HE&OR Study)
8. Pharmacogenomics/Pharmacogenetics Study
9. Pilot Trial
10. Pivotal Trial
11. Postmarketing Surveillance (PMS) Study
12. Proof-of-concept (POC) Study
13. Registry Study
TABLE 1 [0022] Further the method involves creating a disease specific list of indication parameters, wherein the indication parameters are classified into main indication parameters and sub indication parameters. The steps involved in creating a list of indication parameters in an exemplary embodiment involves, collating all the clinical trials in a given indication area and listing down all the data pertaining to given parameter. For example, for endpoints, all the endpoints that are used in all the clinical trials collated are listed. Next, filtering is done to remove the redundant indication data. Next, the data collected pertaining to given parameter, is divided into different level, for example, two levels, first level being termed as Main parameter (also referred to as parent parameter) and second level called being termed as Sub- parameter (also referred to as child parameter). A sample of the Chronic Obstructive Pulmonary Disorder (COPD) indication parameter is listed in the Table 2 below:
2. Healthy Female Volunteers Others 1. COPD with Insomnia
2. Cystic Fibrosis
3. Patients with Gastroduodenal ulcer
4. Idiopathic Pulmonary Fibrosis (IPF)
5. Unspecified Chronic Respiratory Disease
6. Cigarette Smokers
7. Active SELECT trial Participant
TABLE 2
[0023] Another exemplary list of inclusion parameters as used in the method of the invention is given in Table 3:
Patients on mechanical ventilation
TABLE 3
[0024] Similarly another list of exclusion parameters as used in the invention is given below in Table 4.
Cystic fibrosis
Giant bullous disease
Interstitial lung disease
Lung cancer
Pleural pathology
Pneumonia
Pneumothorax
Primary ciliary dyskinesia
Pulmonary edema
Pulmonary fibrosis
Pulmonary hypertension
Pulmonary thromboembolic disease
Sarcoidosis
Solitary nodule in the lung
Tuberculosis (known, active)
Tuberculosis sequalae
Unspecified chronic respiratory disease
Chest x-ray abnormality other than COPD
Pneumoconiosis
12 Patients with hematologic disorder
13 Bladder neck obstruction
14 Immune disorder
15 Neoplasm
Cancers
Cancers with specific exceptions
16 Infections
17 Ophthalmic disease
18 Neurological disease
19 Psychiatric disorder
Bipolar disease
Schizophrenia
Mental retardation
TABLE 4
[0025] An exemplary list of end-points as used in the method of the invention is given below in Table 5:
Inspiratory Vital Capacity (IVC)
TABLE 5
[0026] Another exemplary list of indication parameters showing diagnostic/lab parameter is given in Table 6 below:
Tidal Volume (VT)
TABLE 6 [0027] It will be appreciated by those skilled in the art that only exemplary lists are shown in above tables, and the lists include several other parameters needed for classification and tagging of the clinical trials. These aspects are shown in more detail in FIG. 2. [0028] The method then involves the step for advanced tagging of the collated clinical trial data at step 18, using indication parameters as described above. All the relevant trials are thus categorized, analyzed and indexed based on parameters that depend on a given indication area.
[0029] Then using the baseline tagging and advanced tagging, the method involves creating multiple tagged clinical data as shown at step 20.
[0030] The method as described herein further allows for dynamic updating of the trial data information. In this respect the method includes mapping a new clinical trial information to an existing multiple tagged clinical data or creating a new multiple tagged clinical data from the new clinical trial information, if it is not an update for any existing record but a new trial data.
[0031] The method further comprises creating an enhanced trial database of the multiple tagged clinical data as indicated at step 22.
[0032] Thus through the method as described herein an enhanced trial database is made available that contains organized clinical trial data in the form of multiple tagged data and is available for further use for example through a web- enabled tool for searching and analyzing the clinical trial data.
[0033] Referring now to FIG. 2, a diagrammatic representation 24 of different components used or created by the method of FIG. 1 is illustrated in more details. The different clinical trial resources are indicated generally by reference numeral 26 that are used in the method of FIG. 1. The clinical trial information from all such resources is checked for redundancies, trial updates and also for new trials on a continuous or periodic basis as shown generally by reference numeral 28. This cleaned up data is stored in a database 30 and is then used for doing baseline tagging 32 using different attributes as indicated by reference numeral 34. Then the baseline tagged data is further filtered as shown at 36 to provide advanced tagged data 42 with non-indication and indication parameters as shown at 38 and 40 respectively. This leads to creation of multiple tagged clinical data 44 that is stored as an enhanced trial database 46 that can be accessed for example by a client interface 48.
[0034] It would be appreciated by those skilled in the art that the method described herein provides a repository of global clinical trials, which are organized systematically in order to facilitate easy retrieval with enhanced and current clinical trial information. It is useful for all those who are involved in design, execution, or analysis of clinical trials.
[0035] It may be appreciated by one skilled in the art that the method and process steps and algorithms described herein can be executed by means of software running on a suitable processor, or by any suitable combination of hardware and software. When software is used, the software can be accessed by a processor using any suitable reader device which can read the medium on which the software is stored. The computer readable storage medium can include, for example, magnetic storage media such as magnetic disc or magnetic tape; optical storage media such as optical disc, optical tape, or machine readable bar code; solid state electronic storage devices such as random access memory (RAM) or read only memory (ROM); or any other physical device or medium employed to store a computer program. The software carries program code which, when read by the computer, causes the computer to execute any or all of the steps of the methods disclosed in this application. Similarly a communication link that may be an ordinary link or a dedicated communication link may be provided for accessing the enhanced trial database as described herein from a user's work station. [0036] While only certain features of the invention have been illustrated and described herein, many modifications and changes will occur to those skilled in the art. It is, therefore, to be understood that the appended claims are intended to cover all such modifications and changes as fall within the true spirit of the invention.