Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
SINGLE-MOLECULE SEQUENCE AND HIGH SENSITIVITY METHYLATION ANALYSIS FOR TISSUE-SPECIFIC ANALYSIS
Document Type and Number:
WIPO Patent Application WO/2018/153588
Kind Code:
A1
Abstract:
The present invention relates to methods of determining one or more modification(s) of the nucleic acid sequence of at least one nucleic acid and at least one epigenetic alteration of said at least one nucleic acid in a sample of a subject, wherein the sample is derived from a body fluid of the subject, in order to link the one or more modification to a specific cell type.

Inventors:
DIETRICH CARSTEN (DE)
POSCH ANDREAS EMANUEL (AT)
Application Number:
PCT/EP2018/051502
Publication Date:
August 30, 2018
Filing Date:
January 23, 2018
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
SIEMENS HEALTHCARE GMBH (DE)
International Classes:
C12Q1/68
Domestic Patent References:
WO2016127944A12016-08-18
WO2013148400A12013-10-03
WO2016094813A12016-06-16
Foreign References:
US20130189684A12013-07-25
US5795782A1998-08-18
Other References:
B. ZHANG ET AL: "Functional DNA methylation differences between tissues, cell types, and across individuals discovered using the M&M algorithm", GENOME RESEARCH, vol. 23, no. 9, 26 June 2013 (2013-06-26), US, pages 1522 - 1540, XP055457350, ISSN: 1088-9051, DOI: 10.1101/gr.156539.113
LU HENGYUN ET AL: "Oxford Nanopore MinION Sequencing and Genome Assembly", GENOMICS PROTEOMICS AND BIOINFORMATICS, BEIJING GENOMICS INSTITUTE, BEIJING, CN, vol. 14, no. 5, 17 September 2016 (2016-09-17), pages 265 - 279, XP029799460, ISSN: 1672-0229, DOI: 10.1016/J.GPB.2016.05.004
K. SUN ET AL.: "Plasma DNA tissue mapping by genome-wide methylation sequencing for noninvasive prenatal, cancer, and transplantation assessments", PNAS, vol. 112, no. 40, pages E5503 - E5512, XP055373988, DOI: doi:10.1073/pnas.1508736112
MEISSNER ET AL.: "Genome-scale DNA methylation maps of plu-ripotent and differentiated cells", NATURE, vol. 454, 2008, pages 766 - 770
LOKK ET AL.: "DNA methylome profiling of human tissues identifies global and tissue-specific methylation patterns", GENOME BIOLOGY, vol. 15, pages 3248
B.A. FLUSBERG ET AL.: "Direct detection of DNA methylation during single-molecule, real-time sequencing", NATURE METHODS, vol. 7, 2010, pages 461 - 465, XP009142171, DOI: doi:10.1038/nmeth.1459
JOHNJ. KASIANOWICZ ET AL.: "Characterization of individual polynucleotide molecules using a membrane channel", PNAS, vol. 93, no. 24, 1996, pages 13779 - 13773
SIMP-SONS ET AL., DETECTING DNA METHYLATION USING THE OXFORD NANOPORE TECHNOLOGIES MINION SEQUENCER
T. OHSHIRO ET AL.: "Single-Molecule Electrical Random Resequencing of DNA and RNA", SCIENTIFIC REPORTS, no. 501, 2012
LOKK ET AL.: "DNA methylome profiling of human tissues identifies global and tissue-specific methylation patterns", GENOME BIOLOGY, vol. 15, 2014, pages 3248
R. LEHMANN-WERMAN ET AL.: "Identification of tissue-specific cell death using methylation patterns of circulating DNA", PNAS, vol. 113, no. 13, pages E1826 - E1834, XP055436315, DOI: doi:10.1073/pnas.1519286113
SNYDER ET AL.: "Cell-free DNA Comprises an In Vivo Nucleosome Footprint that Informs Its Tissues-Of-Origin", CELL, vol. 164, 2016, pages 1 - 2,57-68
A.M. NEWMAN ET AL.: "Integrated digital error suppression for improved detection of circulating tumor DNA", NATURE BIOTECHNOLOGY, vol. 34, 2016, pages 547 - 555
Download PDF:
Claims:
Claims

1. A method of determining one or more modification ( s ) of the nucleic acid sequence of at least one nucleic acid and at least one epigenetic alteration of said at least one nucleic acid in a samp1e of a subject, wherein the samp1e is derived from a body fluid of the subject, comprising :

obtaining or providing a samp1e from the subj ect comprising at least one nucleic acid;

- sequencing the at least one nucleic acid from the sample to obtain the nucleic acid sequence of said at least one nu¬ cleic acid, and determining at least a part of the epigenetic profile of said at least one nucleic acid;

comparing said nucleic acid sequence to a first refer- ence database to determine one or more modification (s) there¬ of;

comparing the at least part of the epigenetic profile of said at least one nucleic acid to a second reference database to correlate said at least one nucleic acid to a specific cell type ; and

linking the one or more modification (s) of the at least one nucleic acid sequence to the specific cell type .

2. The method of claim 1 , wherein the comparing the nucleic acid sequence to a first reference database of the subj ect to analyze it for a modification and the comparing the epigenet¬ ic profile of the at least one nucleic acid to a second data¬ base to correlate the at least one nucleic acid to a specific cell type are carried out simultaneously .

3. The method of claim 1 or 2 , wherein the linking of the one or more modification (s) of the at least one nucleic acid sequence of the specific cell-type of the subj ect is used to determine a malfunction of the specific cell-type of the sub- j ect, particularly a malignant condition .

4. The method of any one of claims 1 to 3, wherein the sub¬ ject is a vertebrate, preferably a human or animal subject, particularly a human . 5. The method of any one of claims 1 to 4, wherein the at least part of the epigenetic profile of said at least one nu¬ cleic acid is further compared to a third reference database to determine a malignant epigenetic change . 6. The method of any one of claims 1 to 5, wherein the at least part of the epigenetic profile is a methylation pattern of the DNA sequence .

7. The method of any one of claims 1 to 6, wherein the se- quencing the at least one nucleic acid from the sample to ob¬ tain the nucleic acid sequence of said at least one nucleic acid, and determining at least a part of the epigenetic pro¬ file of said at least one nucleic acid is carried out using a method selected from nanopore sequencing, single-molecule based sequencing, tunneling based detection or combinations thereof .

8. A method of determining a malignant condition of a spe¬ cific cell type in a subj ect , comprising :

- obtaining or providing a samp1e from the subj ect comprising at least one nucleic acid;

sequencing the at least one nucleic acid from the sample to obtain the nucleic acid sequence of said at least one nu¬ cleic acid, and determining at least a part of the epigenetic profile of said at least one nucleic acid;

comparing said nucleic acid sequence to a first refer¬ ence database to determine one or more modification (s) there¬ of that are indicative of a malignant condition;

comparing the at least part of the epigenetic profile of said at least one nucleic acid to a second reference database to correlate said at least one nucleic acid to a specific cell type ; and linking the one or more modification ( s ) of the at least one nucleic acid sequence that are indicative of a malignant condition to the specific cell type . 9. The method of claim 8 , wherein the comparing the nucleic acid sequence to a first reference database of the subject to analyze it for a modification and the comparing the epigenet¬ ic profile of the at least one nucleic acid to a second data¬ base to correlate the at least one nucleic acid to a specific cell type are carried out simultaneously .

10. The method of claim 8 or 9, wherein the subject is a vertebrate, preferably a human or animal subj ect , particular¬ ly a human .

11. The method of any one of claims 8 to 10 , wherein the at least part of the epigenetic profile of said at least one nu¬ cleic acid is further compared to a third reference database to determine a malignant epigenetic change .

12. A method of linking one of more modification (s) of at least one nucleic acid sequence in a samp1e of a subj ect to a specific cell type of the subj ect, comprising :

obtaining or providing at least one nucleic acid se- quence of a nucleic acid and at least a part of the epigenet¬ ic profile of said at least one nucleic acid;

comparing said nucleic acid sequence to a first refer¬ ence database to determine one or more modification (s) there¬ of;

- comparing the at least part of the epigenetic profile of said at least one nucleic acid to a second reference database to correlate said at least one nucleic acid to a specific cell type ; and

linking the one or more modification (s) of the at least one nucleic acid sequence to the specific cell type .

13. The method of claim 12, wherein the comparing the nucle¬ ic acid sequence to a first reference database of the subject to analyze it for a modification and the comparing the epige¬ netic profile of the at least one nucleic acid to a second database to correlate the at least one nucleic acid to a spe¬ cific cell type are carried out simultaneously .

14. The method of claim 12 or 13, wherein the at least part of the epigenetic profile of said at least one nucleic acid is further compared to a third reference database to deter¬ mine a malignant epigenetic change .

15. Computer program product comprising computer executable instructions which, when executed, perform a method according to any one of claims 12 to 14.

Description:
Description

Single-molecule sequence and high sensitivity methylation analysis for tissue-specific analysis

The present invention relates to methods of determining one or more modification (s) of the nucleic acid sequence of at least one nucleic acid and at least one epigenetic alteration of said at least one nucleic acid in a samp1e of a subject, wherein the samp1e is derived from a body fluid of the sub- ject, in order to link the one or more modification to a specific cell type .

Liquid biopsy (sequencing of cell free DNA (cfDNA) ) is re ¬ cently thought to replace tissue biopsy by sampling tis ¬ sues/cells/molecules from a blood sample . This is especially useful when the target tissue is hard to reach/probe and would require large incisions that would leave a patient with large traumas and wounds .

In current non-targeted ( open question approach) liquid biop ¬ sy, sequencing allows detection of malignant nucleic acid, e.g. DNA, alterations indicative of disease . In this regard, liquid biopsy can be based on cell-free DNA/RNA, exosomes with DNA/RNA, or circulating tumor cells (DNA/RNA. However, the original tissue where analyzed DNA fragments stem from cannot be determined with this approach . Determination of the tissue of origin would be particularly useful , though, to ( i ) directly link somatic mutations to the tissue of origin (tumor / metastases ) and ( ii ) determine fi- brotic/necrotic tissue by increased cell free DNA levels in the blood . Finally, false positive/negative results might be the result of this workflow, since malignant DNA changes might be nonfunctional (hence not malignant) when occurring in non-relevant tissues, so that also in this regard the de ¬ termination of the tissue of origin is helpful .

Currently, liquid biopsy needs to be complemented by imaging based diagnostics to identify location of tumors , lesions and/or metastases . For example, Molecular Stethoscope links RNA molecules to tissue . The analysis of somatic DNA changes , however, is almost impossible . Furthermore, the cfRNA (cell free RNA) amount is much less (up to 3 orders of magnitude less ) than cfDNA.

Another approach that is taken is the use of epigenetic in ¬ formation from liquid biopsy for determining e.g. tumours , as disclosed e.g. by K. Sun et al . , "Plasma DNA tissue mapping by genome-wide methylation sequencing for noninvasive prena ¬ tal , cancer, and transplantation assessments", PNAS , vol . 112 no . 40 , E5503-E5512 , doi : 10.1073/pnas .1508736112, wherein the authors use methylation analysis after bisulfite conversion . Similarly, the company Nucleix, Israel, offers urine based liquid biopsy. A methylation pattern is therein used for identification whether bladder cancer is present or not .

The present inventors propose a liquid biopsy workflow that also considers epigenetic modifications (e.g. methylation) of the same molecules used for DNA sequencing and thereby achieves two improvements : First , malignant epigenetic chang ¬ es can be detected in liquid biopsy and the corresponding in ¬ formation used in addition to the information from sequenc- ing, as epigenetic changes can also be causal for diseases and cancer) . Second, since epigenetic alterations are among the most important alterations when it comes to cell differ ¬ entiation, many epigenetic modifications are indicative of a certain cell type . This offers to classify DNA fragments with regard to their original tissue type . Any other detected changes , be it DNA variations or epigenetic variations , can subsequently be analyzed in a tissue-specific manner - hence coming close to a regular biopsy - but without incisions .

In a first aspect the present invention relates to a method of determining one or more modification (s) of the nucleic ac ¬ id sequence of at least one nucleic acid and at least one ep ¬ igenetic alteration of said at least one nucleic acid in a sample of a subj ect , wherein the sample is derived from a body fluid of the subject, comprising :

obtaining or providing a samp1e from the subj ect comprising at least one nucleic acid;

sequencing the at least one nucleic acid from the sample to obtain the nucleic acid sequence of said at least one nu- cleic acid, and determining at least a part of the epigenetic profile of said at least one nucleic acid;

comparing said nucleic acid sequence to a first refer ¬ ence database to determine one or more modification (s) there ¬ of;

- comparing the at least part of the epigenetic profile of said at least one nucleic acid to a second reference database to correlate said at least one nucleic acid to a specific cell type ; and

linking the one or more modification (s) of the at least one nucleic acid sequence to the specific cell type .

A second aspect of the present invention is directed to a method of determining a malignant condition of a specific cell type in a subj ect, comprising :

- obtaining or providing a sample from the subj ect comprising at least one nucleic acid;

sequencing the at least one nucleic acid from the samp1e to obtain the nucleic acid sequence of said at least one nu ¬ cleic acid, and determining at least a part of the epigenetic profile of said at least one nucleic acid; comparing said nucleic acid sequence to a first refer ¬ ence database to determine one or more modification ( s ) there ¬ of that are indicative of a malignant condition;

comparing the at least part of the epigenetic profile of said at least one nucleic acid to a second reference database to correlate said at least one nucleic acid to a specific cell type ; and

linking the one or more modification (s) of the at least one nucleic acid sequence that are indicative of a malignant condition to the specific cell type .

Also disclosed is in a third aspect a method of linking one of more modification (s) of at least one nucleic acid sequence in a samp1e of a subject to a specific cell type of the sub- ject, comprising :

obtaining or providing at least one nucleic acid se ¬ quence of a nucleic acid and at least a part of the epigenet ¬ ic profile of said at least one nucleic acid;

comparing said nucleic acid sequence to a first refer- ence database to determine one or more modification (s) there ¬ of;

comparing the at least part of the epigenetic profile of said at least one nucleic acid to a second reference database to correlate said at least one nucleic acid to a specific cell type ; and

linking the one or more modification (s) of the at least one nucleic acid sequence to the specific cell type .

Further, the present invention also relates to a computer program product comprising computer executable instructions which, when executed, perform a method according to the third aspect .

Further aspects and embodiments of the invention are dis- closed in the dependent claims and can be taken from the fol ¬ lowing description, figures and examples , without being limited thereto . Figures

The enclosed drawings should illustrate embodiments of the present invention and convey a further understanding thereof . In connection with the description they serve as explanation of concepts and principles of the invention . Other embodi ¬ ments and many of the stated advantages can be derived in re ¬ lation to the drawings . The elements of the drawings are not necessarily to scale towards each other . Identical , functionally equivalent and acting equal features and components are denoted in the figures of the drawings with the same refer ¬ ence numbers , unless noted otherwise .

Fig . 1 shows schematically a method of determining simul ¬ taneously one or more modification (s) of nucleic acid sequences of circulating DNA and/or RNA mole ¬ cules and the methylation data thereof to link the modifications to specific cell types in a method of the present invention .

Fig . 2 shows schematically details of Fig . 1.

Fig . 3 gives reference to the features of Fig . 2 with re ¬ gard to actual results that can be obtained by the present method .

Fig . 4 gives a detail schematic view of the processes in

Fig . 3.

Detailed description of the invention

Definitions

Unless defined otherwise, technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs . In the context of the present invention a "sample" is a sam ¬ ple which comprises potentially at least one nucleic acid to be sequenced . Within the scope of the present invention, the sample is derived from a body fluid of a subj ect , i.e. not from matter that does not represent a body fluid, e.g. organ ¬ ic tissue . Body fluids are thereby liquids originating from inside the bodies of sub ects , particularly living sub ects . They include fluids that can be excreted or secreted from the body and/or that circulate in the body, and body water . They can be in the state of a liquid, emulsion or suspension . Examples of body fluids within the invention are blood, urine, saliva, sputum, plasma, serum. A samp1e derived from a body fluid can be the body fluid itself as well as a body fluid that is worked up, e.g. wherein specific parts are extracted, separated, etc . According to certain embodiments , the sample is a patient sample (clinical isolate) . Preferred samples are serum, plasma, and/or whole blood of a patient . According to certain embodiments , the subj ect in the present methods is a vertebrate, preferably a human or animal , more preferably a mammal and most preferred a human, respectively human patient . A vertebrate within the present invention refers to animals having a vertebrate , which includes mammals - including hu ¬ mans , birds , reptiles , amphibians and fishes . The present in ¬ vention thus is not only suitable for humans and the human medical field, but also for veterinary medicine .

The term "nucleic acid" refers to a polynucleotide molecule having a defined sequence . It comprises DNA molecules , RNA molecules , nucleotide analog molecules and combinations and derivatives thereof, such as DNA molecules or RNA molecules with incorporated nucleotide analogs or cDNA. It also com ¬ prises cell free (cf) DNA and RNA. The term "nucleic acid se- quence" relates to the sequence of nucleotides in the nucleic acid molecule.

The term "modification" of the nucleic acid sequence refers to any change in the nucleic acid sequence, i.e. in the nu ¬ cleotide sequence, occurring compared to a reference se ¬ quence, and does not refer to epigenetic changes , which are herein termed "epigenetic alteration" . A modification of the nucleic acid sequence comprises e.g. a mutation, i.e. a vari- ation in the sequence as compared to a reference sequence . Such a reference sequence can be a sequence determined in a predominant wild type organism or a reference organism, e.g. a defined and known reference sequence of an animal or human . A mutation is for example a deletion of one or multiple nu- cleotides , an insertion of one or multiple nucleotides , or substitution of one or multiple nucleotides , duplication of one or a sequence of multiple nucleotides , translocation of one or a sequence of multiple nucleotides , and a single nu ¬ cleotide polymorphism ( SNP) . A modification of the nucleic acid sequence can also refer to e.g. a deletion of a whole gene in a sequence, but can also refer to non-coding sequence in a nucleic acid .

Epigenetics relate to all factors that affect heritable traits that are not related to changes in the nucleic acid sequence, i.e. the nucleotide sequence . It relates to e.g. histones , methylation of nucleic acids (particularly DNA and/or RNA) , telomeres , prions , hydroxymethylation, oxidation of nucleotides , etc . , which can influence e.g. activity of a gene, respectively gene function, not related to the nucleic acid sequence itself .

An epigenetic profile relates to all factors of a nucleic ac ¬ id, respectively nucleic acid molecule, not associated with the nucleic acid sequence, particularly affecting downstream physiological processes such as e.g. transcription and thus may result in phenotypic changes . A part of an epigenetic profile relates to only a part of all epigenetic factors as ¬ sociated with a nucleic acid, e.g. the methylation infor ¬ mation of the nucleic acid or of a part thereof . An epigenetic alteration is a change in the epigenetic infor ¬ mation of a nucleic acid, i.e. an alteration in a nucleic ac ¬ id molecule that is not a modification of the nucleic acid sequence, of a cell of a specific cell type compared to one or more cells of other cell types , e.g. also as described in Meissner et al . , "Genome-scale DNA methylation maps of plu- ripotent and differentiated cells", Nature, 454 , 766-770, 2008, doi : 10.1038/nature07107, and Lokk et al . , "DNA methylo- me profiling of human tissues identifies global and tissue- specific methylation patterns", Genome Biology 15:3248.

Before the invention is described in exemplary detail , it is to be understood that this invention is not limited to the particular component parts of the process steps of the meth ¬ ods described herein as such methods may vary . It is also to be understood that the terminology used herein is for purpos ¬ es of describing particular embodiments only, and is not intended to be limiting . It must be noted that, as used in the specification and the appended claims , the singular forms "a", "an" and "the" include singular and/or plural referents unless the context clearly dictates otherwise . For example, the term "a" as used herein can be understood as one single entity or in the meaning of "one or more" entities . It is al ¬ so to be understood that plural forms include singular and/or plural referents unless the context clearly dictates other- wise . It is moreover to be understood that, in case parameter ranges are given which are delimited by numeric values , the ranges are deemed to include these limitation values .

A first aspect of the present invention relates to a method of determining one or more modification (s) of the nucleic ac ¬ id sequence of at least one nucleic acid and at least one ep ¬ igenetic alteration of said at least one nucleic acid in a sample of a subject, wherein the sample is derived from a body fluid of the subject, comprising :

obtaining or providing a sample from the subj ect comprising at least one nucleic acid;

- sequencing the at least one nucleic acid from the sample to obtain the nucleic acid sequence of said at least one nu ¬ cleic acid, and determining at least a part of the epigenetic profile of said at least one nucleic acid;

comparing said nucleic acid sequence to a first refer- ence database to determine one or more modification (s) there ¬ of;

comparing the at least part of the epigenetic profile of said at least one nucleic acid to a second reference database to correlate said at least one nucleic acid to a specific cell type ; and

linking the one or more modification (s) of the at least one nucleic acid sequence to the specific cell type .

In the present methods , the sample can be provided or ob- tained from the subj ect in any way, preferably non-invasive, and can be e.g. provided as an in vitro samp1e or prepared as in vitro samp1e .

The sequencing of the at least one nucleic acid from the sam- pie is not particularly limited and can be done by any suita ¬ ble sequencing method .

Further, also the determination of the at least part of the epigenetic profile of the nucleic acid is not particularly limited and can be done by any suitable method as well .

In this regard, it is only important that the determining of the nucleic acid sequence of the nucleic acid and the deter ¬ mining of the at least part of the epigenetic profile of the nucleic acid are carried out on the at least one nucleic ac ¬ id, i.e. the same molecule . According to certain embodiments, the sequencing of the at least one nucleic acid and the determination of the at least part of the epigenetic profile are carried out at the same time by a suitable method that can provide the nucleic acid sequence of a nucleic acid and at the same time obtain at least a part of the epigenetic data of the same nucleic acid, e.g. the information regarding methylation of the nucleic acid . According to certain embodiments , the sequencing of the at least one nucleic acid from the sample to obtain the nu- cleic acid sequence of said at least one nucleic acid, and determining at least a part of the epigenetic profile of said at least one nucleic acid is carried out using a method se ¬ lected from single-molecule based sequencing techniques , e.g. single-molecule real-time (SMRT) sequencing techniques such as PacBio on time-level with optical read-out, as e.g. de ¬ scribed by B . A. Flusberg et al . , "Direct detection of DNA methylation during single-molecule, real-time sequencing", Nature Methods 7, 461-465, 2010, doi : 10.1038/nmeth.1459, for example with nanopore sequencing, as e.g. described in US 5 , 795, 782 and e.g. provided by Oxford Nanopore Technologies , UK, with detection methods such as indirect detection via ion current changes , as e.g. described by B . A. Flusberg et al . , "Direct detection of DNA methylation during single-molecule, real-time sequencing", Nature Methods 7 , 461-465, 2010, doi:10.1038/nmeth .1459, John J . Kasianowicz et al . , "Charac ¬ terization of individual polynucleotide molecules using a membrane channel", PNAS 93 , 24 , 13779-13773, 1996, and Simp ¬ sons et al . , "Detecting DNA Methylation using the Oxford Nanopore Technologies MinlON sequencer",

https://doi.org/10.1101/047142, or tunneling current , as described by e.g. T . Ohshiro et al . , "Single-Molecule Electri ¬ cal Random Resequencing of DNA and RNA" , Scientific Reports 2 , Article number : 501 (2012) , doi : 10.1038 /srepO 0501. Howev ¬ er, other methods or combinations thereof are possible . Gen- erally all sequencing methods can be used that can measure base modifications via electrochemical , optical and/or other physico-chemical properties . In contrast to other sequencing methods like bisulfite sequencing, the above methods can pro ¬ vide information of the nucleic acid sequence and at least part of the epigenetic profile at the same time, so that a smaller amount of nucleic acid can be used and that also no artifacts are obtained due to a modification of the nucleic acid, as is e.g. the case during bisulfite sequencing . Due to this approach, also cost can be saved . This aspect is of tre ¬ mendous use in a method where the nucleic acid is coming from a samp1e derived from a body fluid of the subject, as in liq- uid biopsy .

It is not excluded in the present invention that the samp1e is worked up prior to sequencing the at least one nucleic ac ¬ id and/or determining at least a part of the epigenetic pro- file of said at least one nucleic acid, e.g. by isolating the at least one nucleic acid from the sample or at least concen ¬ trate it therein by a suitable method, e.g. by centrifuging, e.g. if the sample is a blood sample . However, it is also possible that sequencing and determining at least a part of the epigenetic profile of the nucleic acid is carried out us ¬ ing the sample without prior work-up, i. e . as is .

In the present methods , the first reference database is not particularly limited as long as modifications of a nucleic acid sequence can be determined . It can e.g. comprise nucleic acid sequence information of one or more healthy sub ect ( s ) of the same species as the subject of which the sample in the present method is obtained or provided, but can also alterna ¬ tively or in addition contain nucleic acid sequence infor- mation of one or more subj ect ( s ) of the same species for which one or more modification (s) is known and e.g. associat ¬ ed with a malignant condition and/or a disease . For example, the database can contain nucleic acid sequence information of a healthy subj ect, e.g. a human or animal , and nucleic acid sequence information of a subj ect with a malignant condition and/or a disease, e.g. cancer and/or a heart disease . It is also not excluded that the database contains nucleic acid se- quence information of the same subject from which the sample in the present method is obtained or provided, e.g. an older sample taken from the subject. The first reference database can contain the whole nucleic acid sequence information in regard to a subj ect or only a part thereof, e.g. the genome . Suitable databases include e.g. The Cancer Genome Atlas

(TCGA; https : / /cancergenome . nih . gov/ ) , Ensemble

(http : //www . ensembl .org) , OMIM - Online Mendelian Inheritance in Man (https : //www. omim. org/) , METHHC, COSMIC (Catalogue of Somatic Mutations in Cancer) , Cancer Gene Census (ongoing ef ¬ fort to catalogue those genes for which mutations have been causally implicated in cancer) , dbSNP (Database of short ge ¬ netic variations) , ESP (Exome Sequencing Proj ect) , 1000 Ge ¬ nomes / 10000 genomes (Deep catalogues of genetic variation) , dbNSFP (annotation database for non-synonymous SNPs ) , Cancer Moonshot .

Also the second reference database is not particularly lim ¬ ited as long as it contains epigenetic information that al - lows correlation of the nucleic acid to a specific cell type, e.g. a cell of the lung, heart, kidney, liver, intestine, specific muscles , prostate, pancreas , testicles , larynx, pharynx, etc . , i. e . any tissue . For example, according to certain embodiments even a link to cells of the blood can be made, e.g. red or white blood cells , so that also malignant conditions thereof can be detected, e.g. leukemia .

In this regard it is noted that epigenetic information can be easily linked to specific cell types , as e.g. disclosed in Lokk et al . , "DNA methylome profiling of human tissues identifies global and tissue-specific methylation patterns", Ge ¬ nome Biology 15 : 3248, 2014, http : //dx . doi . org//l 0.1186/gb- 2014-15-4-r54, R . Lehmann-Werman et al . , "Identification of tissue-specific cell death using methylation patterns of cir- culating DNA", PNAS, 113, 13. E1826-E1834,

doi : 10.1073/pnas .1519286113, or in Snyder et al . , "Cell-free DNA Comprises an In Vivo Nucleosome Footprint that Informs Its Tissues-Of-Origin", Cell 164, 1-2, 57-68, 2016,

http://dx.doi.Org/10.1016/j.cell.2015.ll.050. A suitable sec ¬ ond reference database can be e.g. a methylation database like, for example, MethyCancer, MethHC, MENT, MethylomeDB, NGSmethDB, DiseaseMeth, MethBase, TCGA. Like the first reference database the second reference database can contain in ¬ formation from healthy sub ects and/or subj ects negatively affected by a disease and/or malignant change . Also, again, older data of the same subject from which the sample in the present method is obtained or provided can be contained in the second reference database . It is not excluded in the p e ¬ sent invention that a reference data base can contain data that make it suitable as first reference database as well as second reference database .

According to certain embodiments , the method of the first as ¬ pect allows determining a malfunction of a specific cell-type in a samp1e of a subj ect , e.g. a malignant condition like a tumor .

According to certain embodiments , a multitude of nucleic ac ¬ ids is obtained or provided in the present methods . This way the prediction of a disease and/or malignant condition in a specific cell type can be enhanced .

The linking of the one or more modification (s) of the at least one nucleic acid sequence to the specific cell type can be done in any way and can e.g. be simply carried out by com ¬ bining the results of the comparison to the first and the second reference database . Also a link can be done to any further results of comparisons to further databases . In this way a direct correlation can be made between the tissue of origin of the at least one nucleic acid and an modification of the nucleic acid sequence thereof . In this way it is pos- sible to find out if a medical condition arising from the modification of the nucleic acid sequence actually is prob ¬ lematic with regard to the tissue of origin or not . For exam- pie, modifications of the nucleic acid sequence of a cell free nucleic acid coming from lung cells can be indicative of lung cancer, whereas the same changes in cells from calve muscles could be without any adverse effect on the health of the subject. Thus , the present method allows not only deter ¬ mination of the possibility of adverse effects on the subject due to modifications of the nucleic acid sequence in a samp1e derived from a body fluid, but also the correlation to a spe ¬ cific tissue to determine whether the effect actually is ad- verse in the specific tissue . This way the results of the present methods can be used to form a basis for deciding whether a biopsy in a specific tissue could be useful even when no other symptoms with regard to the tissue are ob ¬ served .

According to certain embodiments , the comparing the nucleic acid sequence to a first reference database of the subj ect to analyze it for a modification and the comparing the epigenet- ic profile of the at least one nucleic acid to a second data- base to correlate the at least one nucleic acid to a specific cell type are carried out simultaneously . This way it is pos ¬ sible to directly obtain results for deciding whether an acute action should be taken with regard to a medical condi ¬ tion, e.g. one affecting the heart or brain, etc .

According to certain embodiments , the linking of the one or more modification (s) of the at least one nucleic acid se ¬ quence of the specific cell-type of the subj ect is used to determine a malfunction of the specific cell-type of the sub- j ect, particularly a malignant condition . As already laid out above, it is possible to find out about malignant conditions with regard to a specific tissue, e.g. the liver, bladder, intestines , etc . , for which a usual confirmation can be at ¬ tributed with stress for the subj ect, e.g. due to invasive procedures . According to certain embodiments, the subject in the present methods is a vertebrate, preferably a human or animal sub- j ect , particularly a human . According to certain embodiments , the at least part of the epigenetic profile of said at least one nucleic acid is fur ¬ ther compared to a third reference database to determine a malignant epigenetic change or another health-adverse change in the subject. As described above, it is known already that also specific epigenetic alterations are associated with ma ¬ lignant conditions , e.g. tumor . For example, a mutant allele- frequency can be 0.1% to 10% for cancer and other diseases or even below 0.1%, as disclosed by A.M. Newman et al . , "Inte ¬ grated digital error suppression for improved detection of circulating tumor DNA" , Nature Biotechnology 34 , 547-555, 2016, doi:10.1038/nbt.3520.

In this regard the third reference database is also not par ¬ ticularly limited and can be different from the first and/or second reference database or the same . It can also be inte ¬ grated in the first and/or second reference database . Suita ¬ ble examples are also e.g. mentioned with regard to the first and/or second reference database . Suitable databases include e.g. The Cancer Genome Atlas (TCGA;

https : //cancergenome . nih . gov/ ) , or NIH Roadmap Epigenomics Mapping Consortium

(http://egg2.wustl.edu/roadmap/web____portal/), etc . , as e.g. described above . According to certain embodiments , the at least part of the epigenetic profile is a methylation pattern of the DNA se ¬ quence . Particularly the methylation pattern allows an easy link to the cell type . In a second aspect the present invention relates to a method of determining a malignant condition of a specific cell type in a subj ect, comprising : obtaining or providing a sample from the subject comprising at least one nucleic acid;

sequencing the at least one nucleic acid from the sample to obtain the nucleic acid sequence of said at least one nu- cleic acid, and determining at least a part of the epigenetic profile of said at least one nucleic acid;

comparing said nucleic acid sequence to a first refer ¬ ence database to determine one or more modification (s) there ¬ of that are indicative of a malignant condition;

- comparing the at least part of the epigenetic profile of said at least one nucleic acid to a second reference database to correlate said at least one nucleic acid to a specific cell type ; and

linking the one or more modification (s) of the at least one nucleic acid sequence that are indicative of a malignant condition to the specific cell type .

With regard to the method of the second aspect, the different steps identical to the ones in the first method can be car- ried out in the same way, i.e. the obtaining or providing a samp1e from the subject comprising at least one nucleic acid; the sequencing the at least one nucleic acid from the samp1e to obtain the nucleic acid sequence of said at least one nu ¬ cleic acid, and determining at least a part of the epigenetic profile of said at least one nucleic acid; the comparing said nucleic acid sequence to a first reference database to deter ¬ mine one or more modification (s) thereof that are indicative of a malignant condition; and the comparing the at least part of the epigenetic profile of said at least one nucleic acid to a second reference database to correlate said at least one nucleic acid to a specific cell type . Thus , all embodiments mentioned with regard to the method of the first method also apply to the method of the second aspect . The linking of the one or more modification (s) of the at least one nucleic acid sequence that are indicative of a ma ¬ lignant condition to the specific cell type can be carried out in any suitable way and is not particularly limited . It can be e.g. just a combination of the data obtained when com ¬ paring the nucleic acid sequence to the first reference data base (first data) and the data obtained when comparing the at least part of the epigenetic profile of said at least one nu ¬ cleic acid to the second reference database (second data) , wherein the first data can e.g. indicate a possible malignant condition and the second data can indicate the cell type . With the method of the second aspect the malignant condition is directly linked to a specific cell type . This in return means that - in contrast to the method of the first aspect - a modification of the nucleic acid sequence that possibly can indicate a malignant change in one cell type but not the spe ¬ cific cell type determined when comparing the at least part of the epigenetic profile of said at least one nucleic acid to the second reference database is not determined in the method of the second aspect . This can be done e.g. by filter ¬ ing the corresponding result in the step of linking the one or more modification (s) of the at least one nucleic acid se- quence that are indicative of a malignant condition to the specific cell type, i.e. results indicative of a possible ma ¬ lignant condition, e.g. having a specific modification pattern, that are known to not lead to a malignant condition in the specific cell type determined .

According to certain embodiments , the comparing the nucleic acid sequence to a first reference database of the subject to analyze it for a modification and the comparing the epigenet ¬ ic profile of the at least one nucleic acid to a second data- base to correlate the at least one nucleic acid to a specific cell type are carried out simultaneously, as described corre ¬ spondingly in the method of the first aspect .

According to certain embodiments , the subj ect in the method of the second aspect is a vertebrate, preferably a human or animal subj ect, particularly a human . As in the method of the first aspect, the at least part of the epigenetic profile of said at least one nucleic acid can be further compared to a third reference database to deter ¬ mine a malignant epigenetic change in the method of the sec- ond aspect . The third reference database can be, as well as the first and second reference databases , the same as de ¬ scribed with regard to the method of the first aspect .

In a third aspect a method of linking one of more modifica- tion ( s ) of at least one nucleic acid sequence in a samp1e of a subject to a specific cell type of the subject is dis ¬ closed, comprising :

obtaining or providing at least one nucleic acid se ¬ quence of a nucleic acid and at least a part of the epigenet- ic profile of said at least one nucleic acid;

comparing said nucleic acid sequence to a first refer ¬ ence database to determine one or more modification (s) there ¬ of;

comparing the at least part of the epigenetic profile of said at least one nucleic acid to a second reference database to correlate said at least one nucleic acid to a specific cell type ; and

linking the one or more modification (s) of the at least one nucleic acid sequence to the specific cell type .

In this regard the obtaining or providing at least one nucle ¬ ic acid sequence of a nucleic acid and at least a part of the epigenetic profile of said at least one nucleic acid are not particularly limited . For example , the nucleic acid sequence and the at least part of the epigenetic profile can be ob ¬ tained or provided as data output from a sequencing method, e.g. one as described with regard to the method of the first aspect . The data can be in any form as long as they can be used for the subsequent comparison steps , e.g. are machine- readable . The comparing said nucleic acid sequence to a first reference database to determine one or more modification ( s ) thereof, the comparing the at least part of the epigenetic profile of said at least one nucleic acid to a second reference database to correlate said at least one nucleic acid to a specific cell type, and the linking the one or more modification (s) of the at least one nucleic acid sequence to the specific cell type can be carried out in the same way as described with re ¬ gard to the first aspect of the invention .

According to certain embodiments , the comparing the nucleic acid sequence to a first reference database of the subject to analyze it for a modification and the comparing the epigenet ¬ ic profile of the at least one nucleic acid to a second data- base to correlate the at least one nucleic acid to a specific cell type are carried out simultaneously, as described with regard to the first aspect . Furthermore, the at least part of the epigenetic profile of said at least one nucleic acid can be further compared to a third reference database to deter- mine a malignant epigenetic change, also as described with regard to the first, respectively second, aspect .

In a further aspect the present invention relates to a com ¬ puter program product comprising computer executable instruc- tions which, when executed, perform a method according to the third aspect .

According to certain embodiments the computer program product is one on which program commands or program codes of a com- puter program for executing said method are stored . According to certain embodiments the computer program product is a storage medium. The computer program product of the present invention can be self-learning, e.g. incorporating data obtained in the method of the third aspect in the first, second and/or third reference database . A workflow as described with regard to the aspects of the present invention can be included and/or integrated in pre ¬ vention examinations for cancer and other diseases , particu ¬ larly when becoming more cost-effective. The tissue-specific alteration allows even to detect tissue-specific malignant DNA and methylation changes even when there is no detectable phenotype . This means that a cancer/disease can be detected in an early stage even when e.g. diagnostic imaging does not result in a positive result if the primary tumor is very small .

Fig . 1 shows schematically a workflow for an enhanced liquid biopsy 100 with regard to the methods of the present inven ¬ tion, with methylation as an example for an epigenetic alter- ation .

As shown in Fig . 1, nucleic acids 1 , e.g. circulating DNA and/or RNA molecules , are sequenced using a sequencing 2 method that also allows determination of the methylation sta- tus of the nucleic acids , e.g. using a DNA/RNA base and meth ¬ ylation caller . This way, nucleic acid (e.g. DNA and/or RNA) sequence reads 3 including methylation data are obtained, wherein the methylation M is indicated by a star in the figure . The obtained data are then compared in a comparison step 6 to a second reference database 4, e.g. a tissue-specific methylation database, and a first reference database 5 , e.g. a disease-specific nucleic acid change database, using a suitable comparison means , e.g. a computer program product, e.g. a sequence classifier . In this case the first reference database can also contain disease-specific methylation chang ¬ es , i.e. also function as a third reference database . As a result of the comparison the nucleic acids 1 , respectively the data obtained therefrom, can afterwards be divided with regard to the results obtained in inconspicuous nucleic acids 7 , and nucleic acids 8 containing modifications that can be indicative of a cancer 9, e.g. of the lung 9a, the intestine 9b or the liver 9c - as indicated by the difference in meth- ylation pattern, or a disease 10, e.g. dilated cardiomyopa ¬ thy, indicated by different methylation patterns 10a, 1 Ob and 10c with regard to nucleic acids originating from the heart . Fig . 2 shows a detail of figure 1 wherein the databases are omitted for clarity, only showing the actual steps of se ¬ quencing 2 and comparison 6 and the respective results ob ¬ tained from the nucleic acids 1 , as indicated already in Fig ¬ ure 1 , showing how the nucleic acids 1 are first "translated" in nucleic acid sequence reads 3 and then classified in in ¬ conspicuous nucleic acids 7 and nucleic acids containing mod ¬ ifications 8.

Fig . 3 gives reference to the features of Figure 2 with re- gard to actual results that can be obtained by the present method . As can be seen in the figure, data are obtained in the comparison 6 that can then be statistically analyzed us ¬ ing a suitable method to e.g. determine the fraction F of a specific tissue T in the nucleic acids 1 contained in a sam- pie, indicating that the amount of specific nucleic acids is increased . For example, in the figure, Tl can refer to leuko ¬ cytes , T2 can refer to the lung, T3 to the kidneys and 4 to the prostate, indicating an increased amount of free nucleic acids originating from the lung in the body fluid samp1e as compared to the kidneys and prostate . Taken together with the data for modification of the nucleic acid sequence, this can then e . g . indicate lung cancer if the nucleic acids originat ¬ ing from the lung also contain modifications indicative of cancer .

Fig . 4 gives a detail schematic view of the processes in Fig ¬ ure 3. It shows thereby that not only statistical data A, as indicated in Fig . 3 , are obtained for all nucleic acids in the comparison step 6, e.g. when identifying the tissue of origin, step 6a, but also that for each single nucleic ac ¬ id/fragment/read B the methylation information 6B, i. e . indi ¬ cating the lung and possibly a malignant change thereof, as well as the nucleic acid sequence information 6C, e.g. indi ¬ cating a TP53 mutation, is obtained .

The above embodiments can be combined arbitrarily, if appro- priate . Further possible embodiments and implementations of the invention comprise also combinations of features not e x ¬ plicitly mentioned in the foregoing or in the following with regard to the Examples of the invention . Particularly, a person skilled in the art will also add individual aspects as improvements or additions to the respective basic form of the invention .

Examples The present invention will now be described in detail with reference to several examples thereof . However, these exam ¬ ples are illustrative and do not limit the scope of the in ¬ vention . A blood sample from a human patient is provided for lung can ¬ cer screening . The screening test aims to determine altera ¬ tions associated with the nucleic acids , methylation profile, and to conclude the tissue of origin of the nucleic acids to strengthen cancer screening by multiple evidence levels .

After separation of the nucleic acids , two methods are em ¬ ployed for the scalpel-free biopsy, 1 ) bisulfite sequencing (comparative example) and 2 ) direct nanopore sequencing (ex ¬ ample) . 1 ) For bisulfite-sequencing, an estimate of sequencing depths is more cumbersome when the aim of the experiment is to analyze all three levels , nucleic acid (e.g. DNA) sequence mutation, epigenetic profile change and tissue of origin, since a new variable has to be considered : the reaction effi- ciency of bisulfite conversion . Incomplete deamination might occur at low mutant allele frequency and low cfDNA pools , as described also in e.g. R.P. Darst et al . , "Bisulfite Sequenc- ing of DNA, Curr . Protoc.Mol. Biol,

doi:10.1002/0471142727. mb0709s91. Second, the conversion of non-methylated C masks real cancer-derived C -> T mutations that might be indicative for cancer . Hence, for an accurate determination of mutation within a liquid biopsy the experiment might need to be done twice, with and without bisulfite conversions , thus also doubling the price . Also, the joint analysis of both data sets is not based on the same DNA frag ¬ ments , making it more challenging to analyze both .

2 ) The extracted nucleic acids are sequenced without prior bisulfite conversion with nanopore sequencing with indirect detection using ion current alteration, (e.g. using Oxford Nanopore MinlON . This method allows detecting epigenetic mod- ifications (e.g. methylated C) and a non-modified DNA se ¬ quence simultaneously . Hence, there is little bias introduced due to a conversion, and hence C -> T mutations are better to call . In contrast to the bisulfite sequencing, the method was less tedious and required less material .

Both methods detected an EGFR mutation indicative for lung cancer : EGFR c .2573T>G (L858R) . Also, an increased level of methylation of the DCC promoter was detected that gave an ad ¬ ditional level of confidence of the presence lung cancer . A third level of confidence was gathered when specific methyla ¬ tion profile of the TSLP was identified, which differently methylated human blood and human lung tissue .

With the present invention it is possible to integrate two till now independently measured entities , i.e. nucleic acid, e.g. DNA, sequence and epigenetic patterns . The analysis of these features originate thereby from the same mole ¬ cule/fragment/read, so that a direct link is possible between modifications of the nucleic acid sequence and the cell type . Linking nucleic acid, e.g. cfDNA, molecules and detected so ¬ matic mutations to the tissue of origin thereby facilitates a "liquid tissue biopsy" . Using a method that detects both, the nucleic acid sequence and an epigenetic pattern, leads to a more efficient, cost effective method with less bias and less chance of artefacts .