Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
NEIGHBOR INFLUENCE COMPENSATION
Document Type and Number:
WIPO Patent Application WO/2018/050854
Kind Code:
A1
Abstract:
The invention relates to a method of neighbor influence compensation between a plurality of objects in at least one digital image, wherein the at least one digital image contains image information about a plurality of objects. Each of the plurality of objects is configured to receive at least one molecule comprising genetic information, wherein the at least one molecule is configured to receive a fluorescent compound, and the at least one digital image is taken by an optical imaging system during emission of electromagnetic radiation of the fluorescent compounds received by the at least one molecules.

Inventors:
LOHEL MAIKO (DE)
ZERFASS THORSTEN (DE)
Application Number:
PCT/EP2017/073341
Publication Date:
March 22, 2018
Filing Date:
September 15, 2017
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
QIAGEN GMBH (DE)
International Classes:
G06T5/00
Domestic Patent References:
WO1996042066A11996-12-27
WO2009117119A12009-09-24
WO2008097455A12008-08-14
WO2014020137A12014-02-06
Foreign References:
US20060184038A12006-08-17
US20110007981A12011-01-13
US20100323350A12010-12-23
Other References:
P. SARDER ET AL: "Deconvolution methods for 3-D fluorescence microscopy images", IEEE SIGNAL PROCESSING MAGAZINE, vol. 23, no. 3, 1 May 2006 (2006-05-01), pages 32 - 45, XP055004885, ISSN: 1053-5888, DOI: 10.1109/MSP.2006.1628876
ALEX SMALL ET AL: "Fluorophore localization algorithms for super-resolution microscopy", NATURE METHODS, vol. 11, no. 3, 27 February 2014 (2014-02-27), GB, pages 267 - 279, XP055274731, ISSN: 1548-7091, DOI: 10.1038/nmeth.2844
VOELKERDING ET AL., CLINICAL CHEMISTRY, vol. 55, 2009, pages 4 641 - 658
METZKER, NATURE REVIEWS/ GENETICS, vol. 11, January 2010 (2010-01-01), pages 31 - 46
SARA GOODWIN ET AL., NATURE REVIEWS GENETICS, vol. 17, June 2016 (2016-06-01), pages 333 - 351
Attorney, Agent or Firm:
VOSSIUS & PARTNER (DE)
Download PDF:
Claims:
CLAIMS

Method of neighbor influence compensation between a plurality of objects in at least one digital image,

the at least one digital image containing image information about a plurality of objects,

each of the plurality of objects being configured to receive at least one molecule comprising genetic information,

the at least one molecule being configured to receive a fluorescent compound, the at least one digital image being taken by an optical imaging system during emission of electromagnetic radiation of the fluorescent compounds received by the at least one molecules,

wherein the method comprises the following steps, preferably in the following order:

a) providing a model describing the imaging of the objects by the optical system, which model defines equations, the equations relating determined intensities of the electromagnetic radiation having traversed the optical system with original intensities of the electromagnetic radiation;

b) determining intensity values of the plurality of objects on the basis of the at least one digital image; and

c) determining compensated intensity values of the plurality of objects on the basis of the determined intensity values and the model by solving the equations for the original intensities.

Method according to claim 1 ,

wherein the model includes a point spread function of the optical system.

Method according to claims 1 or 2,

wherein the digital image is partitioned into domains, each domain containing one object,

wherein the model describes the determined intensities in terms of original intensities and weights, wherein a separate weight is associated to each pair of objects and domains.

Method according to claim 3, further comprising the following step, preferably before step b):

d) determining the weights;

wherein, preferably, the method is carried out over a plurality of cycles and step d) is carried out prior to the plurality of cycles.

Method according to claim 3 or 4,

wherein the model is only described by weights, where the distance between the object and the domain associated to the weight is below a predetermined cut-off distance.

Method according to any of the preceding claims,

wherein, in step c), a solution to the equations is determined by determining an approximated solution with an iterative algorithm.

Method according to claim 6, comprising the following step, preferably being a sub-step of step c):

f) stopping the iterative algorithm of step c) if a predetermined maximum number of iterations is exceeded.

Method according to claim 7, further comprising the following step, preferably during step c):

g) determining an error of the compensated intensity values;

wherein the iterative algorithm of step c) is stopped if the predetermined maximum number of iterations is exceeded or if the determined error is below a given threshold.

Method according to any of the preceding claim,

wherein the model depends on at least one parameter;

wherein, preferably, the at least one parameter includes a radius of the plurality of objects and/or a blurriness of the at least one objects in the at least one digital image.

10. Method according to claim 9, the method further comprising the following step, preferably between steps a) and b):

e) determining the at least one parameter by processing the at least one digital image.

1 1. Method according to claim 10,

wherein the method is carried out for a plurality of cycles;

wherein step e) is carried out once for the plurality of cycles, preferably prior to the plurality of cycles.

12. Method according to any of the preceding claims,

wherein the method is carried out for a plurality of cycles;

wherein steps b) and c) are carried out in every cycle of the plurality of cycles.

13. Method according to any of the preceding claims,

wherein the at least one digital image comprises a plurality of digital images;

wherein each digital image of the plurality of digital images is taken with a different filter, the different filters preferably being different color filters;

wherein the steps b), c) and d) are carried out separately for each digital image of the plurality of digital images.

14. System for neighbor influence compensation between a plurality of objects in at least one digital image,

the at least one digital image containing image information about a plurality of objects,

each of the plurality of objects having received at least one molecule comprising genetic information,

the at least one molecule being configured to receive a fluorescent compound, the at least one digital image being taken by an optical imaging system during emission of electromagnetic radiation of the fluorescent compounds received by the at least one molecules,

the system comprising:

i) a memory unit containing a model describing the imaging of the objects by the optical system, which model defines equations, the equations relating determined intensities of the electromagnetic radiation having traversed the optical system with original intensities of the electromagnetic radiation;

ii) an intensity determination unit configured for determining intensity values from the digital image for the plurality of objects;

iii) a processing unit configured for detenmning compensated intensity values of the plurality of objects on the basis of the determined intensity values and the model by solving the equations for the original intensities.

15. Computer program product comprising one or more computer readable media having computer executable instructions for performing the steps of the method of one of claims 1 to 13.

Description:
Neighbor influence compensation

The present invention relates to systems and methods for neighbor influence compensation. More particularly, the present invention relates to neighbor influence compensation of electromagnetic radiation emitted by different fluorescent compounds in different color channels, preferably for DNA-sequencing.

Biotechnology, medicine and related technical fields are based on the analysis of molecules. Electronic devices can analyze molecules with high precision and specificity. Especially in the last years, automated electronic devices have been developed for analyzing large numbers of samples by routine methods. For example, modern DNA sequencing apparatus are used for routine analysis of large numbers of DNA probes. Protein samples can be analyzed by high throughput screening and related methods. Frequently, such electronic devices detect fluorescent signals emitted from the sample probes. This is possible when molecules, such as nucleic acids or proteins, have been labeled with fluorescent compounds such as dyes.

Commercially available sequencing apparatus are capable of sequencing large numbers of samples labeled with fluorescent dyes in parallel. Recently developed methods, referred to as "next-generation sequencing", NGS, have revolutionized sequencing. NGS allows the massively parallel sequencing of clonally amplified or single DNA molecules that are spatially separated in a flow cell or by generation of an oil-water emulsion. NGS allows thousands or even millions to billions of sequencing reactions to be performed simultaneously.

In NGS, sequencing is performed by repeated cycles of polymerase-mediated nucleotide extensions or, in one format, by iterative cycles of oligonucleotide ligation. As a massively parallel process, NGS generates hundreds of megabases to gigabases of nucleotide-sequence output in a single instrument run, depending on the platform. The inexpensive production of large volumes of sequence data is the primary advantage over conventional methods.

NGS platforms and common applications/fields for NGS technologies are e.g. reviewed in Voelkerding et al, Clinical Chemistry 55:4 641-658, 2009 and Metzker, Nature Reviews/ Genetics Volume 1 1 , January 2010, pages 31-46 and in Sara Goodwin et al.; Nature Reviews Genetics, Volume 17, June 2016, pages 333-351.

In NGS, various oligonucleotides of interest are covalently attached to a support. Subsequently, a nucleotide labeled with a fluorescent dye is attached to the growing oligonucleotide chain with DNA polymerase. When the four nucleotides are labeled with different fluorescent dyes, fluorescent signals emitted from a probe can be detected and the type of nucleotide attached to the oligonucleotide can be identified. After detection, the fluorescent dye is cleaved off and the next synthesis cycle is carried out, in which a new labeled nucleotide is attached to the growing chain. By carrying out multiple cycles, the sequence of a growing oligonucleotide chain can be determined in a stepwise manner. The working steps are carried out in an automated sequencing apparatus.

US 2010/0323350 Al and WO 2009/1 171 19 Al relate to methods and compositions for determining the identity of nucleic acids in nucleotide sequences using, for example, data obtained from sequencing by synthesis methods.

WO 2008/097455 Al relates to an imaging system for exciting and measuring fluorescence on or in samples comprising fluorescent materials, e.g. fluorescent labels, dyes or pigments, in particular to detect fluorescent labels on nucleic acid. Moreover, a device is disclosed being configured such that fluorescent labels in a plurality of different DNA templates are simultaneously detected.

WO 2014/020137 Al relates to a method for enriching target sequences from a sequencing library to provide a target enriched sequencing library, wherein the sequencing library is suitable for massive parallel sequencing and comprises a plurality of double-stranded nucleic acid molecules.

Fluorescent signals emitted from sample probes with labeled molecules are weak, but the signals have to be detected with high precision and specificity. Thus, precise optical equipment, especially cameras and scanning technology, is required for such processes. In addition, extensive evaluation of the digital images captured by an optical imaging system of the sequencing apparatus is necessary for obtaining a precise and reliable result of the sequencing, e.g. in FASTQ.

The invention is, inter alia, based on the recognition that objects observed with an optical system, e.g. a microscope, may be blurred, e.g., due to diffraction of light by the optical system. Thus, light emanating from two neighboring, yet physically separate objects, can considerably overlap in the detector plane, e.g., causing a mixture or combination of the signals of the neighboring objects. This overlap may be referred to as neighbor influence in the context of the invention.

Because of the superposition property, neighbor influence may mutually increase the detected light intensity of spatially close beads. This can deteriorate the quality of the decision whether a specific base is at a given position. Furthermore, it can lead to erroneous classification of originally dim beads as live beads and can therefore cause incorrect base calls. Furthermore, neighbor influence may decreases contrast between dim and live beads, potentially impeding determination of proper thresholds for their discrimination. Moreover, because of the overlapping or mixing of the signals of neighboring objects, the objects may appear as if multiple different DNA strands are attached to one object (polyclonality).

Further still, a bright object may dominate the actual signal due to neighboring influence such that the original sequence may be lost. Instead, an optical duplicate could be read-out, which could be statistically disadvantageous in latter processing steps.

In any case, such neighboring effects may lead to a loss of data.

It is therefore an object of the invention to provide a method and a system, which, among others, overcomes the above mentioned problems of the prior art. In particular, one object is to compensate neighbor influence.

This object is achieved with the features of the independent claims. The dependent claims relate to further aspects of the invention. The method according to the present invention is computer-implemented. However, it is understood by the skilled person that there are also other ways of implementing the method according to the present invention.

The invention relates to a method of neighbor influence compensation between a plurality of objects in at least one digital image, wherein the at least one digital image contains image information about a plurality of objects. Each of the plurality of objects is configured to receive at least one molecule comprising genetic information, wherein the at least one molecule is configured to receive a fluorescent compound. Furthermore, the at least one digital image is taken by an optical imaging system during emission of electromagnetic radiation of the fluorescent compounds received by the at least one molecules. The method comprises the following steps, preferably in the following order:

a) providing a model describing the imaging of the objects by the optical system, which model defines equations, the equations relating determined intensities of the electromagnetic radiation having traversed the optical system with original intensities of the electromagnetic radiation;

b) determining intensity values of the plurality of objects on the basis of the at least one digital image; and

c) determining compensated intensity values of the plurality of objects on the basis of the determined intensity values and the model by solving the equations for the original intensities.

In other words, the method according to the present invention may be adapted for determining compensated intensity values, which may correspond to the original intensity values of the electromagnetic radiation emitted by the fluorescent compounds, by finding solutions to equations defined by the model describing the imaging procedure of the objects in the optical system. Thus, the compensated values may correspond to the solutions for the original values of the equations. For example, the compensated values may be equal to the solutions for the original values. The compensated intensity values may also be denoted as reconstructed intensity values. In other words, the neighbor influence compensation algorithm may approximately restore the original fluorescence intensities underlying the imaging process to attenuate aforementioned consequences of neighbor influence. In this way, effects that are due to the light traversing the optical system causing a blurring of the objects, can be compensated.

The at least one object may be a bead. Such beads are commercially available, e.g., Dynabeads of Thermo Fisher. For example, the image may contain image information about 400000 or more beads.

The object may also be a live-bead having received at least one DNA/RNA strand. The DNA RNA strand may be a single stranded DNA/RNA fragment. However, the at least one object may also be a bead which does not have received a DNA/RNA strand. In other words, the neighbor influence effect may be independent from the presence of DNA/RNA, since each signal contribution may be relevant, for example, because the model used for neighbor influence compensation may only depend on the sum of the separate signal contributions.

The method of neighbor influence compensation described herein may also be carried out in headless sequencing methods. Thus, it may be understood that each of the plurality of objects includes at least one molecule.

Each DNA strand may be configured to receive molecules, preferably nucleotides. The nucleotides may comprise one cleavable fluorescent compound and a cleavable cap. Furthermore, the molecules may be a plurality of four different molecules each having a fluorescent compound specific for each of the four different molecules. Thus, each object may radiate electromagnetic radiation with a certain intensity. Furthermore, each DNA/RNA strand received by an object may have received such a molecule. The intensity of the electromagnetic radiation radiated by the objects may be referred to as original intensity. On the other hand, the intensities that are determined by the optical imaging system may be referred to as determined intensities or raw intensities.

The optical imaging system preferably comprises of an illumination device, e.g. an LED, to activate/stimulate the fluorescent compound(s) to excite electromagnetic radiation and a detection system, e.g. a CCD detector, to detect the emission spectra (electromagnetic radiation) of the fluorescent compound(s). The method may be carried out over a plurality of cycles and may be applied over a defined cycle-range. Each of the plurality of cycles may be characterized in that one of four molecules is attached to one of the DNA strands, wherein the cleavable cap ensures that only one molecule is attached to one DNA strand during one cycle. After each one of the cycles, the cleavable cap may be removed so that another molecule can be attached to the DNA strand in the next cycle. At the same time, the fluorescent compound may be removed from the at least one molecule.

For each cycle, a plurality, preferably four, digital images, i.e. pictures, may be taken, one for each of the four fluorescent compounds attached to the molecules. For example, each one of the plurality of digital images may be taken with a different color filter. Furthermore, each one of the four fluorescent compounds may have one of a main emission spectra in the red, blue, yellow or green spectral regime.

The model described herein may be understood to describe and/or define image formation of the plurality of objects by the optical system. In other words, the model may describe the influence of the optical system on the electromagnetic radiation emitted by the fluorescent compounds. Furthermore, the model may also include information about the objects emitting the electromagnetic radiation. In the context of the invention, electromagnetic radiation may be understood to be light with a wavelength between 380 nm and 780 nm, preferably between 445 nm and 720 nm. It may be understood that imaging the objects refers to imaging the electromagnetic radiation emitted by the fluorescent compounds attached to the objects.

For example, in the model, each object radiating the electromagnetic radiation may be described with a disk function describing a uniform disk with a given radius R. Thus, the disk function may depend on a parameter R describing the radius of the radiating object. Furthermore, the model may also include a function that describes the diffraction of the electromagnetic radiation traversing the optical system. The parameter of the disk function R may, e.g., be predetermined. For example, the parameter R may be provided prior to the execution of the method. Alternatively, the parameter R may also be determined during the method described herein.

Preferably, the model includes a point spread function of the optical system. The point spread function may, for example, be approximated by a two-dimensional Gaussian function depending on a parameter σ describing the blurring of the objects by the optical system. The parameter σ describing the blurriness may, e.g., be predetermined. For example, the parameter σ describing the blurriness may be provided prior to the operation of the method. Alternatively, the parameter σ describing the blurriness may also be determined during the method described herein.

Preferably, the digital image is partitioned into domains, each domain containing one object, wherein the model describes the determined intensities in terms of original intensities and weights, and wherein a separate weight is associated to each pair of objects and domains.

In other words, the model may be described in terms of weights wy, wherein the weight denotes the influence of the j-th bead onto the i-th domain In this context, the i-th domain may refer to an area of the detector, which detector determines the intensities for taking the digital image, where the i-th bead is measured. Thus, in a mathematical sense, Ω; may be understood to be the domain of definition of a function defining the weight.

Preferably, the method further comprises the following step, preferably before step b):

d) determining the weights;

Preferably, the method is carried out over a plurality of cycles and step d) is carried out prior to the plurality of cycles.

Since the weights may only have to be determined once, the computing power necessary for carrying out the method can be reduced.

For example, the weights may be determined by numerical integration. In order to determine the weight Wy, information about the domain Ω ί as well as about the position of the j-th bead may be provided.

Preferably, the model is only described by weights, where the distance between the object and the domain associated to the weight is below a predetermined cut-off distance.

In other words, the equations may be approximated by only considering weights Wy, where the distance between the objects i and j is below a given cut-off distance. Thus, it may be assumed that the raw or determined intensity of a first object is only substantially influenced by other second objects in the neighborhood of the first object. In this way, the complexity of the equations may be reduced such that solving the equations may be simplified and the computing power may be reduced.

Preferably, in step c), a solution to the equations is determined by determining an approximated solution with an iterative algorithm.

For example, in a first iteration, a first approximate solution for the original intensities may be found, wherein the first approximate solution only depends on weights describing the influence of the beads to their own measurements, i.e., wherein the first approximate solution only comprises weights Wjj. In a second iteration, said first approximate solution may be used for determining a second approximate solution where neighbor influence effects are included. In general, the (n-l)-th approximate solution may be used for determining the n-th approximate solution.

In this way, a solution to the family of inverse problems defined by the equations can be found.

Preferably, the method further comprises the following step, preferably being a sub-step of step c):

f) stopping the iterative algorithm of step c) if a predetermined maximum number of iterations is exceeded.

The maximum number of iterations may amount to 20 iterations, preferably to 15 iterations, more preferably to 10 iterations. It has been found that such a maximum number of iterations produces a good result.

Preferably, the method further comprises the following step, preferably during step c):

g) determining an error of the compensated intensity values;

wherein the iterative algorithm of step c) is stopped if the predetermined maximum number of iterations is exceeded or if the determined error is below a given threshold.

In this way, it can be ensured that the compensated intensity values are determined with a predetermined or desired accuracy.

Preferably, the model depends on at least one parameter, wherein, preferably, the at least one parameter includes a radius of the plurality of objects and/or a blurriness of the at least one objects in the at least one digital image.

For example, the radius of the plurality of objects may correspond to the parameter R of the disk function described in the context of the invention and the blurriness may correspond to the parameter σ of the Gaussian function of the point spread function.

Preferably, the method further comprises the following step, preferably between steps a) and b):

e) determining the at least one parameter by processing the at least one digital image.

Preferably, the method is carried out for a plurality of cycles, wherein step e) is carried out once for the plurality of cycles, preferably prior to the plurality of cycles.

Thus, in each cycle, intensities may be determined, wherein a new nucleotide is attached to each object. The parameters may be determined before the cyclic operation of steps b) and c) is carried out.

Preferably, the method is carried out for a plurality of cycles, wherein steps b) and c) are carried out in every cycle of the plurality of cycles.

Thus, in each cycle, intensities may be determined, wherein a new nucleotide is attached to each object. The weights may be determined before the intensities are determined with the optical system. As the weights may not vary in time, it may not be necessary to re-determine the weights during the cyclic operation of steps b) and c).

Preferably, the at least one digital image comprises a plurality of digital images, wherein each digital image of the plurality of digital images is taken with a different filter, the different filters preferably being different color filters. Furthermore, the steps b), c) and d) are carried out separately for each digital image of the plurality of digital images. The first image may be taken such that it contains image information in a first frequency range and the second image may be taken such that it contains image information in a second frequency range. For example, the first image may be taken with a first color filter and the second image may be taken with a second color filter.

Another aspect of the invention relates to a system for neighbor influence compensation between a plurality of objects in at least one digital image, wherein the at least one digital image contains image information about a plurality of objects. Furthermore, each of the plurality of objects has received at least one molecule comprising genetic information, wherein the at least one molecule is configured to receive a fluorescent compound. Still further, the at least one digital image is taken by an optical imaging system during emission of electromagnetic radiation of the fluorescent compounds received by the at least one molecules. The system comprises:

i) a memory unit containing a model describing the imaging of the objects by the optical system, which model defines equations, the equations relating determined intensities of the electromagnetic radiation having traversed the optical system with original intensities of the electromagnetic radiation;

ii) an intensity determination unit configured for determining intensity values from the digital image for the plurality of objects;

iii) a processing unit configured for determining compensated intensity values of the plurality of objects on the basis of the determined intensity values and the model by solving the equations for the original intensities.

In other words, the system may relate to an evaluation system, e.g. for DNA/RNA sequencing.

It is to be understood that the embodiments disclosed herein which relate to a method, also relate to a system comprising means adapted for carrying out the respective method steps. Therefore, the definitions, explanations, effects, and examples given with respect to the method also describe a corresponding system.

Another aspect of the invention relates to a computer program product comprising one or more computer readable media having computer executable instructions for performing the steps of the method described herein.

For example, the model describing the imaging of the objects, the equations defined by the model, and/or the algorithm for solving the equations may be described as following.

Light traversing the optical path from a point in the focal plane may become spread over the detector in a characteristic way. This characteristic spread may be denoted as the impulse response, or more specifically, the point spread function (PSF) of the imaging system. Because it may be assumed that the superposition principle holds for linear optical systems (such as light microscopes), image formation can be described by the convolution of the object space (given by its intensity distribution Q) with the PSF:

I(u, v) = jj Q{x, y) - PSF{u - x, v - y) dx dy (1 )

K 2

In general, the PSF may distribute the light over an area exceeding the physical dimensions of the object. Hence, light emanating from two neighboring, yet physically separate objects can considerably overlap in the detector plane. This overlap is referred to as neighbor influence.

Here, the object space may be composed of objects such as micro beads, i.e., non-overlapping spheres with similar radii. Furthermore, it may be assumed that a uniform disk is a sufficiently accurate model for the projection of the beads onto two-dimensional space constituting the focal plane. Hence, the disk function

D(f) = \ 1/nR2 ' »*ϊ < R (2) I 0, otherwise ' may model the projection of a bead with radius R, which is centered at the origin and normalized, i.e., f R2 D(f) dr - l.

Thus, the object space Q may formally be written as the sum of N disk-shaped bead- projections with intensities q t at positions pf.

Similarly, it may be assume that a 2-dimensional Gaussian function approximates the PSF sufficiently well and define

PSF ) = ^ e ~c ^ 2 (4) with c = ΐ/2σ 2 . Like the disk function, the integral of the Gaussian may also be normalized to 1. Substituting equations (3) and (4) into equation (1), may result in a model of the image:

Because integration is a linear operation, Equation (5) can be re-written to read

Thus, the image may be seen as the sum of N individual bead-images weighted by their corresponding intensities. Furthermore, the integral does only depend on the bead positions but is independent from the bead intensities. For an isolated bead at position p, the sum disappears from equation (6) and assuming unit intensity, equation (6) results in a simpler expression l f (f) = 7 jj D ^ - p) e -ciir-sii 2 d2 . (7)

R 2

To determine how much this isolated bead contributes to the intensity recorded on a certain area Ω of the detector, equation (7) may be integrated over the area Ω. Thus, the area Ω may be understood to be the domain of the integral in equation (7). This integral may be defined in terms of a weight function

/ η (Ω) = jj 1 η (ω) άω

Ω

= ^ jj jj D(r - p n ) e- c ^-^ z dr dZ (8)

Ω R 2 ue- 2 u2 (2c||io|| 2 ) du dcS,

The weight function may be seen as a model of the fraction of the intensity of the n-th bead which is recorded within the sample domain Ω.

To extract an estimate of the intensity of the z ' -th bead from the image, the intensity of a small domain Ω; containing the z ' -th bead may be measured. This measurement ; may be termed raw intensity. The contribution of the z ' -th bead to its own measurement may then be given by ί ί ί( Ω · Similarly, the contribution of the j ' -th bead to the same measurement may be qjfj{fi . Comparing equations (8) a (6), it may be found that the raw intensity of the z ' -th bead is determined by the sum

N

M^ ^ qjfjin - (9)

j

The weight function may only depend on the relative positioning of the beads, which may remain constant because the beads may be tethered to the flow cell. Hence, it may be sufficient to determine the weights

Wi] = (Ω.) (10) once and reuse them in every sequencing cycle.

Equations (9) and (10) describe an inhomogeneous linear system with coefficients ¼¾·. However, with N > 300000 beads, solving the linear system directly is prohibitively expensive. As the weight function may decay rapidly, such that influences from distant beads can possibly be neglected, the raw intensity of the z ' -th bead may be determined by considering only the beads in its neighborhood

Φί = {j ■■ \\Pi - Pj \\ < Λ ≠ j), (1 1) where d lxa is the cutoff distance. Consequently, the raw intensity from equation (9) may approximately be written as

M i = ∑ jwtj. (12) e0iU{i)

Equation (12) can be solved for the intensity of the z ' -th bead by

'∑je0i qjWij

(13)

From equation (13), an iterative scheme can be derived.

(fc - ; „ (14)

A modification of this scheme, which is possibly improved in every iteration, may be found by considering the upper bound for the error, which may be determined by the absolute value of the difference of the last sequence element and its predecessor:

ε £j (k) can potentially be large even if ε^ ' ^Ο. The arithmetic mean = (q k + q k ) may lie exactly in the middle of the interval bounded by q{ A and Thus, ql k may be a better estimate for the real q than the worse of the last two sequence elements, or mathematically

Thus, a faster and more robust approximation scheme for the intensities may read:

( k) _ M i -∑je<p i qf- 1) w i ]

M

</i 2

Some exemplary embodiments are now described with reference to the drawings. For explanation purpose, various specific details are set forth, without departing from the scope of the present invention as claimed.

Brief description of the figures

Fig. 1 illustrates a method according to an exemplary embodiment of the invention.

Fig. 2 illustrates a system according to an exemplary embodiment of the invention.

Fig. 3 illustrates the effect of neighbor influence and the definition of weights according to an exemplary embodiment of the invention.

Fig. 4A illustrates a synthetic image showing the raw intensities and an image showing a reconstruction of the original intensities by neighbor influence compensation according to an exemplary embodiment of the invention.

Fig. 4B illustrates overlays of the synthetic images and the relative errors of the reconstruction before and after neighbor influence compensation according to an exemplary embodiment of the invention.

Fig. 5 illustrates the images taken in different cycles according to an embodiment of the present invention.

Detailed description of exemplary embodiments

Fig. 1 illustrates a method of neighbor influence compensation between a plurality of objects in at least one digital image according to an exemplary embodiment of the invention. According to the method, the at least one digital image contains image information about a plurality of objects, wherein each of the plurality of objects is configured to receive at least one molecule comprising genetic information, and wherein the at least one molecule is configured to receive a fluorescent compound. Furthermore, the at least one digital image is taken by an optical imaging system during emission of electromagnetic radiation of the fluorescent compounds received by the at least one molecules. The method comprises the following steps, preferably in the following order:

S I : providing a model describing the imaging of the objects by the optical system, which model defines equations, the equations relating determined intensities of the electromagnetic radiation having traversed the optical system with original intensities of the electromagnetic radiation;

S2: determining intensity values of the plurality of objects on the basis of the at least one digital image; and

S3: determining compensated intensity values of the plurality of objects on the basis of the determined intensity values and the model by solving the equations for the original intensities.

According to exemplary embodiments, the model includes a point spread function describing the influence of the optical system onto the electromagnetic radiation traversing the optical system and/or a disk function describing the object radiating the electromagnetic radiation. According to an exemplary embodiment, the model and thus the equations depend on at least one parameter, preferably on a blurriness caused by the optical system and/or a radius of each of the plurality of objects. According to an exemplary embodiment, said parameter or parameters are determined by processing the at least one digital image, preferably at the beginning of the method.

According to an exemplary embodiment, the equations defined by the model include weights Wjj describing the influence of an object j on a domain i. Hereby, in theory, the domain may relate to an area of the detector of the optical system, where electromagnetic radiation of the i- th object is detected. According to an exemplary embodiment, the weights are determined numerically, preferably before the intensities are cyclically determined.

According to a further exemplary embodiment, only such weights are considered in the equations, wherein a distance between the object j and the domain i (i.e. the i-th object associated to the i-th domain) is below a cut-off distance.

According to a further exemplary embodiment, the equations are solved iteratively. In other words, first a 0-th order solution is determined which is used for determining the 1 -st order solution. Subsequently, the 1-st order solution is used for determining the 2-nd order solution. In general, the (n-l )-th order solution is used for determining the n-th order solution. According to exemplary embodiments, the iterative algorithm is stopped, if a predetermined maximum number of iterations is reached and/or if the error of the solution of the last iteration is below a given (e.g. predetermined) threshold.

Fig. 2 illustrates a system for neighbor influence compensation between a plurality of objects in at least one digital image according to an exemplary embodiment of the invention. The at least one digital image contains image information about a plurality of objects, wherein each of the plurality of objects has received at least one molecule comprising genetic information, and wherein the at least one molecule is configured to receive a fluorescent compound. Furthermore, the at least one digital image is taken by an optical imaging system during emission of electromagnetic radiation of the fluorescent compounds received by the at least one molecules.

The system comprises a memory unit 201 containing a model describing the imaging of the objects by the optical system, which model defines equations, the equations relating determined intensities of the electromagnetic radiation having traversed the optical system with original intensities of the electromagnetic radiation. In other words, a model describing the influence on light traversing the optical system Furthermore, the system includes an intensity determination unit 202 configured for determining intensity values from the digital image for the plurality of objects. Further still, the system includes a processing unit 203 configured for determining compensated intensity values of the plurality of objects on the basis of the determined intensity values and the model by solving the equations for the original intensities. The processing unit 203 may further be configured for carrying out other method steps described in the context of the invention.

Fig. 3 illustrates the effect of neighbor influence. Portion 301 of Fig. 3 exemplarily shows an image of the plurality of objects. The individual objects, i.e. beads, are strongly blurred. This is, inter alia, due to the blurring effect of the imaging system which causes that radiation of individual objects overlap at the detector. According to an exemplary embodiment, this effect is described by weights Wjj illustrated in portions 302 and 306 of Fig. 3. In portion 302, a i-th object 305 and a j-th object 303 are depicted. The i-th object 305 defines a domain Ω; 304, corresponding to an area of the detector, in which the i-th object 305 is detected. Due to the effects of the optical system, in the i-th domain 304 not only electromagnetic radiation of the i-th object is received and detected. On the contrary, due to the effect of the imaging system, also radiation of the j-th object 303 is detected in the i-th domain 304. This effect is described by the weight Wjj 306. Theoretically, there is a weight Wy for each pair of i and j. However, according to an exemplary embodiment of the invention it was realized that the equations can be simplified by assuming that the j-th object 303 only has an effect on domains within the neighborhood 307, i.e. such that the distance between the j-th and i-th object is below a cutoff distance.

Fig. 4A illustrates a synthetic image 401 showing raw intensities and an image 402 showing a reconstruction of the original intensities by neighbor influence compensation according to an exemplary embodiment of the invention. In the synthetic image 401 , the image of the individual objects is blurred and the intensities of the individual objects may overlap, in part due to neighbor influence. The raw intensities may be determined on the basis of an image corresponding to the synthetic image 401. For illustration purposes, raw intensities have been determined on the basis of the synthetic image 401. The so determined raw intensities may, however, be defective, e.g., due to neighbor influence as described herein.

The reconstructed image 402 is obtained by processing the raw intensities determined from the synthetic image 401 with a neighbor influence compensation method according to an exemplary embodiment. In other words, the reconstructed image 402 contains compensated intensities which closely correspond to the original intensities of the electromagnetic radiation emitted by the objects.

Fig. 4B illustrates overlays 403, 404 of the synthetic images and the relative error of the reconstruction before and after neighbor influence compensation according to an exemplary embodiment of the invention. Overlay 403 hereby shows the relatives errors of the reconstructed objects before/without neighbor influence compensation, which relative errors are indicated by the brightness of the reconstructed objects. Similarly, overlay 404 shows the relative errors of the reconstructed objects after neighbor influence compensation. As can be clearly seen, the relative errors of the reconstructed objects are significantly smaller after applying neighbor influence compensation. Thus, neighbor compensation may increase the accuracy of the determined positions and/or intensities of objects.

Fig. 5 illustrates the images 1 1 -14, 21 -24, 31 -34, 41 -44, 51 -54 taken in a plurality of cycles 10-50 of the method. This illustration should not be construed as limiting as of the amount of images taken in a corresponding cycle or the amounts of cycles. As can be seen in Fig. 6 in each of the cycles 10-50 four images 1 1 -14, 21 -24, 31-34, 41 -44, 51-54 are taken ,i.e. acquired, captured etc., in this example. In particular, each of the four images 11-14, 21-24, 31-34, 41-44, 51 -54 in one cycle 10-50 corresponds to one channel of the optical imaging system, i.e. red, green, yellow and blue. For example, every first image may be taken with a first color filter, every second image with a second color filter, every third image with a third color filter, and every fourth image with a fourth color filter. The different colors are emitted by fluorescent compounds carried by different molecules which are received by DNA strands attached to the objects (beads). More particular, each of the different fluorescent compounds represents one of a specific DNA base, i.e. thymine (T), adenine (A), cytosine (C), and guanine (G). For example, the fluorescent compounds are associated to the DNA bases as follows: T=green; A=yellow; C=blue; and G=red.

In each cycle 10-50 the first images 11 , 21 , 31 , 41 , 51 corresponds to one of the four channels T, A, C, G, e.g. G. The second images 12, 22, 32, 42, 52 then correspond to a second one of the remaining three channels T, A, C, e.g. C. The third images 13, 23, 33, 43, 53 then correspond to a third one of the remaining two channels T, A, e.g. A. The fourth images 14, 24, 34, 44, 54 then correspond to a fourth one of the remaining channel, e.g. T.

The forgoing method steps and the system of the exemplary embodiments have been described as relating to DNA/RNA sequencing. However, as it will be apparent to the person skilled in the art the present invention is not restricted to this technical field. It is clear that the solution of the present invention can be applied to numerous other technical fields, where fluorescent images comprising different types of objects are analyzed. That is, the objects do not have to be beads, but can also be any kind of fluorescent emitting objects. As the present invention may be embodied in several forms without departing from the scope or essential characteristics thereof, it should be understood that the above-described embodiments are not limited by any of the details of the foregoing descriptions, unless otherwise specified, but rather should be construed broadly within the scope as defined in the appended claims, and therefore all changes and modifications that fall within the present invention are therefore intended to be embraced by the appended claims.

Furthermore, in the claims the word "comprising" does not exclude other elements or steps, and the indefinite article "a" or "an" does not exclude a plurality. A single unit may fulfill the functions of several features recited in the claims. The terms "essentially", "about", "approximately" and the like in connection with an attribute or a value particularly also define exactly the attribute or exactly the value, respectively.