CHARGE DOMAIN MATHEMATICAL ENGINE AND METHOD

Title:

CHARGE DOMAIN MATHEMATICAL ENGINE AND METHOD

Document Type and Number:

WIPO Patent Application WO/2019/169396

Kind Code:

Abstract:

A multiplier has a pair of charge reservoirs. The pair of charge reservoirs are connected in series, A first charge movement device induces charge movement to or from the pair of charge reservoirs at a same rate. A second charge movement device induces charge movement to or from one of the pair of reservoirs, the rate of charge movement programmed to one of add or remove charges at a rate proportional to the first charge movement device. The first charge movement device loads a first charge into a first of the pair of charge reservoirs daring a first cycle, The first charge movement device and the second charge movement device remove charges at a proportional rate from the pair of charge reservoirs during a second cycle until the first of the pair of charge reservoirs is depleted of the first charge. The second charge reservoir thereafter holding the multiplied result.

Inventors:

SCHIE DAVID (US)
GAITUKEVICH SERGEY (US)
DRABOS PETER (US)
SIBRAI ANDREAS (US)
SIBRAI ERIK (US)

Application Number:

PCT/US2019/020577

Publication Date:

September 06, 2019

Filing Date:

March 04, 2019

Export Citation:

Click for automatic bibliography generation Help

Assignee:

SCHIE DAVID (US)
GAITUKEVICH SERGEY (US)
DRABOS PETER (US)
SIBRAI ANDREAS (US)
SIBRAI ERIK (US)

International Classes:

G06N3/063; G06N3/08; H03M1/10; H03M1/40

Foreign References:

US20140344200A1	2014-11-20
US20160182077A1	2016-06-23
US20140361356A1	2014-12-11
US20070127469A1	2007-06-07

Other References:

CALHOUN ET AL.: "Digital circuit design challenges and opportunities in the era of nanoscale CMOS", PROCEEDINGS OF THE IEEE., vol. 96, no. 2, February 2008 (2008-02-01), pages 343 - 365, XP011198910, ISSN: 0018-9219, DOI: 10.1109/JPROC.2007.911072

Attorney, Agent or Firm:

MOY, Jeffrey, D. (US)

Download PDF:

View/Download PDF PDF Help

Claims:

CLAIMS

What is claimed is:

1. A multiplier comprising:

a pair of charge reservoirs, wherein the pair of charge reservoirs are connected in series; a first charge movement device inducing charge movement to or from the pair of charge reservoirs at a same rate;

a second charge movement device inducing charge movement to or from one of the pair of reservoirs, the rate of charge movement programmed to one of add or remove charges at a rate proportional to the first charge movement device;

wherein a first charge is loaded into a first of the pair of charge reservoirs during a first cycle, the first charge movement device and the second charge movement device removing charges at a proportional rate from the pair of charge reservoirs during a second cycle until the first of the pair of charge reservoirs is depleted of the first charge leaving a charge representing the first charge multiplied by a ratio of charge movement rates on the second charge reservoir.

2. Hie multiplier of Claim 1 , further comprising a device to stop charge movement.

3. The multiplier of Claim 1, wherein the first charge movement device and the second charge movement device are based upon control of electric fields.

4. The multiplier of Claim 1, wherein the first charge movement device is a first current source coupled in series with the pair of charge reservoirs and the second charge movement device is a second current source coupled at a node where the pair of charge reservoirs meet.

5. The multiplier of Claim 2, wherein the device to stop charge movement is a transfer gate, the transfer gate lowers a barrier and then raises the barrier in conformance with an event while maintaining a field to control charge movement at other times.

6. The multiplier of Claim 1, wherein said pair of charge reservoirs are capacitors.

7. The multiplier of Claim 1 , wherein said pair of charge reservoirs are floating diffusions.

8. The multiplier of Claim 1 , wherein the first charge is introduced to the first of the pair of charge reservoirs by one of a switched capacitor circuit, an active pixel circuit; or a V/I to charge circuit.

9. A neural network comprising:

an analog multiplier comprising:

a pair of charge reservoirs,, wherein the pair of charge reservoirs are connected in series; a first charge movement device inducing charge movement to or from the pair of charge reservoirs at a same rate;

wherein a first charge into a first of the pair of charge reservoirs during a first cycle, the first charge movement device and the second charge movement device removing charges at a proportional rate from the pair of charge reservoirs during a second cycle until the first of the pair of charge reservoirs is depleted of the first charge; and the first of the pair of reservoirs in conformance

10. The neural network in accordance with Claim 9, wherein the analog multiplier comprises a device to stop charge movement.

11. The neural network in accordance with Claim 9, comprising an input photodiodc formed on a same integrated circuit as the neural network,

12. The neural network of Claim 9, wherein the input gathering device is a charge domain circuit and the input information is optical information.

13. The neural network of Claim 9, comprising at least one Charge Coupled Device (CCD) shift register coupled between the input gathering device and the first of the pair of charge reservoirs.

14. The neural network of Claim 9, comprising at least one Charge Coupled Device (CCD) shift register coupled to the input gathering device, the CCD shift register used as the first of the pair of charge reservoirs.

15. The neural network of Claim 13, wherein the CCD shift register is a two dimensional shift register.

16. Hie neural network of Claim 13, wherein the CCD shift register is a two dimensional coupled array capable of accepting information at cells around its periphery.

17. The neural network of Claim 13 , comprising a time weighted crossbar used to broadcast an input operand to the first of the pair of charge reservoirs, the at least one CCD shift register being used to couple a charge according to a systolic response.

18. The neural network of Claim 17, where a second operand is stored as charge in a second CCD shift register, wherein a plurality of CCD shift registers pass respective operands per a systolic algorithm.

19. An analog multiplier comprising:

an active pixel comprising at least one of a pinned photodiode or other photodetector, wherein input information to the active pixel is stored on a first input charge reservoir;

a second charge reservoir coupled to the first reservoir,

a transfer gate positioned between the first charge reservoir and the second charge reservoir, wherein a first rate of charge of movement may be controlled by a field coupled to the transfer gate; and

a second charge movement device coupled to the second charge reservoir, wherein a second rate of charge movement may be programmed in proportion to that of said first rate of charge movement;

wherein a first charge is loaded into the first charge reservoir only during a first cycle and the transfer gate and the second charge movement device move charge proportionally during a second cycle until the first charge reservoir is depleted to produce a charge multiplication on the second charge reservoir at the end of the second cycle.

20. The analog multiplier of Claim 19, comprising junction depleted transfer gates to minimize overlap capacitance.

21. A multiplier comprising:

a pair of charge reservoirs, wherein each of the pair of charge reservoirs are coupled to a gated charge movement device;

wherein the gated charge movement device is programmed so a rate of charge movement is proportional, the gated charge movement device stopping charge movement once one of the pair of charge reservoirs is depleted.

22. The multiplier of Claim 21, comprising a device loading multiple weighted inputs into the first of the pair of charge reservoirs.

23. The multiplier of Claim 21, comprising multiple charge movement rate controlled inputs coupled to the first of the pair of charge reservoirs each individually gated in time by a programming device.

24. The multiplier of Claim 22, wherein the device is a time weighted crossbar comprising: a plurality of cross bar conductor lines; a plurality of gated current sources coupling to the plurality of conductor lines, whereby the plurality of gated current sources are gated according to one of an input voltage, current and a timeframe or just a time; and

a controller circuit coupled to each of the plurality of gated current sources and enabling each of the plurality of gated current sources in conformance with a desired neural network configuration summing multiple weighted inputs.

25. The multiplier of Claim 24, wherein the time weighted crossbar comprises one of an analog memory, a memristor memory, a floating gate memory, a flash memory or a DNA memory to set a gating time.

26. A weighted summer comprising:

a single charge reservoir,

a plurality of input charge movement devices each one of coupling or removing charges to or from the single charge reservoir in a first cycle;

an output charge movement device one of coupling or removing the charges to or from the single charge reservoir so as to return the single charge reservoir to a starting charge level during a second cycle after which charge movement ends.

27- The weighted summer of Claim 26 wherein:

the plurality of input charge movement devices couple or remove the charges at rates proportional to the rate that the output charge movement device will couple or remove the charges during the second cycle, but where the plurality of input charge movement devices are each individually gated in time in conformance with input information and; wherein during the second cycle a time during which the output charge movement device is coupling or removing the charges to return the single charge reservoir to the starting charge level represents an output of the weighted summer.

28. The weighted summer of Claim 26, wherein the summer comprises a spiking circuit having a comparator, the comparator initiating an interrupt to a controller circuit when the single charge reservoir reaches a predetermined level.

Description:

CHARGE DOMAIN MATHEMATICAL ENGINE AND

METHOD

RELATED APFLCIATIONS

[0001] This patent application is related to U.S. Provisional Application No. 62/637,496 filed March 2, 2018, entitled "CHARGE DOMAIN MATHEMATICAL ENGINE" in the name of David Schie, and which is incorporated herein by reference in its entirety. The present patent application claims the benefit under 35 U.S.C § 119(e).

TECHNICAL FIELD

[0002] The present invention generally relates to image sensing device and, more particularly to, a charge domain mathematical engine wherein a charge store in a reservoir may be directly coupled to a multiplier of a machine learning input layer.

BACKGROUND

[0003] In silicon imaging it is common to rely on the integration or movement of charge using charge domain structures such as spill and fill circuits, CCD shift registers, photodetectors, correlated double sampling circuits, and similar devices. SpiU and fill circuits may rely upon the concept of a buried pinned photodiode. FIG. 1 shows a cross-section view of a buried pin diode structure 10 showing active doping profiles. The buried pinned photodiode 10 may integrate electrons created when light is collected by the buried pinned photodiode 10 into a storage well SW region. A second charge reservoir, the floating diffusion FD, is created on the far side of a transfer gate labelled TG. [0004] Referring to FIGs. 2A, a spill and fill circuit 20 may be seen. The spill and fill circuit 20 uses the concept of a pinned photodiode (PPD) charge receptacle holding electrons in front of a transfer gate TG. The transfer gate TG is lowered and raised in conformance with required electron flow. At some point the transfer gate TG lowers the potential harrier and the electrons spill from the storage well SW charge reservoir into the floating diffusion FD charge reservoir. The devices are created so as to ensure that all electrons move from the storage well SW charge reservoir into the floating diffusion FD charge reservoir. FIG.2B shows the energy diagram from the storage well SW charge reservoir into the floating diffusion FD charge reservoir.

[0005] Referring to FIG. 3, a spill and fill circuit 30 may be seen. The fill and spill circuit 30 uses the uses the spill and fill circuit 20 of FIG. 2 A but includes the concept of a reset device 32 coupled to the transfer gate TG and a source follower SF which converts the charge on the floating diffusion FD charge reservoir to a voltage which may then be read by other circuitry. Typically, correlated double sampling (CDS) may be used to sample the noise and offset on the output of the floating diffusion FD charge reservoir after reset, and then read again after the spill (charge transfer) such that only the difference attributed to the latest integrated charge stored on the storage well SW remains. By doing this any offset charge and certain noise is removed.

[0006] Instead of two charge reservoirs a single charge reservoir might be used to produce a weighted input and sum result or a weighted summer. Initially said reservoir would be reset to a known charge level Thereafter during a first cycle a plurality of input current movement means would couple charge from the charge reservoir with each of said current movement means removing charge at a rate individually proportional (in conformance with desired weight values) to an output current movement means to be used in a second cycle but off during said first cycle. Additionally, each of said plurality of input current movement means would be further gated in time, or allowed to move charge only during a time, conforming to individual input magnitudes. The resulting input charge movement magnitude for a gated period of time would remove a charge conforming to the weighted input magnitude from said charge reservoir. At the end of first cycle, once all input movement means have removed their charges, a second cycle would cause the output charge movement means to return the charge in the charge reservoir to its original level. The time it would take to do so would be proportional to the weighted sum of the inputs. The resulting weighted summer thereby accepting inputs as time, weights are charge movement rate magnitude, and producing an output as a time.

[0007] Once the charge is transferred to the floating diffusion FD charge reservoir mere are a number of other circuits which may be used in place of the source follower SF to convert the charge into a voltage or current and thereafter into a digital value. For example, a row of an imager may rely on a counter which is compared to each pixel follower value and the digital words associated with each specific pixel recorded. There are many circuits which attempt to optimize the speed and power efficiency of the conversion from the charge domain to a digital word.

[0008] Machine vision is a common application of artificial intelligence (AI) or machine learning. Autonomous or machine vision augmented vehicles, handset security such as fingerprints or facial recognition, smart city sensors, security cameras, x-ray, ultrasound and medical diagnosis, robotics, drones, wearable heart rate monitors, behavioral analysis and monitoring arid many other applications are relying upon the analysis of images for a variety of tasks, many of them time and power critical.

[0009] Presently, machine learning systems require that the input be a digital word or at the very least a voltage, current or spiking waveform (which is also a voltage or current waveform). The conversion of charge relies upon a coupling circuit such as the source follower SF shown in FIG. 3 all of which are known to be associated with three undesirable side effects. The first is a loss of image quality due to introduction of noise by the coupling circuit It is well known by those skilled in the art that any circuit which converts charges maintained in reservoirs such as the storage well SW or floating diffusion FD regions described in FIGs 1 -3 to a voltage or current will necessarily introduce noise. This noise for example could degrade an image with 14 bit pixel accuracy to 12 bit equivalent accuracy. In medical or time critical life safety applications this could obscure key information. A second side effect of conversion to the current or voltage domain is the time associated with the conversion. The digitization of the voltage or current on the output of the coupling circuit takes time. In an augmented vehicle, life safety application or high-speed imaging application this latency could literally mean the difference between a desirable result or a very undesirable result such as a death or fatality. Finally, the conversion of the information from the charge domain to the voltage or current domain and finally digital domain further requires additional energy. This energy is associated with the coupling circuit itself, as well as the circuitry related to digitization or creating the desired waveforms such as a neuromorphic spiking waveform.

[0010] Therefore, it would be desirable to provide a system and method mat overcome the above problems. The system and method would couple the charge stored in reservoirs directly to multipliers of a machine learning input layer or weighted summer.

SUMMARY

[0011] In accordance with one embodiment, a multiplier is disclosed. The multiplier has a pair of charge reservoirs. The pair of charge reservoirs are connected in series. A first charge movement device induces charge movement to or from the pair of charge reservoirs at a same rate. A second charge movement device induces charge movement to or from one of the pair of reservoirs, the rate of charge movement programmed to one of add or remove charges at a rate proportional to the first charge movement device. The first charge movement device, or other mechanism, loads a first charge into a first of the pair of charge reservoirs during a first cycle. The first charge movement device and the second charge movement device remove charges at a proportional rate from the pair of charge reservoirs daring a second cycle until the first of the pair of charge reservoirs is depleted of the first charge.

[0012] In accordance with one embodiment, a method of forming a neural network is disclosed. The neural network has an analog multiplier. The analog multiplier has a pair of charge reservoirs, wherein the pair of charge reservoirs are connected in series. A first charge movement device induces charge movement to or from the pair of charge reservoirs at a same rate. A second charge movement device induces charge movement to or from one of the pair of reservoirs, the rate of charge movement programmed to one of add or remove charges at a rate proportional to the first charge movement device. The first charge movement device, or other mechanism, loads an input charge into a first of the pair of charge reservoirs during a first cycle. The first charge movement device and the second charge movement device remove charges at a proportional rate from the pair of charge reservoirs during a second cycle until the first of the pair of charge reservoirs is depleted of the input charge. An input gathering device is used as the mechanism to store charge in the first of the pair of reservoirs in conformance with input information.

[0013] In accordance with one embodiment, an analog multiplier is disclosed. The analog multiplier has an active pixel comprising a pinned photodiode and a photodetector, wherein input information to the active pixel is stored on a first input charge reservoir. A second charge reservoir is coupled to the first reservoir by a transfer gate positioned between the first charge reservoir and the second charge reservoir, wherein a first rate of charge of movement may be controlled by the transfer gate. A second charge movement device is coupled to the second charge reservoir, wherein a second rate of charge movement may be programmed in proportion to that of die first rate of charge movement. An input charge is loaded into the first charge reservoir only during a first cycle and the transfer gate and the second charge movement device charge proportionally during a second cycle until the first charge reservoir is depleted to produce a charge multiplication on the second charge reservoir at the end of the second cycle.

[0014] In accordance with one embodiment, a muitiplier is disclosed. The multiplier has a pair of charge reservoirs, wherein each of the pair of charge reservoirs are coupled to a gated charge movement device. The gated charge movement device is programmed so a rate of charge movement is proportional, the gated charge movement device stopping charge movement once one of the pair of charge reservoirs is depleted.

[0615] In accordance with another embodiment, a weighted summer is disclosed. The weighted summer consists of a single charge reservoir. A plurality of input current movement devices is coupled to the single charge reservoir where for each of the input current movement devices a rate of current movement conform to weight multiplicands and are proportional to an output charge movement device rate of charge movement A conduction time of each output charge movement device during a first cycle conforms to an input value. The charge added or removed during the first cycle into the single charge reservoir represents a weighted sum of the input values. During a second cycle the output charge movement devices will one of add or remove charge to return the single charge reservoir to its original level, a time it takes to return to the original value representing a sum of input value times weighted by rates of movement of the proportional input charge movement devices.

[0016] hi accordance with one embodiment, a weighted summer is disclosed. The weighted summer one of adds or removes charge from a plurality of weight charge movement devices at a rate proportional to an output charge movement device rate, for a time conforming to input values during a first cycle. An output charge movement device one of adds or removes charge to change a charge level to an original charge level during a second cycle. An output time being a weighted sum representation of input times weighted by the proportional to output charge movement device rates of the weight charge movement devices.

BRIEF DESCRIPTION OF THE DRAWINGS

[0017] The present application is further detailed with respect to the following drawings. These figures are not intended to limit the scope of the present application but rather illustrate certain attributes thereof. The same reference numbers will be used throughout the drawings to refer to the same or like parts.

[0018] FIGs. 1 shows a cross-sectional view of a buried pin diode structure showing active doping profiles;

[0019] FIG.2A shows a spill and fill circuit;

[0020] FIG. 2B shows an energy diagram from a storage well SW charge reservoir into a floating diffusion FD charge reservoir for the spill and fill circuit of FIG. 2 A;

[0021] FIG.3 shows the spill and fill circuit of Figure 2A further coupled to a reset mechanism (RST) and to a source follower (SF) for reading out the voltage of the floating diffusion (FD) through a select line (SEL) to a bus (COL BUS).

[0022] FIG. 4 is a block diagram showing an exemplary embodiment of a neural network architecture forming a basis of the charge domain mathematical engine in accordance with one aspect of the present application;

[0023] FIG. 5 is block diagram showing an exemplary embodiment of a pinned photodiode (PPD) with a two-dimensional shift register in accordance with one aspect of the present application;

[0024] FIG, 6A shows a top level view of a CCD shift register in accordance with one aspect of the present application; [0025] FIG.6B shows a cross sectional view of said CCD shift register along the poly finger in accordance with one aspect of the present application;

[0026] FIG. 6C shows a cross section view of said CCD shift register cutting across the poly fingers in accordance with one aspect of the present application;

[0027] FIG. 6D shows different configurations of CCD shift registers where data may be moving vertically and men horizontally to alter the flow of information in different directions;

[0029] FIG. 7 shows the concept of systolic rearrangement of multiplication and sum coefficients to illustrate the efficiency that may be gained using the CCD shift register or other means to provide weights and inputs values to the weighted summers of a neural network in optimized ways (re-arrange the provision of the coefficients to the weighted summers) in accordance with one aspect of the present invention;

[0029] FIGs.8A-8B illustrates the concept of coupled charge reservoirs coupled together by first charge movement means and a proportional charge movement means proportional to the first coupled only to one of the charge movement means, FIG. 8A shows this configuration using capacitors as charge reservoirs and current sources as charge movements devices and FIG.8B shows a similar configuration using a pinned photodiode storage well, floating diffusions and transfer gates;

[0030] FIG. 9 shows a crossbar that might be used to couple pulses or current source weights through a switch fabric.

[0031] FIG. 10 shows a time weighted crossbar that can sum a weighted charge to an output node and also shift me summing of such a charge over several frames in time;

[0032] FIG. 11 shows a depleted junction transfer gate that could be used to reduce charge injection for switching devices or reduce ringing in current sources; and

[0033] FIG. 12 is a block diagram showing an exemplary embodiment of a weighted summer with a pulse output in accordance with one aspect of the present application. DESCRIPTION OF THE APPLICATION

[0034] The description set forth below in connection with the appended drawings is intended as a description of presently preferred embodiments of the disclosure and is not intended to represent the only forms in which Que present disclosure may be constructed and/or utilized. The description sets forth the functions and the sequence of steps for constructing and operating the disclosure in connection with the illustrated embodiments. It is to be understood, however, mat the same or equivalent functions and sequences may be accomplished by different embodiments that are also intended to be encompassed within the spirit and scope of this disclosure.

[0035] It is desirable to couple the charge stored in reservoirs directly to the multipliers of the machine learning input layer. Referring to FIG. 4, a block diagram showing a neural network architecture 40 forming the basis of the present application may be seen. In the neural network architecture 40, the circles 42 are neurons or in the case of the input layer the input voltage, charge, current, waveform, or digital word. The lines 44 are multipliers which multiply the input information by a weight (w). The result is fed to a decision circuit and that output in turn is fed to the next layer. As each neuron, containing a summer of weighted inputs potentially a bias and potentially a decision circuit, maybe connected to many neurons in the following layers, therefore the number of weights can be very large.

[0036] Based on the above, if one could replace the input layer with charge reservoirs such as SW or FD, and utilize this charge within the multipliers connecting this input layer to the first inner layer directly, then one could eliminate the latency, power and information loss due to noise associated with the coupling and digitization circuits. [0037] Referring to FIG. 5, once a charge is stored in a reservoir such as SW or FD _> it is possible to store that charge in a charge coupled shift register 50 as shown in FIG. 5. The shift register 50 may be formed of a plurality of cells 52. The shift register 50 may move the charge without loss of fidelity of the charge information. It is also possible to move the charge along multiple axis and to combine charges held within specific reservoirs. Multiple shift registers 50 can also be used to move multiplicand information in different directions and at different speeds.

[0038] In FIG. 5, a pinned photodiode (PPD) maybe represented by the large rectangle at the top left. It delivers charge through the TG to a charge reservoir It is possible to keep loading the charges vertically and then move them horizontally based upon the construction of the shift register 50. In this way it is possible to temporarily store charge information, in a small area, with high fidelity. It is also possible to induce a charge in one reservoir to flow into a reservoir with an existing charge to perform summation.

[0039] FIG. 6A-6D depict multiple cross-sectional views of a charge coupled device (CCD) shift register 60. The CCD shift register 60 allows for X and Y movement of stored content

[0040] It is common to utilize mathematical constructions to improve the efficiency of multiplies in a machine learning system. For example, in matrix multiplication it is common to move multiplicands through different arrangements such that they may be efficiently re-used without having to re-load the information. Systolic structures are an example which may be used to reduce the number of times multiplicands have to be loaded and which make use of prior calculations. It would be desirable to utilize the charge coupled shift register to organize the charge multiplicands in conformance with these types of mathematical efficiency improvements and in some cases to further utilize the charge coupled shift register to combine the charges for summation. [0041] Referring to FIG, 7A-7B, die concept of a systolic array may be disclosed. Systolic arrays reduce memory loading and organize data into recursive or efficient constructs to increase the efficiency of matrix multiplication. It is useful to implement systolic techniques with CCD shift registers as the operands can easily be moved through the shift register at different speeds and in different directions as required by the different systolic implementations,

[0042] FIGs. 8A-8B shows a charge based analog multiplier in multiple implementations. In FIG. 8A, an implementation 80 where capacitors are used for charge storage is shown where switches SI and S2 are used to load the input charge reservoir CI (SI off, II is off, 12 is off, S2 is turned on, current source 82 is turned on for a time, switch S2 turned off at the end of this cycle) during a first cycle. In FIG, 8B, a pinned photodiode PPD is shown charging a reservoir SW the same way as the current source 82 and switch S2 charge the CI by exposing it to light for a time. In both cases the input charge reservoir (CI in one case and SW in the other) is filled with charge during a first cycle.

[0043] In the second cycle SI in FIG. 8A is closed (S2 being open). The current sources II and 12, which are proportional in magnitude, are turned on. CI is charged by a current magnitude of 11+12 and C2 is charged by a current magnitude of 12. Node is monitored until the charge on CI is completely dissipated (voltage across it reaches zero). This takes Qcl/(I1+I2) time where Qcl was the charge introduced on CI during the first cycle. C2 is charged by 12 only for this Qcl/(H+I2) time which means it will see a charge of I2*Qcl/(Il+I2). This means it will receive the charge CI multiplied by the ratio of 12/(11+12). By controlling 12/(11+12) one has set a multiplier gain.

[0044] In an analogous way, in FIG. 8B, the charge on the pinned photodiode PPD may be moved by a field into the floating diffusion FD at a rate controlled by the TG path. A second charge path through the TGI also fills the FD at a rate proportional to the rate of charge movement from SW. The time it takes to deplete SW is Qcl/il-t. A cairent il+i2 is flowing into FD for a time t, therefore Qcl*(il+i2)/il is the charge on FD. Thus, one has effectively multiplied the charge by (il +i2)/il . Instead of a pinned photodiode a CCD array or other floating diffusion could be the source of il. FD may them be read into voltage or current by a coupling circuit or may be used for further calculation.

[0045] By coupling the output of the PPD multiplier to a CCD shift register, a two dimensional CCD shift register or a second CCD shift register may be used to sum charges as if they were entering a neuron. If a systolic architecture is used then the charge reservoirs may be coupled to appropriate operands as they move through the CCD array and the results may be summed into the input reservoir of a multiplier or the result could be re-injected into another shift register cell for re-use. For broadcast topologies the CCD shift register may be used to create multiple copies of an input to load an Xi operand into multiple multipliers.

[0046] It would therefore be useful to store multiplicands in one or more CCD arrays and then move the information through the arrays in conformance with systolic arrangements so as to minimize memory loading and multiplier efficiency.

[0047] It would be useful to allow the loading of multiple inputs into said first reservoir at the same time. This may be accomplished by summing a known charge from multiple weighted inputs, such as the outputs of multiple neurons from a previous layer, into said first multiplier reservoir. FIG.9 shows the concept of a crossbar 90. The crossbar 90 is an assembly of individual switches between a set of inputs and a set of outputs. The switches may be arratiged in a matrix. If the time weighted crossbar 90 has M inputs and N outputs, then the crossbar has a matrix with M * N cross-points or places where the connections cross. At each cross-point is a switch; when closed, it connects one of the inputs to one of the outputs. A given crossbar is a single layer, non-blocking switch. Non-blocking switch means that other concurrent connections do not prevent connecting other inputs to other outputs. As may be seen in FIG. 9, metal lines 92 come together and switches between them may be turned on to connect lines together on different layers or the same layers of metal or other conductor. Several crossbars can produce a significant fanout. A gated current source 94 may be coupled through such an arrangement.

[0048] It would be Useful to allow the loading of multiple inputs into said first reservoir at the same time. This may be accomplished by summing a known charge from multiple weighted inputs, such as the weighted outputs of multiple neurons from a previous layer, into said first multiplier reservoir. FIG. 10 shows one possible configuration of a time weighted crossbar 100, however, for now assume the second row of switches is closed. In this case connected paths are further coupled to current sources 102 of the same magnitude. Paths do not start charging until a stored voltage, such as an NVM voltage, matches a ramp voltage as determined by comparators 104. This produces a charge proportional to the stored voltage since the currents are proportional and ramp are generated from the same source. These currents may be injected into a floating diffusion or optically coupled into a diffusion or used directly if capacitors or other storage elements are used for the multiplier. This time delay crossbar 100 may be addressed using relative or direct addressing and allows flexible neural network configurations even though the neurons have a specific physical location. The result is that the weights are set by current magnitudes and input values by time. For subsequent layers, or layers with charge inputs such as PPD inputs, the output of each weighted summer can be time and therefore the comparators 104 are not necessary - the output of previous neurons can be applied directly to gate the switches.

[0049] Current magnitudes may come from a dynamic or from an NVM memory. This may be an analog memory such as a ferroelectric memristor. It could be an analog floating gate or flash memory. Or it could be a DNA memory. DNA memory has recently shown great promise in producing analog or digital memory in a very small area like 3 ran with a very long lifetime. Ferroelectric mcmri stars such as those developed by Panasonic have been shown capable of producing accurate analog values.

[0050] Neuromorphic spiking networks are energy efficient because they only turn on neural pathways when the controlling neuron weighted input summer reaches a threshold, leaving neurons which do not accumulate sufficient input charge unused. It would be useful to modify the weighted summer described in this application to allow such an implementation when said weighted summer is used to create a neuron. This can be done by coupling a comparator to the first/input charge reservoir and once the charge on this reservoir reaches a level an interrupt is generated forcing the controller to couple the output of the neuron to its appropriate connection within the desired neural network. Some neuromorphic spiking networks also have the requirement for magnitude and/or time delay information. Time delay may be introduced through repeating the ramp in the time delay crossbar multiple times and through the use of a second set of switches 106 in Figure 10. For example if the ramp were repeated five times and a controller provided a five bit word indicating on which ramp the current should be applied then a simple counter could be used to determine when to turn on said second switch to implement said time delay by only allowing current to flow if it matches the appropriate five (5) bit delay word.

[0051] in certain cases ϋ may be more efficient to separate the charge reservoirs completely rather than coupling them in series, In this case the first charge reservoir charges during a first cycle and during a second cycle is discharged by a charge movement means at a controlled rate until it is depleted. During mis same second cycle a second charge movement means, programmed to be of proportional magnitude to that charging the first, charges the second charge reservoir until the charge on the first charge reservoir is depleted. Now the charge on the second charge reservoir will be that of the first multiplied by the ratio of the rates of charge movement. For example if the charge movement means were current sources, and II were depleting the first charge reservoir and 12 charging the second, then the charge on said second charge reservoir at the end of the second cycle would be I2/I1*Q1 where Ql is the initial charge on the first charge reservoir. The charge movement means could be a MOSFET, transfer gate, a graded junctions or other devices capable of controlling charge while also being started or stopped.

[0052] To reduce charge injection and allow use of extremely small equivalent capacitances, the transfer gates are created with depleted junction MOSFETs whose construction is also designed to minimize overlap capacitance. An example is shown in FIG. 11.

[0053] Referring to Fig. 12 an embodiment showing weighted summer 120 may be considered. Here a single charge reservoir is shown consisting of the gate of MN1 which is also connected to CI (which could be a capacitor or floating diffusion), the drain of the gating MOSFETs 122 and current source 126 also known as lout. Charge movement device 124 labelled wl and wn and bias bl are programmed in conformance with desired weight inputs and in proportion to current source lout 126. These weight inputs are gated by time inputs al, an, and b which are shown connected to a buffer driving the gates of MOSFETs 122. During a reset the gate of MN1 is pulled below its Vt comparator threshold, which will cause the drain of MOSFET Ml to invert and allow inverter 128 to turn on current source 126 until the MN1 gate reaches its switching threshold after which it is switched off. In a first cycle, the time plurality of pulse inputs al ...an and b effectively allow the weights to flow for a given amount of time resulting in a weighted charge to be removed from the charge reservoir at the gate of MN1. Once this current is removed MNl's drain will again flip its state and cause the inverter 128 to turn on the current source lout which will replace the charge removed by the weighted inputs. The time it takes to do so will represent a weighted sum output pulse at aout. [0054] While embodiments of the disclosure have been described in terms of various specific embodiments, those skilled in the art will recognize that the embodiments of the disclosure may be practiced with modifications within the spirit and scope of the claims

Previous Patent: METHOD AND SYSTEM FOR HIGH-THROUGHPUT PARTICLE HANDLING BY USE OF MAGNETIC FIELDS AND DEVICE

Next Patent: MULTISECTION TUBELESS HEAT EXCHANGER, FLUID HEATING SYSTEM INCLUDING THE SAME, AND METHODS OF MANUFA...