Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
DIGITAL COMPENSATOR FOR A NON-LINEAR SYSTEM
Document Type and Number:
WIPO Patent Application WO/2019/217811
Kind Code:
A1
Abstract:
A pre-distorter that both accurately compensates for the non-linearities of a radio frequency transmit chain, and that imposes as few computation requirements in terms of arithmetic operations, uses a diverse set of real- valued signals that are derived from the input signal. The derived real signals are passed through configurable non-linear transformations, which may be adapted during operation, and which may be efficiently implemented using lookup tables. The outputs of the non-linear transformations serve as gain terms for a set of complex signals, which are functions of the input, and which are summed to compute the pre-distorted signal. A small set of the complex signals and derived real signals may be selected for a particular system to match the classes of non- linearities exhibited by the system, thereby providing further computational savings, and reducing complexity of adapting the pre-distortion through adapting of the non-linear transformations.

Inventors:
MEGRETSKI ALEXANDRE (US)
LI YAN (US)
CHUANG KEVIN (US)
MAHMOOD ZOHAIB (US)
KIM HELEN (US)
Application Number:
PCT/US2019/031714
Publication Date:
November 14, 2019
Filing Date:
May 10, 2019
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
NANOSEMI INC (US)
International Classes:
H03F1/32; H03F1/02; H03F3/24
Foreign References:
EP1560329A12005-08-03
EP1429453A12004-06-16
US20050001684A12005-01-06
EP1732208A12006-12-13
US9590668B12017-03-07
US10141961B12018-11-27
US201816004594A2018-06-11
Attorney, Agent or Firm:
ROHLICEK, J., Robin (US)
Download PDF:
Claims:
What is claimed is:

1. A method of signal predistortion for linearizing a non-linear circuit, the method comprising: processing an input signal (u ) to produce a plurality of transformed signals ( w ); determining a plurality of phase-invariant derived signals (r ), each derived signal ( r ) of the plurality of derived signals being equal to a non linear function of one or more of the transformed signals, at least some of the derived signals being equal to functions of different one or more of the transformed signals;

transforming each derived signal ( r ) of the plurality of phase-invariant derived signals according to a parametric non-linear transformation to produce a time-varying gain component (gi ) of a plurality of gain components ( g ); forming a distortion term by accumulating a plurality of terms, each term being a product of a transformed signal of the plurality of transformed signals and a time-varying gain, the time-varying gain being a function ( F ) of a one or more of the phase-invariant derived signals, the function of the one or more of the phase-invariant derived signals being decomposable into a combination of one or more parametric functions ( f ) of a corresponding single one of the time- varying gain components ( gt ); and providing an output signal ( v) determined from the distortion term for

application to the non-linear circuit.

2. The method of claim 1 where the non-linear circuit comprises a radio frequency section including a radio-frequency modulator configured to modulate the output signal to a carrier frequency to form a modulated signal and an amplifier for amplifying the modulated signal.

3. The method of claim 2 wherein the input signal (u ) comprises a quadrature components of a baseband signal for transmission via the radio-frequency section.

4. The method of claim 1 wherein the input signal (u ) and the plurality of transformed signals (w) comprise complex- valued signals.

5. The method of any of claims 1 through 4 wherein processing the input signal ( u ) to produce the plurality of transformed signals ( w ) includes forming at least one of the transformed signals as a linear combination of the input signal (u ) and one or more delayed versions of the input signal.

6. The method of claim 5 wherein forming at least one of the transformed signals as a linear combination includes forming a linear combination with at least one imaginary or complex multiple input signal or a delayed version of the input signal.

7. The method of claim 6 wherein forming at least one of the transformed signals, wk to be a multiple of Dawa + jdWf, , where wa and wh are other of the transformed signals, and Da represents a delay by a , and d is an integer between 0 and 3.

8. The method of claim 5 wherein forming the at least one of the transformed signals includes time filtering the input signal to form said transformed signal.

9. The method of claim 8 wherein time filtering the input signal includes applying a finite-impulse-response (FIR) filter to the input signal.

10. The method of claim 8 wherein time filtering the input signal includes applying an infinite-impulse-response (HR) filter to the input signal.

11. The method of any of claims 1 through 4 wherein the plurality of transformed signals (w) includes non-linear functions of the input signal (u ).

12. The method of claim 11 wherein the non-linear functions of the input signal ( u ) comprises at least one function of a form

u[n - r~\\u[n - t^r

for a delay t and an integer power p or

for a set for integer delays zj to 2 -] , where indicates a complex conjugate operation.

13. The method of any of claims 1 through 12 wherein determining a plurality of phase-invariant derived signals (r ) comprises determining real- valued derived signals.

14. The method of any of claims 1 through 13 wherein determining a plurality of phase-invariant derived signals (r ) comprises processing the transformed signals (w) to produce a plurality of phase-invariant derived signals ( r ).

15. The method of claim 14 wherein each of the derived signals is equal to a function of a single one of the transformed signals.

16. The method of claim 14 wherein processing the transformed signals ( w ) to produce a plurality of phase-invariant derived signals includes for at least one derived signal (rp ) computing said derived signal by first computing a phase-invariant non linear function of one of the transformed signals ( wk ) to produce a first derived signal, and then computing a linear combination of the first derived signal and delayed versions of the first derived signal to determine the at least one derived signal.

17. The method of claim 16 wherein computing a phase-invariant non-linear function of one of the transformed signals ( wk ) comprises computing a power of a magnitude of the one of the transformed signals (| ¼¾ \p ) for an integer power p > 1.

18. The method of claim 17 wherein p = 1 or p = 2 .

19. The method of claim 16 wherein computing the linear combination of the first derive signal and delayed versions of the first derived signal comprises time filtering the first derived signal.

20. The method of claim 19 wherein time filtering the first derived signal comprises applying a finite-impulse-response (FIR) filter to the first derived signal.

21. The method of claim 19 wherein time filtering the first derived signal comprises applying an infinite-impulse-response (HR) filter to the first derived signal.

22. The method of claim 14 wherein processing the transformed signals ( w ) to produce a plurality of phase-invariant derived signals includes computing a first signal as a phase-invariant non-linear function of a first signal of the transformed signals, and computing a second signal as a phase-invariant non-linear function of a second of the transformed signals, and then computing a combination of the first signal and the second signal to form at least one of the phase-invariant derived signals.

23. The method of claim 22 wherein at least one of the phase-invariant derived signals is equal to a function for two of the transformed signals wa and¼¾ with a form

\ wa[t] \a \ wb[t - T]

for positive integer powers a and b .

24. The method of claim 14 wherein processing the transformed signals ( w ) to produce a plurality of phase-invariant derived signals includes computing a derived signal rk\t\ using at least one of the following transformations:

¾ M =I wa[/] |“ , where a > 0 for a transformed signal wa [/] ;

¾[/] = 0.5(1 -d + ra[t- a] + 0¾[/]) , where Q e {1,-1} , a,b e {\,...,k-\} , and a is an integer and r [/] and rb[l] are other of the derived signals; rk M = ra [t - a] rb [/] , where a, b e {1, ... , k - 1} and a is an integer and ra [/] and r6[/] are other of the derived signals; and integer d > 0.

25. The method of any of claims 1 through 24 wherein the time-varying gain components comprise complex- valued gain components.

26. The method of any of claims 1 through 25 further comprising transforming a first derived signal {r·) of the plurality of phase-invariant derived signals according to one or more different parametric non-linear transformation to produce a

corresponding time-varying gain components.

27. The method of claim 26 wherein the one or more different parametric non linear transformations comprises multiple different non-linear transformations producing corresponding time-varying gain components.

28. The method of claim 27 wherein each of the corresponding time-varying gain components forms a part of a different term of the plurality of terms of the distortion term.

29. The method of any of claims 1 to 25 wherein forming the distortion term comprises forming a first sum of products, each term in the fist sum being a product of a delayed version of the transformed signal and a second sum of a corresponding subset of the gain components.

30. The method of any of claims 1 to 29 wherein the distortion term />[/] has a form

wherein for each term indexed by k , ak selects the transformed signal, dk determines the delay of said transformed signal, and Ak determines the subset of the gain components.

31. The method of any of claims 1 through 30 wherein transforming a first derived signal of the plurality of derived signals according to a parametric non-linear transformation comprises performing a table lookup in a data table corresponding to said transformation according to the first derived signal to determine a result of the transforming.

32. The method of claim 31 wherein the parametric non-linear transformation comprises a plurality of segments, each segment corresponding to a different range of values of the first derived signal, and wherein transforming the first derived signal according to the parametric non-linear transformation comprises determining a segment of the parametric non-linear transformation from the first derived signal and accessing data from the data table corresponding to a said segment.

33. The method of claim 32 wherein the parametric non-linear transformation comprises a piecewise linear or a piecewise constant transformation, and the data from the data table corresponding to said segment characterizes endpoints of said segment.

34. The method of claim 33 wherein the non-linear transformation comprises a piecewise linear transformation, and transforming the first derived signal comprises interpolating a value on a linear segment of said transformation.

35. The method of any of claims 1 through 34 further comprising adapting configuration parameters of the parametric non-linear transformation according to sensed output of the non-linear circuit.

36. The method of claim 35 further comprising acquiring a sensing signal (y ) dependent on an output of the non-linear circuit, and wherein adapting the configuration parameters includes adjusting said parameters according to a relationship of the sensing signal ( y ) and at least one of the input signal (u ) and the output signal (v).

37. The method of claim 36 wherein adjusting said parameters includes reducing a mean squared value of a signal computed from the sensing signal ( y ) and at least one of the input signal (u ) and the output signal (v) according to said parameters.

38. The method of claim 37 wherein reducing the mean squared value includes applying a stochastic gradient procedure to incrementally update the configuration parameters.

39. The method of claim 37 wherein reducing the mean squared value includes processing a time interval of the sensing signal ( y ) and a corresponding time interval of at least one of the input signal (u ) and the output signal ( v ).

40. The method of claim 39 comprising performing a matrix inverse of a Gramian matrix determined from the time interval of the sensing signal and a corresponding time interval of at least one of the input signal (u ) and the output signal ( v ).

41. The method of claim 40 further comprising forming the Gramian matrix as a time average Gramian.

42. The method of claim 39 comprising performing coordinate descent procedure based on the time interval of the sensing signal and a corresponding time interval of at least one of the input signal (u ) and the output signal (v).

43. The method of any of claims 35 through 38 wherein transforming a first derived signal of the plurality of derived signals according to a parametric non-linear transformation comprises performing a table lookup in a data table corresponding to said transformation according to the first derived signal to determine a result of the transforming, and wherein adapting the configuration parameters comprises updating values in the data table.

44. The method of claim 43 wherein the parametric non-linear transformation comprises a greater number of piecewise linear segments than adjustable parameters characterizing said transformation.

45. The method of claim 44 wherein the non-linear transformation represents a function that is a sum of scaled kernels, a magnitude scaling each kernel being determined by a different one of the adjustable parameters characterizing said transformation.

46. The method of claim 45 wherein each kernel comprises a piecewise linear function.

47. The method of claim 45 wherein each kernel is zero for at least some range of values of the derived signal.

48. A digital predistorter circuit configured to perform all the steps of any of claims 1 to 47.

49. A design structure encoded on a non-transitory machine-readable medium, said design structure comprising elements that, when processed in a computer-aided design system, generate a machine-executable representation of the digital predistortion circuit of claim 48.

50. A non-transitory computer readable media programmed with a set of computer instmctions executable on a processor that, when executed, cause operations comprising the method steps of any of claims 1 to 47.

693639

Description:
DIGITAL COMPENSATOR FOR A NON-LINEAR SYSTEM

CROSS-REFERENCE TO RELATED APPLICATIONS

[001] This application claims the benefit of U.S. Provisional Application No.

62/670,315, filed on May 11 , 2018, and U.S. Provisional Application No. 62/747,994, filed October 19, 2018, each of which is incorporated herein by reference.

BACKGROUND

[002] This invention relates to digital compensation of a non-linear circuit or system, for instance linearizing a non-linear power amplifier and radio transmitter chain, and in particular to effective parameterization of a digital pre-distorter used for digital compensation.

[003] One method for compensation of such a non-linear circuit is to“pre-distort”

(or“pre-invert”) the input. For example, an ideal circuit outputs a desired signal «[.] unchanged, such that y[.\ = «[.] , while the actual non-linear circuit has an input- output transformation y[.\ = A(w[.]) , where the notation y\.] denotes a discrete time signal. A compensation component is introduced before the non-linear circuit that transforms the input «[.] , which represents the desired output, to a predistorted input v[.] according to a transformation v[.] = C(w[.]) . Then this predistorted input is passed through the non-linear circuit, yielding y[.\ = A(v[.]) . The functional form and selectable parameters values that specify the transformation C( ) are chosen such that y[.\ « «[.] as closely as possible in a particular sense (e.g., minimizing mean squared error), thereby linearizing the operation of tandem arrangement of the pre distorter and the non-linear circuit as well as possible.

[004] In some examples, the DPD performs the transformation of the desired signal «[.] to the input y\.] by using delay elements to form a set of delayed versions of the desired signal, and then using a non-linear polynomial function of those delayed inputs. In some examples, the non-linear function is a Volterra series:

In some examples, the non-linear function uses a reduced set of Volterra terms or a delay polynomial:

In these cases, the particular compensation function C is determined by the values of the numerical configuration parameters x p .

[005] In the case of a radio transmitter, the desired input «[.] may be a complex discrete time baseband signal of a transmit band, and y[.] may represent that transmit band as modulated to the carrier frequency of the radio transmitter by the function F( ) that represents the radio transmit chain. That is, the radio transmitter may modulate and amplify the input v[.] to a (real continuous-time) radio frequency signal p(.) which when demodulated back to baseband, limited to the transmit band and sampled, is represented by y[.\ .

[006] There is a need for a pre-distorter with a form that both accurately

compensates for the non-linearities of the transmit chain, and that imposes as few computation requirements in terms of arithmetic operations to be performed to pre distort a signal and in terms of the storage requirements of values of the configuration parameters. There is also a need for the form of the pre-distorter to be robust to variation in the parameter values and/or to variation of the characteristics of the transmit chain so that performance degradation of pre-distortion does not exceed that which may be commensurate with the degree of such variation. SUMMARY

[007] In one aspect, in general, a pre-distorter that both accurately compensates for the non-linearities of a radio frequency transmit chain, and that imposes as few computation requirements in terms of arithmetic operations and storage requirements, uses a diverse set of real-valued signals that are derived from the input signal, as well as optional input envelope and other relevant measurements of the system. The derived real signals are passed through configurable non-linear transformations, which may be adapted during operation based on sensed output of the transmit chain, and which may be efficiently implemented using lookup tables. The outputs of the non-linear transformations serve as gain terms for a set of complex signals, which are transformations of the input. The gain-adjusted complex signals are summed to compute the pre-distorted signal, which is passed to the transmit chain. A small set of the complex signals and derived real signals may be selected for a particular system to match the non-linearities exhibited by the system, thereby providing further computational savings, and reducing complexity of adapting the pre-distortion through adapting of the non-linear transformations.

[008] In another aspect, in general, a method of signal predistortion linearizes a non linear circuit. An input signal (u ) is processed to produce multiple transformed signals ( w). The transformed signals are processed to produce multiple phase- invariant derived signals (r). These phase-invariant derived signals (r ) are determined such that each derived signal ( ) is equal to a non-linear function of one or more of the transformed signals. The derived signals are phase-invariant in the sense that a change in the phase of a transformed signal does not change the value of the derive signal. At least some of the derived signals are equal to functions of different one or more of the transformed signals. Each derived signal ( ) of the phase-invariant derived signals is processed according to a parametric non-linear transformation to produce a time-varying gain component ( g t ) of multiple gain components ( g ). A distortion term is then formed by accumulating multiple terms. Each terai is a product of a transformed signal of the transformed signals and a time- varying gain. The time-varying gain is a function ( F ) of one or more of the phase- invariant derived signals (i.e., a gain signal determined from the derived signals) and the function of the one or more of the phase-invariant derived signals is decomposable into a combination of one or more parametric functions ( f ) of a corresponding single one of the time-varying gain components (i.e., component gain signals). An output signal ( v ) is determined from the distortion term and provided for application to the non-linear circuit.

[009] Aspects may include one or more of the following features.

[010] The non-linear circuit includes a radio-frequency section including a radio frequency modulator configured to modulate the output signal to a carrier frequency to form a modulated signal and an amplifier for amplifying the modulated signal.

[Oil] The input signal (u ) includes quadrature components of a baseband signal for transmission via the radio-frequency section.

[012] The input signal (u ) and the transformed signals ( w) are complex-valued signals.

[013] Processing the input signal ( u ) to produce the transformed signals ( w ) includes forming at least one of the transformed signals as a linear combination of the input signal (u ) and one or more delayed versions of the input signal.

[014] At least one of the transformed signals is formed as a linear combination includes forming a linear combination with at least one imaginary or complex multiple input signal or a delayed version of the input signal.

[015] Forming at least one of the transformed signals, w k to be a multiple of where w a and w h are other of the transformed signals, and D a represents a delay by a , and d is an integer between 0 and 3. [016] Forming the at least one of the transformed signals includes time filtering the input signal to form said transformed signal. The time filtering of the input signal can include includes applying a finite-impulse-response (FIR) filter to the input signal, or applying an infinite-impulse-response (HR) filter to the input signal.

[017] The transformed signals (w) include non-linear functions of the input signal ( u ).

[018] The non-linear functions of the input signal (u ) include at least one function of a form u[n - r] for a delay r and an integer power p or

where indicates a complex conjugate operation.

[019] Determining a plurality of phase-invariant derived signals ( r ) comprises determining real-valued derived signals.

[020] Determining the phase-invariant derived signals ( r ) comprises processing the transformed signals (w) to produce a plurality of phase-invariant derived signals r ).

[021] Each of the derived signals is equal to a function of a single one of the transformed signals.

[022] Processing the transformed signals ( w ) to produce the phase-invariant derived signals includes, for at least one derived signal ( r p ), computing said derived signal by first computing a phase-invariant non-linear function of one of the transformed signals ( vt¾ ) to produce a first derived signal, and then computing a linear combination of the first derived signal and delayed versions of the first derived signal to determine the at least one derived signal.

[023] Computing a phase-invariant non-linear function of one of the transformed signals ( w k ) comprises computing a power of a magnitude of the one of the transformed signals (| ¾ \ p ) for an integer power p > \ . For example, p = 1 or p = 2 [024] Computing the linear combination of the first derived signal and delayed versions of the first derived signal comprises time filtering the first derived signal. Time filtering the first derived signal can include applying a finite-impulse-response (FIR) filter to the first derived signal or applying an infinite-impulse-response (HR) filter to the first derived signal.

[025] Processing the transformed signals ( w ) to produce the phase-invariant derived signals includes computing a first signal as a phase-invariant non-linear function of a first signal of the transformed signals, and computing a second signal as a phase- invariant non-linear function of a second of the transformed signals, and then computing a combination of the first signal and the second signal to form at least one of the phase-invariant derived signals.

[026] At least one of the phase-invariant derived signals is equal to a function for two of the transformed signals w a and ¼¾ with a form | w a [/] \ a | u¾[/ - r] f for positive integer powers a and b .

[027] The transformed signals ( w ) are processed to produce the phase-invariant derived signals by computing a derived signal r k \t\ using at least one of the following transformations : r k M =1 w a M r where a > 0 for a transformed signal w a [/] ;

¾[/] = 0.5(1 - q + ra [t-a ~ \ + 0¾[/]) , where 0 e {l,-l} , a,b e {1, ... , L:— 1} , and a is an integer, and r [/] and r h [l] are other of the derived signals; r k [t\ = r a [t - «] ¾[/] , where a,b <E {l,...,& - l} and a is an integer and r a [/] and r 6 [/] are other of the derived signals; and

/¾ [!] = ¾[/ - l] + 2 _fii (r a [/] - ¾[/ - l]) , where a e {l,...,£ - l} and d is an integer d > 0 .

[028] The time-varying gain components comprise complex-valued gain

components. [029] The method includes transfomiing a first derived signal ( ) of the plurality of phase-invariant derived signals according to one or more different parametric non linear transformation to produce a corresponding time-varying gain components.

[030] The one or more different parametric non-linear transformations comprises multiple different non-linear transformations producing corresponding time-varying gain components.

[031] Each of the corresponding time-varying gain components forms a part of a different term of the plurality of terms of the sum forming the distortion term.

[032] Forming the distortion term comprises forming a first sum of products, each term in the fist sum being a product of a delayed version of the transformed signal and a second sum of a corresponding subset of the gain components.

[033] The distortion term ό ' [ί] has a form 5 [/] wherein

for each term indexed by k , a k selects the transformed signal, d k determines the delay of said transformed signal, and A k determines the subset of the gain components.

[034] Transforming a first derived signal of the derived signals according to a parametric non-linear transformation comprises performing a table lookup in a data table corresponding to said transformation according to the first derived signal to determine a result of the transforming.

[035] The parametric non-linear transformation comprises a plurality of segments, each segment corresponding to a different range of values of the first derived signal, and wherein transforming the first derived signal according to the parametric non linear transformation comprises determining a segment of the parametric non-linear transformation from the first derived signal and accessing data from the data table corresponding to a said segment. [036] The parametric non-linear transformation comprises a piecewise linear or a piecewise constant transformation, and the data from the data table corresponding to the segment characterizes endpoints of said segment.

[037] The non-linear transformation comprises a piecewise linear transformation, and transforming the first derived signal comprises interpolating a value on a linear segment of said transformation.

[038] The method further includes adapting configuration parameters of the parametric non-linear transformation according to sensed output of the non-linear circuit.

[039] The method further includes acquiring a sensing signal (y ) dependent on an output of the non-linear circuit, and wherein adapting the configuration parameters includes adjusting said parameters according to a relationship of the sensing signal ( y

) and at least one of the input signal (u ) and the output signal ( v ).

[040] Adjusting said parameters includes reducing a mean squared value of a signal computed from the sensing signal (y ) and at least one of the input signal (u ) and the output signal ( v ) according to said parameters.

[041] Reducing the mean squared value includes applying a stochastic gradient procedure to incrementally update the configuration parameters.

[042] Reducing the mean squared value includes processing a time interval of the sensing signal ( y ) and a corresponding time interval of at least one of the input signal

(u ) and the output signal (v).

[043] The method includes performing a matrix inverse of a Gramian matrix determined from the time interval of the sensing signal and a corresponding time interval of at least one of the input signal (u ) and the output signal (v).

[044] The method includes forming the Gramian matrix as a time average Gramian. [045] The method includes performing coordinate descent procedure based on the time interval of the sensing signal and a corresponding time interval of at least one of the input signal (u ) and the output signal (v).

[046] Transforming a first derived signal of the plurality of derived signals according to a parametric non-linear transformation comprises performing a table lookup in a data table corresponding to said transformation according to the first derived signal to determine a result of the transforming, and wherein adapting the configuration parameters comprises updating values in the data table.

[047] The parametric non-linear transformation comprises a greater number of piecewise linear segments than adjustable parameters characterizing said

transformation.

[048] The non-linear transformation represents a function that is a sum of scaled kernels, a magnitude scaling each kernel being determined by a different one of the adjustable parameters characterizing said transformation.

[049] Each kernel comprises a piecewise linear function.

[050] Each kernel is zero for at least some range of values of the derived signal.

[051] In another aspect, in general, a digital predistorter circuit is configured to perform all the steps of any of the methods set forth above.

[052] In another aspect, in general, a design structure is encoded on a non-transitory machine-readable medium. The design structure comprises elements that, when processed in a computer-aided design system, generate a machine-executable representation of the digital predistortion circuit that is configured to perform all the steps of any of the methods set forth above.

[053] In another aspect, in general, a non-transitory computer readable media is programmed with a set of computer instmctions executable on a processor. When these instructions are executed, they cause operations including all the steps of any of the methods set forth above.

DESCRIPTION OF DRAWINGS

[054] FIG. 1 is a block diagram of a radio transmitter.

[055] FIG. 2 is a block diagram of the pre-distorter of FIG. 1.

[056] FIG. 3 is a block diagram of a distortion signal combiner of FIG. 2.

[057] FIGS. 4A-E are graphs of example gain functions.

[058] FIG. 5 is a diagram of a table-lookup implementation of a gain lookup section of FIG. 2.

[059] FIG. 6A-B are diagrams of a section of a table lookup for piecewise linear functions.

DESCRIPTION

[060] Referring to FIG. 1, in an exemplary structure of a radio transmitter 100, a desired baseband input signal «[.] passes to a baseband section 110, producing a predistorted signal v[.] . In the description below, unless otherwise indicated, signals such as «[.] and v[.] are described as complex-valued signals, with the real and imaginary parts of the signals representing the in-phase and quadrature terms (i.e., the quadrature components) of the signal. The predistorted signal v[.] then passes through a radio frequency (RF) section 140 to produce an RF signal p(.) , which then drives a transmit antenna 150. In this example, the output signal is monitored (e.g., continuously or from time to time) via a coupler 152, which drives an adaptation section 160. The adaptation section also receives the input to the RF section, v[.] .

The adaptation section 150 determined values of parameters x , which are passed to the baseband section 110, and which affect the transformation from «[.] to v[.] implemented by that section.

[061] The structure of the radio transmitter 100 shown in FIG. 1 includes an optional envelope tracking aspect, which is used to control the power (e.g., the voltage) supplied to a power amplifier of the RF section 140, such that less power is provided when the input «[.] has smaller magnitude over a short term and more power is provided when it has larger magnitude. When such an aspect is included, an envelope signal e[.] is provided from the baseband section 110 to the RF section 140, and may also be provided to the adaptation section 160.

[062] The baseband section 110 has a predistorter 130, which implements the transformation from the baseband input «[.] to the input v[.] to the RF section 140.

This predistorter is configured with the values of the configuration parameters x provided by the adaptation section 160 if such adaptation is provided. Alternatively, the parameter values are set when the transmitter is initially tested, or may be selected based on operating conditions, for example, as generally described in U.S. Pat.

9,590,668,“Digital Compensator.”

[063] In examples that include an envelope-tracking aspect, the baseband section 110 includes an envelope tracker 120, which generates the envelope signal e[.] . For example, this signal tracks the magnitude of the input baseband signal, possibly filtered in the time domain to smooth the envelope. In particular, the values of the envelope signal may be in the range [0,1], representing the fraction of a full range. In some examples, there are N E such components of the signal (i.e.,

e[.] = {eyW,...,e jjE [.] , for example, with e \ [.] may be a conventional envelope signal, and the other components may be other signals, such as environmental measurements, clock measurements (e.g., the time since the last“on” switch, such as a ramp signal synchronized with time-division-multiplex (TDM) intervals), or other user monitoring signals. This envelope signal is optionally provided to the predistorter 130. Because

- l i the envelope signal may be provided to the RF section, thereby controlling power provided to a power amplifier, and because the power provided may change the non linear characteristics of the RF section, in at least some examples, the transformation implemented by the predistorter depends on the envelope signal.

[064] Turning to the RF section 140, the predistorted baseband signal v[.] passes through an RF signal generator 142, which modulates the signal to the target radio frequency band at a center frequency f c . This radio frequency signal passes through a power amplifier (PA) 148 to produce the antenna driving signal p(.) . In the illustrated example, the power amplifier is powered at a supply voltage determined by an envelope conditioner 122, which receives the envelope signal e[.] and outputs a time-varying supply voltage V c to the power amplifier.

[065] As introduced above, the predistorter 130 is configured with a set of fixed parameters z , and values of a set of adaptation parameters x , which in the illustrated embodiment are determined by the adaptation section 160. Very generally, the fixed parameters determined the family of compensation functions that may be

implemented by the predistorter, and the adaptation parameters determine the particular function that is used. The adaptation section 160 receives a sensing of the signal passing between the power amplifier 148 and the antenna 150, for example, with a signal sensor 152 preferably near the antenna (i.e., after the RF signal path between the power amplifier and the antenna, in order to capture non-linear characteristics of the passive signal path). RF sensor circuity 164 demodulates the sensed signal to produce a representation of the signal band y[.] , which is passed to an adapter 162. The adapter 162 essentially uses the inputs to the RF section, namely v[.] and/or the input to the predistorter «[.] (e.g., according to the adaptation approach implemented) and optionally e[.\ , and the representation of sensed output of the RF section, namely y[.\ . In the analysis below, the RF section is treated as implementing a generally non-linear transformation represented as y[.\ = A(v[.],e[.]) in the baseband domain, with a sampling rate sufficiently large to capture not only the bandwidth of the original signal «[.] but also a somewhat extended bandwidth to include significant non-linear components that may have frequencies outside the desired transmission band. In later discussions below, the sampling rate of the discrete time signals in the baseband section 110 is denoted as f s .

[066] In the adapter 162 is illustrated in FIG. 1 and described below as essentially receiving u[t\ and/or v[/] synchronized with _ >[/] . However, there is a delay in the signal path from the input to the RF section 140 to the output of the RF sensor 164. Therefore, a synchronization section (not illustrated) may be used to account for the delay, and optionally to adapt to changes in the delay. For example, the signals are upsampled and correlated, thereby yielding a fractional sample delay compensation, which may be applied to one or the other signal before processing in the adaptation section. Another example of a synchronizer is described in US Pat. 10,141,961, which is incorporated herein by reference.

[067] Although various structures for the transformation implemented by the predistorter 130 may be used, in one or more embodiments described below, the functional form implemented is

v[·] = «[.] + *[·]

where

5[.\ = A(u[.],e[·]) ,

and A( , ) , which may be referred to as the distortion term, is effectively

parameterized by the parameters x . Rather than using a set of terms as outlined above for the Volterra or delay polynomial approaches, the present approach makes use of a multiple stage approach in which a diverse set of targeted distortion terms are combined in a manner that satisfies the requirements of low computation requirement, low storage requirement, and robustness, while achieving a high degree of linearization. [068] Very generally, structure of the function D( , ) is motivated by application of the Kolmogorov Superposition Theorem (KST). One statement of KST is that a non linear function of d arguments cg,...,x d e [0,1]^ may be expressed as

for some functions g t and L· . Proofs of the existence of such functions may concentrate on particular types of non-linear functions, for example, fixing the hy and proving the existence of suitable g t . In application to approaches described in this document, this motivation yields a class of non-linear functions defined by constituent non-linear functions somewhat analogous to the g { and/or the hy in the KST formulation above.

[069] Referring to FIG. 2, the predistorter 130 performs a series of transformations that generate a diverse set of building blocks for forming the distortion term using an efficient table-driven combination. As a first transformation, the predistorter includes a complex transformation component 210, labelled L c and also referred to as the

“complex layer.” Generally, the complex layer receives the input signal, and outputs multiple transformed signals. In the present embodiment, the input to the complex transformation component is the complex input baseband signal, «[.] , and the output is a set of complex baseband signals, ¼{.] , which may be represented as a vector of signals and indexed w l [.],w 2 [.],...,w^ [.] , where N w is the number of such signals.

Very generally, these complex baseband signals form terms for constructing the distortion term. More specifically, the distortion term is constructed as a weighted summation of the set of baseband signals, where the weighting is time varying, and determined based on both the inputs to the predistorter 130, «[.] and e[.] , as well as the values of the configuration parameters, x . Going forward, the denotation of signals with“[.]” is omitted, and the context should make evident when the signal as a whole is referenced versus a particular sample. [070] Note that as illustrated in FIG. 2, the complex layer 210 is configured with values of fixed parameters z , but does not depend of the adaptation parameters x .

For example, the fixed parameters are chosen according to the type of RF section 140 being linearized, and there fixed parameters determine the number N w of the complex signals generated, and their definition.

[071] In one implementation, the set of complex baseband signals includes the input itself, w 1 = u , as well as well as various delays of that signal, for example, w k = u[t - k + \] for k = 1,..., N w . In another implementation, the complex signals output from the complex layer are arithmetic functions of the input, for example (w[/] + w[/ -l]) / 2 ;

In at least some examples, these arithmetic functions are selected to limit the computational resources by having primarily additive operations and multiplicative operations by constants that may be implemented efficiently (e.g., division by 2). In another implementation, a set of relatively short finite-impulse-response (FIR) filters modify the input u[t\ to yield vt¾[/] , where the coefficients may be selected according to time constants and resonance frequencies of the RF section.

[072] In yet another implementation, the set of complex baseband signals includes the input itself, W = u , as well as well as various combinations, for example, of the form

w k = 0.5(D a w a + j d w b ) ,

where D a represents a delay of a signal by an integer number a samples, and d is an integer, generally with d e {0,1, 2, 3} may depend on k , and k > a, b (i.e., each signal w k may be defined in terms of previously defined signals), such that

w k [f\ = 0.5(w e [i - a] + j d w b [f\) . There are various ways of choosing which combinations of signals (e.g., the a , b, d values) determine the signals constructed. One way is essentially by trial and error, for example, adding signals from a set of values in a predetermined range that most improve performance in a greedy manner (e.g., by a directed search) one by one.

[073] Continuing to refer to FIG. 2, a second stage is a real transformation component 220, labelled ½ and also referred to as the“real layer.” The real transformation component receives the N w signals w , optionally as well as the envelope signal e , and outputs N R (generally greater than N w ) real signals r , in a bounded range, in the implementation in a range [0,1] . In some implementations, the real signals are scaled, for example, based on a fixed scale factor that is based on the expected level of the input signal u . In some implementations, the fixed parameters for the system may include a scale (and optionally an offset) in order to achieve a typical range of [0,1]. In yet other implementations, the scale factors may be adapted to maintain the real values in the desired range.

[074] In one implementation, each of the complex signals vt¾ passes to one or more corresponding non-linear functions / ( w ) , which accepts a complex value and outputs a real value r that does not depend on the phase of its input (i.e., the function is phase-invariant). Examples of these non-linear functions, with an input u = u re + ju im include the following:

l \ w | \l/2

In at least some examples, the non-linear function is monotone or non-decreasing in norm (e.g., an increase in | w | corresponds to an increase in r = f ( u ) ).

[075] In some implementations, the output of a non-linear, phase-invariant function may be filtered, for example, with a real linear time-invariant filters. In some examples, each of these filters is an Infinite Impulse-Response (HR) filter implemented as having a rational polynomial Laplace or Z Transform (i.e., characterized by the locations of the poles and zeros of the Transform of the transfer function). An example of a Z transform for an HR filter is:

Y(z) z -q

( z ) z 2 - 2qz + p

where, for example, p = 0.7105 and q = 0.8018 . In other examples, a Finite Impulse-

Response (FIR). An example of a FIR filter with input x and output y is:

for example with k = \ or k = .

[076] In yet another implementation, the particular signals are chosen (e.g., by trial and error, in a directed search, iterative optimization, etc.) from one or more of the following families of signals:

a. r k = e k for k = 1, ... , N E , where q , ... , e N ^ are the optional components of signal e b. r k [t] =\ w a [i\ |“ for all l , where a > 0 (with a = 1 or a = 2 being most common) and a e { 1, ... , N w } may depend on k ; c · ¾ M = 0-5(1 -0 + a [/- a] + $¾[/]) for all /, where 0 e {l,- 1} ,

a, b e {1, ... , k - 1} , and a is an integer that may depend on k ; d. r k [ ] = r a [t - a] r b [ ] for all l , where a, b e {1, ... , k - 1} and a is an integer that may depend on k ;

integer d , d > 0 , may depend on k (equivalently, r k is the response of a first order linear time invariant (LTI) filter with a pole at 1 - 2 ~d , applied to r a for some a < k ; f. r k is the response (appropriately scaled and centered) of a second order LTI filter with complex poles (carefully selected for easy implementability), applied to r a for some a e {1,...,^-1} .

[077] As illustrated in FIG. 2, the real layer 220 is configured by the fixed parameters z , which determine the number of real signals N R , and their definition.

However, as with the complex layer 210, the real layer does not depend on the adaptation parameters x . The choice of real functions may depend on characteristics of the RF section 140 in a general sense, for example, being selected based on manufacturing or design-time considerations, but these functions do not generally change during operation of the system while the adaptation parameters x may be updated on an ongoing basis in at least some implementations.

[078] According to (a), the components of e are automatically treated as real signals (i.e., the components of r ). Option (b) presents a convenient way of converting complex signals to real ones while assuring that scaling the input u by a complex constant with unit absolute value does not change the outcome (i.e., phase- invariance). Options (c) and (d) allow addition, subtraction, and (if needed) multiplication of real signals. Option (e) allows averaging (i.e., cheaply implemented low-pass filtering) of real signals and option (f) offers more advanced spectral shaping, which is needed for some real-world power amplifiers 148, which may exhibit a second order resonance behavior. Note that more generally, the

transformations producing the r components are phase invariant in the original baseband input u , that is, multiplication of u[t\ by cxp( /<9) or cxp(jo t) does not change r p [t\ .

[079] Constructing the signals w and r can provide a diversity of signals from which the distortion term may be formed using a parameterized transformation. In some implementations, the form of the transformation is as follows: The function F jA*(r) takes as an argument the N R components of r , and maps those values to a complex number according to the parameters values of x . That is, each function (r) essentially provides a time-varying complex gain for the k th term in the summation forming the distortion term. With up to D delays (i.e., 0 £ d k ,D ) and N w different w[/] functions, there are up to N W D terms in the sum. The selection of the particular terms (i.e., the values of a k and d k ) is represented in the fixed parameters z that configure the system.

[080] Rather than configuring functions of N R arguments, some embodiments structure the functions as a summation of functions of single arguments as follows: where the summation over j may include all N R terms, or may omit certain terms. Overall, the distortion term is therefore computed to result in the following:

Again, the summation over j may omit certain terms, for example, as chosen by the designer according to their know-how and other experience or experimental measurements. This transformation is implemented by the combination stage 230, labelled L R in FIG. 2. Each term in the sum over k uses a different combination of a selection of a component a k of w and a delay d k for that component. The sum over j yields a complex multiplier for that combination, essentially functioning as a time- varying gain for that combination.

[081] As an example of one term in summation that yields the distortion term,

2

consider w 1 = u , and r =| u | (i.e., applying transformation (b) with a = 1 , and a = 2 ), which together yield a term of the form u f{ \ u \ ) where y{ ) is one of the parameterized scalar functions. Note the contrast of such a term as compared to a 2

simple scalar weighting of a terms u \ u \ , which lack the larger number of degrees of freedom obtainable though the parameterization of f( ) .

[082] Each function implements a parameterized mapping from the real argument r j , which is in the range [0,1] , to a complex number, optionally limited to complex numbers with magnitudes less than or equal to one. These functions are essentially parameterized by the parameters x , which are determined by the adaptation section 160 (see FIG. 1). In principal, if there are N w components of w , and delays from 0 to D- 1 are permitted, and each component of the N R components of r may be used, then there may be up to a total of N w D N R different functions

[083] In practice, a selection of a subset of these terms are used, being selected for instance by trial-and-error or greedy selection. In an example of a greedy iterative selection procedure, a number of possible terms (e.g., w and r combinations) are evaluated according to their usefulness in reducing a measure of distortion (e.g., peak or average RMS error, impact on EVM, etc. on a sample data set) at an iteration and one or possible more best terms are retained before proceeding to the next iteration where further terms may be selected, with a stopping rule, such as a maximum number of terms or a threshold on the reduction of the distortion measure. A result is that for any term k in the sum, only a subset of the N R components of r are generally used. For a highly nonlinear device, a design generally works better employing a variety of /¾ signals. For nonlinear systems with strong memory effect

(i.e., poor harmonic frequency response), the design tends to require more shifts in the vt¾ signals. In an alternative selection approach, the best choices of w k and r k with given constraints starts with a universal compensator model which has a rich selection of vt¾ and ¾ , and then an Ll trimming is used to restrict the terms.

[084] Referring to FIG. 4A, one functional form for the j ( ) functions, generically referred to as f t , is as a piecewise constant function 410. In FIG. 4A, the real part of such a piecewise constant function is shown in which the interval from

0.0 to 1.0 is divided into 8 section (i.e., 2 sections for S = 3). In embodiments that use such form, the adaptive parameters x directly represent the values of these piecewise constant sections 411, 412 - 418. In FIG. 4 A, and in examples below, the r axis is divided in regular intervals, in the figure in equal width intervals. The approaches described herein do not necessarily depend on uniform intervals, and the axis may be divide in unequal intervals, with all functions using the same set of intervals or different functions potentially using different intervals. In some implementations, the intervals are determined by the fixed parameters z of the system.

[085] Referring to FIG. 4B, another form of function is a piecewise linear function 420. Each section 431 - 438 is linear and is defined by the values of its endpoints.

Therefore, the function 420 is defined by the 9 (i.e., 2 + 1 ) endpoints. The function 420 can also be considered to be the weighted sum of predefined kernels (r) for

/ = 0, L - 1 , in this illustrated case with L = 2 + 1 = 9. ln particular, these kernels may be defined as:

The function 420 is then effectively defined by the weighted sum of these kernels as:

where the X [ are the values at the endpoints of the linear segments.

[086] Referring to FIG. 4C, different kernels may be used. For example, a smooth function 440 may be defined as the summation of weighted kernels 441, 442 - 449. In some examples, the kernels are non-zero over a restricted range of values of r , for example, with ty (r) being zero for r outside [(/ - ri) / L, (i + ri) / L] for n = 1 , or some large value of n < L .

[087] Referring to FIG. 4D, in some examples, piecewise linear function forms an approximation of a smooth function. In the example shown in FIG. 4D, a smooth function, such as the function in FIG. 4C, is defined by 9 values, the multiplier for kernel functions /¾ through b 9 . This smooth function is then approximated by a larger number of linear sections 451 - 466, in this case 16 section defined by 17 endpoints.470, 471 - 486. As is discussed below, this results in there being 9 (complex) parameters to estimate, which are then transformed to 17 parameters for configuring the predistorter. Of course, different number of estimated parameters and linear sections may be used. For example, 4 smooth kernels may be used in estimation and then 32 linear sections may be used in the runtime predistorter.

[088] Referring to FIG. 4E, in another example, the kernel functions themselves are piecewise linear. In this example, 9 kernel functions, of which two 491 and 492 are illustrated, are used. Because the kernels have linear segments of length 1/16, the summation of the 9 kernel functions result in a function 490 that has 16 linear segments. One way to form the kernel functions is a 1 / M‘ h band interpolation filter, in this illustration a half-band filter. In another example that is not illustrated, 5 kernels can be used to generate the 16-segment function essentially by using quarter- band interpolation filters. The specific form of the kernels may be determined by other approaches, for example, to optimize smoothness or frequency content of the resulting functions, for example, using linear programming of finite-impulse-response filter design techniques.

[089] It should also be understood that the approximation shown in FIGD. 4D-E do not have to be linear. For example, a low-order spline may be used to approximate the smooth function, with fixed knot locations (e.g., equally spaced along the r axis, or with knots located with unequal spacing and/or at locations determined during the adaptation process, for example, to optimize a degree of fit of the splines to the smooth function.

[090] Referring to FIG. 3, the combination stage 230 is implemented in two parts: a lookup table stage 330, and a modulation stage 340. The lookup table stage 330, labelled L T , implements a mapping from the N R components of r to N G components of a complex vector g . Each component g t corresponds to a unique function <f> k used in the summation shown above. The components of g

corresponding to a particular term k have indices i in a set denoted A k . Therefore, the combination sum may be written as follows:

This summation is implemented in the modulation stage 340 shown in FIG. 3. As introduced above, the values of the a k , d k , and the A k are encoded in the fixed parameters .

[091] Note that the parameterization of the predistorter 130 (see FIG. 1) is focused on the specification of the functions ( ) . In a preferred embodiment, these functions are implemented in the lookup table stage 330. The other parts of the predistorter, including the selection of the particular components of w that are formed in the complex transformation component 210, the particular components of r that are formed in the real transformation component 220, and the selection of the particular functions ( ) that are combined in the combination stage 230, are fixed and do not depend on the values of the adaptation parameters x .

[092] One efficient approach to implementing the lookup table stage 330 is to restrict each of the functions ) to have a piecewise constant or piecewise linear form. Because the argument to each of these functions is one of the components of r , the argument range is restricted to [0,l] , the range can be divided into 2 s sections, for example, 2 s equal sized sections with boundaries at z ' 2 5 for i e {0,1,...,2 s } . In the case of piecewise constant function, the function can be represented in a table with 2 s complex values, such that evaluating the function for a particular value of r j involves retrieving one of the values. In the case of piecewise linear functions, a table with

1 + 2 s values can represent the function, such that evaluating the function for a particular value of r j involves retrieving two values from the table for the boundaries of the section that r j is within, and appropriately linearly interpolating the retrieved values.

[093] Referring to FIG. 5, one implementation of the lookup table stage 330, in this illustration for piecewise constant functions, makes use of a set of tables (or parts of one table) 510-512. Table 510 has one row for each function f> k ) , table 511 has one row for each function 2 (r 2 ) > an d so forth. That is, each row represents the endpoints of the linear segments of the piecewise linear form of the function. In such an arrangement, each of the tables 510-512 will in general have a different number of rows. Also, it should be understood that such an arrangement of separate tables is logical, and the implemented data structures may be different, for example, with a separate array of endpoint values for each function, not necessarily arranged in tables as shown in FIG. 5. To implement the mapping from r to g , each element is used to select a corresponding column in the j th table, and the values in that column are retrieved to form a portion of g . For example, the r{ h column 520 is selected for the first table 410, and the values in that column are retrieved as g \ ,g 2 ,. This process is repeated for the r d column 421 of table 511, the r 2 d column 522 of table 512 and so forth to determine all the component values of g . In an embodiment in which piecewise linear functions are used, two columns may be retrieved, and the values in the columns are linearly interpolated to form the corresponding section of g . It should be understood that the table structure illustrated in FIG. 5 is only one example, and that other analogous data structures may be used within the general approach of using lookup tables rather than extensive use of arithmetic functions to evaluate the functions j ( ) . It should be recognized that while the input r p is real, the output g t is complex. Therefore the cells of the table can be considered to hold pairs of values for the real and imaginary parts of the output, respectively.

[094] The lookup table approach can be applied to piecewise linear function, as illustrated in FIG. 6A for one representative transformation . The value r p is first processed in a quantizer 630, which determines which segment r p falls on, and output m p representing that segment. The quantizer also output a“fractional” part f p , which represents the location of r p in the interval for that segment. Each cell in the column 621 identified by m p has two quantities, which essentially define one endpoint and the slope of the segment. The slope is multiplied in a multiplier 632 by the fractional part f p , and the product is added in an adder 634 to yield the value g k .

Of course this is only one implementation, and different arrangements of the values stored in the table 611, or in multiple tables, and the arrangement of the arithmetic operators on selected values from the table to yield the value g may be used. FIG. 6B shows another arrangement for use with piecewise linear functions. In this arrangement, the output m p selects two adjacent columns of the table, which represent the two endpoint values. Such an arrangement reduces the storage by a factor of two as compared to the arrangement of FIG. 6A. However, because the slope of the linear segments are not stored, an adder 635 is used to take the difference between the endpoint values, and then this difference is multiplied by f and added to one of the endpoint values in the manner of FIG. 6A.

[095] As introduced above, the particular constructions used to assemble the complex signals vt¾ and real signals r k may be based on trial-and-error, analytical prediction of impact of various terms, heuristics, and/or a search or combinatorial optimization to select the subset for a particular situation (e.g., for a particular power amplifier, transmission band, etc.). One possible optimization approach may make use of greedy selection of productions to add to a set of and r k signals according to their impact on an overall distortion measure. [096] Very generally, the parameters x of the predistorter 130 (see FIG. 1), which implements the compensation function C , may be selected to minimize a distortion between a desired output (i.e., the input to the compensator) «[.] , and the sensed output of the power amplifier y[.] . For example, the parameters x , which may be the values defining the piecewise constant or piecewise linear functions f , are updated, for example, in a gradient-based iteration based on a reference pair of signals (¾[.], y[.]) , for example, adjusting the values of the parameters such that «[.] = y[.\ . In some examples that make use of tables, for example with 2 entries, to encode the non-linear functions </¾ ( ) , each entry may be estimated in the gradient procedure. In other examples, a smoothness or other regularity is enforced for these functions by limiting the number of degrees of freedom to less than 2 , for example, by estimating the non-linear function as a being in the span (linear combination) of a set of smooth basis functions. After estimating the combination of such functions, the table is then generated.

[097] Therefore, the adaptation section 160 essentially determines the parameters used to compute the distortion term as d[ί ~ \ = A(u[t - r], ..., u[t - 1]) in the case that t delayed values of the input u are used. More generally, d delayed values of the input and ry look-ahead values of the input are used. This range of inputs is defined for notational conveniences as q u [t\ = {u[t - T d \,...,u[t + zy]) . (Note that with the optional use of the terms e[t\ , these values are also included in the ¾([/]) term.) This term is parameterized by values of a set of complex parameters x , therefore the function of the predistorter can be expressed as

A*] = C(q u [/]) = u[t] + A{q u [/])

One or more approaches to determining the values of the parameter $x$ that define the function d ) are discussed below.

[098] The distortion term can be viewed in a form as being a summation

where the a b are complex scalars, and B h ( ) can be considered to be basis functions evaluated with the argument q u [/] . The quality of the distortion term generally relies on there being sufficient diversity in the basis functions to capture the non-linear effects that may be observed. However, unlike some conventional approaches in which the basis functions are fixed, and the terms a b are estimated directly, or possibly are represented as functions of relatively simple arguments such as | u[t] \ , in approaches described below, the equivalents of the basis functions B b ( ) are themselves parameterized and estimated based on training data. Furthermore, the structure of this parameterization provides both a great deal of diversity that permits capturing a wide variety of non-linear effects, and efficient runtime and estimation approaches using the structure.

[099] As discussed above, the complex input u[t] to produce a set of complex signals w¾[/] using operations such as complex conjugation and multiplication of delayed versions of u[t ~ \ or other w¾[/] . These complex signals are then processed to form a set of phase-invariant real signals r p [t\ using operations such as magnitude, real, or imaginary parts, of various w¾[/] or arithmetic combinations of other r p [t\ signals. In some examples, these real values are in the range [0,1.0] or [-1.0, 1.0], or in some other predetermined bounded range. The result is that the real signals have a great deal of diversity and depend on a history of u[t\ , at least by virtue of at least some of the vt¾[/] depending on multiple delays of u[t\ . Note that computation of the w¾[/] and r p [t] can be performed efficiently. Furthermore, various procedures may be used to retain only the most important of these terms for any particular use case, thereby further increasing efficiency.

[0100] Before turning to the parameter estimation approach, recall that the distortion term can be represented as

where r\t ~ \ represents the entire set of the r p [t] real quantities, and F( ) is a parameterized complex function. For efficiency of computation, this non-linear function is separated into terms that each depend on a single real value as

[0101] For parameter estimation purposes, each of the non-linear functions f{ ) may be considered to be made up of a weighted sum of the fixed kernels (r ) , discussed above with reference to FIGS. 4A-D, such that

Introducing the kernel form of non-linear functions into the definition of the distortion term yields

In this form representing the triple ( k,pj ) as b , the distortion term can be expressed as

where

It should be recognized that for each time t , the complex values ¾[/] depends on the fixed parameters z and the input u over a range of times, but does not depend on the adaptation parameters x . Therefore the complex values ¾[/] for all the

combinations b = ( k , p, /) can be treated used in place of the input in the adaptation procedure.

[0102] An optional approach extends the form of the distortion term to introduce linear dependence on a set of parameter values, p [/],..., p [/] , which may, for example be obtained by monitoring temperature, power level, modulation center frequency, etc. In some cases, the envelope signal e\t\ may be introduced as a parameter. Generally, the approach is to augment the set of non-linear functions according to a set of environmental parameters so that essentially each function <t>k, P ( r )

is replaced with d linear multiples to form d + 1 functions

This essentially forms the set of basis functions

B b (¾ M) = w a k [t - dk] bi (h [/]) p d [/]

where b represents the tuple ( k,p,l,d ) and r ϋ = 1 .

[0103] What should be evident is that this form achieves a high degree of diversity in the functions B b ( ) , without incurring runtime computational cost that may be associated with conventional techniques that have a comparably diverse set of basis functions. Determination of the parameter values x b generally can be implemented in one of two away: direct and indirect estimation. In direct estimation, the goal is to adjust the parameters x according to the minimization:

x < arg mm |C(w) - (v-y + M)|| = arg mm å |D(¾[f - (n[/] - f| where the minimization varies the function A(q u [t]) while the terms q u [t] , v[/] , and y\t\ are fixed and known. In indirect estimation, the goal is to determine the parameters x according to the minimization

x < arg arg

Where q y [t\ is defined in the same manner as q u [t\ , except using y rather than u .

Solutions to both the direct and indirect approaches are similar, and the indirect approach is described in detail below.

[0104] Adding a regularization term, an objective function for minimization in the indirect adaptation case may be expressed as

where e[t\ = v[t] - y[t ~ \ . This can be expressed in vector/matrix form as where

Using the form, following matrices can be computed:

From these, one approach to updating the parameters x is by a solution

x <- ( pl n + G) ~l L

where I n denotes an n / n identity.

[0105] In some examples, the Gramian, G , and related terms above, are accumulated over a sampling interval T , and then the matrix inverse is computed. In some examples, the terms are updated in a continual decaying average using a“memory Gramian” approach. In some such examples, rather than computing the inverse at each step, a coordinate descent procedure is used in which at each iteration, only one of the components of x is updated, thereby avoiding the need to perform a full matrix inverse, which may not be computationally feasible in some applications.

[0106] As an alternative to the solution above, a stochastic gradient approach may be used implementing:

where is a step size that is selected adaptively. For example, one or more samples of the signals are selected a fixed or random intervals to update the parameters or a buffer of past pairs (q y [t\, v[/]) is maintained, for example, by periodic updating, and random samples from the buffer are selected to update the parameter values using the gradient update equation above.

[0107] Yet other adaptation procedures that may be used in conjunction with the approaches presented in this document are described in co-pending U.S. Application No. 16/004,594, titled“Linearization System,” filed on June 11, 2018, which is incorporated herein by reference.

[0108] Returning to the selection of the particular terms to be used for a device to be linearized, which are represented in the fixed parameters z , which includes the selection of the particular w k terms to generate, and then the particular r p to generate from the w k , and then the particular subset of r p to use to weight each of the w k in the sum yielding the distortion term, uses a systematic methodology. One such methodology is performed when a new device (a“device under test”, DUT) is evaluated for linearization. For this evaluation, recorded data sequences («[.], y[.]) and/or (v[.],y[.]) are collected. A predistorter structure that includes a large number of terms, possibly an exhaustive set of terms within a constrain on delays, number of w k and r p terms etc. is constructed. The least mean squared (LMS) criterion discussed above is used to determine the values of the exhaustive set of parameters x . Then, a variable selection procedure is used and this set of parameters is reduced, essentially, by omitting terms that have relatively little impact on the distortion term c [.] . One way to make this selection uses the LASSO (least absolute shrinkage and selection operator) technique, which is a regression analysis method that performs both variable selection and regularization, to determine which terms to retain for use in the runtime system. In some implementations, the runtime system is configured with the parameter values x determined at this stage. Note that it should be understood that there are some uses of the techniques described above that omit the adapter completely (i.e., the adapter is a non-essential part of the system), and the parameters are set one (e.g., at manufacturing time), and not adapted during operation, or may be updated from time to time using an offline parameter estimation procedure.

[0109] An example of applying the techniques described above starts with the general description of the distortion term The complex signals derived from the input, and the real signals derived from the complex signals are chosen do that the full distortion term has the following full form:

This form creates a total of 198 (=121+22+55) terms. In an experimental example, this set of terms is reduced from 198 terms to 6 terms using a LASSO procedure. These remaining 6 terms result in the distortion term having the form:

This form is computationally efficient because only 6 w k complex signals and 6 real signals r p terms that must be computed at each time step. If each non-linear transformation is represented by 32 linear segments, then the lookup tables have a total of 6 times 33, or 198 complex values. If each non-linear function is represented by 32 piecewise segments defined by 6 kernels, then there are only 36 complex parameter values that need to be adapted (i.e., 6 scale factors for the kernels of each non-linear function, and 6 such non-linear functions).

[0110] The techniques described above may be applied in a wide range of radio frequency communication systems. For example, approach illustrated in FIG. 1 may be used for wide area (e.g., cellular) base stations to linearize transmission of one or more channels in a system adhering to standard, such as 3GPP or IEEE standards (implemented over licensed and unlicensed frequency bands), pre-5G and 5G New Radio (NR), etc. Similarly, the approach can be implemented in a mobile station (e.g., a smartphone, handset, mobile client device (e.g., a vehicle), fixed client device, etc.). Furthermore, the techniques are equally applicable to local area communication (e.g., “WiFi”, the family of 802.11 protocols, etc.) as they are to wide area communication. Furthermore, the approaches can be applied to wired rather than wireless

communication, for example, to linearize transmitters in coaxial network distribution, for instance to linearize amplification and transmission stages for DOCSIS (Data Over Cable Service Interface Specification) head ends system and client modems. Yet other applications are not necessarily related to electrical signals, and the techniques may be used to linearize mechanical or acoustic actuators (e.g., audio speakers), and optical transmission systems. Finally, although described above in the context of linearizing a transmission path, with a suitable reference signal representing a transmission (e.g. predefine pilot signal patterns) the approach may be used to linearize a receiver, or to linearize a combined transmitter-channel-receiver path.

[0111] A summary of a typical use case of the approaches described above is as follows. First, initial data sequences («[.], [.]) and/or (v[.],y[.]) , as well as corresponding sequences e[.] and p[.\ in implementations that make use of these optional inputs, are obtained for a new type of device, for example, for a new cellular base station or a smartphone handset. Using this data, a set of complex signals w k and real signals r p are selected for the runtime system, for example, based on an ad hoc selection approach, or an optimization such as using the LASSO approach. In this selection stage, computational constraints for the runtime system are taken into account so that the computational limitations are not exceeded and/or performance requirements are met. Such computational requirements may be expressed, for example, in terms computational operations per second, storage requirements, and/or for hardware implementations in terms of circuit area or power requirements. Note that there may be separate limits on the computational constraints for the predistorter 130, which operates on every input value, and on the adapter, which may operate only from time to time to update the parameters of the system. Having determined the terms to be used in the runtime system, a specification of that system is produced. In some implementations, that specification includes code that will execute on a processor, for example, an embedded processor for the system. In some

implementations, the specification includes a design structure that specifies a hardware implementation of the predistorter and/or the adapter. For example, the design structure may include configuration data for a field-programmable gate array (FPGA), or may include a hardware description language specific of an application- specific integrated circuit (ASIC). In such hardware implementations, the hardware device includes input and output ports for the inputs and outputs shown in FIG. 1 for the predistorter and the adapter. In some examples, the memory for the predistorter is external to the device, while in other examples, it is integrated into the device. In some examples, the adapter is implemented in a separate device than the predistorter, in which case the predistorter may have a port for receiving updated values of the adaption parameters.

[0112] In some implementations, a computer accessible non-transitory storage medium includes instructions for causing a digital processor to execute instructions implementing procedures described above. The digital processor may be a general- purpose processor, a special purpose processor, such as an embedded processor or a controller, and may be a processor core integrated in a hardware device that implements at least some of the functions in dedicated circuitry (e.g., with dedicated arithmetic units, storage registers, etc.). In some implementations, a computer accessible non-transitory storage medium includes a database representative of a system including some or all of the components of the linearization system. Generally speaking, a computer accessible storage medium may include any non-transitory storage media accessible by a computer during use to provide instructions and/or data to the computer. For example, a computer accessible storage medium may include storage media such as magnetic or optical disks and semiconductor memories.

Generally, the database representative of the system may be a database or other data structure which can be read by a program and used, directly or indirectly, to fabricate the hardware comprising the system. For example, the database may be a behavioral- level description or register-transfer level (RTL) description of the hardware functionality in a high-level design language (HDL) such as Verilog or VHDL. The description may be read by a synthesis tool which may synthesize the description to produce a netlist comprising a list of gates from a synthesis library. The netlist comprises a set of gates that also represent the functionality of the hardware comprising the system. The netlist may then be placed and routed to produce a data set describing geometric shapes to be applied to masks. The masks may then be used in various semiconductor fabrication steps to produce a semiconductor circuit or circuits corresponding to the system. In other examples, the database may itself be the netlist (with or without the synthesis library) or the data set.

[0113] It is to be understood that the foregoing description is intended to illustrate and not to limit the scope of the invention, which is defined by the scope of the appended claims. Reference signs, including drawing reference numerals and/or algebraic symbols, in parentheses in the claims should not be seen as limiting the extent of the matter protected by the claims; their sole function is to make claims easier to understand by providing a connection between the features mentioned in the claims and one or more embodiments disclosed in the Description and Drawings. Other embodiments are within the scope of the following claims.