Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
REFERENCE SIGNAL PACKING FOR WIRELESS COMMUNICATIONS
Document Type and Number:
WIPO Patent Application WO/2017/147439
Kind Code:
A1
Abstract:
In a wireless communication network, pilot signals are transmitted over a wireless communication channel by determining a maximum delay spread for a transmission channel, determining a maximum Doppler frequency spread for the transmission channel, and allocating a set of transmission resources in a time-frequency domain to a number of pilot signals based on the maximum delay spread and the maximum Doppler frequency spread.

Inventors:
HADANI RONNY (US)
RAKIB SHLOMO SELIM (US)
MONK ANTON (US)
TSATSANIS MICHAIL (US)
HEBRON YOAV (US)
Application Number:
PCT/US2017/019376
Publication Date:
August 31, 2017
Filing Date:
February 24, 2017
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
COHERE TECH INC (US)
International Classes:
H04B1/10; H04B1/46; H04B1/69; H04B1/707; H04L12/28; H04W24/08
Domestic Patent References:
WO2014126519A12014-08-21
WO2016014596A12016-01-28
Foreign References:
US20040131110A12004-07-08
US20060133381A12006-06-22
US20060276143A12006-12-07
US20120269232A12012-10-25
US20140269357A12014-09-18
US20080137788A12008-06-12
US20040131110A12004-07-08
Other References:
"3GPP DRAFT; RWS-150034", 3 September 2015, 3RD GENERATION PARTNERSHIP PROJECT (3GPP, article "5G Air Interface Waveforms"
See also references of EP 3420641A4
Attorney, Agent or Firm:
SATHE, Vinay (US)
Download PDF:
Claims:
Claims

1 . A wireless communication method, implemented by a wireless communication device, comprising:

determining a maximum delay spread for a transmission channel;

determining a maximum Doppler frequency spread for the transmission channel; allocating a set of transmission resources in a time-frequency domain to a number of pilot signals based on the maximum delay spread and the maximum Doppler frequency spread; and

transmitting the pilot signals over a wireless communication channel using transmission resources.

2. The method of claim 1 , wherein the allocating the set of transmission resources includes:

staggering transmission resources for the number of pilots with respect to each other such that at least some pilots occupy transmission resources that do not occur on a rectangular grid in the delay-Doppler domain.

3. The method of claim 2, wherein the staggering includes staggering every other pilot.

4. The method of claim 1 , wherein the set of transmission resources in the time- frequency domain occupied by any given pilot signal corresponds to a lattice comprising time instances uniformly distributed along a time axis and having a first step size and frequencies uniformly distributed along a frequency axis and having a second step size.

5. The method of claim 1 , wherein the set of transmission resources in the time- frequency domain occupied by the pilot signal correspond to a lattice comprising time instances non-uniform ly distributed along a time axis.

6. The method of claim 1 , wherein the set of transmission resources in the time- frequency domain occupied by at least one pilot signal correspond to a lattice comprising frequencies that are non-uniformly distributed along a frequency axis.

7. The method of claim 1 , wherein the set of transmission resources in the time- frequency domain occupied by at least one pilot signal are non-overlapping with another set of resources in the time-frequency domain over which user data is transmitted by the wireless communication device.

8. The method of claim 1 , wherein the transmitting the pilot signal includes transmitting the pilot signal to a given user equipment prior to transmitting data to the user equipment.

9. The method of claim 1 , wherein the generating the pilot signal includes:

scrambling a basis signal using a two-dimensional (2-D) chirp sequence.

10. The method of claim 1 , wherein each pilot signal corresponds to a delta function in a delay-Doppler domain.

1 1 . The method of claim 1 , wherein each pilot signal corresponds to a different cyclic shift in a time domain and. /or frequency domain of a root 2-D Zadoff-Chu sequence.

12. The method of claim 1 , wherein the transmitting the pilot signal is performed continuously, regardless of data transmissions.

13. The method of claim 1 , wherein the wireless communication device includes a base station, the method further including pre-coding data prior to data transmissions.

14. The method of claim 1 , wherein the wireless communication device includes a base station, the method further including generating at least two pilot signals occupying two sets of transmission resources non-overlapping in the time-frequency domain.

15. The method of claim 14, further including:

individually transmitting the at least two pilot signals to two different user equipment at time instances that are non-overlapping with each other.

16. The method of claim 14, further including:

individually transmitting the at least two pilot signals from two different user equipment at time instances that are non-overlapping with each other.

17. The method of claim 1 , wherein the wireless communication device includes a base station, the method further including generating at least two pilot signals occupying two sets of transmission resources are non-overlapping in the delay-Doppler domain.

18. The method of claim 14, wherein the at least two pilot signals use non-overlapping delay domain resources.

19. The method of claim 14, wherein the at least two pilot signals use non-overlapping Doppler-domain resources.

20. The method of claim 1 , wherein the wireless communication device includes a user equipment, and wherein the set of transmission resources are specified to the wireless communication device in a upper layer message.

21 . A wireless communication method, implemented by a wireless communication device, comprising:

determining a maximum delay spread for a transmission channel;

determining a maximum Doppler frequency spread for the transmission channel; determining a number of pilot signals that can be transmitted using a set of two- dimensional transmission resources at least based on the maximum delay spread and the maximum Doppler frequency spread;

allocating the set of transmission resources from a two-dimensional set of resources to the number of pilot; and transmitting the pilot signals over a wireless communication channel using transmission resources.

22. The method of claim 21 , wherein the determining the number of pilot signals further includes determining the number of pilot signals based on one or more of a number of receivers to send the pilot signals to, a number of transmission layers used for

transmissions to the receivers, a number of receivers that are also transmitting pilot signals, and possible interference from another cell's pilot signals.

23. The method of claim 21 , wherein the allocating the pilot signals includes

determining an observation window in a time-frequency domain for the pilot signals.

24. The method of claim 21 , wherein the allocating the set of transmission resources includes:

staggering transmission resources for the number of pilots with respect to each other such that at least some pilots occupy transmission resources that do not occur on a rectangular grid in the delay-Doppler domain.

25. The method of claim 24, wherein the staggering is performed by shifting locations of staggered pilots from non-staggered pilots to maximize a distance in a dimension of the shift.

26. A wireless communication apparatus, comprising:

a memory storing instructions;

a processor; and

a transmitter communicatively coupled to the memory and the processor; wherein the memory stores instructions for causing the processor to implement a method recited in any of claims 1 to 25.

27. An apparatus for receiving reference signals disclosed in the present document, processing the received reference signals, and modifying an operation of the apparatus based on the processing of the reference signal.

28. A method, apparatus or computer program product disclosed in the present document.

Description:
REFERENCE SIGNAL PACKING FOR WIRELESS COMMUNICATIONS CROSS-REFERENCE TO RELATED APPLICATION(S)

[1] This patent document claims the benefit of priority from U. S. Provisional

Patent Applications 62/303,318, filed March 3, 2016, and 62/299,985, filed February 25, 2016. All of the aforementioned patent applications are incorporated by reference herein in their entirety.

TECHNICAL FIELD

[2] This document relates to the field of telecommunications, in particular, estimation and compensation of impairments in telecommunications data channels.

BACKGROUND

[3] Due to an explosive growth in the number of wireless user devices and the amount of wireless data that these devices can generate or consume, current wireless communication networks are fast running out of bandwidth to accommodate such a high growth in data traffic and provide high quality of service to users.

[4] Various efforts are underway in the telecommunication industry to come up with next generation of wireless technologies that can keep up with the demand on performance of wireless devices and networks.

SUMMARY

[5] Various techniques for pilot packing, e.g., assigning transmission resources to a number of pilot signals for transmission, are disclosed. The disclosed techniques provide various operational advantages, including, for example, pilot packing by staggering pilots to achieve improved separation among the pilots, providing a number of pilot that is commensurate with the target delay-Doppler spread to be combated in the wireless channel, thereby optimally using the available transmission bandwidth, and so on.

[6] In one example aspect, a wireless communication method is disclosed. Using the method, pilot signals are transmitted over a wireless communication channel by determining a maximum delay spread for a transmission channel, determining a maximum Doppler frequency spread for the transmission channel, and allocating a set of transmission resources in a time-frequency domain to a number of pilot signals based on the maximum delay spread and the maximum Doppler frequency spread.

[7] In another aspect, a method of wireless communication includes determining a maximum delay spread for a transmission channel, determining a maximum Doppler frequency spread for the transmission channel, determining a number of pilot signals that can be transmitted using a set of two-dimensional transmission resources at least based on the maximum delay spread and the maximum Doppler frequency spread, allocating the set of transmission resources from a two-dimensional set of resources to the number of pilot, and transmitting the pilot signals over a wireless communication channel using transmission resources.

[8] In yet another aspect, a wireless communication apparatus comprising a memory, a processor and a transmitter is disclosed. The wireless communication apparatus may implement any of the above-described methods and other associated techniques described in the present document.

[9] These, and other aspects, are described in greater detail in the present document.

BRIEF DESCRIPTION OF THE DRAWINGS

[10] Figure 1 shows an example trajectory of Time Varying Impulse Response for

Accelerating Reflector.

[11] Figure 2 shows an example of Delay-Doppler Representation for an

Accelerating Reflector Channel.

[12] Figure 3 depicts example Levels of Abstraction: Signaling over the (i) actual channel with a signaling waveform (ii) the time-frequency Domain (iii) the delay-Doppler Domain.

[13] Figure 4 shows examples of notation Used to Denote Signals at Various

Stages of Transmitter and Receiver. [14] Figure 5 depicts an example of a conceptual Implementation of the

Heisenberg Transform in the Transmitter and the Wigner Transform in the Receiver.

[15] Figure 6 shows an example of cross-correlation between g tr t) and g r (t) for

OFDM Systems.

[16] Figure 7 shows an example of Information Symbols in the Information (Delay-

Doppler) Domain (Right), and Corresponding Basis Functions in the Time-Frequency Domain (Left).

[17] Figure 8 shows a One Dimensional Multipath Channel Example: (i) Sampled

Frequency Response at Af = 1 Hz (ii) Periodic Fourier Transform with Period l/Af = 1 sec (iii) Sampled Fourier Transform with Period l/Af and Resolution l/MAf.

[18] Figure 9 shows a One Dimensional Doppler Channel Example: (i) Sampled

Frequency Response at T s = 1 sec (ii) Periodic Fourier Transform with Period 1/T S = 1 Hz (iii) Sampled Fourier Transform with Period 1/T S and Resolution 1/NT S .

[19] Figure 10 depicts an example of a time-Varying Channel Response in the

Time-Frequency Domain

[20] Figure 1 1 depicts an example SDFT of Channel response - (τ, ν) Delay-

Doppler Domain.

[21] Figure 12 depicts an example SFFT of Channel Response - Sampled (τ, ν)

Delay-Doppler Domain.

[22] Figure 13 depicts an example of Transformation of the Time-Frequency Plane to the Doppler-Delay Plane.

[23] Figure 14 depicts an example of a Discrete Impulse in the OTFS Domain Used for Channel Estimation.

[24] Figure 15 shows an example of Different Basis Functions, Assigned to

Different Users, Span the Whole Time-Frequency Frame.

[25] Figure 16 shows an example embodiment of multiplexing three users in the

Time-Frequency Domain. [26] Figure 17 shows an example embodiment of Multiplexing three users in the

Time-Frequency Domain with Interleaving.

[27] Figure 18 shows an example of an OTFS architecture block diagram.

[28] Figure 19 shows an example of a t-f pilot lattice (1902) superimposed on a data lattice (1904) when N=14, M=2.

[29] Figure 20 shows examples of OTFS based pilots, [a] shows 10 pilots on the

Delay-Doppler plane associated with the data plane, [b] shows the samples of the real portion of P3 = δ ° ' °^ 57 ) on the t-f data lattice of Figure 19, [c] shows the representation of the same 10 pilots on the Delay-Doppler plane associated with the coarser pilot lattice of Figure 19 (N=14, M=2), and d shows the samples of the real portion of P3 on the t-f pilot lattice.

[30] Figure 21 shows an example of 4x2 pilots in a staggered structure on Delay

Doppler plane.

[31] Figure 22 shows an example of a pilot lattice with N=28 and M=1 superimposed on the data lattice.

[32] Figure 23 shows an example embodiment with 12 pilot lattices with N=28,

M=12 each, superimposed on the data lattice.

[33] Figure 24 shows an example embodiment with 2 pilot lattices with N=28 and

M=2 superimposed on the data lattice.

[34] Figure 25 shows an example of UL (2502) and DL (2504) pilot sample points on a data lattice (2506).

[35] Figure 26 shows an example of packing 10x4=40 pilots on the Delay-Doppler plane associated with the pilot lattice of N=28, M=1 (2504 points of the lattice of Figure 25)

[36] Figure 27 shows an example of a torus.

[37] Figure 28 shows an example of a sampled Delay-Doppler data plane showing the absolute value of an instantiation of the time-domain cyclic shift portion of the 4 LTE UL DM RSs. [38] Figure 29 shows an example staggered version of the 4 LTE DM RSs shown in Figure 28.

[39] Figure 30 shows a graphical depiction of an example of average SNR of estimated 4 ETU-50 channels (using MMSE interpolation) for the 4 DM RSs of Figure 28 (3004) and Figure 29 (3002) when receiver input SNR is 50dB.

[40] Figure 31 shows an example of pilots' sample points (3102) on a data lattice

(3104).

[41] Figure 32 shows an example communication network in which the disclosed technology can be embodied.

[42] Figure 33 shows a flowchart of an example method of wireless communication.

[43] Figure 34 is a block diagram of an example of a wireless communication apparatus.

[44] Figure 35 shows a flowchart of an example method of wireless communication.

[45] Figure 36 is a block diagram of an example of a wireless communication apparatus that can be used for embodying some techniques disclosed in this patent document.

DETAILED DESCRIPTION

[46] Section headings are used in this document to help improve readability and do not limit scope of the technology discussed in each section only to that section. Furthermore, for ease of explanation, a number of simplifying assumptions have been made. Although these simplifying assumptions are intended to help convey ideas, they are not intended to be limiting. Some of these simplifying assumptions are:

[47] 1. Introduction

[48] 4G wireless networks have served the public well, providing ubiquitous access to the internet and enabling the explosion of mobile apps, smartphones and sophisticated data intensive applications like mobile video. This continues an honorable tradition in the evolution of cellular technologies, where each new generation brings enormous benefits to the public, enabling astonishing gains in productivity, convenience, and quality of life.

[49] Looking ahead to the demands that the ever increasing and diverse data usage is putting on the network, it is becoming clear to the industry that current 4G networks will not be able to support the foreseen needs in the near term future. The data traffic volume has been and continues to increase exponentially. AT&T reports that its network has seen an increase in data traffic of 100,000% in the period 2007-2015. Looking into the future, new applications like immersive reality, and remote robotic operation (tactile internet) as well as the expansion of mobile video are expected to overwhelm the carrying capacity of current systems. One of the goals of 5G system design is to be able to economically scale the network to 750 Gbps per sq. Km in dense urban settings, something that is not possible with today's technology.

[50] Beyond the sheer volume of data, the quality of data delivery will need to improve in next generation systems. The public has become accustomed to the ubiquity of wireless networks and is demanding a wireline experience when untethered. This translates to a requirement of 50+ Mbps everywhere (at the cell edge), which will require advanced interference mitigation technologies to be achieved.

[51] Another aspect of the quality of user experience is mobility. Current systems' throughput is dramatically reduced with increased mobile speeds due to Doppler effects which evaporate MIMO capacity gains. Future 5G systems aim to not only increase supported speeds up to 500 Km/h for high speed trains and aviation, but also support a host of new automotive applications for vehicle-to-vehicle and vehicle-to-infrastructure communications.

[52] While the support of increased and higher quality data traffic is necessary for the network to continue supporting the user needs, carriers are also exploring new applications that will enable new revenues and innovative use cases. The example of automotive and smart infrastructure applications discussed above is one of several. Others include the deployment of public safety ultra-reliable networks, the use of cellular networks to support the sunset of the PSTN, etc. The biggest revenue opportunity however, is arguably the deployment of large number of internet connected devices, also known as the internet of things (loT). Current networks however are not designed to support a very large number of connected devices with very low traffic per device.

[53] In summary, current LTE networks cannot achieve the cost/performance targets required to support the above objectives, necessitating a new generation of networks involving advanced PHY technologies. There are numerous technical challenges that will have to be overcome in 5G networks as discussed next.

[54] 1.1 4G Technical Challenged

[55] In order to enable machine-to-machine communications and the realization of the internet of things, the spectral efficiency for short bursts will have to be improved, as well as the energy consumption of these devices (allowing for 10 years operation on the equivalent of 2 AA batteries). In current LTE systems, the network synchronization requirements place a burden on the devices to be almost continuously on. In addition, the efficiency goes down as the utilization per UE (user equipment, or mobile device) goes down. The PHY requirements for strict synchronization between UE and eNB (Evolved Node B, or LTE base station) will have to be relaxed, enabling a re-designing of the MAC for loT connections that will simplify transitions from idle state to connected state.

[56] Another important use case for cellular loT (CloT) is deep building penetration to sensors and other devices, requiring an additional 20dB or more of dynamic range. 5G CloT solutions should be able to coexist with the traditional high-throughput applications by dynamically adjusting parameters based on application context.

[57] The path to higher spectral efficiency points towards a larger number of antennas. A lot of research work has gone into full dimension and massive MIMO architectures with promising results. However, the benefits of larger MIMO systems may be hindered by the increased overhead for training, channel estimation and channel tracking for each antenna. A PHY that is robust to channel variations will be needed as well as innovative ways to reduce the channel estimation overhead.

[58] Robustness to time variations is usually connected to the challenges present in high Doppler use cases such as in vehicle-to-infrastructure and vehicle-to-vehicle automotive applications. With the expected use of spectrum up to 60GHz for 5G applications, this Doppler impact will be an order of magnitude greater than with current solutions. The ability to handle mobility at these higher frequencies would be extremely valuable.

[59] 1.2 OTFS Based Solution

[60] OTFS is a modulation technique that modulates each information (e.g., QAM) symbol onto one of a set of two dimensional (2D) orthogonal basis functions that span the bandwidth and time duration of the transmission burst or packet. The modulation basis function set is specifically derived to best represent the dynamics of the time varying multipath channel.

[61] OTFS transforms the time-varying multipath channel into a time invariant delay-Doppler two dimensional convolution channel. In this way, it eliminates the difficulties in tracking time-varying fading, for example in high speed vehicle communications.

[62] OTFS increases the coherence time of the channel by orders of magnitude. It simplifies signaling over the channel using well studied AWGN codes over the average channel SNR. More importantly, it enables linear scaling of throughput with the number of antennas in moving vehicle applications due to the inherently accurate and efficient estimation of channel state information (CSI). In addition, since the delay-doppler channel representation is very compact, OTFS enables massive MIMO and beamforming with CSI at the transmitter for four, eight, and more antennas in moving vehicle applications. The CSI information needed in OTFS is a fraction of what is needed to track a time varying channel.

[63] In deep building penetration use cases, one QAM symbol may be spread over multiple time and/or frequency points. This is a key technique to increase processing gain and in building penetration capabilities for CloT deployment and PSTN replacement applications. Spreading in the OTFS domain allows spreading over wider bandwidth and time durations while maintaining a stationary channel that does not need to be tracked over time. [64] Loose synchronization: CoMP and network MIMO techniques have stringent clock synchronization requirements for the cooperating eNBs. If clock frequencies are not well synchronized, the UE will receive each signal from each eNB with an apparent "Doppler" shift. OTFS's reliable signaling over severe Doppler channels can enable CoMP deployments while minimizing the associated synchronization difficulties.

[65] These benefits of OTFS will become apparent once the basic concepts behind

OTFS are understood. There is a rich mathematical foundation of OTFS that leads to several variations; for example it can be combined with OFDM or with multicarrier filter banks. In this paper we navigate the challenges of balancing generality with ease of understanding as follows:

[66] In Section 2 we start by describing the wireless Doppler multipath channel and its effects on multicarrier modulation.

[67] In Section 3, we develop OTFS as a modulation that matches the characteristics of the time varying channel. We show OTFS as consisting of two processing steps:

[68] A step that allows transmission over the time frequency plane, via orthogonal waveforms generated by translations in time and/or frequency. In this way, the (time- varying) channel response is sampled over points of the time-frequency plane.

[69] A pre-processing step using carefully crafted orthogonal functions employed over the time-frequency plane, which translate the time-varying channel in the time- frequency plane, to a time-invariant one in the new information domain defined by these orthogonal functions.

[70] In Section 4 we develop some more intuition on the new modulation scheme by exploring the behavior of the channel in the new modulation domain in terms of coherence, time and frequency resolution etc.

[71] In Sections 5 and 6 we explore aspects of channel estimation in the new information domain and multiplexing multiple users respectively, while in Section 7 we address complexity and implementation issues. [72] In Sections 8, we provide some performance results and we put the OTFS modulation in the context of cellular systems, discuss its attributes and its benefits for 5G systems.

[73] 2. The Wireless Channel

[74] The multipath fading channel is commonly modeled in the baseband as a convolution channel with a time varying impulse response

[75] where s(t) and r(t) represent the complex baseband channel input and output respectively and where fa( , t) is the complex baseband time varying channel response.

[76] This representation, while general, does not give us insight into the behavior and variations of the time varying impulse response. A more useful and insightful model, which is also commonly used for Doppler multipath doubly fading channels is

[77] In this representation, the received signal is a superposition of reflected copies of the transmitted signal, where each copy is delayed by the path delay τ, frequency shifted by the Doppler shift v and weighted by the time-invariant delay-Doppler impulse response /ι(τ, ν) for that τ and v. In addition to the intuitive nature of this representation, Eq. ( 2 ) maintains the generality of Eq. ( 1 ). In other words it can represent complex Doppler trajectories, like accelerating vehicles, reflectors etc. This can be seen if we express the time varying impulse response as a Fourier expansion with respect to the time variable t

Α(τ, ί) = j h(x, v)ej 2nvt dt (3 }

[78] Substituting ( 3 ) in ( 1 ) we obtain Eq. ( 2 ) after some manipulation 1 . More specifically, we obtain y(t) = jj e j2nvT h(T, v)e j2nv ^ ~ ^x(t - τ)άνάτ which differs from the

1 More specifically we obtain y(t) = // e j2nvT h(r, v)e j2nv(J: T) x(t - τ)άνάτ which differs from Error! Reference source not found, by an exponential factor; however, we can above equations by an exponential factor; however, we can absorb the exponential factor in the definition of the impulse response /ι(τ, ν) making the two representations equivalent.

[79] As an example, Figure 1 shows the time-varying impulse response for an accelerating reflector in the (τ, t) coordinate system, while Figure 2 shows the same channel represented as a time invariant impulse response in the (τ,ν) coordinate system.

[80] An important feature revealed by these two figures is how compact the (τ,ν) representation is compared to the (τ, t) representation. This has important implications for channel estimation, equalization and tracking as will be discussed later.

[81] Notice that while /ι(τ,ν) is, in fact, time-invariant, the operation on s(t) is still time varying, as can be seen by the effect of the explicit complex exponential function of time in Eq. ( 2 ). The technical efforts in this paper are focused on developing a modulation scheme based on appropriate choice of orthogonal basis functions that render the effects of this channel truly time-invariant in the domain defined by those basis functions. Let us motivate those efforts with a high level outline of the structure of the proposed scheme here.

[82] Let us consider a set of orthonormal basis functions 0 TjV (t) indexed by τ, ν which are orthogonal to translation and modulation, i.e.,

e ;'27rVo t r,v(t) = 0τ,ν-ν ο (

[83] and let us consider the transmitted signal as a superposition of these basis functions

absorb the exponential factor in the definition of the impulse response /ι(τ, ν) making the two representations equivalent. [84] where the weights χ(τ, ν) represent the information bearing signal to be transmitted. After the transmitted signal of ( 5 ) goes through the time varying channel of Eq. ( 2 ) we obtain a superposition of delayed and modulated versions of the basis functions, which due to ( 4 ) results in

= r IrI φ τ ν (ΐ){1ι(τ, v) * χ(τ, v)}dzdv ( 6 )

[85] where * denotes two dimensional convolution. Eq. ( 6 ) can be thought of as a generalization of the derivation of the convolution relationship for linear time invariant systems, using one dimensional exponentials as basis functions. Notice that the term in brackets can be recovered at the receiver by matched filtering against each basis function 0 T V (t) . In this way a two dimensional channel relationship is established in the (τ, ν) domain y(r, v) = h(r, v) * χ(τ, ν) , where y(r, v) is the receiver two dimensional matched filter output. Notice also, that in this domain the channel is described by a time invariant two-dimensional convolution.

[86] A final different interpretation of the wireless channel will also be useful in what follows. Let us consider s(t) and r(t) as elements of the Hilbert space of square integrable functions Ή. Then Eq. ( 2 ) can be interpreted as a linear operator on Ή acting on the input s(t) , parametrized by the impulse response h(r, v), and producing the output r(t) r = n ft (s) : s(t) G K ^→ r(t) G K l 7 }

[87] Notice that although the operator is linear, it is not time-invariant. In the no

Doppler case, i.e., if /ι(ν, τ) = ι(0, τ)<5(ν) , then Eq. ( 2 ) reduces to a time invariant convolution. Also notice that while for time invariant systems the impulse response is parameterized by one dimension, in the time varying case we have a two dimensional impulse response. While in the time invariant case the convolution operator produces a superposition of delays of the input s(t) , (hence the parameterization is along the one dimensional delay axis) in the time varying case we have a superposition of delay-and- modulate operations as seen in Eq. ( 2 ) (hence the parameterization is along the two dimensional delay and Doppler axes). This is a major difference which makes the time varying representation non-commutative (in contrast to the convolution operation which is commutative), and complicates the treatment of time varying systems.

[88] The important point of Eq. ( 7 ) is that the operator h (-) can be compactly parametrized in a two dimensional space h r, v), providing an efficient, time invariant description of the channel. Typical channel delay spreads and Doppler spreads are a very small fraction of the symbol duration and subcarrier spacing of multicarrier systems.

[89] In the mathematics literature, the representation of time varying systems of

( 2 ) and ( 7 ) is called the Heisenberg representation [1 ]. It can actually be shown that every linear operator ( 7 ) can be parameterized by some impulse response as in ( 2 ).

[90] 3. OTFS modulation over the Doppler multipath channel

[91] The time variation of the channel introduces significant difficulties in wireless communications related to channel acquisition, tracking, equalization and transmission of channel state information (CSI) to the transmit side for beamforming and MIMO processing. In this paper, we develop a modulation domain based on a set of orthonormal basis functions over which we can transmit the information symbols, and over which the information symbols experience a static, time invariant, two dimensional channel for the duration of the packet or burst transmission. In that modulation domain, the channel coherence time is increased by orders of magnitude and the issues associated with channel fading in the time or frequency domain in SISO or MIMO systems are significantly reduced.

[92] Orthogonal Time Frequency Space (OTFS) modulation is comprised of a cascade of two transformations. The first transformation maps the two dimensional plane where the information symbols reside (and which we call the delay-Doppler plane) to the time frequency plane. The second one transforms the time frequency domain to the waveform time domain where actual transmitted signal is constructed. This transform can be thought of as a generalization of multicarrier modulation schemes.

[93] Figure 3 provides a pictorial view of the two transformations that constitute the

OTFS modulation. It shows at a high level the signal processing steps that are required at the transmitter and receiver. It also includes the parameters that define each step, which will become apparent as we further expose each step. Further, Figure 4 shows a block diagram of the different processing stages at the transmitter and receiver and establishes the notation that will be used for the various signals.

[94] We start our description with the transform which relates the waveform domain to the time-frequency domain.

[95] 3.1 The Heisenberg Transform

[96] Our purpose in this section is to construct an appropriate transmit waveform which carries information provided by symbols on a grid in the time-frequency plane. Our intent in developing this modulation scheme is to transform the channel operation to an equivalent operation on the time-frequency domain with two important properties:

[97] The channel is orthogonalized on the time-frequency grid.

[98] The channel time variation is simplified on the time-frequency grid and can be addressed with an additional transform.

[99] Fortunately, these goals can be accomplished with a scheme that is very close to well-known multicarrier modulation techniques, as explained next. We will start with a general framework for multicarrier modulation and then give examples of OFDM and multicarrier filter bank implementations.

[100] Let us consider the following components of a time frequency modulation:

[101] A lattice or grid on the time frequency plane, that is a sampling of the time axis with sampling period T and the frequency axis with sampling period Δ .

Λ = {(nT, mAf), n,m £ ¾ ( 8 )

[102] A packet burst with total duration NT sees and total bandwidth MAf Hz

[103] A set of modulation symbols X[n, m], n = 0, ..., N - 1, m = 0, ..., M - 1 we wish to transmit over this burst [104] A transmit pulse g tr (t) with the property 2 of being orthogonal to translations by T and modulations by Δ

[105] Given the above components, the time-frequency modulator is a Heisenberg operator on the lattice Λ, that is, it maps the two dimensional symbols X[n. m] to a transmitted waveform, via a superposition of delay-and-modulate operations on the pulse waveform g tr (t)

M/2-1 w-1

s(t) = ^ ^ [n, m]^ tr (t - nr)e^ '27rm ^ (t - nT) ( 10 ) m=-M/2 n=0

[106] More formally

[107] where we denote by Η χ (·) the "discrete" Heisenberg operator, parameterized by discrete values X[n, m] .

[108] Notice the similarity of ( 1 1 ) with the channel equation ( 7 ). This is not by coincidence, but rather because we apply a modulation effect that mimics the channel effect, so that the end effect of the cascade of modulation and channel is more tractable at the receiver. It is not uncommon practice; for example, linear modulation (aimed at time invariant channels) is in its simplest form a convolution of the transmit pulse g(t) with a delta train of QAM information symbols sampled at the Baud rate T.

W-1

X[n]g t - nT) ( 12 )

71=0

[109] In our case, aimed at the time varying channel, we convolve-and-modulate the transmit pulse (c.f. the channel Eq. ( 2 )) with a two dimensional delta train which samples the time frequency domain at a certain Baud rate and subcarrier spacing.

2 This orthogonality property is required if the receiver uses the same pulse as the transmitter. We will generalize it to a bi-orthogonality property in later sections. [110] The sampling rate in the time-frequency domain is related to the bandwidth and time duration of the pulse g tr (t) namely its time-frequency localization. In order for the orthogonality condition of ( 9 ) to hold for a frequency spacing M, the time spacing must be T > 1/M. The critical sampling case of T = 1/M is generally not practical and refers to limiting cases, for example to OFDM systems with cyclic prefix length equal to zero or to filter banks with g tr (t) equal to the ideal Nyquist pulse.

[111] Some examples are as follows:

[112] Example 1 : OFDM Modulation: Let us consider an OFDM system with M subcarriers, symbol length T 0FDM , cyclic prefix length T CP and subcarrier spacing 1/T 0FDM . If we substitute in Equation ( 10 ) symbol duration T = T 0FDM + T CP , number of symbols N = 1, subcarrier spacing Δ = 1/T 0FDM and g tr (t) a square window that limits the duration of the subcarriers to the symbol length T

[113] then we obtain the OFDM formula 3

M/2-1

x(t) = ^ X[n, m]g tr (t)e j2mnA r t ( 14 )

m=-M/2

[114] Example 2: Single Carrier Modulation: Equation ( 10 ) reduces to single carrier modulation if we substitute M = 1 subcarrier, T equal to the Baud period and g tr (t) equal to a square root raised cosine Nyquist pulse.

[115] Example 3: Multicarrier Filter Banks (MCFB): Equation ( 10 ) describes a MCFB if g tr (t) is a square root raised cosine Nyquist pulse with excess bandwith , T is equal to the Baud period and Δ = (1 + )/T.

[116] Expressing the modulation operation as a Heisenberg transform as in Eq. ( 1 1 ) may be counterintuitive. We usually think of modulation as a transformation of the modulation symbols X[m, n] to a transmit waveform s(t). The Heisenberg transform instead, uses X[m, n] as weights/parameters of an operator that produces s(t) when

3 Technically, the pulse of Eq. ( 13 ) is not orthonormal but is orthogonal to the receive filter (where the CP samples are discarded) as we will see shortly. applied to the prototype transmit filter response g tr (t) - c.f. Eq. ( 1 1 ). While counterintuitive, this formulation is useful in pursuing an abstraction of the modulation- channel-demodulation cascade effects in a two dimensional domain where the channel can be described as time invariant.

[117] We next turn our attention to the processing on the receiver side needed to go back from the waveform domain to the time-frequency domain. Since the received signal has undergone the cascade of two Heisenberg transforms (one by the modulation effect and one by the channel effect), it is natural to inquire what the end-to-end effect of this cascade is. The answer to this question is given by the following result:

[118] Proposition 1 : Let two Heisenberg transforms as defined by Eqs. ( 7 ), ( 2 ) be parametrized by impulse responses ^(τ, ν), h 2 , v) and be applied in cascade to a waveform g(t) ε Ή. Then

Π Λ2 Λι (0(ί))) = Π Λ (0(ί)) ( 15 )

[119] where /ι(τ,ν) = /ι 2 (τ,ν) 0 /ι 1 (τ,ν) is the "twisted" convolution of ^(τ, ν), h 2 , v) defined by the following convolve-and-modulate operation h( , v) = jj ^(τ', ν'^τ - τ', ν - v')e j2nv '^- T '^ dr'dv' ( 16 ) [120] Proof: See Appendix 0.

[121] Applying the above result to the cascade of the modulation and channel Heisenberg transforms of ( 1 1 ) and ( 7 ), we can show that the received signal is given by the Heisenberg transform r(t) = n f (g tr (t ) + v(t) = jj f(x, v ei 2 ™^g tr t - x)dvdx + v(t) ( 17)

[122] where v(t) is additive noise and /(τ, v), the impulse response of the combined transform, is given by the twisted convolution of X[n, m] and /ι(τ, ν)

/(τ, ν) = Λ(τ, ν) QX[n, m]

M/2-1 W _! ( 1Q ) m=-M/2 n=0 [123] This result can be considered an extension of the single carrier modulation case, where the received signal through a time invariant channel is given by the convolution of the QAM symbols with a composite pulse, that pulse being the convolution of the transmitter pulse and the channel impulse response.

[124] With this result established we are ready to examine the receiver processing steps.

[125] 3.2 Receiver processing and the Wigner transform

[126] Typical communication system design dictates that the receiver performs a matched filtering operation, taking the inner product of the received waveform with the transmitter pulse, appropriately delayed or otherwise distorted by the channel. In our case, we have used a collection of delayed and modulated transmit pulses, and we need to perform a matched filter on each one of them. Figure 5 provides a conceptual view of this processing. On the transmitter, we modulate a set of M subcarriers for each symbol we transmit, while on the receiver we perform matched filtering on each of those subcarrier pulses. We define a receiver pulse g r (t) and take the inner product with a collection of delayed and modulated versions of it. The receiver pulse g r (t) is in many cases identical to the transmitter pulse, but we keep the separate notation to cover some cases where it is not (most notably in OFDM where the CP samples have to be discarded).

[127] While this approach will yield the sufficient statistics for data detection in the case of an ideal channel, a concern can be raised here for the case of non-ideal channel effects. In this case, the sufficient statistics for symbol detection are obtained by matched filtering with the channel-distorted, information-carrying pulses (assuming that the additive noise is white and Gaussian). In many well designed multicarrier systems however (e.g. , OFDM and MCFB), the channel distorted version of each subcarrier signal is only a scalar version of the transmitted signal, allowing for a matched filter design that is independent of the channel and uses the original transmitted subcarrier pulse. We will make these statements more precise shortly and examine the required conditions for this to be true.

[128] Figure 5 is only a conceptual illustration and does not point to the actual implementation of the receiver. Typically this matched filtering is implemented in the digital domain using an FFT or a polyphase transform for OFDM and MCFB respectively. In this paper we are rather more interested in the theoretical understanding of this modulation. To this end, we will consider a generalization of this matched filtering by taking the inner product < g r {t - T)e J'27rv(t_T) , r(t) > of the received waveform with the delayed and modulated versions of the receiver pulse for arbitrary time and frequency offset (τ,ν). While this is not a practical implementation, it allows us to view the operations of Figure 5 as a two dimensional sampling of this more general inner product.

[129] Let us define the inner product

A 9r ,r( , v) = < g r t - r(t) > = j g r * (t - ( 19 )

[130] The function A gr r (r, v) is known as the cross-ambiguity function in the radar and math communities and yields the matched filter output if sampled at τ = nT, v = m f (on the lattice Λ), i.e.,

Y [n, m] = A gr (τ, ν) | T=nTiV=mAf ( 20 )

[131] In the math community, the ambiguity function is related to the inverse of the Heisenberg transform, namely the Wigner transform. Figure 5 provides an intuitive feel for that, as the receiver appears to invert the operations of the transmitter 4 .

[132] The key question here is what the relationship is between the matched filter output Y[n, m] (or more generally Y(j, v)) and the transmitter input X[n, m] . We have already established in ( 17 ) that the input to the matched filter r(t) can be expressed as a Heisenberg representation with impulse response /(τ,ν) (plus noise). The output of the matched filter then has two contributions

Y(T, V) = A grir (j, v) = A G F 3TR)+V] (T, v) = v) ( 21 )

4 More formally, if we take the cross-ambiguity or the transmit and receive pulses

Ag r g TR {T, v), and use it as the impulse response of the Heisenberg operator, then we obtain the orthogonal cross-projection operator

n ½rStr (y(t)) = g tr (t) < g r (t), y(t) >

In words, the coefficients that come out of the matched filter, if used in a Heisenberg representation, will provide the best approximation to the original y(t) in the sense of minimum square error. [133] The last term is the contribution of noise, which we will denote (τ, ν) = Ag r V r, v). The first term on the right hand side is the matched filter output to the (noiseless) input comprising of a superposition of delayed and modulated versions of the transmit pulse. We next establish that this term can be expressed as the twisted convolution of the two dimensional impulse response /(τ,ν) with the cross-ambiguity function (or two dimensional cross correlation) of the transmit and receive pulses.

[134] The following theorem summarizes the key result.

[135] Theorem 1 : (Fundamental time-frequency domain channel equation). If the received signal can be expressed as

Π / ½,·(0) = jj - τ)άνάτ ( 22 )

[136] Then the cross-ambiguity of that signal with the receive pulse g tr (t) can be expressed as

A 9 r , f (g tr ) ( τ > v ) = f< > v ) O A 9r tr (τ, v) ( 23 )

[137] Proof: See Appendix 0.

[138] Recall from ( 18 ) that /(τ, v) = h( , v) QX[n, m], that is, the composite impulse response is itself a twisted convolution of the channel response and the modulation sumbols.

[139] Substituting /(τ,ν) from ( 18 ) into ( 21 ) we obtain the end-to-end channel description in the time frequency domain

Υ(τ,ν) = A gr ji r(Str) (T, v) + V(r,v) ( 24 )

= h r,v) O X[n, m] Q A gri9tr { , v) + V r, v)

[140] where (τ,ν) is the additive noise term. Eq. ( 24 ) provides an abstraction of the time varying channel on the time-frequency plane. It states that the matched filter output at any time and frequency point (τ, ν) is given by the delay-Doppler impulse response of the channel twist-convolved with the impulse response of the modulation operator twist-convolved with the cross-ambiguity (or two dimensional cross correlation) function of the transmit and receive pulses.

[141 ] Evaluating Eq. ( 24 ) on the lattice Λ we obtain the matched filter output modulation symbol estimates

X\m, n] = [n, ml = (T, V) | { 25 }

[142] In order to get more intuition on Equations ( 24 ), ( 25 ) let us first consider the case of an ideal channel, i.e. , /ι(τ, ν) = δ(τ)δ(ν). In this case by direct substitution we get the convolution relationship

M/2-1 w-1

Y[n, m] = ^ ^ X[n', m']A gr gtr ((n - n')T, (m - m')Af) + V[m, n] ( 26

m'=-M/2 n'=0

[143] In order to simplify Eq. ( 26 ) we will use the orthogonality properties of the ambiguity function. Since we use a different transmit and receive pulses we will modify the orthogonality condition on the design of the transmit pulse we stated in ( 9) to a bi- orthogonality condition

[144] Under this condition, only one term survives in ( 26 ) and we obtain

Y [n, m] = X [n, m] + V [n, m] I )

[145] where [n, m] is the additive white noise. Eq. ( 28 ) shows that the matched filter output does recover the transmitted symbols (plus noise) under ideal channel conditions. Of more interest of course is the case of non-ideal time varying channel effects. We next show that even in this case, the channel orthogonalization is maintained (no intersymbol or intercarrier interference), while the channel complex gain distortion has a closed form expression. [146] The following theorem summarizes the result as a generalization of ( 28 ).

[147] Theorem 2: (End-to-end time-frequency domain channel equation):

[148] If /ι(τ, ν) has finite support bounded by (T max , v max ) and if Ag rigtr ( , v) = 0 for τ ε (nT — x max , nT + x max ), v G {wAf - v max , mAf + v max ), that is, the ambiguity function bi-orthogonality property of ( 27 ) is true in a neighborhood of each grid point (m f, nT) of the lattice Λ at least as large as the support of the channel response h r, v), then the following equation holds

Y [n, m] = H[n, m]X [n, m]

[149] If the ambiguity function is only approximately bi-orthogonal in the neighborhood of Λ (by continuity), then ( 29 ) is only approximately true.

[150] Proof: See Appendix 0.

[151] Eq. ( 29 ) is a fundamental equation that describes the channel behavior in the time-frequency domain. It is the basis for understanding the nature of the channel and its variations along the time and frequency dimensions.

[152] Some observations are now in order on Eq. ( 29 ). As mentioned before, there is no interference across X[n, m] in either time n or frequency m.

[153] The end-to-end channel distortion in the modulation domain is a (complex) scalar that needs to be equalized

[154] If there is no Doppler, i.e. h(r, v) = h( , 0)<5(v), then Eq. ( 29 ) becomes r [n,m] = x [n, m] / f c( T, o )e ~ dT

= X[n, m]H(0, mAf)

[155] which is the well-known multicarrier result, that each subcarrier symbol is multiplied by the frequency response of the time invariant channel evaluated at the frequency of that subcarrier. [156] If there is no multipath, i.e. /ι(τ, ν) = ι(0,ν)<5(τ), then Eq. ( 29 ) becomes Y[n, m] = X[n, m] f h(v, 0)εί 2πνηΤ άτ (31 )

[157] Notice that the fading each subcarrier experiences as a function of time nT has a complicated expression as a weighted superposition of exponentials. This is a major complication in the design of wireless systems with mobility like LTE; it necessitates the transmission of pilots and the continuous tracking of the channel, which becomes more difficult the higher the vehicle speed or Doppler bandwidth is.

[158] We close this section with some examples of this general framework.

[159] Example 3: (OFDM modulation). In this case the fundamental transmit pulse is given by ( 13) and the fundamental receive pulse is

[160] i.e., the receiver zeroes out the CP samples and applies a square window to the symbols comprising the OFDM symbol. It is worth noting that in this case, the bi- orthogonality property holds exactly along the time dimension. Figure 6 shows the cross correlation between the transmit and receive pulses of ( 13 ) and ( 32 ). Notice that the cross correlation is exactly equal to one and zero in the vicinity of zero and ±T respectively, while holding those values for the duration of T CP . Hence, as long as the support of the channel on the time dimension is less than T CP the bi-orthogonality condition is satisfied along the time dimension. Across the frequency dimension the condition is only approximate, as the ambiguity takes the form of a sine function as a function of frequency and the nulls are not identically zero for the whole support of the Doppler spread.

[161] Example 4: (MCFB modulation). In the case of multicarrier filter banks 9tr X = 9r X = # ( - There are several designs for the fundamental pulse g(t). A square root raised cosine pulse provides good localization along the frequency dimension at the expense of less localization along the time dimension. If T is much larger than the support of the channel in the time dimension, then each subchannel sees a flat channel and the bi- orthogonality property holds approximately.

[162] In summary, in this section we described the one of the two transforms that define OTFS. We explained how the transmitter and receiver apply appropriate operators on the fundamental transmit and receive pulses and orthogonalize the channel according to Eq. ( 29 ). We further saw via examples how the choice of the fundamental pulse affect the time and frequency localization of the transmitted modulation symbols and the quality of the channel orthogonalization that is achieved. However, Eq. ( 29 ) shows that the channel in this domain, while free of intersymbol interference, suffers from fading across both the time and the frequency dimensions via a complicated superposition of linear phase factors.

[163] In the next section we will start from Eq. ( 29 ) and describe the second transform that defines OTFS; we will show how that transform defines an information domain where the channel does not fade in either dimension.

[164] 3.3 The 2D OTFS Transform

[165] Notice that the time-frequency response H[n, m] in ( 29 ) is related to the channel delay-Doppler response /ι(τ,ν) by an expression that resembles a Fourier transform. However, there are two important differences: (i) the transform is two dimensional (along delay and Doppler) and (ii) the exponentials defining the transforms for the two dimensions have opposing signs. Despite these difficulties, Eq. ( 29 ) points in the direction of using complex exponentials as basis functions on which to modulate the information symbols; and only transmit on the time-frequency domain the superposition of those modulated complex exponential bases. This is the approach we will pursue in this section.

[166] This is akin to the SC-FDMA modulation scheme, where in the frequency domain we transmit a superposition of modulated exponentials (the output of the DFT preprocessing block). The reason we pursue this direction is to exploit Fourier transform properties and translate a multiplicative channel in one Fourier domain to a convolution channel in the other Fourier domain. [167] Given the difficulties of Eq. ( 29 ) mentioned above we need to develop a suitable version of Fourier transform and associated sampling theory results. Let us start with the following definitions:

[168] Definition 1 : Symplectic Discrete Fourier Transform: Given a square summable two dimensional sequence X[m, n] ε (C(A) we define

x( T ,v) = ^ X[n, m]e- j27t(vnT - TmA n

m,n ( 33 }

= SDFT(X[n, m])

[169] Notice that the above 2D Fourier transform (known as the Symplectic Discrete Fourier Transform in the math community) differs from the more well known Cartesian Fourier transform in that the exponential functions across each of the two dimensions have opposing signs. This is necessary in this case, as it matches the behavior of the channel equation.

[170] Further notice that the resulting χ(τ,ν) is periodic with periods (1/Δ , 1/T) . This transform defines a new two dimensional plane, which we will call the delay-Doppler plane, and which can represent a max delay of 1/Δ and a max Doppler of 1/T. A one dimensional periodic function is also called a function on a circle, while a 2D periodic function is called a function on a torus (or donut). In this case χ(τ, v) is defined on a torus Z with circumferences (dimensions) (1/Δ , 1/T).

[171] The periodicity of χ(τ,ν) (or sampling rate of the time-frequency plane) also defines a lattice on the dela -Doppler plane, which we will call the reciprocal lattice

[172] The points on the reciprocal lattice have the property of making the exponent in ( 33 ), an integer multiple of 2π.

[173] The inverse transform is given by:

[174] where c = TAf.

[175] We next define a sampled version of χ(τ,ν). In particular, we wish to take M samples on the delay dimension (spaced at 1/ΜΔ " ) and N samples on the Doppler dimension (spaced at 1/NT). More formally we define a denser version of the reciprocal lattice

A " = {{ m W' n w' n ' mEl \ (36)

[176] So that Λ 1 _≡ A Q . We define discrete periodic functions on this dense lattice with period (1/Δ , 1/Γ), or equivalently we define functions on a discrete torus with these dimensions

Z ^ = {{ m Mf' n r)' m = ° M _ 1 ' n = 0 > -N-l,}

[177] These functions are related via Fourier transform relationships to discrete periodic functions on the lattice Λ, or equivalently, functions on the discrete torus

Z 0 = {(nT,mAf), m = 0,...,M-l, n = 0, ...N-l,} W

[178] We wish to develop an expression for sampling Eq. ( 33 ) on the lattice of ( 37 ). First, we start with the following definition.

[179] Definition 2: Svmplectic Finite Fourier Transform: If X p [k, I] is periodic with period (N, ), then we define

W-l 2

Σ ν- ' .» ,nk mL

) X p [k,l]e~ j2n( T~M-> \ >f!! j

Ar-0 ,__M

2

SFFT(X[k,l]) [180] Notice that x p [m,n] is also periodic with period [M,N] or equivalently, it is defined on the discrete torus Z Q . Formally, the SFFT(X[n,m ) is a linear transformation from <C(Z 0 )→ <C(Z ).

[181] Let us now consider generating x p [m,n] as a sampled version of ( 33 ), i.e., x p [m,n] = x[m,n] = χ(τ,ν)\ m Then we can show that ( 39 ) still holds where

T MAf' V NT

X p [m,n] is a periodization of X[n,m] with period (Λί, )

X p [n,m] = ^ [n-/cN,m-/ ] W

l,k=-∞

[182] This is similar to the well-known result that sampling in one Fourier domain creates aliasing in the other domain.

[183] The inverse discrete (symplectic) Fourier transform is given by

}

SFFT -1 (*[/,£])

[184] where 1 = 0, ...,M -1, k = 0, ...,N - 1. If the support of X[n,m] is time- frequency limited to Z 0 (no aliasing in ( 40 ) ), then X p [n,m] = X[n,m] for n,m ε Z 0 , and the inverse transform ( 41 ) recovers the original signal.

[185] In the math community, the SDFT is called "discrete" because it represents a signal using a discrete set of exponentials, while the SFFT is called "finite" because it represents a signal using a finite set of exponentials.

[186] Arguably the most important property of the symplectic Fourier transform is that it transforms a multiplicative channel effect in one domain to a circular convolution effect in the transformed domain. This is summarized in the following proposition:

[187] Proposition 2: Let X^n.m] ε <C(Z 0 ), X 2 [n,m] ε £(Z 0 ) be periodic 2D sequences. Then

SFFT(X 1 [n, m] * X 2 [n, m] ) = SFFT(X 1 [n, m] ) SFFT(X 2 [n, m] ) ( 4 }

[188] where * denotes two dimensional circular convolution. [189] Proof: See Appendix 0.

[190] With this framework established we are ready to define the OTFS modulation.

[191] Discrete OTFS modulation: Consider a set of NM QAM information symbols arranged on a 2D grid x[l, k], k = 0, ..., N - 1, I = 0, ... , M - 1 we wish to transmit. We will consider x[l, k] to be two dimensional periodic with period [N, M]. Further, assume a multicarrier modulation system defined by

[192] A lattice on the time frequency plane, that is a sampling of the time axis with sampling period T and the frequency axis with sampling period Δ (c.f. Eq. ( 8 ) ).

[193] A packet burst with total duration NT sees and total bandwidth MAf Hz.

[194] Transmit and receive pulses g tr {t), g tr {t) ε L 2 (W) satisfying the bi- orthogonality property of ( 27 ).

[195] A transmit windowing square summable function W tr [n, m] ε (C(A) multiplying the modulation symbols in the time-frequency domain

[196] A set of modulation symbols X[n, m], n = 0, ..., N - 1, m = 0, ..., M - 1 related to the information symbols x[k, I] by a set of basis functions b k l [n, m

N-lM-l

X[n, m] =—W tr [n, m] ^ ^ x[l, k] b l [n, m] ( 43 )

k=0 1=0

. ( ml nk

b kil [n, m] = e J M N >

[197] where the basis functions b k l [n, m are related to the inverse symplectic Fourier transform (c.f. , Eq. ( 41 ) )

[198] Given the above components, we define the discrete OTFS modulation via the following two steps

X[n, m] = I^ tr [n, m] 5FF7 , - 1 (x[/c, /])

[199] The first equation in ( 44 ) describes the OTFS transform, which combines an inverse symplectic transform with a widowing operation. The second equation describes the transmission of the modulation symbols X[n, m] via a Heisenberg transform of g tr (t) parameterized by X[n, m] . More explicit formulas for the modulation steps are given by Equations ( 41 ) and ( 10 ).

[200] While the expression of the OTFS modulation via the symplectic Fourier transform reveals important properties, it is easier to understand the modulation via Eq. ( 43 ), that is, transmitting each information symbol x[k, I] by modulating a 2D basis function b k l [n, m] on the time-frequency plane.

[201] Figure 7 visualizes this interpretation by isolating each symbol in the information domain and showing its contribution to the time-frequency modulation domain. Of course the transmitted signal is the superposition of all the symbols on the right (in the information domain) or all the basis functions on the left (in the modulation domain).

[202] Figure 7 uses the trivial window W tr [n, m] = l for all n = 0, ... , N - 1, m =

~, ... γ - 1 and zero else. This may seem superfluous but there is a technical reason for this window: recall that 5FFr _1 (x[/c, I]) is a periodic sequence that extends to infinite time and bandwidth. By applying the window we limit the modulation symbols to the available finite time and bandwidth. The window in general could extend beyond the period of the information symbols [M, N] and could have a shape different from a rectangular pulse. This would be akin to adding cyclic prefix/suffix in the dimensions of both time and frequency with or without shaping. The choice of window has implications on the shape and resolution of the channel response in the information domain as we will discuss later. It also has implications on the receiver processing as the potential cyclic prefix/suffix has to either be removed or otherwise handled as we see next.

[203] Discrete OTFS demodulation: Let us assume that the transmitted signal s(t) undergoes channel distortion according to ( 7 ), ( 2 ) yielding r(t) at the receiver. Further, let the receiver employ a receive windowing square summable function W r [n, m] . Then, the demodulation operation consists of the following steps:

[204] Matched filtering with the receive pulse, or more formally, evaluating the ambiguity function on Λ (Wigner transform) to obtain estimates of the time-frequency modulation symbols

[205] windowing and periodization of Y[n, m]

Y w [n, m] = W r [n, m] Y [n, m]

Y p [n, m] = ^ Y w [n - kN, m - M] ( 46 )

k,l=-∞

[206] and applying the symplectic Fourier transform on the periodic sequence

Y p [n, m]

x[l, k] = y[l, k] = SFFT(Y p [n, m]) ( 4 j j

[207] The first step of the demodulation operation can be interpreted as a matched filtering operation on the time-frequency domain as we discussed earlier. The second step is there to ensure that the input to the SFFT is a periodic sequence. If the trivial window is used, this step can be skipped. The third step can also be interpreted as a projection of the time-frequency modulation symbols on the orthogonal basis functions

M-l N-l

•T 1

x[l, k] = X(ji, m) b k * l {n, m)

m=0 n=0 ( 1 48 ')

,lm kn.

b u * (n, m) = e - j2n( T-— )

[208] The discrete OTFS modulation defined above points to efficient implementation via discrete-and-periodic FFT type processing. However, it does not provide insight into the time and bandwidth resolution of these operations in the context of two dimensional Fourier sampling theory. We next introduce the continouse OTFS modulation and relate the more practical discrete OTFS as a sampled version of the continuous modulation.

[209] Continuous OTFS modulation: Consider a two dimensional periodic function χ(τ, ν) with period [1/Δ , 1/T] we wish to transmit; the choice of the period may seem arbitrary at this point, but it will become clear after the discussion in the next section. Further, assume a multicarrier modulation system defined by [210] A lattice on the time frequency plane, that is a sampling of the time axis with sampling period T and the frequency axis with sampling period Δ (c.f. Eq. ( 8 ) ).

[211] Transmit and receive pulses g tr {t), g tr {t) ε L 2 (W) satisfying the bi- orthogonality property of ( 27 )

[212] A transmit windowing function W tr [n, m] ε (C(A) multiplying the modulation symbols in the time-frequency domain

[213] Given the above components, we define the continuous OTFS modulation via the following two steps

X[n, m] = W tr [n, m] SOFT '1 ^^

[214] The first equation describes the inverse discrete time-frequency symplectic Fourier transform [c.f. Eq. ( 35 )] and the windowing function, while the second equation describes the transmission of the modulation symbols via a Heisenberg transform [c.f. Eq. ( 10 )].

[215] Continuous OTFS demodulation: Let us assume that the transmitted signal s(t) undergoes channel distortion according to ( 7 ), ( 2 ) yielding r(t) at the receiver. Further, let the receiver employ a receive windowing function W r [n, m] ε (C(A). Then, the demodulation operation consists of two steps:

[216] Evaluating the ambiguity function on Λ (Wigner transform) to obtain estimates of the time-frequency modulation symbols

[217] Windowing and applying the symplectic Fourier transform on the modulation symbols (τ, v) = SDFT(W r [n, m]Y[n, rn]) ( 51 )

[218] Notice that in ( 50 ), ( 51 ) there is no periodization of Y[n, m] , since the SDFT is defined on aperiodic square summable sequences. The periodization step needed in discrete OTFS can be understood as follows. Suppose we wish to recover the transmitted information symbols by performing a continuous OTFS demodulation and then sampling on the delay-Doppler grid x(l, k) = (τ, ν) | m _ n

T ~MAp V ~NT

[219] Since performing a continuous symplectic Fourier transform is not practical we consider whether the same result can be obtained using SFFT. The answer is that SFFT processing will produce exactly the samples we are looking for if the input sequence is first periodized (aliased) - see also ( 39 ) ( 40 ).

[220] We have now described all the steps of the OTFS modulation as depicted in Figure 3. We have also discussed how the Wigner transform at the receiver inverts the Heisenberg transform at the transmitter [c.f. Eqs. ( 26 ), ( 28 )], and similarly for the forward and inverse symplectic Fourier transforms. The key question is what form the end-to-end signal relationship takes when a non-ideal channel is between the transmitter and receiver. The answer to this question is addressed next.

[221] 3.4 Channel Equation in the OTFS Domain

[222] The main result in this section shows how the time varying channel in ( 2 ), ( 7 ), is transformed to a time invariant convolution channel in the delay Doppler domain.

[223] Proposition 3: Consider a set of NM QAM information symbols arranged in a 2D periodic sequence x[l, k] with period [M, N]. The sequence x[k, I] undergoes the following transformations:

[224] It is modulated using the discrete OTFS modulation of Eq. ( 44 ).

[225] It is distorted by the delay-Doppler channel of Eqs.( 2 ), ( 7 ).

[226] It is demodulated by the discrete OTFS demodulation of Eqs. ( 45 ), ( 47 ).

[227] The estimated sequence x[l, k] obtained after demodulation is given by the two dimensional periodic convolution [228] of the input QAM sequence x[m,n] and a sampled version of the windowed impulse response h w ),

[229] where h w (r',v') denotes the circular convolution of the channel response with a windowing function 5

K ',v') = jj e-j 27TVT h(T,v)w(x' - τ,ν' - v)dxdv

[230] where the windowing function W(T, v) is the symplectic Fourier transform of the time-frequency window V [n,m]

Af-l N-l

νν(τ,ν) = ^ ^ W[n,m e ~}2n{ynT~Tm n

m=0 n=0

[231] and where W[n, m] is the product of the transmit and receive window.

W [n, m] = W tr [n, m] W r [n, m] { }

[232] Proof: See Appendix 0.

[233] In many cases, the windows in the transmitter and receiver are matched, i.e., W tr [n,m] = V 0 [n,m] and W r [n,m] = V 0 * [n,m], hence V [n,m] = |V 0 [n,m]| 2 .

[234] The window effect is to produce a blurred version of the original channel with a resolution that depends on the span of the frequency and time samples available as will be discussed in the next section. If we consider the rectangular (or trivial) window, i.e., W[n,m] = 1, n = 0, ...,N - 1, m = -M/2, ...,M/2 - 1 and zero else, then its SDFT νν(τ,ν) in ( 55 ) is the two dimensional Dirichlet kernel with bandwidth inversely proportional to N and M.

To be precise, in the window w( , v) is circularly convolved with a slightly modified version of the channel impulse response e ~j2nvT h(r, v) (by a complex exponential) as can be seen in the equation. [235] There are several other uses of the window function. The system can be designed with a window function aimed at randomizing the phases of the transmitted symbols, akin to how QAM symbol phases are randomized in WiFi and Multimedia-Over- Coax communication systems. This randomization may be more important for pilot symbols than data carrying symbols. For example, if neighboring cells use different window functions, the problem of pilot contamination is avoided.

[236] A different use of the window is the ability to implement random access systems over OTFS using spread spectrum/CDMA type techniques as will be discussed later.

[237] 4. Channel Time/Frequency coherence and OTFS resolution

[238] In this section we examine certain OTFS design issues, like the choice of data frame length, bandwidth, symbol length and number of subcarriers. We study the tradeoffs among these parameters and gain more insight on the capabilities of OTFS technology.

[239] Since OTFS is based on Fourier representation theory similar spectral analysis concepts apply like frequency resolution vs Fourier transform length, sidelobes vs windowing shape etc. One difference that can be a source of confusion comes from the naming of the two Fourier transform domains in the current framework.

[240] OTFS transforms the time-frequency domain to the delay-Doppler domain creating the Fourier pairs: (i) time <= Doppler and (ii) frequency <= delay. The "spectral" resolution of interest here therefore is either on the Doppler or on the delay dimensions.

[241] These issues can be easier clarified with an example. Let us consider a time- invariant multipath channel (zero Doppler) with frequency response H(f, 0) for all t. In the first plot of Figure 8 we show the real part of (/, 0) as well as a sampled version of it on a grid of M = 8 subcarriers. The second plot of Figure 8 shows the SDFT of the sampled H(m f, 0), i.e., h(r, 0) along the delay dimension. Notice that taking this frequency response to the "delay" domain reveals the structure of this multipath channel, that is, the existence of two reflectors with equal power in this example. Further, notice that the delay dimension of the SDFT is periodic with period 1/Δ as expected due to the nature of the discrete Fourier transform. Finally, in the third plot of Figure 8 we show the SFFT of the frequency response, which as expected is a sampled version of the SDFT of the second plot. Notice that the SFFT has M = 8 points in each period 1/Δ leading to a resolution in the delay domain of 1/MAf = 1/BW.

[242] In the current example, the reflectors are separated by more than 1/MAf and are resolvable. If they were not, then the system would experience a flat channel within the bandwidth of observation, and in the delay domain the two reflectors would have been blurred into one.

[243] Figure 9 shows similar results for a flat Doppler channel with time varying frequency response H(0, t) for all /. The first plot shows the the response as a function of time, while the second plot shown the SDFT along the Doppler dimension. Finally the third plot shows the SFFT, that is the sampled version of the transform. Notice that the SDFT is periodic with period 1/T while the SFFT is periodic with period 1/T and has resolution of 1/NT.

[244] The conclusion one can draw from Figure 9 is that as long as there is sufficient variability of the channel within the observation time NT, that is as long as reflectors have Doppler frequency difference larger than 1/NT, the OTFS system will resolve these reflectors and will produce an equivalent channel in the delay-Doppler domain that is not fading. In other words, OTFS can take a channel that inherently has a coherence time of only T and produce an equivalent channel in the delay Doppler domain that has coherence time NT. This is an important property of OTFS as it can increase the coherence time of the channel by orders of magnitude and enable MIMO processing and beamforming under Doppler channel conditions.

[245] The two one-dimensional channel examples we have examined are special cases of the more general two-dimensional channel of Figure 10. The time-frequency response and its sampled version are shown in this figure, where the sampling period is (Γ, Δ/). Figure 1 1 shows the SDFT of this sampled response which is periodic with period (1/T, 1/Δ/), across the Doppler and delay dimensions respectively.

[246] Let us now examine the Nyquist sampling requirements for this channel response. 1/T is generally on the order of Δ/ (for an OFDM system with zero length CP it is exactly 1/T = Δ/) so the period of the channel response in Figure 1 1 is approximately (Δ/, Γ), and aliasing can be avoided as long as the support of the channel response is less than +Δ//2 in the Doppler dimension and ±T/2 in the delay dimension.

[247] Figure 12 shows the SFFT, that is, the sampled version of Figure 1 1 . The resolution of Figure 1 1 is l/NT, l/M f across the Doppler and delay dimensions respectively.

[248] We summarize the sampling aspects of the OTFS modulation in Figure 13. The OTFS modulation consists of two steps shown in this figure:

[249] A Heisenberg transform translates a time-varying convolution channel in the waveform domain to an orthogonal but still time varying channel in the time frequency domain. For a total bandwidth BW and M subcarriers the frequency resolution is Δ = BW/M. For a total frame duration T f and N symbols the time resolution is T = T f /N.

[250] A SFFT transform translates the time-varying channel in the time-frequency domain to a time invariant one in the delay-Doppler domain. The Doppler resolution is 1/7} and the delay resolution is 1/BW.

[251] The choice of window can provide a tradeoff between main lobe width (resolution) and side lobe suppression, as in classical spectral analysis.

[252] 5 Channel Estimation in the OTFS Domain

[253] There is a variety of different ways a channel estimation scheme could be designed for an OTFS system, and a variety of different implementation options and details. In the section we will only present a high level summary and highlight the key concepts.

[254] A straightforward way to perform channel estimation entails transmitting a soudning OTFS frame containing a discrete delta function in the OTFS domain or equivalently a set of unmodulated carriers in the time frequency domain. From a practical standpoint, the carriers may be modulated with known, say BPSK, symbols which are removed at the receiver as is common in many OFDM systems. This approach could be considered an extension of the channel estimation symbols used in WiFi and Multimedia- Over-Coax modems. Figure 14 shows an OTFS symbol containing such an impulse.

[255] This approach may however be wasteful as the extend of the channel response is only a fraction of the full extend of the OTFS frame (1/T, 1/Δ/). For example, in LTE systems 1/T « 15 KHz while the maximum Doppler shift f diTnax is typically one to two orders of magnitude smaller. Similarly 1/Δ « 67 usee, while maximum delay spread T max is again one to two orders of magnitude less. We therefore can have a much smaller region of the OTFS frame devoted to channel estimation while the rest of the frame carries useful data. More specifically, for a channel with support (±f d ,max > ± T max) we nee d an OTFS subframe of length (2f dimax /T, 2T max /Af).

[256] In the case of multiuser transmission, each UE can have its own channel estimation subframe positioned in different parts of the OTFS frame. This is akin to multiplexing of multiple users when transmitting Uplink Sounding Reference Signals in LTE. The difference is that OTFS benefits from the virtuous effects of its two dimensional nature. For example, if r max is 5% of the extend of the delay dimension and f dimax is 5% of the Doppler dimesion, the channel estimation subframe need only be 5% x 5% = 0.25% of the OTFS frame.

[257] Notice that although the channel estimation symbols are limited to a small part of the OTFS frame, they actually sound the whole time-frequency domain via the corresponding basis functions associated with these symbols.

[258] A different approach to channel estimation is to devote pilot symbols on a subgrid in the time-frequency domain. This is akin to CRS pilots in downlink LTE subframes. The key question in this approach is the determination of the density of pilots that is sufficient for channel estimation without introducing aliasing. Assume that the pilots occupy the subgrid (n 0 T, m 0 f) for some integers n 0 , m 0 . Recall that for this grid the SDFT will be periodic with period (l/n 0 7\ 1/τη 0 Δ/). Then, applying the aliasing results discussed earlier to this grid, we obtain an alias free Nyquist channel support region of (±f d ,max > ± T ma ) = (±1/2η 0 Γ, +1/2τη 0 Δ/). The density of the pilots can then be determined from this relation given the maximum support of the channel. The pilot subgrid should extend to the whole time-frequency frame, so that the resolution of the channel is not compromised.

[259] 6 OTFS-Access: Multiplexing More than one User

[260] There is a variety of ways to multiplex several uplink or downlink transmissions in one OTFS frame. This is a rich topic whose full treatment is outside the scope of this paper. Here we will briefly review the following multiplexing methods:

• Multiplexing in the OTFS delay-Doppler domain

• Multiplexing in the time-frequency domain

• Multiplexing in the code speading domain

• Multiplexing in the spatial domain

[261] Multiplexing in the delay-Doppler domain: This is the most natural multiplexing scheme for downlink transmissions. Different sets of OTFS basis functions, or sets of information symbols or resource blocks are given to different users. Given the orthogonality of the basis functions, the users can be separated at the UE receiver. The UE need only demodulate the portion of the OTFS frame that is assigned to it.

[262] This approach is similar to the allocation of PRBs to different UEs in LTE. One difference is that in OTFS, even a small subframe or resource block in the OTFS domain will be transmitted over the whole time-frequency frame via the basis functions and will experience the average channel response. Figure 15 illustrates this point by showing two different basis functions belonging to different users. Because of this, there is no compromise on channel resolution for each user, regardless of the resource block or subframe size.

[263] In the uplink direction, transmissions from different users experience different channel responses. Hence, the different subframes in the OTFS domain will experience a different convolution channel. This can potentially introduce inter-user interference at the edges where two user subframes are adjacent, and would require guard gaps to eliminate it. In order to avoid this overhead, a different multiplexing scheme can be used in the uplink as explained next. [264] Multiplexing in the time-frequency domain: In this approach, resource blocks or subframes are allocated to different users in the time-freqeuncy domain. Figure 16 illustrates this for a three user case. In this figure, User 1 (blue, 1602) occupies the whole frame length but only half the available subcarriers. Users 2 and 3 (red, 1604, and black, 1606, respectively) occupy the other half subcarriers, and divide the total length of the frame between them.

[265] Notice that in this case, each user employs a slightly different version of the OTFS modulation described in Section 3. One difference is that each user i performs an SFFT on a subframe (N^ ;), Ni≤ N, M t < M. This reduces the resolution of the channel, or in other words reduces the extent of the time-frequency plane in which each user will experience its channel variation. On the other side, this also gives the scheduler the opportunity to schedule users in parts of the time-frequency plane where their channel is best.

[266] If we wish to extract the maximum diversity of the channel and allocate users across the whole time-frequency frame, we can multiplex users via interleaving. In this case, one user occupies a subsampled grid of the time-frequency frame, while another user occupies another subsampled grid adjacent to it. Figure 17 shows the same three users as before but interleaved on the subcarrier dimension. Of course, interleaving is possible in the time dimension as well, and/or in both dimensions. The degree of interleaving, or subsampling the grip per user is only limited by the spread of the channel that we need to handle.

[267] Multiplexing in the time-frequency spreading code domain: Let us assume that we wish to design a random access PHY and MAC layer where users can access the network without having to undergo elaborate RACH and other synchronization procedures. There have been several discussions on the need for such a system to support Internet of Things (loT) deployments. OTFS can support such a system by employing a spread-spectrum approach. Each user is assigned a different two-dimensional window function that is designed as a randomizer. The windows of different users are designed to be nearly orthogonal to each other and nearly orthogonal to time and frequency shifts. Each user then only transmits on one or a few basis functions and uses the window as a means to randomize interference and provide processing gain. This can result in a much simplified system that may be attractive for low cost, short burst type of loT applications.

[268] Multiplexing in the spatial domain: Finally, like other OFDM multicarrier systems, a multi-antenna OTFS system can support multiple users transmitting on the same basis functions across the whole time-frequency frame. The users are separated by appropriate transmitter and receiver beamforming operations. A detailed treatment of MIMO-OTFS architectures however is outside the scope of this paper.

[269] 7. Implementation Issues

[270] OTFS is a novel modulation technique with numerous benefits and a strong mathematical foundation. From an implementation standpoint, its added benefit is the compatibility with OFDM and the need for only incremental change in the transmitter and receiver architecture.

[271] Recall that OTFS consists of two steps. The Heisenberg transform (which takes the time-frequency domain to the waveform domain) is already implemented in today's systems in the form of OFDM/OFDMA. In the formulation of this paper, this corresponds to a prototype filter g(t) which is a square pulse. Other filtered OFDM and filter bank variations have been proposed for 5G, which can can also be accommodated in this general framework with different choices of g(t).

[272] The second step of OTFS is the two dimensional Fourier transform (SFFT). This can be thought of as a pre- and post-processing step at the transmitter and receiver respectively as illustrated in Figure 18. In that sense it is similar, from an implementation standpoint, to the SC-FDMA pre-processing step.

[273] From a complexity comparison standpoint, we can calculate that for a frame of N OFDM symbols of M subcarriers, SC-FDMA adds N DFTs of M point each (assuming worse case M subcarriers given to a single user). The additional complexity of SC-FDMA is then NMlog 2 (M) over the baseline OFDM architecture. For OTFS, the 2D SFFT has complexity NMlog 2 (NM) = NMlog 2 (M) + NMlog 2 (N), so the term NMlog 2 (N) is the OTFS additional complexity compared to SC-FDMA. For an LTE subframe with M = 1200 subcarriers and N = 14 symbols, the additional complexity is 37% more compared to the additional complexity of SC-FDMA

[274] Notice also that from an architectural and implementation standpoint, OTFS augments the PHY capabilities of an existing LTE modem architecture and does not introduce co-existence and compatibility issues.

[275] 8. Example Benefits of OTFS modulation

[276] The OTFS modulation has numerous benefits that tie into the challenges that 5G systems are trying to overcome. Arguably, the biggest benefit and the main reason to study this modulation is its ability to communicate over a channel that randomly fades within the time-frequency frame and still provide a stationary, deterministic and non-fading channel interaction between the transmitter and the receiver. In the OTFS domain all information symbols experience the same channel and same SNR.

[277] Further, OTFS best utilizes the fades and power fluctuations in the received signal to maximize capacity. To illustrate this point assume that the channel consists of two reflectors which introduce peaks and valleys in the channel response either across time or across frequency or both. An OFDM system can theoretically address this problem by allocating power resources according to the waterfilling principle. However, due to practical difficulties such approaches are not pursued in wireless OFDM systems, leading to wasteful parts of the time-frequency frame having excess received energy, followed by other parts with too low received energy. An OTFS system would resolve the two reflectors and the receiver equalizer would employ coherent combining of the energy of the two reflectors, providing a non-fading channel with the same SNR for each symbol. It therefore provides a channel interaction that is designed to maximize capacity under the transmit assumption of equal power allocation across symbols (which is common in existing wireless systems), using only standard AWGN codes.

[278] In addition, OTFS provides a domain in which the channel can be characterized in a very compact form. This has significant implications for addressing the channel estimation bottlenecks that plague current multi-antenna systems and can be a key enabling technology for addressing similar problems in future massive MIMO systems. [279] One benefit of OTFS is its ability to easily handle extreme Doppler channels.

We have verified in the field 2x2 and 4x4, two and four stream MIMO transmission respectively in 90 Km/h moving vehicle setups. This is not only useful in vehicle-to-vehicle, high speed train and other 5G applications that are Doppler intensive, but can also be an enabling technology for mm wave systems where Doppler effects will be significantly amplified.

[280] Further, OTFS provides a natural way to apply spreading codes and deliver processing gain, and spread-spectrum based CDMA random access to multicarrier systems. It eliminates the time and frequency fades common to multicarrier systems and simplifies the receiver maximal ratio combining subsystem. The processing gain can address the challenge of deep building penetration needed for loT and PSTN replacement applications, while the CDMA multiple access scheme can address the battery life challenges and short burst efficiency needed for IOT deployments.

[281] Last but not least, the compact channel estimation process that OTFS provides can be essential to the successful deployment of advanced technologies like Cooperative Multipoint (Co-MP) and distributed interference mitigation or network MIMO.

[282] Appendix 0

[283] Proof of Proposition 1 : Let

g 2 (t) = \\ h 2 (τ, v) e^ '27rv(t - T) g x (t - τ) dvdr

[284] Substituting ( 58 ) into ( 57 ) we obtain after some manipulation

[285] with /(τ, ν) given by ( 16 ).

[286] Proof of Theorem 1 : The theorem can be proven by straightforward but tedious substitution of the left hand side of ( 23 ); by definition = < g r (t - T)e j2nvt ,U f (g tr ) >

= j g;{t- T)e-i 2 ™ {g tr (t))dt

= j g r * (t ( G °)

[287] By changing the order of integration and the variable of integration (t - τ')→ t we obtain

= jj /(T 27n ' (t - T,) j gr*(t- T)g tr (t

- T ') e -j2nvt dt dv > dT > ,

= jj f(x',V *™'^A arigtr {x-x',v

[288] where

A 9r, 9t - τ - ν ') = f 9r(t - (τ - x') g tr {t e-W-^-^'Ht

(62)

[289] Notice that the right second line of ( 61 ) is exactly the right hand side of ( 23 ), which is what we wanted to prove. □

[290] Proof of Theorem 2: Substituting into ( 23 ) and evaluating on the lattice Λ we obtain:

X[m,n] = ^ ^ X[m',n']

m, = _Mn> = 0 h(r -ηΤ,ν- mAf) A grgtr (nT - x, mAf - v e n™>(nT-T)t + V [ m> n ] [291] Using the bi-orthogonality condition in ( 63 ) only one term survives in the right hand side and we obtain the desired result of ( 29 ).

[292] Proof of Proposition 2: Based on the definition of SFFT, it is not hard to verify that a delay translates into a linear phase

nk ml . ,

SFFT(X 2 [n - k, m - I]) = SFFT(X 2 [n, m])e ~;27r(" ^ ( t>4 )

[293] Based on this result we can evaluate the SFFT of a circular convolution

nk ml

X 1 [k, l]SFFT(X 2 [n, m]) e ~;27r(" )

= SFFT(X 1 [n, m])SFFT(X 2 [n, m]) [294] yielding the desired result.

[295] Proof of Proposition 3: We have already proven that on the time-frequency domain we have a multiplicative frequency selective channel given by ( 29 ). This result, combined with the interchange of convolution and multiplication property of the symplectic Fourier transform [c.f. Proposition 1 and Eq. ( 42 )] leads to the desired result.

[296] In particular, if we substitute Y(n, m) in the demodulation equation ( 48 ) from the time-frequency channel equation ( 29 ) and X[n, m] in ( 29 ) from the modulation equation ( 43 ) we get a (complicated) end-to-end expression

W-l Af-1

, -]2πντ X

L J / /

[297] Recognizing the factor in brackets as the discrete symplectic Fourier transform of W(n, m) we have N-l M-l

l - V k - k'

- τ,-

N7 ,' 67 }

k'=0 l' = 0

— v) dvdr

[298] Further recognizing the double integral as a convolution of the channel impulse response (multiplied by an exponential) with the transformed window we obtain

N-l M-l

1 " l - V k - k'

*[*· <] = MA? Σ∑*[*'.'']Aw( ' K "

fc'=0 i' = 0

[299] which is the desired result. [300] 9. Reference signals

[301] Unless otherwise specifically mentioned, the terms reference signals and pilot signals are used interchangeably in the present document.

[302] 9.1 The OTFS-based reference signals

[303] Assume the time-frequency (t-f) lattice defined by the following discrete points:

A D t f = Idt 0 TLdf = {(Kdt. Ldf): K, L Ε Έ} (69)

[304] Where dt (in sec) and df (in Hz) are the physical distances between the lattice points in the time and frequency dimensions respectfully, and K and L are integers 6 . We will call this lattice the data lattice as most of the points on this lattice will be occupied by data samples. The reference signals (pilots) will occupy a subset of the data lattice. When the pilot samples occupy a regular subset of the data lattice they form a regular (coarser) pilot lattice defined by:

A p t = INdt 0 IMdf N, M E Έ≥ 1 (70)

In OFDM terminology, df may be the subcarrier spacing and dt may be the OFDM symbol time. [305] As an example, for N = 14, M = 2, the t-f plane will look as shown in Figure 19.

[306] As shown in this document, the data lattice (69) is associated, through the symplectic Fourier transform, with a Delay-Doppler (τ, ν) torus which has the following delay and Doppler circumferences respectfully:

C T D = l/df , C° = 1/dt

[307] The Delay-Doppler torus associated with the (coarser) pilot lattice (70) is a torus with the following smaller circumferences

C T P = l/(Mdf) , C = l/(Ndt)

[308] It can be shown that a 2-D function χ(τ,ν) on the continuous Delay-Doppler torus associated with the t-f lattice defined in (69) can be transformed to a 2-D discrete function X[i,j] on the t-f lattice using an inverse symplectic discrete Fourier transform defined as

1_ J_

df dt

X[i,j] = SDFT- 1 {X{T, V)) = -^j j x(T, v)e^ 2n( - vidt - T ^dv dz (71)

0 0

[309] If the pilot χ ρ (τ, ν) is chosen as the delta function δ(τ ρ ρ ) on the Delay- Doppler torus, then the representation of this pilot on the time-frequency lattice defined in (69) will be

X =——e j2 v v idt -^ d f) i,j ε τ (72)

[310] Figure 20 shows an example of positioning 10 OTFS-based pilots on the Delay-Doppler plane and how one of the pilots look after it goes through the inverse symplectic discrete Fourier transform (71 ).

[311] Each OTFS-based reference signal is a delta function placed on the Delay- Doppler torus at a different point (τ ρ ρ ). The sum of these delta functions is then transformed to the t-f plane using the inverse symplectic discrete Fourier transform, and a subset of the samples in the t-f plane are selected to be transmitted. Stated differently, an OTFS-based reference signal is a symplectic exponential which is restricted to a subset of points on the data lattice.

[312] If P pilots are sent using a subset of the data lattice which forms a regular lattice as represented in (70), the t-f samples of the n pilots will be:

p

X ^'^ = dtd?∑ e j2 i r(y P idt-T p jdf) i = kN = lM . k E i (7 3 )

p=i

[313] Where N and M are fixed positive integers representing the size of the coarser pilot lattice.

[314] When the pilot samples in the t-f plane form a regular lattice, the number of pilots that can be packed in that lattice can be calculated from the circumferences of the torus associated with the pilot lattice and the maximum delay and Doppler spreads of the channels that each of the pilots is expected to experience. To avoid leakage between the pilots, the maximum number of pilots that can be packed in each dimension is the circumference of the torus divided by the maximum spread. Noting the delay and Doppler spreads as Δ τ and Δ ν respectfully, the maximum number of pilots that can be packed in each of the dimensions is:

[315] As an example, for channels with average maximum delay spread of 5 and a maximum Doppler frequency of 50Hz (maximum Doppler spread of 100Hz), a torus with a delay circumference of = 67 με and Doppler circumference of Οξ = 200 Hz can support up to 13 pilots in the delay dimension and 2 pilots in the Doppler dimension, for a total of 26 pilots. How close a system can get to the maximum achievable pilot packing will depend on the pilot observation window. A finite observation window of the pilots translates to convolving the pilots in the Delay-Doppler plane with the symplectic Fourier transform of the window (which, in the case of a rectangular window, is a two-dimensional sine function). Hence, a larger observation window will result in lower leakage between the received pilots, which will enable:

• improved accuracy of the channel estimation and

• Tighter packing of pilots, up to the maximum number stated in (74) for infinitely

large window.

[316] Note that staggering the pilots (e.g., not placing them on a rectangular grid in the Delay-Doppler plane) may improve the separation between the pilots and hence could provide better, or denser, packing. Figure 21 shows an example of staggered pilots. As can be seen from the figure, all the even pilots are staggered. An example of how staggering the LTE UL DM reference signals can improve the channel estimation is shown in the present document.

[317] 9.2 Reference Signal Packing

[318] Packing orthogonal OTFS-based pilots can be done using one of the following schemes:

[319] Delay-Doppler Pilot Packing (DDPP): Arranging the pilots in the Delay-Doppler plane keeping the distances large enough to minimize the leakage of the received pilots onto each other after going through the worst case delay and Doppler shifts of the channels each of the pilots may experience, and taking into consideration the impact of the (sub) sampling of the pilots in the t-f domain, and the pilot observation window.

• Time-Frequency Pilot Packing (TFPP): Assigning for each pilot (with no overlap) a coarse enough lattice that can support the relevant channel (with a regular pilot lattice, the circumferences of the associated torus have to be larger than the largest expected delay and Doppler shifts through the channel).

• Mixed Pilot Packing (MPP): A combination of DDPP and TFPP.

[320] OTFS-based pilots can be packed very efficiently, and hence can support the simultaneous transmission of a very large number of orthogonal pilots without using a significant percentage of the channel capacity.

[321] Here are a few pilot packing examples: [322] Example 1 (DDPP): [323] Assume the following:

• Data lattice parameters (LTE numerology):

o dt = 1/14 ms

o df = 15 kHz

• Channel parameters:

o Delay spread: 5 (ETU)

o Max Doppler frequency: 50 Hz (100 Hz spread)

[324] If we chose a pilot lattice with N = 28 (Ndt = 2ms) and M = 1, as shown in Figure 22, the circumferences of the pilot torus will be

C T P = 66.67US , C = 500 Hz

[325] The maximum number of pilots that can be supported in this configuration is 13x5 = 65. This is by placing 13 pilots in the delay dimension of the torus (spaced 5.13 apart) and 5 replicas of these pilots in the Doppler dimension of the torus (spaced 100 Hz apart). In practice with 10 MHz channel bandwidth and allowing for a reasonable size window in the time dimension this configuration can support at least 40 pilots (10x4).

[326] Example 2 (TFPP):

[327] For the same data lattice as in Example 1 assume the following channel parameters:

• Delay spread: 5 (ETU)

• Max Doppler frequency: 200 Hz (400 Hz spread)

[328] If we split the pilot lattice of Example 1 into 12 different pilot lattices represented by the different color diamonds in Figure 23, then each of these pilot lattices will have N = 28 {Ndt = 2ms) and M = 12 (Mdf = 180KHz). The circumferences of all the tori associated with these lattices will be:

C T P = 5.56 us , C ? = 500 Hz

[329] As can be seen from the circumferences, each pilot lattice can support only a single pilot, for a total of 12 pilots that can be supported by the 12 pilot lattices of Figure 23. [330] Example 3 (MPP):

[331] If we split the lattice of Example 1 into two lattices as shown in Figure 24, each of the lattices will have Ndt = 2 ms and Mdf = 30 kHz. The tori associated with the two lattices will both have the following circumferences:

C T P = 33.33 μ$ , εξ = 500 Hz

[332] Assuming the same channel parameters as in Example 1 , the maximum number of pilots that can be supported by each of these tori is 6x5 = 30 for a total of 60 pilots on the two lattices. A practical number with 10 MHz channel bandwidth and a reasonable size window in the time dimension is expected to be 40 (4x5x2) or 50 (5x5x2) pilots.

[333] Another example of MPP is shown later in the document. [334] The advantage of using DDPP is that it provides:

• More flexibility in supporting different channel delay and Doppler spreads. With DDPP the pilots can be placed anywhere on the continuous torus whereas when multiplexing the pilots in the time-frequency plane the options are limited to using discrete lattices.

• Lower latency than TFPP when the pilots are used for demodulating data, since in TFPP the lattice used for each pilot is coarser, and hence the average time between the data and the last pilot used for interpolation (the pilot following the data) is larger.

[335] TFPP has an advantage when trying to use a short pilot observation window as an equivalent quality of the channel estimation as is achieved with DDPP can be achieved with TFPP using a shorter observation window.

[336] 9.3 Potential Use of OTFS Based Reference Signals in LTE

[337] To support massive MIMO using channel reciprocity, all active UEs need to send pilots on the UL, so that the eNodeB can predict the channel for pre-coding its DL transmissions to these UEs. This requires supporting a large number of pilots. [338] One way to support a large number of pilots is to send OTFS-based reference signals using the resources allocated to the Sounding Reference Signals (SRSs). The SRSs in the LTE system are transmitted on the last symbol of the UL sub-frame. In TDD mode the SRSs can be scheduled with the shortest configuration period being 5 sub- frames (5 ms). With this configuration, the SRSs use a pilot lattice with N = 70, = 1 (see section 9.1 ). The torus associated with this lattice has the following delay and Doppler circumferences respectfully:

C T P = 66.67 με, = 200 Hz

[339] Assuming, as an example, an ETU channel with maximum Doppler frequency of up to 10 Hz, the maximum number of pilots that can be supported on this lattice is 13x10 = 130. With a practical finite observation window, the number of pilots that can be supported with good enough channel prediction of 5 ms (the distance between the pilots) into the future will be smaller, but is still expected to be very large.

[340] 9.4 Examples of Reference Signals for 5G Communications

[341] The proposed structure of the reference signals supports pre-coding of the downlink (DL) transmissions using channel reciprocity in the presence of time varying channels. We refer to pre-coding as a generalized beamforming scheme for supporting multi-layer transmission in a MIMO system.

[342] For the 5G reference signals it is proposed:

• To dedicate a subset of the time-frequency data lattice to reference signals

• To use the OTFS-based reference signals described in section 9.1 .

• To scramble the reference signals by multiplying their time-frequency samples by 2-D chirp sequences (e.g. 2-D Zadoff-Chu), for the purpose of limiting the inter-cell interference between the reference signals. The 2-D sequences will have a much richer selection of sequences with good cross correlation characteristics than single dimension sequences. • To pack all the reference signals (pilots) required for the operation of the system on the lattice dedicated to the reference signals (except maybe for demodulation reference signals, when needed, that could be sent with the data).

• To have the eNodeB (base station) transmit the DL reference signals continuously on the dedicated time-frequency DL pilot lattice.

• To have each UE (subscriber device) transmit its uplink (UL) reference signals on the dedicated UL pilot lattice before the eNodeB starts to pre-code its

transmissions to the UE.

[343] Separating the pilots from the data enables starting the transmission of the pilots before the data transmission starts, which enables the receiver to use a large pilot observation window resulting in higher channel observation resolution. The higher channel observation resolution enables:

• A better channel estimation

• A better pilot packing (due to reduced leakage between the received pilots)

• Improved predictability of the channel, which will improve the precoding in the

presence of Doppler spreads,

[344] all without impacting data transmission latency.

[345] To enable using the channel reciprocity for pre-coding, the channel response information has to be current during the DL transmission time. To achieve that, the eNodeB has to receive pilots from all active UEs on a regular basis so that the eNodeB has an up to date channel information whenever it needs to transmit to a UE. It can be shown that using the proposed pilot lattice that meets the conditions in (74) with a long enough observation window, provides good channels prediction that can be used for precoding.

[346] The number of pilots that need to be supported in the UL is equal to the number of active UEs in a cell and at the edges of the neighboring cells (to minimize interference between pilots and to allow support of interference cancelation in the DL) times the number of spatially multiplexed layers per UE. For the purpose of pilot transmissions, an active UE will be a UE that started sending or receiving data (alternatively it can be the time it wakes up to start sending or receiving data). At that point the UE will start sending the pilots. Until the eNodeB collects enough pilots from the UE to support pre-coding the eNodeB will send data to that UE without pre-coding. Also, to limit the number of UEs that need to send pilots continuously, the support for pre-coding per UE can be configurable. In that case the UE could send demodulation pilots only with the data.

[347] To support pre-coding on the DL, the proposal is to have the active UEs transmit their pilots on a regular basis. This will allow the eNodeB to collect a history of pilot information from all the active UEs. When a packet needs to be transmitted to a specific UE, the eNodeB can use the pilot history of the specific UE to calculate the pre- coder, and apply it to the transmitted packet. In addition to using the pilots for pre-coding DL transmissions, the eNodeB can use the regularly transmitted UL pilots to estimate the channel for demodulating packets transmitted on the UL. Using pilot history will also help improve the separation of the desired pilot from the other pilots and the quality of the channel estimation for demodulating the received signal. It is assumed that the transmissions on the UL are either not pre-coded or that the eNodeB has knowledge of the pre-coders used.

[348] In the DL, assuming the pre-coding is good enough, the UEs will not need demodulation reference signals (DM RSs) on the pre-coded layers. With that assumption, the eNodeB will only need to send reference signals on the spatial layers that do not use pre-coding. Hence the number of pilots on the DL will be much smaller than on the UL. The proposal is to send all the DL pilots on a regular basis. These pilots will be used by the UE both as DM RSs (for the non pre-coded transmissions) and for measuring the Observed Time Difference Of Arrival (OTDOA). If it is perceived that DM reference signals are still needed after the pre-coding, then the DM reference signals can be sent with the data.

[349] The number of pilots that need to be supported in the DL is equal to the number of non pre-coded layers per cell times the number of neighboring cells. This is to prevent the pilots from interfering with the pilots of the neighboring cells and to support measuring the OTDOA. [350] 9.4.1 Downlink Reference Signals

[351] For the LTE numerology it is proposed to use a pilot lattice with N = 28 and M = 1. This will support up to 40 pilots for ETU channels with average maximum Doppler frequency of 50 Hz (ETU-50), as shown in Example 1 in section 9.2. If a smaller number of pilots are needed this lattice will support higher Doppler spreads (e.g. with 20 pilots it can support a maximum Doppler frequency of 100 Hz) and vice versa.

[352] If the subcarrier spacing changes to 150 KHz, the data lattice parameters will be:

• dt = 1/140 ms

• df = 150 Hz

[353] This numerology also supports 40 pilots for ETU-50 channels using the same pilot lattice (N = 28, = 1). In this case all the pilots will be packed in the Doppler dimension.

[354] The DL pilots will be transmitted continuously on the pilot lattice. The UEs should collect a long enough history of the pilots to support good enough channel estimation for the purpose of receiving non pre-coded (data or control) transmissions from the eNodeB, and for improving the measured TOA.

[355] 9.4.2 Uplink Reference Signals

[356] For the LTE numerology it is proposed to use one or more adjacent pilot lattices with N = 28 and M = 1, and/or one or more adjacent pilot lattices with N = 14 and M = 1. Each of the first lattices will support up to 40 pilots for ETU-50 (as shown in Example 1 in section 9.2), and each of the second lattices will support 80 pilots. A good example is using one lattice with N = 28, M = 1 and one lattice with N = 14, = 1, in combination with a DL pilot lattice of N = 28, = 1. This example, demonstrated in Figure 25, supports 120 UL pilots and 40 DL pilots for ETU channels with an average Doppler frequency of 50 Hz. Figure 26 shows the representation of 40 equally spaced pilots on the Delay-Doppler plane that is associated with the pilot lattice of N = 28, = 1. [357] The pilot structure of Figure 25 supports both symmetric and asymmetric DL/UL transmissions. Note that with this pilot structure a switching guard period (GP) is required after every DL sub-frame. Hence, the more asymmetric the transmissions are the more switching guard periods (GPs) will be required. If the downlink-to-uplink switch-point periodicity is N sub-frames, then (N - 1) GPs will be required per N sub-frames.

[358] The pilot structure of Figure 25 adds overhead of 14.3% (2/14) for supporting 160 pilots. This is an overhead of 0.09% per pilot. Note that this overhead per pilot depends on the delay and Doppler spreads of the channels. The number of supported pilots in this pilot configuration will be doubled and the overhead per pilot will be cut by half for a maximum Doppler frequency of 25 Hz (instead of 50 Hz).

[359] 9.5 Comparison with LTE Pilot Packing

[360] As shown in section 9.4.2, the reference signal structure proposed in section 9.3 can accommodate 40 pilots on the DL and 120 pilots on the UL, all supporting an ETU- 50 channel. The DL pilots use 3.6% of the total PHY resources (data lattice), and the UL pilots occupy 10.7% of the total PHY resources.

[361] In LTE, the cell specific reference signals occupy 14.3% of the DL PHY resources. With these reference signals LTE supports up to 4 non pre-coded DL spatial layers. On the UL, to support 8 spatial layers, the UEs can be configured (in TDD mode) to send SRSs with a configuration period of 5ms. In this mode, to support ETU channels, a total of 8 SRSs can be supported. These SRSs occupy 1.43% of the UL PHY resources. These reference signals can't support any significant Doppler spread in ETU channels.

[362] The following Table 1 shows a summary of the comparison between the proposed reference signals and the LTE reference signals. Note that for supporting lower Doppler channels than shown in the table for the OTFS RSS, the number of OTFS RSs could either be increased proportionally to the decrease in the Doppler spread or the overhead could decrease proportionally. As an example, for ETU-5 channels the overhead of the OTFS RSs could decrease 10 times (to around 0.02% per RS) while still supporting 140 pilots on the DL and 20 pilots on the UL. Table 1

[363] Appendix A - Mathematical background

[364] A function g of a discrete variable ndt where η ε ΐ (the set of integer numbers) and dt ε R (the set of real numbers) is a function on the one dimensional lattice A t = TLdt = {ndt-. n E TL. dt E R}. It is well known that the Fourier transform of the discrete function g(ndt) is a continuous periodic function with period 1/dt. The discrete Fourier transform transforms the function g(ndt) to a continuous function G( ) that resides on = [0,1/ dt). Since the discrete Fourier transform translates a multiplication of two functions on the lattice A t to a circular convolution, it is convenient to refer to G(/) as residing on a circle with a circumference of 1/dt.

[365] Similar to the one dimensional case, it can be shown that the discrete symplectic Fourier transform (a twisted version of the two dimensional discrete Fourier transform) transforms a function g of two discrete variables to a function of two continuous periodic variables. Assume that the function g resides on the following lattice:

A t = TLdt 0 TLdf = {(ndt, mdf): n, m E TL, dt, df E R} (75)

[366] The discrete symplectic Fourier transform of gindt. mdf) is given by:

SF(g)(r, v) = G(r, v) = dtdf■ e -2nj{vndt-rmdf) i g(ndt, ndf)

(76) n,m [367] The function G(r,v) resides on a two-dimensional plane τ χ ν = [0,1/ df)x[0,l/dt) or equivalently on a torus with circumferences 1/df in the τ dimension and 1/dt in the v dimension. This torus is referred to as the torus associated with the lattice A t f . An example is depicted in Figure 27.

[368] Appendix B - Effect of Staggering the LTE UL DM Reference Signals

[369] The uplink demodulation reference signals in LTE are defined by the time- domain cyclic shift τ(Τ) of the base sequence r uv (k) according to r uv (mL RS + k, T = w m {X)e^ k r uv {k) | 0 < k≤ L RS - 1, m = 0,1 (77)

[370] Where L RS is the length of the reference signal sequence (in number of subcarriers), λ is the spatial layer index, [w Q (X) w^X)] = [+1 + 1] is the Orthogonal Cover Code (OCC), and r uv (k) is the base (Zadoff-Chu) sequence. The term e jTWk represents the layer dependent cyclic shift which separates the pilots of the different layers. For rank 4 UL transmission the 4 UL reference signals can be represented in the Delay-Doppler plane as shown in Figure 28. As can be seen from Figure 28, all 4 reference signals have the same Doppler shift. Staggering the reference signals as shown in Figure 29 enables better estimation of the channel when using a small number of PRBs (small observation window) as shown in Figure 30.

[371] Appendix C MPP Examples

[372] Using the same numerology and channel parameters of example 1 in section 9.2, chose the lattice points as shown in Figure 31 . The red columns form the same torus as in example 1 and can support a maximum of 65 pilots. The remaining red points can be viewed as 13 coarse lattices, each with N=28 and M=12. The circumferences of torus associated with this lattice are

C T P = 5.56iis , = 500 Hz

[373] This torus supports 1x5 pilots, so the total number of pilots that can be supported by these 13 coarse lattices is 5x13=65. Hence the total maximum number of pilots that can be supported by the pilots' sample points in Figure 31 is 130. A more practical number is 40 on the first lattice (same as in Example 1 ) and 4x13 = 52 on the coarser lattices, for a total of 92 pilots.

[374] For the same example we can partition the pilots' sample points differently, into 25 lattices with N = 28 and M = 12. Each such lattice supports a maximum of 1x5 pilots for a total of 5x25 = 125 pilots and a more practical number of 4x25 = 100 pilots.

[375] Figure 32 shows an example communication network in which the disclosed technology can be embodied. The network 3200 may include a base station transmitter that transmits wireless signals s(t) (downlink signals) to one or more receivers r(t) which may be located in a variety of locations, including inside or outside a building and in a moving vehicle. The receivers may transmit uplink transmissions to the base station, typically located near the wireless transmitter.

[376] Figure 33 shows a flowchart of an example method 3300 of wireless communication. The method 3300 includes the following operations: (3302) determining a maximum delay spread for a transmission channel, (3304) determining a maximum Doppler frequency spread for the transmission channel, (3306) allocating a set of transmission resources in a time-frequency domain to a number of pilot signals based on the maximum delay spread and the maximum Doppler frequency spread, and (3308) transmitting the pilot signals over a wireless communication channel using transmission resources. Various examples and options are disclosed in the present document, in particular, in Section 9.

[377] In some embodiments, each pilot signal may correspond to a delta function in the delay-Doppler domain.

[378] In some embodiments, the allocating operation 3306 may include staggering transmission resources for the number of pilots with respect to each other such that at least some pilots occupy transmission resources that do not occur on a rectangular grid in the delay-Doppler domain. In some embodiments, every other pilot signal position may be staggered from n original position on the rectangular grid. For example, in various embodiments, all even-numbered (or odd-numbered) pilot signals may be staggered. [379] In some embodiments, the set of transmission resources in the time-frequency domain occupied by any given pilot signal corresponds to a lattice comprising time instances uniformly distributed along a time axis and having a first step size and frequencies uniformly distributed along a frequency axis and having a second step size. It will be understood that the step sizes in the time-frequency domain are different from frequency domain spacing of pilot signals.

[380] In some embodiments, the set of transmission resources in the time-frequency domain occupied by the pilot signal correspond to a lattice comprising time instances nonuniform ly distributed along a time axis.

[381] In some embodiments, the set of transmission resources in the time-frequency domain occupied by at least one pilot signal correspond to a lattice comprising frequencies that are non-uniformly distributed along a frequency axis.

[382] In some embodiments, the set of transmission resources in the time-frequency domain occupied by at least one pilot signal are non-overlapping with another set of resources in the time-frequency domain over which user data is transmitted by the wireless communication device.

[383] In some embodiments, the operation 3308 of transmitting includes transmitting the pilot signal to a given user equipment prior to transmitting data to the user equipment.

[384] In some embodiments, the pilot signals may be generated by scrambling a basis signal using a two-dimensional (2-D) chirp sequence. In some embodiments, the pilot signals may be generated by cyclically shifting by a different amount a root 2-D Zadoff-Chu sequence. The shift may be performed in the time domain and/or in the frequency domain.

[385] In some embodiments, the transmission 3308 may be performed on a continuous basis from the transmitter to a user equipment, regardless of there is data transmission going on from the transmitter to the UE. In some embodiments, data may be pre-coded prior to the transmission.

[386] In some embodiments, each pilot signal generated by the wireless device using the method 3300 may occupy non-overlapping and distinct transmission resources. [387] In some embodiments, the wireless communication device includes a base station, the method 3300 further including generating at least two pilot signals occupying two sets of transmission resources are non-overlapping in the delay-Doppler domain. In some embodiments, the at least two pilot signals use non-overlapping delay domain resources. In some embodiments, the at least two pilot signals use non-overlapping Doppler-domain resources.

[388] In some embodiments, the wireless communication device includes a user equipment, and wherein the set of transmission resources are specified to the wireless communication device in a upper layer message.

[389] Figure 34 is a block diagram of an example of a wireless communication apparatus 3400 that includes a memory 3402 for storing instructions, a processor 3404 and a transmitter 3406. The transmitter 3406 is communicatively coupled with the processor 3404 and the memory 3402. The memory 3402 stores instructions for the processor 3404 to generate a pilot signal according to the methods described here (e.g., method 3300 and method 3500). The transmitter 3404 transmits the pilot signal over a wireless communication channel using transmission resources that are designated for pilot signal transmission.

[390] Figure 35 shows a flowchart of an example method 3500 of wireless communication. The method 3500 includes the following operations: determining (3502) a maximum delay spread for a transmission channel, determining (3504) a maximum Doppler frequency spread for the transmission channel, determining (3506) a number of pilot signals that can be transmitted using a set of two-dimensional transmission resources at least based on the maximum delay spread and the maximum Doppler frequency spread, allocating (3508) the set of transmission resources from a two-dimensional set of resources to the number of pilot, and transmitting (3510) the pilot signals over a wireless communication channel using transmission resources. Various examples and options are disclosed in the present document, in particular, in Section 9.

[391] In some embodiments, the operation 3506 may include determining the number of pilot signals based on one or more of a number of receivers to send the pilot signals to, a number of transmission layers used for transmissions to the receivers, a number of receivers that are also transmitting pilot signals, and possible interference from another cell's pilot signals. As previously described, a target observation window may be determined in the time-frequency domain based on the desired resolution and observation time.

[392] In some embodiments, pilot signals may be staggered. Some examples are shown and described with respect to Figure 29 and Figure 30. The staggering may be achieve by re-locating pilots from a position on a grid to a position along one of the dimensions (time or frequency) to maximize the separation from the non-staggered pilots.

[393] Figure 36 is a block diagram of an example of a wireless communication apparatus that can be used for embodying some techniques disclosed in this patent document. The apparatus 3600 may be used to implement method 3300 or 3500. The apparatus 3600 includes a processor 3602, a memory 3604 that stores processor- executable instructions and data during computations performed by the processor. The apparatus 3600 includes reception and/or transmission circuitry 3606, e.g., including radio frequency operations for receiving or transmitting signals.

[394] It will be appreciated that various techniques are disclosed for pilot packing in an OTFS-based communication network.

[395] The disclosed and other embodiments and the functional operations described in this document can be implemented in digital electronic circuitry, or in computer software, firmware, or hardware, including the structures disclosed in this document and their structural equivalents, or in combinations of one or more of them. The disclosed and other embodiments can be implemented as one or more computer program products, i.e., one or more modules of computer program instructions encoded on a computer readable medium for execution by, or to control the operation of, data processing apparatus. The computer readable medium can be a machine-readable storage device, a machine-readable storage substrate, a memory device, a composition of matter effecting a machine-readable propagated signal, or a combination of one or more them. The term "data processing apparatus" encompasses all apparatus, devices, and machines for processing data, including by way of example a programmable processor, a computer, or multiple processors or computers. The apparatus can include, in addition to hardware, code that creates an execution environment for the computer program in question, e.g., code that constitutes processor firmware, a protocol stack, a database management system, an operating system, or a combination of one or more of them. A propagated signal is an artificially generated signal, e.g., a machine-generated electrical, optical, or electromagnetic signal, that is generated to encode information for transmission to suitable receiver apparatus.

[396] A computer program (also known as a program, software, software application, script, or code) can be written in any form of programming language, including compiled or interpreted languages, and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment. A computer program does not necessarily correspond to a file in a file system. A program can be stored in a portion of a file that holds other programs or data (e.g., one or more scripts stored in a markup language document), in a single file dedicated to the program in question, or in multiple coordinated files (e.g., files that store one or more modules, sub programs, or portions of code). A computer program can be deployed to be executed on one computer or on multiple computers that are located at one site or distributed across multiple sites and interconnected by a communication network.

[397] The processes and logic flows described in this document can be performed by one or more programmable processors executing one or more computer programs to perform functions by operating on input data and generating output. The processes and logic flows can also be performed by, and apparatus can also be implemented as, special purpose logic circuitry, e.g., an FPGA (field programmable gate array) or an ASIC (application specific integrated circuit).

[398] Processors suitable for the execution of a computer program include, by way of example, both general and special purpose microprocessors, and any one or more processors of any kind of digital computer. Generally, a processor will receive instructions and data from a read only memory or a random access memory or both. The essential elements of a computer are a processor for performing instructions and one or more memory devices for storing instructions and data. Generally, a computer will also include, or be operatively coupled to receive data from or transfer data to, or both, one or more mass storage devices for storing data, e.g., magnetic, magneto optical disks, or optical disks. However, a computer need not have such devices. Computer readable media suitable for storing computer program instructions and data include all forms of non volatile memory, media and memory devices, including by way of example semiconductor memory devices, e.g., EPROM, EEPROM, and flash memory devices; magnetic disks, e.g., internal hard disks or removable disks; magneto optical disks; and CD ROM and DVD-ROM disks. The processor and the memory can be supplemented by, or incorporated in, special purpose logic circuitry.

[399] While this document contains many specifics, these should not be construed as limitations on the scope of an invention that is claimed or of what may be claimed, but rather as descriptions of features specific to particular embodiments. Certain features that are described in this document in the context of separate embodiments can also be implemented in combination in a single embodiment. Conversely, various features that are described in the context of a single embodiment can also be implemented in multiple embodiments separately or in any suitable sub-combination. Moreover, although features may be described above as acting in certain combinations and even initially claimed as such, one or more features from a claimed combination can in some cases be excised from the combination, and the claimed combination may be directed to a sub-combination or a variation of a sub-combination. Similarly, while operations are depicted in the drawings in a particular order, this should not be understood as requiring that such operations be performed in the particular order shown or in sequential order, or that all illustrated operations be performed, to achieve desirable results.

[400] Only a few examples and implementations are disclosed. Variations, modifications, and enhancements to the described examples and implementations and other implementations can be made based on what is disclosed.