Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
APPARATUSES AND METHODS FOR PROVIDING FEEDBACK
Document Type and Number:
WIPO Patent Application WO/2021/083521
Kind Code:
A1
Abstract:
A retransmission method in a communication system, wherein the communication system comprises at least one transmitter and at least one receiver with a communication channel between the transmitter and the receiver, the method comprising: utilising in the transmitter a transmitter algorithm with trainable weights and in the receiver a receiver algorithm with trainable weights; generating (302) by the transmitter symbols to be transmitted based on a message to be sent and feedback received from the receiver and transmitting the symbols; generating (304) by the receiver a predicted message based on the received symbols, evaluating (306) the predicted message based on a criterion and providing (312), utilising an algorithm with trainable weights, feedback symbols as a response to the transmitter if the evaluation indicates the predicted message is not acceptable.

Inventors:
GOUTAY MATHIEU (FR)
AIT AOUDIA FAYCAL (FR)
HOYDIS JAKOB (FR)
Application Number:
PCT/EP2019/079805
Publication Date:
May 06, 2021
Filing Date:
October 31, 2019
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
NOKIA TECHNOLOGIES OY (FI)
International Classes:
H04L1/16
Other References:
GOEKTEPE BARIS ET AL: "Subcode-Based Early HARQ for 5G", 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), IEEE, 20 May 2018 (2018-05-20), pages 1 - 6, XP033369802, DOI: 10.1109/ICCW.2018.8403491
STRODTHOFF NILS ET AL: "Machine Learning for Early HARQ Feedback Prediction in 5G", 2018 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), IEEE, 9 December 2018 (2018-12-09), pages 1 - 6, XP033519181, DOI: 10.1109/GLOCOMW.2018.8644343
Attorney, Agent or Firm:
KOLSTER OY AB (FI)
Download PDF:
Claims:
Claims

1. A retransmission method in a communication system, wherein the communication system comprises at least one transmitter and at least one receiver with a communication channel between the transmitter and the receiver, the method comprising: utilising in the transmitter a transmitter algorithm with trainable weights and in the receiver a receiver algorithm with trainable weights; generating by the transmitter symbols to be transmitted based on a message to be sent and feedback received from the receiver and transmitting the symbols; generating by the receiver a predicted message based on the received symbols, evaluating the predicted message based on a criterion and providing, utilising an algorithm with trainable weights, feedback symbols as a response to the transmitter if the evaluation indicates the predicted message is not acceptable.

2. The method of claim 1, wherein training of the weights of the transmitter and receiver algorithms comprises: selecting initial weights for transmitter and receiver algorithms; sampling a given number of messages for transmission; computing a predicted message and feedback, for each message for each training iteration round; update the weights for transmitter and receiver algorithms based on a loss function selected on the basis of the type of messages; repeat the three previous steps a desired number of iteration rounds to obtain weights for the transmitter and receiver algorithms.

3. The method of claim 1, wherein evaluating utilises an algorithm with trainable weights and the training of evaluator algorithm with trainable weights comprises: selecting initial weights for trainable weights of the evaluator algorithm; selecting a given number of messages; computing forward pass through transmitter, channel and receiver to obtain predicted message for each message; computing evaluation output utilising the evaluator algorithm; update the weights for evaluator algorithm by performing a gradient descent on a loss function selected on the basis of the type of the measure; repeat the three previous steps until a predetermined criterion fulfils to obtain weights for the evaluator algorithm.

4. The method of claim 2 or 3, further comprising: repeating the steps a predetermined number of times.

5. The method of claim 2 or 3, further comprising: repeating the steps until the loss function has reached a given threshold.

6. The method of claim 2 or 3, wherein the selected initial weights are random. 7. The method of claim 1, wherein the messages to be sent are sequences of bits encoded with a cyclic redundancy check, CRC, and the evaluation comprises comparing bits of the predicted message to the CRC.

8. The method of claim 1, wherein the messages to be sent are multimedia data and the evaluation comprises evaluating the quality of the received message.

9. The method of claim 2, further comprising: updating the weights by performing gradient descent on the loss where θT, θR are transmitter and receiver weights, B is the number of messages, s and ŝ are the transmitted and predicted messages, I is the number of iteration rounds and loss (·) is a loss function selected on the basis of the type of messages. 10. An apparatus comprising at least one processor; at least one memory including computer program code; the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus at least to perform: utilising a transmitter algorithm with trainable weights; generating and transmitting symbols based on a message to be sent; receiving feedback symbols from a receiver; generating new symbols to be transmitted based on the message to be sent and the received feedback symbols, the new symbols being different than previously transmitted; and transmitting the new symbols.

11. An apparatus comprising at least one processor; at least one memory including computer program code; the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus at least to perform: utilising a receiver algorithm with trainable weights; controlling reception of symbols from a transmitter; evaluating the predicted message based on a criterion; providing, utilising an algorithm with trainable weights, feedback symbols as a response to the transmitter if the predicted message is not acceptable; and controlling transmission of the feedback symbols to the transmitter.

12. The apparatus of claim 10 or 11, the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus further to perform: selecting initial weights for transmitter and receiver algorithms; sampling a given number of messages for transmission; computing a predicted message and feedback, for each message for each training iteration round; update the weights for transmitter and receiver algorithms based on a loss function selected on the basis of the type of messages; repeat the three previous steps a desired number of iteration rounds to obtain weights for the transmitter and receiver algorithms.

13. The apparatus of claim 11, the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus further to perform: utilising an algorithm with trainable weights in evaluating and training of the algorithm comprises selecting initial weights for evaluator algorithm; selecting a given number of messages; computing forward pass through transmitter, channel and receiver to obtain predicted message for each message; computing evaluation output utilising the evaluator algorithm; update the weights for evaluator algorithm by performing a gradient descent on a loss function selected on the basis of the type of the measure; repeat the three previous steps until a predetermined criterion fulfils to obtain weights for the evaluator algorithm.

14. The apparatus of claim 12 or 13, the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus further to perform: repeating the steps a predetermined number of times.

15. The apparatus of claim 12 or 13, the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus further to perform: repeating the steps until the loss function has reached a given threshold.

16. The apparatus of claim 12 or 13, the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus further to perform: selecting random initial weights.

17. The apparatus of claim 10 or 11, wherein the messages to be sent are sequences of bits encoded with a cyclic redundancy check, CRC, and the evaluation comprises comparing bits of the predicted message to the CRC.

18. The apparatus of claim 11, wherein the messages to be sent are multimedia data and the evaluation comprises evaluating the quality of the received message.

19. A computer program comprising instructions for causing an apparatus of a communication system to perform any of the method steps of claims 1 to 9.

Description:
APPARATUSES AND METHODS FOR PROVIDING FEEDBACK

Technical Field

The exemplary and non-limiting embodiments of the invention relate generally to communications.

Background

Wireless telecommunication systems are under constant development. There is a constant need for higher data rates and high quality of service. Reliability requirements are constantly rising and ways and means to ensure reliable connections and data traffic while keeping transmission delays minimal are constantly under development.

Hybrid Automatic Repeat Request (HARQ) is an error-control method for data transmission which combines standard Automatic Repeat Request (ARQ) with error-correcting codes. ARQ is a process where each message is transmitted along with a short error-detecting code. If the receiver detects an error in the message, it can either send a negative-acknowledgement to the transmitter or just choose not to send an acknowledgement, depending on the implementation. The transmitter then sends the exact same message again. The use of error-correcting codes in HARQ allows the receiver to correct a certain number of errors in a received message.

One drawback of HARQ is that the feedback sent by the receiver is only binary. When an error is detected in a received message, the receiver does not send any information to the transmitter apart from a possible negative- acknowledgement.

Brief description

According to an aspect of the present invention, there are provided method of claim 1.

According to an aspect of the present invention, there are provided apparatuses of claims 10 and 11.

One or more examples of implementations are set forth in more detail in the accompanying drawings and the description below. Other features will be apparent from the description and drawings, and from the claims. The embodiments and or examples and features, if any, described in this specification that do not fall under the scope of the independent claims are to be interpreted as examples useful for understanding various embodiments of the invention.

Brief description of the drawings

In the following the invention will be described in greater detail by means of preferred embodiments with reference to the accompanying drawings, in which

Figure 1 illustrates a general architecture of an exemplary communication system;

Figure 2 illustrates an example of a simple communication system;

Figure 3 is a flowchart illustrating an example of the operation of the a communication system;

Figure 4 illustrates an example of a simple communication system implemented using an autoencoder;

Figure 5 illustrates an example of an unfolded autoencoder;

Figures 6 and 7 are flowcharts illustrating examples of training a neural network; and

Figure 8 illustrates an example of an apparatus.

Detailed description of some embodiments

In the following, different exemplifying embodiments will be described using, as an example of an access architecture to which the embodiments may be applied, a radio access architecture based on long term evolution advanced (LTE Advanced, LTE-A) , enhanced LTE (eLTE), or new radio (NR, 5G), without restricting the embodiments to such an architecture, however. It is obvious for a person skilled in the art that the embodiments may also be applied to other kinds of communications networks having suitable means by adjusting parameters and procedures appropriately. Some examples of other options for suitable systems are the universal mobile telecommunications system (UMTS) radio access network (UTRAN or E-UTRAN), long term evolution (LTE, the same as E-UTRA), wireless local area network (WLAN or WiFi), worldwide interoperability for microwave access (WiMAX), Bluetooth®, personal communications services (PCS), ZigBee®, wideband code division multiple access (WCDMA), systems using ultra-wideband (UWB) technology, sensor networks, mobile ad-hoc networks (MANETs) and Internet Protocol multimedia subsystems (IMS) or any combination thereof.

Fig. 1 depicts examples of simplified system architectures only showing some elements and functional entities, all being logical units, whose implementation may differ from what is shown. The connections shown in Fig. 1 are logical connections; the actual physical connections may be different. It is apparent to a person skilled in the art that the system typically comprises also other functions and structures than those shown in Fig. 1.

The embodiments are not, however, restricted to the system given as an example but a person skilled in the art may apply the solution to other communication systems provided with necessary properties.

The example of Fig. 1 shows a part of an exemplifying radio access network.

Fig. 1 shows user devices 100 and 102 configured to be in a wireless connection on one or more communication channels in a cell with an access node, a distributed unit or an (e/g)NodeB) 104 providing the cell. The physical link from a user device to a (e/g)NodeB is called uplink or reverse link and the physical link from the (e/g)NodeB to the user device is called downlink or forward link. It should be appreciated that (e/g)NodeBs or their functionalities may be implemented by using any node, host, server or access point etc. entity suitable for such a usage.

A communications system typically comprises more than one (e/g)NodeB in which case the (e/g)NodeBs may also be configured to communicate with one another over links, wired or wireless, designed for the purpose. These links may be used for data and signaling purposes. The (e/g)NodeB is a computing device configured to control the radio resources of communication system it is coupled to. The (e/g)NodeB may also be referred to as a base station, an access point or any other type of interfacing device including a relay station capable of operating in a wireless environment. The (e/g)NodeB includes or is coupled to transceivers. From the transceivers of the (e/g)NodeB, a connection is provided to an antenna unit that establishes bi-directional radio links to user devices. The antenna unit may comprise a plurality of antennas or antenna elements. The (e/g)NodeB is further connected to core network 106 (CN or next generation core NGC). Depending on the system, the counterpart on the CN side can be a serving gateway (S-GW, routing and forwarding user data packets), packet data network gateway (P-GW), for providing connectivity of user devices (UEs) to external packet data networks, or mobile management entity (MME), etc. The core network further comprises a Core Access and Mobility Management Function, AMF, a Session Management Function, SMF, which is responsible for subscriber sessions, and User Plane Function, UPF. The user device (also called UE, user equipment, user terminal, terminal device, etc.) illustrates one type of an apparatus to which resources on the air interface are allocated and assigned, and thus any feature described herein with a user device may be implemented with a corresponding apparatus, such as a relay node. An example of such a relay node is a layer 3 relay (self-backhauling relay) or a layer 2 relay (Integrated Access and Backhaul) towards the base station.

The user device typically refers to a portable computing device that includes wireless mobile communication devices operating with or without a subscriber identification module (SIM), including, but not limited to, the following types of devices: a mobile station (mobile phone), smartphone, personal digital assistant (PDA), handset, device using a wireless modem (alarm or measurement device, etc.), laptop and/or touch screen computer, tablet, game console, notebook, and multimedia device. It should be appreciated that a user device may also be a nearly exclusive uplink only device, of which an example is a camera or video camera loading images or video clips to a network. A user device may also be a device having capability to operate in Internet of Things (loT) network which is a scenario in which objects are provided with the ability to transfer data over a network without requiring human-to-human or human-to-computer interaction. One technology in the above network may be denoted as narrowband Internet of Things (NB-lot). The user device may also be a device having capability to operate utilizing enhanced machine-type communication (eMTC). The user device may also utilize cloud. In some applications, a user device may comprise a small portable device with radio parts (such as a watch, earphones or eyeglasses) and the computation is carried out in the cloud. The user device (or in some embodiments a layer 3 relay node) is configured to perform one or more of user equipment functionalities. The user device may also be called a subscriber unit, mobile station, remote terminal, access terminal, user terminal or user equipment (UE) just to mention but a few names or apparatuses.

Various techniques described herein may also be applied to a cyber- physical system (CPS) (a system of collaborating computational elements controlling physical entities). CPS may enable the implementation and exploitation of massive amounts of interconnected ICT devices (sensors, actuators, processors microcontrollers, etc.) embedded in physical objects at different locations. Mobile cyber physical systems, in which the physical system in question has inherent mobility, are a subcategory of cyber-physical systems. Examples of mobile physical systems include mobile robotics and electronics transported by humans or animals.

Additionally, although the apparatuses have been depicted as single entities, different units, processors and/or memory units (not all shown in Fig. 1) may be implemented.

5G enables using multiple input - multiple output (MIMO) antennas, perhaps more base stations or nodes than the LTE (a so-called small cell concept), including macro sites operating in co-operation with smaller stations and employing a variety of radio technologies depending on service needs, use cases and/or spectrum available. 5G mobile communications support a wide range of use cases and related applications including video streaming, augmented reality, different ways of data sharing and various forms of machine type applications (such as (massive) machine-type communications (mMTC), including vehicular safety, different sensors and real-time control. 5G is expected to have multiple radio interfaces, namely below 6GHz, and mmWave, and also being integrable with existing legacy radio access technologies, such as the LTE. Integration with the LTE may be implemented, at least in the early phase, as a system, where macro coverage is provided by the LTE and 5G radio interface access comes from small cells by aggregation to the LTE. In other words, 5G is planned to support both inter-RAT operability (such as LTE-5G) and inter-Rl operability (inter-radio interface operability, such as below 6GHz - cmWave, above 6GHz -mmWave). One of the concepts considered to be used in 5G networks is network slicing in which multiple independent and dedicated virtual sub-networks (network instances) may be created within the same infrastructure to run services that have different requirements on latency, reliability, throughput and mobility.

The current architecture in LTE networks is fully distributed in the radio and fully centralized in the core network. The low latency applications and services in 5G require to bring the content close to the radio which leads to local break out and mobile edge computing (MEC). 5G enables analytics and knowledge generation to occur at the source of the data. This approach requires leveraging resources that may not be continuously connected to a network such as laptops, smartphones, tablets and sensors. MEC provides a distributed computing environment for application and service hosting. It also has the ability to store and process content in close proximity to cellular subscribers for faster response time. Edge computing covers a wide range of technologies such as wireless sensor networks, mobile data acquisition, mobile signature analysis, cooperative distributed peer-to-peer ad hoc networking and processing also classifiable as local cloud/fog computing and grid/mesh computing, dew computing, mobile edge computing, cloudlet, distributed data storage and retrieval, autonomic self-healing networks, remote cloud services, augmented and virtual reality, data caching, Internet of Things (massive connectivity and/or latency critical), critical communications (autonomous vehicles, traffic safety, real-time analytics, time- critical control, healthcare applications).

The communication system is also able to communicate with other networks, such as a public switched telephone network or the Internet 112, or utilize services provided by them. The communication network may also be able to support the usage of cloud services, for example at least part of core network operations may be carried out as a cloud service (this is depicted in Fig. 1 by "cloud" 114). The communication system may also comprise a central control entity, or a like, providing facilities for networks of different operators to cooperate for example in spectrum sharing.

Edge cloud may be brought into radio access network (RAN) by utilizing network function virtualization (NVF) and software defined networking (SDN). Using edge cloud may mean access node operations to be carried out, at least partly, in a server, host or node operationally coupled to a remote radio head or base station comprising radio parts. It is also possible that node operations will be distributed among a plurality of servers, nodes or hosts. Application of cloudRAN architecture enables RAN real time functions being carried out at the RAN side (in a distributed unit, DU 104) and non-real time functions being carried out in a centralized manner (in a centralized unit, CU 108).

It should also be understood that the distribution of labor between core network operations and base station operations may differ from that of the LTE or even be non-existent. Some other technology advancements probably to be used are Big Data and all-IP, which may change the way networks are being constructed and managed. 5G (or new radio, NR) networks are being designed to support multiple hierarchies, where MEC servers can be placed between the core and the base station or nodeB (gNB). It should be appreciated that MEC can be applied in 4G networks as well.

In an embodiment, 5G may also utilize satellite communication to enhance or complement the coverage of 5G service, for example by providing backhauling. Possible use cases are providing service continuity for machine-to- machine (M2M) or Internet of Things (loT) devices or for passengers on board of vehicles, or ensuring service availability for critical communications, and future railway/maritime/aeronautical communications. Satellite communication may utilize geostationary earth orbit (GEO) satellite systems, but also low earth orbit (LEO) satellite systems, in particular mega-constellations (systems in which hundreds of (nano)satellites are deployed). Each satellite 110 in the mega- constellation may cover several satellite-enabled network entities that create on- ground cells. The on-ground cells may be created through an on-ground relay node 104 or by a gNB located on-ground or in a satellite.

It is obvious for a person skilled in the art that the depicted system is only an example of a part of a radio access system and in practice, the system may comprise a plurality of (e/g)NodeBs, the user device may have an access to a plurality of radio cells and the system may comprise also other apparatuses, such as physical layer relay nodes or other network elements, etc. At least one of the (e/g)NodeBs may be a Home(e/g)nodeB. Additionally, in a geographical area of a radio communication system a plurality of different kinds of radio cells as well as a plurality of radio cells may be provided. Radio cells may be macro cells (or umbrella cells) which are large cells, usually having a diameter of up to tens of kilometers, or smaller cells such as micro-, femto- or picocells. The (e/g)NodeBs of Figure 1 may provide any kind of these cells. A cellular radio system may be implemented as a multilayer network including several kinds of cells. Typically, in multilayer networks, one access node provides one kind of a cell or cells, and thus a plurality of (e/g)NodeBs are required to provide such a network structure.

For fulfilling the need for improving the deployment and performance of communication systems, the concept of "plug-and-play" (e/g)NodeBs has been introduced. Typically, a network which is able to use "plug-and-play" (e/g)Node Bs, includes, in addition to Home (e/g)NodeBs (H(e/g)nodeBs), a home node B gateway, or HNB-GW (not shown in Fig. 1). A HNB Gateway (HNB-GW), which is typically installed within an operator’s network may aggregate traffic from a large number of HNBs back to a core network. As mentioned, radio access network may be split into two logical entities called Central Unit (CU) and Distributed Unit (DU). In prior art, both CU and DU supplied by the same vendor. Thus, they are designed together and interworking between the units is easy. The interface between CU and DU is currently being standardized by 3GPP and it is denoted FI interface. Therefore, in the future the network operators may have the flexibility to choose different vendors for CU and DU. Different vendors can provide different failure and recovery characteristics for the units. If the failure and recovery scenarios of the units are not handled in a coordinated manner, it will result in inconsistent states in the CU and DU (which may lead to subsequent call failures, for example). Thus there is a need to enable the CU and DU from different vendors to coordinate operation to handle failure conditions and recovery, taking into account the potential differences in resiliency capabilities between the CU and DU.

Fig. 2 illustrates an example of communication system in a simple form. The system comprises a transmitter 200, a receiver 202 and a channel 204 between the transmitter and the receiver. One way of interpreting a communication system is to implement it as an autoencoder. An autoencoder-based communication system implements the transmitter 200, channel 204, and receiver 202 as a single Neural Network (NN), which aims to reconstruct its inputs at its outputs. In an embodiment, the transmitter 200 may be implemented as a transmitter neural network and the receiver 202 as a receiver neural network. The channel 204 may also be implemented as a neural network. In general, an algorithm with trainable parameters may be used as well. The communication system of Fig. 2 may be implemented as an autoencoder in a following manner: the transmitter 200 encodes a message s from a finite discrete set into a finite-length vector of channel symbols denoted by x and transmits x to the channel 204. The receiver 202 takes as input from the channel a finite-length vector of samples denoted by y and generates a prediction of the sent messages denoted by s. A channel model, which generates probability density function y~Pr(y|x), may be implemented as non- trainable layers between the transmitter and the receiver. If the channel model is differentiable, then the end-to-end system can be trained by backpropagation, using stochastic gradient descent, SGD, or a variant, for example.

As mentioned, one drawback of HARQ is that the feedback sent by the receiver is only binary. When an error is detected in a received message, the receiver does not send any information to the transmitter apart from a possible negative-acknowledgement. In an autoencoder-based communication system the system may be trained to send and receive a more informative feedback, thus increasing the quality of service of the system.

Fig. 3 is a flowchart illustrating an example of the operation of a communication system of Fig. 2 implemented as an autoencoder-based communication system. In step 300, a message to be transmitted is selected. In an embodiment, the message s is drawn from a set S containing all possible messages that may be sent over a channel.

In step 302, the message s and the previous feedback from the receiver are fed to the transmitter which generates symbols x to be transmitted based on the message s and the feedback. At the first iteration, when the message is transmitted for the first time, the transmitter has not received any feedback and the feedback value is set to 0. The transmitter then transmits the symbols to the channel.

In step 304, the receiver receives the symbols y from the channel and generates or decodes a corresponding predicted message ŝ.

In step 306, the predicted message ŝ is evaluated at the receiver by an evaluator function. In an embodiment, an evaluation value e is obtained.

In step 308, if the evaluation value is good enough, then the predicted message s may be taken as the final decoded message 310. Otherwise, the receiver generates utilising an algorithm with trainable weights feedback symbols as a response to the transmitter for the next iteration and transmits 312 the response.

The transceiver receives the feedback, generates new symbols based on the message to be sent and the feedback symbols and transmits the message in step 302. The transmitter thus transmits the same information as on the previous iteration round but the generated symbols are different due to the received feedback.

Fig.4 illustrates an example of the communication system of Fig. 2 when implemented as an autoencoder based communication system. The figure shows transmitter 200, receiver 202, forward channel 204, feedback channel 400 and evaluator 402. The evaluator 402 has here shown outside the receiver 202, although it may also be a part of the receiver. In Fig. 4, z -1 denotes delay.

The transmitter 200 and the receiver 202, may at least partly be implemented as algorithms or neural networks comprising trainable parameters or weights θ T , θ R .

The transmitter neural network with trainable parameters θ T may be described by mapping where s is the message to be sent, is the vector of feedback symbols received by the transmitter from the receiver, n m and n f are the number of channels used to transmit the message and the feedback, respectively, and xt is the vector of message symbols the transmitter sends over the channel. The feedback symbol vector is a vector of complex baseband symbols.

The receiver neural network with trainable parameters θ R may be described by mapping where y t is the vector of symbols received by the receiver, ŝ is the decoded message, and f t is the vector of feedback symbols sent by the receiver. The vector h t can be interpreted as a receiver’s internal state and takes value in denote the sets of complex and real numbers.

As mentioned, the forward channel 204 from the transmitter to the receiver, a channel model which generates y t ~Pr(y t | x t ) may be implemented as a neural network or an algorithm with trainable parameters. Likewise, on the opposite direction 400, a channel model which generates may also be also implemented as a neural network or an algorithm with trainable parameters.

The evaluator 402 may be modelled by a function E: ŝ t Є S → e Є R which evaluates the predicted message ŝ t . If evaluation value e is good enough, then the predicted message ŝ t may be taken as the final decoded message. Otherwise, feedback f t is to be transmitted to the transmitter as an input for the next iteration. The evaluator may inform 404 the receiver that feedback is to be sent to the transmitter-

The function E may depend on the messages sent. In an embodiment, when the messages comprise integers from a finite set, then ŝ t may be a probability vector over the possible integers. The evaluator may then be a threshold, and the integer having the highest probability of having been sent is taken as the message if the probability is over the threshold.

In an embodiment, the messages maybe sequences of bits encoded with a cyclic redundancy check, CRC. Now the evaluation may be a decoding algorithm checking if the CRC matches the sequence.

In an embodiment, the messages to be sent may be multimedia data. In this case, the evaluator may be a function that evaluates the quality of the received data.

In an embodiment, the transmitter neural network and the receiver neural network are trained to produce correct feedback and to react to the feedback. The autoencoder-based communication system may be viewed as a recurrent neural network which can be unfolded. Unfolding means here a technique where the multiple iteration rounds through the network at multiple time steps are unfolded into a single larger neural network, where the weights of the network are shared between every transmitter and receiver. Fig. 5 illustrates an example of an unfolded autoencoder of Fig.4, where the number of iterations of times steps are assumed to be three (time steps t= 0,1,2).

As before, the message to be transmitted is s, the symbols to be transmitted are denoted as x and they are based on the message s and the feedback from the receiver. At the first iteration, when the message is transmitted for the first time, the transmitter has not received any feedback and the feedback value is set to 0. Symbols arriving to the receiver through the channel are denoted as y, feedback generated by the receiver f,feedback received by the receiver through the channel is denoted with and h is the internal state of the receiver. The unfolded autoencoder comprises the first transmission 500 (t=0), and two successive transmissions 502 (t= 1), 504 (t= 2).

On the first iteration round or first transmission, 500 (t=0), there are transmitter 200A, forward channel 204A, receiver 202A and feedback channel 400A.

On the second iteration round or transmission, 502 ( t= 1), there are transmitter 200B, forward channel 204B, receiver 202B and feedback channel 400B.

Correspondingly, on the third iteration round or transmission, 504 (t= 2), there are transmitter 200C, forward channel 204C, receiver 202C and feedback channel 400C.

Figure 6 is a flowchart illustrating an embodiment. The flowchart describes training of the transmitter neural network and the receiver neural network (or training of algorithms with trainable parameters). The steps of the flowchart may be performed by an apparatus configured to perform the training of the neural networks. In an embodiment, regular backpropagation with Stochastic Gradient Descent, SGD, or a variant may be utilised in the training.

In step 600 of Fig. 6, the number B of messages to be used at each iteration round is selected and random initial weights are selected for transmitter and receiver neural networks, the same weights being used on each iteration round. Thus the same weights are shared between every transmitter (200A, 200B, 200C), and similarly between every receiver (202A, 202B, 202C), in the unfolded architecture of the autoencoder. In step 602 of Fig. 6, a given number B of messages {s i Є S}i=1,...,B is sampled for transmission.

In step 604 of Fig. 6, a forward pass through the autoencoder, i.e predicted message, feedback and receiver’s internal state for each message are computed through the unfolded autoencoder (for each iteration round 500, 502, 504);

In step 606 of Fig. 6, the weights (θ T , θ R ) for transmitter and receiver neural networks are updated based on a loss function selected on the basis of the type of messages. In an embodiment, gradient descent (using SGD or a variant) is performed on the loss: where loss(·) is a loss function that depends on the nature of the messages sent. I is the number of iterations. For example, if the messages are images, then the loss function could be the Mean Squared Error (MSE) of the pixel values between the sent and the received images.

The three previous steps are repeated 608 a desired number of times to obtain weights for the transmitter and receiver neural networks. The number of iteration rounds in the training may be fixed or it may be based on some criteria. For example, the training may be stopped after a given number of iterations or when the loss or a related performance metric (such as block error rate) have reached a given performance threshold.

Above, the number of messages B, learning rate and variant of SGD may be parameters of the training algorithm.

The evaluator 402 can be given or can be implemented as a trainable evaluator algorithm or a neural network. The evaluator neural network is trained in a similar manner like transmitter and receiver neural networks. In an embodiment, evaluator neural network is trained after training the transmitter and receiver neural networks. The evaluator neural network comprises trainable parameters θ E , i.e., For a given transmitter, channel, and receiver, the training can also be done with regular backpropagation with SGD, or a variant.

An example of the training of the evaluator neural network is illustrated in the flowchart of Fig. 7. In step 700 of Fig. 7, the number B E of messages to be used at each iteration round is selected and random initial weights are selected for the evaluator.

In step 702 of Fig. 7, a given number BE of messages {s i Є S}i=1,...,B E is sampled for transmission.

In step 704 of Fig. 7, a forward pass through the autoencoder to obtain predicted message for each message.

In step 706 of Fig. 7, a forward pass through the evaluator is computed to obtain i.e the evaluator output.

In step 708 of Fig. 7, a measure of difference is calculated between the generated evaluation and a true evaluation, such as a Peak Signal to Noise Ratio, PSNR, or Mean Squared Error, MSE, between s and ŝ ,.

In step 710 of Fig. 7, the weights (θ E ) for the evaluator neural network are updated by performing a gradient descent (using SGD or a variant) on the measure of difference, the updating of weights utilising a loss function selected on the basis of the type of the measure. For example, if the measure of difference is the MSE, then the evaluator loss can be computed as:

The three previous steps may be repeated 712 a desired number of times to obtain weights for the evaluator neural network. The stop criterion may be the same as in the case of training transmitter and receiver neural networks. The number of messages B E , learning rate and variant of SGD may be parameters of the training algorithm.

Fig. 8 illustrates an embodiment. The figure illustrates a simplified example of an apparatus 800 applying embodiments of the invention. It should be understood that the apparatus is depicted herein as an example illustrating some embodiments. It is apparent to a person skilled in the art that the apparatus may also comprise other functions and/or structures and not all described functions and structures are required. Although the apparatus has been depicted as one entity, different modules and memory may be implemented in one or more physical or logical entities.

The apparatus 800 of the example includes a control circuitry 802 configured to control at least part of the operation of the apparatus.

The apparatus may comprise a memory 804 for storing data. Furthermore the memory may store software 806 executable by the control circuitry 802. The memory may be integrated in the control circuitry. The apparatus may comprise one or more interface circuitries 808, 810. The apparatus may comprise an interface 808 for connecting the apparatus to a network or communication system. The interface may provide a wired or wireless connection to the communication system. The apparatus may further comprise a user interface 810 realised with known methods. The interfaces may be operationally connected to the control circuitry 802.

The software 806 may comprise a computer program comprising program code means adapted to cause the control circuitry 802 of the apparatus to perform the embodiments described above and in the claims.

The steps and related functions described in the above and attached figures are in no absolute chronological order, and some of the steps may be performed simultaneously or in an order differing from the given one. Other functions can also be executed between the steps or within the steps. Some of the steps can also be left out or replaced with a corresponding step.

The apparatuses or controllers able to perform the above-described steps may be implemented as an electronic digital computer, which may comprise a working memory (RAM), a central processing unit (CPU), and a system clock. The CPU may comprise a set of registers, an arithmetic logic unit, and a controller. The controller is controlled by a sequence of program instructions transferred to the CPU from the RAM. The controller may contain a number of microinstructions for basic operations. The implementation of microinstructions may vary depending on the CPU design. The program instructions may be coded by a programming language, which may be a high-level programming language, such as C, Java, etc., or a low-level programming language, such as a machine language, or an assembler. The electronic digital computer may also have an operating system, which may provide system services to a computer program written with the program instructions.

As used in this application, the term 'circuitry' refers to all of the following: (a) hardware -only circuit implementations, such as implementations in only analog and/or digital circuitry, and (b) combinations of circuits and software (and/or firmware), such as (as applicable): (i) a combination of processor(s) or (ii) portions of processor(s)/software including digital signal processor(s), software, and memory(ies) that work together to cause an apparatus to perform various functions, and (c) circuits, such as a microprocessor(s) or a portion of a microprocessor(s), that require software or firmware for operation, even if the software or firmware is not physically present. This definition of 'circuitry' applies to all uses of this term in this application. As a further example, as used in this application, the term 'circuitry' would also cover an implementation of merely a processor (or multiple processors) or a portion of a processor and its (or their) accompanying software and/or firmware. The term 'circuitry' would also cover, for example and if applicable to the particular element, a baseband integrated circuit or applications processor integrated circuit for a mobile phone or a similar integrated circuit in a server, a cellular network device, or another network device.

The computer program may be in source code form, object code form, or in some intermediate form, and it may be stored in some sort of carrier, which may be any entity or device capable of carrying the program. Such carriers include a record medium, computer memory, read-only memory, and a software distribution package, for example. Depending on the processing power needed, the computer program may be executed in a single electronic digital computer or it may be distributed amongst a number of computers.

The apparatus may also be implemented as one or more integrated circuits, such as application-specific integrated circuits ASIC. Other hardware embodiments are also feasible, such as a circuit built of separate logic components. A hybrid of these different implementations is also feasible. When selecting the method of implementation, a person skilled in the art will consider the requirements set for the size and power consumption of the apparatus, the necessary processing capacity, production costs, and production volumes, for example.

In an embodiment, the apparatus comprises means for utilising a receiver algorithm with trainable weights; means for controlling reception of symbols from a transmitter; means for evaluating the predicted message based on a criterion; means for providing, utilising an algorithm with trainable weights, feedback symbols as a response to the transmitter if the predicted message is not acceptable; and means for controlling transmission of the feedback symbols to the transmitter.

In an embodiment, the apparatus comprises means for utilising a receiver algorithm with trainable weights; means for controlling reception of symbols from a transmitter; means for evaluating the predicted message based on a criterion; means for providing, utilising an algorithm with trainable weights, feedback symbols as a response to the transmitter if the predicted message is not acceptable; and means for controlling transmission of the feedback symbols to the transmitter.

It will be obvious to a person skilled in the art that, as the technology advances, the inventive concept can be implemented in various ways. The invention and its embodiments are not limited to the examples described above but may vary within the scope of the claims.