Login| Sign Up| Help| Contact|

Patent Searching and Data


Matches 701 - 750 out of 171,548

Document Document Title
WO/2022/085197A1
The present invention is a voice signal conversion model learning device provided with: a learning data acquisition unit for acquiring learning input data, which is an input voice signal; and a learning stage conversion unit for executin...  
WO/2022/086252A1
Disclosed are an electronic device and a method for controlling the electronic device. Particularly, the electronic device comprises: a microphone; memory for storing data relating to a voice recognition model; and a processor which, whe...  
WO/2022/085506A1
A content output device (20) comprises: a content acquisition unit (30) that acquires content; a state detection unit (34) that detects the psychological state of a user in response to the content; a storage unit (24) that stores psychol...  
WO/2022/085296A1
Provided is an information processing device which performs a punctuation mark recovery process on text data obtained by automatic sound recognition. The information processing device comprises: a modifier which inserts a modification ...  
WO/2022/083039A1
A speech processing method and apparatus, a computer storage medium, and an electronic device, relating to the technical field of speech processing. The speech processing method comprises: acquiring a speech sequence, performing framing ...  
WO/2022/087117A2
A method (300) for evaluating a verification model (146) includes receiving first and second sets of verification results (148) where each verification result indicates whether a primary model or an alternative model verifies an identity...  
WO/2022/081590A1
Techniques are described herein for identifying a failed hotword attempt. A method includes: receiving first audio data; processing the first audio data to generate a first predicted output; determining that the first predicted output sa...  
WO/2022/081602A1
A method of determining an alignment sequence between a reference sequence of symbols and a hypothesis sequence of symbols includes loading a reference sequence of symbols to a computing system and creating a reference finite state autom...  
WO/2022/078189A1
Provided are a control method and apparatus for supporting a dynamic intention, and a storage medium. The method comprises: firstly, a robot device acquiring input information from a user; next, identifying intention information represen...  
WO/2022/081937A1
A technique for semantic search and retrieval that is event-based, wherein is event is composed of a sequence of observations that are user speech or physical actions. Using a first set of conversations, a machine learning model is train...  
WO/2022/079049A2
Apparatus for encoding a plurality of audio objects, comprising: an object parameter calculator (100) configured for calculating, for one or more frequency bins of a plurality of frequency bins related to a time frame, parameter data for...  
WO/2022/081186A1
Techniques are described herein for detecting and handling failures in other automated assistants. A method includes: executing a first automated assistant in an inactive state at least in part on a computing device operated by a user; w...  
WO/2022/080659A1
Disclosed is a control method for an electronic device. The control method according to the present disclosure comprises the steps of: extracting log data related to a set function from log data including a plurality of functions perform...  
WO/2022/080395A1
An audio synthesizing method according to one aspect of the present invention is realized by using a computer, wherein score data and acoustic data are received via a user interface, and on the basis of a score encoder and the acoustic d...  
WO/2022/079853A1
This music data processing system comprises: a recording unit 12, 13, in which previously stored first music data is played back, and second music data that is performed in conjunction with the playback of the first music data is recorde...  
WO/2022/081678A1
Described is a method of training a deep-learning-based system for sound source separation. The system comprises a separation stage for frame-wise extraction of representations of sound sources from a representation of an audio signal, a...  
WO/2022/081185A1
Implementations relate to automatic generation of speaker features for each of one or more particular text-dependent speaker verifications (TD-SVs) for a user. Implementations can generate speaker features for a particular TD-SV using in...  
WO/2022/079589A1
A ventilation system with an inlet and outlet, the ventilation system being mounted to a wall, the ventilation system including: an acoustic member absorbing sound from the outlet; the acoustic member includes a film-like body portion op...  
WO/2022/079165A1
Described herein is a method for training a machine learning algorithm. The method may comprise receiving a first input multichannel audio signal. The method may comprise generating, using the machine learning algorithm, an intermediate ...  
WO/2022/081891A1
A computer system for generating graphics content receives text or information specifying an amount of spoken language and uses NLP to extract linguistic structures associated with the text or the amount of spoken language to determine m...  
WO/2022/077883A1
Provided are a material fulfilment system and method, the system comprising a call information receiving module used for receiving call information, a voice tag module used for voice recognition of tag information in the call information...  
WO/2022/078905A1
According to embodiments, similarity values of the voice signals may be obtained, wherein a similarity value may indicate a level of similarity between two voice signals. According to embodiments, the audio signal may be rendered by spat...  
WO/2022/080774A1
The present invention relates to a speech disorder assessment device, and comprises: a communication unit which receives recording data including an utterance voice recorded while performing at least one utterance task by a person subjec...  
WO/2022/081962A1
Example implementations of the present disclosure relate to machine learning for microphone style transfer, for example, to facilitate augmentation of audio data such as speech data to improve robustness of machine learning models traine...  
WO/2022/081688A1
A method (300) of generating an accurate speaker representation for an audio sample (202) includes receiving a first audio sample from a first speaker (10) and a second audio sample from a second speaker. The method includes dividing a r...  
WO/2022/078634A1
There are disclosed techniques for generating an audio signal and training an audio generator. An audio generator (10) may generate an audio Signal (16) from an input Signal (14) and target data (12) representing the audio Signal (16). I...  
WO/2022/078164A1
A sound quality evaluation method and apparatus, and an electronic device (20). The method comprises: performing recording during the playback of a standard audio to obtain a signal to be evaluated (S11, S21, S31); determining a first po...  
WO/2022/079264A2
Described herein is a method of processing an audio signal using a deep-learning-based generator, wherein the method includes the steps of: (a) inputting the audio signal into the generator for processing the audio signal; (b) mapping a ...  
WO/2022/078506A1
A computer-implemented method of building a multilingual acoustic model for automatic speech recognition in a low resource setting includes training a multilingual network on a set of training languages with an original transcribed train...  
WO/2022/078960A1
The present disclosure provides a decoder configured to receive a finite bitrate stream that includes a quantized latent frame, where the quantized latent frame includes a quantized representation of a current frame of a signal in a late...  
WO/2022/079164A2
The present disclosure relates to a method and system for performing packet loss concealment using a neural network system. The method comprises obtaining a representation of an incomplete audio signal, inputting the representation of th...  
WO/2022/079776A1
An audio signal processing device according to an embodiment comprises a plurality of audio signal processing units and a plurality of buffers. Each of the plurality of audio signal processing units belongs to any of a plurality of group...  
WO/2022/078760A1
The invention relates to a device for transmitting mechanical vibrations to flowable media. The device is characterized in that the amplitudes to normal of the contact surface points of a resonator are substantially uniform during a reso...  
WO/2022/079854A1
This acoustic signal enhancement device comprises: a temporal-spatial covariance matrix estimation unit 2 that uses power λt,f (n) of a sound source n and an observation signal vector Xt,f configured from an observation signal xm,t,f of...  
WO/2022/079937A1
This invention relates generally to speech processing and more particularly to end-to-end automatic speech recognition (ASR) that utilizes long contextual information. Some embodiments of the invention provide a system and a method for e...  
WO/2022/079129A1
There are disclosed techniques for generating an audio signal and training an audio generator. An audio generator (10) may generate an audio signal (16) from an input signal (14) and target data (12) representing the audio signal (16). T...  
WO/2022/079848A1
A speech enhancement means 81 determines an enhancement mask generated based on a mask for speech enhancement, when a test utterance is input as speech data. A first hyper-parameter optimization means 82 determines, when the test utteran...  
WO/2022/078146A1
A speech recognition method and apparatus, a device, and a storage medium, relating to the field of speech recognition. Speech recognition is performed using attention coding and time series decoding methods, and attention coding is perf...  
WO/2022/081915A1
Described herein is a method of processing an audio signal using a neural network or using a first and a second neural network. Described is further a method of training said neural network or of jointly training a set of said first and ...  
WO/2022/081669A1
The method S200 can include: at an aircraft, receiving an audio utterance from air traffic control S210, converting the audio utterance to text, determining commands from the text using a question-and-answer model S240, and optionally co...  
WO/2022/080788A1
The present invention relates to a surgical robot system using a headset-based voice recognition microphone and, specifically, to a surgical robot system in which a user, while checking endoscopic images through a virtual reality (VR) he...  
WO/2022/079263A1
A neural network system is provided, implementing a generative model for autoregressively generating a distribution for a plurality of current filter-bank samples of an audio signal, wherein the current samples correspond to a current ti...  
WO/2022/081374A1
Processor(s) of a client device can: identify a textual segment stored locally at the client device; process the textual segment, using an on-device TTS generator model, to generate synthesized speech audio data that includes synthesized...  
WO/2022/081141A1
According to an aspect, a method for distributed sound/image recognition using a wearable device includes receiving, via at least one sensor device, sensor data, and detecting, by a classifier of the wearable device, whether or not the s...  
WO/2022/081595A1
Techniques are described herein for cross-device data synchronization. A method includes: executing a first instance of an automated assistant at least in part on a first computing device; receiving audio data that captures a spoken utte...  
WO/2022/082021A1
The present invention relates to a method for predicting transform coefficients representing frequency content of an adaptive block length media signal, by receiving a frame and receiving block length information indicating a number of q...  
WO/2022/082036A1
The embodiments execute machine-learning architectures for biometric-based identity recognition (e.g., speaker recognition, facial recognition) and deepfake detection (e.g., speaker deepfake detection, facial deepfake detection). The mac...  
WO/2022/081167A1
In example implementations, an apparatus is provided. The apparatus includes a plurality of microphones to record background sounds, a noise cancellation component to generate an inverted signal to negate the background sounds from an ou...  
WO/2022/078728A1
Disclosed is an assembly (1) for active control of the rolling noise for a motor vehicle, comprising a path control device (2), comprising another sensor (22, 22') fastened on the steering knuckle (20, 20'), such as an accelerometer, the...  
WO/2022/079365A1
The processing of a signal y(t) from a microphone (MIC) of a device further comprising at least one loudspeaker (HP) intended to be powered by a signal x(t) aims to limit an echo effect induced by the microphone (MIC) picking up a sound ...  

Matches 701 - 750 out of 171,548