Login| Sign Up| Help| Contact|

Patent Searching and Data


Matches 401 - 450 out of 171,548

Document Document Title
WO/2022/121182A1
The present application relates to artificial intelligence, and provides a voice activity detection method and apparatus, and a device and a computer-readable storage medium. The method comprises: obtaining an audio signal to be detected...  
WO/2022/124164A1
When an interest reaction by a child is detected based on input data from a child camera (25) and a child sensor, an HCU (1) identifies an attention object that is something outside a vehicle that a child shows interest in based on the c...  
WO/2022/124498A1
Disclosed are a lip sync video generation apparatus and method. The lip sync video generation apparatus, according to a disclosed embodiment, is a lip sync video generation apparatus comprising one or more processors and memory storing o...  
WO/2022/125290A1
A linguistic content and speaking style disentanglement model (100) includes a content encoder (110), a style encoder (130), and a decoder (150). The content encoder is configured to receive input speech (102) as input and generate a lat...  
WO/2022/122361A1
A noise cancellation enabled headphone to be worn on or over an ear of a user comprises a speaker (SP), a feed-forward microphone (FF_MIC) predominantly sensing ambient sound, an error microphone (ERR_MIC) being arranged in front of the ...  
WO/2022/121157A1
A speech synthesis method, which relates to the field of artificial intelligence, and comprises: acquiring training text, and performing position vector encoding and sound spectrum encoding on the training text by using a pre-built speec...  
WO/2022/121155A1
A meta learning-based adaptive speech recognition method and apparatus, a device and a medium, relating to the technical field of artificial intelligence and capable of solving the problems that, when an adaptive speaker adjustment is ma...  
WO/2022/121188A1
A keyword detection method, comprising: acquiring a speech sentence to be detected that is inputted by a current user (S1); extracting speech feature parameters corresponding to the speech sentence (S2); inputting the speech feature para...  
WO/2022/121684A1
An approach to identifying alternate soft labels for training a student model may be provided. A teaching model may generate a soft label for a labeled training data. The training data can be an acoustic file for speech or a spoken natur...  
WO/2022/125284A1
A method (400) for combining hotwords (24) in a single utterance (20) includes receiving, at a first assistant-enabled device (AED) (110), audio data (14) corresponding to an utterance directed toward the first AED and a second AED among...  
WO/2022/116825A1
The present application relates to cloud technology and artificial intelligence technology, and provides an artificial intelligence-based audio processing method and apparatus, an electronic device, a computer readable storage medium, an...  
WO/2022/117292A1
A method of training a neural network to generate conversational replies, the method comprising: providing a first dataset of stored phrases linked to form a plurality of conversational sequences; training the neural network to generate ...  
WO/2022/116277A1
A sound-generating assembly (03) and an electronic device comprising the sound-generating assembly (03). The sound-generating assembly (03) comprises a support (2) provided with an electric connection part (3), wherein two surfaces of th...  
WO/2022/119121A1
An electronic device is disclosed. The electronic device comprises a communication circuit, a memory, and a processor operatively connected to the communication circuit and the memory, wherein the memory can store instructions that allow...  
WO/2022/116969A1
A general voice instruction generating method and apparatus. The method comprises: obtaining View tree content of a display interface of an application program (S11); traversing information nodes in the View tree content, and configuring...  
WO/2022/116644A1
The present application relates to the technical field of terminals, and provides an anti-vibration sound reception device, a terminal, a signal processing method and a signal processing module. The device comprises a base plate, a cover...  
WO/2022/117968A1
A multicellular acoustic-attenuation panel (220) comprises several rows of acoustic cells (240, 250, 260, 270, 280) each extending in a circumferential direction (DC), each acoustic cell being delimited by a wall (241, 251, 261, 271, 281...  
WO/2022/120093A1
Disclosed is an audio signal encoding/decoding method that uses an encoding downmix strategy applied at an encoder that is different than a decoding re-mix/upmix strategy applied at a decoder. Based on the type of downmix coding scheme, ...  
WO/2022/118389A1
A sonic wave generator (10) is configured to generate ultrasonic waves of a predetermined frequency for driving away harmful animals. The sonic wave generator (10) is provided with a power source (11), a switch (12), an oscillation circu...  
WO/2022/117291A1
A device for generating conversational replies, comprising a processor with a memory; a speech input module, a user input module; a natural language processing module including one or more encoder-decode modules; the device being configu...  
WO/2022/119585A1
A process for compressing an audio speech signal utilizes ASR processing to generate a corresponding text representation and, depending on confidence in the corresponding text representation, selectively applies more, less, or no compres...  
WO/2022/120011A1
Method for encoding scene-based audio is provided. In some implementations, the method involves determining, by an encoder, a spatial direction of a dominant sound component in a frame of an input audio signal. In some implementations, t...  
WO/2022/116487A1
A voice processing method and apparatus based on a generative adversarial network, a device, and a medium, relating to the technical field of voice processing, the method comprising: acquiring a voice fragment to be processed, segmenting...  
WO/2022/119699A1
A method (600) for determining synthetic speech includes receiving audio data (120) characterizing speech in streaming audio (118) obtained by a user device (102). The method also includes generating, using a trained self-supervised mode...  
WO/2022/120082A1
An attenuation or "gap" may be inserted into at least a first frequency range of at least first and second audio playback signals of a content stream during at least a first time interval to generate at least first and second modified au...  
WO/2022/119705A1
A method (300) for decaying speech processing includes receiving, at a voice- enabled device (110), an indication of a microphone trigger event (202) indicating a possible interaction with the device through speech where the device has a...  
WO/2022/119946A1
Embodiments are disclosed for spatial noise filling in multi-channel codecs. In an embodiment, a method of regenerating background noise ambience in a multi-channel codec by generating spatial hole filling noise comprises: computing nois...  
WO/2022/120203A1
The present disclosure provides systems and methods for enhancing audio communications. In one aspect, the present disclosure provides a method for enhancing audio communications. The method may comprise (a) detecting one or more paramet...  
WO/2022/119752A1
Systems and methods for dynamic voice accentuation and reinforcement are presented herein. One embodiment comprises one or more audio input sources; one or more audio output sources; one or more band pass filters; and a processing contro...  
WO/2022/117444A1
A computer-implemented method (1) of detecting cognitive impairment comprising: receiving audio data (21) representing recorded utterances of a patient; processing the audio data using a speech-to-text engine (30) to produce a text trans...  
WO/2022/118700A1
[Problem] To provide a compact keyboard instrument having the dynamic characteristics of keys similar to those of a grand piano. [Solution] A keyboard instrument 1 comprises: a first member 41 having a first fixed support part 31, a firs...  
WO/2022/120085A1
Some implementations involve receiving, from a first subband domain acoustic echo canceller (AEC) of a first audio device in an audio environment, first adaptive filter management data from each of a plurality of first adaptive filter ma...  
WO/2022/116442A1
A speech sample screening method and apparatus (100) based on geometry, and a computer device (500) and a storage medium, which relate to artificial intelligence technology. The method comprises: acquiring an initial speech sample set, a...  
WO/2022/119023A1
The present invention relates to an effector-integrated guitar. The effector-integrated guitar comprises: a body which forms part of the guitar (11); and an effector (12) which is disposed inside the body, wherein the effector (12) is el...  
WO/2022/116432A1
The present application relates to the field of artificial intelligence. Disclosed are a multi-style audio synthesis method, apparatus and device, and a storage medium. The method comprises: acquiring text data to be processed and a firs...  
WO/2022/119088A1
An electronic device according to various embodiments of the present disclosure may comprise a motor, a microphone, an expandable and/or reducible display, and a processor, wherein the processor: identifies whether the display is in an e...  
WO/2022/119598A1
Systems and methods for audio privacy in network video surveillance systems are described. A video camera may include an image sensor and a microphone to generate a video stream. Responsive to detecting a human speaking condition in the ...  
WO/2022/119850A1
Techniques are described herein for detecting and suppressing commands in media that may trigger another automated assistant. A method includes: determining, for each of a plurality of automated assistant devices in an environment that a...  
WO/2022/119115A1
According to various embodiments, an electronic apparatus comprises: a first housing; a second housing connected to at least a portion of the first housing and capable of moving relative to the first housing; at least one display connect...  
WO/2022/117480A1
A method and device for audio steering from a loudspeaker line array of a display device toward a user direction is disclosed. Data corresponding to a viewer gesture is obtained from at least one sensor of a display device. A distance an...  
WO/2022/119536A1
The present invention relates to a system (1) for calculating the probability of a person to damage the said company by analysing the mood (emotional state) of a person having a conversation with companies in the financial area during th...  
WO/2022/119942A1
A software-based system and method that provides a generalized scheme to voice-enable text-oriented chatbots. The system can be configured to adapt to a plurality of different types of chatbots, a plurality of different speech-to-text an...  
WO/2022/119212A1
An electronic device is provided. The electronic device may comprise: a voice input device; a communication circuit; a display; a processor operatively connected to the voice input device, the communication circuit, and the display; and ...  
WO/2022/116420A1
A speech event detection method, a speech event detection apparatus (100), an electronic device (1), and a computer readable storage medium, relating to the artificial intelligence technology. The method comprises: obtaining an audio und...  
WO/2022/119673A1
A vehicle includes a cabin, an internal-loudspeaker set an external-microphone set, and a signal processor that filters a raw audio signal that has been received by the external-microphone set broadcasts the resulting filtered audio sign...  
WO/2022/119580A1
Implementations are directed to providing a voice bot development platform that enables a third-party developer to train a voice bot based on training instance(s). The training instance(s) can each include training input and training out...  
WO/2022/111168A1
The present application discloses a video classification method and apparatus, belonging to the technical field of data processing. The method comprises: acquiring a target audio and a corresponding target video comprising human body act...  
WO/2022/111579A1
A voice wakeup method and an electronic device, relating to the field of terminal artificial intelligence. The method comprises: by utilizing ambient sound acquired by each device, on one hand, relative positions of a user and the multip...  
WO/2022/110723A1
An audio encoding and decoding method and apparatus, and a readable storage medium. The encoding method comprises: selecting a first target virtual speaker from a preset virtual speaker set according to a current scene audio signal (401)...  
WO/2022/112594A2
Described herein is a computer-implemented deep-learning-based system for determining an indication of an audio quality of an input audio frame. The system comprises at least one inception block configured to receive at least one represe...  

Matches 401 - 450 out of 171,548