Document |
Document Title |
WO/2024/080699A1 |
A neural method model is trained by, in an initial training iteration, training the neural network model in a teacher forcing mode in which an autoregressive channel includes a ground-truth shifted waveform, and outputting predictions of...
|
WO/2024/081567A2 |
The 3D audio perception of a listener such as a computer gamer is tested "stereoscopically" and the results input to a source of audio such as a computer game. Audio (802) from the source of audio (such as a head-mounted display of a com...
|
WO/2024/077906A1 |
Provided in the present disclosure are a speech text generation method, which can be applied to the technical field of artificial intelligence and the field of intelligent customer service. The speech text generation method comprises: pe...
|
WO/2024/080590A1 |
An electronic device is provided. The electronic device may comprise a microphone, a speaker, a communication circuit, and at least one processor. The at least one processor can be configured to: acquire a second signal on the basis of d...
|
WO/2024/077452A1 |
The present application relates to the field of audio processing, and discloses an audio processing method and apparatus, a device, and a storage medium. The method comprises: performing short-time Fourier transform on audio data to obta...
|
WO/2024/081720A1 |
To enhance the sensory experience of voice, in some cases at a later time than the speech was spoken (300) to enable reliving emotions and experiences, vocal sounds captured by a microphone are processed (304) by a computer game controll...
|
WO/2024/032035A9 |
Embodiments of the present application provide a voice signal output method and an electronic device. The method comprises: generating a first voice signal, the first voice signal being an interference signal generated according to a dow...
|
WO/2024/080422A1 |
The present specification discloses an AI-based specialized human resources platform service method for providing a remote recruitment service. According to the present invention, users can improve mock interview skills through AI analys...
|
WO/2024/078296A1 |
An audio mixing method and a related device. The method comprises: an audio sink-side device may receive a plurality of pieces of indication information from a plurality of audio source devices, the plurality of pieces of indication info...
|
WO/2024/081031A1 |
A method (500) includes, for each training sample (410) of a plurality of training samples: processing, using a sequence transduction model (200), corresponding training input features (415) to obtain one or more output token sequence hy...
|
WO/2024/081741A1 |
The method (S100) can include: generating pilot monitoring data (S110); determining a pilot attention state based on the sensor data (S120); optionally determining an aircraft state (S130); responding to an event based on the pilot atten...
|
WO/2024/080421A1 |
This embodiment relates to a non face-to-face interview method using an artificial intelligence algorithm. Particularly, this method: first selects a non face-to-face interview type as a service (native app) for an interviewee using a mo...
|
WO/2024/080709A1 |
An electronic device may comprise a microphone, a display, and a processor. The processor may be configured to: display, via the display, a screen including a plurality of executable objects; on the basis of a focused executable object a...
|
WO/2024/080160A1 |
Provided is an information processing device of which at least a part is configured to be able to be mounted in the outer ear canal of a user, the information processing device comprising: an output unit that outputs a measurement sound ...
|
WO/2024/059700A3 |
A light induction microphone barrier apparatus includes a masking signal generator configured to generate a masking signal. The masking signal generator provides the masking signal to a driver configured to receive the masking signal fro...
|
WO/2024/080745A1 |
According to an embodiment, there may be provided an electronic device comprising: a communication interface; a memory including a speech cache; and at least one processor operatively connected to the communication interface and the memo...
|
WO/2024/080527A1 |
A display apparatus is disclosed. The display apparatus comprises: a display; a communication apparatus that receives stream data corresponding to image content in real time; a memory for storing the received stream data; and a processor...
|
WO/2024/080468A1 |
A method for adjusting privacy level automatically by a voice assistant is disclosed. The method includes identifying one or more devices present in an environment around the voice assistant based on a beacon signal transmitted by the de...
|
WO/2024/079284A1 |
The invention relates to an ultrasonic transducer (1; 101; 201) for producing high-frequency vibrations, the ultrasonic transducer (1; 101; 201) being formed as a planar single piece with a thickness (D) in a thickness direction (DR), a ...
|
WO/2024/078670A1 |
The present invention relates to a panel for attachment to boundaries of a room, such as a concert hall, the panel comprising a panel board (1, 28) provided with through openings (2, 29) between a first surface (1', 28') and a second sur...
|
WO/2024/079865A1 |
Provided is a musical composition generation device that comprises an encoder that converts audio data for first and second musical compositions to first and second feature vectors within a first feature space and a decoder that generate...
|
WO/2024/080495A1 |
A method includes obtaining a speech signal. The method also includes predicting a first likelihood of a wake word or phrase being spoken in the speech signal using a first machine learning model trained to receive the speech signal as i...
|
WO/2024/080044A1 |
An information processing system that receives input sound and pitch information; extracts a timbre feature amount from the input sound; and generates information of a musical instrument sound with a pitch based on the timbre feature amo...
|
WO/2024/079264A1 |
The disclosure relates to a method for Wiener-filter-based signal restoration, comprising the following method steps: receiving a signal (g); estimating a signal-to-noise ratio for a Wiener-filter-based restoration algorithm (v) by a pro...
|
WO/2024/081733A2 |
Improved reeds for a reed blown musical instrument includes silk fibroin coating or impregnation. An example reed includes a reed body having a reed tip adapted for engagement by a mouth of a player and a vibrating part that extends from...
|
WO/2024/059801A3 |
The disclosed systems and methods provide a novel technical solution via mechanisms for identifying which models are truly high-performing and the set of models that would provide the most accurate single prediction for a signal data sig...
|
WO/2024/079446A1 |
A flexible transducer designed with high volume manufacturing in mind is described, the where flexible transducer mount can be panelized into a grid form and the transducers assembled into the grid using commercially available pick-and-p...
|
WO/2024/078419A1 |
Embodiments of the present application provide a voice interaction method, a voice interaction apparatus and an electronic device. The method comprises: receiving a first voice input of a user, the first voice input comprising a first sl...
|
WO/2024/081332A1 |
A method (500) includes receiving a sequence of acoustic frames (100) as input to a multilingual automated speech recognition (ASR) model (200) configured to recognize speech in a plurality of different supported languages and generating...
|
WO/2023/241254A9 |
The present application provides an audio encoding and decoding method and apparatus, an electronic device, and a storage medium, capable of being applied to in-vehicle scenarios. The audio decoding method comprises: obtaining a code str...
|
WO/2024/081131A1 |
Implementations relate to an automated assistant that can determine whether to respond to inputs in an environment according to whether radar data indicates a user is present. When user presence is detected, the automated assistant can v...
|
WO/2024/077511A1 |
Disclosed in embodiments of the present application are an interaction counting method, apparatus, device, and system, and a storage medium. The method comprises: extracting first audio features of all audio frames in classroom audio dat...
|
WO/2024/080723A1 |
An embodiment of the present disclosure relates to a device and a method for minimizing quantization noise by reflecting a user's individual hearing characteristics when quantizing or de-quantizing an audio signal. A control method there...
|
WO/2024/054990A3 |
An acoustic filter can include a first substrate including a first plurality of holes directed, therethrough, a second substrate including a second plurality of holes directed therethrough, a chamber defined between the first substrate a...
|
WO/2024/081836A1 |
Techniques are disclosed relating to adjusting, via a user interface, parameters (e.g., the gain) of musical phrases in generative music content. A computing system may select a set of musical phrases to include in generative music conte...
|
WO/2024/081203A1 |
A method (600) includes obtaining a multi-utterance training sample (410) that includes audio data (412) characterizing utterances spoken by two or more different speakers (10) and obtaining ground-truth speaker change intervals (414) in...
|
WO/2024/080597A1 |
An electronic device is provided. The electronic device may comprise a communication circuit, a speaker, and a processor. The processor may be configured to identify a bitrate of a first audio bitstream received from an external electron...
|
WO/2024/078028A1 |
The present application relates to a howling suppression system and method for an ANC/PSAP system, and a storage medium. The howling suppression system comprises a system-on-chip. The system-on-chip is configured in such a way that an ad...
|
WO/2024/078460A1 |
A speech processing method, comprising: receiving wake-up speech zone information which is forwarded by a vehicle and is for a user to wake up a vehicle speech function in a vehicle cabin; determining an initial false rejection mode of e...
|
WO/2024/078565A1 |
Methods, systems (126), and computer program products for domain adaptive speech recognition using artificial intelligence are provided herein. A computer-implemented method includes generating a set of language data candidates, each lan...
|
WO/2024/078435A1 |
A method for dynamically switching speech zones in a vehicle, a speech interaction method, a device, a medium, and a vehicle, relating to the technical field of intelligent vehicles, and aiming to solve the problem of poor effect on the ...
|
WO/2024/079625A1 |
A computer assisted method for classifying digital audio files based on features of a digital audio signal comprised in the file, comprising: storing the audio file in a digital memory; determining a portion (p) of drop of the audio file...
|
WO/2024/080633A1 |
An electronic device according to one embodiment of the present document comprises: a microphone; a communication module comprising a first communication circuit providing a call channel for communicating with a first external electronic...
|
WO/2024/080796A1 |
According to embodiments of the present disclosure, disclosed is a method for remotely monitoring a learning situation of a learner, comprising the steps in which: a learning guidance device acquires first captured data by photographing ...
|
WO/2024/080729A1 |
An electronic device according to an embodiment may comprise an input module, a memory including a plurality of DBs, and a processor. The processor according to an embodiment may identify, as a short instruction including a plurality of ...
|
WO/2024/081502A1 |
A voice-based authentication system receives uttered words from a user (e.g., a human speaker); compares the uttered words with an authentication text that includes high-confidence corpus words and one or more low-confidence corpus words...
|
WO/2024/079605A1 |
A computing device implemented method for assisting a speaker (300) during training or actual performance of a speech, comprising the steps of: (10) inputting a text (200) to be spoken; (20) using a machine learning system to detect occu...
|
WO/2024/076810A1 |
Systems, methods, and computer program products for performing gain control on audio signals are provided. An automatic gain control system obtains a downmixed audio signal of an audio signal to be encoded. The system determines that an ...
|
WO/2024/076452A1 |
A method (500) includes receiving a first query (116) issued by a first user, the first query including a command (111) for a digital assistant (105) to perform a first action, and enabling a round robin mode (350) to control performance...
|
WO/2024/076830A1 |
A method, performed by a device with one or more microphones, for generating an encoded bitstream, the method comprising, capturing, by the one or more microphones, one or more audio signals, analyzing the captured audio signals to deter...
|