Document |
Document Title |
WO/2023/166352A2 |
In some aspects, each participant in a conversation can record their audio content separately, at their own time and post it to a conversation thread. This is an asynchronous format that does not rely on all participants being available ...
|
WO/2023/165946A1 |
The invention relates to a method for encoding an audio signal, comprising the following steps: - decomposing (102) the audio signal into at least amplitude components and sign or phase components; - analysing (104) the amplitude compone...
|
WO/2023/164954A1 |
One or more embodiments of the present description relate to a hearing assistance device. The hearing assistance device comprises: a plurality of microphones, configured to receive initial sound signals and convert the initial sound sign...
|
WO/2023/163963A1 |
Methods, systems, and computer-readable media are provided for detecting voice activity. A primary signal is configured to include a speech component representative of a user's speech when the user is speaking in a detection region, or e...
|
WO/2023/162513A1 |
This language model learning device, for providing a language model learning device whereby a large-scale language model can be learned with low computational cost independent of speech synthesis and speech recognition performance, inclu...
|
WO/2023/163383A1 |
The present disclosure provides a multimodal-based method and apparatus for recognizing emotions in real time. According to one aspect of the present disclosure, provided is a method by which an emotion recognition apparatus recognizes e...
|
WO/2023/163804A1 |
An acoustic metamaterial (AMM) passive impedance matching device for headphone-type devices for matching the complex acoustic impedance load of a human ear to enhance acoustic performance of a headphone is disclosed. The device includes ...
|
WO/2023/163422A1 |
Disclosed is an operation method of a display device, the operation method comprising the steps of: acquiring context information for selection of a speech recognizer; selecting at least one speech recognizer from among a plurality of sp...
|
WO/2023/160994A1 |
The present disclosure relates to a computer-implemented method for transforming speech into visual text, comprising obtaining a speech signal encoding a text of a speech; selecting a portion of the text at least based on a projection mo...
|
WO/2023/159881A1 |
A speech intent recognition method and apparatus, and an electronic device. The method comprises: obtaining speech data from a terminal device and state data of the terminal device (S102); inputting the speech data and the state data int...
|
WO/2023/163895A1 |
Systems and techniques are provided for processing audio data. For instance, a process can include detecting a first audio data between two or more in-person participants of a plurality of in-person participants of a group communication ...
|
WO/2023/160553A1 |
A speech synthesis method and apparatus, and a computer-readable medium and an electronic device. The method comprises: acquiring a phoneme sequence corresponding to text to be synthesized (S101); according to the phoneme sequence and sa...
|
WO/2023/163489A1 |
Disclosed is a method for processing a user's audio input in an electronic device. Particularly, disclosed is a method of processing a user's audio input in an electronic device, comprising the steps of: acquiring a first audio signal fr...
|
WO/2023/163265A1 |
The present invention, in order to enable a foreign language learner to reduce time and costs for foreign language learning and to receive an adequate feedback result for the foreign language learning, uses a user terminal held by a user...
|
WO/2023/162108A1 |
A learning device (10) acquires: information concerning a speaker and a utterance data of the speaker; and information concerning a listener and conversation data of the listener. Then, the learning device (10), by using the acquired inf...
|
WO/2023/163991A1 |
Embodiments can take brain activity measurement and decode into continuous language. Embodiment can use non-invasive brain recordings, such as functional magnetic resonance imaging (fMRI) and functional near-infrared spectroscopy (fNIRS)...
|
WO/2023/134550A9 |
The present disclosure relates to a feature encoding model generation method, an audio determination method, and a related device. The feature encoding model generation method comprises: obtaining a plurality of sample audios marked with...
|
WO/2023/161151A1 |
The invention relates to a sound-generating reed (10) for wind instruments, said reed being designed with a structuring (22).
|
WO/2023/162508A1 |
The present invention creates experiential value of content, for example. This signal processing device comprises: a feature extracting unit for extracting a signal of a specific sound from an input signal using a learning model obtain...
|
WO/2023/159582A1 |
Embodiments of the present application provide an earphone control method, an earphone, an apparatus, and a storage medium. The method comprises: collecting ambient information, and determining key sound detection sensitivity according t...
|
WO/2023/161554A1 |
An apparatus for assisting spatial rendering in at least one acoustic environment, the apparatus comprising means configured to: determine a source-listener distance; determine an attenuation parameter value, the attenuation parameter as...
|
WO/2023/164392A1 |
A system for generating enhanced speech data using robust audio features is disclosed. In some embodiments, a system is programmed to use a self-supervised deep learning model to generate a set of feature vectors from given audio data th...
|
WO/2023/162107A1 |
A learning device (10) acquires: information concerning a speaker and utterance data of the speaker; information concerning a listener and conversation data of the listener; and a classification label of a response that is included in th...
|
WO/2023/164176A1 |
An audio processing and streaming server system includes an input bus, a digital signal processor (DSP), a first and second stream generators. The input bus includes a plurality of input audio channels, each corresponding to a respective...
|
WO/2023/160545A1 |
The present invention provides an electronic keyboard instrument, and relates to the technical field of electronics. The electronic keyboard instrument comprises keys, carbon film contacts under the keys and sound source chips on a mainb...
|
WO/2023/161673A1 |
The present disclosure is enclosed in the area of musical instruments, in particular including the combination of acoustic and digital sound generation, including an apparatus suitable for providing a digital sound output and couplable t...
|
WO/2023/159716A1 |
The present application belongs to the technical field of active noise cancellation (ANC). Disclosed are an adaptive adjustment method and device for an ANC parameter, and a storage medium. The method comprises: acquiring frequencies at ...
|
WO/2023/160087A1 |
A prompting method for a response state of a voice instruction and a display device. The method comprises: receiving a voice instruction, monitoring a response state of the voice instruction, and obtaining screen brightness (S281); and c...
|
WO/2023/163427A1 |
A method for adjusting the volume of an electronic device, according to one embodiment, may comprise the operations of: adjusting, if a user's voice is input into a first electronic device, the output volume of a second electronic device...
|
WO/2023/162347A1 |
This information processing device comprises: a reception unit for receiving performance sound of a stringed instrument that has a string and a peg; an estimation unit for using the received performance sound to estimate positional infor...
|
WO/2023/163942A1 |
A method for adjusting the clarity of an audio output in a changing environment, including: receiving a content signal; applying a customized gain to the content signal; and outputting the content signal with the customized gain to at le...
|
WO/2023/164380A1 |
A method (800) for training a memorized neural network (300) includes receiving a training input audio sequence (400) including a sequence of input frames defining a hotword that initiates a wake-up process on a user device (102). The me...
|
WO/2023/164332A1 |
There is provided a method that includes (a) obtaining a first voice vector that was derived from a signal of a voice that was sampled at a first sampling frequency, (b) obtaining a second voice vector that was derived from a signal of a...
|
WO/2023/162114A1 |
A training device (10) acquires utterance data of a speaker and information on the speaker, conversation data of a listener and information on the listener, and emotion information relating to the listener. The training device (10) then ...
|
WO/2023/163314A1 |
A method, performed in a metaverse system, for providing an evolvable avatar according to various embodiments of the present disclosure may comprise the steps of: creating a basic avatar used in a metaverse virtual space; collecting at l...
|
WO/2023/160447A1 |
A tuning device (100), comprising an adjusting assembly (10), an adjusting rod (20), and a connecting member (30). The adjusting assembly (10) comprises a housing (11), a transmission rod (12), and a transmission wheel (13) engaged with ...
|
WO/2023/160713A1 |
Music generation methods and apparatuses (1100, 1200), a device, a storage medium, and a program. A method comprises: determining a music template, the music template comprising a plurality of tracks, and each track being divided into at...
|
WO/2023/163270A1 |
An electronic device according to various embodiments disclosed in the present document comprises a processor and memory operatively connected to the processor. The memory can store instructions which, when executed, cause the processor ...
|
WO/2023/162581A1 |
This sound production device (100) comprises: an acquisition unit (131) that acquires spatial information indicating a region of a three-dimensional virtual space including a sound source object and one or more three-dimensional objects ...
|
WO/2023/161778A1 |
Low frequency acoustic room and environment. An inner wall portion (120) comprising a plurality of porous-but-resistive membranes (125) operatively configured along the wall portions (130) of the inner wall portion (120) wherein the poro...
|
WO/2023/163896A1 |
Systems and techniques are provided for processing audio data. For instance, a process can include obtaining a primary audio signal from a user computing device and obtaining first audio data from an additional computing device, wherein ...
|
WO/2023/157650A1 |
The present technology relates to a signal processing device and a signal processing method which can make it possible to preferably synthesize a plurality of pieces of encoded data. The signal processing device of the present technology...
|
WO/2023/156176A1 |
An apparatus comprising means for: obtaining a bitstream comprising encoded spatial metadata and encoded transport audio signals; decoding transport audio signals from the bitstream encoded transport audio signals; decoding spatial metad...
|
WO/2023/158784A1 |
A multi-mode hearing stimulation method for stimulating the perception of hearing in a subject. The method includes generating, based on sound signals representative of multi-channel sound, two or more of multi-channel electrical stimula...
|
WO/2023/158226A1 |
A speech synthesis method using an adversarial training technique according to an embodiment may comprise the steps of: receiving speech data input; training an adversarial model for speech synthesis on the basis of the speech data input...
|
WO/2023/157207A1 |
This signal analysis system comprises: an acquisition unit that acquires a conversion network that is learned using a first mel spectrogram sequence in a machine learning technique for acoustic conversion based on a discriminator-equippe...
|
WO/2023/156841A1 |
The present invention relates to a chromatic bass musical instrument, whose format is non-cumbersome and portable, played with the feet diagonally: from left to right and vice-versa, from top to bottom and vice-versa; which musical instr...
|
WO/2023/158282A1 |
An electronic apparatus is disclosed. The electronic apparatus comprises a dust collection filter, a voltage supply unit for applying a voltage to the dust collection filter, a microphone disposed within a critical distance from the dust...
|
WO/2023/157066A1 |
A computer executes a first learning procedure to learn a second model by updating a first model, to which a speaker vector representing a speaker, text, and a first acoustic feature related to speech obtained by the speaker uttering the...
|
WO/2023/158553A1 |
An automated speech recognition (ASR) transcript of at least a portion of a media content is obtained from an ASR tool. Suggested words are received for corrected words of the ASR transcript of the media content. Features are obtained us...
|