Document |
Document Title |
WO/2024/093557A1 |
A data processing method and apparatus, an electronic device, a computer-readable storage medium, and a computer program product. The data processing method comprises: performing emotion prediction on information to be identified of a cu...
|
WO/2024/097380A1 |
A system for platform-independent visualization of audio content, in particular audio tracks utilizing a central computer system in communication with user devices via a computer network. The central system utilizes various algorithms to...
|
WO/2024/097002A1 |
Methods and systems provide for extracting next step sentences from a communication session. In one embodiment, the system defines a set of annotation guidelines for labeling training data; receives a set of labeled training data includi...
|
WO/2024/096364A1 |
An electronic device according to an embodiment may, on the basis of receiving a first signal for call establishment via a communication circuit, run a first software application for processing the first signal. The electronic device may...
|
WO/2024/093264A1 |
The present application provides an audio detection method, apparatus and device. The method comprises: acquiring a first audio code stream sent by an audio sender to an audio receiver; acquiring a second audio code stream returned by th...
|
WO/2024/097684A1 |
A system and method to identify filler speech and deliver real-time feedback to the speaker for correction. The system receives a live audio signal from a user's speech, analyzes the audio signal for filler speech, verifies the speaker's...
|
WO/2024/097568A1 |
A device includes one or more processors configured to detect single-stream data and generate multi-stream augmented data that includes one or more modified versions of the single-stream data. The one or more processors are configured to...
|
WO/2024/093442A1 |
According to the embodiments of the present disclosure, provided are a method and apparatus for checking audiovisual content, and a device and a storage medium. The method comprises: providing a check interface for audiovisual content, w...
|
WO/2024/093525A1 |
Provided in the embodiments of the present application are an information processing method and an electronic device. The method may be used for an electronic device to generate notes, record information in the notes and process the reco...
|
WO/2024/096314A1 |
An electronic device is disclosed. The electronic device comprises: a speaker array including a plurality of speaker units; a memory storing at least one instruction; and one or more processors connected to the speaker array and the memo...
|
WO/2024/093588A1 |
The present application provides a method and apparatus for training a speech synthesis model, a speech synthesis method and apparatus, an electronic device, a computer-readable storage medium and a computer program product. The method f...
|
WO/2024/093515A1 |
The present application provides a voice interaction method and a related electronic device. The method comprises: receiving a first voice signal; when determining that voice detection is to be performed on the first voice signal, obtain...
|
WO/2024/094513A1 |
A system for automatically selecting a sound recognition model for an environment based on audio data and image data associated with the environment. The system includes a camera, a microphone, a memory including a plurality of sound rec...
|
WO/2024/096827A1 |
The present invention relates to a system (1) comprising a collective decision-making mechanism which enables to solve dynamic communication problem in the process of voting by integrating different artificial intelligence algorithms in ...
|
WO/2024/096969A1 |
Techniques for causing an LLM to generate semantically related phrase variations for an identified phrase are disclosed. An LLM that is generally pre-trained on an arbitrary corpus of language training data is accessed. Seed data is fed ...
|
WO/2024/094006A1 |
An audio signal coding method and apparatus, and an audio signal decoding method and apparatus. The audio signal coding method comprises: acquiring a high-frequency residual signal and a low-frequency residual signal of a target audio fr...
|
WO/2024/094001A1 |
Disclosed in the embodiments of the present application are a speech recognition method and a related apparatus. When speech recognition is performed, speech recognition is not necessarily performed on the basis of all hidden layers in a...
|
WO/2024/096641A1 |
An electronic device according to one embodiment disclosed in the present document may comprise a communication circuit, a memory and a processor. The processor can be configured to: receive, from an external device, voice signals accord...
|
WO/2024/096968A1 |
Techniques for facilitating voice based dictation of programming code within a context of an IDE are disclosed. Programming code is fed to a text-to-speech (TTS) model. The TTS model generates an audio file associated with the code. The ...
|
WO/2024/093490A1 |
A method and apparatus for processing an audio coding data packet. The method comprises: parsing an audio coding data packet to acquire data packet information of the audio coding data packet, wherein the data packet information comprise...
|
WO/2024/093460A1 |
The present application relates to the field of audio processing. Provided are a voice detection method and a related device thereof. The voice detection method comprises: acquiring audio data, the audio data being data collected by a fi...
|
WO/2024/094604A1 |
The invention relates to a foldable sonar system (20) comprising a sonar antenna (22). The sonar antenna comprises a plurality of planar sub-arrays, the sub-arrays each having a multiplicity of waterborne sound transducers (26). A joint ...
|
WO/2024/096096A1 |
A soundproof heat-dissipating cover (1) that is to cover an object (2) comprises: a soundproof heat-dissipating sheet (3) that has a covering surface (10) that covers the surface of the object (2) and a heat dissipation surface (11) that...
|
WO/2024/095543A1 |
This sound absorption characteristic measuring device measures a sound absorption characteristic of a structure surface at a measurement site. The sound absorption characteristic measuring device comprises: a drive circuit for generating...
|
WO/2024/097015A1 |
Some disclosed embodiments are directed to obtaining a decoded audio data including a spoken language utterance recognized in audio data and identifying a disfluency in the decoded audio data. Upon determining that correcting the disflue...
|
WO/2024/093748A1 |
Provided in the present application are a signal collection method, and an electronic device and a storage medium. The method comprises: collecting an audio signal of a user by means of a microphone; performing determination on the colle...
|
WO/2024/093443A1 |
An information display method and apparatus based on voice interaction, and an electronic device. A specific embodiment of the method comprises: on the basis of operation information of an interaction-related document for real-time voice...
|
WO/2024/097360A1 |
Disclosed are systems, methods, and other implementations, including a method for sound processing that includes obtaining, by a device (e.g., a hearing device), sound signals from two or more sound sources in an acoustic scene in which ...
|
WO/2024/093578A1 |
A voice recognition method and apparatus, and an electronic device, a storage medium and a computer program product, which are applied to the fields of artificial intelligence and games. The method is executed by the electronic device. T...
|
WO/2024/096828A1 |
The present invention relates to a system (1) which enables to suggest an appropriate product to persons in accordance with the input data received from persons by analysing the text, image, audio or motion input data –that are receive...
|
WO/2024/095384A1 |
This situation display device comprises a display unit 5 that displays, in the vicinity of a two-dimensional graph indicating the amount of speech by a subject per unit time in a prescribed time segment, pictures indicating the situation...
|
WO/2024/096253A1 |
The present electronic device comprises: a memory that stores a plurality of sample prompts; a communication interface that communicates with a server including a large language model; and at least one processor that acquires a user inpu...
|
WO/2024/097485A1 |
Enclosed are embodiments for very low bit rate scene-based audio (LBRSBA) coding with combined SPAR and DIRAC. In some embodiments, a method comprises: receiving scene based audio metadata; creating from the scene based audio metadata, S...
|
WO/2024/095383A1 |
This voice recognition result display device comprises a display unit 7 that displays, in an utterance content display area on a screen, a voice recognition result text, which is a text resulting from voice recognition of a latest uttera...
|
WO/2024/095535A1 |
This speech recognition result display device comprises a display unit 7 that displays, in a display area for utterance content on a screen, speech recognition result text which is text of speech recognition results of a newest utterance...
|
WO/2024/093798A1 |
A music composition method and apparatus, and an electronic device and a readable storage medium. The method comprises: dividing a first audio track into a plurality of candidate track clips according to a timeline, wherein each candidat...
|
WO/2024/095550A1 |
This situation display device comprises a display unit 5 that, in the vicinity of a two-dimensional graph indicative of the amount of speech of a subject who is a person subject to status display and per unit time in a prescribed time se...
|
WO/2024/093648A1 |
The present application provides a method for multi-instruction execution. The method is applied to an electronic device or a cabin of a vehicle. The method comprises: receiving a first input, the first input comprising a first instructi...
|
WO/2024/096600A1 |
An electronic device according to various embodiments may comprise a speaker, an external microphone, an internal microphone, a first filter, a second filter, and a processor operatively connected to the speaker, the external microphone,...
|
WO/2024/090882A1 |
The present invention relates to a transient-based sidechain audio watermark coding system and comprises: a bit code generation unit that generates watermark code for a watermark message, i.e., generates watermark code comprising a plura...
|
WO/2024/091313A2 |
A computer-implemented system and method for operating one or more uncrewed vehicles (UxSs) in autonomous navigation modes. A software module is provided and executed for operation in a portable user computer device for enabling control ...
|
WO/2024/088720A1 |
The present invention relates to a method for measuring at least one point of impact between at least one member and a playing zone for the onset and development of a musical gesture (1). For this purpose, computing means receive a first...
|
WO/2024/090076A1 |
Provided is a silencer-equipped air duct which makes it possible to improve silencing performance for low-frequency sounds while suppressing increase in size. This silencer-equipped air duct is configured by disposing a silencer at an ...
|
WO/2024/090017A1 |
A display method involves receiving an acoustic space and a target sound pressure distribution in the acoustic space, using a prescribed model as a basis to calculate a speaker or microphone placement distribution corresponding to the re...
|
WO/2024/089962A1 |
A system for performing end-to-end automatic speech recognition (ASR). The system configured to collect a sequence of acoustic frames associated with a mixture of speeches performed by multiple speakers. Each frame from the sequence of a...
|
WO/2024/089198A1 |
The invention relates to an electric musical instrument, in particular an electric guitar, having: electronics (100) configured to process a sound signal (101), which is generated when the musical instrument is played, and to output said...
|
WO/2024/091426A1 |
A method (500) includes obtaining an ASR model (200) trained to recognize speech in a first language and receiving transcribed training utterances (304) in a second language. The method also includes integrating the ASR model with an inp...
|
WO/2024/091526A1 |
A method (600) for residual adapters for few-shot text-to-speech speaker adaptation includes obtaining a text-to-speech (TTS) model (200) configured to convert text (152) into representations of synthetic speech (261), the TTS model pre-...
|
WO/2024/090778A1 |
Disclosed is an electronic device. The present electronic device comprises: a memory in which a neural network model is stored; and at least one processor that is connected to the memory to control the electronic device, wherein the proc...
|
WO/2024/091564A1 |
A method (600) includes receiving training data (301) that includes a plurality of sets of text-to-speech (TTS) spoken utterances (510) each associated with a respective language and including TTS utterances of synthetic speech spoken th...
|