| Document |
Document Title |
|
WO/2012/004650 |
The present application is directed towards systems and methods for dynamic, distributed creation of music compositions to accompany video compositions. Visual compositions may be uploaded to a server cloud and analyzed in a distributed ...
|
|
WO/2012/005210 |
When a number of samples smaller than a first reference value is smaller than a second reference value or not greater than the second reference value and if a subtraction value calculated by subtracting a value corresponding to a quantiz...
|
|
WO/2012/006024 |
One or more embodiments present a script to a user in an interactive script environment. A digital representation of a manuscript is analyzed. This digital representation includes a set of roles and a set of information associated with e...
|
|
WO/2012/005211 |
When encoding, index information is output indicating a set of coefficients, from among sets of predetermined coefficients corresponding to each sample position, for minimising the sum, with regards to each sample position, of the error ...
|
|
WO/2012/004998 |
Disclosed are a device and a method for efficiently encoding the quantization parameters of split multirate lattice vector quantization. By means of performing spectral analysis of a split multirate vector quantized spectrum, the aboveme...
|
|
WO/2012/005970 |
A system, method, and computer readable storage medium generates an audio fingerprint for an input audio clip that is robust to differences in key, instrumentation, and other performance variations. The audio fingerprint comprises a sequ...
|
|
WO/2012/003602 |
A method for reconstructing electronic larynx speech and a system thereof are provided. The method includes the following steps: firstly, extracting model parameters from the collected speech as a parameter library; then collecting facia...
|
|
WO/2012/005074 |
An audio signal processing device capable of separating audio signal components which heighten immersiveness from two channels of audio signals and increasing or decreasing pressure for the components is provided. An audio signal process...
|
|
WO/2012/003523 |
Mental state of a person is classified in an automated manner by analysing natural speech of the person. A glottal waveform is extracted from a natural speech signal. Pre-determined parameters defining at least one diagnostic class of a ...
|
|
WO/2012/004999 |
Disclosed is an action for an upright piano capable of expressing rich tone variation and improving the continuous press performance of the same key. In the action (1) for an upright piano, there are formed a guide member (67) fixed to a...
|
|
WO/2012/003762 |
An splicing head with thin film sticker is made up of a sticking patch and a pipe fitting, the main body of the sticking patch is composed of thin film, there is a sound transmitting hole on the thin film, an adhesive sticker tack coat i...
|
|
WO/2012/004349 |
A codec supporting switching between time-domain aliasing cancellation transform coding mode and time-domain coding mode is made less liable to frame loss by adding a further syntax portion to the frames, depending on which the parser of...
|
|
WO/2012/005209 |
An encoding method obtains a quantized normalized value by quantizing a normalized value, said normalized value being a value representing samples, and a normalized value quantization index corresponding to the quantized normalized value...
|
|
WO/2012/003974 |
The invention relates to an ultrasonic transducer module for an ultrasonic sensor for detecting and/or examining valuable documents, comprising at least one piezoelectric ultrasonic transducer having electrical connecting wires and an in...
|
|
WO/2012/004955 |
Provided is a device that, with respect to recognized sentences that include errors and that are the output of voice recognition or the like, efficiently performs presentation of estimation/correction candidates of misrecognized sections...
|
|
WO/2012/000882 |
In one aspect, the invention provides an audio encoding method characterized by a decision being made as to whether the device which will decode the resulting bit stream should apply post filtering including attenuation of interharmonic ...
|
|
WO/2012/003269 |
A speech processing engine is provided that in some embodiments, employs Kalman filtering with a particular speaker's glottal information to clean up an audio speech signal for more efficient automatic speech recognition.
|
|
WO/2012/002348 |
The value 1 is added to each bit counter corresponding to indices k(i), ..., k(i)-(L-1) for each of i = 0, 1, ..., N-1. k(i)-Thres+1 bits or L bits, whichever is fewer, are allocated to each sample i having an index greater than or equal...
|
|
WO/2012/001458 |
The invention provides a voice-tag method and apparatus based on confidence score. The voice-tag method based on confidence score comprises: performing phoneme recognition on a registration speech to obtain a plurality of pronunciation t...
|
|
WO/2012/001216 |
A method, devices, computer program products and an internet service is disclosed for adapting a context model. In the method a media clip is received. Also sensor data captured at least partly when the media clip was captured is receive...
|
|
WO/2012/001804 |
A telephone call apparatus comprises: a sound input means to which received sound is input; an acoustic characteristic acquiring means that acquires an acoustic characteristic in an acoustic space different from an acoustic space in whic...
|
|
WO/2012/000341 |
The present invention discloses a reception processing method and apparatus for a downlink speech frame, and a baseband. Wherein, the method includes: the baseband judges periodically whether a downlink speech frame has been received; if...
|
|
WO/2012/002467 |
The purpose of the present invention is to facilitate improved recognition when a person with a cochlear implant hears music or other sounds. MIDI data of a musical composition from a sound source device (12) is processed with a music in...
|
|
WO/2012/002702 |
The objective of the present invention to provide a speech recognition apparatus for an elevator, which can be installed regardless of the elevator manufacturer, and which improves use convenience for visually-impaired or hearing-impaire...
|
|
WO/2012/003125 |
A system for identification of video content in a video signal is provided via a sound track audio signal. The audio signal is processed with filtering, frequency translation, and or non linear transformations to extract voice signals fr...
|
|
WO/2012/001730 |
There are included a speech recognition unit (3) that speech-recognizes an input speech; a speech recognition dictionary (4) in which the words of the speech-recognized input speech are registered; a response speech data storage unit (6)...
|
|
WO/2012/000043 |
The present invention generally concerns a method and a system for providing a computer-generated response in response to natural language inputs. The response includes, but is not limited to, visual, audio, and textual forms. The respon...
|
|
WO/2012/002841 |
What is proposed is: a method for identifying a person's spoken and/or non-spoken communications by means of using an individual correspondence algorithm, which comprises: а) an individual algorithm for a person's speech and/or b) an in...
|
|
WO/2012/001187 |
Low-consumption sound recognition system. The present invention relates to a low-consumption multi-purpose sound detection system which is simple to integrate and is intended to be used in any type of system. One example of systems in wh...
|
|
WO/2012/001928 |
Disclosed is a conversation detection device which uses a head-mounted microphone array and can determine with a high degree of accuracy whether or not a speaker in front of the person wearing the microphone array is a conversation partn...
|
|
WO/2012/002537 |
Disclosed is a portable electronic apparatus able to suitably correct audio sound to output and for outputting audio sound which is easier to hear for a user. The disclosed portable electronic apparatus resolves the above-mentioned probl...
|
|
WO/2012/002768 |
The present invention relates to a method for processing an audio signal, and the method comprises the steps of: receiving an audio signal; determining a coding mode corresponding to a current frame, by receiving network information for ...
|
|
WO/2012/001457 |
The present invention provides a method and apparatus for fusing voiced phoneme units in Text-To-Speech. An apparatus for fusing voiced phoneme units of the present invention comprises: a unit input module configured to input a plurality...
|
|
WO/2012/001463 |
Apparatus comprising at least one processor and at least one memory including computer code, the at least one memory and the computer code configured to with the at least one processor cause the apparatus to at least perform: transformin...
|
|
WO/2012/001261 |
The present invention relates to a method for detecting acoustic shocks in an audio stream, characterized in that it comprises the following steps: breaking down the audio stream into audio frames; analyzing said audio frames in order to...
|
|
WO/2012/000404 |
A method for suppressing a fan noise is provided, which includes: acquiring a fan speed, generating a control signal to a sound module according to the fan speed, and driving the sound module to generate a suppression signal of the fan n...
|
|
WO/2012/003098 |
The subject matter of this specification can be embodied in, among other things, a computer-implemented method for removing noise from audio that includes building a sound model that represents noises which result from activations of inp...
|
|
WO/2012/001260 |
The present invention relates to a coding/decoding of a digital audio signal comprising a succession of consecutive blocks of data, on the basis of a predictive filter. Within the meaning of the invention, a modified predictive filter (A...
|
|
WO/2012/001447 |
The invention involves a device that enables deaf people to perceive sound which comprises: a microphone configured to receive the analog audio signal from the user and from his or her teacher and output the said analog audio signal; an ...
|
|
WO/2011/160741 |
It comprises analyzing audio content of multimedia files and performing a speech to text transcription thereof automatically by means of an ASR process, and selecting acoustic and language models adapted for the ASR process at least befo...
|
|
WO/2011/160651 |
Industrial environments are traditionally noisy, this makes it difficult to do condition monitoring of bearings in such environments. One method according to the invention uses acoustic signals to determine bearing condition. To reduce t...
|
|
WO/2011/161362 |
The invention relates to a method for controlling the shaping of encoding noise during the ADPCM encoding of a digital audio input signal. The noise-shaping is carried out through the use of feedback that comprises filtering noise. Said ...
|
|
WO/2011/162723 |
Embodiments provide an entropy encoder arrangement, including an input configured to receive an input signal, wherein the input signal includes a plurality of signal blocks and each signal block includes a plurality of signal sample valu...
|
|
WO/2011/161372 |
The invention relates to a digital audio synthesizer that includes: an input memory for receiving a sequence of digital data representing the amplitude spectrum of an audio signal over consecutive and overlapping time windows; a computer...
|
|
WO/2011/162740 |
Sound damping compositions and methods for their application are described herein. The compositions can include a polymer, a polyacrylate rheology modifier, and a polyurethane rheology modifier. The compositions can alternatively include...
|
|
WO/2011/161487 |
An apparatus comprising: at least one processor; and at least one memory including computer program code, the at least one memory and the computer program code configured to, with the at least one processor, allow the apparatus to perfor...
|
|
WO/2011/161849 |
A regeneration information acquisition unit (182) acquires information identifying a content stream to be regenerated by an information processing device (100). A content information acquisition unit (184) acquires information identifyin...
|
|
WO/2011/160966 |
A method of providing a digital watermark in an audio signal comprises selecting a key frequency value determining how watermark information is to be embedded into a first time frame of the audio signal. A plurality of discrete frequency...
|
|
WO/2011/161886 |
Disclosed is a decoding device which can efficiently encode/decode spectral data in a high pass section of a broadband signal, can achieve a substantial reduction in the amount of processing computations, and can improve the quality of a...
|
|
WO/2011/163538 |
A vehicle based system and method for receiving voice inputs and determining whether to perform a voice recognition analysis using in-vehicle resources or resources external to the vehicle.
|