Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
VOICE PROCESSING LEARNING METHOD, VOICE PROCESSING LEARNING DEVICE, AND PROGRAM
Document Type and Number:
WIPO Patent Application WO/2023/135788
Kind Code:
A1
Abstract:
A feature extraction unit 11 extracts a feature amount from a voice that is the same as a subject voice, which is the voice spoken by a target speaker, received as an enrolment voice. A speaker expression extraction unit 13 extracts a speaker expression from the extracted feature amount. A target speaker extraction unit 15 uses the extracted speaker expression to extract a voice inferred to be a subject voice from mixed sound constituted by the subject voice, a non-subject voice that is the voice of a different speaker to the target speaker, and noise. An optimization unit 16 calculates a loss function using the inferred voice and the subject voice, and optimizes the target speaker extraction process so that the calculated value is minimized.

Inventors:
SATO HIROSHI (JP)
MAKISHIMA NAOKI (JP)
Application Number:
PCT/JP2022/001315
Publication Date:
July 20, 2023
Filing Date:
January 17, 2022
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
NIPPON TELEGRAPH & TELEPHONE (JP)
International Classes:
G10L17/04; G10L17/18
Foreign References:
JP2019219574A2019-12-26
JPH05143094A1993-06-11
Attorney, Agent or Firm:
NAKAO, Naoki et al. (JP)
Download PDF: