VOICE PROCESSING LEARNING METHOD, VOICE PROCESSING LEARNING DEVICE, AND PROGRAM

Title:

VOICE PROCESSING LEARNING METHOD, VOICE PROCESSING LEARNING DEVICE, AND PROGRAM

Document Type and Number:

WIPO Patent Application WO/2023/135788

Kind Code:

A1

Abstract:

A feature extraction unit 11 extracts a feature amount from a voice that is the same as a subject voice, which is the voice spoken by a target speaker, received as an enrolment voice. A speaker expression extraction unit 13 extracts a speaker expression from the extracted feature amount. A target speaker extraction unit 15 uses the extracted speaker expression to extract a voice inferred to be a subject voice from mixed sound constituted by the subject voice, a non-subject voice that is the voice of a different speaker to the target speaker, and noise. An optimization unit 16 calculates a loss function using the inferred voice and the subject voice, and optimizes the target speaker extraction process so that the calculated value is minimized.

Inventors:

SATO HIROSHI (JP)
MAKISHIMA NAOKI (JP)

Application Number:

PCT/JP2022/001315

Publication Date:

July 20, 2023

Filing Date:

January 17, 2022

Export Citation:

Click for automatic bibliography generation Help

Assignee:

NIPPON TELEGRAPH & TELEPHONE (JP)

International Classes:

G10L17/04; G10L17/18

Foreign References:

JP2019219574A	2019-12-26
JPH05143094A	1993-06-11

Attorney, Agent or Firm:

NAKAO, Naoki et al. (JP)

Download PDF:

View/Download PDF PDF Help

Previous Patent: REFRIGERATOR

Next Patent: WARNING DEVICE, WARNING METHOD, AND PROGRAM