首页 /研究 /TTS Skins: Speaker Conversion via ASR

OTHER

TTS Skins: Speaker Conversion via ASR

Adam Polyak, Lior Wolf, Yaniv Taigman

发表年份: 2019
访问权限: 开放获取

摘要

We present a fully convolutional wav-to-wav network for converting between speakers' voices, without relying on text. Our network is based on an encoder-decoder architecture, where the encoder is pre-trained for the task of Automatic Speech Recognition, and a multi-speaker waveform decoder is trained to reconstruct the original signal in an autoregressive manner. We train the network on narrated audiobooks, and demonstrate multi-voice TTS in those voices, by converting the voice of a TTS robot.

关键词

cs.SDcs.LGstat.ML

相关论文

OTHER

📊 1 引用

一种面向线弧增材制造的电动汽车结构可制造性拓扑优化的双环框架

Qiang Cui, Chuan Yu, Daoqian Yang 等 5 位作者

Robotics and Computer-Integrated Manufacturing · 2026

OTHER

📊 0 引用

几何数字孪生：一种用于航空发动机装配精度预测的数字智能模型

Ke Shang, Xin Jin, Teli Xu 等 7 位作者

Robotics and Computer-Integrated Manufacturing · 2026

OTHER

📊 0 引用

通过人工智能驱动的机器人技术革新产业

Aryan Chaudhary

Recent Advances in Computer Science and Communications · 2026

OTHER

📊 0 引用

新型大口径偏置馈电可展开天线设计与动态性能预测

Chuang Shi, Tianming Liu, Ning Xue 等 9 位作者

Aerospace Science and Technology · 2026

查看 OTHER 分类全部论文