SlideShare a Scribd company logo
Project Linne 徵音梅林
–– Virtual Singer Sound-bankVirtual Singer Sound-bank
Processing with PythonProcessing with Python
Yuan CHAO ( 趙元 )
PyCon TW
2016/06/03-05
ㄓ ˇ
Who am I ?
Yuan CHAO (John)
YChao
...
A researcher
working on HEP
using OSS...
No physics today
Let's talk about
Virtual Singer (VS)...
Familiar with TTS?
(Text-to-Speech synth.)
Siri, Ok Google
Virtual Singer –
Note-to-Vocal synth.
YAMAHA Vocaloid™
初音未來
Sound-bank for VS?
Sound bank
From commercial company
or volunteers
Editor
Notes and Lyrics
Vocal Synth.
Synthesized
song
Block diagram for
a VS system
Songs
飴屋 P - UTAU の基本的アルゴリズムと開発経緯
http://udn.utau-synth.com/documents/kouen/20120325/
Step 1: cut the recored sounds through into sound
elements (phonons)
飴屋 P - UTAU の基本的アルゴリズムと開発経緯
http://udn.utau-synth.com/documents/kouen/20120325/
Step 2: connect the elements following the lyrics
飴屋 P - UTAU の基本的アルゴリズムと開発経緯
http://udn.utau-synth.com/documents/kouen/20120325/
Step 3: Adjust the pitches and lengths of the lyrics
Sound-bank Parameters
Consonants / Vowel
子音 / 母音
聲母 / 韻母
(fixed / variable length)
Start
Beat matching
position
Beginning of Vowel
Consonants
End
Next sound
Sound Parameters
Sound bank for Japanese
The 50 Sounds
五十音
Japanese
50 sounds x 2
(voiced/unvoiced)
( 清濁 )
Japanese
Fifty sounds x 2
(voiced/unvoiced) +
Half-voiced, palatalized...
拗音、半濁音及其他輔音
Japanese
Fifty sounds x 2 +
others
~150 basic sounds
Japanese
with connected vowels
~150 sounds x 6
Japanese
With connected vowels
~150 sounds x 6
Total ~ 1000 sounds
Japanese
With connected vowels
~150 sounds x 6
Total ~ 1000 sounds
(~10 samples/hr. for well trained people)
Chinese
Chinese
If counted with bopomofo
ㄅㄆㄇㄈ
Chinese
21 consonants, 16 vowels
聲母 21 個、韻母 16 個
Chinese
All possible sound
combinations
~ 450
Chinese
with connected vowels
~450 sounds x 9
一ㄨㄩㄚㄛㄜㄝㄦ n ( ㄣㄥㄢㄤ )
( ㄞㄟㄠㄡ )
Chinese
With connected vowels
~450 sounds x 9
~4000 sounds
https://github.com/benlau/linne-analyzer
http://www.gnu.org/software/octave/
https://github.com/jsawruk/pymir
Analysis framework by
Ben Lau
PyMIR lib
.wav I/O
Feature extraction
GNU Octave
visualization
Simple Analyzer
http://guhy.csie.ntust.edu.tw/pap/07_TWN_Mandarin_SingingVoice_Synthesis_BasedOn_ExpressionParameter_Analyzing.pdf
過零率
Zero-cross
rate
頻譜
變異數
Spectrum
variance
Simple Analyzer
http://guhy.csie.ntust.edu.tw/pap/07_TWN_Mandarin_SingingVoice_Synthesis_BasedOn_ExpressionParameter_Analyzing.pdf
過零率
Zero-cross
rate
頻譜
變異數
Spectrum
variance
Threshold method doesn't
give good results
Try in frequency domain
Fourier Transformation
https://en.wikipedia.org/wiki/Fourier_transform
https://en.wikipedia.org/wiki/Vowel
Spectrum Patterns
Time domain vs. Frequency domain
Thousands of samples
to be processed...
Try with ML tools –
SciKit Learn
Sampling with
sliding window
Using SVM to determine
the vowel positions
http://www.cmlab.csie.ntu.edu.tw/~cyy/learning/tutorials/SVM3.pdf
https://en.wikipedia.org/wiki/Support_vector_machine
http://www.csie.ntu.edu.tw/~cjlin/libsvm/index.html
Using SVM to set vowels
あいうえおん
Using SVM to set vowels
ㄚㄛㄜㄝ一ㄨㄩㄦ
ㄢㄣㄤㄥ (n)
Supervised learning -
Training sample?
https://github.com/yuanchao/linne-analyzer/blob/vowel_det/src/linne/analyzer/cmd/linne-train2.py
Take part of the data as
the training sample –
Data-driven Analysis
https://github.com/yuanchao/linne-analyzer/blob/vowel_det/src/linne/analyzer/cmd/linne-spect2.py
N
U
E
O
I
A
N
U
E
O
I
A
A-A-I-A-U
N
U
E
O
I
A
https://github.com/yuanchao/linne-analyzer/blob/vowel_det/src/linne/analyzer/cmd/linne-test2.py
Detecting connected vowels
fen-fia-fou-a-fe
ㄈㄣ - ㄈ一ㄚ - ㄈㄡ - ㄚ - ㄈㄜ
N
ㄩ
ㄨ
一
ㄝ
ㄜ
ㄛ
ㄚ
Still some room for
improvements
Fork Me on GitHub!
https://github.com/yuanchao/linne-analyzer/tree/vowel_det/src/linne/analyzer/cmd
ㄓ ˇ ㄧㄣㄇㄟ ˊ ㄌㄧㄣ ˊ
徵音梅林開發計畫
https://github.com/ProjectMeilin
ち おんメイリン
徵音梅林開發計畫
Free and open VS platform
痴音
ㄓ ˇ
徵音梅林開發計畫
Software: Paul Liu, MGDesigner,
Ben Lau, Atsushieno, Yuan Chao
ㄓ ˇ
徵音梅林開發計畫
Vocal: 羅竺 License: CC-BY
https://www.youtube.com/watch?v=OZNrVq50wEY
Welcome
示範曲播放
Live DEMO!!!
https://soundcloud.com/ychao/umbrella-linne2
http://www.nicovideo.jp/watch/sm26831479
https://soundcloud.com/ychao/utau-celluloid-linne-zh
以上
謝謝
Remerci de
Votre
Attention

More Related Content

PDF
【実践】Python入門_第一回【岩手県立大学_競プロサークル】
PDF
Python basics_ part1
PPTX
RubyConf Bangladesh 2017 - Core Ruby: How it works
PDF
Donnez des couleurs a votre terminal
PDF
pyconjp2015_talk_Translation of Python Program__
PDF
How to develop a rich terminal UI application
PDF
Mp25: Audio Fingerprinting and metadata correction with Python
PDF
Golangにおける端末制御 リッチなターミナルUIの実現方法
【実践】Python入門_第一回【岩手県立大学_競プロサークル】
Python basics_ part1
RubyConf Bangladesh 2017 - Core Ruby: How it works
Donnez des couleurs a votre terminal
pyconjp2015_talk_Translation of Python Program__
How to develop a rich terminal UI application
Mp25: Audio Fingerprinting and metadata correction with Python
Golangにおける端末制御 リッチなターミナルUIの実現方法

Similar to Project Linne 徵音梅林 -- Virtual Singer Sound-bank Processed with Python (10)

PDF
Py conjp2019 renyuanlyu_3
PPTX
Py conjp2019 renyuanlyu_3
PDF
Introduction of ToySynth
PPTX
Ok shazam, "la la-lalaa"!
PDF
Py conjp2019 renyuanlyu_3
PDF
2023-1117 AI Music Intro.pdf
ODP
(2014-05-24) [Taubaté Perl Mongers] AudioLazy Python DSP (Digital Signal Proc...
PDF
"All you need is AI and music" by Keunwoo Choi
PPT
Audio Production
PDF
Research on Automatic Music Composition at the Taiwan AI Labs, April 2020
Py conjp2019 renyuanlyu_3
Py conjp2019 renyuanlyu_3
Introduction of ToySynth
Ok shazam, "la la-lalaa"!
Py conjp2019 renyuanlyu_3
2023-1117 AI Music Intro.pdf
(2014-05-24) [Taubaté Perl Mongers] AudioLazy Python DSP (Digital Signal Proc...
"All you need is AI and music" by Keunwoo Choi
Audio Production
Research on Automatic Music Composition at the Taiwan AI Labs, April 2020
Ad

More from Yuan CHAO (15)

PDF
基礎科學研究連結跨領域的未來 The Link between Fundamental Science Research and the Future o...
PDF
玩轉 LHC 公開數據 (Play around with the LHC open data)
PDF
介紹 TrackML 挑戰 (Introduction to TrackML Kaggle challenge)
PDF
淺嚐 LHCb 數據分析的滋味 Play around the LHCb Data on Kaggle with SK-Learn and MatPlotLib
PDF
蒙地卡羅模擬與志願運算
PDF
粒子物理與天文物理學簡介
PDF
「大數據」時代的「小問題」-- 以數據分析的手法處理虛擬歌手聲源參數
PDF
用 Python 玩 LHC 公開數據
PDF
Mass Resconstruction with HEP detectors
PDF
Detector Simulation for HEP
PDF
巨量資料分析輕鬆上手_教您玩大強子對撞機公開數據
PDF
大強子計算網格與OSS
PDF
No more fractures MingLiu font!
PDF
Introduce powertop to end users
PDF
Introduction to Soas
基礎科學研究連結跨領域的未來 The Link between Fundamental Science Research and the Future o...
玩轉 LHC 公開數據 (Play around with the LHC open data)
介紹 TrackML 挑戰 (Introduction to TrackML Kaggle challenge)
淺嚐 LHCb 數據分析的滋味 Play around the LHCb Data on Kaggle with SK-Learn and MatPlotLib
蒙地卡羅模擬與志願運算
粒子物理與天文物理學簡介
「大數據」時代的「小問題」-- 以數據分析的手法處理虛擬歌手聲源參數
用 Python 玩 LHC 公開數據
Mass Resconstruction with HEP detectors
Detector Simulation for HEP
巨量資料分析輕鬆上手_教您玩大強子對撞機公開數據
大強子計算網格與OSS
No more fractures MingLiu font!
Introduce powertop to end users
Introduction to Soas
Ad

Recently uploaded (20)

PDF
Lecture1 pattern recognition............
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPT
Predictive modeling basics in data cleaning process
PPTX
Managing Community Partner Relationships
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
[EN] Industrial Machine Downtime Prediction
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PDF
.pdf is not working space design for the following data for the following dat...
PDF
Transcultural that can help you someday.
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Lecture1 pattern recognition............
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
IB Computer Science - Internal Assessment.pptx
Introduction-to-Cloud-ComputingFinal.pptx
Clinical guidelines as a resource for EBP(1).pdf
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Introduction to Knowledge Engineering Part 1
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Galatica Smart Energy Infrastructure Startup Pitch Deck
Predictive modeling basics in data cleaning process
Managing Community Partner Relationships
STUDY DESIGN details- Lt Col Maksud (21).pptx
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
[EN] Industrial Machine Downtime Prediction
Supervised vs unsupervised machine learning algorithms
Qualitative Qantitative and Mixed Methods.pptx
.pdf is not working space design for the following data for the following dat...
Transcultural that can help you someday.
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...

Project Linne 徵音梅林 -- Virtual Singer Sound-bank Processed with Python