ICASSP 2019 Duration Controllable TTS: attention alignment and PDC model architecture

Emotional Text-to-Speech and Voice Conversion Systems

Led development of duration-controllable TTS and emotional voice conversion at Humelo, producing two ICASSP publications (2019 Oral 1st author, 2020). Won Minister of Science and ICT Special Award at K-Startup 2018.

May 15, 2020 · 2 min · Jungbae Park
K-Startup Minister Award

Government & Public R&D Grant Management

Secured ~US$875K across three competitive Korean government R&D grants (IITP, TIPS, Seoul R&BD) at Humelo, covering brain-inspired AI, emotional TTS, and voice conversion research.

March 15, 2020 · 1 min · Jungbae Park
CBRNN architecture with transfer learning for polyphonic sound event detection (ICASSP 2019)

Polyphonic Sound Event Detection with Transfer Learning

Developed convolutional bidirectional LSTM with synthetic data-based transfer learning for polyphonic sound event detection at Humelo, achieving +28.4% F1 improvement. Published at ICASSP 2019 as corresponding author.

April 15, 2019 · 1 min · Jungbae Park
Music X AI Collaboration Poster

AI Music Composition and SM Entertainment Collaboration

Led AI music composition and rap synthesis at Humelo, presented at SXSW 2019, collaborated with SM Entertainment and rapper Sleepy (KBS Documentary), and received coverage from 10+ national media outlets.

March 15, 2019 · 2 min · Jungbae Park
Speech Emotion Recognition Pipeline

Speech Emotion Recognition & Classification System

Built a multi-class speech emotion recognition system at Humelo using SpeechCNN and CRNN architectures with MFCC/Mel-spectrogram features, integrated into the Emotional TTS pipeline.

February 1, 2019 · 1 min · Jungbae Park