About | Zhepei Wang

About

Research profile

I build machine learning systems for audio and multimodal understanding, with a particular emphasis on music, speech, and generative modeling.

Across academia and industry, I have worked on source separation, speech enhancement, audio-language representations, sound event detection, and music captioning. I enjoy research that connects strong modeling ideas with practical media tools.

Feel free to look through my publications, browse the posts, or reach out via the contact page.