About
Research profile
I build machine learning systems for audio and multimodal understanding, with a particular emphasis on music, speech, and generative modeling.
Across academia and industry, I have worked on source separation, speech enhancement, audio-language representations, sound event detection, and music captioning. I enjoy research that connects strong modeling ideas with practical media tools.
Feel free to look through my publications, browse the posts, or reach out via the contact page.