News
- Nov 2025: FoleyBench released
- Nov 2025: MMAU-Pro accepted to AAAI 2026
- Sept 2025: Mellow accepted to NeurIPS 2025
- July 2025: Work on Morphing accepted to WASPAA 2025
- Jan 2025: MACE accepted to ICASSP SALMA 2025
Selected Publications and Preprints
Vision Language Models Are Few-Shot Audio Spectrogram Classifiers
NeurIPS 2024 Audio Imagination Workshop