Audio deepfakes, by definition, are synthetic audio recordings generated using deep learning-based systems for either malicious, artistic, or entertainment ...
We introduce MMAR, a new benchmark designed to evaluate the deep reasoning capabilities of Audio-Language Models (ALMs) across massive multi-disciplinary tasks. MMAR comprises 1,000 meticulously ...
Abstract: Motivated by depression's significant impact on global health, this work proposes MultiDepNet, a novel multi-modal interpretable depression detection system integrating visual, physiological ...
Abstract: Flood mapping using remote sensing data is critical to disaster response, especially in real-time monitoring and edge deployment. However, existing deep-learning (DL) models often face ...