Audio deepfakes, by definition, are synthetic audio recordings generated using deep learning-based systems for either malicious, artistic, or entertainment ...
We introduce MMAR, a new benchmark designed to evaluate the deep reasoning capabilities of Audio-Language Models (ALMs) across massive multi-disciplinary tasks. MMAR comprises 1,000 meticulously ...
Abstract: In traditional audio captioning methods, a model is usually trained in a fully supervised manner using a human-annotated dataset containing audio-text pairs and then evaluated on the test ...
Abstract: Ship dangerous situations result from the fact that some ships encounter complex conditions in confined waters and, therefore, ship collision risk assessment is important to support maritime ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results