AudioFingerprint is a production-ready, local audio fingerprinting and song identification system inspired by Shazam and Google Sound Search. It uses spectral peak extraction and combinatorial hashing ...
Abstract: Over the past few decades, convolutional neural network (CNN) has found broad applications in image recognition. Nevertheless, the operational environment of CNN is facing significant ...
Abstract: This paper presents a novel approach incorporating Facial Expression Recognition (FER) to improve emotional and contextual understanding in Vision-Language Pretraining (VLP) model-generated ...