International Journal For Multidisciplinary Research

E-ISSN: 2582-2160     Impact Factor: 9.24

A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal

Call for Paper Volume 7, Issue 1 (January-February 2025) Submit your research before last 3 days of February to publish your research paper in the issue of January-February.

Emotion Detection in Speech: A Deep Learning-Driven Approach Leveraging Acoustic Features within Intelligent Computing Systems

Author(s) Sulaxman Kaja, Vijaya Chandra Jadala
Country India
Abstract As digital image collections grow in size, there is an increasing need for robust image retrieval systems capable of managing large datasets effectively. This work offerings a novel Content-Based Image Retrieval (CBIR) system designed to enhance retrieval accuracy across both general and medical image
datasets. The proposed system leverages U-Net for feature extraction, integrated with an improved weight-learning approach to enhance retrieval performance. A Convolutional neural network, U-Net, a network design famous for its picture segmentation ability, is utilized to capture complex, high-level
image features. The approach includes an adaptive, query-aware feature weighting mechanism that applies weight re-scaling to parameterized features, assigning optimized weights to top-ranked images.
This CBIR system comprises three main components: image pre-processing, U-Net-based feature extraction, and feature re-weighting. During pre-processing, images undergo augmentation and normalization to increase model robustness. The feature re-weighting process evaluates feature importance using cosine similarity, which further improves discriminative power at retrieval. The
proposed CBIR system was tested across various image datasets through extensive experiments, with performance measured in terms of recall, precision and F1-score. The results indicate that integrating feature re-weighting with U-Net-based extraction significantly enhances retrieval effectiveness. This
work represents a step forward in developing adaptive image retrieval systems that better respond to diverse retrieval scenarios.
Keywords A Deep Learning-Driven Approach Leveraging Acoustic Features within Intelligent Computing Systems
Field Computer > Artificial Intelligence / Simulation / Virtual Reality
Published In Volume 7, Issue 1, January-February 2025
Published On 2025-01-28
Cite This Emotion Detection in Speech: A Deep Learning-Driven Approach Leveraging Acoustic Features within Intelligent Computing Systems - Sulaxman Kaja, Vijaya Chandra Jadala - IJFMR Volume 7, Issue 1, January-February 2025. DOI 10.36948/ijfmr.2025.v07i01.35643
DOI https://doi.org/10.36948/ijfmr.2025.v07i01.35643
Short DOI https://doi.org/g829r6

Share this