International Journal For Multidisciplinary Research

E-ISSN: 2582-2160     Impact Factor: 9.24

A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal

Call for Paper Volume 6 Issue 4 July-August 2024 Submit your research before last 3 days of August to publish your research paper in the issue of July-August.

Improving AI Model Performance by Augmenting Synthetic Data

Author(s) Monojit Banerjee
Country United States
Abstract In recent years, supervised learning has improved many computer vision problems. However, data scarcity, lack of labeled data, and imbalanced datasets have created issues in adopting this improvement in the medical imaging domain. With the recent advancement in other large language and vision language models(eg: chatgpt, DALL-E) generating synthetic data has become easier. However, this is still cost-prohibitive for large-scale datasets specifically image dataset generation. This approach can also may not be suitable for privacy-first datasets. In this work, the proposed methodology is to generate synthetic images based on available labeled images and then use these generated images along with the existing data to solve above mentioned issues. Chest X-ray datasets are one of the complex datasets that suffer from label imbalance problems and strict data privacy is required for handling any such kind of data. In this work, a simplified generative adversarial network-based solution is used which is cost-effective and provides better results than only using available datasets. This proposed method is especially useful for privacy-first, imbalanced datasets. Finally, this solution was compared with some existing proposals. The promising result obtained using this methodology shows that this proposed solution can be expanded to other domains.
Keywords Artificial Intelligence, Machine Learning, Synthetic Data, Model training, GAN, MLOps
Field Computer > Artificial Intelligence / Simulation / Virtual Reality
Published In Volume 6, Issue 2, March-April 2024
Published On 2024-04-30
Cite This Improving AI Model Performance by Augmenting Synthetic Data - Monojit Banerjee - IJFMR Volume 6, Issue 2, March-April 2024. DOI 10.36948/ijfmr.2024.v06i02.18972
DOI https://doi.org/10.36948/ijfmr.2024.v06i02.18972
Short DOI https://doi.org/gts4rx

Share this