Aiding Medical Diagnosis through Image Synthesis and Classification
Journal:
arXiv
Published Date:
Jun 1, 2025
Abstract
Medical professionals, especially those in training, often depend on visual
reference materials to support an accurate diagnosis and develop pattern
recognition skills. However, existing resources may lack the diversity and
accessibility needed for broad and effective clinical learning. This paper
presents a system designed to generate realistic medical images from textual
descriptions and validate their accuracy through a classification model. A
pretrained stable diffusion model was fine-tuned using Low-Rank Adaptation
(LoRA) on the PathMNIST dataset, consisting of nine colorectal histopathology
tissue types. The generative model was trained multiple times using different
training parameter configurations, guided by domain-specific prompts to capture
meaningful features. To ensure quality control, a ResNet-18 classification
model was trained on the same dataset, achieving 99.76% accuracy in detecting
the correct label of a colorectal histopathological medical image. Generated
images were then filtered using the trained classifier and an iterative
process, where inaccurate outputs were discarded and regenerated until they
were correctly classified. The highest performing version of the generative
model from experimentation achieved an F1 score of 0.6727, with precision and
recall scores of 0.6817 and 0.7111, respectively. Some types of tissue, such as
adipose tissue and lymphocytes, reached perfect classification scores, while
others proved more challenging due to structural complexity. The
self-validating approach created demonstrates a reliable method for
synthesizing domain-specific medical images because of high accuracy in both
the generation and classification portions of the system, with potential
applications in both diagnostic support and clinical education. Future work
includes improving prompt-specific accuracy and extending the system to other
areas of medical imaging.