Vision language models in ophthalmology.

Journal: Current opinion in ophthalmology
Published Date:

Abstract

PURPOSE OF REVIEW: Vision Language Models are an emerging paradigm in artificial intelligence that offers the potential to natively analyze both image and textual data simultaneously, within a single model. The fusion of these two modalities is of particular relevance to ophthalmology, which has historically involved specialized imaging techniques such as angiography, optical coherence tomography, and fundus photography, while also interfacing with electronic health records that include free text descriptions. This review then surveys the fast-evolving field of Vision Language Models as they apply to current ophthalmologic research and practice.

Authors

  • Gilbert Lim
    School of Computing, National University of Singapore.
  • Kabilan Elangovan
    Artificial Intelligence and Digital Innovation Research Group, Singapore Eye Research Institute, Singapore.
  • Liyuan Jin
    Duke-NUS Medical School, Singapore, Singapore.