RetinoCardioNet: Multi-Modal Deep Learning Framework for Cardiovascular Risk Assessment Using Retinal Fundus Imaging
Main Article Content
Abstract
Retinal fundus imaging offers a non-invasive window into microvascular health, with growing evidence linking retinal abnormalities to systemic cardiovascular conditions. However, most computational models in this domain rely on isolated image features and fail to incorporate vascular geometry or clinical metadata. This study introduces RetinoCardioNet, a unified multi-modal deep learning framework designed for cardiovascular risk prediction using retinal fundus images, vascular graphs, and structured clinical data. . The proposed system integrates three data modalities: high-resolution retinal images processed through a ResNet-50 encoder with self-supervised SimCLR pretraining, graph neural networks (GCNs) encoding vessel topology, and a clinical metadata encoder. These features are fused via a multiread cross-attention mechanism. The framework was trained on public datasets (EyePACS, Messidor, UK Biobank) and evaluated using a 5 -fold cross-validation protocol. Model optimization used Adam with a learning rate of , cosine annealing, and early stopping based on validation AUC. RetinoCardioNet achieved an AUC of 0.942 , F1-score of 0.916 , precision of 0.906 , and recall of 0.927. Ablation studies showed performance dropped by up to when removing key components, confirming the contribution of each modality. Visual attention maps further improved interpretability. Conclusion: RetinoCardioNet offers a clinically relevant, interpretable, and scalable framework for noninvasive cardiovascular risk screening, showing potential for deployment in preventive cardiology, especially in resource-limited settings.
Article Details

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
IJCERT Policy:
The published work presented in this paper is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0) license. This means that the content of this paper can be shared, copied, and redistributed in any medium or format, as long as the original author is properly attributed. Additionally, any derivative works based on this paper must also be licensed under the same terms. This licensing agreement allows for broad dissemination and use of the work while maintaining the author's rights and recognition.
By submitting this paper to IJCERT, the author(s) agree to these licensing terms and confirm that the work is original and does not infringe on any third-party copyright or intellectual property rights.
References
T. Y. Wong et al., "Retinal microvascular abnormalities and their relationship with hypertension, cardiovascular disease, and mortality," Surv. Ophthalmol., vol. 46, no. 1, pp. 59–80, Jul. 2001. DOI: 10.1016/S0039-6257(01)00234-X.
M. D. Knudtson et al., "Revised formulas for summarizing retinal vessel diameters," Curr. Eye Res., vol. 27, no. 3, pp. 143–149, Sep. 2003. DOI: 10.1076/ceyr.27.3.143.16049.
R. Klein et al., "Retinal vessel caliber and long-term risk of coronary heart disease," JAMA, vol. 300, no. 4, pp. 411–419, Jul. 2008. DOI: 10.1001/jama.300.4.411.
J. W. Yau et al., "Global prevalence and major risk factors of diabetic retinopathy," Diabetes Care, vol. 35, no. 3, pp. 556–564, Mar. 2012. DOI: 10.2337/dc11-1909.
P. Porwal et al., "IDRiD: Diabetic retinopathy—Segmentation and grading challenge," Med. Image Anal., vol. 59, Oct. 2020. DOI: 10.1016/j.media.2019.101561.
A. Poplin et al., "Prediction of cardiovascular risk factors from retinal fundus photographs via deep learning," Nat. Biomed. Eng., vol. 2, no. 3, pp. 158–164, Mar. 2018. DOI: 10.1038/s41551-018-0195-0.
O. Ronneberger, P. Fischer, and T. Brox, "U-Net: Convolutional networks for biomedical image segmentation," Med. Image Comput. Comput.-Assist. Interv., pp. 234–241, 2015. DOI: 10.1007/978-3-319-24574-4_28.
R. B. D’Agostino et al., "General cardiovascular risk profile for use in primary care: The Framingham Heart Study," Circulation, vol. 117, no. 6, pp. 743–753, Feb. 2008. DOI: 10.1161/CIRCULATIONAHA.107.699579.
S. Chappidi and A. Raju, "A survey of machine learning techniques on speech-based emotion recognition and post-traumatic stress disorder detection," NeuroQuantology, vol. 20, no. 14, pp. 69–79, Oct. 2022, doi: 10.4704/nq.2022.20.14.NQ88010.
S. Chappidi and A. Raju, "Enhanced speech emotion recognition using the cognitive emotion fusion network for PTSD detection with a novel hybrid approach," Journal of Electrical Systems, doi: https://doi.org/10.52783/jes.644.
S. Chappidi and A. Raju, "Advancements in speech-based emotion recognition and PTSD detection through machine and deep learning techniques: A comprehensive survey," SSRG International Journal of Electronics and Communication Engineering, vol. 11, no. 5, 2023, doi: 10.14445/23488549/IJECE-V11I5P121.
S. Chappidi and A. Raju, "Speech-based emotion recognition by using a faster region-based convolutional neural network," Multimedia Tools and Applications, Springer, 2024, doi: https://doi.org/10.1007/s11042-024-19004-2.
J. Devlin et al., "BERT: Pre-training of deep bidirectional transformers for language understanding," Proc. NAACL-HLT, vol. 1, pp. 4171–4186, 2019. arXiv:1810.04805.
EyePACS, "Diabetic retinopathy detection dataset," 2015. : https://www.kaggle.com/c/diabetic-retinopathy-detection.
Messidor, "Methods for evaluating segmentation and indexing techniques in the field of retinal ophthalmology," 2008. : http://www.adcis.net/en/Download-Third-Party/Messidor.html.
UK Biobank, "Retinal imaging dataset," 2020. : https://www.ukbiobank.ac.uk/.
R. R. Wolfe et al., "Standards for retinal imaging in clinical trials," Ophthalmology, vol. 121, no. 7, pp. 1453–1458, Jul. 2014. DOI: 10.1016/j.ophtha.2014.01.021.
M. M. Fraz et al., "An ensemble classification-based approach for retinal vessel segmentation," IEEE Trans. Biomed. Eng., vol. 59, no. 9, pp. 2538–2548, Sep. 2012. DOI: 10.1109/TBME.2012.2205687.
A. Hoover et al., "Locating blood vessels in retinal images by piecewise threshold probing of a matched filter response," IEEE Trans. Med. Imaging, vol. 19, no. 3, pp. 203–210, Mar. 2000. DOI: 10.1109/42.845178.
P. M. Ridker et al., "Development and validation of improved algorithms for the assessment of global cardiovascular risk in women," JAMA, vol. 297, no. 6, pp. 611–619, Feb. 2007. DOI: 10.1001/jama.297.6.611.
H. W. Ressom et al., "Handling missing values in proteomic data," Proteomics, vol. 5, no. 8, pp. 2085–2097, May 2005. DOI: 10.1002/pmic.200401071.
J. L. Fleiss et al., "The measurement of interrater agreement," Stat. Methods Rates Proportions, vol. 2, pp. 22–23, 1981.
O. Oktay et al., "Attention U-Net: Learning where to look for the pancreas," Med. Image Anal., vol. 53, pp. 26–42, May 2018. DOI: 10.1016/j.media.2019.01.012.
T. N. Kipf and M. Welling, "Semi-supervised classification with graph convolutional networks," Proc. ICLR, 2017. arXiv:1609.02907.
J. Staal et al., "Ridge-based vessel segmentation in color images of the retina," IEEE Trans. Med. Imaging, vol. 23, no. 4, pp. 501–509, Apr. 2004. DOI: 10.1109/TMI.2004.825627.
W. L. Hamilton et al., "Inductive representation learning on large graphs," Adv. Neural Inf. Process. Syst., vol. 30, pp. 1024–1034, 2017. arXiv:1706.02216.
 
							 
			
		 
			 
			