Saghir Alfasly

Assistant Professor of Biomedical Informatics · Mayo Clinic College of Medicine and Science
Research Associate, Department of AI & Informatics, Mayo Clinic

I am an Assistant Professor of Biomedical Informatics at Mayo Clinic College of Medicine and Science and a Research Associate in the Department of AI & Informatics, Mayo Clinic, Rochester, MN. My current research interests are (1) computational pathology and foundation models for histopathology, (2) generative AI for synthetic tissue synthesis, (3) multimodal foundation models and agentic AI for clinical applications, and (4) active learning and continual/lifelong learning for adaptive clinical AI systems.

My publications appear in NeurIPS, CVPR, IEEE TNNLS, IEEE TITS, Mayo Clinic Proceedings, and IEEE Reviews in Biomedical Engineering, among others.

Previously, I was a Postdoctoral Research Fellow at the Shenzhen Key Laboratory of Advanced Machine Learning and Applications, Shenzhen University, and the Guangdong Key Laboratory of Intelligent Information Processing, Shenzhen, China.

I received my Ph.D. in Information and Communication Engineering (Computer Vision & Machine Learning) from the School of Electronics and Information Engineering, South China University of Technology, China, supported by the Chinese Government Scholarship. I also hold a B.Sc. and M.Sc. in Computer Science.

News

July 2026	NeurIPS workshop proposal accepted: AI at Scale for Clinical Impact (ASCI): Cancer Pathology Foundation Models.
July 2026	Awarded the Travel award - Pathology Visions 2026 - Digital Pathology Association
Apr 2026	Tutorial proposal accepted at ICHI 2026: Synthetic Biomedical Data Generation Across Modalities.
Jan 2026	Appointed as Assistant Professor of Biomedical Informatics at Mayo Clinic College of Medicine and Science.
Dec 2025	Paper accepted at NeurIPS 2025: HeteroTissue-Diffuse: Semantic and Visual Crop-Guided Diffusion Models for Heterogeneous Tissue Synthesis in Histopathology.
Nov 2025	Co-organized the NeurIPS 2025 Competition on Self-Supervised Learning for Cancer Pathology Foundation Models (SLC-PFM). [Website]
Nov 2025	Delivered an invited seminar at Kaiko-AI on generative AI for histopathology.
Dec 2025	Named AI Award Finalist — VIBE Summit, Mayo Clinic, December 9, 2025.
Sep 2025	Received the Excellence Award in AI, Data Science, and Computational Biology — Mayo Research Fellows' Association (MRFA), Rochester, Minnesota.
Jun 2025	Finalist in the Research Pitch Competition — Mayo Clinic, June 25, 2025.
Jul 2025	Organized and delivered hands-on workshop: Practical AI in Healthcare: From Fine-tuning Large Models to Deployment with Open Source Tools — Mayo Clinic AI Summit 2025.
Apr 2025	Invited seminar at AbbVie (Computer Vision Research Team): Histopathology Image Analysis — Challenges and Limitations.
Feb 2025	Paper published in Scientific Reports: Validation of Histopathology Foundation Models through Whole Slide Image Retrieval.
Jan 2025	Paper published in IEEE Reviews in Biomedical Engineering: Analysis and Validation of Image Search Engines in Histopathology.

Awards & Recognition

2026

Travel Award — Pathology Visions 2026 - Digital Pathology Association

2025

Excellence Award in AI, Data Science, and Computational Biology — MRFA, Mayo Clinic, Rochester, Minnesota.

2025

AI Award Finalist — VIBE Summit, Mayo Clinic, Rochester, Minnesota, December 2025.

2025

Research Pitch Competition Finalist — Mayo Clinic, Rochester, Minnesota. June 25, 2025.

2016

Chinese Government Scholarship — South China University of Technology, Guangzhou, China. (2016–2020)

2015

Award for Excellent Performance — Department of Computer Science, Kuvempu University, India.

2014

First Place, Programming Competition (Java, C, C++) — University Of Mysore, India.

Talks & Presentations

Invited Talks

HeteroTissue-Diffuse: Visual Generative Models in Histopathology

Research Seminar, Kaiko-AI. November 2025.

Agentic AI for Medicine: Potential and Challenges

Department of AI & Informatics, Mayo Clinic. April 2025.

Histopathology Image Analysis — Challenges and Limitations

Computer Vision Research Team Imaging Seminar, AbbVie. April 2025.

Intelligent Video Analysis Solutions with Deep Learning

Annual Training Programme, GRG Banking Equipment Co., Ltd., Guangzhou, China. January 2018.

Conference Presentations

HeteroTissue-Diffuse: Semantic and Visual Crop-Guided Diffusion Models … (Oral)

Deep Learning & Machine Learning Journal Club, Mayo Clinic. November 2025.

Rotation-Agnostic Image Representation Learning for Digital Pathology (Poster)

CVPR 2024, Seattle, WA. June 2024.

Decoding the Foundation Models Revolution: Data vs. Model Structure (Poster)

Mayo Clinic AI Summit 2024, Rochester, MN. July 2024.

Overfitting in Histopathology Image Model Training: The Need for Customized Architectures (Oral)

Mayo Clinic AI Summit 2024, Rochester, MN. July 2024.

Learnable Irrelevant Modality Dropout for Multimodal Action Recognition (Poster)

CVPR 2022, New Orleans, LA. June 2022.

Variational Representation Learning for Vehicle Re-Identification (Oral)

IEEE ICIP 2019, Taipei, Taiwan. September 2019.

Workshops and Tutorials

Practical AI in Healthcare: From Fine-tuning Large Models to Deployment with Open Source Tools

Mayo Clinic AI Summit 2025, Rochester, MN. July 2025.

Publications

NeurIPS 2025

Semantic and Visual Crop-Guided Diffusion Models for Heterogeneous Tissue Synthesis in Histopathology

Saghir Alfasly, Wataru Uegami, MD Enamul Hoq, Ghazal Alabtah, H.R. Tizhoosh

Neural Information Processing Systems (NeurIPS), 2025

Project Video PDF Code

HeteroTissue-Diffuse is a latent diffusion framework for synthesizing histopathology images that preserve tissue heterogeneity and fine morphological detail. A novel conditioning mechanism scales to both annotated and unannotated datasets, enabling realistic, diverse, and annotated synthetic tissue slides for training data augmentation and privacy-preserving model development.
CVPR 2024

Rotation-Agnostic Image Representation Learning for Digital Pathology

Saghir Alfasly, Abubakr Shafique, Peyman Nejat, Jibran Khan, Areej Alsaafin, Ghazal Alabtah, H.R. Tizhoosh

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024

Project Demo PDF Code

PathDino is a compact histopathology Transformer (~9M parameters) with rotation-invariant representations. Introduces HistoRotate (360° augmentation) and a fast patch selection (FPS) method that preserves spatial distribution while reducing computation. Achieves competitive performance with models 10× larger.
MCP:DH 2024

Foundation Models for Histopathology — Fanfare or Flair

Saghir Alfasly, Peyman Nejat, Sobhan Hemati, Jibran Khan, Isaiah Lahr, Areej Alsaafin, Abubakr Shafique, Nneka Comfere, Dennis Murphree, Chady Meroueh, Saba Yasir, Aaron Mangold, Lisa Boardman, Vijay H. Shah, Joaquin J. Garcia, H.R. Tizhoosh

Mayo Clinic Proceedings: Digital Health, 2024

PDF Supp

Rigorous evaluation of CLIP-based foundation models (PLIP, BiomedCLIP) against domain-specific histopathology models across 8 diverse datasets (4 Mayo Clinic internal + 4 public: PANDA, BRACS, CAMELYON16, DigestPath). Domain-specific models such as DinoSSLPath and KimiaNet consistently outperform general-purpose foundation models, underscoring the importance of curated, domain-specific training data.
Preprint 2023

Selection of Distinct Morphologies to Divide & Conquer Gigapixel Pathology Images

Abubakr Shafique, Saghir Alfasly, Areej Alsaafin, Peyman Nejat, Jibran Khan, H.R. Tizhoosh

Preprint, 2023

PDF

SDM: a patch selection method for whole-slide images that minimizes patch count while capturing all morphological variations. Outperforms state-of-the-art methods without requiring parameter tuning, enabling more efficient representation learning on gigapixel images.
IEEE TITS 2023

OSRE: Object-to-Spot Rotation Estimation for Bike Parking Assessment

Saghir Alfasly, Zaid Al-Huda, Saifullahi Bello, Ahmed Elazab, Jian Lu, Chen Xu

IEEE Transactions on Intelligent Transportation Systems, 25(6):6013–6022, 2023

Project Video PDF Code Data

OSRE estimates a parked bike's rotation relative to its designated parking spot using 3D graphics and computer vision. Enables intelligent surveillance systems, bike-sharing management, and smart city applications. Introduces a synthetic dataset (SynthBRSet) for training and evaluation.
IEEE TNNLS 2024

An Effective Video Transformer with Synchronized Spatiotemporal and Spatial Self-Attention for Action Recognition

Saghir Alfasly, Charles K. Chui, Qingtang Jiang, Jian Lu, Chen Xu

IEEE Transactions on Neural Networks and Learning Systems, 35(2):2496–2509, 2024

PDF Video

SSTSA: a novel spatiotemporal attention scheme combining temporal and spatial multi-headed self-attention modules in a synchronized manner for efficient and effective video action recognition, outperforming prior transformer-based methods.
CVPR 2022

Learnable Irrelevant Modality Dropout for Multimodal Action Recognition on Modality-Specific Annotated Videos

Saghir Alfasly, Jian Lu, Chen Xu, Yuru Zou

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022

PDF Supp

IMD (Irrelevant Modality Dropout): a multimodal learning approach for video action recognition that adaptively identifies and drops audio modalities irrelevant to the visual content, improving recognition on datasets with modality-specific annotations.
Neurocomp. 2022

FastPicker: Adaptive Independent Two-Stage Video-to-Video Summarization for Efficient Action Recognition

Saghir Alfasly, Jian Lu, Chen Xu, Zaid Al-Huda, Qingtang Jiang, Zhaosong Lu, Charles K. Chui

Neurocomputing, 516:231–244, 2022

PDF Video

A fast, independent, adaptive two-stage algorithm for selecting the most discriminative and representative video frames, significantly reducing dataset size while improving action recognition performance.
Appl. Intel. 2022

Weakly Supervised Pavement Crack Semantic Segmentation Based on Multi-Scale Object Localization and Incremental Annotation Refinement

Zaid Al-Huda, Bo Peng, Riyadh Nazar Ali Algburi, Saghir Alfasly, Tianrui Li

Applied Intelligence, 53(11):14527–14546, 2022

PDF

Weakly supervised segmentation of pavement cracks using multi-scale object localization and iterative annotation refinement, reducing the need for expensive pixel-level labels while achieving competitive segmentation accuracy.
IEEE Access 2019

Multi-Label-Based Similarity Learning for Vehicle Re-Identification

Saghir Alfasly, Yongjian Hu, Haoliang Li, Tiancai Liang, Xiaofeng Jin, BeiBei Liu, Qingli Zhao

IEEE Access, 2019

PDF Video

Multi-label similarity learning framework for vehicle re-identification that exploits label correlations and multi-granularity features for robust cross-camera vehicle matching.
ICIP 2019

Variational Representation Learning for Vehicle Re-Identification

Saghir Alfasly, Yongjian Hu, Tiancai Liang, Xiaofeng Jin, Qingli Zhao, Beibei Liu

IEEE International Conference on Image Processing (ICIP), 2019

PDF Code

Variational representation learning for object re-identification. Evaluated on vehicle re-ID, person re-ID, and face recognition benchmarks, demonstrating that variational approaches improve generalization across unseen identities.
IEEE Access 2019

Auto-Zooming CNN-Based Framework for Real-Time Pedestrian Detection in Outdoor Surveillance Videos

Saghir Alfasly, BeiBei Liu, Yongjian Hu, Yufei Wang, Chang-Tsun Li

IEEE Access, 2019

PDF Code Video

A fast, lightweight, auto-zooming CNN framework for detecting small pedestrians in outdoor surveillance videos, using adaptive zoom and multi-scale feature fusion to improve detection accuracy at varying distances.

Additional Publications

Validation of Histopathology Foundation Models through Whole Slide Image Retrieval

Alfasly S., Alabtah G., Hemati S., Kalari K.R., Garcia J.J., Tizhoosh H.R.

Scientific Reports, 15(1):3990, 2025
Analysis and Validation of Image Search Engines in Histopathology

Lahr I., Alfasly S., Nejat P., Khan J., Kottom L., et al.

IEEE Reviews in Biomedical Engineering, 18:350–367, 2025
Geometric Edge Convolution for Rigid Transformation Invariant Features in 3D Point Clouds

Saifullahi, Alfasly S., et al.

Neurocomputing, 622:129313, 2025
Auxiliary Audio-Textual Modalities for Better Action Recognition on Vision-Specific Annotated Videos

Alfasly S., Jian Lu, Chen Xu, Yuru Zou

Pattern Recognition, 156:110808, 2024
Sequential Patching Lattice for Image Classification and Enquiry

Alsaafin A., Nejat P., Shafique A., Khan J., Alfasly S., Alabtah G., Tizhoosh H.R.

American Journal of Pathology, 194(10):1898–1912, 2024
Model-Agnostic Binary Patch Grouping for Bone Marrow Whole Slide Image Representation

Mu Y., Tizhoosh H.R., Dehkharghanian T., Alfasly S., Campbell C.J.V.

American Journal of Pathology, 194(5):721–734, 2024
AlcLaM: Arabic Dialectal Language Model

Murtadha, Alfasly S., et al.

ACL — Proceedings of the Second Arabic NLP Conference, 2024
When is a Foundation Model a Foundation Model?

Alfasly S., Nejat P., Hemati S., Khan J., Lahr I., et al.

arXiv Preprint, arXiv:2309.11510, 2023

Teaching & Mentoring

Teaching

Practical AI in Healthcare: From Fine-tuning Large Models to Deployment with Open Source Tools

Workshop · Mayo Clinic AI Summit · Rochester, MN · July 2025

AI for Video Understanding

Lab seminar series · Shenzhen Key Laboratory of Advanced Machine Learning and Applications, Shenzhen University, China · Fall 2021

Foundations of Computer Vision

Lab seminar series · School of Electronic and Information Engineering, South China University of Technology, China · Fall 2018

Mentoring

Mentored 15+ students and research interns across multiple institutions. Selected outcomes:

MD Enamul Hoq

PhD Student · Mayo Clinic · 2025

Co-authored two papers at NeurIPS 2025 and Mayo Clinic Proceedings.

→ PhD Research Assistant @ UAMS

Jibran Khan

Research Intern · Mayo Clinic · 2023–2024

Co-authored papers at CVPR 2024 and Mayo Clinic Proceedings.

→ Data Analyst & AI/ML Developer @ Delta

Yuyi Lin

Graduate Student · SCUT · 2018–2019

Deep learning for human attribute recognition in surveillance systems.

→ R&D Engineer @ ByteDance

Luo Xin

Graduate Student · Shenzhen Univ. · 2021–2022

Object detection in videos and 3D synthetic data generation.

→ Assisted Driving Engineer @ BYD

Chen Hao

Graduate Student · SCUT · 2017–2019

Detection algorithms for partially masked objects in surveillance videos.

→ Software Engineer @ Honor Terminal Co. (Huawei)

Polash Das

Graduate Student · SCUT · 2017–2019

Face anti-spoofing using handcrafted and deep network features.

→ R&D Manager @ Dragon Tech Pte Ltd, Singapore

Service

Organizing

Tutorial Co-organizer — Synthetic Biomedical Data Generation Across Modalities, ICHI 2026.
Competition/Workshop Co-organizer — Self-Supervised Learning for Cancer Pathology Foundation Models (SLC-PFM) NeurIPS 2025.
Workshop Co-organizer — Practical AI in Healthcare: From Fine-Tuning Large Models to Deployment with Open Source Tools, Mayo Clinic AI Summit 2025.

Editorial

Area Chair — IJCNN (International Joint Conference on Neural Networks). 2025–Present.

Reviewer — Journals

Reviewer — Conferences

CVPR 2021, 2022, 2023, 2024, 2025
ICCV 2023, 2025
ECCV 2024
AMIA Annual Symposium 2025–Present
ACM Multimedia (ACM MM)
NeurIPS 2025
MICCAI 2026