Pipeline
Pose estimation from unordered images is fundamental for 3D reconstruction, robotics, and scientific imaging. Recent geometric foundation models enable end-to-end dense 3D reconstruction but remain underexplored for cryo-electron microscopy (cryo-EM), where reconstruction still depends on slow iterative optimization. We introduce CryoFastAR, the first geometric foundation model that directly predicts poses from cryo-EM particle images for fast ab initio reconstruction. CryoFastAR integrates multi-view features and is trained on large-scale simulated cryo-EM data with realistic noise and CTF modulation. A progressive training strategy stabilizes learning by starting from simplified settings before gradually increasing difficulty. Experiments on synthetic and real datasets show that CryoFastAR matches or exceeds reconstruction quality while significantly accelerating inference compared with traditional pipelines. Code, models, and datasets are released to spur further research.
CryoFastAR preserves fine structural details across simulated complexes while avoiding the artifacts that iterative pose-bootstrapping methods sometimes introduce. Visual comparisons highlight sharper density recovery for flexible regions and cleaner background suppression on challenging molecules.
On the experimental spliceosome dataset, CryoFastAR reconstructs cohesive densities that trace helices and peripheral domains, while iterative baselines struggle with missing or noisy regions. Feed-forward predictions offer a clean starting volume that downstream refinement can further polish.
@inproceedings{zhang2025cryofastar,
author = {Zhang, Jiakai and Zhou, Shouchen and Dai, Haizhao and Liu, Xinhang and Wang, Peihao and Fan, Zhiwen and Pei, Yuan and Yu, Jingyi},
title = {CryoFastAR: Fast Cryo-EM Ab Initio Reconstruction Made Easy},
booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision},
year = {2025}
}