高效轻量化的单光子三维成像方法

doi:10.3969/j.issn.1007-5461.2026.03.006

量子电子学报 ›› 2026, Vol. 43 ›› Issue (3): 384-393.doi: 10.3969/j.issn.1007-5461.2026.03.006

高效轻量化的单光子三维成像方法

郑杰凯 1, 刘尉悦 1, 刘腾 1, 林泽洪 2*

1 宁波大学信息科学与工程学院, 浙江宁波 315211;2 丽水职业技术学院电子信息学院, 浙江丽水 323000

收稿日期:2024-03-26 修回日期:2024-05-16 出版日期:2026-05-28 发布日期:2026-05-28
通讯作者: E-mail: linzehong@yeah.net E-mail:E-mail: linzehong@yeah.net
作者简介:郑杰凯 ( 2000 - ), 浙江丽水人, 研究生, 主要从事量子通信与应用方面的研究。 E-mail: 752091756@qq.com
基金资助:
浙江省自然科学基金 (LY21F050003, LY23F010003), 浙江省"尖兵''"领雁"研发攻关计划 (2024C01105)

Efficient lightweight single‑photon three‑dimensionalimaging method

ZHENG Jiekai 1 , LIU Weiyue 1 , LIU Teng 1 , LIN Zehong2*

1 Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo 315211, China;2 School of Electronic Information, Lishui Vocational & Technical College, Lishui 323000, China

Received:2024-03-26 Revised:2024-05-16 Published:2026-05-28 Online:2026-05-28
Supported by:

摘要/Abstract

摘要： 随着深度学习的发展，单光子成像慢慢成为了一个重要且具有挑战性的研究方向。它有助于指导单光子图像的3D重建。单光子图像是一种稀疏的充满噪声的3D图像，在它的时间通道中仅还有少量的有效信号回波。然而，在目前现有的架构当中，都是以建立更庞大的主干网络来取得更好的效果的，而代价是占用更高的显卡内存，因此设计一个轻量级但有效的模型使其能够部署到边缘设备上同样是目前科研界十分关心的事，本文提出了一种轻量级的架构，在计算量大大降低的情况下取得了和其他方法相近的效果。具体来说，本文先利用swin transformer网络提取了单光子图像的时间域特征，通过这个时间预测网络将单光子图像的维度大大的降低，再通过一个密集级联多尺度网络（DCMNet）来提取单光子图像的空间域特征进一步对单光子图像重建。它通过自上而下的级联路径和密集连接改进了解码层之间的互连，以产生高质量的多分辨率深度输出。实验证明，本文提出的网络在大大减少占用的资源的同时，还能取得不错的效果。

关键词: 计算机视觉, 单光子图像三维重建, Swin Transformer与密集级联多尺度网络, 轻量级架构, 边缘计算

Abstract: With the advancement of deep learning, single-photon imaging has gradually become an important and challenging research direction. It contributes to the 3D reconstruction of single-photon images, which are sparse, noise-filled 3D images with only a few valid signal echoes in their time channels. However, current architectures generally achieve better results by establishing larger backbone networks, which comes at the cost of higher GPU memory usage. Designing a lightweight yet effective model that can be deployed on edge devices is also a significant concern in the research community. Therefore, this paper proposes a lightweight architecture that achieves comparable results to other methods with significantly reduced computational requirements. Specifically, this work utilizes a Swin Transformer network to extract temporal features of single-photon images, using this time prediction network to significantly reduce the dimensions of single-photon images. Then, a Densely Cascaded Multi-scale Network (DCMNet) is employed to extract spatial domain features of single-photon images for further reconstruction. It improves the interconnection between decoding layers through a top-down cascade pathway and dense connections, producing high-quality multi-resolution depth outputs. Experimental results demonstrate that our network can achieve commendable results while significantly reducing resource consumption.

Key words: computer vision, three-dimensional reconstruction of single-photon images, Swin Transformer and dense cascaded multi-scale network, lightweight architecture, edge computing

中图分类号:

O431.2

郑杰凯 , 刘尉悦 , 刘腾 , 林泽洪 . 高效轻量化的单光子三维成像方法[J]. 量子电子学报, 2026, 43(3): 384-393.

ZHENG Jiekai , LIU Weiyue , LIU Teng , LIN Zehong. Efficient lightweight single‑photon three‑dimensionalimaging method[J]. Chinese Journal of Quantum Electronics, 2026, 43(3): 384-393.

参考文献

[1] Chen Y, Rong F, Duggal S, et al. Geosim: Realistic video simulation via geometry-aware composition for self-driving[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2021: 7230-7240.
[2] Yang W, Li Q, Liu W, et al. Projecting your view attentively: Monocular road scene layout estimation via cross-view transformation[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2021: 15536-15545.
[3] Baek S H, Heide F. Polka lines: Learning structured illumination and reconstruction for active stereo[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021: 5757-5767.
[4] Lacaita A, Zappa F, Cova S, et al. Single-photon detection beyond 1 μm: performance of commercially available InGaAs/InP detectors[J]. Applied Optics, 1996, 35(16): 2986-2996.
[5] Kirmani A, Venkatraman D, Shin D, et al. First-photon imaging[J]. Science, 2014, 343(6166): 58-61.
[6] Shin D, Kirmani A, Goyal V K, et al. Computational 3D and reflectivity imaging with high photon efficiency[C]//2014 IEEE International Conference on Image Processing (ICIP). IEEE, 2014: 46-50.
[7] Shin D, Kirmani A, Goyal V K, et al. Photon-efficient computational 3-D and reflectivity imaging with single-photon detectors[J]. IEEE Transactions on Computational Imaging, 2015, 1(2): 112-125..
[8] Lindell D B, O'Toole M, Wetzstein G. Single-photon 3D imaging with deep sensor fusion[J]. ACM Trans. Graph., 2018, 37(4): 113.
[9] Chen Y, Yao G, Liu Y, et al. Deep domain adversarial adaptation for photon-efficient imaging based on spatiotemporal inception network[J]. arXiv preprint arXiv:2201.02475, 2022.
[10] Peng J, Xiong Z, Tan H, et al. Boosting photon-efficient image reconstruction with a unified deep neural network[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 45(4): 4180-4197.
[11] Yao G, Chen Y, Liu Y, et al. Robust photon-efficient imaging using a pixel-wise residual shrinkage network[J]. Optics Express, 2022, 30(11): 18856-18873.
[12] Howard A G, Zhu M, Chen B, et al. Mobilenets: Efficient convolutional neural networks for mobile vision applications[J]. arXiv preprint arXiv:1704.04861, 2017.
[13] Sandler M, Howard A, Zhu M, et al. Mobilenetv2: Inverted residuals and linear bottlenecks[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 4510-4520.
[14] Howard A, Sandler M, Chu G, et al. Searching for mobilenetv3[C]//Proceedings of the IEEE/CVF international conference on computer vision. 2019: 1314-1324.
[15] Liu Z, Lin Y, Cao Y, et al. Swin transformer: Hierarchical vision transformer using shifted windows[C]//Proceedings of the IEEE/CVF international conference on computer vision. 2021: 10012-10022.
[16] Ronneberger O, Fischer P, Brox T. U-net: Convolutional networks for biomedical image segmentation[C]//Medical image computing and computer-assisted intervention–MICCAI 2015: 18th international conference, Munich, Germany, October 5-9, 2015, proceedings, part III 18. Springer International Publishing, 2015: 234-241.
[17] Huang G, Liu Z, Van Der Maaten L, et al. Densely connected convolutional networks[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 4700-4708.
[18] Lin T Y, Dollár P, Girshick R, et al. Feature pyramid networks for object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 2117-2125.
[19] Heide F, Diamond S, Lindell D B, et al. Sub-picosecond photon-efficient 3D imaging using single-photon sensors[J]. Scientific reports, 2018, 8(1): 17726.
[20] Ingle A, Velten A, Gupta M. High flux passive imaging with single-photon sensors[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019: 6760-6769.
[21] Snyder D L, Miller M I. Random point processes in time and space[M]. Springer Science & Business Media, 2012.
[22] Zhou T, Brown M, Snavely N, et al. Unsupervised learning of depth and ego-motion from video[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 1851-1858.
[23] Wang Z, Bovik A C, Sheikh H R, et al. Image quality assessment: from error visibility to structural similarity[J]. IEEE transactions on image processing, 2004, 13(4): 600-612.
[24] Silberman N, Hoiem D, Kohli P, et al. Indoor segmentation and support inference from rgbd images[C]//Computer Vision–ECCV 2012: 12th European Conference on Computer Vision, Florence, Italy, October 7-13, 2012, Proceedings, Part V 12. Springer Berlin Heidelberg, 2012: 746-760.

高效轻量化的单光子三维成像方法

Efficient lightweight single‑photon three‑dimensionalimaging method

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

[1]	吕磅 , 汪溢镭 , 朱雪阳 , 王译锋 , 冯宝 . 基于被动光源监测的相位匹配量子密钥分发[J]. 量子电子学报, 2026, 43(3): 431-442.
[2]	侯佳慧 , 张敏 , 王奥 , 郭晓敏 , 郭龑强 , . 基于光子关联的非厄米准晶体局域化相变研究[J]. 量子电子学报, 2026, 43(3): 443-455.
[3]	胡雅静 , 张晓宇 , 高超 , 王晓茜 . 基于涨落特性的三阶关联成像差分形式[J]. 量子电子学报, 2026, 43(2): 275-284.
[4]	刘天赐 #, 米江 #, 周笑阳, 刘海, 娄金伟, 高涵宇, 唐隆煌 . 基于光-微波映射的大量程频域干涉测距技术研究 (特邀, 封面文章）[J]. 量子电子学报, 2026, 43(2): 196-209.
[5]	陈力颖, 王辰阳, 李榜添, 曹凌风, 程传同 . InGaAs SPAD 光子符合检测读出电路设计 (特邀)[J]. 量子电子学报, 2026, 43(2): 210-217.
[6]	耿玥, 徐琼怡, 彭晨, 李观荣, 张焓笑, 杨红 . 基于自发辐射相干的非互易反射动态调控[J]. 量子电子学报, 2026, 43(1): 110-119.
[7]	王玉林, 杨震, 徐艳丽, 赵洪 . 基于柔性衬底全介质超表面的异常透射开关研究[J]. 量子电子学报, 2026, 43(1): 151-160.
[8]	迟熹昊, 郭祥, 尹冬梅, 王治海 . 开放拓扑波导的稳态性质[J]. 量子电子学报, 2025, 42(6): 806-817.
[9]	李振华, 窦天琦, 孔维文, 解宇恒, 赵琦, 唐建军, 马海强 . 模式匹配量子密钥分发协议中的新型光源监测方案[J]. 量子电子学报, 2025, 42(6): 829-839.
[10]	郝鹏磊, 韩正甫, . 量子光源的相位随机化检测装置[J]. 量子电子学报, 2025, 42(5): 686-693.
[11]	黄思远, 洪玲, 陈理想 . 基于轨道角动量光强调制实现转速及转向的同时探测 (封面文章)[J]. 量子电子学报, 2025, 42(5): 694-702.
[12]	刘腾, 金标, 梁钦, 薛华建, 刘尉悦 . 移动平台下 QKD 抑制大扰动控制系统的实现[J]. 量子电子学报, 2025, 42(5): 703-710.
[13]	张秀再, 赵宇婕, 张薇薇 . 雷暴云对量子干涉雷达探测性能的影响[J]. 量子电子学报, 2025, 42(5): 722-732.
[14]	牛素俭解孟雨周志远史保森. 基于量子纠缠光源的椭圆偏振测量技术研究进展 (特邀)[J]. 量子电子学报, 2025, 42(4): 450-463.
[15]	肖圣贤, 张加宸, 汪涛, 张学锋 . 基于里德堡原子的通信技术 (特邀)[J]. 量子电子学报, 2025, 42(4): 476-489.