量子电子学报 ›› 2026, Vol. 43 ›› Issue (3): 384-393.doi: 10.3969/j.issn.1007-5461.2026.03.006

• 图像与信息处理 • 上一篇    下一篇

高效轻量化的单光子三维成像方法

郑杰凯 1, 刘尉悦 1, 刘 腾 1, 林泽洪 2*   

  1. 1 宁波大学信息科学与工程学院, 浙江 宁波 315211;2 丽水职业技术学院电子信息学院, 浙江 丽水 323000
  • 收稿日期:2024-03-26 修回日期:2024-05-16 出版日期:2026-05-28 发布日期:2026-05-28
  • 通讯作者: E-mail: linzehong@yeah.net E-mail:E-mail: linzehong@yeah.net
  • 作者简介:郑杰凯 ( 2000 - ), 浙江丽水人, 研究生, 主要从事量子通信与应用方面的研究。 E-mail: 752091756@qq.com
  • 基金资助:
    浙江省自然科学基金 (LY21F050003, LY23F010003), 浙江省"尖兵''"领雁"研发攻关计划 (2024C01105)

Efficient lightweight single‑photon three‑dimensionalimaging method

ZHENG Jiekai 1 , LIU Weiyue 1 , LIU Teng 1 , LIN Zehong2*   

  1. 1 Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo 315211, China;2 School of Electronic Information, Lishui Vocational & Technical College, Lishui 323000, China
  • Received:2024-03-26 Revised:2024-05-16 Published:2026-05-28 Online:2026-05-28
  • Supported by:

摘要: 随着深度学习的发展,单光子成像慢慢成为了一个重要且具有挑战性的研究方向。它有助于指导单光子图像的3D重建。单光子图像是一种稀疏的充满噪声的3D图像,在它的时间通道中仅还有少量的有效信号回波。然而,在目前现有的架构当中,都是以建立更庞大的主干网络来取得更好的效果的,而代价是占用更高的显卡内存,因此设计一个轻量级但有效的模型使其能够部署到边缘设备上同样是目前科研界十分关心的事,本文提出了一种轻量级的架构,在计算量大大降低的情况下取得了和其他方法相近的效果。具体来说,本文先利用swin transformer网络提取了单光子图像的时间域特征,通过这个时间预测网络将单光子图像的维度大大的降低,再通过一个密集级联多尺度网络(DCMNet)来提取单光子图像的空间域特征进一步对单光子图像重建。它通过自上而下的级联路径和密集连接改进了解码层之间的互连,以产生高质量的多分辨率深度输出。实验证明,本文提出的网络在大大减少占用的资源的同时,还能取得不错的效果。

关键词: 计算机视觉, 单光子图像三维重建, Swin Transformer与密集级联多尺度网络, 轻量级架构, 边缘计算

Abstract: With the advancement of deep learning, single-photon imaging has gradually become an important and challenging research direction. It contributes to the 3D reconstruction of single-photon images, which are sparse, noise-filled 3D images with only a few valid signal echoes in their time channels. However, current architectures generally achieve better results by establishing larger backbone networks, which comes at the cost of higher GPU memory usage. Designing a lightweight yet effective model that can be deployed on edge devices is also a significant concern in the research community. Therefore, this paper proposes a lightweight architecture that achieves comparable results to other methods with significantly reduced computational requirements. Specifically, this work utilizes a Swin Transformer network to extract temporal features of single-photon images, using this time prediction network to significantly reduce the dimensions of single-photon images. Then, a Densely Cascaded Multi-scale Network (DCMNet) is employed to extract spatial domain features of single-photon images for further reconstruction. It improves the interconnection between decoding layers through a top-down cascade pathway and dense connections, producing high-quality multi-resolution depth outputs. Experimental results demonstrate that our network can achieve commendable results while significantly reducing resource consumption.

Key words: computer vision, three-dimensional reconstruction of single-photon images, Swin Transformer and dense cascaded multi-scale network, lightweight architecture, edge computing

中图分类号: