Asynchronous Single-Photon 3D Imaging

Abstract -- 摘要

Single-photon avalanche diodes (SPADs) are becoming popular in time-of-flight depth-ranging due to their unique ability to capture individual photons with picosecond timing resolution. However, ambient light (e.g., sunlight) incident on a SPAD-based 3D camera leads to severe non-linear distortions (pileup) in the measured waveform, resulting in large depth errors. The authors propose asynchronous acquisition schemes that temporally misalign SPAD measurement windows and the laser cycles through deterministically predefined or randomized offsets. Their approach demonstrates "improvement in depth accuracy of up to an order of magnitude" compared to existing methods across a wide range of imaging scenarios with high ambient flux.

單光子雪崩二極體（SPAD）因其以皮秒時間解析度捕捉個別光子的獨特能力，在飛時測距領域日益普及。然而，入射在 SPAD 三維相機上的環境光（如日光）會導致量測波形產生嚴重的非線性失真（堆積效應），造成顯著的深度誤差。作者提出非同步擷取方案，透過確定性預設或隨機化的偏移量，使 SPAD 量測窗口與雷射週期在時間上錯開。其方法在高環境光通量的各種成像場景中，展現了「深度準確度提升可達一個數量級」的成果。

段落功能全文總覽——從 SPAD 的優勢出發，揭示環境光挑戰，提出非同步解方。

邏輯角色摘要以「技術潛力 -> 實際瓶頸 -> 解決方案」的三段式結構組織，最後以「一個數量級改進」的量化結果收尾。

論證技巧 / 潛在漏洞「一個數量級」的宣稱極為吸引人，但需考量這是在特定高環境光條件下的結果。在低環境光場景中，堆積效應本身不嚴重，改進幅度可能大幅縮減。

1. Single-Photon Cameras -- 緒論

Single-photon avalanche diodes (SPADs) offer unprecedented photon detection sensitivity and picosecond timing precision, yet remain "restricted to a limited set of niche applications" primarily due to performance degradation under ambient light conditions. The fundamental problem is photon pileup: in conventional synchronous acquisition, "early arriving ambient photons prevent the SPAD from measuring the signal (laser) photons that may arrive at a later time bin." This creates a systematic bias toward shorter depth estimates, which worsens as ambient light intensity increases.

單光子雪崩二極體（SPAD）提供前所未有的光子偵測靈敏度與皮秒等級的時間精度，但仍「受限於有限的利基應用」，主要原因是在環境光條件下的性能退化。根本問題是光子堆積：在傳統的同步擷取中，「先到達的環境光子阻止了 SPAD 量測可能在較晚時段到達的訊號（雷射）光子」。這造成了對較短深度估計的系統性偏差，且隨環境光強度增加而加劇。

段落功能建立研究場域——定義 SPAD 的光子堆積問題。

邏輯角色以物理機制（先到的光子佔據偵測窗口）解釋為何環境光會造成深度偏差，為後續的非同步方案提供物理直覺。

論證技巧 / 潛在漏洞以淺顯的「先到先得」概念解釋複雜的物理現象，降低了理解門檻。但實際的堆積效應比「先到先得」更複雜，涉及 SPAD 的死時間、恢復時間等硬體參數。

Prior approaches include optical attenuation (reducing incident photon count), computational pileup correction (post-processing algorithms), temporally-shifted gated acquisition, and photon-driven acquisition modes. Optical attenuation sacrifices signal photons along with ambient photons, reducing the signal-to-background ratio. Computational correction relies on accurate noise models that may not hold in practice. The authors distinguish their contribution by focusing on acquisition strategy design that fundamentally prevents rather than corrects pileup effects.

先前的方法包括光學衰減（減少入射光子數）、計算堆積校正（後處理演算法）、時間偏移閘控擷取，以及光子驅動擷取模式。光學衰減在減少環境光子的同時也犧牲了訊號光子，降低了訊雜比。計算校正依賴可能在實際中不成立的精確雜訊模型。作者以「聚焦於根本預防而非校正堆積效應的擷取策略設計」來區分其貢獻。

段落功能文獻回顧——列舉四類現有方法及其局限。

邏輯角色「預防 vs 校正」的二分法精準定位了本文的差異化：現有方法試圖在堆積發生後修補，本文則從擷取階段就避免堆積。

論證技巧 / 潛在漏洞「預防優於治療」的修辭非常有說服力。但非同步擷取也有其代價——需要更多的擷取週期來覆蓋完整的深度範圍，可能降低幀率。

3. Single-Photon 3D Imaging Model -- 單光子三維成像模型

The paper establishes mathematical foundations using Poisson-distributed photon arrival models, defining the incident photon flux waveform and describing how synchronous acquisition fails under ambient illumination. In conventional operation, the SPAD measurement window is synchronized with each laser pulse. When ambient flux is high, photons from background illumination trigger the SPAD before the signal photon arrives, causing the measured histogram to be distorted toward earlier time bins. This distortion is fundamentally non-linear and cannot be removed by simple averaging.

本文以泊松分布的光子到達模型建立數學基礎，定義入射光子通量波形，並描述同步擷取在環境光照下如何失效。在傳統操作中，SPAD 量測窗口與每個雷射脈衝同步。當環境光通量高時，來自背景照明的光子在訊號光子到達前觸發 SPAD，導致量測直方圖向較早的時間格偏移。此失真本質上是非線性的，無法透過簡單平均消除。

段落功能模型建立——以泊松統計定義問題的數學結構。

邏輯角色此段將物理直覺（早到的光子佔據偵測器）形式化為數學語言，為後續的理論分析提供嚴謹的基礎。「非線性」的強調說明了為何簡單方法無效。

論證技巧 / 潛在漏洞泊松模型是光子統計的標準假設，數學上無可挑剔。但實際 SPAD 可能表現出非泊松行為（如串擾、後脈衝），這些效應在模型中被忽略。

4. Theory of Asynchronous Image Formation -- 非同步成像理論

The core theoretical contribution is the derivation of a Poisson-Multinomial Distribution for measured histograms under asynchronous acquisition, and the generalization of Coates's estimator for arbitrary temporal offsets. The formulation introduces the "denominator sequence" concept representing photon detection opportunities per bin. Two foundational results establish that uniform shifting strategies minimize depth estimation error bounds by achieving constant expected denominator sequences across all histogram bins, thereby distributing ambient light effects uniformly rather than concentrating them in early bins.

核心理論貢獻包括推導非同步擷取下量測直方圖的泊松-多項式分布，以及將 Coates 估計器推廣至任意時間偏移。此公式化引入了「分母序列」概念，代表每個時間格的光子偵測機會。兩個基礎性結果確立了均勻偏移策略能最小化深度估計的誤差界限，方式是使所有直方圖時間格的期望分母序列保持恆定，從而將環境光效應均勻分布而非集中於早期時間格。

段落功能核心理論——建立非同步擷取的數學基礎。

邏輯角色「分母序列」概念是本文最重要的理論創新——它提供了一個統一的數學框架來分析任何擷取策略的堆積行為，並自然地導出「均勻偏移最佳」的結論。

論證技巧 / 潛在漏洞理論推導的嚴謹性是本文的重大優勢。但「均勻分布環境光」的最佳性基於特定的誤差度量（RMSE），在其他度量下（如最大誤差）最佳策略可能不同。

5. Practically Optimal Acquisition -- 實際最佳化擷取

Three practical implementations are presented. SPAD Active Time Optimization derives the optimal active window duration, achieving RMSE improvements up to six-fold. Photon-Driven Shifting uses a free-running mode where cycle lengths vary stochastically based on photon detection timing, automatically achieving uniform shift distributions and providing adaptive per-pixel flux utilization. Combination with Flux Attenuation yields higher optimal attenuation fractions than synchronous methods. The photon-driven approach is particularly elegant as it requires no external synchronization hardware.

提出三種實際實作方式。SPAD 活躍時間最佳化推導出最佳活躍窗口時長，達到 RMSE 改善高達六倍。光子驅動偏移使用自由運行模式，週期長度根據光子偵測時序隨機變化，自動達成均勻偏移分布，並提供自適應的逐像素通量利用。結合光通量衰減可達到比同步方法更高的最佳衰減比例。光子驅動方法尤為優雅，因為它不需要外部同步硬體。

段落功能從理論到實作——提供三種可行的實現路徑。

邏輯角色光子驅動偏移是理論與實作的完美結合：堆積問題本身（環境光觸發 SPAD）被轉化為解決方案的一部分（觸發事件產生天然的隨機偏移）。

論證技巧 / 潛在漏洞「問題本身就是解決方案」的設計哲學極為優雅。但光子驅動模式的擷取時間可能因環境光強度而大幅變動，在實際系統中需要自適應控制。

6. Experiments -- 實驗

Hardware implementation uses a 405 nm pulsed laser, TCSPC module, and fast-gated SPAD operating at 10 MHz repetition frequency with 1000 histogram bins covering 15-meter unambiguous range. Results demonstrate sub-centimeter depth accuracy on test scenes under high ambient illumination (11 photons background per cycle at signal-to-background ratio of 0.02), with successful adaptation to varying surface reflectivities. The asynchronous schemes achieve depth accuracy improvements of up to an order of magnitude compared to synchronous baselines under identical ambient conditions.

硬體實作使用 405 奈米脈衝雷射、TCSPC 模組與 10 MHz 重複頻率的快速閘控 SPAD，以 1000 個直方圖時間格涵蓋 15 公尺無模糊範圍。結果展示在高環境光照下（每週期 11 個背景光子，訊雜比為 0.02）的測試場景中達到次公分等級的深度準確度，且能成功適應不同的表面反射率。在相同環境條件下，非同步方案相比同步基線達到了深度準確度提升高達一個數量級的成果。

段落功能提供硬體驗證——以實際 SPAD 系統確認理論預測。

邏輯角色訊雜比 0.02（訊號光子為背景的 2%）是極端惡劣的條件，在此條件下仍達到次公分精度，有力地證明了方法在實際環境中的可行性。

論證技巧 / 潛在漏洞以具體的硬體參數（405 nm、10 MHz、1000 bins）增添實驗的可重現性。但實驗場景似乎局限於靜態場景，對動態目標的適用性未被驗證。

7. Conclusion -- 結論

This work demonstrates that asynchronous acquisition strategies can fundamentally mitigate photon pileup in SPAD-based 3D imaging, achieving up to an order of magnitude improvement in depth accuracy under high ambient light. The generalized image formation model and denominator sequence framework provide theoretical tools for analyzing and optimizing any acquisition scheme. Future extensions to non-uniform shifting schemes for time-domain fluorescence imaging and non-line-of-sight applications are noted as promising directions.

本研究證明非同步擷取策略能從根本上緩解 SPAD 三維成像中的光子堆積，在高環境光下達到深度準確度提升一個數量級的成果。推廣的成像模型與分母序列框架提供了分析與最佳化任何擷取方案的理論工具。論文指出非均勻偏移方案用於時域螢光成像與非視線應用是有前景的未來方向。

段落功能總結全文——重申核心貢獻並指出應用延伸。

邏輯角色結論以「理論框架的通用性」作為最終定位，將具體的堆積問題提升至「任何擷取方案的設計原則」，拉高研究的影響力。

論證技巧 / 潛在漏洞螢光成像與非視線應用的延伸暗示了極廣的應用範圍。但從三維測距到螢光壽命量測的跨領域推廣，涉及不同的物理模型假設，實際可行性需要進一步驗證。

論證結構總覽

問題
環境光導致 SPAD
光子堆積失真

→

論點
非同步擷取
預防堆積效應

→

證據
深度精度提升
一個數量級

→

反駁
分母序列理論
證明均勻偏移最佳

→

結論
通用的擷取策略
設計原則

作者核心主張（一句話）

透過使 SPAD 量測窗口與雷射週期非同步化，以均勻偏移策略將環境光的堆積效應均勻分散至所有時間格，從根本上避免深度估計偏差。

論證最強處

理論與實作的完美融合：分母序列框架提供了分析任何擷取策略的統一數學工具，而光子驅動偏移則將堆積的「問題」（環境光觸發）巧妙地轉化為「解決方案」（天然的隨機偏移），在無需外部同步硬體的情況下自動實現理論最佳的均勻偏移。

論證最弱處

實驗場景的侷限性：驗證主要在受控的實驗室環境中進行，以靜態場景為主。在真實的室外環境中（環境光動態變化、多路徑干擾、不同材質的反射特性），方法的穩健性尚未被充分驗證。此外，逐像素的深度估計未利用空間正則化，在低訊噪比下可能產生空間不連續的雜訊。