摘要 1. 緒論 2. 方法 3. 微邊緣 4. 實驗 5. 結論 論證總覽

Abstract — 摘要

Computing gradients of visibility with respect to scene parameters is a long-standing challenge in differentiable rendering. Existing approaches either approximate the rendering process to make it differentiable, or rely on auxiliary data structures that add complexity. We present Rasterized Edge Gradients (REG), a method that computes accurate gradients at visibility discontinuities directly within a standard rasterization pipeline. Our key idea is to introduce micro-edges — infinitesimal edge segments along silhouette boundaries — that allow us to treat the rasterized image as the outcome of a differentiable, continuous process while remaining fully compatible with discrete-pixel rasterization. REG requires no modifications to the forward rendering pass and is applicable to masks, depth, normals, and color images.
計算可見性相對於場景參數的梯度是可微分渲染中的長期挑戰。現有方法要麼對渲染過程進行近似使其可微分,要麼依賴增加複雜度的輔助資料結構。我們提出光柵化邊緣梯度(REG),一種在標準光柵化管線中直接於可見性不連續處計算精確梯度的方法。核心概念是引入微邊緣——沿輪廓邊界的無窮小邊段——使我們得以將光柵化影像視為可微分連續過程的結果,同時完全相容於離散像素光柵化。REG 無需修改前向渲染流程,且適用於遮罩、深度、法向量和彩色影像
段落功能 全文總覽——定義問題、提出解決方案、列舉關鍵特性。
邏輯角色 以「長期挑戰」開場確立問題的重要性,再以對比現有方法的不足引出 REG 方法,最後以「無需修改前向流程」的實用性收束。
論證技巧 / 潛在漏洞 以「長期挑戰」定位增強了研究價值感。「無需修改前向渲染」是極具吸引力的特性,但尚未提及計算開銷。

1. Introduction — 緒論

Differentiable rendering has become a cornerstone of modern computer vision and graphics, enabling inverse rendering, 3D reconstruction from images, neural radiance fields, and physics-based material estimation. The fundamental requirement is to compute gradients of the rendered image with respect to scene parameters such as vertex positions, camera parameters, and material properties. While gradients at smooth, continuous regions of the image are straightforward to compute, the challenge lies at visibility discontinuities — silhouette edges where objects occlude each other. At these boundaries, small changes in scene parameters cause discrete pixel ownership changes, creating step-function discontinuities where standard differentiation fails.
可微分渲染已成為現代電腦視覺與圖學的基石,使得逆向渲染、從影像進行三維重建、神經輻射場以及基於物理的材質估計得以實現。基本需求是計算渲染影像相對於頂點位置、相機參數及材質屬性等場景參數的梯度。儘管在影像平滑連續區域的梯度計算相對直觀,但挑戰在於可見性不連續處——物體相互遮擋的輪廓邊緣。在這些邊界上,場景參數的微小變化會導致離散的像素歸屬改變,產生階躍函數不連續性,使標準微分失效。
段落功能 建立研究場域——介紹可微分渲染的重要性及其核心瓶頸。
邏輯角色 論證鏈的起點:先確立可微分渲染的廣泛應用,再聚焦到可見性不連續的根本問題。
論證技巧 / 潛在漏洞 列舉四大應用領域有效建立了問題的重要性。對「階躍函數不連續性」的精確描述展現了數學嚴謹性。
Existing approaches to differentiable rendering can be broadly categorized into two families. The first family modifies the forward rendering process by blurring or softening visibility, effectively replacing the hard visibility function with a smooth approximation. While this makes gradients available everywhere, it introduces artifacts in the rendered images and biased gradients. The second family uses edge-sampling techniques that explicitly identify silhouette edges and compute gradients along them. These methods produce accurate gradients but require additional data structures, ray casting, or multi-pass rendering, significantly increasing complexity and computational cost. Our approach, REG, combines the simplicity of rasterization-based methods with the accuracy of edge-based gradient computation.
現有可微分渲染方法大致可歸為兩類。第一類修改前向渲染過程,透過模糊或軟化可見性,將硬可見性函數替換為平滑近似。雖然這使得梯度在各處均可得,但會在渲染影像中引入瑕疵和有偏梯度。第二類使用邊緣取樣技術,明確辨識輪廓邊緣並沿邊緣計算梯度。這些方法能產生精確梯度,但需要額外的資料結構、光線投射或多次渲染,顯著增加複雜度與計算成本。我們的方法 REG 結合了光柵化方法的簡潔性與基於邊緣的梯度計算之精確性
段落功能 分析現有方法的不足——建立「簡潔性 vs 精確性」的兩難局面。
邏輯角色 典型的「兩類現有方法各有不足」論證模式,為 REG 的定位(兩者優點的結合)做鋪墊。
論證技巧 / 潛在漏洞 二分法的歸納清晰有效,但現有方法的分類可能過於簡化,部分方法可能兼具兩類特性。

2. Method — 方法

The core idea of REG is to decompose the gradient computation at visibility discontinuities into a sum of contributions from micro-edges. Consider a silhouette edge in the rendered image where a foreground triangle occludes a background triangle. As the silhouette edge moves by an infinitesimal amount, some pixels transition from showing the background to showing the foreground (or vice versa). The gradient contribution from each such pixel transition can be computed as the product of the pixel's value difference (foreground minus background) and the rate at which the edge sweeps across the pixel. By aggregating these contributions along the entire edge, we obtain the correct visibility gradient.
REG 的核心概念是將可見性不連續處的梯度計算分解為微邊緣貢獻的總和。考慮渲染影像中一條輪廓邊緣,其中前景三角形遮擋背景三角形。當輪廓邊緣移動無窮小量時,部分像素從顯示背景轉為顯示前景(或反之)。每個像素轉換的梯度貢獻可計算為像素值差(前景減去背景)與邊緣掃過像素之速率的乘積。沿整條邊緣聚合這些貢獻,即可得到正確的可見性梯度。
段落功能 闡述核心演算法——微邊緣梯度分解的數學直覺。
邏輯角色 方法論的基石段落,用直觀的幾何描述解釋微邊緣的工作原理。
論證技巧 / 潛在漏洞 以「無窮小移動」的思想實驗引入梯度概念,數學直覺清晰。但實際實現中邊緣的離散化可能引入誤差。

3. Micro-Edges — 微邊緣

We introduce the concept of micro-edges to make this idea practical. A micro-edge is a small segment of a silhouette edge that passes through a single pixel. For each pixel that lies on a silhouette boundary, we identify the micro-edge passing through it and compute: (1) the local edge direction, which determines the axis of the gradient; (2) the foreground and background values at that pixel from the two adjacent triangles; and (3) the edge's sensitivity to each scene parameter, obtained from the screen-space projection of the mesh edge. The final gradient for each parameter is the sum of all micro-edge contributions across all silhouette pixels.
我們引入微邊緣的概念以使此想法付諸實踐。微邊緣是穿過單一像素的輪廓邊緣小段。對於位於輪廓邊界上的每個像素,我們辨識穿過它的微邊緣並計算:(1) 局部邊緣方向,決定梯度的軸向;(2) 來自兩個相鄰三角形的該像素處前景與背景值;(3) 邊緣對每個場景參數的靈敏度,由網格邊緣的螢幕空間投影得出。每個參數的最終梯度為所有輪廓像素上所有微邊緣貢獻的總和
段落功能 細化核心概念——定義微邊緣的具體計算步驟。
邏輯角色 從抽象概念到具體演算法,三步計算流程清晰可循。
論證技巧 / 潛在漏洞 分步描述使方法易於理解和實現。但三角形相交等特殊情況的處理尚未在此闡述。
A critical advantage of the micro-edge formulation is that it works entirely within the rasterization framework. All information needed — silhouette pixel locations, foreground/background values, and edge directions — can be extracted from a standard rasterization pass with an additional silhouette detection step. Unlike ray-tracing-based methods, REG does not require shooting additional rays or maintaining acceleration structures. Furthermore, REG handles geometry intersections correctly, which many existing methods struggle with, by treating intersection edges as additional silhouette edges with their own micro-edge decomposition.
微邊緣公式化的一個關鍵優勢在於它完全在光柵化框架內運作。所有所需資訊——輪廓像素位置、前景/背景值和邊緣方向——均可從標準光柵化流程中擷取,僅需一個額外的輪廓偵測步驟。不同於基於光線追蹤的方法,REG 不需要發射額外光線或維護加速結構。此外,REG 能正確處理幾何相交,這是許多現有方法所困難的,其做法是將相交邊緣視為具有自身微邊緣分解的額外輪廓邊緣。
段落功能 強調方法優勢——與現有方法的差異化比較。
邏輯角色 回應前文提出的兩類方法各自的不足,論證 REG 確實結合了兩者的優點。
論證技巧 / 潛在漏洞 「無需額外光線」和「正確處理幾何相交」是強有力的差異化論點。但「額外的輪廓偵測步驟」的成本仍待量化。

4. Experiments — 實驗

We evaluate REG on dynamic human head scene reconstruction, a challenging application that requires accurate gradients for both camera images and segmentation masks. Our method successfully reconstructs fine geometric details including hair strands, ear geometry, and facial wrinkles by leveraging accurate silhouette gradients. Compared to SoftRas (soft rasterization approach), our method produces sharper reconstructions with fewer artifacts. Compared to Nvdiffrast (an edge-sampling baseline), REG achieves comparable accuracy while being simpler to implement and requiring no auxiliary ray casting. Quantitatively, REG achieves a mean vertex error of 0.83mm on our benchmark, compared to 1.12mm for SoftRas and 0.87mm for Nvdiffrast.
我們在動態人頭場景重建上評估 REG,這是一個需要精確梯度以處理相機影像和分割遮罩的挑戰性應用。我們的方法利用精確的輪廓梯度,成功重建了包括髮絲、耳部幾何和臉部皺紋在內的精細幾何細節。與 SoftRas(軟光柵化方法)相比,我們的方法產生更銳利的重建結果且瑕疵更少。與 Nvdiffrast(邊緣取樣基線)相比,REG 達到了相當的精確度,同時實現更簡潔且無需輔助光線投射。在量化指標上,REG 在基準測試上達到平均頂點誤差 0.83mm,相較之下 SoftRas 為 1.12mm,Nvdiffrast 為 0.87mm。
段落功能 提供核心實證——以動態人頭重建的量化結果驗證方法效能。
邏輯角色 將方法論主張轉化為可量化的效能數據,以具體應用場景證明 REG 的優勢。
論證技巧 / 潛在漏洞 0.83mm vs 1.12mm 的改進顯著。但僅在人頭重建上驗證,對其他場景(建築、車輛等)的推廣性尚待確認。

5. Conclusion — 結論

We have presented Rasterized Edge Gradients (REG), a novel approach to computing visibility gradients in differentiable rendering that combines the simplicity and efficiency of rasterization with accurate edge-based gradient computation. Through the introduction of micro-edges, our method requires no modifications to the forward rendering pass and supports gradients for arbitrary image quantities including masks, depth, normals, and color. Our method correctly handles geometry intersections and achieves state-of-the-art reconstruction quality on dynamic human head scenes. We believe REG provides a practical and principled foundation for differentiable rendering in production pipelines.
我們提出了光柵化邊緣梯度(REG),一種可微分渲染中計算可見性梯度的新方法,結合了光柵化的簡潔高效與基於邊緣的精確梯度計算。透過引入微邊緣,我們的方法無需修改前向渲染流程,並支援任意影像量(包括遮罩、深度、法向量和色彩)的梯度。方法正確處理幾何相交,並在動態人頭場景上達到最先進的重建品質。我們認為 REG 為生產管線中的可微分渲染提供了實用且有原則性的基礎
段落功能 總結全文——重申方法特性與應用前景。
邏輯角色 以「生產管線的基礎」收束,將學術研究連結到工業應用價值。
論證技巧 / 潛在漏洞 「實用且有原則性」的定位恰如其分。作為 Honorable Mention,此研究在理論與工程間取得了優秀平衡。

論證結構總覽

問題
可見性不連續處梯度不可計算
論點
微邊緣可在光柵化中精確計算梯度
方法
微邊緣分解 + 貢獻聚合
證據
人頭重建誤差降至 0.83mm
結論
生產管線的可微分渲染基礎

核心主張

透過微邊緣概念,可在標準光柵化管線中無損地計算可見性不連續處的精確梯度,無需修改前向渲染流程。

論證最強處

方法的數學嚴謹性與工程簡潔性兼具:不僅有理論保證,且能無縫整合至現有渲染管線,降低採用門檻。

論證最弱處

實驗驗證集中於人頭重建單一場景類型,對更多樣化的場景和大規模網格的效能與可擴展性尚未充分論證。

核心論點 / Thesis
關鍵概念 / 術語
實證證據 / 資料
讓步 / 反駁處理
方法論說明