Visualizing and Understanding Convolutional Networks

Abstract — 摘要

Large Convolutional Network models have recently demonstrated impressive classification performance on the ImageNet benchmark. However there is no clear understanding of why they perform so well, or how they might be improved. In this paper we address both issues. We introduce a novel visualization technique that gives insight into the function of intermediate feature layers and the operation of the classifier. Used in a diagnostic role, these visualizations allow us to find model architectures that outperform Krizhevsky et al. on the ImageNet classification benchmark.

大型摺積網路模型近期在 ImageNet 基準上展現了令人印象深刻的分類效能。然而，目前對於它們為何表現如此出色，或如何改進，缺乏清晰的理解。本文同時處理這兩個問題。我們引入了一種新穎的視覺化技術，能洞察中間特徵層的功能及分類器的運作方式。在診斷角色中使用這些視覺化，使我們能找到在 ImageNet 分類基準上超越 Krizhevsky 等人的模型架構。

段落功能指出深度學習的「黑箱」問題，並以視覺化技術作為解方。

邏輯角色「效能好但不理解」的矛盾引出研究必要性，視覺化既是理解工具也是改進手段。

論證技巧 / 潛在漏洞將可解釋性與效能改進並列，巧妙地滿足了學術界對「理解」和產業界對「效能」的雙重期待。

1. Introduction — 緒論

Since their introduction by LeCun et al. in the early 1990s, Convolutional Networks (convnets) have demonstrated excellent performance at tasks such as hand-written digit classification and face detection. In the last year, several papers have shown that they can also deliver outstanding performance on more challenging visual classification tasks. Krizhevsky et al. achieved a convincing win in the ImageNet 2012 classification challenge, with their large convnet achieving significant improvements over previous state-of-the-art.

自 LeCun 等人在 1990 年代初期引入摺積網路（convnets）以來，它們在手寫數字分類和人臉偵測等任務上展現了卓越效能。過去一年，多篇論文表明它們也能在更具挑戰性的視覺分類任務上實現出色效能。Krizhevsky 等人在 ImageNet 2012 分類挑戰賽中取得了令人信服的勝利，其大型摺積網路相較先前的技術水準實現了顯著改進。

段落功能回顧摺積網路的歷史，特別是 AlexNet 的突破。

邏輯角色建立 AlexNet 成功的背景，為「但我們不理解為什麼」的反轉鋪路。

論證技巧 / 潛在漏洞從 LeCun 到 Krizhevsky 的歷史線索賦予了研究脈絡感，展現學術傳承。

There are several factors behind this renewed interest in convnet models: (i) the availability of much larger training sets with millions of labeled examples; (ii) powerful GPU implementations that make the training of very large models practical; and (iii) better model regularization strategies, such as Dropout. Despite this progress, there is still little insight into the internal operation and behavior of these complex models, or how they achieve such good performance. Without this understanding, the development of better models is reduced to trial-and-error.

摺積網路模型重新受到關注的背後有幾個因素：(i) 擁有數百萬標記範例的更大訓練集；(ii) 強大的 GPU 實現使得訓練超大模型變得可行；(iii) 更好的模型正則化策略，如 Dropout。儘管取得了進展，對這些複雜模型的內部運作與行為仍缺乏洞察，也不清楚它們如何達到如此好的效能。若缺乏此理解，更好模型的開發將淪為試誤法。

段落功能列舉深度學習復興的三個要素，再指出可解釋性的缺失。

邏輯角色「試誤法」的警告暗示當前研究範式的脆弱性，凸顯視覺化工具的必要性。

論證技巧 / 潛在漏洞將缺乏理解等同於「試誤法」有些誇大——許多架構設計有其理論直覺。

2. Deconvolution Visualization — 反摺積視覺化

We use a multi-layered Deconvolutional Network (deconvnet) to project the feature activations back to the input pixel space. A deconvnet can be thought of as a convnet model that uses the same components (filtering, pooling) but in reverse, so instead of mapping pixels to features, it maps features to pixels. To examine a given convnet activation, we set all other activations in the layer to zero and pass the feature maps as input to the attached deconvnet layer. Then we successively (i) unpool, (ii) rectify, and (iii) filter to reconstruct the activity in the layer beneath.

我們使用多層反摺積網路（deconvnet）將特徵激活投射回輸入像素空間。反摺積網路可視為使用相同元件（濾波、池化）但方向相反的摺積網路模型——它不是將像素映射到特徵，而是將特徵映射到像素。為檢視給定的摺積網路激活，我們將該層中所有其他激活設為零，並將特徵圖作為輸入傳遞給附接的反摺積網路層。然後我們依序(i) 反池化、(ii) 整流、(iii) 濾波來重建下層的活動。

段落功能詳述反摺積視覺化的三步驟操作流程。

邏輯角色此方法是全文的技術核心，為後續所有分析與發現提供工具基礎。

論證技巧 / 潛在漏洞以「正向/反向」的對稱性描述使方法直觀易懂，但反摺積重建的保真度與其解釋力之間的關係並非不言自明。

3. Feature Analysis — 特徵分析

Using our visualization technique, we reveal the hierarchical nature of features in a convnet. Layer 1 and 2 respond to low-level features such as edges, corners, and color conjunctions. Layer 3 captures more complex textures and patterns. Layer 4 shows significant variation and begins to capture class-specific features (e.g., dog faces, bird legs). Layer 5 captures entire objects with significant pose variation. This progression from simple to complex features across layers confirms the intuition that deep networks learn increasingly abstract representations.

使用我們的視覺化技術，我們揭示了摺積網路中特徵的階層本質。第 1 和第 2 層對低階特徵如邊緣、角點和顏色組合做出回應。第 3 層捕捉更複雜的紋理與圖案。第 4 層顯示顯著變化並開始捕捉類別特定特徵（如狗臉、鳥腿）。第 5 層捕捉具有顯著姿態變化的完整物件。從簡單到複雜特徵的跨層遞進，證實了深度網路學習越來越抽象的表示的直覺。

段落功能以逐層分析呈現特徵階層的經典發現。

邏輯角色此發現是全文最重要的貢獻，首次以實驗方式驗證了「深度學習學習階層特徵」的假說。

論證技巧 / 潛在漏洞以具體視覺例子（狗臉、鳥腿）使抽象的「特徵階層」概念具象化，極具傳播力。

4. Experiments — 實驗

Guided by the visualization, we modified the architecture of Krizhevsky et al. by (i) reducing the first layer filter size from 11x11 to 7x7 and (ii) reducing the stride from 4 to 2. These changes were motivated by observing that the first layer filters of the original model contained a mixture of high and low frequency information with aliasing artifacts. Our modified architecture achieves a top-5 error rate of 14.8% on ImageNet 2012 validation set, compared to 16.4% for Krizhevsky et al.

在視覺化的指引下，我們修改了 Krizhevsky 等人的架構：(i) 將第一層濾波器大小從 11x11 縮減為 7x7，(ii) 將步幅從 4 縮減為 2。這些修改的動機來自觀察到原始模型的第一層濾波器包含混合的高低頻資訊與混疊偽影。我們修改後的架構在 ImageNet 2012 驗證集上達到 14.8% 的 top-5 錯誤率，相較 Krizhevsky 等人的 16.4%。

段落功能展示視覺化如何直接指導架構改進。

邏輯角色此段完成了「視覺化→洞察→改進→驗證」的完整迴圈，是全文論證的高潮。

論證技巧 / 潛在漏洞 1.6 個百分點的改進幅度可觀，且改進動機直接來自視覺化觀察，論證鏈條完整有力。

5. Conclusion — 結論

We have explored large convnet models using a novel visualization technique that reveals the hierarchical feature representations learned at each layer. The visualizations provide a non-trivial understanding of what a convnet learns and identify potential issues such as aliasing in the first layer. Guided by these visualizations, we constructed an architecture (ZFNet) that achieves state-of-the-art results on several benchmarks. Our work demonstrates that understanding these models is essential for building better ones.

我們使用一種新穎的視覺化技術探索了大型摺積網路模型，揭示了各層學到的階層式特徵表示。這些視覺化提供了對摺積網路學習內容的非平凡理解，並辨識出潛在問題如第一層的混疊。在這些視覺化的指引下，我們建構了一個架構（ZFNet），在多個基準上達到最先進的結果。我們的工作證明了理解這些模型對於建構更好的模型至關重要。

段落功能總結視覺化的價值與其對架構改進的實際貢獻。

邏輯角色以「理解 = 改進的前提」的命題重申全文核心立場。

論證技巧 / 潛在漏洞結論的訊息簡明有力，但此後深度學習的發展表明，許多重大進步仍以經驗性探索為主。

論證結構總覽

摺積網路黑箱
效能好但不可解釋

→

反摺積視覺化
特徵→像素投射

→

階層特徵發現
邊緣→紋理→物件

→

架構診斷改進
7x7 濾波器/步幅 2

→

ZFNet
top-5: 14.8%

核心主張

透過反摺積視覺化技術揭示摺積網路各層學習到的階層式特徵表示，並利用此理解直接指導架構改進。

最強論證

「視覺化→發現問題→修改架構→效能提升」的完整迴圈令人信服，逐層特徵分析的視覺化結果在學術界產生了深遠影響。

最弱環節

反摺積視覺化僅揭示單一激活的最大刺激模式，未必能完整反映網路的決策機制。此外「理解 = 改進」的因果關係在後續研究中並未完全成立。