LogoAIAny

Multi-Scale Context Aggregation by Dilated Convolutions

This paper introduces a novel module for semantic segmentation using dilated convolutions, which enables exponential expansion of the receptive field without losing resolution. By aggregating multi-scale contextual information efficiently, the proposed context module significantly improves dense prediction accuracy when integrated into existing architectures. The work has had a lasting impact on dense prediction and semantic segmentation, laying the foundation for many modern segmentation models.

Introduction

State-of-the-art models for semantic segmentation are based on adaptations of convolutional networks originally designed for image classification. However, dense prediction problems like semantic segmentation differ structurally. This paper develops a new convolutional network module using dilated convolutions to systematically aggregate multi-scale contextual information without losing resolution. It supports exponential expansion of the receptive field and improves the accuracy of semantic segmentation systems. The authors also simplify classification networks adapted for dense prediction and demonstrate improved accuracy.

Information

  • Websitearxiv.org
  • AuthorsFisher Yu, Vladlen Koltun
  • Published date2015/11/23

More Items