What is: PSPNet?
Source | Pyramid Scene Parsing Network |
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
PSPNet, or Pyramid Scene Parsing Network, is a semantic segmentation model that utilises a pyramid parsing module that exploits global context information by different-region based context aggregation. The local and global clues together make the final prediction more reliable. We also propose an optimization
Given an input image, PSPNet use a pretrained CNN with the dilated network strategy to extract the feature map. The final feature map size is of the input image. On top of the map, we use the pyramid pooling module to gather context information. Using our 4-level pyramid, the pooling kernels cover the whole, half of, and small portions of the image. They are fused as the global prior. Then we concatenate the prior with the original feature map in the final part of. It is followed by a convolution layer to generate the final prediction map.