Aerial detection1

Aerial Object Detection
HyeongJun Kwon
2019-2

Contents
2
1. ClusDet
2. RoI Transformer
3. SCRDet
4. GcGAN
5. CBAM

ClusDet
4
Object : solve image with object sparse and non-uniform and tend to be highly clustered
in certain regions
Existing Problem of Method:
- parse, non-uniform and highly clustered in certain region
Proposed Method:
- Cluster Proposal Sub-network (CPNet)
- Scale Network

ClusDet
5
Cluster Proposal Sub-network (CPNet)
: like RPN, but using first layer of feature extractor because of large receptive field

ClusDet
6
ICM : aggregate Cluster region algorithm

ClusDet
7
ScaleNet & Padding and partition(PP)
: to avoid extreme scale of objects degrading detection performance

RoI Transformer
9
Network Overview

RoI Transformer
10
Object : oriented and densely packed detection task
- Expensive Computation
- Not Learning rotation-invariant feature
Proposed Method:
- RRoI learner
- Rotated Position Sensitive RoI pooling (RPS RoI pooling)

RoI Transformer
11
RRoI learner. For computational efficiency, matching RRoI and RGT before
determine 𝑡θ
∗
RPS RoI Align. Rotate + PS RoI pooling + RoI Align

RoI Transformer
13
• Experiments :

SCRDet
15
Object : oriented and densely packed detection task
Challenging task of object detection:
- Small object
- Cluttered arrangement
- Arbitrary orientation
Proposed Method:
- Sampling fusion network(SFnet) for issue of small object
- Multi-dimensional attention network for denoising background noise

SCRDet
16
SFNet : module for combining Feature fusion and Finer sampling
Feature fusion: for combining low-level and high-
level information like FPN, TDM etc..
Finer sampling: small size of anchor stride achieve
higher EMO score than large size

SCRDet
17
MDANet : suppress noise by using pixel attention + channel attention
Channel Attention: Using SE-module
Pixel Attention: Using Inception-module & get
attention loss by using binary map of RGT

SCRDet
19
IoU smooth L1 loss : for solving boundary discontinuity problem

GcGAN
23
Object : inference marginal distribution about source domain and target domain
- Existing constraints have overlooked special characteristics of image
: geometric transformation do not change semantic structure
Proposed Method:
- Geometric consistency which can make model one-side mapping

GcGAN
24
Geometric consistency constraints
ℒℊℯℴ 𝐺 𝑋𝑌, 𝐺 𝑋𝑌, 𝑋, 𝑌

CBAM(Covolutional Block Attention Module)
27
Network Overview

CBAM(Covolutional Block Attention Module)
28
Experiments on MS COCO

Result
29
Baseline(RoI Transforemr with Faster Rcnn) on DOTA 1
Plane BD Bridge GTF SV LV Ship TC BC ST SBF RA Harbor SP HC mAP
88.52 80.13 52.45 71.01 63.16 79.63 85.17 90.68 85.50 82.37 51.82 37.22 72.09 63.28 57.89 70.73
Plane BD Bridge GTF SV LV Ship TC BC ST SBF RA Harbor SP HC mAP
88.01 78.34 52.56 71.64 61.33 79.89 83.97 90.61 85.14 83.30 50.22 37.72 67.59 62.14 62.10 70.5
Results(Baseline)
Results(Baseline+CBAM)

Aerial detection1

More Related Content

What's hot (20)

Similar to Aerial detection1 (20)

More from ssuser456ad6 (6)

Recently uploaded (20)

Aerial detection1

Editor's Notes