You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This repository contains end-to-end trainable deep learning based framework to localize graphical objects in the document images called as Graphical Object Detection (GOD).
4
+
5
+
This repository is built on [jwyang/faster-rcnn.pytorch](https://github.com/jwyang/faster-rcnn.pytorch). This implementation has the following features:
6
+
-**It is pure Pytorch code**. Of course, there are some CUDA code.
7
+
8
+
-**It supports multi-image batch training**.
9
+
10
+
-**It supports multiple GPUs training**.
11
+
12
+
The results of GOD on different datasets is listed in the paper.
The compilation is done as instructed by [jwyang/faster-rcnn.pytorch](https://github.com/jwyang/faster-rcnn.pytorch/blob/master/README.md#compilation).
33
+
34
+
35
+
#### Dataset
36
+
This repository uses the dataset in the same format as PASCAL VOC. But other format of datasets can also be adapted as done by [jwyang/faster-rcnn.pytorch](https://github.com/jwyang/faster-rcnn.pytorch). The dataset should be prepared as per the following tree structure.
37
+
```
38
+
GODdevkit2019
39
+
├── GOD2019
40
+
├── JPEGImages
41
+
│ ├── GOD001.jpg
42
+
│ ├── GOD002.jpg
43
+
│ ├── ...
44
+
├── ImageSets
45
+
│ ├── Main
46
+
│ │ ├── train.txt
47
+
│ │ ├── val.txt
48
+
│ │ ├── test.txt
49
+
│ │ ├── ...
50
+
└── Annotations
51
+
├── GOD001.xml
52
+
├── GOD002.xml
53
+
├── ...
54
+
```
55
+
56
+
#### Pretrained Models
57
+
We used ImageNet pretrained weights (VGG16 and ResNets) from Caffe in our experiments. You can download these two models from:
Download them and put them into the ```data/pretrained_model/```.
63
+
64
+
**If you want to use pytorch pre-trained models, please remember to transpose images from BGR to RGB, and also use the same data transformer (minus mean and normalize) as used in pretrained model.**
0 commit comments