rnjtsh
diff --git a/‎.gitignore
+204 b/‎.gitignore
+204
diff --git a/‎README.md
+64 b/‎README.md
+64
diff --git a/‎_init_paths.py
+15 b/‎_init_paths.py
+15
diff --git a/‎cfgs/res101.yml
+18 b/‎cfgs/res101.yml
+18
diff --git a/‎cfgs/res101_ls.yml
+22 b/‎cfgs/res101_ls.yml
+22
diff --git a/‎cfgs/res152_ls.yml
+24 b/‎cfgs/res152_ls.yml
+24
diff --git a/‎cfgs/res50.yml
+17 b/‎cfgs/res50.yml
+17
diff --git a/‎cfgs/res50_ls.yml
+21 b/‎cfgs/res50_ls.yml
+21
@@ -0,0 +1,204 @@
+data/*
+
+# READ THIS BEFORE YOU REFACTOR ME
+#
+# setup.py uses the list of patterns in this file to decide
+# what to delete, but it's not 100% sound.  So, for example,
+# if you delete aten/build/ because it's redundant with build/,
+# aten/build/ will stop being cleaned.  So be careful when
+# refactoring this file!
+
+## PyTorch
+test_rcnn.sh
+train_rcnn.sh
+.mypy_cache
+*.pyc
+*/*.pyc
+*/*.so*
+*/**/__pycache__
+*/**/*.dylib*
+*/**/*.pyc
+*/**/*.pyd
+*/**/*.so*
+*/**/**/*.pyc
+*/**/**/**/*.pyc
+*/**/**/**/**/*.pyc
+aten/build/
+aten/src/ATen/Config.h
+aten/src/ATen/cuda/CUDAConfig.h
+build/
+dist/
+docs/src/**/*
+test/.coverage
+test/cpp/api/mnist
+test/data/gpu_tensors.pt
+test/data/legacy_modules.t7
+test/data/legacy_serialized.pt
+test/data/linear.pt
+test/htmlcov
+third_party/build/
+tools/shared/_utils_internal.py
+torch.egg-info/
+torch/csrc/autograd/generated/*
+torch/csrc/cudnn/cuDNN.cpp
+torch/csrc/generated
+torch/csrc/generic/TensorMethods.cpp
+torch/csrc/jit/generated/*
+torch/csrc/nn/THCUNN.cpp
+torch/csrc/nn/THCUNN.cwrap
+torch/csrc/nn/THNN_generic.cpp
+torch/csrc/nn/THNN_generic.cwrap
+torch/csrc/nn/THNN_generic.h
+torch/csrc/nn/THNN.cpp
+torch/csrc/nn/THNN.cwrap
+torch/lib/*.a*
+torch/lib/*.dll*
+torch/lib/*.dylib*
+torch/lib/*.h
+torch/lib/*.lib
+torch/lib/*.so*
+torch/lib/build
+torch/lib/cmake
+torch/lib/include
+torch/lib/pkgconfig
+torch/lib/protoc
+torch/lib/tmp_install
+torch/lib/torch_shm_manager
+torch/version.py
+
+# IPython notebook checkpoints
+.ipynb_checkpoints
+
+# Editor temporaries
+*.swn
+*.swo
+*.swp
+*.swm
+*~
+
+# macOS dir files
+.DS_Store
+
+# Symbolic files
+tools/shared/cwrap_common.py
+
+# Ninja files
+.ninja_deps
+.ninja_log
+compile_commands.json
+*.egg-info/
+docs/source/scripts/activation_images/
+
+## General
+
+# Compiled Object files
+*.slo
+*.lo
+*.o
+*.cuo
+*.obj
+
+# Compiled Dynamic libraries
+*.so
+*.dylib
+*.dll
+
+# Compiled Static libraries
+*.lai
+*.la
+*.a
+*.lib
+
+# Compiled protocol buffers
+*.pb.h
+*.pb.cc
+*_pb2.py
+
+# Compiled python
+*.pyc
+*.pyd
+
+# Compiled MATLAB
+*.mex*
+
+# IPython notebook checkpoints
+.ipynb_checkpoints
+
+# Editor temporaries
+*.swn
+*.swo
+*.swp
+*~
+
+# Sublime Text settings
+*.sublime-workspace
+*.sublime-project
+
+# Eclipse Project settings
+*.*project
+.settings
+
+# QtCreator files
+*.user
+
+# PyCharm files
+.idea
+
+# Visual Studio Code files
+.vscode
+.vs
+
+# OSX dir files
+.DS_Store
+
+## Caffe2
+
+# build, distribute, and bins (+ python proto bindings)
+build
+build_host_protoc
+build_android
+build_ios
+/build_*
+.build_debug/*
+.build_release/*
+distribute/*
+*.testbin
+*.bin
+cmake_build
+.cmake_build
+gen
+.setuptools-cmake-build
+.pytest_cache
+aten/build/*
+
+# Bram
+plsdontbreak
+
+# Generated documentation
+docs/_site
+docs/gathered
+_site
+doxygen
+docs/dev
+
+# LevelDB files
+*.sst
+*.ldb
+LOCK
+LOG*
+CURRENT
+MANIFEST-*
+
+# generated version file
+caffe2/version.py
+
+# setup.py intermediates
+.eggs
+caffe2.egg-info
+
+# Atom/Watchman required file
+.watchmanconfig
+
+# cython generated files
+lib/model/utils/bbox.c
+lib/pycocotools/_mask.c
@@ -0,0 +1,64 @@
+# Graphical Object Detector in document images
+
+This repository contains end-to-end trainable deep learning based framework to localize graphical objects in the document images called as Graphical Object Detection (GOD). 
+
+This repository is built on [jwyang/faster-rcnn.pytorch](https://github.com/jwyang/faster-rcnn.pytorch). This implementation has the following features:
+- **It is pure Pytorch code**. Of course, there are some CUDA code.
+
+- **It supports multi-image batch training**.
+
+- **It supports multiple GPUs training**.
+
+The results of GOD on different datasets is listed in the paper.
+
+
+### Getting Started
+Clone the repo:
+```
+    git clone https://github.com/rnjtsh/graphical-object-detector/GOD.git
+```
+Then, create a folder:
+```
+    cd GOD && mkdir data
+```
+
+#### prerequisites
+- Python 2.7 or 3.6
+- Pytorch 0.4.0
+- CUDA 8.0 or higher
+
+
+#### Compilation
+The compilation is done as instructed by [jwyang/faster-rcnn.pytorch](https://github.com/jwyang/faster-rcnn.pytorch/blob/master/README.md#compilation).
+
+
+#### Dataset
+This repository uses the dataset in the same format as PASCAL VOC. But other format of datasets can also be adapted as done by [jwyang/faster-rcnn.pytorch](https://github.com/jwyang/faster-rcnn.pytorch). The dataset should be prepared as per the following tree structure.
+```
+    GODdevkit2019
+      ├── GOD2019
+          ├── JPEGImages
+          │   ├──  GOD001.jpg
+          │   ├──  GOD002.jpg
+          │   ├──  ...
+          ├── ImageSets
+          │   ├──  Main
+          │   │    ├──  train.txt
+          │   │    ├──  val.txt
+          │   │    ├──  test.txt
+          │   │    ├──  ...
+          └── Annotations
+              ├──  GOD001.xml
+              ├──  GOD002.xml
+              ├──  ...
+```
+
+#### Pretrained Models
+We used ImageNet pretrained weights (VGG16 and ResNets) from Caffe in our experiments. You can download these two models from:
+- [VGG16](https://drive.google.com/open?id=19UphT53C0Ua9JAtICnw84PPTa3sZZ_9k)
+- [ResNet50](https://drive.google.com/open?id=1wHSvusQ1CiEMc5Nx5R8adqoHQjIDWXl1), [ResNet101](https://drive.google.com/open?id=1x2fTMqLrn63EMW0VuK4GEa2eQKzvJ_7l), [ResNet152](https://drive.google.com/open?id=1NSCycOb7pU0KzluH326zmyMFUU55JslF)
+
+
+Download them and put them into the ```data/pretrained_model/```.
+
+**If you want to use pytorch pre-trained models, please remember to transpose images from BGR to RGB, and also use the same data transformer (minus mean and normalize) as used in pretrained model.**
@@ -0,0 +1,15 @@
+import os.path as osp
+import sys
+
+def add_path(path):
+    if path not in sys.path:
+        sys.path.insert(0, path)
+
+this_dir = osp.dirname(__file__)
+
+# Add lib to PYTHONPATH
+lib_path = osp.join(this_dir, 'lib')
+add_path(lib_path)
+
+coco_path = osp.join(this_dir, 'data', 'coco', 'PythonAPI')
+add_path(coco_path)
@@ -0,0 +1,18 @@
+EXP_DIR: res101
+TRAIN:
+  HAS_RPN: True
+  BBOX_NORMALIZE_TARGETS_PRECOMPUTED: True
+  RPN_POSITIVE_OVERLAP: 0.7
+  RPN_BATCHSIZE: 256
+  PROPOSAL_METHOD: gt
+  BG_THRESH_LO: 0.0
+  DISPLAY: 20
+  BATCH_SIZE: 128
+  WEIGHT_DECAY: 0.0001
+  DOUBLE_BIAS: False
+  LEARNING_RATE: 0.001
+TEST:
+  HAS_RPN: True
+POOLING_SIZE: 7
+POOLING_MODE: align
+CROP_RESIZE_WITH_MAX_POOL: False
@@ -0,0 +1,22 @@
+EXP_DIR: res101
+TRAIN:
+  HAS_RPN: True
+  BBOX_NORMALIZE_TARGETS_PRECOMPUTED: True
+  RPN_POSITIVE_OVERLAP: 0.7
+  RPN_BATCHSIZE: 256
+  PROPOSAL_METHOD: gt
+  BG_THRESH_LO: 0.0
+  DISPLAY: 20
+  BATCH_SIZE: 128
+  WEIGHT_DECAY: 0.0001
+  SCALES: [800]
+  DOUBLE_BIAS: False
+  LEARNING_RATE: 0.001
+TEST:
+  HAS_RPN: True
+  SCALES: [800]
+  MAX_SIZE: 1200
+  RPN_POST_NMS_TOP_N: 1000
+POOLING_SIZE: 7
+POOLING_MODE: align
+CROP_RESIZE_WITH_MAX_POOL: False
@@ -0,0 +1,24 @@
+EXP_DIR: res152
+TRAIN:
+  HAS_RPN: True
+  # IMS_PER_BATCH: 1
+  BBOX_NORMALIZE_TARGETS_PRECOMPUTED: True
+  RPN_POSITIVE_OVERLAP: 0.7
+  RPN_BATCHSIZE: 256
+  PROPOSAL_METHOD: gt
+  BG_THRESH_LO: 0.0
+  DISPLAY: 20
+  BATCH_SIZE: 128
+  WEIGHT_DECAY: 0.0001
+  DOUBLE_BIAS: False
+  SNAPSHOT_PREFIX: res152_faster_rcnn
+  LEARNING_RATE: 0.001
+  SCALES: [800]
+TEST:
+  HAS_RPN: True
+  SCALES: [800]
+  MAX_SIZE: 1200
+  RPN_POST_NMS_TOP_N: 1000
+POOLING_SIZE: 7
+POOLING_MODE: align
+CROP_RESIZE_WITH_MAX_POOL: False
@@ -0,0 +1,17 @@
+EXP_DIR: res50
+TRAIN:
+  HAS_RPN: True
+  # IMS_PER_BATCH: 1
+  BBOX_NORMALIZE_TARGETS_PRECOMPUTED: True
+  RPN_POSITIVE_OVERLAP: 0.7
+  RPN_BATCHSIZE: 256
+  PROPOSAL_METHOD: gt
+  BG_THRESH_LO: 0.0
+  DISPLAY: 20
+  BATCH_SIZE: 256
+  WEIGHT_DECAY: 0.0001
+  DOUBLE_BIAS: False
+  SNAPSHOT_PREFIX: res50_faster_rcnn
+TEST:
+  HAS_RPN: True
+POOLING_MODE: crop
@@ -0,0 +1,21 @@
+EXP_DIR: res50
+TRAIN:
+  HAS_RPN: True
+  # IMS_PER_BATCH: 1
+  BBOX_NORMALIZE_TARGETS_PRECOMPUTED: True
+  RPN_POSITIVE_OVERLAP: 0.7
+  RPN_BATCHSIZE: 256
+  PROPOSAL_METHOD: gt
+  BG_THRESH_LO: 0.0
+  DISPLAY: 20
+  BATCH_SIZE: 256
+  SCALES: [800]
+  WEIGHT_DECAY: 0.0001
+  DOUBLE_BIAS: False
+  SNAPSHOT_PREFIX: res50_faster_rcnn
+TEST:
+  HAS_RPN: True
+  SCALES: [800]
+  MAX_SIZE: 1200
+  RPN_POST_NMS_TOP_N: 1000
+POOLING_MODE: crop