You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -264,7 +264,7 @@ This params are for dual Titan RTX cards with NVIDIA Apex installed:
264
264
265
265
### SE-ResNeXt-26-D and SE-ResNeXt-26-T
266
266
These hparams (or similar) work well for a wide range of ResNet architecture, generally a good idea to increase the epoch # as the model size increases... ie approx 180-200 for ResNe(X)t50, and 220+ for larger. Increase batch size and LR proportionally for better GPUs or with AMP enabled. These params were for 2 1080Ti cards:
### EfficientNet-B3 with RandAugment - 81.5 top-1, 95.7 top-5
@@ -279,13 +279,14 @@ The training of this model started with the same command line as EfficientNet-B2
279
279
280
280
All development and testing has been done in Conda Python 3 environments on Linux x86-64 systems, specifically Python 3.6.x and 3.7.x. Little to no care has been taken to be Python 2.x friendly and I don't plan to support it. If you run into any challenges running on Windows, or other OS, I'm definitely open to looking into those issues so long as it's in a reproducible (read Conda) environment.
281
281
282
-
PyTorch versions 1.0 and 1.1 have been tested with this code.
282
+
PyTorch versions 1.2 and 1.3.1 have been tested with this code.
283
283
284
284
I've tried to keep the dependencies minimal, the setup is as per the PyTorch default install instructions for Conda:
@@ -331,5 +332,4 @@ To run inference from a checkpoint:
331
332
## TODO
332
333
A number of additions planned in the future for various projects, incl
333
334
* Do a model performance (speed + accuracy) benchmarking across all models (make runable as script)
334
-
* Add usage examples to comments, good hyper params for training
335
-
* Comments, cleanup and the usual things that get pushed back
335
+
* Complete feature map extraction across all model types and build obj detection/segmentation models and scripts (or integrate backbones with mmdetection, detectron2)
0 commit comments