SlideShare a Scribd company logo
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
Introduction to GluonCV
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
Why GluonCV?
• What is the biggest challenge you have ever encountered with deep
learning?
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
Why GluonCV?
• What is the biggest challenge you have ever encountered with deep
learning?
• “reproducing the best claimed results from latest papers”
SOTA
state-of-the-art
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
Real-world Stories #1
• Back to a period in 2016, the same ImageNet models trained by MXNet
achieved on average 1% worse accuracy compared to Torch.
• Tried almost everything to debug, even developed a plugin to run Torch
code inside MXNet to make it easier to compare results.
=> Transcoding training images using 95 JPEG quality rather than 85 solved
the problem.
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
Real-world Stories #2
• Using another open source DL framework: trained model accuracies
cannot match previous internal version.
• Spent months to figure out why, with no clue.
=> The order of data augmentation is different from previous version.
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
• I will write clean and reusable code
when I’m prototyping this time.
• Variant:
• - I will write clean and reusable code
next time.
Common myth 1
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
Common myth 2
• My code will still run next year.
• Sometimes, it’s not our fault.
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
Common myth3
• I will finish setting up the
baseline model this afternoon.
• Though it may not be our fault
again.
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
Starting from scratch can be hard
• Even the most talented researchers will get blocked by trivial things.
• Experience and instincts can be your enemies in certain circumstances.
• Training is time-consuming, initialization and augmentation is
randomized, and many implementation details need to be taken care of.
=> Debugging deep learning models is extremely difficult.
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
It’s not easy to embrace open-source implementations
• Often the quality of open-source implementations vary.
• Languages, code styles, project structures, DL frameworks are mixed.
• Personal projects tend to focusing on a specific task with specific
datasets. It requires significant engineering efforts to adapt to your use
case.
• Community projects can be abandoned frequently.
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
What does GluonCV provide
• Reproduction of important papers in recent years
• Model zoo with 80+ pre-trained models
• Training scripts (as well as tuned hyper-parameters) to
reproduce the results
• Full training script + Dataset download script
• Logs of training run
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
What does GluonCV provide
• Considerate APIs and modules that are easy to follow and
understand
• Avoid re-writing the same utilities again and again
• Pre-set data augmentation and transforms, visualization and
training utilities
• Community support, feel free to ask and discuss
• User forum
• Github community and open roadmap
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
Image Classification
• More than 50+ pre-trained ImageNet models(ResNet, MobileNet…)
• We achieved the best accuracy using some of the most popular
models (e.g., ResNet), compared with other frameworks
• Used as backbone in many downstream tasks => better accuracy
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
Semantic Segmentation
• FCN
• PSPNet
• Mask-RCNN
• DeepLab
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
Object Detection
• SSD and YOLOv3: fastest
solution
• Faster-RCNN, RFCN and FPN:
slower but more accurate,
especially for tiny objects
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
Instance Segmentation
• Mask R-CNN
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
Key Point Estimation
• SimplePose
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
Style Transfer
MSGNet
GANs
CycleGAN
SRGAN
WGAN
Re-identification
Market1501
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
Coming Soon: Depth Estimation
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
Like GluonCV? Go build!
https://gluon-cv.mxnet.io
https://github.com/dmlc/gluon-cv

More Related Content

PPTX
Image Segmentation: Approaches and Challenges
PPTX
Introduction to object tracking with Deep Learning
PDF
AI Services and Serverless Workshop
PDF
Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)
PPTX
Generative Adversarial Networks (GANs) using Apache MXNet
PPTX
Adding Image and Video Analysis to your Applications (May 2018)
PDF
AWS AI Services 101
PDF
Voice Tech TO #1
Image Segmentation: Approaches and Challenges
Introduction to object tracking with Deep Learning
AI Services and Serverless Workshop
Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)
Generative Adversarial Networks (GANs) using Apache MXNet
Adding Image and Video Analysis to your Applications (May 2018)
AWS AI Services 101
Voice Tech TO #1

Similar to Introduction to GluonCV (14)

PPTX
AWS Toronto Summit 2019 - AIM302 - Build, train, and deploy ML models with Am...
PPTX
Grokking Techtalk #40: AWS’s philosophy on designing MLOps platform
PPTX
GluonCV
PDF
A Gentle Intro to Deep Learning
PPTX
WhereML a Serverless ML Powered Location Guessing Twitter Bot
PDF
Build Your Recommendation Engine on AWS Today!
PPTX
Introduction to GluonNLP
PPTX
MXNet Paris Workshop - Intro To MXNet
PDF
Integrate Machine Learning into Your Spring Application in Less than an Hour
PPTX
Build, train and deploy ML models with SageMaker (October 2019)
PPTX
Where ml ai_heavy
PPTX
Apache MXNet and Gluon
PDF
Build Your Recommendation Engine on AWS Today - AWS Summit Berlin 2018
PPTX
AWS re:Invent 2018 - ENT321 - SageMaker Workshop
AWS Toronto Summit 2019 - AIM302 - Build, train, and deploy ML models with Am...
Grokking Techtalk #40: AWS’s philosophy on designing MLOps platform
GluonCV
A Gentle Intro to Deep Learning
WhereML a Serverless ML Powered Location Guessing Twitter Bot
Build Your Recommendation Engine on AWS Today!
Introduction to GluonNLP
MXNet Paris Workshop - Intro To MXNet
Integrate Machine Learning into Your Spring Application in Less than an Hour
Build, train and deploy ML models with SageMaker (October 2019)
Where ml ai_heavy
Apache MXNet and Gluon
Build Your Recommendation Engine on AWS Today - AWS Summit Berlin 2018
AWS re:Invent 2018 - ENT321 - SageMaker Workshop
Ad

More from Apache MXNet (20)

PPTX
Recent Advances in Natural Language Processing
PPTX
Fine-tuning BERT for Question Answering
PPTX
Introduction to Computer Vision
PPTX
Introduction to Deep face detection and recognition
PPTX
Deep Learning With Apache MXNet On Video by Ben Taylor @ ziff.ai
PDF
Using Java to deploy Deep Learning models with MXNet
PPTX
AI powered emotion recognition: From Inception to Production - Global AI Conf...
PDF
Apache MXNet ODSC West 2018
PDF
DeepLearning001&ApacheMXNetWithSparkForInference-ACNA2018
PDF
Apache MXNet EcoSystem - ACNA2018
PDF
ONNX and Edge Deployments
PDF
Distributed Inference with MXNet and Spark
PDF
Multivariate Time Series
PDF
AI On the Edge: Model Compression
PDF
Building Content Recommendation Systems using MXNet Gluon
PDF
Game Playing RL Agent
PDF
Building Applications with Apache MXNet
PDF
Neural Machine Translation with Sockeye
PPTX
Debugging and Performance tricks for MXNet Gluon
PPTX
Visual Search Engine with MXNet Gluon
Recent Advances in Natural Language Processing
Fine-tuning BERT for Question Answering
Introduction to Computer Vision
Introduction to Deep face detection and recognition
Deep Learning With Apache MXNet On Video by Ben Taylor @ ziff.ai
Using Java to deploy Deep Learning models with MXNet
AI powered emotion recognition: From Inception to Production - Global AI Conf...
Apache MXNet ODSC West 2018
DeepLearning001&ApacheMXNetWithSparkForInference-ACNA2018
Apache MXNet EcoSystem - ACNA2018
ONNX and Edge Deployments
Distributed Inference with MXNet and Spark
Multivariate Time Series
AI On the Edge: Model Compression
Building Content Recommendation Systems using MXNet Gluon
Game Playing RL Agent
Building Applications with Apache MXNet
Neural Machine Translation with Sockeye
Debugging and Performance tricks for MXNet Gluon
Visual Search Engine with MXNet Gluon
Ad

Recently uploaded (20)

PDF
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
PDF
. Radiology Case Scenariosssssssssssssss
PPTX
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
PPTX
7. General Toxicologyfor clinical phrmacy.pptx
PDF
Sciences of Europe No 170 (2025)
PPTX
2. Earth - The Living Planet Module 2ELS
PDF
HPLC-PPT.docx high performance liquid chromatography
PPTX
Microbiology with diagram medical studies .pptx
PDF
MIRIDeepImagingSurvey(MIDIS)oftheHubbleUltraDeepField
PPTX
microscope-Lecturecjchchchchcuvuvhc.pptx
PPTX
SCIENCE10 Q1 5 WK8 Evidence Supporting Plate Movement.pptx
PPTX
2. Earth - The Living Planet earth and life
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PDF
AlphaEarth Foundations and the Satellite Embedding dataset
PPT
protein biochemistry.ppt for university classes
PPTX
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
PDF
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
PPTX
Derivatives of integument scales, beaks, horns,.pptx
PPTX
INTRODUCTION TO EVS | Concept of sustainability
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
. Radiology Case Scenariosssssssssssssss
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
7. General Toxicologyfor clinical phrmacy.pptx
Sciences of Europe No 170 (2025)
2. Earth - The Living Planet Module 2ELS
HPLC-PPT.docx high performance liquid chromatography
Microbiology with diagram medical studies .pptx
MIRIDeepImagingSurvey(MIDIS)oftheHubbleUltraDeepField
microscope-Lecturecjchchchchcuvuvhc.pptx
SCIENCE10 Q1 5 WK8 Evidence Supporting Plate Movement.pptx
2. Earth - The Living Planet earth and life
Taita Taveta Laboratory Technician Workshop Presentation.pptx
AlphaEarth Foundations and the Satellite Embedding dataset
protein biochemistry.ppt for university classes
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
Derivatives of integument scales, beaks, horns,.pptx
INTRODUCTION TO EVS | Concept of sustainability
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...

Introduction to GluonCV

  • 1. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark Introduction to GluonCV
  • 2. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark Why GluonCV? • What is the biggest challenge you have ever encountered with deep learning?
  • 3. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark Why GluonCV? • What is the biggest challenge you have ever encountered with deep learning? • “reproducing the best claimed results from latest papers” SOTA state-of-the-art
  • 4. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark Real-world Stories #1 • Back to a period in 2016, the same ImageNet models trained by MXNet achieved on average 1% worse accuracy compared to Torch. • Tried almost everything to debug, even developed a plugin to run Torch code inside MXNet to make it easier to compare results. => Transcoding training images using 95 JPEG quality rather than 85 solved the problem.
  • 5. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark Real-world Stories #2 • Using another open source DL framework: trained model accuracies cannot match previous internal version. • Spent months to figure out why, with no clue. => The order of data augmentation is different from previous version.
  • 6. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark • I will write clean and reusable code when I’m prototyping this time. • Variant: • - I will write clean and reusable code next time. Common myth 1
  • 7. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark Common myth 2 • My code will still run next year. • Sometimes, it’s not our fault.
  • 8. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark Common myth3 • I will finish setting up the baseline model this afternoon. • Though it may not be our fault again.
  • 9. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark Starting from scratch can be hard • Even the most talented researchers will get blocked by trivial things. • Experience and instincts can be your enemies in certain circumstances. • Training is time-consuming, initialization and augmentation is randomized, and many implementation details need to be taken care of. => Debugging deep learning models is extremely difficult.
  • 10. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
  • 11. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark It’s not easy to embrace open-source implementations • Often the quality of open-source implementations vary. • Languages, code styles, project structures, DL frameworks are mixed. • Personal projects tend to focusing on a specific task with specific datasets. It requires significant engineering efforts to adapt to your use case. • Community projects can be abandoned frequently.
  • 12. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark What does GluonCV provide • Reproduction of important papers in recent years • Model zoo with 80+ pre-trained models • Training scripts (as well as tuned hyper-parameters) to reproduce the results • Full training script + Dataset download script • Logs of training run
  • 13. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark What does GluonCV provide • Considerate APIs and modules that are easy to follow and understand • Avoid re-writing the same utilities again and again • Pre-set data augmentation and transforms, visualization and training utilities • Community support, feel free to ask and discuss • User forum • Github community and open roadmap
  • 14. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
  • 15. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark Image Classification • More than 50+ pre-trained ImageNet models(ResNet, MobileNet…) • We achieved the best accuracy using some of the most popular models (e.g., ResNet), compared with other frameworks • Used as backbone in many downstream tasks => better accuracy
  • 16. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark Semantic Segmentation • FCN • PSPNet • Mask-RCNN • DeepLab
  • 17. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark Object Detection • SSD and YOLOv3: fastest solution • Faster-RCNN, RFCN and FPN: slower but more accurate, especially for tiny objects
  • 18. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
  • 19. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark Instance Segmentation • Mask R-CNN
  • 20. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark Key Point Estimation • SimplePose
  • 21. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark Style Transfer MSGNet GANs CycleGAN SRGAN WGAN Re-identification Market1501
  • 22. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark Coming Soon: Depth Estimation
  • 23. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark Like GluonCV? Go build! https://gluon-cv.mxnet.io https://github.com/dmlc/gluon-cv

Editor's Notes

  • #2: First call deck for a high level introduction to Apache MXNet.