SlideShare a Scribd company logo
2
Visual Question-Answering
▪ Team Members:
▪ Abdalla Shaaban Elsayed
▪ Rabah Jamal Mohammed Ali
▪ Abdullah Abdelkader Roshdy
▪ Abdullah Mahmoud Abdullah
▪ Supervisor:
▪ Dr Sally Saad
▪ TA Ahmed Salah
▪ Introduction
▪ Motivation
▪ Problem definition
▪ Objective
▪ Working Phases
▪ Time Plan
▪ Tools
2
Outline
▪ Introduction
3
Outline
4
Predict t he A nsw er of a given quest ion relat ed t o an image .
Visual Question-Answering
▪ Introduction
▪ Motivation
5
Outline
6
▪ Performing complex activities.
▪ Merging between two or more sub-problems.
▪ Understanding :
- Convolutional neural network
- Natural language processing
- Recurrent neural network
▪ Obtaining high accuracy from complex model.
Motivation
Types of Visual Question-Answering
7
▪ Introduction
▪ Motivation
▪ Problem definition
8
Outline
9
How to build a model that extract feature of an image related
To a given question ?
Problem definition
▪ Introduction
▪ Motivation
▪ Problem definition
▪ Objective
10
Outline
Objectives
▪ Build a visual question answering system using hierarchical co-Attention
technique.
▪ We aim to slightly improve the result by taking a question and an image as input
and outputs a response to the answer based on how the RCNN understands the
question asked.
11
▪ Introduction
▪ Motivation
▪ Problem definition
▪ Objective
▪ Working Phases
12
Outline
Data
Preprocessing
Model
Building
Model
Testing and
Validation
Model
Interface
13
Phases Diagram
Phases overview | Data preprocessing
Gathering
Datasets
• VQA Dataset
• COCO-QA Dataset
Preparing
Dataset
• Cleaning the dataset using NLTK.
• Text representation using word embedding.
14
Phases overview | Model Building
Image Feature
Extraction
Question
Hierarchy
Co-Attention
Encoding for
Predicting
Answers
15
Phases overview | Model Building
▪ The model will extract the word level, phrase level, and question level embedding.
At each level, it applies co-attention on both the image and question. The final
answer prediction is based on all the co-attended image and question features.
16
Phases overview | Model Testing
▪ Measuring the system’s accuracy and the level of correctness of the predicted answers.
17
Phases overview | Interface
▪ Build a user interface for the system, which allows the user to interact with the system.
▪ Using Python’s framework with CSS and Javascript (optional).
18
▪ Introduction
▪ Motivation
▪ Problem definition
▪ Objective
▪ Working Phases
▪ Time Plan
19
Outline
Time Plan
20
▪ Introduction
▪ Motivation
▪ Problem definition
▪ Objective
▪ Working Phases
▪ Time Plan
▪ Tools
21
Outline
22
Tools
▪ Languages:
Python for preprocessing the datasets.
Javascript for the UI (optional).
▪ Libraries and Frameworks:
NLTK, Pillow (Python Imaging Library) for preprocessing the dataset.
TensorFlow to build the model.
Questions
23
References
▪ Chenyue Meng and Yixin Wang, “Image-Question-Linguistic Co-Attention for
Visual Question Answering”, 2016.
▪ Alisha Rege and Payal Bajaj C, “From Vision to NLP: A Merge”, 2017.
▪ Ronghang Hu and Jacob Andreas and Marcus Rohrbach, “Learning to Reason:
End-to-End Module Networks for Visual Question Answering” , 2017.
▪ Jiasen Luand Jianwei Yang and Dhruv Batra , “Hierarchical Question-Image Co-
Attention for Visual Question Answering” , 2017
24
25
Thank You!

More Related Content

PDF
Inferring and executing programs for visual reasoning (UPC Reading Group)
PPTX
Seminar2017
PDF
Upscale_Academy_Syllabus
PDF
Improving neural question generation using answer separation
PPTX
Thriving in Our Digital World — A CS Principles Course
PDF
INTELLIGENTSUDOKUSOLVERWITH AI-BASEDOPTIMIZATION.pdf
PPTX
Fyp slide presentation muiz
Inferring and executing programs for visual reasoning (UPC Reading Group)
Seminar2017
Upscale_Academy_Syllabus
Improving neural question generation using answer separation
Thriving in Our Digital World — A CS Principles Course
INTELLIGENTSUDOKUSOLVERWITH AI-BASEDOPTIMIZATION.pdf
Fyp slide presentation muiz

Similar to Vqa seminar (1) (20)

PPTX
Capstone_Project_Planning and execution.pptx
PPTX
Design Patterns - General Introduction
DOC
PPTX
Teaching Open Web Mapping - AAG 2017
DOC
DhanalakshmiPanjamExp
DOC
HARI 1.8 RESUME
PDF
Brochure curriculum (1)
PPTX
Java parser a fine grained indexing tool and its application
PDF
_OOP with JAVA Solution Manual (1).pdf
PPTX
Lecture_16_Self-supervised_Learning.pptx
PDF
Rostyslav Chayka: Prompt Engineering для проєктного менеджменту (Basic) (UA)
DOC
PDF
INDUSTRIAL TRAINING SAMPLE.pdf
DOCX
Augmented 7 cs_learning_design_workshop_7_may (1)
PDF
PShapeTrace: Linking Drawing Instructions with Visual Outcomes in Processing
PDF
“Understand the Multimodal World with Minimal Supervision,” a Keynote Present...
PDF
Using Microsoft Project to automate a workplace culture that works
DOC
AshwiniCV- SAP Basis
DOC
Kanishka resume
Capstone_Project_Planning and execution.pptx
Design Patterns - General Introduction
Teaching Open Web Mapping - AAG 2017
DhanalakshmiPanjamExp
HARI 1.8 RESUME
Brochure curriculum (1)
Java parser a fine grained indexing tool and its application
_OOP with JAVA Solution Manual (1).pdf
Lecture_16_Self-supervised_Learning.pptx
Rostyslav Chayka: Prompt Engineering для проєктного менеджменту (Basic) (UA)
INDUSTRIAL TRAINING SAMPLE.pdf
Augmented 7 cs_learning_design_workshop_7_may (1)
PShapeTrace: Linking Drawing Instructions with Visual Outcomes in Processing
“Understand the Multimodal World with Minimal Supervision,” a Keynote Present...
Using Microsoft Project to automate a workplace culture that works
AshwiniCV- SAP Basis
Kanishka resume
Ad

Recently uploaded (20)

PDF
crisisintervention-210721062718.presentatiodnf
PDF
demography and familyplanning-181222172149.pdf
PDF
_OB Finals 24.pdf notes for pregnant women
PDF
Zuri Health Pan-African Digital Health Innovator.pdf
PPTX
Nancy Caroline Emergency Paramedic Chapter 1
PDF
health promotion and maintenance of elderly
PPTX
Nancy Caroline Emergency Paramedic Chapter 14
PPTX
POSTURE.pptx......,............. .........
PPTX
HIGHLIGHTS of NDCT 2019 WITH IMPACT ON CLINICAL RESEARCH.pptx
PDF
chapter 14.pdf Ch+12+SGOB.docx hilighted important stuff on exa,
PPTX
OSTEOMYELITIS and OSTEORADIONECROSIS.pptx
PPTX
Nancy Caroline Emergency Paramedic Chapter 11
PPTX
Arthritis Types, Signs & Treatment with physiotherapy management
PDF
ENT MedMap you can study for the exam with this.pdf
PDF
Fundamentals Final Review Questions.docx.pdf
PDF
cerebral aneurysm.. neurosurgery , anaesthesia
PPTX
Nancy Caroline Emergency Paramedic Chapter 13
PDF
Culturally Sensitive Health Solutions: Engineering Localized Practices (www....
PPTX
Full Slide Deck - SY CF Talk Adelaide 10June.pptx
PPTX
Nancy Caroline Emergency Paramedic Chapter 18
crisisintervention-210721062718.presentatiodnf
demography and familyplanning-181222172149.pdf
_OB Finals 24.pdf notes for pregnant women
Zuri Health Pan-African Digital Health Innovator.pdf
Nancy Caroline Emergency Paramedic Chapter 1
health promotion and maintenance of elderly
Nancy Caroline Emergency Paramedic Chapter 14
POSTURE.pptx......,............. .........
HIGHLIGHTS of NDCT 2019 WITH IMPACT ON CLINICAL RESEARCH.pptx
chapter 14.pdf Ch+12+SGOB.docx hilighted important stuff on exa,
OSTEOMYELITIS and OSTEORADIONECROSIS.pptx
Nancy Caroline Emergency Paramedic Chapter 11
Arthritis Types, Signs & Treatment with physiotherapy management
ENT MedMap you can study for the exam with this.pdf
Fundamentals Final Review Questions.docx.pdf
cerebral aneurysm.. neurosurgery , anaesthesia
Nancy Caroline Emergency Paramedic Chapter 13
Culturally Sensitive Health Solutions: Engineering Localized Practices (www....
Full Slide Deck - SY CF Talk Adelaide 10June.pptx
Nancy Caroline Emergency Paramedic Chapter 18
Ad

Vqa seminar (1)