SlideShare a Scribd company logo
Video-based Big Data Analytics
in Cyberlearning
Shuangbao (Paul) Wang, Ph.D.
Professor, Director
Center for Security Studies
1913
1920 * 1080 * 50 * 60 -- 3000 * 2M -- 6G
2016
Videos in Cyberlearning – big data
• Video use is growing rapidly in education (and elsewhere)
• MOOCs (ex. EdX, Coursera, Udacity) rely on videos
• Huge repositories (ex. RBDIL, NSDL) contain
extraordinary amounts of valuable video data
• Videos are big data, unstructured
– Hardly being analyzed by current data analytics tools
• Cyberlearning requires more interactions.
RBDIL – Rutgers University
NSDL – National Science Digital Library
Paul Wang SOED 2016
Paul Wang SOED 2016
Using Videos Effectively in Learning
• Interactive? (DoD)
• Assessments ? (adaptive)
• Easy for instructors to use? (course development)
• Accessibility? (Mac, mobile)
• Track students’ growth? (over multiple years)
• Long videos vs. short ones? (crop)
• Recording methodology? (noises and echo)
inVideo - A Novel Big Data Analytics
Tool for Video Data Analytics
• Analyzing Video by Keywords
• Content Based Image Retrieval (CBIR)
• Pattern Recognition (PR)
• Multiple Languages
inVideo: Analyzing Video by Keywords
• Audio is stripped and used to generate a transcript
• Transcript is indexed back to original media
• Video is now searchable/mineable by keyword
Result shows that 7 video clips from three videos were retrieved for keyword “online”
inVideo: Content Based Image Recognition (CBIR)
• Provide a picture reference
• Search video content (frames) that contains the reference picture
• Return the video clips
Result shows that the match is at 0.05th sec. in video named “student”
inVideo: Pattern Recognition (PR)
• Provide a keyword reference
• Search video content (frames) that contains the object described as
“keyword”
The results shows three videos were retrieved that contain objects look like the keyword “credit card”
inVideo: Analyzing Different Languages
• Input keywords in other languages
• Search transcript for keywords in that language
• Retrieve video clips that match
The results shows two video clips in one video contain the keyword “学生”
(the word “student”in Chinese).
Paul Wang SOED 2016
Clip#0
Introduction
Clip#1
Pwd cracking
Clip#2
Port scan
Clip#3
Encryption
Clip#4
Forensics
Clip#5
Cyber
weapon
Video: Linear – to -- Interactive
Before: A 46-minute long video
After: 2-3 minute video clips with assessments in between and at the end
Assessments
• Learning objects
composed of short
video clips
• Assessment of
learning outcomes
of studying video
content
• Teachers: selecting
a video segment and
assign Q&As
Dragthe stage bar and click “From”button; continue draggingand then click “To”
button. Add a question and answers.
Define “Learning Objects” (Instructors)
Learning and Assessments
• View the whole
video, and take
a quiz
• Review the
video clip
corresponding
to the question
Click “Review” button to review the video clip; click the speak icon to speak out
the question; click “Confirm”to check your answer
Case Study: Cybersecurity Program
Student Engagement for the 24 Classrooms
inVideo: Turn videos into interactive learning contents
Low Accuracy
Video1: 45 Video Clips
Video1: 29 Video Clips
Video3: 29 Video Clips
Video1: Individual
Video 2: Small Class
Video 3: Full Classroom
Accuracy of transcripts of 9 video clips from three
original videos
SDLC
Accuracy comparison:
• “hits rate” before –
standard parameters;
• “hits rate” after –
revised parameters
No improvement!
Correlations?
low accuracy vs. recording methods
• Low accuracy
– 10% or less
– Individual22,(45+31+29 video clips)
• Medium accuracy
– 40 to 60%
– EdX_EDM, EdX_ajax, (20 video clips)
• High accuracy
– 90% or higher
– Phone.p2, online_shopping, (30 video clips)
Online Shopping
A=90%
Individual 22
A=10%
rfeb07
A=10%
phone.p2
A=90%
Phone.p2
A=90%
r002
A=10%
edX. ajax
A=60%
edX.EDM
A=50%
Voice-over re-Recording
• Re-recorded voices on
videos
• Merge audio track with
original videos
• Signal analyzing while
recording
• Accuracy significantly
improved!
Correlations
• Low accuracy is expressed in high quefrency
– A measurement of ambient noises
– echo
• Recording methods
– One microphone (per person)
– Used condenser microphone instead of dynamic one
• Recording setting could affect the audio quality (for digital processing)
– Experiments
– Guide to digital recording
Paul Wang SOED 2016
Paul Wang SOED 2016
Transcript Time-stamping System (TTS)
• Adding timestamps for already transcribed videos
• Fuzzy Search 
Further Discussions
• Web API
• Search progression (over the years)
• Voice cancellation/reduction
• Automatic Time-stamping
• Curriculum Development
• Build community - collaborations
Publications
Shuangbao (Paul) Wang, Ph.D.
paul.wang@computer.org
William Kelly, Metonymy Corporation
Xiaolong Cheng, Doctoral Candidate, George Washington University

More Related Content

PDF
Lumos StepUp ELA for ISTEP+
PDF
Lumos StepUp for ISTEP+
PDF
Ost math
PDF
Ost ela
PDF
TNReady ELA IG - www.lumoslearning.com
PDF
TNReady Math IG - www.lumoslearning.com
PDF
Sakai10 ken romeohandout
PPT
The Real China Tutorial Report
Lumos StepUp ELA for ISTEP+
Lumos StepUp for ISTEP+
Ost math
Ost ela
TNReady ELA IG - www.lumoslearning.com
TNReady Math IG - www.lumoslearning.com
Sakai10 ken romeohandout
The Real China Tutorial Report

Similar to Paul Wang SOED 2016 (20)

PDF
CHI2014 Workshop - Leveraging Video Interaction Data and Content Analysis to ...
PDF
Interactive E-Lecture Using Video Annotation in Learning Groups
PPT
An interactive video based Environment supporting learning analytics
PDF
Profile based Video segmentation system to support E-learning
PPTX
Show me what you can! - Digital video in support of practice-based education:...
PPTX
Processing digital ideas
PPTX
Video research in education
PPTX
軒銘Icalt.pptx
DOCX
Analysing student behaviour when learning from video-based learning resources
PDF
Students’ perceptions and perceived value of lecture capture in teacher educa...
PPTX
Video Analysis Instructional Design Presentation
PPT
11 06 28_dublin_video
DOCX
Andrea boehmecimt543spring2012video
PDF
5009a270
PDF
V Code And V Data Illustrating A New Framework For Supporting The Video Annot...
PPT
Content Classification And Context Based Retrieval System For E Learning
PPT
Content Classification And Context Based Retrieval System For E Learning
PPT
Content Classification And Context Based Retrieval System For E Learning
PDF
Video content analysis and retrieval system using video storytelling and inde...
PPT
Visual Learning Lab
CHI2014 Workshop - Leveraging Video Interaction Data and Content Analysis to ...
Interactive E-Lecture Using Video Annotation in Learning Groups
An interactive video based Environment supporting learning analytics
Profile based Video segmentation system to support E-learning
Show me what you can! - Digital video in support of practice-based education:...
Processing digital ideas
Video research in education
軒銘Icalt.pptx
Analysing student behaviour when learning from video-based learning resources
Students’ perceptions and perceived value of lecture capture in teacher educa...
Video Analysis Instructional Design Presentation
11 06 28_dublin_video
Andrea boehmecimt543spring2012video
5009a270
V Code And V Data Illustrating A New Framework For Supporting The Video Annot...
Content Classification And Context Based Retrieval System For E Learning
Content Classification And Context Based Retrieval System For E Learning
Content Classification And Context Based Retrieval System For E Learning
Video content analysis and retrieval system using video storytelling and inde...
Visual Learning Lab
Ad

More from Colleen Ganley (6)

PDF
Dragan Gasevic SOED 2016
PDF
Matti Pauna SOED 2016
PDF
Michael Gage SOED 2016
PDF
Yahya Almalki SOED 2016
PPTX
Hart & Ganley SOED 2016
PPTX
Christopher Brooks SOED 2016
Dragan Gasevic SOED 2016
Matti Pauna SOED 2016
Michael Gage SOED 2016
Yahya Almalki SOED 2016
Hart & Ganley SOED 2016
Christopher Brooks SOED 2016
Ad

Recently uploaded (20)

PPTX
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
PDF
VCE English Exam - Section C Student Revision Booklet
PPTX
Cell Types and Its function , kingdom of life
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
Insiders guide to clinical Medicine.pdf
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PPTX
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
PDF
TR - Agricultural Crops Production NC III.pdf
PPTX
Cell Structure & Organelles in detailed.
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PPTX
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
PDF
Anesthesia in Laparoscopic Surgery in India
PPTX
Pharma ospi slides which help in ospi learning
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
VCE English Exam - Section C Student Revision Booklet
Cell Types and Its function , kingdom of life
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
O7-L3 Supply Chain Operations - ICLT Program
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Renaissance Architecture: A Journey from Faith to Humanism
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Insiders guide to clinical Medicine.pdf
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
TR - Agricultural Crops Production NC III.pdf
Cell Structure & Organelles in detailed.
Module 4: Burden of Disease Tutorial Slides S2 2025
Abdominal Access Techniques with Prof. Dr. R K Mishra
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
Anesthesia in Laparoscopic Surgery in India
Pharma ospi slides which help in ospi learning

Paul Wang SOED 2016

  • 1. Video-based Big Data Analytics in Cyberlearning Shuangbao (Paul) Wang, Ph.D. Professor, Director Center for Security Studies
  • 2. 1913 1920 * 1080 * 50 * 60 -- 3000 * 2M -- 6G 2016
  • 3. Videos in Cyberlearning – big data • Video use is growing rapidly in education (and elsewhere) • MOOCs (ex. EdX, Coursera, Udacity) rely on videos • Huge repositories (ex. RBDIL, NSDL) contain extraordinary amounts of valuable video data • Videos are big data, unstructured – Hardly being analyzed by current data analytics tools • Cyberlearning requires more interactions.
  • 4. RBDIL – Rutgers University
  • 5. NSDL – National Science Digital Library
  • 8. Using Videos Effectively in Learning • Interactive? (DoD) • Assessments ? (adaptive) • Easy for instructors to use? (course development) • Accessibility? (Mac, mobile) • Track students’ growth? (over multiple years) • Long videos vs. short ones? (crop) • Recording methodology? (noises and echo)
  • 9. inVideo - A Novel Big Data Analytics Tool for Video Data Analytics • Analyzing Video by Keywords • Content Based Image Retrieval (CBIR) • Pattern Recognition (PR) • Multiple Languages
  • 10. inVideo: Analyzing Video by Keywords • Audio is stripped and used to generate a transcript • Transcript is indexed back to original media • Video is now searchable/mineable by keyword Result shows that 7 video clips from three videos were retrieved for keyword “online”
  • 11. inVideo: Content Based Image Recognition (CBIR) • Provide a picture reference • Search video content (frames) that contains the reference picture • Return the video clips Result shows that the match is at 0.05th sec. in video named “student”
  • 12. inVideo: Pattern Recognition (PR) • Provide a keyword reference • Search video content (frames) that contains the object described as “keyword” The results shows three videos were retrieved that contain objects look like the keyword “credit card”
  • 13. inVideo: Analyzing Different Languages • Input keywords in other languages • Search transcript for keywords in that language • Retrieve video clips that match The results shows two video clips in one video contain the keyword “学生” (the word “student”in Chinese).
  • 15. Clip#0 Introduction Clip#1 Pwd cracking Clip#2 Port scan Clip#3 Encryption Clip#4 Forensics Clip#5 Cyber weapon Video: Linear – to -- Interactive Before: A 46-minute long video After: 2-3 minute video clips with assessments in between and at the end Assessments
  • 16. • Learning objects composed of short video clips • Assessment of learning outcomes of studying video content • Teachers: selecting a video segment and assign Q&As Dragthe stage bar and click “From”button; continue draggingand then click “To” button. Add a question and answers. Define “Learning Objects” (Instructors)
  • 17. Learning and Assessments • View the whole video, and take a quiz • Review the video clip corresponding to the question Click “Review” button to review the video clip; click the speak icon to speak out the question; click “Confirm”to check your answer
  • 18. Case Study: Cybersecurity Program Student Engagement for the 24 Classrooms inVideo: Turn videos into interactive learning contents
  • 19. Low Accuracy Video1: 45 Video Clips Video1: 29 Video Clips Video3: 29 Video Clips Video1: Individual Video 2: Small Class Video 3: Full Classroom
  • 20. Accuracy of transcripts of 9 video clips from three original videos
  • 21. SDLC Accuracy comparison: • “hits rate” before – standard parameters; • “hits rate” after – revised parameters No improvement!
  • 22. Correlations? low accuracy vs. recording methods • Low accuracy – 10% or less – Individual22,(45+31+29 video clips) • Medium accuracy – 40 to 60% – EdX_EDM, EdX_ajax, (20 video clips) • High accuracy – 90% or higher – Phone.p2, online_shopping, (30 video clips)
  • 25. Voice-over re-Recording • Re-recorded voices on videos • Merge audio track with original videos • Signal analyzing while recording • Accuracy significantly improved!
  • 26. Correlations • Low accuracy is expressed in high quefrency – A measurement of ambient noises – echo • Recording methods – One microphone (per person) – Used condenser microphone instead of dynamic one • Recording setting could affect the audio quality (for digital processing) – Experiments – Guide to digital recording
  • 29. Transcript Time-stamping System (TTS) • Adding timestamps for already transcribed videos • Fuzzy Search 
  • 30. Further Discussions • Web API • Search progression (over the years) • Voice cancellation/reduction • Automatic Time-stamping • Curriculum Development • Build community - collaborations
  • 32. Shuangbao (Paul) Wang, Ph.D. paul.wang@computer.org William Kelly, Metonymy Corporation Xiaolong Cheng, Doctoral Candidate, George Washington University