SlideShare a Scribd company logo
RELATIONS BETWEEN LEARNING ANALYTICS AND
DATA PRIVACY IN MOOCs
Malinka Ivanova
Technical University of Sofia, Bulgaria
Carmen Holotescu
„Ioan Slavici” University of Timisoara, Romania
Gabriela Grosseck
West University of Timisoara, Romania
Cătălin Iapă
University Politehnica Timisoara, Romania
eLearning and Software for Education
Conference - eLSE 2016, Workshop on Open
Educational Resources and MOOC,
Bucharest, 21-22 April 2016
AIM
To summarize and analyse the current projects and
practices, best learning cases and research
standpoints in the area of MOOCs data use for
learning analytics purposes and at this base a model
presenting a possibility to ensure data privacy in
MOOCs to be developed
OUTLINE
Introduction and problem definition
Research method
Collected data by MOOCs platform
Big data and MOOCs analytics
CIC data privacy model in MOOCs
Conclusions
understanding and
optimizing learning
and the
environments in
which it occurs
Learning
Analytics
-measurement,
-collection,
-analysis and
-reporting of
data about
learners and
their contexts
is defined as for purposes of
Siemens, Long, 2011
-LMS
-Social media platforms
-MOOCs
-Online
-Blended-learning
courses
Data about learners
Learning support
Records of students’ activity in LMSs
•logging, posting and commenting messages, accessing
materials, posting assignments
Results in previous courses
Preferences
Learning optimization and
understanding
Prediction and improvement
students’ academic performance
Helping students „at risk” with
prompt feedback
Learning
Analytics
METHOD
Review of
privacy policy of five
MOOCs platforms
Review of
started projects and
published scientific
papers
Research questions
What kind of students’ private data are
shared and why?
What are the privacy models provided
by MOOCs?
What data are needed for Learning
Analytics modules?
Are there any third party actors
processing the students’ data and why?
Which are the relations between
Learning Analytics and Data Privacy?
Is it possible the learners to be
protected?
Development
of a model reflecting
the data privacy in
MOOCs
COLLECTED DATA BY MOOCs PLATFORMS AND THIRD PARTIES
The term:
• Personal data is legally recognized and it is defined as “any information relating
to an identified or identifiable natural person” (Directive 95/46/EC)
• Private data is not used by juridical science, but it is often utilized in other fields
of science – for example in computer science
• Sensitive data - special category of data in legal documents related to “racial or
ethnic origin, political opinions, religious or philosophical beliefs, trade-union
membership, and the processing of data concerning health or sex life” (Directive
95/46/EC)
• Confidential data - used only between two parties, keeping it in secret; it is not
available for public use
Personal data Non-personal data
Private data
Sensitive data
Confidential data
Continuum of collected data for research and educational purposes
Factors/
MOOCs
Platforms
edX Coursera FutureLearn CanvasNetwork Open2Study
Collected
information
by
platforms
Personal
information about:
-sign up;
-participation in online
courses;
-registration for a paid
certificate;
-at sending email
messages;
-participation in public
forums;
-student learning
performance;
-which, when, where
pages are visited;
-which hyperlinks and
other user interface
controls are used.
A.Personal
information:
-at registration; -for
user account update;
-to purchase products or
services;
-to complete surveys;
-to sign-up for email
updates;
-to participate in
forums;
-to send email;
-to participate in online
courses.
B.Non-personal
information:
- which, when, where
pages are visited;
-which hyperlinks are
visited;
-URLs from which
Cursera is visited;
-log of IP address, OS
Personal information:
-for access to website;
-to use online courses and
content;
-to register or post notes,
assignments or other
material;
-information when user
report a problem;
-location, gender and
educational history or
qualifications;
-other information for
delivering the best service;
-results of assessments;
–at purchase a paid service -
credit card or other payment
details;
- a record of email
correspondence;
-data about usage of
content;
-IP address.
Personal information:
-data for login;
-messages and stored files;
-email content;
-survey data;
- how user interacts with
applications;
- encountered errors by users
when use platform;
- device identifiers;
-how often users visit the
site, what pages are visited,
what other sites are used for
coming to the site;
-IP address, operating system,
browser type, domain name.
A.Personal information:
-at registration;
-educational qualifications;
-academic results;
-banking and payment details;
-tax file number;
- responses to surveys;
-how, when and why services
are used.
B. Sensitive information
-language background;
-citizenship;
-disability;
-health information.
C.Non-personal
information
- website visit - date, time and
duration, search terms,
viewed pages;
-IP address, type of device,
browser and OS.
COLLECTED DATA BY MOOCs PLATFORMS AND THIRD PARTIES
Collected
financial
information
By third party
payment site
-A. By third party
payment site;
-B. By Coursera:
credit card
information
By FutureLearn: data
about credit card or
other payment
details at purchasing
a paid service
- By Open2Study that
uses EFTPOS and
online technologies
for payment
Factors/
MOOCs
Platforms
edX Coursera FutureLearn CanvasNetwork Open2Study
COLLECTED DATA BY MOOCs PLATFORMS AND THIRD PARTIES
Usage of
collected
information
by
platforms
-to personalize
the course
material;
-to improve
learning
process.
A.Personal
information for:
-delivering specific
courses and/or
services;
-sending updates
about online
courses and
events.
B.Non-Personal
information for:
-improvement of
services;
-other business
purposes.
- to improve,
maintain and protect
the quality of web
site, online courses
and content;
-to personalize
browsing;
-for announcements
and email
notifications;
-for opinion gathering
- to create and
maintain user
account;
-to provide better
services;
- to personalize and
improve learner
experience;
- to send
administrative e-mail
and announcement;
-to send surveys
- for user
identification and
verification;
-for communication;
- for delivering of
educational services;
- for improvement of
web site services
Factors/
MOOCs
Platforms
edX Coursera FutureLearn CanvasNetwork Open2Study
COLLECTED DATA BY MOOCs PLATFORMS AND THIRD PARTIES
Usage of
collected
information
from third
parties
-by educational
institutions;
-by other
providers of
courses in edX
- by service
providers,
vendors and
contractors;
-by university
partners and
other business
partners;
- by government
authorities
- by FutureLearn
partners;
-by course and
content providers;
-other third parties
- third party service
providers;
- for merger, financing,
acquisition,
bankruptcy,
dissolution,
transaction, sale
purposes
-by Australian
government
- by organizations
administrating the
business;
- by financial
institutions;
-by service
providers and
research agencies,
mailing houses,
postal, freight and
courier service
providers, printers
and distributors of
direct marketing
material, external
business advisers
Factors/
MOOCs
Platforms
edX Coursera FutureLearn CanvasNetwork Open2Study
COLLECTED DATA BY MOOCs PLATFORMS AND THIRD PARTIES
Security -software
programme for
data protection
through
administrative,
physical, and
technical
safeguards
-industry
standard: physical,
technical and
administrative
security measures
- not sharing
information with
third parties,
except before
mentioned
-technical and
organizational
security measures
-secure web site
- protect against
unauthorized access,
information use, or
disclosure
-any posted content
using CanvasNetwork
services is at user risk
-ICT security;
-secure office
access;
-personnel security
and training;
-workplace policies
Factors/
MOOCs
Platforms
edX Coursera FutureLearn CanvasNetwork Open2Study
COLLECTED DATA BY MOOCs PLATFORMS AND THIRD PARTIES
Usage of
cookies
cookies: for
collecting IP
address,
operating
system,
and browser
information
cookies and/or
web beacons:
-to identify users;
-to personalize
experience;
-to identify repeat
visitors;
-to determine the
type of content
and spending time
cookies:
- to enhance learner
experience;
-for better
understand how
learners use the
website;
- cookies from third
party social media
websites
-cookies and web
beacons:
for information about
user visit and
searched/ viewed
content
- flash cookies: to
store user preferences
and personalize visits
cookies:
-to hold anonymous
session;
- to personalize
website visits
Factors/
MOOCs
Platforms
edX Coursera FutureLearn CanvasNetwork Open2Study
COLLECTED DATA BY MOOCs PLATFORMS AND THIRD PARTIES
Privacy
policy
-Own edX
privacy policy;
-Family
Educational
Rights and
Privacy (FERPA)
for education
records
Own Cursera
privacy policy
Own FutureLearn
privacy policy
Own CanvasNetwork
privacy policy
-Own complying
with:
the Privacy Act
1988, State and
Territory health
privacy legislation,
the Spam Act 2003,
the Do Not Call
Register Act 2006
Factors/
MOOCs
Platforms
edX Coursera FutureLearn CanvasNetwork Open2Study
COLLECTED DATA BY MOOCs PLATFORMS AND THIRD PARTIES
BIG DATA AND MOOCs ANALYTICS
• Report about new technology approaches and challenges for
performance of MOOCs analytics (O’Reilly, Veeramachaneni, 2014)
• Emerging of web platforms facilitating huge groups to contribute in
development of analytics
MOOCviz – platform for collaborative
MOOC visualizations
FeatureFactory – interactive platform
for presenting prediction problems
LabelMe-Text – platform designed in
support of annotating in forum posts
• Report of LASyM learning analytics system for MOOCs:
 mines “big data”
 analyzes learning outcomes and assessment of learners
 delivers information that could be used for design of optimized MOOCs
• The aim of LASyM is to minimize “at-risk” MOOC students through
their identification at the earliest possible stage
Tabaa, Medouri, 2013
BIG DATA AND MOOCs ANALYTICS
• Ascertainment about the growing interest among researchers and
educators to open analytical systems like edX Insights and Tin Can
• Their usage could improve effectiveness of course design in different
subjects:
 different learning scenarios could be developed for studying computer
science and art history
 improvement of effectiveness of peer ranking and improvement of course
assignments
Godwin-Jones, 2014
BIG DATA AND MOOCs ANALYTICS
• European project Multiple MOOC Aggregator (EMMA) reports
development of a MOOC platform that aggregates existing European
MOOC courses with possibility for learners to design their own
personal learning environment
• EMMA platform possesses analytical application for ensuring
realization of personal learning paths according to individual learning
needs
• The learning process is monitored, achievements are controlled and
development of a competence like “learning to learn” is improved
Brouns, Tammets, Padrón-Nápoles, 2014
BIG DATA AND MOOCs ANALYTICS
• European project EDSA (European Data Science Academy) - a data
mining process in MOOCs on Coursera is applied with aim the
learning process to be analyzed
• An experiment with students from Eindhoven University of
Technology registered for participation in MOOCs on Coursera is
performed
 their behavior is analyzed
 a learning model of successful and failed students is created
EDSA project, http://edsa-project.eu/resources/learning-analytics
BIG DATA AND MOOCs ANALYTICS
CIC DATA PRIVACY MODEL IN MOOCs
• Importance of applying learning
analytics to “big data” mining for
improvement of MOOCs learning
scenarios
• A huge massive of information
concerning any learner is utilized
and personal data are not kept
private
• The model CIC (Classification-
Information-Choice) describes six
measures for ensuring data
privacy in scenarios of large scale
learning
Classification. Appropriate
classification of collected personal data/
non-personal data in groups and
classification of third parties
Information. Preparation of
informative tools informing learners
about type of collected data,
mechanisms for collecting and storing,
data usage and possibilities for choices
Choice. Choices regarding data sharing
to MOOCs platform, data giving to third
parties, for movement personal and non-
personal data from one classification
group to other
CIC DATA
PRIVACY
MODEL IN
MOOCs
CONCLUSIONS
• Intensive work on data privacy in different applicable areas in
practical and theoretical aspect, including in MOOCs learning
• Anyway, this topic is still in its immature form and it is a need for
further development of policies, mechanisms and tools
• In the existing documents the privacy of personal data is well
discussed, but nothing is mentioned about the privacy of non-
personal data
• As it is seen a connection between personal and non-personal data
exists and this connection should be taken into consideration when a
data privacy is forming
CONCLUSIONS
• The created CIC model reflects on the current situation:
 there are privacy policies written for a given MOOCs platform, but they are
not complete;
 existing learning analytics tools are utilized with minimum care for data
privacy;
 “big data” is collected and stored in unsafe digital repositories
• It reveals the big picture for undertaking measures for improvement
of data privacy in MOOCs
• It could be used as a recommendation tool guiding developers of
MOOCs in realization of more complete data privacy
Thank you!
• Picture is taken from:
http://cdn.ttgtmedia.com/visuals/ComputerWeekly/Hero%20Images/data-
privacy-fotolia.jpg

More Related Content

PPTX
Researching Data Privacy in eLearning
PPTX
Researching Data Privacy in eLearning
PPT
Incommon overview
PDF
COMBI: A NEW COMBINATION OF WEB MINING TECHNIQUES FOR E- LEARNING PERSONALIZA...
PDF
PDF
Privacy-Enhanced Personalization
PDF
Li- Rethinking information literacy competency standards in the 21st century
PDF
E-safety advocates - working with young people to raise awareness by Emma Had...
Researching Data Privacy in eLearning
Researching Data Privacy in eLearning
Incommon overview
COMBI: A NEW COMBINATION OF WEB MINING TECHNIQUES FOR E- LEARNING PERSONALIZA...
Privacy-Enhanced Personalization
Li- Rethinking information literacy competency standards in the 21st century
E-safety advocates - working with young people to raise awareness by Emma Had...

Viewers also liked (7)

PDF
Date statistice - suport pentru politici publice. Studiu de caz: Aspecte demo...
PPTX
Early Experiences of the Course Conversion to OER Process at Open University...
PPTX
2016 MOOC to Blended; pedagogies & personalised learning
PDF
The Power of MOOCs
PPTX
Learning Analytics in MOOCs: Can Data Improve Students Retention and Learning?
PPTX
Anca Colibaba, Ovidiu Ursa: Rethinking academic management for creative mater...
PDF
How To Tell Inspiring Stories
Date statistice - suport pentru politici publice. Studiu de caz: Aspecte demo...
Early Experiences of the Course Conversion to OER Process at Open University...
2016 MOOC to Blended; pedagogies & personalised learning
The Power of MOOCs
Learning Analytics in MOOCs: Can Data Improve Students Retention and Learning?
Anca Colibaba, Ovidiu Ursa: Rethinking academic management for creative mater...
How To Tell Inspiring Stories
Ad

Similar to RELATIONS BETWEEN LEARNING ANALYTICS AND DATA PRIVACY IN MOOCs (20)

PPTX
Student privacy self-management: Implications for learning analytics
PPTX
Learning analytics at the intersections of student trust, disclosure and benefit
PDF
Big Data and Student Retention
PPTX
Addressing the wicked problem of learning data privacy though principle and p...
PPTX
The big story of small data.
PPTX
The unbearable lightness of consent: Mapping MOOC providers’ response to consent
PDF
Lynema Data Collection and Privacy: Library and Learning Management Systems
PDF
Learning Analytics in action: ethics and privacy issues in the classroom
PPTX
Balancing Innovation and Student Data Privacy in EdTech
PPTX
Munassir etec647 e presentation
PDF
Griffiths lace workshop-eden-2016
PDF
Ethics privacy washington
PPTX
Personalisation and the Transmedial Learner
PDF
Ethics and Privacy in the Application of Learning Analytics (#EP4LA)
PPTX
Critical issues in the collection, analysis and use of student (digital) data
PPTX
Data security issues, ethical issues and challenges to privacy in knowledge-i...
PDF
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid. ...
PPTX
Learning and Educational Analytics
PPTX
Data aCT PRIVACY/................................................
PPTX
LAK16 privacy and analytics (2016)
Student privacy self-management: Implications for learning analytics
Learning analytics at the intersections of student trust, disclosure and benefit
Big Data and Student Retention
Addressing the wicked problem of learning data privacy though principle and p...
The big story of small data.
The unbearable lightness of consent: Mapping MOOC providers’ response to consent
Lynema Data Collection and Privacy: Library and Learning Management Systems
Learning Analytics in action: ethics and privacy issues in the classroom
Balancing Innovation and Student Data Privacy in EdTech
Munassir etec647 e presentation
Griffiths lace workshop-eden-2016
Ethics privacy washington
Personalisation and the Transmedial Learner
Ethics and Privacy in the Application of Learning Analytics (#EP4LA)
Critical issues in the collection, analysis and use of student (digital) data
Data security issues, ethical issues and challenges to privacy in knowledge-i...
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid. ...
Learning and Educational Analytics
Data aCT PRIVACY/................................................
LAK16 privacy and analytics (2016)
Ad

More from Malinka Ivanova (20)

PPTX
144 presentation iee_tel2021
PPTX
Imcl2021 paper1
PPT
Relationship between Students’ Creative Skill and Learning Performance
PPTX
Analysis and Modelling of CMOS Gm-C Filters through Machine Learning
PPTX
Cooling Water Control System Fuzzy Logic
PPTX
Presentation Learning Analytics Open Educational Resources
PPT
Evaluation of e-assessment
PPTX
Predictive Modeling Concerning Mobile Learning Advance
PPTX
Fuzzy Set Theory and Fuzzy Logic for Activities Automation in Engineering Edu...
PDF
Development of the Personalized Recommender System COsys for Career Orientation
PPTX
CASE-BASED WORKFLOW MODELING IN SUPPORT OF AUTOMATION THE TEACHERS’ PERSONAL ...
PPTX
INTELLIGENT TUTORING ININFORMAL SETTINGS:EMPIRICAL STUDY
PPTX
EXPLORATION ON THE AFFECTIVE STATES AND LEARNING DURING AN AUGMENTED REALITY ...
PPTX
Analysis of Personal Learning Networks in Support of Teachers Presence Optimi...
PPTX
Actualization of a Course Library through Influential Twitter Knowledge
PPTX
Дългата опашка и електронното обучение
PPTX
Researching Emerging Technologies and Environments in Support of New Learni...
PPTX
Competences Mapping for Personal Learning Environment Management
PPT
Google Wave Platform: Exploring the Settings for Personalized Learning
PPTX
Facilitating Active Learning Utilizing the Online Environment of Nfomedia
144 presentation iee_tel2021
Imcl2021 paper1
Relationship between Students’ Creative Skill and Learning Performance
Analysis and Modelling of CMOS Gm-C Filters through Machine Learning
Cooling Water Control System Fuzzy Logic
Presentation Learning Analytics Open Educational Resources
Evaluation of e-assessment
Predictive Modeling Concerning Mobile Learning Advance
Fuzzy Set Theory and Fuzzy Logic for Activities Automation in Engineering Edu...
Development of the Personalized Recommender System COsys for Career Orientation
CASE-BASED WORKFLOW MODELING IN SUPPORT OF AUTOMATION THE TEACHERS’ PERSONAL ...
INTELLIGENT TUTORING ININFORMAL SETTINGS:EMPIRICAL STUDY
EXPLORATION ON THE AFFECTIVE STATES AND LEARNING DURING AN AUGMENTED REALITY ...
Analysis of Personal Learning Networks in Support of Teachers Presence Optimi...
Actualization of a Course Library through Influential Twitter Knowledge
Дългата опашка и електронното обучение
Researching Emerging Technologies and Environments in Support of New Learni...
Competences Mapping for Personal Learning Environment Management
Google Wave Platform: Exploring the Settings for Personalized Learning
Facilitating Active Learning Utilizing the Online Environment of Nfomedia

Recently uploaded (20)

PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PDF
Foundation of Data Science unit number two notes
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPT
Quality review (1)_presentation of this 21
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
Supervised vs unsupervised machine learning algorithms
PDF
annual-report-2024-2025 original latest.
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
Database Infoormation System (DBIS).pptx
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PDF
Lecture1 pattern recognition............
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Qualitative Qantitative and Mixed Methods.pptx
Foundation of Data Science unit number two notes
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Quality review (1)_presentation of this 21
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Supervised vs unsupervised machine learning algorithms
annual-report-2024-2025 original latest.
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Data_Analytics_and_PowerBI_Presentation.pptx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Introduction to Knowledge Engineering Part 1
Database Infoormation System (DBIS).pptx
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Lecture1 pattern recognition............
Introduction-to-Cloud-ComputingFinal.pptx

RELATIONS BETWEEN LEARNING ANALYTICS AND DATA PRIVACY IN MOOCs

  • 1. RELATIONS BETWEEN LEARNING ANALYTICS AND DATA PRIVACY IN MOOCs Malinka Ivanova Technical University of Sofia, Bulgaria Carmen Holotescu „Ioan Slavici” University of Timisoara, Romania Gabriela Grosseck West University of Timisoara, Romania Cătălin Iapă University Politehnica Timisoara, Romania eLearning and Software for Education Conference - eLSE 2016, Workshop on Open Educational Resources and MOOC, Bucharest, 21-22 April 2016
  • 2. AIM To summarize and analyse the current projects and practices, best learning cases and research standpoints in the area of MOOCs data use for learning analytics purposes and at this base a model presenting a possibility to ensure data privacy in MOOCs to be developed
  • 3. OUTLINE Introduction and problem definition Research method Collected data by MOOCs platform Big data and MOOCs analytics CIC data privacy model in MOOCs Conclusions
  • 4. understanding and optimizing learning and the environments in which it occurs Learning Analytics -measurement, -collection, -analysis and -reporting of data about learners and their contexts is defined as for purposes of Siemens, Long, 2011 -LMS -Social media platforms -MOOCs -Online -Blended-learning courses
  • 5. Data about learners Learning support Records of students’ activity in LMSs •logging, posting and commenting messages, accessing materials, posting assignments Results in previous courses Preferences Learning optimization and understanding Prediction and improvement students’ academic performance Helping students „at risk” with prompt feedback Learning Analytics
  • 6. METHOD Review of privacy policy of five MOOCs platforms Review of started projects and published scientific papers Research questions What kind of students’ private data are shared and why? What are the privacy models provided by MOOCs? What data are needed for Learning Analytics modules? Are there any third party actors processing the students’ data and why? Which are the relations between Learning Analytics and Data Privacy? Is it possible the learners to be protected? Development of a model reflecting the data privacy in MOOCs
  • 7. COLLECTED DATA BY MOOCs PLATFORMS AND THIRD PARTIES The term: • Personal data is legally recognized and it is defined as “any information relating to an identified or identifiable natural person” (Directive 95/46/EC) • Private data is not used by juridical science, but it is often utilized in other fields of science – for example in computer science • Sensitive data - special category of data in legal documents related to “racial or ethnic origin, political opinions, religious or philosophical beliefs, trade-union membership, and the processing of data concerning health or sex life” (Directive 95/46/EC) • Confidential data - used only between two parties, keeping it in secret; it is not available for public use Personal data Non-personal data Private data Sensitive data Confidential data Continuum of collected data for research and educational purposes
  • 8. Factors/ MOOCs Platforms edX Coursera FutureLearn CanvasNetwork Open2Study Collected information by platforms Personal information about: -sign up; -participation in online courses; -registration for a paid certificate; -at sending email messages; -participation in public forums; -student learning performance; -which, when, where pages are visited; -which hyperlinks and other user interface controls are used. A.Personal information: -at registration; -for user account update; -to purchase products or services; -to complete surveys; -to sign-up for email updates; -to participate in forums; -to send email; -to participate in online courses. B.Non-personal information: - which, when, where pages are visited; -which hyperlinks are visited; -URLs from which Cursera is visited; -log of IP address, OS Personal information: -for access to website; -to use online courses and content; -to register or post notes, assignments or other material; -information when user report a problem; -location, gender and educational history or qualifications; -other information for delivering the best service; -results of assessments; –at purchase a paid service - credit card or other payment details; - a record of email correspondence; -data about usage of content; -IP address. Personal information: -data for login; -messages and stored files; -email content; -survey data; - how user interacts with applications; - encountered errors by users when use platform; - device identifiers; -how often users visit the site, what pages are visited, what other sites are used for coming to the site; -IP address, operating system, browser type, domain name. A.Personal information: -at registration; -educational qualifications; -academic results; -banking and payment details; -tax file number; - responses to surveys; -how, when and why services are used. B. Sensitive information -language background; -citizenship; -disability; -health information. C.Non-personal information - website visit - date, time and duration, search terms, viewed pages; -IP address, type of device, browser and OS. COLLECTED DATA BY MOOCs PLATFORMS AND THIRD PARTIES
  • 9. Collected financial information By third party payment site -A. By third party payment site; -B. By Coursera: credit card information By FutureLearn: data about credit card or other payment details at purchasing a paid service - By Open2Study that uses EFTPOS and online technologies for payment Factors/ MOOCs Platforms edX Coursera FutureLearn CanvasNetwork Open2Study COLLECTED DATA BY MOOCs PLATFORMS AND THIRD PARTIES
  • 10. Usage of collected information by platforms -to personalize the course material; -to improve learning process. A.Personal information for: -delivering specific courses and/or services; -sending updates about online courses and events. B.Non-Personal information for: -improvement of services; -other business purposes. - to improve, maintain and protect the quality of web site, online courses and content; -to personalize browsing; -for announcements and email notifications; -for opinion gathering - to create and maintain user account; -to provide better services; - to personalize and improve learner experience; - to send administrative e-mail and announcement; -to send surveys - for user identification and verification; -for communication; - for delivering of educational services; - for improvement of web site services Factors/ MOOCs Platforms edX Coursera FutureLearn CanvasNetwork Open2Study COLLECTED DATA BY MOOCs PLATFORMS AND THIRD PARTIES
  • 11. Usage of collected information from third parties -by educational institutions; -by other providers of courses in edX - by service providers, vendors and contractors; -by university partners and other business partners; - by government authorities - by FutureLearn partners; -by course and content providers; -other third parties - third party service providers; - for merger, financing, acquisition, bankruptcy, dissolution, transaction, sale purposes -by Australian government - by organizations administrating the business; - by financial institutions; -by service providers and research agencies, mailing houses, postal, freight and courier service providers, printers and distributors of direct marketing material, external business advisers Factors/ MOOCs Platforms edX Coursera FutureLearn CanvasNetwork Open2Study COLLECTED DATA BY MOOCs PLATFORMS AND THIRD PARTIES
  • 12. Security -software programme for data protection through administrative, physical, and technical safeguards -industry standard: physical, technical and administrative security measures - not sharing information with third parties, except before mentioned -technical and organizational security measures -secure web site - protect against unauthorized access, information use, or disclosure -any posted content using CanvasNetwork services is at user risk -ICT security; -secure office access; -personnel security and training; -workplace policies Factors/ MOOCs Platforms edX Coursera FutureLearn CanvasNetwork Open2Study COLLECTED DATA BY MOOCs PLATFORMS AND THIRD PARTIES
  • 13. Usage of cookies cookies: for collecting IP address, operating system, and browser information cookies and/or web beacons: -to identify users; -to personalize experience; -to identify repeat visitors; -to determine the type of content and spending time cookies: - to enhance learner experience; -for better understand how learners use the website; - cookies from third party social media websites -cookies and web beacons: for information about user visit and searched/ viewed content - flash cookies: to store user preferences and personalize visits cookies: -to hold anonymous session; - to personalize website visits Factors/ MOOCs Platforms edX Coursera FutureLearn CanvasNetwork Open2Study COLLECTED DATA BY MOOCs PLATFORMS AND THIRD PARTIES
  • 14. Privacy policy -Own edX privacy policy; -Family Educational Rights and Privacy (FERPA) for education records Own Cursera privacy policy Own FutureLearn privacy policy Own CanvasNetwork privacy policy -Own complying with: the Privacy Act 1988, State and Territory health privacy legislation, the Spam Act 2003, the Do Not Call Register Act 2006 Factors/ MOOCs Platforms edX Coursera FutureLearn CanvasNetwork Open2Study COLLECTED DATA BY MOOCs PLATFORMS AND THIRD PARTIES
  • 15. BIG DATA AND MOOCs ANALYTICS • Report about new technology approaches and challenges for performance of MOOCs analytics (O’Reilly, Veeramachaneni, 2014) • Emerging of web platforms facilitating huge groups to contribute in development of analytics MOOCviz – platform for collaborative MOOC visualizations FeatureFactory – interactive platform for presenting prediction problems LabelMe-Text – platform designed in support of annotating in forum posts
  • 16. • Report of LASyM learning analytics system for MOOCs:  mines “big data”  analyzes learning outcomes and assessment of learners  delivers information that could be used for design of optimized MOOCs • The aim of LASyM is to minimize “at-risk” MOOC students through their identification at the earliest possible stage Tabaa, Medouri, 2013 BIG DATA AND MOOCs ANALYTICS
  • 17. • Ascertainment about the growing interest among researchers and educators to open analytical systems like edX Insights and Tin Can • Their usage could improve effectiveness of course design in different subjects:  different learning scenarios could be developed for studying computer science and art history  improvement of effectiveness of peer ranking and improvement of course assignments Godwin-Jones, 2014 BIG DATA AND MOOCs ANALYTICS
  • 18. • European project Multiple MOOC Aggregator (EMMA) reports development of a MOOC platform that aggregates existing European MOOC courses with possibility for learners to design their own personal learning environment • EMMA platform possesses analytical application for ensuring realization of personal learning paths according to individual learning needs • The learning process is monitored, achievements are controlled and development of a competence like “learning to learn” is improved Brouns, Tammets, Padrón-Nápoles, 2014 BIG DATA AND MOOCs ANALYTICS
  • 19. • European project EDSA (European Data Science Academy) - a data mining process in MOOCs on Coursera is applied with aim the learning process to be analyzed • An experiment with students from Eindhoven University of Technology registered for participation in MOOCs on Coursera is performed  their behavior is analyzed  a learning model of successful and failed students is created EDSA project, http://edsa-project.eu/resources/learning-analytics BIG DATA AND MOOCs ANALYTICS
  • 20. CIC DATA PRIVACY MODEL IN MOOCs • Importance of applying learning analytics to “big data” mining for improvement of MOOCs learning scenarios • A huge massive of information concerning any learner is utilized and personal data are not kept private • The model CIC (Classification- Information-Choice) describes six measures for ensuring data privacy in scenarios of large scale learning Classification. Appropriate classification of collected personal data/ non-personal data in groups and classification of third parties Information. Preparation of informative tools informing learners about type of collected data, mechanisms for collecting and storing, data usage and possibilities for choices Choice. Choices regarding data sharing to MOOCs platform, data giving to third parties, for movement personal and non- personal data from one classification group to other
  • 22. CONCLUSIONS • Intensive work on data privacy in different applicable areas in practical and theoretical aspect, including in MOOCs learning • Anyway, this topic is still in its immature form and it is a need for further development of policies, mechanisms and tools • In the existing documents the privacy of personal data is well discussed, but nothing is mentioned about the privacy of non- personal data • As it is seen a connection between personal and non-personal data exists and this connection should be taken into consideration when a data privacy is forming
  • 23. CONCLUSIONS • The created CIC model reflects on the current situation:  there are privacy policies written for a given MOOCs platform, but they are not complete;  existing learning analytics tools are utilized with minimum care for data privacy;  “big data” is collected and stored in unsafe digital repositories • It reveals the big picture for undertaking measures for improvement of data privacy in MOOCs • It could be used as a recommendation tool guiding developers of MOOCs in realization of more complete data privacy
  • 24. Thank you! • Picture is taken from: http://cdn.ttgtmedia.com/visuals/ComputerWeekly/Hero%20Images/data- privacy-fotolia.jpg