SlideShare a Scribd company logo
May 2, 2013
Shared Data:
What it Means for the Future of
Libraries
May 2, 2013
Peter Murray,
LYRASIS Digital Technology Services
Robin Fay,
Head, DBM/Cataloging
University of Georgia Libraries
Using this software
Agenda
• Overview of big data
• What is big data? What is shared data?
• Implications and challenges
• Discussion
How did our data get big?
• Technology that has unforeseen consequences
• Technology changes
• We leave digital trails wherever we go
• Think> internet browsing history, email, medical
records, bank transactions, buying history at
shopping sites, Amazon reviews, Facebook
photos, comments on websites, and much more
How did our data get big?
• “Collectively the data
that we leave behind
is Big DataBig DataBig DataBig Data”
• and of course.. There
is the data that others
(people and
machines) create
about us
• Big Data is about us
and has far reaching
consequences
What is Big Data?
• It is a not a technology –
it is a shift in how we
view and use information
• Taking large amounts of
information spread
across many different
resources in different
formats making them
explore
• It doesn’t have to be
“that big just bigger than
what you can go through
by hand”
3 attributes of Big Data
• Large
• Fast (manual
time needed)
• and
unstructured
(formats differ)
=3 Vs of Big Data
Big Data
• Relational (relationships) database - our ILS systems are often
relational databases
• Mathematical database – computations
• Big Data is the intersection of two
• Health– analyzing health records to identify allergies, sickness, etc
• Philanthropy (datakind) – analyze behavior of farmers and
knowledge workers to evaluate the impact (ROI) of philanthropic
work
• Think about potential for library use: we have patron data,
bibliographic data and more!
Concerns: Big Data
• Privacy – erodes privacy potentially leaking private
information
• Justify stereotypes (data can be misused or used in a
negative) and polarize social groups
• Facebook open graph search – pulling together information
from diverse information to get lists of seemingly innocent
ways such as movie watching habits or music can be used in
negative ways to reinforce stereotypes or drawn conclusions
about people
• “Personalization can look like prejudice”
• We live in grey areas
• Computers do not understand that
Which side of the fence?
• Big Data is going to change our lives!
• Are you
• a semantic idealist?a semantic idealist?a semantic idealist?a semantic idealist? if we can “taxonomize” and
organize it, we can make sense of it
– Wolfram Alpha – we can ask it and it will reason
(mathematical)
• A chaotic nihilistA chaotic nihilistA chaotic nihilistA chaotic nihilist? Algorithms will handle it – correct
data will bubble up given enough information
– Watson – doesn’t know answers but will analyze to
interpret answer
So, how would you file a cup of coffee?So, how would you file a cup of coffee?So, how would you file a cup of coffee?So, how would you file a cup of coffee?
• Depends upon how you will use the
information!
• Understandings do not take
advantage of digital information
which slows semantic idealism –
much information not organized so
we have to rely algorithms (for now)
but it is vulnerable.
• Tagging is often done by machines
– even in libraries we batch load,
harvest, update data globally.
Humans and technologyHumans and technologyHumans and technologyHumans and technology
• Our reasoning can be flawed - we make decisions
evolutionary – we look at simple correlations and
patterns (false positives)
• If comments after a post are highly negative,
responders are more likely to take polarizing
viewpoints
• Even when math is good, data can be wrong
Shared dataShared dataShared dataShared data
• We are a mosaic of data from other resources
• Unified digital history – record of all of our data and could
aggregate health information and share with doctors – just
one example
• Veracity (can verify) and Value (how we can make sense of
our data)
• Shared data : connecting networks will collect data;
algorithms will tag and assign metadata but it will be up to
humans to add value - this can then be shared in ways that
are useful
Linked data makes it possibleLinked data makes it possibleLinked data makes it possibleLinked data makes it possible
• Linked data keeps us from having to re-enter or
copy information
It makes data:
• reusable
• easy to correct (correct one record instead of
multiples)
• efficient
• and potentially useful to others
Linked data makes it possibleLinked data makes it possibleLinked data makes it possibleLinked data makes it possible
• It can build relationships in different ways -
allowing us to create temporary collections (a user
could organize their search results in a way that
makes sense to them) or more permanent
(collocating ALL works by a particular author more
easily; pulling together photographs more easily)
• It can help make sense of Big Data and facilitate
sharing data
Linked data makes it possibleLinked data makes it possibleLinked data makes it possibleLinked data makes it possible
• Linked data keeps us from having to re-enter or
copy information
It makes data:
• reusable
• easy to correct (correct one record instead of
multiples)
• efficient
• and potentially useful to others
Thinking of data in the library environmentThinking of data in the library environmentThinking of data in the library environmentThinking of data in the library environment
• Automation and new technologies
• The web has changed
• Large scale bibliographic databases
• User expectations and needs
• Patron data
• Cooperative cataloging
• Greater variety of media in library collections (electronic!)
• FRBR is our data model – semantic web friendly!
Discussion points
• Obviously, WorldCat is a shared data resource we
have all been using for years. What are some other
examples of big data, shared data, or linked data
that libraries use now?
• What are some examples of data that libraries
could share that we aren't sharing already?
• What are some of the pitfalls of data sharing on a
massive scale?
Thank you!
• Our speakers
• You!
• Questions?
• russell.palmer@lyrasis.org

More Related Content

PPTX
Shared Data & Big Data for Libraries
PDF
Big Data for Library Services (2017)
PPTX
web 30.pptx
PPTX
MIT Program on Information Science Talk -- Julia Flanders on Jobs, Roles, Ski...
PDF
Tim Estes - Generating dynamic social networks from large scale unstructured ...
PPTX
BigData
PPTX
Digital Reasoning at AirSummit 2014
PDF
Getting started in Data Science (April 2017, Los Angeles)
Shared Data & Big Data for Libraries
Big Data for Library Services (2017)
web 30.pptx
MIT Program on Information Science Talk -- Julia Flanders on Jobs, Roles, Ski...
Tim Estes - Generating dynamic social networks from large scale unstructured ...
BigData
Digital Reasoning at AirSummit 2014
Getting started in Data Science (April 2017, Los Angeles)

What's hot (20)

PDF
Mining the Social Web for Fun & Profit Within Your Organization
PDF
Using cognitive computing to better analyze human communication
PDF
Getting Started in Data Science
PPTX
Brown Bag: New Models of Scholarly Communication for Digital Scholarship, by ...
PDF
Career in Data Science (July 2017, DTLA)
PDF
Got Chaos? Extracting Business Intelligence from Email with Natural Language ...
PDF
NOVA Data Science Meetup 1/19/2017 - Presentation 1
PDF
Introduction to data science
PDF
Synthesys Technical Overview
PDF
Tim Estes - Information Systems in an Entity Centric World
PDF
Isolating values from big data with the help of four v’s
PPTX
Data Science For Social Good: Tackling the Challenge of Homelessness
PDF
Open data
PDF
Using language to save the world: interactions between society, behaviour and...
PDF
Big Data & Analytics for Government - Case Studies
PPTX
Metadata in a Crowd: Shared Knowledge Production
PDF
JIMS Rohini IT Flash Monthly Newsletter - October Issue
PPTX
Becker, Digby, Ferrante, Lloyd, Leffler, and Tolliver "Talking to Your Organi...
PDF
Datascience and python
PPTX
HumanityRoad training - Basic Crisis Information Management
Mining the Social Web for Fun & Profit Within Your Organization
Using cognitive computing to better analyze human communication
Getting Started in Data Science
Brown Bag: New Models of Scholarly Communication for Digital Scholarship, by ...
Career in Data Science (July 2017, DTLA)
Got Chaos? Extracting Business Intelligence from Email with Natural Language ...
NOVA Data Science Meetup 1/19/2017 - Presentation 1
Introduction to data science
Synthesys Technical Overview
Tim Estes - Information Systems in an Entity Centric World
Isolating values from big data with the help of four v’s
Data Science For Social Good: Tackling the Challenge of Homelessness
Open data
Using language to save the world: interactions between society, behaviour and...
Big Data & Analytics for Government - Case Studies
Metadata in a Crowd: Shared Knowledge Production
JIMS Rohini IT Flash Monthly Newsletter - October Issue
Becker, Digby, Ferrante, Lloyd, Leffler, and Tolliver "Talking to Your Organi...
Datascience and python
HumanityRoad training - Basic Crisis Information Management
Ad

Viewers also liked (10)

PPTX
PDF
The future of music collections
PPTX
Dlg lyrasis pres 2013
PDF
The+university+of+scranton conten tdm
PDF
Lyrasis 2nd friday august rs detective-2nd_fri
PDF
Baldwin library & digital foundations
PDF
Hidden treasures 2nd friday 6 14-13 [read-only]
PDF
Lf l 130521 final
PDF
Linked data and the future of libraries
PPT
TOTAL QUALITY MANAGEMENT-Customer Satisfaction
The future of music collections
Dlg lyrasis pres 2013
The+university+of+scranton conten tdm
Lyrasis 2nd friday august rs detective-2nd_fri
Baldwin library & digital foundations
Hidden treasures 2nd friday 6 14-13 [read-only]
Lf l 130521 final
Linked data and the future of libraries
TOTAL QUALITY MANAGEMENT-Customer Satisfaction
Ad

Similar to Shared data and the future of libraries (20)

PPT
WORLD CAT AS BIG DATA
PDF
Is this BIG DATA which I see before me?
PPTX
Preparing For The Future: Helping Libraries Respond to Changing Technological...
PDF
Sentara Linked Data Workshop - Sept 10, 2012
PPTX
Ralph schroeder and eric meyer
PPTX
Sla 2016 presentation
PPTX
Reflecting on Yesterday, Understanding Today, Planning for Tomorrow
PPTX
Open Data and Higher Education: future gains and current practice
PPTX
The role of libraries and information professionals during the Big Data Era/ ...
PPTX
UKSG webinar - Future Casting with Dave Parkes, Staffordshire University
PPTX
Lwb feb2013
PDF
LIS Game Changer Trends and Profession Motivation by Muhammad Shafiq Rana
PPT
What Uses for New Digital Technologies?
PDF
Big Data & the Enterprise
PPTX
Technology Trends to Watch
PDF
Trends, challenges and developments in technologies that will influence the f...
PPTX
Guelph public presentation
PPTX
How Your Data Can Predict The Future
PPTX
Accessing and Using Big Data to Advance Social Science Knowledge
PPTX
Seeing and talking about Big Data, Farida Vis, AHRC Subject Assocations
WORLD CAT AS BIG DATA
Is this BIG DATA which I see before me?
Preparing For The Future: Helping Libraries Respond to Changing Technological...
Sentara Linked Data Workshop - Sept 10, 2012
Ralph schroeder and eric meyer
Sla 2016 presentation
Reflecting on Yesterday, Understanding Today, Planning for Tomorrow
Open Data and Higher Education: future gains and current practice
The role of libraries and information professionals during the Big Data Era/ ...
UKSG webinar - Future Casting with Dave Parkes, Staffordshire University
Lwb feb2013
LIS Game Changer Trends and Profession Motivation by Muhammad Shafiq Rana
What Uses for New Digital Technologies?
Big Data & the Enterprise
Technology Trends to Watch
Trends, challenges and developments in technologies that will influence the f...
Guelph public presentation
How Your Data Can Predict The Future
Accessing and Using Big Data to Advance Social Science Knowledge
Seeing and talking about Big Data, Farida Vis, AHRC Subject Assocations

Recently uploaded (20)

PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PDF
01-Introduction-to-Information-Management.pdf
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
Basic Mud Logging Guide for educational purpose
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
Computing-Curriculum for Schools in Ghana
PPTX
Institutional Correction lecture only . . .
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
TR - Agricultural Crops Production NC III.pdf
PDF
Classroom Observation Tools for Teachers
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PDF
Anesthesia in Laparoscopic Surgery in India
PPTX
Cell Types and Its function , kingdom of life
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
Pre independence Education in Inndia.pdf
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
O5-L3 Freight Transport Ops (International) V1.pdf
01-Introduction-to-Information-Management.pdf
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Basic Mud Logging Guide for educational purpose
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Microbial diseases, their pathogenesis and prophylaxis
O7-L3 Supply Chain Operations - ICLT Program
Computing-Curriculum for Schools in Ghana
Institutional Correction lecture only . . .
STATICS OF THE RIGID BODIES Hibbelers.pdf
TR - Agricultural Crops Production NC III.pdf
Classroom Observation Tools for Teachers
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
human mycosis Human fungal infections are called human mycosis..pptx
Anesthesia in Laparoscopic Surgery in India
Cell Types and Its function , kingdom of life
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Pre independence Education in Inndia.pdf

Shared data and the future of libraries

  • 1. May 2, 2013 Shared Data: What it Means for the Future of Libraries
  • 2. May 2, 2013 Peter Murray, LYRASIS Digital Technology Services Robin Fay, Head, DBM/Cataloging University of Georgia Libraries
  • 4. Agenda • Overview of big data • What is big data? What is shared data? • Implications and challenges • Discussion
  • 5. How did our data get big? • Technology that has unforeseen consequences • Technology changes • We leave digital trails wherever we go • Think> internet browsing history, email, medical records, bank transactions, buying history at shopping sites, Amazon reviews, Facebook photos, comments on websites, and much more
  • 6. How did our data get big? • “Collectively the data that we leave behind is Big DataBig DataBig DataBig Data” • and of course.. There is the data that others (people and machines) create about us • Big Data is about us and has far reaching consequences
  • 7. What is Big Data? • It is a not a technology – it is a shift in how we view and use information • Taking large amounts of information spread across many different resources in different formats making them explore • It doesn’t have to be “that big just bigger than what you can go through by hand”
  • 8. 3 attributes of Big Data • Large • Fast (manual time needed) • and unstructured (formats differ) =3 Vs of Big Data
  • 9. Big Data • Relational (relationships) database - our ILS systems are often relational databases • Mathematical database – computations • Big Data is the intersection of two • Health– analyzing health records to identify allergies, sickness, etc • Philanthropy (datakind) – analyze behavior of farmers and knowledge workers to evaluate the impact (ROI) of philanthropic work • Think about potential for library use: we have patron data, bibliographic data and more!
  • 10. Concerns: Big Data • Privacy – erodes privacy potentially leaking private information • Justify stereotypes (data can be misused or used in a negative) and polarize social groups • Facebook open graph search – pulling together information from diverse information to get lists of seemingly innocent ways such as movie watching habits or music can be used in negative ways to reinforce stereotypes or drawn conclusions about people • “Personalization can look like prejudice” • We live in grey areas • Computers do not understand that
  • 11. Which side of the fence? • Big Data is going to change our lives! • Are you • a semantic idealist?a semantic idealist?a semantic idealist?a semantic idealist? if we can “taxonomize” and organize it, we can make sense of it – Wolfram Alpha – we can ask it and it will reason (mathematical) • A chaotic nihilistA chaotic nihilistA chaotic nihilistA chaotic nihilist? Algorithms will handle it – correct data will bubble up given enough information – Watson – doesn’t know answers but will analyze to interpret answer
  • 12. So, how would you file a cup of coffee?So, how would you file a cup of coffee?So, how would you file a cup of coffee?So, how would you file a cup of coffee? • Depends upon how you will use the information! • Understandings do not take advantage of digital information which slows semantic idealism – much information not organized so we have to rely algorithms (for now) but it is vulnerable. • Tagging is often done by machines – even in libraries we batch load, harvest, update data globally.
  • 13. Humans and technologyHumans and technologyHumans and technologyHumans and technology • Our reasoning can be flawed - we make decisions evolutionary – we look at simple correlations and patterns (false positives) • If comments after a post are highly negative, responders are more likely to take polarizing viewpoints • Even when math is good, data can be wrong
  • 14. Shared dataShared dataShared dataShared data • We are a mosaic of data from other resources • Unified digital history – record of all of our data and could aggregate health information and share with doctors – just one example • Veracity (can verify) and Value (how we can make sense of our data) • Shared data : connecting networks will collect data; algorithms will tag and assign metadata but it will be up to humans to add value - this can then be shared in ways that are useful
  • 15. Linked data makes it possibleLinked data makes it possibleLinked data makes it possibleLinked data makes it possible • Linked data keeps us from having to re-enter or copy information It makes data: • reusable • easy to correct (correct one record instead of multiples) • efficient • and potentially useful to others
  • 16. Linked data makes it possibleLinked data makes it possibleLinked data makes it possibleLinked data makes it possible • It can build relationships in different ways - allowing us to create temporary collections (a user could organize their search results in a way that makes sense to them) or more permanent (collocating ALL works by a particular author more easily; pulling together photographs more easily) • It can help make sense of Big Data and facilitate sharing data
  • 17. Linked data makes it possibleLinked data makes it possibleLinked data makes it possibleLinked data makes it possible • Linked data keeps us from having to re-enter or copy information It makes data: • reusable • easy to correct (correct one record instead of multiples) • efficient • and potentially useful to others
  • 18. Thinking of data in the library environmentThinking of data in the library environmentThinking of data in the library environmentThinking of data in the library environment • Automation and new technologies • The web has changed • Large scale bibliographic databases • User expectations and needs • Patron data • Cooperative cataloging • Greater variety of media in library collections (electronic!) • FRBR is our data model – semantic web friendly!
  • 19. Discussion points • Obviously, WorldCat is a shared data resource we have all been using for years. What are some other examples of big data, shared data, or linked data that libraries use now? • What are some examples of data that libraries could share that we aren't sharing already? • What are some of the pitfalls of data sharing on a massive scale?
  • 20. Thank you! • Our speakers • You! • Questions? • russell.palmer@lyrasis.org