SlideShare a Scribd company logo
Hands-on-Workshop
Big (Twitter) Data
Damian Trilling
d.c.trilling@uva.nl
@damian0604
www.damiantrilling.net
Afdeling Communicatiewetenschap
Universiteit van Amsterdam

30 January 2014
13.15
#bigdata

Damian Trilling
In this session (3/4):

What we’ll do
1

A bunch of exercises

2

If you want to, the opportunity to develop an own script

Björn and I will help you.

#bigdata

Damian Trilling
Analyzing social media with Python and other tools (3/4)
I’ll now show you some example scripts you can use for doing the
exercises and for inspiration for an own project. You find everything
you need at http://beehub.nl/bigdata-cw/workshop.
Or in the future at https://github.com/uvacw/py-examples

#bigdata

Damian Trilling
RE exercise 1: Automated coding

See example from this morning

#bigdata

Damian Trilling
RE exercise 2: Freqencies

netvizz ⇒ engeltjes.tab ⇒ engeltjes.py ⇒ screen output +
engeltjes_count.csv

#bigdata

Damian Trilling
RE exercise 2: Freqencies

netvizz ⇒ engeltjes.tab ⇒ engeltjes.py ⇒ screen output +
engeltjes_count.csv
something new: The package nltk and the removal of stopwords
www.nltk.org

#bigdata

Damian Trilling
Analyzing social media with Python and other tools (3/4)
RE exercise 3: Sentiment analysis

The pattern-module
pattern.nl | en | es | de | fr | it | nl
http://www.clips.ua.ac.be/pages/pattern

#bigdata

Damian Trilling
Analyzing social media with Python and other tools (3/4)
RE exercise 4: Your own ideas

1
2

Ask google.

3

#bigdata

Have a look at the examples on beehub or github.
Ask us for advice.

Damian Trilling
Before you start

Common errors
indention error Pay attention to TAB and SPACE.
error in line YYY Have a close look at line YYY in your editor.
index out of range Maybe you want to read column 5 from a table
with 4 columns?
Try your script on a small dataset first!

#bigdata

Damian Trilling
Vragen of opmerkingen?

Damian Trilling
d.c.trilling@uva.nl
@damian0604
www.damiantrilling.net

#bigdata

Damian Trilling

More Related Content

DOCX
I can statements for computing
PPTX
Pre production (task 5)
PPTX
My ITlab overview
PDF
Export
PDF
Mapping Issues with the Web: An Introduction to Digital Methods
PDF
From Telling Stories with Data to Telling Stories with Data Infrastructures: ...
PDF
Social Network Analysis for Facebook Pages
PPTX
Gạch bông, Gạch cổ đẹp
I can statements for computing
Pre production (task 5)
My ITlab overview
Export
Mapping Issues with the Web: An Introduction to Digital Methods
From Telling Stories with Data to Telling Stories with Data Infrastructures: ...
Social Network Analysis for Facebook Pages
Gạch bông, Gạch cổ đẹp

Viewers also liked (8)

PPTX
Digital Social Science Lab: Connceting academia with data literacy
PDF
Social Media Technicity. Affordances, Politics and Digital Methods.
PDF
Doing Digital Methods: Some Recent Highlights from Winter and Summer Schools
PPTX
4 tools for social network analysis
PDF
Big Data & Vino
PDF
Facebook Network Analysis using Gephi
PDF
Community Detection in Social Media
PDF
Digital in 2016
Digital Social Science Lab: Connceting academia with data literacy
Social Media Technicity. Affordances, Politics and Digital Methods.
Doing Digital Methods: Some Recent Highlights from Winter and Summer Schools
4 tools for social network analysis
Big Data & Vino
Facebook Network Analysis using Gephi
Community Detection in Social Media
Digital in 2016
Ad

Similar to Analyzing social media with Python and other tools (3/4) (8)

PDF
Analyzing social media with Python and other tools (4/4)
PDF
Analyzing social media with Python and other tools (1/4)
PDF
Production-Ready BIG ML Workflows - from zero to hero
PDF
2015 03-28-eb-final
PDF
Analyzing social media with Python and other tools (2/4)
PPTX
Winter Projects GDSC IITK
PPTX
Data carpentry instructor-onboarding
Analyzing social media with Python and other tools (4/4)
Analyzing social media with Python and other tools (1/4)
Production-Ready BIG ML Workflows - from zero to hero
2015 03-28-eb-final
Analyzing social media with Python and other tools (2/4)
Winter Projects GDSC IITK
Data carpentry instructor-onboarding
Ad

More from Department of Communication Science, University of Amsterdam (20)

PDF
Media diets in an age of apps and social media: Dealing with a third layer of...
PDF
Conceptualizing and measuring news exposure as network of users and news items
Media diets in an age of apps and social media: Dealing with a third layer of...
Conceptualizing and measuring news exposure as network of users and news items

Recently uploaded (20)

PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
RMMM.pdf make it easy to upload and study
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PPTX
Presentation on HIE in infants and its manifestations
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
A systematic review of self-coping strategies used by university students to ...
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
Anesthesia in Laparoscopic Surgery in India
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PDF
01-Introduction-to-Information-Management.pdf
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PPTX
Lesson notes of climatology university.
Abdominal Access Techniques with Prof. Dr. R K Mishra
human mycosis Human fungal infections are called human mycosis..pptx
Pharmacology of Heart Failure /Pharmacotherapy of CHF
202450812 BayCHI UCSC-SV 20250812 v17.pptx
Supply Chain Operations Speaking Notes -ICLT Program
STATICS OF THE RIGID BODIES Hibbelers.pdf
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
RMMM.pdf make it easy to upload and study
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Presentation on HIE in infants and its manifestations
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
A systematic review of self-coping strategies used by university students to ...
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
Anesthesia in Laparoscopic Surgery in India
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
01-Introduction-to-Information-Management.pdf
Final Presentation General Medicine 03-08-2024.pptx
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
Lesson notes of climatology university.

Analyzing social media with Python and other tools (3/4)

  • 1. Hands-on-Workshop Big (Twitter) Data Damian Trilling d.c.trilling@uva.nl @damian0604 www.damiantrilling.net Afdeling Communicatiewetenschap Universiteit van Amsterdam 30 January 2014 13.15 #bigdata Damian Trilling
  • 2. In this session (3/4): What we’ll do 1 A bunch of exercises 2 If you want to, the opportunity to develop an own script Björn and I will help you. #bigdata Damian Trilling
  • 4. I’ll now show you some example scripts you can use for doing the exercises and for inspiration for an own project. You find everything you need at http://beehub.nl/bigdata-cw/workshop. Or in the future at https://github.com/uvacw/py-examples #bigdata Damian Trilling
  • 5. RE exercise 1: Automated coding See example from this morning #bigdata Damian Trilling
  • 6. RE exercise 2: Freqencies netvizz ⇒ engeltjes.tab ⇒ engeltjes.py ⇒ screen output + engeltjes_count.csv #bigdata Damian Trilling
  • 7. RE exercise 2: Freqencies netvizz ⇒ engeltjes.tab ⇒ engeltjes.py ⇒ screen output + engeltjes_count.csv something new: The package nltk and the removal of stopwords www.nltk.org #bigdata Damian Trilling
  • 9. RE exercise 3: Sentiment analysis The pattern-module pattern.nl | en | es | de | fr | it | nl http://www.clips.ua.ac.be/pages/pattern #bigdata Damian Trilling
  • 11. RE exercise 4: Your own ideas 1 2 Ask google. 3 #bigdata Have a look at the examples on beehub or github. Ask us for advice. Damian Trilling
  • 12. Before you start Common errors indention error Pay attention to TAB and SPACE. error in line YYY Have a close look at line YYY in your editor. index out of range Maybe you want to read column 5 from a table with 4 columns? Try your script on a small dataset first! #bigdata Damian Trilling
  • 13. Vragen of opmerkingen? Damian Trilling d.c.trilling@uva.nl @damian0604 www.damiantrilling.net #bigdata Damian Trilling