Jaakko Lappalainen
Computer Science department
University of Alcalá, Spain
Overview
•
•
•
•
•
•
•
•

The problem
Proposed approach
The method
Results
Conclusions
Strengths and weaknesses
Future work
Questions
The problem
• Researchers focus on a particular time
frame and scope for testing their
hypotheses.
• But the conclusions of the research are
projected to the future.
• Paradox: the work that predicts things for
tomorrow, becomes a snapshot of what
happened until today.
Proposed approach
• New data relevant to some hypotheses
gets continuously aggregated as time
passes.
• With common semantics, it can be
combined or related to other datasets.
• Represent the hypothesis as programs
that are executed repeatedly.
The method
• The case of study
– Lenten, L. J., & Moosa, I. A. (2003). An
empirical investigation into long-term climate
change in Australia. Environmental Modelling
& Software, 18(1), 59-70.

• The authors claim that the temperature
series has some a trend feature.
The method (II)
• Let’s find some data sources.
– ACORN-SAT, from the Australian Bureau of
Meteorology. This uses LD!!
– NOAA weather data, not in LD but easy to
parse…

• Periodically ingest data (e.g., into a
relational database)
• An R script checks if the trend on the data
has changed…
• Ingested data is semantically tagged…
Results
• We are checking for Lenten & Moosa’s
hypothesis every week.
– More extensive time scope.
– Wider geographical scope, to all data
available for Australia.

• The snapshot becomes a movie.
• Executable paper
Conclusions
• The tools we already have allows us to
use large-scale computation
infrastructures easily to support science.
– The agINFRA project

• Massive data ingestion.
• Data integration and interlinking.
• User-tailored service execution.
Strengths
• Data availability
– The data is ingested (from LD sources, but
not only) and published.

• Data interoperability
– The data is not stored by itself.

• Actionable data
– Ready to be addressed, used and generate
new actionable data.
Weaknesses
• Represent ‘science inquiry’ as a data
model is not trivial.
• CPU-consuming tasks are even more
consuming.
Future work
• Further dataset interlinking
– More plural value for physical parameters.
– Dataset value error detection.

• Advance in hypothesis representation
– Machine readable research processes.
Questions?
Thank you very much!

jkk.lapp@uah.es

More Related Content

PPTX
Tim Menzies, directions in Data Science
PDF
MUMS: Transition & SPUQ Workshop - Uncertainty Quantification in Materials Wo...
PPTX
Welcome to PROMISE 2011
PDF
Celebrating 30 years of ISSRE
PDF
Celebrating 30 years of ISSRE
PPTX
Fault Tree Analysis
PDF
Getting your paper accepted (at ISSTA 2008)
PDF
eric_fitzgerald_resume
Tim Menzies, directions in Data Science
MUMS: Transition & SPUQ Workshop - Uncertainty Quantification in Materials Wo...
Welcome to PROMISE 2011
Celebrating 30 years of ISSRE
Celebrating 30 years of ISSRE
Fault Tree Analysis
Getting your paper accepted (at ISSTA 2008)
eric_fitzgerald_resume

What's hot (20)

PPTX
Roberti esa 2014_workshop_slides
PPTX
The contribution of authors: A study of the relationship between the size and...
PDF
JoshHess_Resume
PDF
Optique presentation
PPT
Replicating FLOSS Research as eResearch
PDF
Towards the impact of design flaws on resources used by an application
PDF
Peter (Yun-shao) Sung's Resume 2016III
PDF
Big Data and Tangibles - TEI 13
KEY
Li101 ecosystem
ODP
Glued Ecology
PPTX
PhD Pre-Thesis
PDF
[3.4] Practical Benefits and Annoyences of Sharing Data - Daniël Lakens [3TU....
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PDF
Drew Hanover Nomination Letter
DOCX
Lucas_Taylor_Resume_Gen_Su16
DOCX
Ryan Goode Resume all new(2page)
DOC
Katherine M Smarkel Resume
DOCX
IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS Multi illuminant estimation with c...
PPTX
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
PDF
Mary Jo Galbraith Resume (updated)
Roberti esa 2014_workshop_slides
The contribution of authors: A study of the relationship between the size and...
JoshHess_Resume
Optique presentation
Replicating FLOSS Research as eResearch
Towards the impact of design flaws on resources used by an application
Peter (Yun-shao) Sung's Resume 2016III
Big Data and Tangibles - TEI 13
Li101 ecosystem
Glued Ecology
PhD Pre-Thesis
[3.4] Practical Benefits and Annoyences of Sharing Data - Daniël Lakens [3TU....
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Drew Hanover Nomination Letter
Lucas_Taylor_Resume_Gen_Su16
Ryan Goode Resume all new(2page)
Katherine M Smarkel Resume
IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS Multi illuminant estimation with c...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Mary Jo Galbraith Resume (updated)
Ad

Viewers also liked (15)

PDF
Gregas - Chinelos - Infatil
PDF
Chinelos Gregas - Infantil
PPTX
Ag infra pilot_programmatic_access_jkklapp
PDF
Gregas Rasteiras Novas
PDF
Chinelos GREGAS - Feminino Adulto
PPTX
Captûre Business Overview 2013
PDF
Masculinos
PPTX
Collona final presentation
PPTX
Ismail dilber assignment3
PDF
Introducing myself
PDF
Chinelos - Gregas - Infantil
PPTX
Randy peays visual resume
PDF
16f628a
PPTX
Wireless communication and networking
PPT
Balducci carol visual_resumestoryboard
Gregas - Chinelos - Infatil
Chinelos Gregas - Infantil
Ag infra pilot_programmatic_access_jkklapp
Gregas Rasteiras Novas
Chinelos GREGAS - Feminino Adulto
Captûre Business Overview 2013
Masculinos
Collona final presentation
Ismail dilber assignment3
Introducing myself
Chinelos - Gregas - Infantil
Randy peays visual resume
16f628a
Wireless communication and networking
Balducci carol visual_resumestoryboard
Ad

Similar to KREAM@ICCS2013 (20)

PPTX
1. Intro DS.pptx
PPTX
Accelerating Data-driven Discovery in Energy Science
PDF
Using parallel hierarchical clustering to
PPTX
Continuous modeling - automating model building on high-performance e-Infrast...
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PPTX
Summary of 3DPAS
PDF
Discovering new functional materials for clean energy and beyond using high-t...
PDF
E research attachment survey
PPTX
Science Engagement: A Non-Technical Approach to the Technical Divide
PDF
Working with Instrument Data (GlobusWorld Tour - UMich)
PDF
Open-source tools for generating and analyzing large materials data sets
PPTX
Data analytics in computer networking
PPTX
Semantic Similarity and Selection of Resources Published According to Linked ...
PPTX
Research.pptx
PDF
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
PDF
Software tools to facilitate materials science research
PPTX
Why Data Science Matters - 2014 WDS Data Stewardship Award Lecture
PDF
Accelerating New Materials Design with Supercomputing and Machine Learning
PPT
Small Is Beautiful: Summarizing Scientific Workflows Using Semantic Annotat...
PPTX
Learning Systems for Science
1. Intro DS.pptx
Accelerating Data-driven Discovery in Energy Science
Using parallel hierarchical clustering to
Continuous modeling - automating model building on high-performance e-Infrast...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Summary of 3DPAS
Discovering new functional materials for clean energy and beyond using high-t...
E research attachment survey
Science Engagement: A Non-Technical Approach to the Technical Divide
Working with Instrument Data (GlobusWorld Tour - UMich)
Open-source tools for generating and analyzing large materials data sets
Data analytics in computer networking
Semantic Similarity and Selection of Resources Published According to Linked ...
Research.pptx
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
Software tools to facilitate materials science research
Why Data Science Matters - 2014 WDS Data Stewardship Award Lecture
Accelerating New Materials Design with Supercomputing and Machine Learning
Small Is Beautiful: Summarizing Scientific Workflows Using Semantic Annotat...
Learning Systems for Science

Recently uploaded (20)

PPTX
Build Your First AI Agent with UiPath.pptx
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
CloudStack 4.21: First Look Webinar slides
PDF
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
PPT
What is a Computer? Input Devices /output devices
PDF
Consumable AI The What, Why & How for Small Teams.pdf
PPT
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
PPT
Geologic Time for studying geology for geologist
PDF
sustainability-14-14877-v2.pddhzftheheeeee
PDF
Flame analysis and combustion estimation using large language and vision assi...
PDF
“A New Era of 3D Sensing: Transforming Industries and Creating Opportunities,...
PDF
Improvisation in detection of pomegranate leaf disease using transfer learni...
PPTX
GROUP4NURSINGINFORMATICSREPORT-2 PRESENTATION
PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
DOCX
search engine optimization ppt fir known well about this
PPTX
TEXTILE technology diploma scope and career opportunities
PDF
Five Habits of High-Impact Board Members
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PDF
Architecture types and enterprise applications.pdf
PDF
Taming the Chaos: How to Turn Unstructured Data into Decisions
Build Your First AI Agent with UiPath.pptx
Zenith AI: Advanced Artificial Intelligence
CloudStack 4.21: First Look Webinar slides
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
What is a Computer? Input Devices /output devices
Consumable AI The What, Why & How for Small Teams.pdf
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
Geologic Time for studying geology for geologist
sustainability-14-14877-v2.pddhzftheheeeee
Flame analysis and combustion estimation using large language and vision assi...
“A New Era of 3D Sensing: Transforming Industries and Creating Opportunities,...
Improvisation in detection of pomegranate leaf disease using transfer learni...
GROUP4NURSINGINFORMATICSREPORT-2 PRESENTATION
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
search engine optimization ppt fir known well about this
TEXTILE technology diploma scope and career opportunities
Five Habits of High-Impact Board Members
NewMind AI Weekly Chronicles – August ’25 Week III
Architecture types and enterprise applications.pdf
Taming the Chaos: How to Turn Unstructured Data into Decisions

KREAM@ICCS2013

  • 1. Jaakko Lappalainen Computer Science department University of Alcalá, Spain
  • 2. Overview • • • • • • • • The problem Proposed approach The method Results Conclusions Strengths and weaknesses Future work Questions
  • 3. The problem • Researchers focus on a particular time frame and scope for testing their hypotheses. • But the conclusions of the research are projected to the future. • Paradox: the work that predicts things for tomorrow, becomes a snapshot of what happened until today.
  • 4. Proposed approach • New data relevant to some hypotheses gets continuously aggregated as time passes. • With common semantics, it can be combined or related to other datasets. • Represent the hypothesis as programs that are executed repeatedly.
  • 5. The method • The case of study – Lenten, L. J., & Moosa, I. A. (2003). An empirical investigation into long-term climate change in Australia. Environmental Modelling & Software, 18(1), 59-70. • The authors claim that the temperature series has some a trend feature.
  • 6. The method (II) • Let’s find some data sources. – ACORN-SAT, from the Australian Bureau of Meteorology. This uses LD!! – NOAA weather data, not in LD but easy to parse… • Periodically ingest data (e.g., into a relational database) • An R script checks if the trend on the data has changed… • Ingested data is semantically tagged…
  • 7. Results • We are checking for Lenten & Moosa’s hypothesis every week. – More extensive time scope. – Wider geographical scope, to all data available for Australia. • The snapshot becomes a movie. • Executable paper
  • 8. Conclusions • The tools we already have allows us to use large-scale computation infrastructures easily to support science. – The agINFRA project • Massive data ingestion. • Data integration and interlinking. • User-tailored service execution.
  • 9. Strengths • Data availability – The data is ingested (from LD sources, but not only) and published. • Data interoperability – The data is not stored by itself. • Actionable data – Ready to be addressed, used and generate new actionable data.
  • 10. Weaknesses • Represent ‘science inquiry’ as a data model is not trivial. • CPU-consuming tasks are even more consuming.
  • 11. Future work • Further dataset interlinking – More plural value for physical parameters. – Dataset value error detection. • Advance in hypothesis representation – Machine readable research processes.
  • 13. Thank you very much! jkk.lapp@uah.es