Towards Parallel Nonmonotonic Reasoning with Billions of Facts

Authors:
Ilias Tachmazidis, Grigoris Antoniou,
Giorgos Flouris, Spyros Kotoulas
Partially funded by PlanetData

 Huge data set coming from
◦ the Web, government authorities, scientific
databases, sensors and more
 Defeasible logic
◦ is suitable for encoding commonsense knowledge
and reasoning
◦ avoids triviality of inference due to low-quality data
 Defeasible logic has low complexity
◦ The consequences of a defeasible theory D can be
computed in O(N) time, where N is the number of
symbols in D

 Reasoning is performed in the presence of
defeasible rules
 Defeasible logic has been implemented for
in-memory reasoning, however, it is not
applicable for huge data set
 Solution: scalability/parallelization using the
MapReduce framework
 Our approach is restricted to single-
argument reasoning

 A multi-argument implementation has been
accepted in ECAI 2012 (to appear)
 Ilias Tachmazidis, Grigoris Antoniou, Giorgos
Flouris, Spyros Kotoulas, and Lee
McCluskey, ‘Large-scale Parallel Stratified
Defeasible Reasoning’, in ECAI, (2012)

 Facts
◦ e.g. bird(eagle)
 Strict Rules
◦ e.g. bird(X)  animal(X)
 Defeasible Rules
◦ e.g. bird(X)  flies(X)

• Priority Relation (acyclic relation on the set of
rules)
–e.g. r: bird(X)  flies(X)
r’: brokenWing(X)  ¬ flies(X)
r’ > r

 Inspired by similar primitives in LISP and
other functional languages
 Operates exclusively on <key, value> pairs
 Input and Output types of a MapReduce job:
◦ Input : <k1, v1>
◦ Map(k1,v1) → list(k2,v2)
◦ Reduce(k2, list (v2)) → list(k3,v3)
◦ Output : list(k3,v3)

 Provides an infrastructure that takes care of
◦ distribution of data
◦ management of fault tolerance
◦ results collection
 For a specific problem
◦ developer writes a few routines which are following
the general interface

 Rule decomposition
◦ the computation of each rule is assigned to a
computer in the cloud
◦ difficult to achieve balanced work distribution
 Data decomposition
◦ a subset of data is assigned to each computer in
the cloud
◦ provides more fine-grained partitioning
◦ our solution is based on data decomposition

 Rule set:
◦ r1 : bird(X)  animal(X)
◦ r2 : bird(X)  flies(X)
◦ r3 : brokenWing(X)  ¬ flies(X)
◦ r3 > r2
 Consider bird(eagle) and brokenWing(owl) as
facts
 Note that flies(eagle) and ¬ flies(owl) are not
conflicting with each other!
 Reasoning is performed, in isolation, for each
unique argument value

INPUT MAP phase Input
Facts in multiple files <position in file, fact>

File01
-------------------- <0, bird(eagle)>
bird(eagle)
<11, bird(owl)>
bird(owl)
<0, bird(pigeon)>
File02
<12, brokenWing(eagle)>
-------------------
bird(pigeon) <29, brokenWing(owl)>
brokenWing(eagle)
brokenWing(owl)

MAP phase Output Reduce phase Input

Grouping/Sorting
<argument,predicate> <argument, list(predicates)>

<eagle, bird>
<eagle, <bird, brokenWing>>
<owl, bird>
<owl, <bird, brokenWing>>
<pigeon, bird>

<eagle, brokenWing> <pigeon, <bird>>

<owl, brokenWing>

Reduce phase Output
(Final Output)
<Conclusions after reasoning>

animal(eagle)
¬ flies(eagle)

animal(owl)
¬ flies(owl)

animal(pigeon)
flies(pigeon)

Towards Parallel Nonmonotonic Reasoning with Billions of Facts

 This work is the first to explore the feasibility
of nonmonotonic reasoning over huge data
sets
 We considered nonmonotonic reasoning in
the form of defeasible logic and adapted the
MapReduce framework for parallelization
 Our experimental results demonstrate that
◦ defeasible reasoning with billions of data is
performant
◦ our approach has the potential to scale to trillions
of facts.

Towards Parallel Nonmonotonic Reasoning with Billions of Facts

More Related Content

What's hot (20)

Similar to Towards Parallel Nonmonotonic Reasoning with Billions of Facts (20)

More from PlanetData Network of Excellence (20)

Recently uploaded (20)

Towards Parallel Nonmonotonic Reasoning with Billions of Facts