SlideShare a Scribd company logo
ExperTwin: An Alter Ego in
Cyberspace for
Knowledge Workers
C. Toxtli, C. Flores-Saviaga, M. Maurier, A. Ribot, T. Bankole, A. Entrekin, M.
Cantley, S. Singh, S. Reddy, R. Reddy
Problem statement
Knowledge workers (i.e. news writers, researchers) are benefited from having
the right information (i.e. in context), in time (i.e. auto suggestions) and in
place (i.e. in their workspace).
Querying and filtering multi-domain knowledge bases (i.e. Google) are time
consuming tasks. The collected information is usually moved to the workspace
and the friction of switching contexts cause interruptions and add up to
reduced productivity and increased stress (Czerwinski 2000, Iqbal 2007, Mark
2008)
Example
Imagine that you are writing an article about the relation of the United States
government to the North Korea government.
Maybe you need to know:
● What are the last actions from North Korea (query focused in North Korea)
● What is the United States government expecting (query focused in U.S.)
● How previous agreements had evolved (query ordered by time)
Then you collect, organize and cite the found information.
Solution - ExperTwin
In order to empower knowledge workers to be able to get opportune in-context
information in their workspace, we present ExperTwin, a Knowledge Advantage
Machine (KAM) capable to manage personal semantic networks.
Goal
The purpose of this research is to envision how a knowledge worker workspace
can be enhanced by applying Knowledge Advantage Machine frameworks
such as Vijjana (Makineni 2015).
Terminology
Knowledge Advantage (KA): Just as Mechanical Advantage played a key role in
the industrial era, the concept of Knowledge Advantage could be applied to deal
with the information explosion problem, and it is defined as the ratio of time it
takes to accomplish a knowledge based task to amount of time it takes to
search for the relevant knowledge.
Knowledge Advantage Machine (KAM): Any machine (or an app) that increases
the KA may be thought of as a KAM.
Knowledge Unit (KU): referred in this paper as JANs. Knowledge Object that
contains all the metadata of each content.
ExperTwin - Components
● Knowledge Discovery
● Learning Agent
● Visualization
Knowledge Discovery
ExperTwin indexes the knowledge
from web sources, local sources,
web feeds and email.
ExperTwin crawlers constantly
updates the Knowledge Base from
these sources.
Knowledge Discovery - Multiple sources
Learning Agent - Natural Language Processing
Purpose: Keyword extraction will, with a degree of accuracy, tell what the
purpose of many articles are. From aiding in determining relevance to user
preferences.
Keyword
Extraction
1. Text to obtain
keywords from
2. Number of keywords
wanted
3. Title of text if
obtainable
Dictionary of
Keywords with
weights.
Perform NLP with NLTK
and RAKE_NLTK libraries
Learning Agent - Machine Learning
According to the user preference of a content over different contexts, the
classifier give an extra weight to each content.
Preprocessing
1. Run through the database
2. Generate keywords for
every JAN in database
3. Define user defined
keywords
4. Label article as class 1/class
2 based on the results of
step 3
5. Collect master document
Tensorflo
w
Learning Agent - Machine Learning
1. CPU based tensorflow®
2. Learn vocabulary and
term document matrix
with scikit learn
3. relU + sigmoid activation
functions wt 50% dropout
4. Train with 70% of data
5. 87% test accuracy
Training
https://goo.gl/aRXEbp
Tensorflo
w
Learning Agent - Machine Learning
1. Load saved neural
network architecture
2. Query the database for
unclassified JANs
3. Retrieve content &
transform to document
term matrix
4. Make predictions
5. Update database
Testing/Processing
https://goo.gl/9q5azK
1. CPU based tensorflow®
2. Learn vocabulary and
term document matrix
with scikit learn
3. relU + sigmoid activation
functions wt 50% dropout
4. Train with 70% of data
5. 87% test accuracy
TrainingPreprocessing
1. Run through the database
2. Generate keywords for
every JAN in database
3. Define user defined
keywords
4. Label article as class 1/class
2 based on the results of
step 3
5. Collect master document
Learning Agent - GraphDB
The semantic network is stored in a
graph database by linking the
keywords to the JANs and assigning
different weights.
● Each twin has a meta-knowledge base
● Stores its biases and reasoning for relating data
● Self-representing (see image)
● Allows us to rank articles by relevance in real time
● Searchable
Architecture
Visualization
● Work area
● Content suggestions
● Content explorer
○ 2D & 3D visualizations
○ Graph representation
Visualization - Work Area
Visualization - Work area
● Need login (through Google Sign-In with a gmail address)
● Many users can use the interface at the same time
● Users need to set up interest keywords (add/delete)
● Keywords associated with user listed
● Users can pick keywords in dropdown or search to start
browsing
Visualization - Work area
● Context choice: Research / Professional / Study / Social / Others
● Will help in the choice/ranking of the articles
● Drag and Drop: to add files or folder to the database
● Help:
○ To send articles (url) to database through an email inbox@aiwvu.ml
○ To download Chrome Extension to add articles to database
Visualization - Content suggestions
From user search, get ten best ranked articles
● Thumbnail (if any)
● Title of article
● Date of publication
● Article clickable for a preview
Visualization - Content suggestions
Each article listed can be open in preview:
● Title
● Date of publication
● Source
● Full content
● User rating
Visualization - Content explorer
7
8
7.- Switch to Graph view
8.- Graph view
2D & 3D
visualizations
A search -> list of articles
4 types of 3D representations
available:
● Table
● Sphere
● Helix
● Grid
Graph visualizations
Articles and their relationship available in Graph 3D representation
Populated by a user search
Each article = node
Link = keyword shared by nodes
Virtual Reality visualization
By using a VR Headset, ExperTwin let users get immerse into the content.
● This work only focuses in how a Knowledge Advantage Machine
frameworks can be applied to implement an enhanced workspace for
knowledge workers.
● Evaluations should be performed to determine how much this tool can help
information workers to improve their work by being assisted by ExperTwin.
Limitations
Conclusions
We propose ExperTwin a Knowledge Advantage Machine that enhances the
knowledge worker workspace by adding in-context information retrieval
capabilities and information analysis visualizations to improve knowledge based
tasks.
Thanks
Carlos Toxtli-Hernandez
@ctoxtli
carlos.toxtli@mail.wvu.edu
https://github.com/aiteamwvu

More Related Content

PDF
Search: Probabilistic Information Retrieval
PPTX
Probabilistic retrieval model
PPTX
Introduction to Text Mining
PPTX
Query expansion_Team42_IRE2k14
PDF
Deep Recommender Systems - PAPIs.io LATAM 2018
PPTX
Indexing Automated Vs Automatic Galvan1
PPT
4.4 text mining
PPTX
Search: Probabilistic Information Retrieval
Probabilistic retrieval model
Introduction to Text Mining
Query expansion_Team42_IRE2k14
Deep Recommender Systems - PAPIs.io LATAM 2018
Indexing Automated Vs Automatic Galvan1
4.4 text mining

What's hot (12)

PPT
Textmining Introduction
PDF
Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...
PPT
Natural language search using Neo4j
PPT
Probablistic information retrieval
PDF
Approaches for Keyword Query Routing
PDF
[Phd Thesis Defense] CHAMELEON: A Deep Learning Meta-Architecture for News Re...
PDF
Semantic Annotation of Documents
PDF
Paper id 37201536
PDF
Crowdsourced query augmentation through the semantic discovery of domain spec...
PDF
Named Entity Recognition from Online News
PDF
Semantic tagging for documents using 'short text' information
PPTX
Using the search engine as recommendation engine
Textmining Introduction
Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...
Natural language search using Neo4j
Probablistic information retrieval
Approaches for Keyword Query Routing
[Phd Thesis Defense] CHAMELEON: A Deep Learning Meta-Architecture for News Re...
Semantic Annotation of Documents
Paper id 37201536
Crowdsourced query augmentation through the semantic discovery of domain spec...
Named Entity Recognition from Online News
Semantic tagging for documents using 'short text' information
Using the search engine as recommendation engine
Ad

Similar to ExperTwin: An Alter Ego in Cyberspace for Knowledge Workers (20)

PPTX
SOFTWARE ENGINEERING PROJECT FOR AI AND APPLICATION
PDF
Information retrieval-systems notes
PDF
Discovering Emerging Tech through Graph Analysis - Henry Hwangbo @ GraphConne...
PPTX
Utilizing the natural langauage toolkit for keyword research
DOC
IEEE 2014 DOTNET CLOUD COMPUTING PROJECTS A scientometric analysis of cloud c...
PDF
professional fuzzy type-ahead rummage around in xml type-ahead search techni...
PDF
Search Solutions 2011: Successful Enterprise Search By Design
PDF
SDSC18 and DSATL Meetup March 2018
PDF
Deprecating the state machine: building conversational AI with the Rasa stack...
PDF
Deprecating the state machine: building conversational AI with the Rasa stack
PPTX
Building a Semantic search Engine in a library
DOCX
Term Paper VirtualizationDue Week 10 and worth 210 pointsThis.docx
PDF
IRJET- Natural Language Query Processing
PPTX
Networking chapter jkl; dfghyubLec 1.pptx
PDF
Text Document Classification System
PDF
Conversational AI with Rasa - PyData Workshop
PDF
Object oriented software engineering concepts
PDF
Information_Retrieval_Models_Nfaoui_El_Habib
PDF
[IJET V2I3P7] Authors: Muthe Sandhya, Shitole Sarika, Sinha Anukriti, Aghav S...
PDF
PoolParty Semantic Classifier
SOFTWARE ENGINEERING PROJECT FOR AI AND APPLICATION
Information retrieval-systems notes
Discovering Emerging Tech through Graph Analysis - Henry Hwangbo @ GraphConne...
Utilizing the natural langauage toolkit for keyword research
IEEE 2014 DOTNET CLOUD COMPUTING PROJECTS A scientometric analysis of cloud c...
professional fuzzy type-ahead rummage around in xml type-ahead search techni...
Search Solutions 2011: Successful Enterprise Search By Design
SDSC18 and DSATL Meetup March 2018
Deprecating the state machine: building conversational AI with the Rasa stack...
Deprecating the state machine: building conversational AI with the Rasa stack
Building a Semantic search Engine in a library
Term Paper VirtualizationDue Week 10 and worth 210 pointsThis.docx
IRJET- Natural Language Query Processing
Networking chapter jkl; dfghyubLec 1.pptx
Text Document Classification System
Conversational AI with Rasa - PyData Workshop
Object oriented software engineering concepts
Information_Retrieval_Models_Nfaoui_El_Habib
[IJET V2I3P7] Authors: Muthe Sandhya, Shitole Sarika, Sinha Anukriti, Aghav S...
PoolParty Semantic Classifier
Ad

More from Carlos Toxtli (20)

PPTX
Reproducibility in artificial intelligence
PPTX
Autom editor video blooper recognition and localization for automatic monolo...
PPTX
Artificial intelligence and open source
PPTX
Bots in robotic process automation
PPTX
How to implement artificial intelligence solutions
PPTX
Multimodal emotion recognition at utterance level with spatio-temporal featur...
PPTX
Changing paradigms in ai prototyping
PPTX
Inteligencia Artificial From Zero to Hero
PDF
Bots for Crowds
PPTX
Enabling Expert Critique with Chatbots and Micro-Guidance - Ci 2018
PPTX
Cómo vivir de la inteligencia artificial
PDF
Education 3.0 - Megatendencias
PPTX
Understanding Political Manipulation and Botnets - RightsCon
PPTX
Understanding Chatbot-Mediated Task Management
PDF
Single sign on spanish - guía completa
PDF
Los empleos del futuro en Latinoamérica
PDF
Empleos que ya están siendo reemplazados por bots y el futuro del RPA (Roboti...
PDF
RPA (Robotic Process Automation)
PDF
Chatbots + rpa (robotic process automation)
PPTX
Estrategias tecnológicas de crecimiento acelerado para startups
Reproducibility in artificial intelligence
Autom editor video blooper recognition and localization for automatic monolo...
Artificial intelligence and open source
Bots in robotic process automation
How to implement artificial intelligence solutions
Multimodal emotion recognition at utterance level with spatio-temporal featur...
Changing paradigms in ai prototyping
Inteligencia Artificial From Zero to Hero
Bots for Crowds
Enabling Expert Critique with Chatbots and Micro-Guidance - Ci 2018
Cómo vivir de la inteligencia artificial
Education 3.0 - Megatendencias
Understanding Political Manipulation and Botnets - RightsCon
Understanding Chatbot-Mediated Task Management
Single sign on spanish - guía completa
Los empleos del futuro en Latinoamérica
Empleos que ya están siendo reemplazados por bots y el futuro del RPA (Roboti...
RPA (Robotic Process Automation)
Chatbots + rpa (robotic process automation)
Estrategias tecnológicas de crecimiento acelerado para startups

Recently uploaded (20)

PDF
Heart disease approach using modified random forest and particle swarm optimi...
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
OMC Textile Division Presentation 2021.pptx
PDF
DP Operators-handbook-extract for the Mautical Institute
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
A comparative analysis of optical character recognition models for extracting...
PPTX
A Presentation on Touch Screen Technology
PPTX
A Presentation on Artificial Intelligence
PDF
Hybrid model detection and classification of lung cancer
PDF
Hindi spoken digit analysis for native and non-native speakers
PPTX
TLE Review Electricity (Electricity).pptx
PDF
Encapsulation theory and applications.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
WOOl fibre morphology and structure.pdf for textiles
PDF
1 - Historical Antecedents, Social Consideration.pdf
PPTX
cloud_computing_Infrastucture_as_cloud_p
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PDF
Enhancing emotion recognition model for a student engagement use case through...
Heart disease approach using modified random forest and particle swarm optimi...
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Programs and apps: productivity, graphics, security and other tools
OMC Textile Division Presentation 2021.pptx
DP Operators-handbook-extract for the Mautical Institute
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
A comparative analysis of optical character recognition models for extracting...
A Presentation on Touch Screen Technology
A Presentation on Artificial Intelligence
Hybrid model detection and classification of lung cancer
Hindi spoken digit analysis for native and non-native speakers
TLE Review Electricity (Electricity).pptx
Encapsulation theory and applications.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
WOOl fibre morphology and structure.pdf for textiles
1 - Historical Antecedents, Social Consideration.pdf
cloud_computing_Infrastucture_as_cloud_p
Digital-Transformation-Roadmap-for-Companies.pptx
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
Enhancing emotion recognition model for a student engagement use case through...

ExperTwin: An Alter Ego in Cyberspace for Knowledge Workers

  • 1. ExperTwin: An Alter Ego in Cyberspace for Knowledge Workers C. Toxtli, C. Flores-Saviaga, M. Maurier, A. Ribot, T. Bankole, A. Entrekin, M. Cantley, S. Singh, S. Reddy, R. Reddy
  • 2. Problem statement Knowledge workers (i.e. news writers, researchers) are benefited from having the right information (i.e. in context), in time (i.e. auto suggestions) and in place (i.e. in their workspace). Querying and filtering multi-domain knowledge bases (i.e. Google) are time consuming tasks. The collected information is usually moved to the workspace and the friction of switching contexts cause interruptions and add up to reduced productivity and increased stress (Czerwinski 2000, Iqbal 2007, Mark 2008)
  • 3. Example Imagine that you are writing an article about the relation of the United States government to the North Korea government. Maybe you need to know: ● What are the last actions from North Korea (query focused in North Korea) ● What is the United States government expecting (query focused in U.S.) ● How previous agreements had evolved (query ordered by time) Then you collect, organize and cite the found information.
  • 4. Solution - ExperTwin In order to empower knowledge workers to be able to get opportune in-context information in their workspace, we present ExperTwin, a Knowledge Advantage Machine (KAM) capable to manage personal semantic networks.
  • 5. Goal The purpose of this research is to envision how a knowledge worker workspace can be enhanced by applying Knowledge Advantage Machine frameworks such as Vijjana (Makineni 2015).
  • 6. Terminology Knowledge Advantage (KA): Just as Mechanical Advantage played a key role in the industrial era, the concept of Knowledge Advantage could be applied to deal with the information explosion problem, and it is defined as the ratio of time it takes to accomplish a knowledge based task to amount of time it takes to search for the relevant knowledge. Knowledge Advantage Machine (KAM): Any machine (or an app) that increases the KA may be thought of as a KAM. Knowledge Unit (KU): referred in this paper as JANs. Knowledge Object that contains all the metadata of each content.
  • 7. ExperTwin - Components ● Knowledge Discovery ● Learning Agent ● Visualization
  • 8. Knowledge Discovery ExperTwin indexes the knowledge from web sources, local sources, web feeds and email. ExperTwin crawlers constantly updates the Knowledge Base from these sources.
  • 9. Knowledge Discovery - Multiple sources
  • 10. Learning Agent - Natural Language Processing Purpose: Keyword extraction will, with a degree of accuracy, tell what the purpose of many articles are. From aiding in determining relevance to user preferences. Keyword Extraction 1. Text to obtain keywords from 2. Number of keywords wanted 3. Title of text if obtainable Dictionary of Keywords with weights. Perform NLP with NLTK and RAKE_NLTK libraries
  • 11. Learning Agent - Machine Learning According to the user preference of a content over different contexts, the classifier give an extra weight to each content. Preprocessing 1. Run through the database 2. Generate keywords for every JAN in database 3. Define user defined keywords 4. Label article as class 1/class 2 based on the results of step 3 5. Collect master document Tensorflo w
  • 12. Learning Agent - Machine Learning 1. CPU based tensorflow® 2. Learn vocabulary and term document matrix with scikit learn 3. relU + sigmoid activation functions wt 50% dropout 4. Train with 70% of data 5. 87% test accuracy Training https://goo.gl/aRXEbp Tensorflo w
  • 13. Learning Agent - Machine Learning 1. Load saved neural network architecture 2. Query the database for unclassified JANs 3. Retrieve content & transform to document term matrix 4. Make predictions 5. Update database Testing/Processing https://goo.gl/9q5azK 1. CPU based tensorflow® 2. Learn vocabulary and term document matrix with scikit learn 3. relU + sigmoid activation functions wt 50% dropout 4. Train with 70% of data 5. 87% test accuracy TrainingPreprocessing 1. Run through the database 2. Generate keywords for every JAN in database 3. Define user defined keywords 4. Label article as class 1/class 2 based on the results of step 3 5. Collect master document
  • 14. Learning Agent - GraphDB The semantic network is stored in a graph database by linking the keywords to the JANs and assigning different weights. ● Each twin has a meta-knowledge base ● Stores its biases and reasoning for relating data ● Self-representing (see image) ● Allows us to rank articles by relevance in real time ● Searchable
  • 16. Visualization ● Work area ● Content suggestions ● Content explorer ○ 2D & 3D visualizations ○ Graph representation
  • 18. Visualization - Work area ● Need login (through Google Sign-In with a gmail address) ● Many users can use the interface at the same time ● Users need to set up interest keywords (add/delete) ● Keywords associated with user listed ● Users can pick keywords in dropdown or search to start browsing
  • 19. Visualization - Work area ● Context choice: Research / Professional / Study / Social / Others ● Will help in the choice/ranking of the articles ● Drag and Drop: to add files or folder to the database ● Help: ○ To send articles (url) to database through an email inbox@aiwvu.ml ○ To download Chrome Extension to add articles to database
  • 20. Visualization - Content suggestions From user search, get ten best ranked articles ● Thumbnail (if any) ● Title of article ● Date of publication ● Article clickable for a preview
  • 21. Visualization - Content suggestions Each article listed can be open in preview: ● Title ● Date of publication ● Source ● Full content ● User rating
  • 22. Visualization - Content explorer 7 8 7.- Switch to Graph view 8.- Graph view
  • 23. 2D & 3D visualizations A search -> list of articles 4 types of 3D representations available: ● Table ● Sphere ● Helix ● Grid
  • 24. Graph visualizations Articles and their relationship available in Graph 3D representation Populated by a user search Each article = node Link = keyword shared by nodes
  • 25. Virtual Reality visualization By using a VR Headset, ExperTwin let users get immerse into the content.
  • 26. ● This work only focuses in how a Knowledge Advantage Machine frameworks can be applied to implement an enhanced workspace for knowledge workers. ● Evaluations should be performed to determine how much this tool can help information workers to improve their work by being assisted by ExperTwin. Limitations
  • 27. Conclusions We propose ExperTwin a Knowledge Advantage Machine that enhances the knowledge worker workspace by adding in-context information retrieval capabilities and information analysis visualizations to improve knowledge based tasks.