SlideShare a Scribd company logo
A Data-Driven Approach
to Measure Web Site
Navigability
Speaker:Scott
Date:6/13/14 (Fri)
Xiao Fang
Paul Jen-Hwa Hu
Michael Chau
Han-fen Hu
Zhuo Yang
Olivia R. Liu Sheng
Journal of Management Information Systems
Introduction
• A well-designed website is beneficial to visitors.
• Navigation and search
• Structure of hyperlinks
• Definition of website navigability
• Aside from perceptual measurements, a data-driven approach is also
can be utilized to evaluate navigability of websites
• Limitations of navigability processed by other scholars in the past
• The objectives of the paper
• Three metrics : power, efficiency, directness
Literature Review
• Website navigation and navigability
 Critical influence of navigation
 Navigation systems, important means
 Nuance between navigation and navigability
• Measuring navigability with web data
 Broad classification
 Web content mining
 Web structure mining
 Web usage mining
Theoretical Foundations
• Information foraging theory
 It extends the optimal foraging theory
 Very likely to modify browsing strategies
• Information-processing theory
 People process information via many aspects
• Visitors make judgments about their traversing paths
• What they care doesn’t merely contain the likelihood of locating target
information.
Method and Metrics for
Measuring Navigability
A Web Mining–Based
Method for Measuring
Navigability
Steps
1. Web log preprocessing:Cleaning, session identification, session
completion.
2. Web site parsing:Parsing focal sites
3. Web page classification:Content pages and index pages
4. Access pattern mining:Frequently accessed sequences of content
pages as proxies for information-seeking targets
5. Hyperlink Structure representation:A distance matrix
Method and Metrics for
Measuring Navigability
Data-Driven Metrics for
Measuring Navigability
𝑝 𝑘 =
𝛽
2𝜋𝑘3 exp
−𝛽 𝑘−𝛼 2
2𝛼2 𝑘
𝑘 = 1,2, … ,
𝐺 𝐼 = ∀𝑘≥𝐼 𝑝(𝑘) 𝐼 = 1,2, …
𝐺 𝐼 =
1 if 𝐼 = 1
G 𝐼 − 1 − 𝑝 𝐼 − 1 if𝐼 > 1
Method and Metrics for
Measuring Navigability
Data-Driven Metrics for
Measuring Navigability
Power
• 𝑈 = 𝑢𝑖 , 𝑖 = 1,2, … , 𝑛
• 𝑢𝑖 =< 𝑝𝑖,1, 𝑝𝑖,2, … , 𝑝𝑖,𝑚 >, where 𝑝𝑖,𝑗 is the jth content page in 𝑢𝑖, 𝑗 =
1,2, … , 𝑚
• 𝑅 𝑢𝑖|𝑝𝑠 = 𝐺 𝑑 𝑝𝑠, 𝑝𝑖,1 𝑗=2
𝑚
𝐺 𝑑 𝑝𝑖,𝑗−1, 𝑝𝑖,𝑗 if 𝑝𝑠 ≠ 𝑝𝑖,1,
otherwise 𝑅 𝑢𝑖|𝑝𝑠 = 𝑗=2
𝑚
𝐺 𝑑 𝑝𝑖,𝑗−1, 𝑝𝑖,𝑗
Method and Metrics for
Measuring Navigability
Data-Driven Metrics for
Measuring Navigability
Power
• 𝑅 𝑢𝑖 = ∀𝑝 𝑆
𝑃(start of seeking for 𝑢𝑖 = 𝑝𝑠)𝑅 𝑢𝑖|𝑝𝑠
• Introducing weight
 𝑤 𝑢𝑖 =
𝑣(𝑢 𝑖)
∀𝑢∈𝑈 𝑣(𝑢)
 𝑅 𝑈 = 𝑖=1
𝑛
𝑤 𝑢𝑖 𝑅(𝑢𝑖)
Method and Metrics for
Measuring Navigability
Data-Driven Metrics for
Measuring Navigability
Efficiency
• 𝑄 𝑢𝑖|𝑝𝑠 =
𝑚𝛾−min(𝑑 𝑝 𝑠, 𝑝 𝑖,1 + 𝑗=2
𝑚
𝑑 𝑝 𝑖,𝑗−1, 𝑝 𝑖,𝑗 , 𝑚𝛾)
𝑚(𝛾−1)
• 𝑄 𝑢𝑖|𝑝𝑠 =
(𝑚−1)𝛾−min( 𝑗=2
𝑚
𝑑 𝑝 𝑖,𝑗−1, 𝑝 𝑖,𝑗 , (𝑚−1)𝛾)
𝑚(𝛾−1)
, if 𝑝𝑠 = 𝑝𝑖,1
• 𝑄 𝑢𝑖 = ∀𝑝 𝑠
𝑃(start of seeking for 𝑢𝑖 = 𝑝𝑠) 𝑄 𝑢𝑖|𝑝𝑠
• 𝑄 𝑈 = 𝑖=1
𝑛
𝑤 𝑢𝑖 𝑄(𝑢𝑖)
Method and Metrics for
Measuring Navigability
Data-Driven Metrics for
Measuring Navigability
Directness
• 𝐿 𝑢𝑖|𝑝𝑠 =
𝑚𝛿−min(𝑁 𝑝 𝑠,𝑝 𝑖,1 + 𝑗=2
𝑚
𝑑 𝑝 𝑖,𝑗−1, 𝑝 𝑖,𝑗 , 𝑚𝛿)
𝑚(𝛿−1)
if 𝑝𝑠 ≠ 𝑝𝑖,1
• 𝐿 𝑢𝑖|𝑝𝑠 =
(𝑚−1)𝛿−min( 𝑗=2
𝑚
𝑑 𝑝 𝑖,𝑗−1, 𝑝 𝑖,𝑗 , (𝑚−1)𝛿)
(𝑚−1)(𝛿−1)
if 𝑝𝑠 = 𝑝𝑖,1
• 𝐿 𝑢𝑖 = ∀𝑝 𝑠
𝑃(start of seeking for 𝑢𝑖 = 𝑝𝑠) 𝐿 𝑢𝑖|𝑝𝑠
• 𝐿 𝑈 = 𝑖=1
𝑛
𝑤 𝑢𝑖 𝐿(𝑢𝑖)
Implementation and Illustrations
• An archetype system was established.
• SpidersRUs was used to parse a website.
• Two sites
 A
 3840 content pages
 437 index pages
• Web logs were gleaned over four weeks.
 A:35,966,494 records; 732,321 sessions
 B:32,170,062 records; 555,299 sessions
 B
 3738 content pages
 380 index pages
Implementation and Illustrations
•
• The threshold was at first set at 0.05%, then its value was increased
with 0.025% in the range from 0.05% to 0.175%.
•
Implementation and Illustrations
• The distances of power and efficiency on B is great on A.
• The directness distances between A and B are smaller than that of
power and efficiency.
• According to the proposed metrics, A has higher navigability than B
• The assessment of the proposed metrics and the prevalent metrics
Evaluation Study and Data Collection
Study design
• A group of people were recruited.
• The significance of users’ familiarity was addressed.
• Four experimental conditions were created
Tasks
• A pretest was conducted.
 Content pages are more likely to constitute information-seeking
targets.
 Key access sequences identified from Web logs are consistent with
users’ common information-seeking needs, desires, and interests.
Evaluation Study and Data Collection
Participants
• Business undergraduate students enrolled in similar information
systems or operations classes in both universities.
• Each participant received $10 for his or her time and efforts.
Measurements
• Three measures: task success rate, task time, and the number of clicks.
• Participants had up to 4 minutes to complete each task.
• Cognitive-processing load
Data collection
• A quite formal way
Data Analyses and Results
• A pilot study, 39 undergraduate students
• An evaluation study with 248 participants
• Comparison of user performance and assessments between A and B
• Comparison of user performance by separating tasks related to
complexity
• Performance of the participants from each university
• An ex post facto comparison
• Further examination of the proposed metrics
Extensions to Proposed Metrics
• A scale factor can be added while evaluating a larger website.
• The metrics can be extended with the combined use of search engine.
• Integration of three metrics as a holistic measure
3
1
𝑅(𝑈)
+
1
Q(𝑈)
+
1
𝐿(𝑈)
=
3𝑅 𝑈 Q 𝑈 𝐿 𝑈
Q 𝑈 𝐿 𝑈 + 𝑅 𝑈 𝐿 𝑈 + 𝑅 𝑈 Q 𝑈
= 𝑂(𝑈)
Discussion
• Three data-driven metrics and a viable method were presented.
• The method can be used continuously for supervising a website’s
navigability
• A method by Liu et al. is suggested for gleaning data (Web log).
• It helps improve hyperlink structure designs of websites
• Limitations
• Different structures of websites may not fit to the results
• Spiders and page parsers‘ utilities are limited.
• Test of different scenarios
• More factors can be introduced to perfect the method
Conclusion
• Three data-driven metrics were presented.
• By integrating appropriate Web mining techniques, a method
cooperated the metrics was created.
• The verification of the metrics and method.
• Users’ perception corresponds to navigability measured using the
methods established by the authors
Comment
• The article clearly and laconically expresses the idea and concept with
the existing theories.
• Vivid examples following many statements which we as post-graduate
students can look upon.
• A host of demonstrations below on many pages provide necessary
assistance for lay people
• I think navigability won’t be only one factor that may affect a website
access ratio.
The End

More Related Content

PDF
New indicators for measuring research impact
PDF
Analysis as KM
PDF
Simple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
PPTX
Harper Analytics Beyond Usage Numbers
PDF
Improving Research Visibility Part 7: Measuring Research Impact
PPTX
Effectively Applying Usage Statistics in E-Resource Collection Development
PDF
LITERATURE REVIEWING WITH RESEARCH TOOLS, Part 2: Finding proper articles
PPT
We've Got the Data - Now What Do We Do About It? Applying Quality Standard to...
New indicators for measuring research impact
Analysis as KM
Simple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
Harper Analytics Beyond Usage Numbers
Improving Research Visibility Part 7: Measuring Research Impact
Effectively Applying Usage Statistics in E-Resource Collection Development
LITERATURE REVIEWING WITH RESEARCH TOOLS, Part 2: Finding proper articles
We've Got the Data - Now What Do We Do About It? Applying Quality Standard to...

What's hot (20)

PPTX
Semantometrics: Towards Fulltext-based Research Evaluation
PDF
A practical guide to do primary research on meta analysis methodology - Pubrica
PDF
Enhancing Information Retrieval by Personalization Techniques
PPT
Cochrane for Librarians: An update on searching and specialised registers
PPT
Proposing a Scientific Paper Retrieval and Recommender Framework
PPT
More Than Just Black and White: A Case for Grey Literature References in Scie...
PDF
Personalized Information Retrieval system using Computational Intelligence Te...
PPTX
Comparison of Techniques for Measuring Research Coverage of Scientific Papers...
PDF
GKumarAICS
PPTX
Quantitative Process
PPTX
Methods for making the best use of admin data
PPTX
Promoting Open Access and Open Educational Resources to Faculty
PDF
Data Stories: Using Narratives to Reflect on a Data Purchase Pilot Program
PPTX
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
PPT
What papers should I cite from my reading list? User evaluation of a manuscri...
PPTX
Added Value of Conceptual Modeling in Geosciences
PDF
Galambos_SlidesNEAIR2015
PDF
Recommending Scientific Papers: Investigating the User Curriculum
PPTX
Puce cutting eeg_paris2018
PPTX
Beyond Journal Impact and Usage Statistics: Using Citation Analysis for Colle...
Semantometrics: Towards Fulltext-based Research Evaluation
A practical guide to do primary research on meta analysis methodology - Pubrica
Enhancing Information Retrieval by Personalization Techniques
Cochrane for Librarians: An update on searching and specialised registers
Proposing a Scientific Paper Retrieval and Recommender Framework
More Than Just Black and White: A Case for Grey Literature References in Scie...
Personalized Information Retrieval system using Computational Intelligence Te...
Comparison of Techniques for Measuring Research Coverage of Scientific Papers...
GKumarAICS
Quantitative Process
Methods for making the best use of admin data
Promoting Open Access and Open Educational Resources to Faculty
Data Stories: Using Narratives to Reflect on a Data Purchase Pilot Program
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
What papers should I cite from my reading list? User evaluation of a manuscri...
Added Value of Conceptual Modeling in Geosciences
Galambos_SlidesNEAIR2015
Recommending Scientific Papers: Investigating the User Curriculum
Puce cutting eeg_paris2018
Beyond Journal Impact and Usage Statistics: Using Citation Analysis for Colle...
Ad

Similar to A data driven approach to measure web site navigability (20)

DOCX
Facilitating effective user navigation through
PDF
Data Mining to Facilitate Effective User Navigation and Improve Structure of ...
PPT
BAQMaR - Conference DM
PDF
Iaetsd effective user navigability through website structure
PPTX
Useful interactions
PDF
Predicting the user navigation pattern from web logs using weighted support ...
PPSX
Efficient approach for website design ppt
DOCX
IMPROVING WEB NAVIGATION USABILITY BY COMPARING ACTUAL AND ANTICIPATED USAGE
PPTX
Data: Digging Deeper and Displaying
PDF
Data Informed Design - Good Tech Test - May 2018
KEY
072012 tc day03
PDF
Farthest first clustering in links reorganization
PPT
Web Analytics and Usability Testing
PPTX
Safety in numbers: A framework for benchmarking the user experience
PDF
GDD Webinar InboundLabs & Hubspot
PDF
Growth Driven Design (GDD) - Webinar by InboundLabs & HubSpot
PDF
Ed25793795
PDF
Research Operations at Scale (Christian Rohrer at DesignOps Summit 2017)
PDF
Marketing and Advertising: web design, web information architecture, usabilit...
PPTX
user behaviour analysis
Facilitating effective user navigation through
Data Mining to Facilitate Effective User Navigation and Improve Structure of ...
BAQMaR - Conference DM
Iaetsd effective user navigability through website structure
Useful interactions
Predicting the user navigation pattern from web logs using weighted support ...
Efficient approach for website design ppt
IMPROVING WEB NAVIGATION USABILITY BY COMPARING ACTUAL AND ANTICIPATED USAGE
Data: Digging Deeper and Displaying
Data Informed Design - Good Tech Test - May 2018
072012 tc day03
Farthest first clustering in links reorganization
Web Analytics and Usability Testing
Safety in numbers: A framework for benchmarking the user experience
GDD Webinar InboundLabs & Hubspot
Growth Driven Design (GDD) - Webinar by InboundLabs & HubSpot
Ed25793795
Research Operations at Scale (Christian Rohrer at DesignOps Summit 2017)
Marketing and Advertising: web design, web information architecture, usabilit...
user behaviour analysis
Ad

More from Shu-Jeng Hsieh (12)

PDF
Big Data Tools in AWS
PDF
Serverless Data Platform
PDF
A New Chapter of Data Processing with CDK
PPTX
The Exabyte Journey and DataBrew with CICD
PPTX
Serverless ETL and Optimization on ML pipeline
PPTX
How cdk and projen benefit to A team
PPTX
Optimization technique for SAS
PPTX
PPTX
Demo of our thoughts
PPTX
Trial plan with capitation payment of the national healthcare insurance in ta...
PPTX
Prediction for consuming behavior of noncontractual customers
PPTX
Scheduling advertisements on a web page to maximize revenue
Big Data Tools in AWS
Serverless Data Platform
A New Chapter of Data Processing with CDK
The Exabyte Journey and DataBrew with CICD
Serverless ETL and Optimization on ML pipeline
How cdk and projen benefit to A team
Optimization technique for SAS
Demo of our thoughts
Trial plan with capitation payment of the national healthcare insurance in ta...
Prediction for consuming behavior of noncontractual customers
Scheduling advertisements on a web page to maximize revenue

Recently uploaded (20)

PPTX
modul_python (1).pptx for professional and student
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPT
Quality review (1)_presentation of this 21
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PPTX
Supervised vs unsupervised machine learning algorithms
PDF
Lecture1 pattern recognition............
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PDF
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PDF
Mega Projects Data Mega Projects Data
PDF
Transcultural that can help you someday.
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PDF
Clinical guidelines as a resource for EBP(1).pdf
modul_python (1).pptx for professional and student
STERILIZATION AND DISINFECTION-1.ppthhhbx
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Quality review (1)_presentation of this 21
Data_Analytics_and_PowerBI_Presentation.pptx
Supervised vs unsupervised machine learning algorithms
Lecture1 pattern recognition............
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Introduction to Knowledge Engineering Part 1
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Mega Projects Data Mega Projects Data
Transcultural that can help you someday.
Acceptance and paychological effects of mandatory extra coach I classes.pptx
climate analysis of Dhaka ,Banglades.pptx
IB Computer Science - Internal Assessment.pptx
Clinical guidelines as a resource for EBP(1).pdf

A data driven approach to measure web site navigability

  • 1. A Data-Driven Approach to Measure Web Site Navigability Speaker:Scott Date:6/13/14 (Fri) Xiao Fang Paul Jen-Hwa Hu Michael Chau Han-fen Hu Zhuo Yang Olivia R. Liu Sheng Journal of Management Information Systems
  • 2. Introduction • A well-designed website is beneficial to visitors. • Navigation and search • Structure of hyperlinks • Definition of website navigability • Aside from perceptual measurements, a data-driven approach is also can be utilized to evaluate navigability of websites • Limitations of navigability processed by other scholars in the past • The objectives of the paper • Three metrics : power, efficiency, directness
  • 3. Literature Review • Website navigation and navigability  Critical influence of navigation  Navigation systems, important means  Nuance between navigation and navigability • Measuring navigability with web data  Broad classification  Web content mining  Web structure mining  Web usage mining
  • 4. Theoretical Foundations • Information foraging theory  It extends the optimal foraging theory  Very likely to modify browsing strategies • Information-processing theory  People process information via many aspects • Visitors make judgments about their traversing paths • What they care doesn’t merely contain the likelihood of locating target information.
  • 5. Method and Metrics for Measuring Navigability A Web Mining–Based Method for Measuring Navigability Steps 1. Web log preprocessing:Cleaning, session identification, session completion. 2. Web site parsing:Parsing focal sites 3. Web page classification:Content pages and index pages 4. Access pattern mining:Frequently accessed sequences of content pages as proxies for information-seeking targets 5. Hyperlink Structure representation:A distance matrix
  • 6. Method and Metrics for Measuring Navigability Data-Driven Metrics for Measuring Navigability 𝑝 𝑘 = 𝛽 2𝜋𝑘3 exp −𝛽 𝑘−𝛼 2 2𝛼2 𝑘 𝑘 = 1,2, … , 𝐺 𝐼 = ∀𝑘≥𝐼 𝑝(𝑘) 𝐼 = 1,2, … 𝐺 𝐼 = 1 if 𝐼 = 1 G 𝐼 − 1 − 𝑝 𝐼 − 1 if𝐼 > 1
  • 7. Method and Metrics for Measuring Navigability Data-Driven Metrics for Measuring Navigability Power • 𝑈 = 𝑢𝑖 , 𝑖 = 1,2, … , 𝑛 • 𝑢𝑖 =< 𝑝𝑖,1, 𝑝𝑖,2, … , 𝑝𝑖,𝑚 >, where 𝑝𝑖,𝑗 is the jth content page in 𝑢𝑖, 𝑗 = 1,2, … , 𝑚 • 𝑅 𝑢𝑖|𝑝𝑠 = 𝐺 𝑑 𝑝𝑠, 𝑝𝑖,1 𝑗=2 𝑚 𝐺 𝑑 𝑝𝑖,𝑗−1, 𝑝𝑖,𝑗 if 𝑝𝑠 ≠ 𝑝𝑖,1, otherwise 𝑅 𝑢𝑖|𝑝𝑠 = 𝑗=2 𝑚 𝐺 𝑑 𝑝𝑖,𝑗−1, 𝑝𝑖,𝑗
  • 8. Method and Metrics for Measuring Navigability Data-Driven Metrics for Measuring Navigability Power • 𝑅 𝑢𝑖 = ∀𝑝 𝑆 𝑃(start of seeking for 𝑢𝑖 = 𝑝𝑠)𝑅 𝑢𝑖|𝑝𝑠 • Introducing weight  𝑤 𝑢𝑖 = 𝑣(𝑢 𝑖) ∀𝑢∈𝑈 𝑣(𝑢)  𝑅 𝑈 = 𝑖=1 𝑛 𝑤 𝑢𝑖 𝑅(𝑢𝑖)
  • 9. Method and Metrics for Measuring Navigability Data-Driven Metrics for Measuring Navigability Efficiency • 𝑄 𝑢𝑖|𝑝𝑠 = 𝑚𝛾−min(𝑑 𝑝 𝑠, 𝑝 𝑖,1 + 𝑗=2 𝑚 𝑑 𝑝 𝑖,𝑗−1, 𝑝 𝑖,𝑗 , 𝑚𝛾) 𝑚(𝛾−1) • 𝑄 𝑢𝑖|𝑝𝑠 = (𝑚−1)𝛾−min( 𝑗=2 𝑚 𝑑 𝑝 𝑖,𝑗−1, 𝑝 𝑖,𝑗 , (𝑚−1)𝛾) 𝑚(𝛾−1) , if 𝑝𝑠 = 𝑝𝑖,1 • 𝑄 𝑢𝑖 = ∀𝑝 𝑠 𝑃(start of seeking for 𝑢𝑖 = 𝑝𝑠) 𝑄 𝑢𝑖|𝑝𝑠 • 𝑄 𝑈 = 𝑖=1 𝑛 𝑤 𝑢𝑖 𝑄(𝑢𝑖)
  • 10. Method and Metrics for Measuring Navigability Data-Driven Metrics for Measuring Navigability Directness • 𝐿 𝑢𝑖|𝑝𝑠 = 𝑚𝛿−min(𝑁 𝑝 𝑠,𝑝 𝑖,1 + 𝑗=2 𝑚 𝑑 𝑝 𝑖,𝑗−1, 𝑝 𝑖,𝑗 , 𝑚𝛿) 𝑚(𝛿−1) if 𝑝𝑠 ≠ 𝑝𝑖,1 • 𝐿 𝑢𝑖|𝑝𝑠 = (𝑚−1)𝛿−min( 𝑗=2 𝑚 𝑑 𝑝 𝑖,𝑗−1, 𝑝 𝑖,𝑗 , (𝑚−1)𝛿) (𝑚−1)(𝛿−1) if 𝑝𝑠 = 𝑝𝑖,1 • 𝐿 𝑢𝑖 = ∀𝑝 𝑠 𝑃(start of seeking for 𝑢𝑖 = 𝑝𝑠) 𝐿 𝑢𝑖|𝑝𝑠 • 𝐿 𝑈 = 𝑖=1 𝑛 𝑤 𝑢𝑖 𝐿(𝑢𝑖)
  • 11. Implementation and Illustrations • An archetype system was established. • SpidersRUs was used to parse a website. • Two sites  A  3840 content pages  437 index pages • Web logs were gleaned over four weeks.  A:35,966,494 records; 732,321 sessions  B:32,170,062 records; 555,299 sessions  B  3738 content pages  380 index pages
  • 12. Implementation and Illustrations • • The threshold was at first set at 0.05%, then its value was increased with 0.025% in the range from 0.05% to 0.175%. •
  • 13. Implementation and Illustrations • The distances of power and efficiency on B is great on A. • The directness distances between A and B are smaller than that of power and efficiency. • According to the proposed metrics, A has higher navigability than B • The assessment of the proposed metrics and the prevalent metrics
  • 14. Evaluation Study and Data Collection Study design • A group of people were recruited. • The significance of users’ familiarity was addressed. • Four experimental conditions were created Tasks • A pretest was conducted.  Content pages are more likely to constitute information-seeking targets.  Key access sequences identified from Web logs are consistent with users’ common information-seeking needs, desires, and interests.
  • 15. Evaluation Study and Data Collection Participants • Business undergraduate students enrolled in similar information systems or operations classes in both universities. • Each participant received $10 for his or her time and efforts. Measurements • Three measures: task success rate, task time, and the number of clicks. • Participants had up to 4 minutes to complete each task. • Cognitive-processing load Data collection • A quite formal way
  • 16. Data Analyses and Results • A pilot study, 39 undergraduate students • An evaluation study with 248 participants • Comparison of user performance and assessments between A and B • Comparison of user performance by separating tasks related to complexity • Performance of the participants from each university • An ex post facto comparison • Further examination of the proposed metrics
  • 17. Extensions to Proposed Metrics • A scale factor can be added while evaluating a larger website. • The metrics can be extended with the combined use of search engine. • Integration of three metrics as a holistic measure 3 1 𝑅(𝑈) + 1 Q(𝑈) + 1 𝐿(𝑈) = 3𝑅 𝑈 Q 𝑈 𝐿 𝑈 Q 𝑈 𝐿 𝑈 + 𝑅 𝑈 𝐿 𝑈 + 𝑅 𝑈 Q 𝑈 = 𝑂(𝑈)
  • 18. Discussion • Three data-driven metrics and a viable method were presented. • The method can be used continuously for supervising a website’s navigability • A method by Liu et al. is suggested for gleaning data (Web log). • It helps improve hyperlink structure designs of websites • Limitations • Different structures of websites may not fit to the results • Spiders and page parsers‘ utilities are limited. • Test of different scenarios • More factors can be introduced to perfect the method
  • 19. Conclusion • Three data-driven metrics were presented. • By integrating appropriate Web mining techniques, a method cooperated the metrics was created. • The verification of the metrics and method. • Users’ perception corresponds to navigability measured using the methods established by the authors
  • 20. Comment • The article clearly and laconically expresses the idea and concept with the existing theories. • Vivid examples following many statements which we as post-graduate students can look upon. • A host of demonstrations below on many pages provide necessary assistance for lay people • I think navigability won’t be only one factor that may affect a website access ratio.

Editor's Notes

  • #3: 第三點,可很大影響使用者經驗和滿意度;但有多數人仍然飽受缺乏專業設計網頁之苦 第四點,一名造訪者能夠根據網站的超連結結構有效率和輕鬆地成功找到資訊的程度稱之。 第七點(objectives),提出資料導向、考量更全面,包含網頁內容、結構和資料使用,的可導航度量度、發展一套可行的網路探勘方法、執行扎實、精實的評估 量度發展由資訊找尋理論和資訊處理理論所導引。
  • #4: Broad classification,內容、結構、使用資料等 內容-網頁上的文字、結構資料-描述連結不同網頁的超連結結構、使用資料-使用者在一網站上的瀏覽資料 Web content mining,使用文字探勘技術去分析網頁內容資料並且能夠支援像是情感分析、相稱分析、偽造網站偵測等應用。 Web structure mining,從超連結結構中找尋模式 Web usage mining,揭露從造訪者使用資料中的瀏覽模式 Botafogo,提出compactness和stratum去評估超連結文字系統的連結度和結構組織 他提出的架構也被其他學者拿來擴充過 本作提出的方法,第一,三種量度;第二,考量更全面的資料;第三,嶄新的方法
  • #5: Many aspects,不同思考、刺激分析、情況修正、障礙評估等 簡單來說,這些度量發展鎖定下列事物:一個人有多可能在一個網站上成功找到資訊? 所需的點擊數、以及從目前網頁所有連結中選擇的難易度 由這些理論導引,三種度量方式被發展
  • #6: Web site parsing:重要特點,外部練結、內部連結、大小、字數、錨點文字;至同一頁面的超連結不會重複計算 Page classification:可分為內容頁面和首頁;使用SVM去自動分類;200張隨機頁面作為訓練樣本 Hyperlink structure representation:頁作為頂點、超連結作為邊;有向圖;超連結點擊數
  • #9: 三個可被定義的度良:效力、效率和直接性 在一個session當中瀏覽k個超連結的機率可以被表示成第一個式子,α是機率分布的平均數,β範圍參數,這兩者可被位在網路記錄檔中記錄的造訪資料所評估。 G(I)代表在一個session期間至少造訪過I個超連結的機率
  • #10: 使用在網頁紀錄檔中所發現的關鍵存取序列去估測造訪者想尋找的資訊目標。 U是從紀錄檔中所發現的n的關鍵存取序列。 Ui代表一個關鍵存取序列。 影響力R(ui)可以在ui中照順序排列的所有內容網頁的機率來測量,從在ui中的第一個內容網頁到第m個內容網頁
  • #11: 至於R(ui)算法呢,就是所有可能去找出存取序列的起始頁面點開機率乘上前面所剛看過的存取序列的機率 那又不是所有序列都是一樣重要,所以權重的概念孕育而生,所以所有存取序列的效力就會如簡報所示 也就是說,R(U)的效力坐落於0到1之間,可反應出一個造訪者藉由導航網站超連結架構達到想尋找的資訊目標的機率 數值越大,代表一個網站的超連結架構越強大
  • #12: m是在ui中,內容網頁的數量;γ>1,為常數;一個網頁最有效的是一個點擊就接數到,等於γ或大於它代表越來越沒效率。 0代表最沒效率;1代表最有效率
  • #14: SpidersRUs設定首頁URL為種子,然後在存取頁面時,它下載頁面並取出所有在頁面上的超連結;然後SpidersRUs會再下載所有被超連結所指向的頁面和其所存有的超連結 A網站有4277個頁面、B有4118個頁面 兩個網站來自同要的領域,在大小上可以比較,功能和使用人口也相近
  • #15: 對A,在機率式子和session大小實際分布之間的適合度是統計上顯著的,p<0.0001,R^2=0.92 對B,一樣也是統計上顯著,p<0.0001,R^2=0.97 平均而言,網站A在三種度量上取得較高分數,效力上-19.6%、效率上-12.8%、直接性-11.7% 為了取得一個合適可詮釋網站間差異的協助,額外18個可比較的大學以0.05%的閾值在網站A、B和這些網站間被做了鑑定 結果是效力介於0.50~0.77、效率介於0.75~0.89、直接性介於0.2~0.6 透過這些數據,A和B之間的差異似乎是大的
  • #16: 第一點:一個內容頁面出現在前十個最常被訪問的主要存取序列在網站A是1.7次點擊,B是2.3次點擊 第三點:也就是預期人們會在網站A中更為可能地找到想要的資訊 第四點:根據事後回溯研究法,又稱解釋觀察研究
  • #17: Pretest,網站A有256位參與者、網站B有165位參與者;所有參與者從他們自己大學的網站檢視20個頁面(10個首頁、10個內容頁面),皆為從網站中隨機選取的頁面。接著以李克特量表讓受測者評估是否被呈現的網頁有他們常找尋的資訊。前測受測者在年齡、學年齡、性別組成和自我報備對自己大學的熟悉度上,都是可比較的 接著,再從各大學調查20個受測者隨機樣本來驗證常被尋找的存取序列。
  • #19: 第一點,Cronbach’s alpha值為0.95,超過普遍0.7的閾值 第二點,群組間差異不大;受測者對他們自己大學網站的熟悉度高於其它網站,差異是顯著的。使用7分李克特量表 第三點,匯集和每個網站有關的資料,包含受測者和任務,然後使用它們執行一系列的配對t檢定。 第四點,任務成功率為A高於B、時間需求為A少於B、點擊數為A少於B、處理認知負荷為A少於B;任務複雜度高低仍不重大影響結果,也就是說有差異但還是網站A的表現優勝 第五點:A > B
  • #20: 第一點:n1個頁面、n2個頁面,範疇因素可被訂為n1/n2;在比較可導航度時,將每項量度乘上範疇因素;相似的採用可能會需要使用到,因為範疇會跟著領域不同而有變動 譬如說,一間大學的網站中之頁數可能比e政府網站還多,造訪者行為也可能因為領域不同而有不同表現;