Data Science of the
KDD ‘14 Review Process
Jure Leskovec (Stanford) and
Wei Wang (UCLA)
Joint work with
Jason Hirshman and David Zeng (Stanford)
KDD 2014 Research Track
Statistics
KDD 2014 Program
Largest KDD program ever:
• 151 research papers (20% growth over KDD’13)
• 43 industry & govt. papers (30% growth)
• 26 workshops (75% growth)
• 11 tutorials (83% growth)
Program highlights:
• Paper spotlights early morning (8:15am)
• Oral presentations (Mon-Wed)
• Posters at the reception (Tue night)
KDD 2014 Research Track
• 1036 submissions from 2600 authors
– 42% increase over KDD ’13
• 151 papers:
– Acceptance rate
14.6%
0
200
400
600
800
1000
1200
2000 2005 2010 2015
KDD year
Numberofsubmissions
KDD Reviewing Process
46 Senior PC members + 340 PC members
• 2971 reviews in total
(Rough) Acceptance rule:
• Raw review score AND Standardized review score AND Raw
meta-review AND Standardized meta-review score ≥ Weak
Accept
• 110 papers matched (immediate accepts)
• Remaining papers were discussed with meta-reviewers and
final decisions were made
Submissions per Country
Acceptance Rate per Country
Acceptance by Subject Area
Predicting Paper Acceptance
Features Used Accuracy
Random Guessing 0.50
Paper Abstract 0.57
Author Status (Past paper counts) 0.64
Author Status (DBLP graph connectivity) 0.61
Author Status (Counts + Graph) 0.65
Reviewer (Similarity, Graph distance to authors) 0.60
All (Abstract, Author Status, and Reviewer) 0.65
Predicting Paper Acceptance
from the Review Text
Features Used
Paper:
Accepted?
Review:
Score > 0?
Random Guessing 0.50 0.50
Review Text 0.68 0.72
Review Text + Numeric Score
(Novelty, Presentation)
0.77 0.77
Human Reading of Review Text 0.88 0.73
I’m submitting a paper:
What correlates with acceptance?
Academia + Industry Papers do Better
Submissions per Author: 5 is best!
No benefit in submitting >5 papers!
Having more authors (seems to) help
It is the most experienced author
that matters!
What insights can we gain on
the review process?
Most reviews are Weak Rejects
More granularity is needed at the
Weak Reject / Weak Accept level
Reviewagreeswiththefinaloutcome
Review length is a good determinant
of a review’s influence/quality
Reviewagreeswiththefinaloutcome
Shorter reviews are used for
clear accepts and rejects
Never review co-author’s papers
The Curse of the Review
Submission Deadline
Over 50% reviews submitted in the last 5 days
Over 20% reviews submitted in the last 24 hours
10% of reviews
submitted late
Ratings increase near the deadline
Weak Rejects
increase while
Rejects decrease
Reviews submitted late are less likely
to agree with final outcome
Late reviews are shorter
Review quality drops: Accuracy of
predicting score from review text
Conclusions
• To get your papers accepted to KDD:
– Collaborate in multidisciplinary teams
– Have a senior author on board
– Do not submit more than 5 papers
• To improve KDD community standards:
– Avoid Weak Reject/Weak Accept scores
– Write longer and clearer reviews
– Submit reviews early!

More Related Content

PDF
Clinical Outcome Assessments Breakout Report
PDF
Writing Good ECRP Proposals
PPTX
Project ECHO QI: Communicating and Advocating Using Data June 29, 2016
PPTX
Project ECHO QI: Gaining Buy In June,8 2016
PPT
Adam Etkin's Flash Presentation from STM Spring 2014
PPTX
Maximizing the Value of On-Farm Research: An Industry Expert Roundtable
PPTX
IGNITE! MagIQ on improving quality
PPTX
What are journal editors looking for (lern)
Clinical Outcome Assessments Breakout Report
Writing Good ECRP Proposals
Project ECHO QI: Communicating and Advocating Using Data June 29, 2016
Project ECHO QI: Gaining Buy In June,8 2016
Adam Etkin's Flash Presentation from STM Spring 2014
Maximizing the Value of On-Farm Research: An Industry Expert Roundtable
IGNITE! MagIQ on improving quality
What are journal editors looking for (lern)

Viewers also liked (11)

PPTX
Big Data per Madee 7 at Digital Accademia
PDF
KDD HARVEST - COMPANY PRESENTATION-1-2
PDF
KDD Analytics 2014 - Experts in Marketing Analytics
PPTX
Random walk on Graphs
PDF
Health Benefits of Pomegranate
PPTX
Aseptic packaging
PPTX
marketing plan of nestle cerelac
PDF
How-To Align Marketing & Sales to Boost Revenue (Infographic)
PDF
Advertising Plan of Nestle Milk Pack (Relaunch)
PPTX
Marketing plan of nestle
PDF
How to Make Money Selling Physical Products - Steve Chou
Big Data per Madee 7 at Digital Accademia
KDD HARVEST - COMPANY PRESENTATION-1-2
KDD Analytics 2014 - Experts in Marketing Analytics
Random walk on Graphs
Health Benefits of Pomegranate
Aseptic packaging
marketing plan of nestle cerelac
How-To Align Marketing & Sales to Boost Revenue (Infographic)
Advertising Plan of Nestle Milk Pack (Relaunch)
Marketing plan of nestle
How to Make Money Selling Physical Products - Steve Chou
Ad

Recently uploaded (20)

PPTX
MBA JAPAN: 2025 the University of Waseda
PDF
Microsoft 365 products and services descrption
PPTX
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
PPTX
Lesson-01intheselfoflifeofthekennyrogersoftheunderstandoftheunderstanded
PPTX
ai agent creaction with langgraph_presentation_
PDF
Session 11 - Data Visualization Storytelling (2).pdf
PPTX
New ISO 27001_2022 standard and the changes
PPTX
Topic 5 Presentation 5 Lesson 5 Corporate Fin
PPT
statistic analysis for study - data collection
PPTX
Business_Capability_Map_Collection__pptx
PPTX
Machine Learning and working of machine Learning
PPTX
CHAPTER-2-THE-ACCOUNTING-PROCESS-2-4.pptx
PPT
Image processing and pattern recognition 2.ppt
PPTX
Tapan_20220802057_Researchinternship_final_stage.pptx
PPT
PROJECT CYCLE MANAGEMENT FRAMEWORK (PCM).ppt
PPT
expt-design-lecture-12 hghhgfggjhjd (1).ppt
PDF
©️ 02_SKU Automatic SW Robotics for Microsoft PC.pdf
PDF
CS3352FOUNDATION OF DATA SCIENCE _1_MAterial.pdf
PPTX
retention in jsjsksksksnbsndjddjdnFPD.pptx
PPTX
SET 1 Compulsory MNH machine learning intro
MBA JAPAN: 2025 the University of Waseda
Microsoft 365 products and services descrption
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
Lesson-01intheselfoflifeofthekennyrogersoftheunderstandoftheunderstanded
ai agent creaction with langgraph_presentation_
Session 11 - Data Visualization Storytelling (2).pdf
New ISO 27001_2022 standard and the changes
Topic 5 Presentation 5 Lesson 5 Corporate Fin
statistic analysis for study - data collection
Business_Capability_Map_Collection__pptx
Machine Learning and working of machine Learning
CHAPTER-2-THE-ACCOUNTING-PROCESS-2-4.pptx
Image processing and pattern recognition 2.ppt
Tapan_20220802057_Researchinternship_final_stage.pptx
PROJECT CYCLE MANAGEMENT FRAMEWORK (PCM).ppt
expt-design-lecture-12 hghhgfggjhjd (1).ppt
©️ 02_SKU Automatic SW Robotics for Microsoft PC.pdf
CS3352FOUNDATION OF DATA SCIENCE _1_MAterial.pdf
retention in jsjsksksksnbsndjddjdnFPD.pptx
SET 1 Compulsory MNH machine learning intro
Ad

Data Science view of the KDD 2014

  • 1. Data Science of the KDD ‘14 Review Process Jure Leskovec (Stanford) and Wei Wang (UCLA) Joint work with Jason Hirshman and David Zeng (Stanford)
  • 2. KDD 2014 Research Track Statistics
  • 3. KDD 2014 Program Largest KDD program ever: • 151 research papers (20% growth over KDD’13) • 43 industry & govt. papers (30% growth) • 26 workshops (75% growth) • 11 tutorials (83% growth) Program highlights: • Paper spotlights early morning (8:15am) • Oral presentations (Mon-Wed) • Posters at the reception (Tue night)
  • 4. KDD 2014 Research Track • 1036 submissions from 2600 authors – 42% increase over KDD ’13 • 151 papers: – Acceptance rate 14.6% 0 200 400 600 800 1000 1200 2000 2005 2010 2015 KDD year Numberofsubmissions
  • 5. KDD Reviewing Process 46 Senior PC members + 340 PC members • 2971 reviews in total (Rough) Acceptance rule: • Raw review score AND Standardized review score AND Raw meta-review AND Standardized meta-review score ≥ Weak Accept • 110 papers matched (immediate accepts) • Remaining papers were discussed with meta-reviewers and final decisions were made
  • 9. Predicting Paper Acceptance Features Used Accuracy Random Guessing 0.50 Paper Abstract 0.57 Author Status (Past paper counts) 0.64 Author Status (DBLP graph connectivity) 0.61 Author Status (Counts + Graph) 0.65 Reviewer (Similarity, Graph distance to authors) 0.60 All (Abstract, Author Status, and Reviewer) 0.65
  • 10. Predicting Paper Acceptance from the Review Text Features Used Paper: Accepted? Review: Score > 0? Random Guessing 0.50 0.50 Review Text 0.68 0.72 Review Text + Numeric Score (Novelty, Presentation) 0.77 0.77 Human Reading of Review Text 0.88 0.73
  • 11. I’m submitting a paper: What correlates with acceptance?
  • 12. Academia + Industry Papers do Better
  • 14. No benefit in submitting >5 papers!
  • 15. Having more authors (seems to) help
  • 16. It is the most experienced author that matters!
  • 17. What insights can we gain on the review process?
  • 18. Most reviews are Weak Rejects
  • 19. More granularity is needed at the Weak Reject / Weak Accept level Reviewagreeswiththefinaloutcome
  • 20. Review length is a good determinant of a review’s influence/quality Reviewagreeswiththefinaloutcome
  • 21. Shorter reviews are used for clear accepts and rejects
  • 23. The Curse of the Review Submission Deadline
  • 24. Over 50% reviews submitted in the last 5 days Over 20% reviews submitted in the last 24 hours 10% of reviews submitted late
  • 25. Ratings increase near the deadline Weak Rejects increase while Rejects decrease
  • 26. Reviews submitted late are less likely to agree with final outcome
  • 27. Late reviews are shorter
  • 28. Review quality drops: Accuracy of predicting score from review text
  • 29. Conclusions • To get your papers accepted to KDD: – Collaborate in multidisciplinary teams – Have a senior author on board – Do not submit more than 5 papers • To improve KDD community standards: – Avoid Weak Reject/Weak Accept scores – Write longer and clearer reviews – Submit reviews early!

Editor's Notes

  • #8: Country of the paper is given by the mode author nationality. Only countries with more than 10 submissions are shown (except South Korea, which had 0 acceptance)
  • #9: Subject areas were based on a field that authors tagged their papers with. Only subject areas with more than 50 submissions are shown.
  • #10: On balanced dataset (we subsampled negative examples)
  • #11: On balanced dataset (we subsampled negative examples)
  • #14: Academic: All authors of paper are affiliated with a university Industry: All authors of paper are affiliated with industry Mixed: Paper has authors affiliated with both universities and industry.