SlideShare a Scribd company logo
SoylentA Word Processor with a Crowd InsideMichael Bernsteinmsbernst@csail.mit.eduGreg Little, Rob Miller, David Karger, David Crowell, Katrina Panovichmit csailBjörn Hartmann	Mark Ackermanucberkeley			university of michiganmit human-computer interaction
Shortening A Paper to Ten Pages1) Do it yourself2) Use an AI3) Ask colleagues
Shortening A Paper to Ten Pages4) Recruit a crowd
Shortening A Paper to Ten Pages4) Recruit a crowd
Soylent is a word processing interface that uses crowd contributions to aid complex writing tasks.
Soylent is a word processing interface that uses crowd contributions to aid complex writing tasks.Soylent is people.Amazon Mechanical Turk Soylent’s core algorithms are human-powered.Find Unnecessary TextRequester:Matt C.Reward:$0.01Tasks Available: 7Shorten Rambling TextRequester:Gordon L.Reward:$0.04Tasks Available: 12
Soylent is a word processing interface that uses crowd contributions to aid complex writing tasks.Find-Fix-Verify:Crowd control design patternFind a problemFix each problemVerify quality of each fixSoylent, a prototype...Soylent, a prototype...Soylent, a prototype...Soylent, a prototype...
Embed paid crowd workers in user interfaces to support cognition and manipulation tasks on demand
demo
Word processors don’t:- parse semantics well judge qualityPaid crowds don’t:- guarantee quality- control costsState ofthe Art[Kittur ’08]
Paid crowds can:- parse semantics well judge qualitySoylent can:- guarantee quality- control costsWord witha Crowd Inside
Challenges in Programming CrowdsThis project has interacted with~9000 Turkers on ~2000 different tasksKey Problem: crowd workers often produce poor output on open-ended tasks30% Rule: ~30% of the resultsfrom open-ended tasks will be unsatisfactoryIntroductionDemoFind-Fix-VerifyEvaluationDiscussionSoylent
Two Personas: An ExampleProofread and correct the following paragraph:The theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections during this story. This theme occurs during many circumstances but is not present from start to finish. In my mind for a theme to be pervasive is must be present during every element of the story. There are many themes that are present most of the way through such as sacrifice, friendship and comradship. But in my opinion there is only one theme that is present from beginning to end, this theme is pursuit of dreams.
The theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections during this story. This theme occurs during many circumstances but is not present from start to finish. In my mind for a theme to be pervasive is must be present during every element of the story. There are many themes that are present most of the way through such as sacrifice, friendship and comradship. But in my opinion there is only one theme that is present from beginning to end, this theme ispursuit of dreams. Two Personas: An ExampleProofread and correct the following paragraph:
The Lazy TurkerDoes as little work as necessary to be paidThe theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections during this story. This theme occurs during many circumstances but is not present from start to finish. In my mind for a theme to be pervasive is must be present during every element of the story. There are many themes that are present most of the way through such as sacrifice, friendship and comradship. But in my opinion there is only one theme that is present from beginning to end, this theme ispursuit of dreams.
The Lazy TurkerDoes as little work as necessary to be paidThe theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections during this story. This theme occurs during many circumstances but is not present from start to finish. In my mind for a theme to be pervasive is must be present during every element of the story. There are many themes that are present most of the way through such as sacrifice, friendship and comradeship. But in my opinion there is only one theme that is present from beginning to end, this theme ispursuit of dreams.
The Eager BeaverGo beyond task requirements to be helpful, but introduce errors in the processThe theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections during this story. This theme occurs during many circumstances but is not present from start to finish. In my mind for a theme to be pervasive is must be present during every element of the story. There are many themes that are present most of the way through such as sacrifice, friendship and comradship. But in my opinion there is only one theme that is present from beginning to end, this theme ispursuit of dreams.
Go beyond task requirements to be helpful, but introduce errors in the processThe theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections of this story. \nThis theme occurs during many circumstances but is not present from start to finish. \nIn my mind, for a theme to be pervasive it must be present during every element of the story. \nThere are many themes that are present most of the way through such as sacrifice, friendship and comradeship.\nBut in my opinion there is only one theme that is present from beginning to end: this theme is pursuit of dreams. The Eager Beaver
The Eager BeaverGo beyond task requirements to be helpful, but introduce errors in the process
Find-Fix-VerifyProgramming crowds today is haphazard,similar to UI technology before design patternslike Model-View-ControllerFind-Fix-Verify is a design pattern for programming crowds to complete open-ended tasks
Find“Identify at least one area that can be shortened without changing the meaning of the paragraph.”Independent agreement to identify patchesFix“Edit the highlighted section to shorten its length without changing the meaning of the paragraph.”Soylent, a prototype...Randomize order of suggestionsVerify“Choose at least one rewrite that has style errors, and at least one rewrite that changes the meaning of the sentence.”
Keep suggestions that do not get voted outVerify“Choose at least one rewrite that has style errors, and at least one rewrite that changes the meaning of the sentence.”
Fix-Fix-Verify DiscussionWhy split Find and Fix?	Force Lazy Turkers to work on a problem of our choice	Allows us to merge work completed in parallelWhy add Verify?	Quality rises when we place Turkers in productive tension	Allows us to trade off lag time with quality
Evaluation GoalsIs Soylent’s crowdsourced user interfaceapproach feasible?	How high is the quality?		How long is the delay?		How much does it cost?123IntroductionDemoFind-Fix-VerifyEvaluationDiscussionSoylent
BlogPrint publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web.Classic uistPaperThe metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper introduced the metaDESK along with two companion platforms, the transBOARD and ambientROOM.ShortnDraft uistPaperIn this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications.  We present WenSo, a tool that uses lightweight text input to capture richly structured information for later retrieval and navigation.Technical WritingFigure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key (the index bits) and the next 15 low order bits (the key fragment).Rambling E-mailA previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups.  Check out our new page: […]
Blog – 83%Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web.Classic uistPaper– 87%The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM.Cut 15% of original paragraph length on average.ShortnDraft uistPaper – 90%In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications.  We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation.Technical Writing – 82%Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment).Rambling E-mail – 78%A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups.  Check out our new page: […]
Blog – 83%					3 para., 158 people, $4.57Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web.Classic uistPaper– 87%	7 para., 264 people, $7.45The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM.ShortnDraft uistPaper – 90%			5 para., 284 people, $7.47In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications.  We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation.Technical Writing – 82%	3 para., 188 people, $4.84Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment).Rambling E-mail – 78%			6 para., 362 people, $9.72A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups.  Check out our new page: […]
Blog – 83%					3 para., 158 people, $4.57Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web.Classic uistPaper– 87%	7 para., 264 people, $7.45The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM.Focus on unnecessarily wordy phrasesBut in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web.ShortnDraft uistPaper – 90%			5 para., 284 people, $7.47In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications.  We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation.Technical Writing – 82%	3 para., 188 people, $4.84Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment).Rambling E-mail – 78%			6 para., 362 people, $9.72A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups.  Check out our new page: […]
Blog – 83%					3 para., 158 people, $4.57Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web.Classic uistPaper– 87%	7 para., 264 people, $7.45The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM.ShortnDraft uistPaper – 90%			5 para., 284 people, $7.47In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications.  We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation.Merged sentences when patches crossed sentence boundariesThe metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, whichintroduced the metaDESKalong withandtwo companion platforms, the transBOARD and ambientROOM.Technical Writing – 82%	3 para., 188 people, $4.84Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment).Rambling E-mail – 78%			6 para., 362 people, $9.72A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups.  Check out our new page: […]
Blog – 83%					3 para., 158 people, $4.57Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web.Classic uistPaper– 87%	7 para., 264 people, $7.45The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM.ShortnDraft uistPaper – 90%			5 para., 284 people, $7.47In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications.  We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation.Workers introduced style errors when not part of the community of practiceTechnical Writing – 82%	3 para., 188 people, $4.84Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment).In this paper we argue thatit is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications.Rambling E-mail – 78%			6 para., 362 people, $9.72A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups.  Check out our new page: […]
Blog – 83%					3 para., 158 people, $4.57Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web.Parallelism can result in inconsistent changesClassic uistPaper– 87%	7 para., 264 people, $7.45The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM.FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment).ShortnDraft uistPaper – 90%			5 para., 284 people, $7.47In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications.  We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation.Technical Writing – 82%	3 para., 188 people, $4.84Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment).Rambling E-mail – 78%			6 para., 362 people, $9.72A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups.  Check out our new page: […]
Results: LagSoylent Posts TaskMost time is spent waiting: Median 18.5 minutesSummed medians across Find, Fix and Verifyq1=8.3 minutes, q3=41.6 minutesWorkerAccepts TaskActual work time is very short:Median 2.0 minutesSummed medians across Find, Fix and Verifyq1=60 seconds, q3=3.6 minutesWorkerSubmits Task
ESL: English as a Second LanguageHowever, while GUI made using computers be more intuitive and easier to learn, it didn’t let people be able to control computers efficiently. Masses only can use the software developed by software companies.Passes Word’s Grammar CheckerMarketing are bad for brand big and small.  You Know What I am Saying. It is no wondering that advertisings are bad for company in America, Chicago and Germany. WikipediaDanduMonara (Flying Peacock, Wooden Peacock), The Flying machine able to fly. The King Ravana (Sri Lanka) built it. Accorinding to hindu believes in Ramayanaya King Ravana used "DanduMonara" for abduct queen Seetha from Rama. According to believes "DanduMonara" landed at Werangatota.CrowdproofNotesBlah blah blah—argument about whether there should be a standard “nosqlsto- rage” API to protect developers storing their stuff in proprietary services in the cloud. Probably unrealistic.UIST DraftMany of these problems vanish if we turn to a much older recording technology---text. When we enter text, each (pen or key) stroke is being used to record the actual information we care about---; none is wasted on application navigation or configuration.
ESL: English as a Second LanguageHowever, while GUI made using computers be more intuitive and easier to learn, it didn’t allow people to let people be able to control computers efficiently. Masses only canThe masses can only use the software developed by software companies, unless they know how to write programs.Crowdproof alone found 67% of errors.CrowdproofCrowdproof fixed 88%of the errors it found.Crowdproof and Word togetherfound 82% of errors.
Find BibTeX:“Hi, please find the bibtex references for the 3 papers in brackets. You can located these by Google Scholar searches and clicking on bibtex.’’Find Creative Commons Figures:“Pick out keywords from the paragrah like Yosemite, rock, half dome, park. Go to a site which hsa CC licensed images […]’’Human MacroBlog Feedback: “Please tell me how to make this paragraph communicate better. Say what's wrong, and what I can improve. Thanks!’’Tense Change:“Please change text in document from past tense to present tense”Find and Format Addresses: “Please complete the addresses below to include all informtion needed as in example below. [...]”
Find BibTeX:“Hi, please find the bibtex references for the 3 papers in brackets. You can located these by Google Scholar searches and clicking on bibtex.’’Duncan and Watts [Duncan and watts HCOMP 09 anchoring] found that Turkers will do more work, but quality is no higher.@conference {      title={{Financial incentives […]}},      author={Mason, W. and Watts, D.J.}, booktitle={HCOMP ‘09},      […]}Human MacroThe Human Macro executedrequests perfectly 71%of the time.
IntroductionDemoFind-Fix-VerifyEvaluationDiscussionSoylent
Wizard of Turk: The New Wizard of OzWizard of Oz prototyping is a tried-and-true technique in HCI and AIPut a human behind the curtain	…until we understand how to engineer itIt’s now possible to wire a wizard permanently into an interactive systemFully deployable from day one	AI vs. Turk is a cost / performance optimization	Crowd contributions can provide training data Wizard of OzInterfaceWizard of Turk
SoylentA Word Processor with a Crowd InsideA new class of crowd-powered interfacesThe Find-Fix-Verify design patternCrowd Personas: The Lazy Turker and the Eager Beaver
Soylentsoylent@csail.mit.edu
Soylent: A Word Processor with a Crowd Inside
Effect of Price on Wait TimePaying more had no effect on early arrivals,but sped up the latecomers
Privacy, Legality, EthicsUnknown third parties can see your documentOne solution: develop long-term relationshipsOr: enterprises hire wage workers under NDAWho owns the edits to your document?Work-for-hire contract means the author retains rights	Is this the correct model? Taylorism and the Turker as API callEmbed human-human contract ethics into your system	Adjusting to minimum wage
Related WorkPowering novel interactions with the wisdom of crowds[ChaCha; Sala et al., Pervasive 2007; Bigham et al., UIST 2010]Improving quality on Mechanical Turk[Kittur et al., CHI 2008; Heer et al., CHI 2010]Artificial intelligence techniques for word processing	Automatic proofreading [Kukich, CSUR 1992]	Sentence compression [Clarke and Lapata, ACL 2006]	AI for EUP [Cypher 1993]
Results: Cost$0.08 per Find, $0.05 per Fix, and $0.04 per VerifyAverage paragraph cost $1.41 to Shortn:		$0.55 to Find an average of two patches	$0.48 to Fix each patch	$0.38 to Verify each patchLower bound with $0.01 per task:	$0.30 per paragraph
Soylent: A Word Processor with a Crowd Inside
Soylent: A Word Processor with a Crowd Inside
Soylent: A Word Processor with a Crowd Inside
Soylent: A Word Processor with a Crowd Inside

More Related Content

PPTX
Scriptwriting powerpoint
PDF
Crowdsourcing for Search Evaluation and Social-Algorithmic Search
PPTX
HarambeeNet: Data by the people, for the people
PDF
Online Graphic Organizer An Essay Map From Read
PDF
Project: Fudgie Bears
PDF
I Suck At Writing Essays RMemes. Online assignment writing service.
PDF
Essay On National Unity Day. Online assignment writing service.
PDF
Fulltext
Scriptwriting powerpoint
Crowdsourcing for Search Evaluation and Social-Algorithmic Search
HarambeeNet: Data by the people, for the people
Online Graphic Organizer An Essay Map From Read
Project: Fudgie Bears
I Suck At Writing Essays RMemes. Online assignment writing service.
Essay On National Unity Day. Online assignment writing service.
Fulltext

More from Michael Bernstein (10)

PDF
Quantifying the Invisible Audience in Social Networks
PDF
The Future of Crowd Work
PDF
Direct Answers for Search Queries in the Long Tail
PDF
Analytic Methods for Optimizing Realtime Crowdsourcing
PPTX
4chan and /b/: An Analysis of Anonymity and Ephemerality in a Large Online Co...
PPTX
RepliCHI: Graduate Student Perspectives
PPTX
RepliCHI: Graduate Student Perspectives
PPTX
The Trouble with Social Computing Systems Research
PPTX
Eddi: Interactive Topic-Based Browsing of Social Status Streams
PPTX
FeedMe: Enhancing Directed Content Sharing on the Web
Quantifying the Invisible Audience in Social Networks
The Future of Crowd Work
Direct Answers for Search Queries in the Long Tail
Analytic Methods for Optimizing Realtime Crowdsourcing
4chan and /b/: An Analysis of Anonymity and Ephemerality in a Large Online Co...
RepliCHI: Graduate Student Perspectives
RepliCHI: Graduate Student Perspectives
The Trouble with Social Computing Systems Research
Eddi: Interactive Topic-Based Browsing of Social Status Streams
FeedMe: Enhancing Directed Content Sharing on the Web
Ad

Recently uploaded (20)

PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Empathic Computing: Creating Shared Understanding
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Programs and apps: productivity, graphics, security and other tools
PPT
Teaching material agriculture food technology
PPTX
Machine Learning_overview_presentation.pptx
PPTX
Big Data Technologies - Introduction.pptx
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PPTX
A Presentation on Artificial Intelligence
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
Per capita expenditure prediction using model stacking based on satellite ima...
The AUB Centre for AI in Media Proposal.docx
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
“AI and Expert System Decision Support & Business Intelligence Systems”
Dropbox Q2 2025 Financial Results & Investor Presentation
The Rise and Fall of 3GPP – Time for a Sabbatical?
Reach Out and Touch Someone: Haptics and Empathic Computing
Review of recent advances in non-invasive hemoglobin estimation
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Empathic Computing: Creating Shared Understanding
20250228 LYD VKU AI Blended-Learning.pptx
Encapsulation_ Review paper, used for researhc scholars
Programs and apps: productivity, graphics, security and other tools
Teaching material agriculture food technology
Machine Learning_overview_presentation.pptx
Big Data Technologies - Introduction.pptx
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
A Presentation on Artificial Intelligence
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Ad

Soylent: A Word Processor with a Crowd Inside

  • 1. SoylentA Word Processor with a Crowd InsideMichael Bernsteinmsbernst@csail.mit.eduGreg Little, Rob Miller, David Karger, David Crowell, Katrina Panovichmit csailBjörn Hartmann Mark Ackermanucberkeley university of michiganmit human-computer interaction
  • 2. Shortening A Paper to Ten Pages1) Do it yourself2) Use an AI3) Ask colleagues
  • 3. Shortening A Paper to Ten Pages4) Recruit a crowd
  • 4. Shortening A Paper to Ten Pages4) Recruit a crowd
  • 5. Soylent is a word processing interface that uses crowd contributions to aid complex writing tasks.
  • 6. Soylent is a word processing interface that uses crowd contributions to aid complex writing tasks.Soylent is people.Amazon Mechanical Turk Soylent’s core algorithms are human-powered.Find Unnecessary TextRequester:Matt C.Reward:$0.01Tasks Available: 7Shorten Rambling TextRequester:Gordon L.Reward:$0.04Tasks Available: 12
  • 7. Soylent is a word processing interface that uses crowd contributions to aid complex writing tasks.Find-Fix-Verify:Crowd control design patternFind a problemFix each problemVerify quality of each fixSoylent, a prototype...Soylent, a prototype...Soylent, a prototype...Soylent, a prototype...
  • 8. Embed paid crowd workers in user interfaces to support cognition and manipulation tasks on demand
  • 10. Word processors don’t:- parse semantics well judge qualityPaid crowds don’t:- guarantee quality- control costsState ofthe Art[Kittur ’08]
  • 11. Paid crowds can:- parse semantics well judge qualitySoylent can:- guarantee quality- control costsWord witha Crowd Inside
  • 12. Challenges in Programming CrowdsThis project has interacted with~9000 Turkers on ~2000 different tasksKey Problem: crowd workers often produce poor output on open-ended tasks30% Rule: ~30% of the resultsfrom open-ended tasks will be unsatisfactoryIntroductionDemoFind-Fix-VerifyEvaluationDiscussionSoylent
  • 13. Two Personas: An ExampleProofread and correct the following paragraph:The theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections during this story. This theme occurs during many circumstances but is not present from start to finish. In my mind for a theme to be pervasive is must be present during every element of the story. There are many themes that are present most of the way through such as sacrifice, friendship and comradship. But in my opinion there is only one theme that is present from beginning to end, this theme is pursuit of dreams.
  • 14. The theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections during this story. This theme occurs during many circumstances but is not present from start to finish. In my mind for a theme to be pervasive is must be present during every element of the story. There are many themes that are present most of the way through such as sacrifice, friendship and comradship. But in my opinion there is only one theme that is present from beginning to end, this theme ispursuit of dreams. Two Personas: An ExampleProofread and correct the following paragraph:
  • 15. The Lazy TurkerDoes as little work as necessary to be paidThe theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections during this story. This theme occurs during many circumstances but is not present from start to finish. In my mind for a theme to be pervasive is must be present during every element of the story. There are many themes that are present most of the way through such as sacrifice, friendship and comradship. But in my opinion there is only one theme that is present from beginning to end, this theme ispursuit of dreams.
  • 16. The Lazy TurkerDoes as little work as necessary to be paidThe theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections during this story. This theme occurs during many circumstances but is not present from start to finish. In my mind for a theme to be pervasive is must be present during every element of the story. There are many themes that are present most of the way through such as sacrifice, friendship and comradeship. But in my opinion there is only one theme that is present from beginning to end, this theme ispursuit of dreams.
  • 17. The Eager BeaverGo beyond task requirements to be helpful, but introduce errors in the processThe theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections during this story. This theme occurs during many circumstances but is not present from start to finish. In my mind for a theme to be pervasive is must be present during every element of the story. There are many themes that are present most of the way through such as sacrifice, friendship and comradship. But in my opinion there is only one theme that is present from beginning to end, this theme ispursuit of dreams.
  • 18. Go beyond task requirements to be helpful, but introduce errors in the processThe theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections of this story. \nThis theme occurs during many circumstances but is not present from start to finish. \nIn my mind, for a theme to be pervasive it must be present during every element of the story. \nThere are many themes that are present most of the way through such as sacrifice, friendship and comradeship.\nBut in my opinion there is only one theme that is present from beginning to end: this theme is pursuit of dreams. The Eager Beaver
  • 19. The Eager BeaverGo beyond task requirements to be helpful, but introduce errors in the process
  • 20. Find-Fix-VerifyProgramming crowds today is haphazard,similar to UI technology before design patternslike Model-View-ControllerFind-Fix-Verify is a design pattern for programming crowds to complete open-ended tasks
  • 21. Find“Identify at least one area that can be shortened without changing the meaning of the paragraph.”Independent agreement to identify patchesFix“Edit the highlighted section to shorten its length without changing the meaning of the paragraph.”Soylent, a prototype...Randomize order of suggestionsVerify“Choose at least one rewrite that has style errors, and at least one rewrite that changes the meaning of the sentence.”
  • 22. Keep suggestions that do not get voted outVerify“Choose at least one rewrite that has style errors, and at least one rewrite that changes the meaning of the sentence.”
  • 23. Fix-Fix-Verify DiscussionWhy split Find and Fix? Force Lazy Turkers to work on a problem of our choice Allows us to merge work completed in parallelWhy add Verify? Quality rises when we place Turkers in productive tension Allows us to trade off lag time with quality
  • 24. Evaluation GoalsIs Soylent’s crowdsourced user interfaceapproach feasible? How high is the quality? How long is the delay? How much does it cost?123IntroductionDemoFind-Fix-VerifyEvaluationDiscussionSoylent
  • 25. BlogPrint publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web.Classic uistPaperThe metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper introduced the metaDESK along with two companion platforms, the transBOARD and ambientROOM.ShortnDraft uistPaperIn this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications. We present WenSo, a tool that uses lightweight text input to capture richly structured information for later retrieval and navigation.Technical WritingFigure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key (the index bits) and the next 15 low order bits (the key fragment).Rambling E-mailA previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups. Check out our new page: […]
  • 26. Blog – 83%Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web.Classic uistPaper– 87%The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM.Cut 15% of original paragraph length on average.ShortnDraft uistPaper – 90%In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications. We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation.Technical Writing – 82%Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment).Rambling E-mail – 78%A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups. Check out our new page: […]
  • 27. Blog – 83% 3 para., 158 people, $4.57Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web.Classic uistPaper– 87% 7 para., 264 people, $7.45The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM.ShortnDraft uistPaper – 90% 5 para., 284 people, $7.47In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications. We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation.Technical Writing – 82% 3 para., 188 people, $4.84Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment).Rambling E-mail – 78% 6 para., 362 people, $9.72A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups. Check out our new page: […]
  • 28. Blog – 83% 3 para., 158 people, $4.57Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web.Classic uistPaper– 87% 7 para., 264 people, $7.45The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM.Focus on unnecessarily wordy phrasesBut in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web.ShortnDraft uistPaper – 90% 5 para., 284 people, $7.47In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications. We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation.Technical Writing – 82% 3 para., 188 people, $4.84Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment).Rambling E-mail – 78% 6 para., 362 people, $9.72A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups. Check out our new page: […]
  • 29. Blog – 83% 3 para., 158 people, $4.57Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web.Classic uistPaper– 87% 7 para., 264 people, $7.45The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM.ShortnDraft uistPaper – 90% 5 para., 284 people, $7.47In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications. We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation.Merged sentences when patches crossed sentence boundariesThe metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, whichintroduced the metaDESKalong withandtwo companion platforms, the transBOARD and ambientROOM.Technical Writing – 82% 3 para., 188 people, $4.84Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment).Rambling E-mail – 78% 6 para., 362 people, $9.72A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups. Check out our new page: […]
  • 30. Blog – 83% 3 para., 158 people, $4.57Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web.Classic uistPaper– 87% 7 para., 264 people, $7.45The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM.ShortnDraft uistPaper – 90% 5 para., 284 people, $7.47In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications. We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation.Workers introduced style errors when not part of the community of practiceTechnical Writing – 82% 3 para., 188 people, $4.84Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment).In this paper we argue thatit is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications.Rambling E-mail – 78% 6 para., 362 people, $9.72A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups. Check out our new page: […]
  • 31. Blog – 83% 3 para., 158 people, $4.57Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web.Parallelism can result in inconsistent changesClassic uistPaper– 87% 7 para., 264 people, $7.45The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM.FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment).ShortnDraft uistPaper – 90% 5 para., 284 people, $7.47In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications. We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation.Technical Writing – 82% 3 para., 188 people, $4.84Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment).Rambling E-mail – 78% 6 para., 362 people, $9.72A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups. Check out our new page: […]
  • 32. Results: LagSoylent Posts TaskMost time is spent waiting: Median 18.5 minutesSummed medians across Find, Fix and Verifyq1=8.3 minutes, q3=41.6 minutesWorkerAccepts TaskActual work time is very short:Median 2.0 minutesSummed medians across Find, Fix and Verifyq1=60 seconds, q3=3.6 minutesWorkerSubmits Task
  • 33. ESL: English as a Second LanguageHowever, while GUI made using computers be more intuitive and easier to learn, it didn’t let people be able to control computers efficiently. Masses only can use the software developed by software companies.Passes Word’s Grammar CheckerMarketing are bad for brand big and small. You Know What I am Saying. It is no wondering that advertisings are bad for company in America, Chicago and Germany. WikipediaDanduMonara (Flying Peacock, Wooden Peacock), The Flying machine able to fly. The King Ravana (Sri Lanka) built it. Accorinding to hindu believes in Ramayanaya King Ravana used "DanduMonara" for abduct queen Seetha from Rama. According to believes "DanduMonara" landed at Werangatota.CrowdproofNotesBlah blah blah—argument about whether there should be a standard “nosqlsto- rage” API to protect developers storing their stuff in proprietary services in the cloud. Probably unrealistic.UIST DraftMany of these problems vanish if we turn to a much older recording technology---text. When we enter text, each (pen or key) stroke is being used to record the actual information we care about---; none is wasted on application navigation or configuration.
  • 34. ESL: English as a Second LanguageHowever, while GUI made using computers be more intuitive and easier to learn, it didn’t allow people to let people be able to control computers efficiently. Masses only canThe masses can only use the software developed by software companies, unless they know how to write programs.Crowdproof alone found 67% of errors.CrowdproofCrowdproof fixed 88%of the errors it found.Crowdproof and Word togetherfound 82% of errors.
  • 35. Find BibTeX:“Hi, please find the bibtex references for the 3 papers in brackets. You can located these by Google Scholar searches and clicking on bibtex.’’Find Creative Commons Figures:“Pick out keywords from the paragrah like Yosemite, rock, half dome, park. Go to a site which hsa CC licensed images […]’’Human MacroBlog Feedback: “Please tell me how to make this paragraph communicate better. Say what's wrong, and what I can improve. Thanks!’’Tense Change:“Please change text in document from past tense to present tense”Find and Format Addresses: “Please complete the addresses below to include all informtion needed as in example below. [...]”
  • 36. Find BibTeX:“Hi, please find the bibtex references for the 3 papers in brackets. You can located these by Google Scholar searches and clicking on bibtex.’’Duncan and Watts [Duncan and watts HCOMP 09 anchoring] found that Turkers will do more work, but quality is no higher.@conference { title={{Financial incentives […]}}, author={Mason, W. and Watts, D.J.}, booktitle={HCOMP ‘09}, […]}Human MacroThe Human Macro executedrequests perfectly 71%of the time.
  • 38. Wizard of Turk: The New Wizard of OzWizard of Oz prototyping is a tried-and-true technique in HCI and AIPut a human behind the curtain …until we understand how to engineer itIt’s now possible to wire a wizard permanently into an interactive systemFully deployable from day one AI vs. Turk is a cost / performance optimization Crowd contributions can provide training data Wizard of OzInterfaceWizard of Turk
  • 39. SoylentA Word Processor with a Crowd InsideA new class of crowd-powered interfacesThe Find-Fix-Verify design patternCrowd Personas: The Lazy Turker and the Eager Beaver
  • 42. Effect of Price on Wait TimePaying more had no effect on early arrivals,but sped up the latecomers
  • 43. Privacy, Legality, EthicsUnknown third parties can see your documentOne solution: develop long-term relationshipsOr: enterprises hire wage workers under NDAWho owns the edits to your document?Work-for-hire contract means the author retains rights Is this the correct model? Taylorism and the Turker as API callEmbed human-human contract ethics into your system Adjusting to minimum wage
  • 44. Related WorkPowering novel interactions with the wisdom of crowds[ChaCha; Sala et al., Pervasive 2007; Bigham et al., UIST 2010]Improving quality on Mechanical Turk[Kittur et al., CHI 2008; Heer et al., CHI 2010]Artificial intelligence techniques for word processing Automatic proofreading [Kukich, CSUR 1992] Sentence compression [Clarke and Lapata, ACL 2006] AI for EUP [Cypher 1993]
  • 45. Results: Cost$0.08 per Find, $0.05 per Fix, and $0.04 per VerifyAverage paragraph cost $1.41 to Shortn: $0.55 to Find an average of two patches $0.48 to Fix each patch $0.38 to Verify each patchLower bound with $0.01 per task: $0.30 per paragraph

Editor's Notes

  • #3: let's start with a scenario many of us are familiar with. we have a pretty strict page limit on a paper, and somewhere in this 7500 words we are just a few lines overlength. We've already spent time editing this paper. It's a pretty painful scenario.move quickly through this or you’ll go overtimeask colleagues, "but they're doing CHI papers too!"
  • #4: We're introducing a fourth option with this work: a crowd. We want to recruit an entire crowd of helpers to support your writing process, to get 100 people to help you cut down your text. And what we're going to do is build it into the interface. (picture)
  • #5: We're introducing a fourth option with this work: a crowd. We want to recruit an entire crowd of helpers to support your writing process, to get 100 people to help you cut down your text. And what we're going to do is build it into the interface. (picture)
  • #6: In this talk I'm introducing Soylent, a word processor with a crowd inside. Soylent is a new kind of word processing interface that is powered by crowd contributions. Three major features: ...
  • #7: Uses TurKit
  • #8: This is hard because the crowd pulls in different directions and doesn’t always do the work you want it to do. We control it with a new design pattern we call Find-Fix-Verify.
  • #9: You hit a button, and some dude goes off and does something
  • #10: "we're just a few references overlength" --> "we're just a few lines overlength”Shortn: authors are bad at hacking their own text. Nobody hacks it in tiny bits. Say out loud that the user is specifying how "tall" the text should be.Rob's reason why the authors missed the error -- we're tired by page 7. Fresh eyeballs looking at a small bit of text.
  • #14: Go FAST
  • #16: You can reject their work
  • #24: They’re working on the same sentence, trying to fix the same underlying problem
  • #25: go faster through this. “We evaluated soylent, and we wanted to ask three questions: quality, delay, and cost.”
  • #27: Cut more than a page from a CHI paper without touching figures or refs, and just removing fat from the writing.
  • #28: Cost works out to $1.40 per paragraph
  • #29: Cost works out to $1.40 per paragraph
  • #30: Cost works out to $1.40 per paragraph
  • #31: Cost works out to $1.40 per paragraph
  • #32: Cost works out to $1.40 per paragraph
  • #33: "our next step is to find ways to reduce wait time"
  • #36: note that they are misspelled and still worked
  • #39: We think that Soylent is an example of a new class of user interfaces. (not, “and here’s the discussion!”)"There are AIs that do sentence compression like Shortn, but their biggest problem right now is lack of data. We can feed it that data, and eventually save the user money."
  • #43: Task: crowdproof, input was ESL first paragraph, separated Find from Fix+Verify so that Fix always had the same patchesAll tasks had at least 3 workers in the first 10 minutes. After about 15 minutes, arrival time starts looking exponential.So, you can get away with paying very little for interaction
  • #44: snooping on CHI papers right before the deadlineadd 9000 Turkers to the author list