SlideShare a Scribd company logo
Vimeo on EC21
Vimeo?2
Why AWS?3
The Workflow4
On-demand, Reserved5
Upload/Transcoding Infrastructure6
Our Bidding StrategyKeep it simpleOne time spot requests; one instance per request;across all availability zonesSpot requests expire in 10 minutesNever bid more than threshold. It is currently set to 80% of on-demand priceNot more than 10 open spot requests at any timeBid 10% more than the average price over last hour Never bid more than threshold. It is currently set to 80% of on-demand price7
Reserved Instance Utilization8on-demand% busy• 96.23% utilization with 54 instances on low-traffic day• High priority jobs: Buy reserve instance capacity to meet non-peak hour loads
Ondemandvs Spots9Happy Place#Ondemand#On-demand#spots#Spots
Pro TipsUse Spots for your low priority and less time critical jobsNever kill spots. Let Amazon do it.Have more retries for jobs running on spots.Watch out for open spot requests. Add expiry to your requests.Long running jobs, bid higher or use on-demandFailover to on-demand when spot market is saturated10
ResultSome months we save 50% over pure on-demand instances11

More Related Content

PPTX
Where's my T-Shirt? Supply chain forecasting in fashion
PDF
Spending Money to Save Money... and Getting Your Boss to Sign Off On It
PPTX
speed up shopify
PPTX
AWS Melbourne CO Interest Group meetup - Lightning talks - 20 Nov 2017
PPTX
AWS Melbourne CO Meetup - Introduction - 20 Nov 2017
PPTX
Serverless architecture: focus on code, performance and costs
PPTX
Side effects of header bidding, Hot Topic: Header Bidding, Feb 2017
PDF
Frappé / ERPNext Open Day December 2015
Where's my T-Shirt? Supply chain forecasting in fashion
Spending Money to Save Money... and Getting Your Boss to Sign Off On It
speed up shopify
AWS Melbourne CO Interest Group meetup - Lightning talks - 20 Nov 2017
AWS Melbourne CO Meetup - Introduction - 20 Nov 2017
Serverless architecture: focus on code, performance and costs
Side effects of header bidding, Hot Topic: Header Bidding, Feb 2017
Frappé / ERPNext Open Day December 2015

What's hot (15)

PPTX
Webinar: AWS Partner Strategies For Success
PDF
Botmetric- Automate your AWS Cloud
PDF
Handling customer fears
PPTX
BeyondAutomation
PPTX
Datacomm VMWare Hybrid Cloud
PDF
ERPNext Open Day - August 2015
PDF
Kanban for Game Studios
PPTX
Facilitating the Elephant carpaccio exercise
PPTX
AWS Well Architected Framework
PDF
Queueing at the Checkout
PDF
Paving The Way To The Hybrid Cloud
PPTX
AWS account migration for BBC iPlayer Radio
PDF
International software testing conference 2017 fergal hynes
PDF
Effect of Batch Size on Work Flow
PPTX
Shop.org 2017 Tech talk website speed for ecommerce why it matters and how to...
Webinar: AWS Partner Strategies For Success
Botmetric- Automate your AWS Cloud
Handling customer fears
BeyondAutomation
Datacomm VMWare Hybrid Cloud
ERPNext Open Day - August 2015
Kanban for Game Studios
Facilitating the Elephant carpaccio exercise
AWS Well Architected Framework
Queueing at the Checkout
Paving The Way To The Hybrid Cloud
AWS account migration for BBC iPlayer Radio
International software testing conference 2017 fergal hynes
Effect of Batch Size on Work Flow
Shop.org 2017 Tech talk website speed for ecommerce why it matters and how to...
Ad

Similar to Vimeo EC2 transcoding (11)

PDF
Cloudreach Voices EC2 Making Sense of the Cost Options
PDF
Cut AWS Costs: Using Spot Instances for More Than Batch
PPTX
5 Ways to Control your AWS Spending (or, How to Make Your CFO Happy)
PPTX
AWS Meetup - Exploring ways to buy EC2 capacity
PDF
Hidden Costs of Chasing the Mythical 'Five Nines'
PPTX
AWS Cost Optimization
PDF
AWS Cloud cost optimization
PDF
Bootcamp Recap: EC2 Reserved Instances
PDF
RightScale Webinar: Decoding AWS Reserved Instances (RIs) What It Means for C...
PPTX
Controle e otimização de custos na AWS
PPTX
Order book dynamics in high frequency trading
Cloudreach Voices EC2 Making Sense of the Cost Options
Cut AWS Costs: Using Spot Instances for More Than Batch
5 Ways to Control your AWS Spending (or, How to Make Your CFO Happy)
AWS Meetup - Exploring ways to buy EC2 capacity
Hidden Costs of Chasing the Mythical 'Five Nines'
AWS Cost Optimization
AWS Cloud cost optimization
Bootcamp Recap: EC2 Reserved Instances
RightScale Webinar: Decoding AWS Reserved Instances (RIs) What It Means for C...
Controle e otimização de custos na AWS
Order book dynamics in high frequency trading
Ad

Recently uploaded (20)

PPTX
1. Introduction to Computer Programming.pptx
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Hybrid model detection and classification of lung cancer
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Encapsulation theory and applications.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PPTX
TLE Review Electricity (Electricity).pptx
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PDF
project resource management chapter-09.pdf
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
Approach and Philosophy of On baking technology
PDF
A novel scalable deep ensemble learning framework for big data classification...
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PDF
August Patch Tuesday
1. Introduction to Computer Programming.pptx
Group 1 Presentation -Planning and Decision Making .pptx
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
Hybrid model detection and classification of lung cancer
Unlocking AI with Model Context Protocol (MCP)
Encapsulation theory and applications.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
TLE Review Electricity (Electricity).pptx
MIND Revenue Release Quarter 2 2025 Press Release
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
project resource management chapter-09.pdf
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Approach and Philosophy of On baking technology
A novel scalable deep ensemble learning framework for big data classification...
SOPHOS-XG Firewall Administrator PPT.pptx
August Patch Tuesday

Vimeo EC2 transcoding

Editor's Notes

  • #3: High quality video, first to do HD, work hard on supporting every format and getting the most out of the video.Video player, close second with HTML5 player, we want videos on vimeo to be playable everywhere. iOS, desktop, TV.Friendliest and most supportive community. Lots of positive people who like making videos.Good tools for sharing and privacy. You don’t have to share your videos with the whole world.Two types of users – free and plus.
  • #4: It was thereManaged hosting, expensive storage in 2007. We moved to source file storage to S3 because we could do it really easily. No contracts. It was cheaper than what we had. It had more features than what we had. We needed something now with low commitment.Our first auto scaling EC2 transcoders went up in 2008. Our encoding load at peak was 3-4x higher than non-peak. Our users don't care whether it's a peak time or not, and we prefer not to pay for transcoding machines the five days a week we aren't using them. This workload is perfect for EC2.Since then our workload has normalized a bit, but that was the situation at the time.
  • #5: Upload machine (Long connections, doesn't scale well, high IO)S3Transcoding machine (Jobs take minutes to transcode, open source toolkit, as tightly coupled as can be -- controlled from our datacenter via ssh. Works great if you are small. Thousands of ssh connections though... don't recommend this)
  • #6: We have had a pretty good auto scaling system in place since April 2009. We constantly improve it. We're refreshing it now to make it stateless.At peak few hundred c1.xlarges, dozens of m1.large for uploads. We're experimenting with GPU and cluster compute instances.We buy reserved instances to bring our costs down. In the past we've bought enough to keep them at 100% utilization. If our lowest utilization on a weekday was 50 instances, we bought that number.We've also buy them to guarantee capacity for our plus members. Our plus members shouldn't wait, even when Amazon is low on on-demand instances.The availability guarantee is important. We have had trouble getting capacity at times, for as long as a few days.Now we are buying them to get to 75% utilization. You save money if you use 55% or more.
  • #7: Don't manage spots manually:) Amazon has awesome api support. Leverage on it. We found python boto to be very stable and easy to use. We likeaws-lib for Node.js. We use it for SQS."Thinking of spots for your web servers and database machines. DON'T DO IT”
  • #8: We Keep it simple: Onetime spot request with one instance per request with an expiry of 10 minutes across all availability zonesWe get average price over the last 1hr and Bid 10% more than that.  It is little more complex than that.Watch how many spots you are launching. We have no more than 10 spot requests "open" at any time.We never exceed our bids more than 80% of the on demand price. 80 came from 80-20 rule. When in doubt pick 80. let us just say we just keep tweaking threshold and currently it is set to 80%
  • #9: However Amazon recommends to bid at the  max price you are comfortable with. We don't do that primarily because we never kill spots.. That is not completely true. When we need to scale down, we terminate spots only when we have no more on demand instances to kill.Why we never have to kill spots? We carefully estimate how many machines we need to keep running all the time and buy reserve instance capacity to meet that demand.
  • #10: Spots can be saturated during peak hours.  Deal with it.
  • #11: We terminate machines that have no jobs running on them or have least impact. Also we terminate machines 5 minutes before start of next billing hour.jobs running on spots are retried twice as many times the jobs running on reserve/on demand. If a job failed far too many times and/or is delayed beyond acceptable wait time, it gets to run on an on demand machine.Use spots for your low priority and less time critical jobs.If you have long running jobs bid higher or just use on demand instances.Understand your workload and tweak the spot algorithm to suit your needs.. the single most important thing you can take from this talk.