SlideShare a Scribd company logo
API Scrapping.
How to protect your API against something that is not
necessarily an attack
Artem Demchenkov
Illustration © Caterina Carraro/Billie
2
Artem Demchenkov
● CTO & Co-Founder at Billie
● ardemchenkov@gmail.com
● https://www.linkedin.com/in/artem-demchenkov-76b69934/
Billie
We are an innovative financial platform, focused on working capital financing.
Some numbers:
● Start: 01.12.2016
● Closed Beta: 01.05.2017
● Public Launch: 01.06.2017
● Team: 100+
● Engineering Crew: 35
Agenda
4
● API scraping may not look like an attack
● Why should we worry then?
● KYCB
● How to recognize that you’re being scrapped
● How to stop API scraping. It is even possible?
● Bonus
5
Let’s begin with numbers
Some numbers
6
● 46% of web traffic are scraping bots (Distil Networks, 2016)
● 2% of online revenue is lost due to web scraping (Distil Networks, 2016)
● Global e-commerce sales are 3.53 trillion dollars (Statista, 2019)
● 2% of it is >70 billion dollars
7
API Attacks
API Attacks
8
● DDoS
● Injections
● Data Exposure
● Authentication Hijacking
● “Man in the Middle”
● Unencrypted Communication
● Application Abuse
● Parameter Tampering
}Unexpected API usage
9
API Scraping is different
API Scraping
10
● Is a pretty traditional and expected user behaviour
● It doesn’t try to hijack or break anything
● Content - is the main point of interest
● That’s why it is not always easy to identify that you are being scraped
Traditional communication flow
11
USER UI API DATA SOURCE
Traditional communication flow
12
USER UI API DATA SOURCE
Scraping communication flow
13
USER UI API DATA SOURCE
14
Scraping is not allowed
Google Maps Terms of Service
15
3.2.3 Restrictions Against Misusing the Services.
(a) No Scraping. Customer will not export, extract, or otherwise scrape Google Maps
Content for use outside the Services. For example, Customer will not: (i) pre-fetch, index,
store, reshare, or rehost Google Maps Content outside the services; (ii) bulk download
Google Maps tiles, Street View images, geocodes, directions, distance matrix results, roads
information, places information, elevation values, and time zone details; (iii) copy and save
business names, addresses, or user reviews; or (iv) use Google Maps Content with
text-to-speech services.
16
But not everyone is Google, isn’t it?
Scraping communication flow
17
USER UI API DATA SOURCEBOT
Scraping communication flow
18
USER UI API EXTERNAL APIBOT
What makes API Scraping dangerous for API Owners
19
● Unpredictable expenses
● Income losses
● Loss of intellectual property ownership
● Loss of competitive advantage
● Risk of being reverse engineered
20
KYC
Know Your Customers
KYCB
Know Your Customers’ Behavior
How to recognize that your API is being scrapped?
21
Logging Thresholds Monitoring alerts
How to set up a proper Data Monitoring
Smart Data-driven Alerts with Prometheus and Grafana
https://youtu.be/GbwyF6xZwwc
How to prevent API Scraping
23
HTTP-Request limits Per user, Per time period
Basic Firewall rules Headers, Content-Size, IPs
More extended rules Geography, Pattern Detection
IP databases https://www.abuseipdb.com/
Real-time ML based bot detection Specific services (e.g. Cloudflare)
Data thresholds Num or registrations, num of API calls...
24
They say:
no one can block API scraping completely
25
We say:
it is a race and one who stops first loses
26
Bonus Track
Scraping communication flow
27
USER UI API DATA SOURCEBOTHUMAN
This one leaves traces
28
Prior to building a bot,
a human needs to explore your API
Thank you
● ardemchenkov@gmail.com
● https://www.linkedin.com/in/artem-demchenkov-76b69934/
Check it out. There’s interesting
More in our “Engineering Corner” on Medium
30
https://medium.com/billie-finanzratgeber

More Related Content

PDF
apidays LIVE LONDON - Open API Economy: Managing Security and Compliance Risk...
PDF
apidays LIVE LONDON - The State of Banking APIs 2020 by Mark Boyd
PPTX
apidays LIVE Paris - Microservices, up and running by Irakli Nadareishvili
PDF
apidays LIVE Paris - Drawing the right lines: DDD, APIs and Microservices by ...
PDF
apidays LIVE Paris - Driving innovation through External APIs without putting...
PPTX
apidays LIVE JAKARTA - Take control of your microservices with App Mesh by Ak...
PPTX
apidays LIVE LONDON - Toward certifying Financial-grade API profile with Keyc...
PPTX
apidays LIVE LONDON - Transformation of APIs in payments by Neil Munro & Rich...
apidays LIVE LONDON - Open API Economy: Managing Security and Compliance Risk...
apidays LIVE LONDON - The State of Banking APIs 2020 by Mark Boyd
apidays LIVE Paris - Microservices, up and running by Irakli Nadareishvili
apidays LIVE Paris - Drawing the right lines: DDD, APIs and Microservices by ...
apidays LIVE Paris - Driving innovation through External APIs without putting...
apidays LIVE JAKARTA - Take control of your microservices with App Mesh by Ak...
apidays LIVE LONDON - Toward certifying Financial-grade API profile with Keyc...
apidays LIVE LONDON - Transformation of APIs in payments by Neil Munro & Rich...

What's hot (20)

PPTX
apidays LIVE LONDON - Exploring the business value of APIs – from insight to ...
PDF
[WSO2 Integration Summit London 2019] The API-driven World
PDF
[WSO2 Integration Summit Bern 2019] Identity and Access Management in an API-...
PDF
apidays LIVE Paris - The State of SaaS Integration by Gertjan De Wilde
PDF
apidays LIVE LONDON - API platform strategy and operating models by Kiran Nadgir
PPTX
WSO2 - Yenlo Integration Summit Stuttgart May 15 2019 - Open Banking APIs and...
PDF
apidays LIVE LONDON - The Service Management Ecosystem: unification of techno...
PDF
apidays LIVE Australia 2020 - API Design in Fintech: Challenges and Opportuni...
PDF
Banking and Mobile Identity
PDF
apidays LIVE Australia 2021 - Leveraging Async APIs to deliver Cross Domain A...
PPTX
apidays LIVE Hong Kong - The Future of Legacy - How to leverage legacy and on...
PDF
[WSO2 Summit Sydney 2019] Open Banking from the Trenches: How European Banks ...
PDF
Iamwire Investment Pitch
PDF
APIdays London 2019 - Open Banking: An Opportunity, not (just) a Mandate by R...
PDF
APIdays Zurich 2019 - The Three Pillars of API Strategy Erik Wilde, GoodAPI
PDF
[WSO2 Summit Americas 2020 ] Fintech Ecosystems & Consumer Experiences: The N...
PDF
[WSO2 Integration Summit London 2019] Identity and Access Management in an AP...
PDF
apidays LIVE Hong Kong - The Business of APIs by Jed Ng
PDF
apidays LIVE LONDON - Differentiating your Developer Program: Is Speed "A" Di...
PDF
Value Networks in Open Banking
apidays LIVE LONDON - Exploring the business value of APIs – from insight to ...
[WSO2 Integration Summit London 2019] The API-driven World
[WSO2 Integration Summit Bern 2019] Identity and Access Management in an API-...
apidays LIVE Paris - The State of SaaS Integration by Gertjan De Wilde
apidays LIVE LONDON - API platform strategy and operating models by Kiran Nadgir
WSO2 - Yenlo Integration Summit Stuttgart May 15 2019 - Open Banking APIs and...
apidays LIVE LONDON - The Service Management Ecosystem: unification of techno...
apidays LIVE Australia 2020 - API Design in Fintech: Challenges and Opportuni...
Banking and Mobile Identity
apidays LIVE Australia 2021 - Leveraging Async APIs to deliver Cross Domain A...
apidays LIVE Hong Kong - The Future of Legacy - How to leverage legacy and on...
[WSO2 Summit Sydney 2019] Open Banking from the Trenches: How European Banks ...
Iamwire Investment Pitch
APIdays London 2019 - Open Banking: An Opportunity, not (just) a Mandate by R...
APIdays Zurich 2019 - The Three Pillars of API Strategy Erik Wilde, GoodAPI
[WSO2 Summit Americas 2020 ] Fintech Ecosystems & Consumer Experiences: The N...
[WSO2 Integration Summit London 2019] Identity and Access Management in an AP...
apidays LIVE Hong Kong - The Business of APIs by Jed Ng
apidays LIVE LONDON - Differentiating your Developer Program: Is Speed "A" Di...
Value Networks in Open Banking
Ad

Similar to apidays LIVE LONDON - API scrapping: how to protect your API against something which is not necessarily an attack by Artem Demchenkov (20)

PDF
API Scraping. How to protect your API against something that is not necessar...
PPTX
What is Scraping API and How Does It Works?
PDF
how to extract api from websites learn more
PDF
5 step plan to securing your APIs
PPTX
APIs: The New Security Layer
PPTX
I'm an API Hacker, Here's How to Go from Making APIs to Breaking Them - Katie...
PDF
Takeaways from API Security Breaches Webinar
PDF
INTERFACE by apidays 2023 - Everything you need to know about API security, T...
PPTX
Appdome & OWASP Keynote Presentation | API World 2019
PDF
CIS14: Best Practices You Must Apply to Secure Your APIs
PDF
Api security-present
PDF
LF_APIStrat17_OWASP’s Latest Category: API Underprotection
PDF
APIsecure 2023 - Understanding and Identifying Threats Against APIs, Shannon ...
PDF
APIs: The Building Blocks of the Future
PDF
Hacking and Defending APIs - Red and Blue make Purple.pdf
PDF
The Dev, Sec and Ops of API Security - API World
PDF
A Look At API Economy Trends In 2024 - by Bill Doerrfeld, Nordic APIs
PDF
apidays Helsinki & North 2023 - API Security in the era of Generative AI, Mat...
PDF
Akamai_ API Security Best Practices - Real-world attacks and breaches
PPTX
Using & Abusing APIs: An Examination of the API Attack Surface
API Scraping. How to protect your API against something that is not necessar...
What is Scraping API and How Does It Works?
how to extract api from websites learn more
5 step plan to securing your APIs
APIs: The New Security Layer
I'm an API Hacker, Here's How to Go from Making APIs to Breaking Them - Katie...
Takeaways from API Security Breaches Webinar
INTERFACE by apidays 2023 - Everything you need to know about API security, T...
Appdome & OWASP Keynote Presentation | API World 2019
CIS14: Best Practices You Must Apply to Secure Your APIs
Api security-present
LF_APIStrat17_OWASP’s Latest Category: API Underprotection
APIsecure 2023 - Understanding and Identifying Threats Against APIs, Shannon ...
APIs: The Building Blocks of the Future
Hacking and Defending APIs - Red and Blue make Purple.pdf
The Dev, Sec and Ops of API Security - API World
A Look At API Economy Trends In 2024 - by Bill Doerrfeld, Nordic APIs
apidays Helsinki & North 2023 - API Security in the era of Generative AI, Mat...
Akamai_ API Security Best Practices - Real-world attacks and breaches
Using & Abusing APIs: An Examination of the API Attack Surface
Ad

More from apidays (20)

PDF
apidays Munich 2025 - The Physics of Requirement Sciences Through Application...
PDF
apidays Munich 2025 - Developer Portals, API Catalogs, and Marketplaces, Miri...
PDF
apidays Munich 2025 - Making Sense of AI-Ready APIs in a Buzzword World, Andr...
PDF
apidays Munich 2025 - Integrate Your APIs into the New AI Marketplace, Senthi...
PDF
apidays Munich 2025 - The Double Life of the API Product Manager, Emmanuel Pa...
PDF
apidays Munich 2025 - Let’s build, debug and test a magic MCP server in Postm...
PDF
apidays Munich 2025 - The life-changing magic of great API docs, Jens Fischer...
PDF
apidays Munich 2025 - Automating Operations Without Reinventing the Wheel, Ma...
PDF
apidays Munich 2025 - Geospatial Artificial Intelligence (GeoAI) with OGC API...
PPTX
apidays Munich 2025 - GraphQL 101: I won't REST, until you GraphQL, Surbhi Si...
PPTX
apidays Munich 2025 - Effectively incorporating API Security into the overall...
PPTX
apidays Munich 2025 - Federated API Management and Governance, Vince Baker (D...
PPTX
apidays Munich 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (Aavista Oy)
PPTX
apidays Munich 2025 - Streamline & Secure LLM Traffic with APISIX AI Gateway ...
PPTX
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
PPTX
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
PDF
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
PDF
apidays Helsinki & North 2025 - Monetizing AI APIs: The New API Economy, Alla...
PDF
apidays Helsinki & North 2025 - How (not) to run a Graphql Stewardship Group,...
PDF
apidays Helsinki & North 2025 - APIs in the healthcare sector: hospitals inte...
apidays Munich 2025 - The Physics of Requirement Sciences Through Application...
apidays Munich 2025 - Developer Portals, API Catalogs, and Marketplaces, Miri...
apidays Munich 2025 - Making Sense of AI-Ready APIs in a Buzzword World, Andr...
apidays Munich 2025 - Integrate Your APIs into the New AI Marketplace, Senthi...
apidays Munich 2025 - The Double Life of the API Product Manager, Emmanuel Pa...
apidays Munich 2025 - Let’s build, debug and test a magic MCP server in Postm...
apidays Munich 2025 - The life-changing magic of great API docs, Jens Fischer...
apidays Munich 2025 - Automating Operations Without Reinventing the Wheel, Ma...
apidays Munich 2025 - Geospatial Artificial Intelligence (GeoAI) with OGC API...
apidays Munich 2025 - GraphQL 101: I won't REST, until you GraphQL, Surbhi Si...
apidays Munich 2025 - Effectively incorporating API Security into the overall...
apidays Munich 2025 - Federated API Management and Governance, Vince Baker (D...
apidays Munich 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (Aavista Oy)
apidays Munich 2025 - Streamline & Secure LLM Traffic with APISIX AI Gateway ...
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays Helsinki & North 2025 - Monetizing AI APIs: The New API Economy, Alla...
apidays Helsinki & North 2025 - How (not) to run a Graphql Stewardship Group,...
apidays Helsinki & North 2025 - APIs in the healthcare sector: hospitals inte...

Recently uploaded (20)

PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
1. Introduction to Computer Programming.pptx
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PPT
Teaching material agriculture food technology
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Encapsulation theory and applications.pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PPTX
cloud_computing_Infrastucture_as_cloud_p
PPTX
Tartificialntelligence_presentation.pptx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
August Patch Tuesday
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Mobile App Security Testing_ A Comprehensive Guide.pdf
1. Introduction to Computer Programming.pptx
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Teaching material agriculture food technology
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Encapsulation theory and applications.pdf
MIND Revenue Release Quarter 2 2025 Press Release
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Encapsulation_ Review paper, used for researhc scholars
NewMind AI Weekly Chronicles - August'25-Week II
cloud_computing_Infrastucture_as_cloud_p
Tartificialntelligence_presentation.pptx
Diabetes mellitus diagnosis method based random forest with bat algorithm
Spectral efficient network and resource selection model in 5G networks
Network Security Unit 5.pdf for BCA BBA.
August Patch Tuesday
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...

apidays LIVE LONDON - API scrapping: how to protect your API against something which is not necessarily an attack by Artem Demchenkov

  • 1. API Scrapping. How to protect your API against something that is not necessarily an attack Artem Demchenkov Illustration © Caterina Carraro/Billie
  • 2. 2 Artem Demchenkov ● CTO & Co-Founder at Billie ● ardemchenkov@gmail.com ● https://www.linkedin.com/in/artem-demchenkov-76b69934/
  • 3. Billie We are an innovative financial platform, focused on working capital financing. Some numbers: ● Start: 01.12.2016 ● Closed Beta: 01.05.2017 ● Public Launch: 01.06.2017 ● Team: 100+ ● Engineering Crew: 35
  • 4. Agenda 4 ● API scraping may not look like an attack ● Why should we worry then? ● KYCB ● How to recognize that you’re being scrapped ● How to stop API scraping. It is even possible? ● Bonus
  • 6. Some numbers 6 ● 46% of web traffic are scraping bots (Distil Networks, 2016) ● 2% of online revenue is lost due to web scraping (Distil Networks, 2016) ● Global e-commerce sales are 3.53 trillion dollars (Statista, 2019) ● 2% of it is >70 billion dollars
  • 8. API Attacks 8 ● DDoS ● Injections ● Data Exposure ● Authentication Hijacking ● “Man in the Middle” ● Unencrypted Communication ● Application Abuse ● Parameter Tampering }Unexpected API usage
  • 9. 9 API Scraping is different
  • 10. API Scraping 10 ● Is a pretty traditional and expected user behaviour ● It doesn’t try to hijack or break anything ● Content - is the main point of interest ● That’s why it is not always easy to identify that you are being scraped
  • 15. Google Maps Terms of Service 15 3.2.3 Restrictions Against Misusing the Services. (a) No Scraping. Customer will not export, extract, or otherwise scrape Google Maps Content for use outside the Services. For example, Customer will not: (i) pre-fetch, index, store, reshare, or rehost Google Maps Content outside the services; (ii) bulk download Google Maps tiles, Street View images, geocodes, directions, distance matrix results, roads information, places information, elevation values, and time zone details; (iii) copy and save business names, addresses, or user reviews; or (iv) use Google Maps Content with text-to-speech services.
  • 16. 16 But not everyone is Google, isn’t it?
  • 17. Scraping communication flow 17 USER UI API DATA SOURCEBOT
  • 18. Scraping communication flow 18 USER UI API EXTERNAL APIBOT
  • 19. What makes API Scraping dangerous for API Owners 19 ● Unpredictable expenses ● Income losses ● Loss of intellectual property ownership ● Loss of competitive advantage ● Risk of being reverse engineered
  • 20. 20 KYC Know Your Customers KYCB Know Your Customers’ Behavior
  • 21. How to recognize that your API is being scrapped? 21 Logging Thresholds Monitoring alerts
  • 22. How to set up a proper Data Monitoring Smart Data-driven Alerts with Prometheus and Grafana https://youtu.be/GbwyF6xZwwc
  • 23. How to prevent API Scraping 23 HTTP-Request limits Per user, Per time period Basic Firewall rules Headers, Content-Size, IPs More extended rules Geography, Pattern Detection IP databases https://www.abuseipdb.com/ Real-time ML based bot detection Specific services (e.g. Cloudflare) Data thresholds Num or registrations, num of API calls...
  • 24. 24 They say: no one can block API scraping completely
  • 25. 25 We say: it is a race and one who stops first loses
  • 27. Scraping communication flow 27 USER UI API DATA SOURCEBOTHUMAN This one leaves traces
  • 28. 28 Prior to building a bot, a human needs to explore your API
  • 29. Thank you ● ardemchenkov@gmail.com ● https://www.linkedin.com/in/artem-demchenkov-76b69934/
  • 30. Check it out. There’s interesting More in our “Engineering Corner” on Medium 30 https://medium.com/billie-finanzratgeber