Scrapy is a fast, open source Python framework for scraping web pages and extracting structured data using XPath selectors. It can build and scale large crawling projects easily, handles requests asynchronously to be fast, and automatically adjusts crawling speed using auto-throttling. Scrapy generates exports in formats like JSON, CSV, and XML and has built-in support for extracting data by XPath or CSS expressions from web pages in an automatic way.
Related topics: