Scrapy is a fast high-level web crawling and web scraping framework, used to crawl websites and extract structured data from their pages. It can be used for a wide range of purposes, from data mining to monitoring and automated testing.
Check the Scrapy homepage at https://scrapy.org for more information, including a list of features.
- IMDB in multiple domains
- Most popular movies.
- Lowest rated movies.
- Most popular TV shows.
- Top rated TV shows.
- Top rated movies.
- All restaurants in gandhinagar (Gujarat) from zomato.
- Basic scraping of quotes.scrape.com
- Python
- SQL
- Python 3.6+
- Works on Linux, Windows, macOS, BSD
The quick way::
pip install scrapy
See the install section in the documentation at https://docs.scrapy.org/en/latest/intro/install.html for more details.
Documentation is available online at https://docs.scrapy.org/ and in the docs
directory.
You can check https://docs.scrapy.org/en/latest/news.html for the release notes.