Web Scraping Tutorial #1 | Complete Scrapy Project with Infinite Scroll | How To Scrape "Load More"

Published: 01 January 1970
on channel: Python 360
1,017
16

We are very excited to bring you our latest Scrapy project / Web Scraping Tutorial.
Web Scraping a large site with "infinite scroll" aka "AJAX load more".
(This is based on a real world project from Upwork).

** PART 1 of 7 **

This covers the analysis of the site, as we identify the relevant elements and sources for the selectors that we will put into our Scrapy spider.



If you just want to skip to a section:
2:29 Looking for the full recipe/ingredients source
3:14 Checking AJAX / XHR in Network tab in Developer Tools in Firefox
5:25 Confirming the URL increments during scroll
7:06 Looking for "json" in the source
8:26 Main Page Thumbnails - Links to details pages - test in Scrapy shell
9:27 response.xpath("//script[@type='application/ld+json']/text()").get();
11:04 Locating the FULL list of ingredients
14:21 Locating the FULL recipe
17:51 testing the increment of "pn=" to use in our spider for our "next page"
23:59 Displaying the full list of ingredients



The json we find is extremely useful, so during the video you will see how we plan to use it and pick out the "links" that we will work on more during part 2.

Please comment, subscribe, like 👍, as this is taking a lot of effort and the more comments/feedback the better I can tailor future videos to you.

GitHub Code
https://github.com/RGGH/Scrapy6/blob/...

Cheers,
Dr Pi.

#scrapy #infinitescroll #loadmore


Watch video Web Scraping Tutorial #1 | Complete Scrapy Project with Infinite Scroll | How To Scrape "Load More" online, duration hours minute second in high quality that is uploaded to the channel Python 360 01 January 1970. Share the link to the video on social media so that your subscribers and friends will also watch this video. This video clip has been viewed 1,017 times and liked it 16 visitors.