Saturday 23 March 2019

Need Tips: Advanced Scraping Methods

Hi,I'm looking for some articles, videos or other resources on more advanced scraping methods. My question really is, how do apps like Pocket always grab the body of an article or headline without knowing the selectors on the page? I imagine that they have some kind of algorithm that determines if a div has certain content in it while traversing the DOM. I'm just wondering what those algorithms might look like. Or maybe they grab site mapping or RSS from somewhere? Any advice helps.Thanks!

Submitted March 23, 2019 at 05:48PM by maximusprime2328

No comments:

Post a Comment