Saturday 31 August 2019

Different approaches for building a website that aggregates news from multiple sources, how to go about this?

I am trying to build a website that aggregates news from multiple sources (Node.js)After doing lots of research, it seems that there are multiple approaches to do this sort of thingSimplest Way:Subscribe to something like Google News rss for the interested categories of news and simply relayThe simplest way mentioned above is ILLEGAL if you are doing this for commercial purposes which is exactly my planModerately Complex Way:Get a bunch of RSS urls from each websiteScan them every minute or twoCheck if there are any changesProceed to emit changed itemsAfter hours of searching on GitHb I found THISThis library was updated 4 years ago thoughIs something wrong with this method?Does it not work for 2019Extremely Complex Way:Have AI load each pageExtract contentFilter tags and other anomaliesProcess and storeQuestionsHow do I go about this, I dont plan to involve AI on day 1Is RSS the way to go in 2019?Is anyone aware of any techniques apart from the 3 I listed to build such a website?Thanks

Submitted August 31, 2019 at 05:23PM by amazeguy

No comments:

Post a Comment