You are here:  » Parsing feeds for search engine


Parsing feeds for search engine

Submitted by GeXus on Tue, 2006-08-01 15:32 in

Is it posible to use Magic Parser to parse thousands of RSS feeds and search them based on keywords? Would it be more efficient to parse them and save them to a rational DB, or to parse the actual feeds? There could be hundreds of thousands even. And can Magic Parser determine last update?

Submitted by support on Tue, 2006-08-01 16:13

Hi,

It would almost certainly not be practical to search thousands of RSS feeds on a real time basis - that job is best left to a database. This is because parsing is a serial process and involves reading the entire file.

Therefore, you would write code to import the feeds into a database and make sure that you have setup appropriate indexes on the fields you wish to search.

Cheers,
David.

Submitted by GeXus on Tue, 2006-08-01 16:59

Gotcha! so would magic parser be efficient at doing the initial parsing? such as speed and compatibility with most RSS feeds?

Submitted by support on Tue, 2006-08-01 17:02

Absolutely - it should work with any RSS feed and will give you access to all the elements and attributes - and of course it will work with Atom format syndication feeds as well.

Cheers,
David.

Submitted by GeXus on Tue, 2006-08-01 17:11

Great, thank you!