Linux, Programming and Me
http://www.feed43.com
December 3, 2011I have been looking at a good website to create feeds. I tried a few like page2rss, feedyes etc. However, feed43 gives the maximum flexibility and best out put for your efforts.
How this works is very interesting. It tells me that I can setup one such utility site using wget, grep and a bit of php coding. This is how it works.
- Enter the url of the page that you want to convert to rss
- The site will retrieve the content and show you the html code
- Recognize the part of the page that you want to use. This is very useful as most pages will have some static and some dynamic content. This is done by identifying the start and end text and entering start_text{%}end_text
- Then, in a similar way, identify the start text and end text patterns for each news item within the above segment.
- Thats it, you are ready to go. However, the site creates a preview of items recognized using your inputs and allows you to optionally choose title, link and body of the rss feed.
The site has good help documents and you can get them by clicking on the ? next to each section. I created feed for MEA (Ministry of External Affairs, India) using this site. The details are below for those who want to try it
URL: http://www.mea.gov.in/mystart.php?id=1900
Global Search pattern: <div class=”newsListing”>{%}<div class=”footer”>
Item Search pattern: <span>{%}</span>{%}<a href=”{%}” class=”more”>{%}</a>
Global Search pattern: <div class=”newsListing”>{%}<div class=”footer”>
Item Search pattern: <span>{%}</span>{%}<a href=”{%}” class=”more”>{%}</a>
Posted in: Civil Service, Feeds, MEA, RSS, Uncategorized