Feedalizer: Easy Webpage to RSS Conversion from Ruby
Feedalizer is a Ruby library that lets you easily scrape Web pages and convert them into RSS feeds. Some demo code that generates a feed:
require "feedalizer" require "time" url = "http://sydsvenskan.se/serier/nemi/article101047.ece?context=serie" feedalize(url) do feed.title = "Nemi" feed.about = "..." feed.description = "Daily Nemi strip scraped from Sydsvenskan" scrape_items("option") do |rss_item, html_element| rss_item.link = html_element.attributes["value"] rss_item.date = Time.parse(html_element.innerHTML) rss_item.title = rss_item.date.strftime("%Y-%m-%d") # This grabs the page for a particular strip and extracts the relevant img element rss_item.description = grab_page(rss_item.link).search("//img[@width=748]") end output! end