Consider the list of weblogs at http://static.userland.com/weblogMonitor/logs.xml
<?xml version="1.0" encoding="ISO-8859-1" ?> <!DOCTYPE foo SYSTEM "http://msdn.microsoft.com/xml/general/htmlentities.dtd"> <weblogs> <log> <name>MozillaZine</name> <url>http://www.mozillazine.org</url> <changesUrl>http://www.mozillazine.org/contents.rdf</changesUrl> <ownerName>Jason Kersey</ownerName> <ownerEmail>kerz@en.com</ownerEmail> <description>THE source for news on the Mozilla Organization. DevChats, Reviews, Chats, Builds, Demos, Screenshots, and more.</description> <imageUrl></imageUrl> <adImageUrl>http://static.userland.com/weblogMonitor/ads/kerz@en.com.gif</adImageUrl> </log> <log> <name>SalonHerringWiredFool</name> <url>http://www.salonherringwiredfool.com/</url> <ownerName>Some Random Herring</ownerName> <ownerEmail>salonfool@wiredherring.com</ownerEmail> <description></description> </log> <log> <name>SlashDot.Org</name> <url>http://www.slashdot.org/</url> <ownerName>Simply a friend</ownerName> <ownerEmail>afriendofweblogs@weblogs.com</ownerEmail> <description>News for Nerds, Stuff that Matters.</description> </log> </weblogs>
The changesUrl
element points to a document like
this:
<?xml version="1.0"?> <!DOCTYPE rss PUBLIC "-//Netscape Communications//DTD RSS 0.91//EN" "http://my.netscape.com/publish/formats/rss-0.91.dtd"> <rss version="0.91"> <channel> <title>MozillaZine</title> <link>http://www.mozillazine.org/</link> <language>en-us</language> <description>Your source for Mozilla news, advocacy, interviews, builds, and more!</description> <copyright>Copyright 1998-2002, The MozillaZine Organization</copyright> <managingEditor>jason@mozillazine.org</managingEditor> <webMaster>jason@mozillazine.org</webMaster> <image> <title>MozillaZine</title> <url>http://www.mozillazine.org/image/mynetscape88.gif</url> <description>Your source for Mozilla news, advocacy, interviews, builds, and more!</description> <link>http://www.mozillazine.org/</link> </image> <item> <title>BugDays Are Back!</title> <link>http://www.mozillazine.org/talkback.html?article=2151</link> </item> <item> <title>Independent Status Reports</title> <link>http://www.mozillazine.org/talkback.html?article=2150</link> </item> </channel> </rss>
We want to process all the item
elements from each weblog.