<?xml version="1.0" encoding="utf-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd"
xmlns:rawvoice="http://www.rawvoice.com/rawvoiceRssModule/"
	>
<channel>
	<title>Comments on: Extracting Content From Websites Using BeautifulSoup</title>
	<atom:link href="http://www.brianjones.ca/archives/2008/01/27/extracting-content-from-websites-using-beautifulsoup/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.brianjones.ca/archives/2008/01/27/extracting-content-from-websites-using-beautifulsoup/</link>
	<description>Programming, Writing, and The Internet</description>
	<lastBuildDate>Sun, 05 Feb 2012 02:43:00 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: joshaidan</title>
		<link>http://www.brianjones.ca/archives/2008/01/27/extracting-content-from-websites-using-beautifulsoup/comment-page-1/#comment-25891</link>
		<dc:creator>joshaidan</dc:creator>
		<pubDate>Thu, 14 Apr 2011 15:39:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.brianjones.ca/archives/2008/01/27/extracting-content-from-websites-using-beautifulsoup/#comment-25891</guid>
		<description>yeah, too much ruby programming. :)</description>
		<content:encoded><![CDATA[<p>yeah, too much ruby programming. <img src='http://www.brianjones.ca/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Bsdwiz</title>
		<link>http://www.brianjones.ca/archives/2008/01/27/extracting-content-from-websites-using-beautifulsoup/comment-page-1/#comment-25890</link>
		<dc:creator>Bsdwiz</dc:creator>
		<pubDate>Thu, 14 Apr 2011 15:17:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.brianjones.ca/archives/2008/01/27/extracting-content-from-websites-using-beautifulsoup/#comment-25890</guid>
		<description>array? i think you mean list.</description>
		<content:encoded><![CDATA[<p>array? i think you mean list.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steven Pigeon</title>
		<link>http://www.brianjones.ca/archives/2008/01/27/extracting-content-from-websites-using-beautifulsoup/comment-page-1/#comment-25543</link>
		<dc:creator>Steven Pigeon</dc:creator>
		<pubDate>Wed, 22 Oct 2008 19:45:06 +0000</pubDate>
		<guid isPermaLink="false">http://www.brianjones.ca/archives/2008/01/27/extracting-content-from-websites-using-beautifulsoup/#comment-25543</guid>
		<description>I&#039;m also learning to use BeautifulSoup for that kind of manipulation, and I have to suggest a (small) improvement for your code: not all &lt;a rel=&quot;nofollow&quot;&gt; have a href (as I discovered the hard way). Use&lt;br&gt;&lt;br&gt;url=a.get(&#039;href&#039;) ( that may return None)&lt;br&gt;&lt;br&gt;or &lt;br&gt;&lt;br&gt;url = a.get(&#039;href&#039;, &#039;&#039; ) if you don&#039;t want None.&lt;/a&gt;</description>
		<content:encoded><![CDATA[<p>I&#39;m also learning to use BeautifulSoup for that kind of manipulation, and I have to suggest a (small) improvement for your code: not all <a rel="nofollow"> have a href (as I discovered the hard way). Use</p>
<p>url=a.get(&#39;href&#39;) ( that may return None)</p>
<p>or </p>
<p>url = a.get(&#39;href&#39;, &#39;&#39; ) if you don&#39;t want None.</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steven Pigeon</title>
		<link>http://www.brianjones.ca/archives/2008/01/27/extracting-content-from-websites-using-beautifulsoup/comment-page-1/#comment-25172</link>
		<dc:creator>Steven Pigeon</dc:creator>
		<pubDate>Wed, 22 Oct 2008 15:45:06 +0000</pubDate>
		<guid isPermaLink="false">http://www.brianjones.ca/archives/2008/01/27/extracting-content-from-websites-using-beautifulsoup/#comment-25172</guid>
		<description>I&#039;m also learning to use BeautifulSoup for that kind of manipulation, and I have to suggest a (small) improvement for your code: not all &lt;a&gt; have a href (as I discovered the hard way). Use&lt;br&gt;&lt;br&gt;url=a.get(&#039;href&#039;) ( that may return None)&lt;br&gt;&lt;br&gt;or &lt;br&gt;&lt;br&gt;url = a.get(&#039;href&#039;, &#039;&#039; ) if you don&#039;t want None.&lt;/a&gt;</description>
		<content:encoded><![CDATA[<p>I&#39;m also learning to use BeautifulSoup for that kind of manipulation, and I have to suggest a (small) improvement for your code: not all <a> have a href (as I discovered the hard way). Use</p>
<p>url=a.get(&#39;href&#39;) ( that may return None)</p>
<p>or </p>
<p>url = a.get(&#39;href&#39;, &#39;&#39; ) if you don&#39;t want None.</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Recent Links Tagged With "beautifulsoup" - JabberTags</title>
		<link>http://www.brianjones.ca/archives/2008/01/27/extracting-content-from-websites-using-beautifulsoup/comment-page-1/#comment-25157</link>
		<dc:creator>Recent Links Tagged With "beautifulsoup" - JabberTags</dc:creator>
		<pubDate>Fri, 26 Sep 2008 19:49:43 +0000</pubDate>
		<guid isPermaLink="false">http://www.brianjones.ca/archives/2008/01/27/extracting-content-from-websites-using-beautifulsoup/#comment-25157</guid>
		<description>[...] public links &gt;&gt; beautifulsoup   Extracting Content From Websites Using BeautifulSoup Saved by secroxs761 on Wed 24-9-2008   [Yaoi Manga] Sore wa Yappari ai Kamone - by Riyu Yamakami [...]</description>
		<content:encoded><![CDATA[<p>[...] public links &gt;&gt; beautifulsoup   Extracting Content From Websites Using BeautifulSoup Saved by secroxs761 on Wed 24-9-2008   [Yaoi Manga] Sore wa Yappari ai Kamone &#8211; by Riyu Yamakami [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Brian</title>
		<link>http://www.brianjones.ca/archives/2008/01/27/extracting-content-from-websites-using-beautifulsoup/comment-page-1/#comment-25127</link>
		<dc:creator>Brian</dc:creator>
		<pubDate>Tue, 15 Apr 2008 14:06:09 +0000</pubDate>
		<guid isPermaLink="false">http://www.brianjones.ca/archives/2008/01/27/extracting-content-from-websites-using-beautifulsoup/#comment-25127</guid>
		<description>Tawa,

You&#039;re not going to want to start at the command line.  Start with a programming language such as Python or Ruby.  First you will need to build a script to extract content from the website.

http://www.python.org
http://www.ruby-lang.org

Brian.</description>
		<content:encoded><![CDATA[<p>Tawa,</p>
<p>You&#8217;re not going to want to start at the command line.  Start with a programming language such as Python or Ruby.  First you will need to build a script to extract content from the website.</p>
<p><a href="http://www.python.org" rel="nofollow">http://www.python.org</a><br />
<a href="http://www.ruby-lang.org" rel="nofollow">http://www.ruby-lang.org</a></p>
<p>Brian.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Brian</title>
		<link>http://www.brianjones.ca/archives/2008/01/27/extracting-content-from-websites-using-beautifulsoup/comment-page-1/#comment-25861</link>
		<dc:creator>Brian</dc:creator>
		<pubDate>Tue, 15 Apr 2008 14:06:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.brianjones.ca/archives/2008/01/27/extracting-content-from-websites-using-beautifulsoup/#comment-25861</guid>
		<description>Tawa,

You&#039;re not going to want to start at the command line.  Start with a programming language such as Python or Ruby.  First you will need to build a script to extract content from the website.

http://www.python.org
http://www.ruby-lang.org

Brian.</description>
		<content:encoded><![CDATA[<p>Tawa,</p>
<p>You&#8217;re not going to want to start at the command line.  Start with a programming language such as Python or Ruby.  First you will need to build a script to extract content from the website.</p>
<p><a href="http://www.python.org" rel="nofollow">http://www.python.org</a><br />
<a href="http://www.ruby-lang.org" rel="nofollow">http://www.ruby-lang.org</a></p>
<p>Brian.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: tawa</title>
		<link>http://www.brianjones.ca/archives/2008/01/27/extracting-content-from-websites-using-beautifulsoup/comment-page-1/#comment-25126</link>
		<dc:creator>tawa</dc:creator>
		<pubDate>Tue, 15 Apr 2008 11:28:19 +0000</pubDate>
		<guid isPermaLink="false">http://www.brianjones.ca/archives/2008/01/27/extracting-content-from-websites-using-beautifulsoup/#comment-25126</guid>
		<description>This is good. Certainly looks like what I am looking for. However, I am very new to this. In order to implement the above, say I want to parse a website, what should I place at the commandline? I short, I need to know how to start. thanks</description>
		<content:encoded><![CDATA[<p>This is good. Certainly looks like what I am looking for. However, I am very new to this. In order to implement the above, say I want to parse a website, what should I place at the commandline? I short, I need to know how to start. thanks</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: tawa</title>
		<link>http://www.brianjones.ca/archives/2008/01/27/extracting-content-from-websites-using-beautifulsoup/comment-page-1/#comment-25862</link>
		<dc:creator>tawa</dc:creator>
		<pubDate>Tue, 15 Apr 2008 11:28:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.brianjones.ca/archives/2008/01/27/extracting-content-from-websites-using-beautifulsoup/#comment-25862</guid>
		<description>This is good. Certainly looks like what I am looking for. However, I am very new to this. In order to implement the above, say I want to parse a website, what should I place at the commandline? I short, I need to know how to start. thanks</description>
		<content:encoded><![CDATA[<p>This is good. Certainly looks like what I am looking for. However, I am very new to this. In order to implement the above, say I want to parse a website, what should I place at the commandline? I short, I need to know how to start. thanks</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: 繁星满天&#187; Blog Archive &#187; links for 2008-03-22</title>
		<link>http://www.brianjones.ca/archives/2008/01/27/extracting-content-from-websites-using-beautifulsoup/comment-page-1/#comment-25091</link>
		<dc:creator>繁星满天&#187; Blog Archive &#187; links for 2008-03-22</dc:creator>
		<pubDate>Sat, 22 Mar 2008 05:41:09 +0000</pubDate>
		<guid isPermaLink="false">http://www.brianjones.ca/archives/2008/01/27/extracting-content-from-websites-using-beautifulsoup/#comment-25091</guid>
		<description>[...] Extracting Content From Websites Using BeautifulSoup (tags: Python) [...]</description>
		<content:encoded><![CDATA[<p>[...] Extracting Content From Websites Using BeautifulSoup (tags: Python) [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>

