<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>David Lowry &#187; ruby</title>
	<atom:link href="http://davidlowry.co.uk/category/ruby/feed/" rel="self" type="application/rss+xml" />
	<link>http://davidlowry.co.uk</link>
	<description>Setting out freelance with Web and AV Solutions</description>
	<lastBuildDate>Wed, 11 Aug 2010 13:03:18 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>Blog Move 1: Getting WordPress data to Ruby using XML</title>
		<link>http://davidlowry.co.uk/export-wordpress-to-ruby-rails/</link>
		<comments>http://davidlowry.co.uk/export-wordpress-to-ruby-rails/#comments</comments>
		<pubDate>Sat, 05 Jun 2010 22:44:50 +0000</pubDate>
		<dc:creator>David</dc:creator>
				<category><![CDATA[Code]]></category>
		<category><![CDATA[ruby]]></category>
		<category><![CDATA[hpricot]]></category>
		<category><![CDATA[wordpress]]></category>
		<category><![CDATA[xml]]></category>

		<guid isPermaLink="false">http://davidlowry.co.uk/?p=341</guid>
		<description><![CDATA[Step 1 in the &#8220;Moving my blog&#8221; process is &#8220;Extract the current site’s data into a manageable format&#8221; Frankly, that&#8217;s easy! WordPress has a functionality to export the site&#8217;s content to a single XML file containing all the published Categories, &#8230; <a href="http://davidlowry.co.uk/export-wordpress-to-ruby-rails/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Step 1 in the &#8220;Moving my blog&#8221; process is &#8220;Extract the current site’s data into a manageable format&#8221;</p>
<p>Frankly, that&#8217;s easy! WordPress has a functionality to export the site&#8217;s content to a single XML file containing all the published Categories, Tags, Posts, Pages and Comments. To do this (WordPress v2.9.2) click Tools &gt; Export and save the file. In previous versions of the software I believe it&#8217;s under the Manage menu.</p>
<p><script src="http://gist.github.com/427052.js?file=gistfile4.xml"></script><br />
 <noscript><a href="http://davidlowry.co.uk/wordpress/wp-content/uploads/2010/06/Screen-shot-2010-06-05-at-22.43.57.png"><img class="alignnone size-full wp-image-342" title="Wordpress XML Code" src="http://davidlowry.co.uk/wordpress/wp-content/uploads/2010/06/Screen-shot-2010-06-05-at-22.43.57.png" alt="Wordpress XML Code: viewed in Dashcode, if you care." width="613" height="323" /></a></noscript></p>
<p>I&#8217;m aware I could import the data directly from the WordPress database (to wherever it goes in the end) but let&#8217;s imagine we can&#8217;t. Anyway, database access would be tediously slow and inefficient to test against and implement.</p>
<p>A quick google for <a href="http://www.google.co.uk/search?q=%22import+wordpress+xml+ruby%22">&#8220;import wordpress xml ruby&#8221;</a> threw up nothing helpful so I turned to the Ruby XML libraries. <a href="http://railstips.org/">John Nunemaker</a> &#8220;feverishly posts everything he learns&#8221; at <a href="http://railstips.org/">railstips.org</a> and has two articles of use here:</p>
<ul>
<li><a href="http://railstips.org/blog/archives/2006/12/09/parsing-xml-with-hpricot/">Parsing XML with hpricot</a></li>
<li><a href="http://railstips.org/blog/archives/2008/08/11/parsing-xml-with-ruby/">Parsing XML with Ruby</a></li>
</ul>
<p>The latter deals with three different ruby xml libraries and compares their speed, ease of use and how nice their names are to say. He puts <a href="http://www.germane-software.com/software/rexml/">REXML</a>, <a href="http://hpricot.com/">hpricot</a> and <a href="http://libxml.rubyforge.org/">libxml-ruby</a>. I&#8217;ll save you the pleasure of reading the article (if you like) and ccv John&#8217;s summary:</p>
<blockquote><p>
&#8220;Libxml is blisteringly fast, [but] Hpricot has cooler name, REXML and Hpricot both feel easier to use out of the box&#8221;
</p></blockquote>
<p>And there you go. Hpricot it is!</p>
<p>Now to get the data into Ruby. After a quick glance at the rubytips article and <a href="http://rdoc.info/projects/hpricot/hpricot">The RDocs</a> I put together this code as a starting point:<br />
 <script src="http://gist.github.com/427052.js?file=gistfile2.builder"></script><br />
<noscript></p>
<pre><code>
cats_hierarchy={}
(doc/"wp:category").each do |category|
    cat_name = category.at("wp:category_nicename").innerHTML
    cat_parent = category.at("wp:category_parent").innerHTML

    if cats_hierarchy.include? cat_parent
        cats_hierarchy[cat_parent] = cat_name
    else
        cats_hierarchy[cat_name] = []
    end
end

cats = cats_hierarchy.to_a.flatten
</code></pre>
<p></noscript></p>
<p>That gives me two each to use Ruby objects each containing all of my category data: a hash which preserves the hierarchy of the structure and all the names in a linear array.<br />
<script src="http://gist.github.com/427052.js?file=gistfile3.builder"></script><br />
<noscript></p>
<pre><code>
?> cats = cats_hierarchy.to_a.flatten.uniq
=> ["route66", nil, "rails", "american-2008", "reciprocal-affection", "hope-for-the-future", "code", "blog", "review-blog", "rant", "brands", "projects", "yab_shop", "textpattern", "meaningful-labor", "giants", "accessibility", "root", "charity-project", "apple", "xhtml", "america-2006-route-66", "ruby", "learning", "america-2007", "uncategorized", "iphone", "america-2008"]

?> cats_hierarchy
=> {"route66"=>nil, "rails"=>nil, "american-2008"=>nil, "reciprocal-affection"=>nil, "hope-for-the-future"=>nil, "code"=>nil, "blog"=>"review-blog", "rant"=>nil, "brands"=>nil, "projects"=>nil, "yab_shop"=>nil, "textpattern"=>nil, "meaningful-labor"=>nil, "giants"=>nil, "accessibility"=>nil, "root"=>nil, "charity-project"=>nil, "apple"=>nil, "xhtml"=>nil, "america-2006-route-66"=>nil, "ruby"=>nil, "learning"=>nil, "america-2007"=>nil, "uncategorized"=>nil, "iphone"=>nil, "america-2008"=>nil}
</code></pre>
<p></noscript></p>
<p>And so we have the starting point to getting this WordPress exported XML data into a Ruby application.</p>
<p>More soon.</p>
]]></content:encoded>
			<wfw:commentRss>http://davidlowry.co.uk/export-wordpress-to-ruby-rails/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Simple Thinking Sphinx on Dreamhost</title>
		<link>http://davidlowry.co.uk/simple-thinking-sphinx-on-dreamhost/</link>
		<comments>http://davidlowry.co.uk/simple-thinking-sphinx-on-dreamhost/#comments</comments>
		<pubDate>Wed, 28 Oct 2009 13:10:16 +0000</pubDate>
		<dc:creator>David</dc:creator>
				<category><![CDATA[learning]]></category>
		<category><![CDATA[meaningful labor]]></category>
		<category><![CDATA[rails]]></category>
		<category><![CDATA[ruby]]></category>
		<category><![CDATA[dreamhost]]></category>
		<category><![CDATA[railscasts]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[sphinx]]></category>
		<category><![CDATA[thinking sphinx]]></category>
		<category><![CDATA[tutorial]]></category>

		<guid isPermaLink="false">http://davidlowry.co.uk/?p=279</guid>
		<description><![CDATA[*** Please note &#8211; this will probably not work (at all) (for more than a day of light use) without Cron use. And isn&#8217;t at all authorised by Dreamhost!! *** For a recent client project I&#8217;ve used a Dreamhost unlimited &#8230; <a href="http://davidlowry.co.uk/simple-thinking-sphinx-on-dreamhost/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>*** Please note &#8211; this will probably not work (at all) (for more than a day of light use) without Cron use. And isn&#8217;t at all authorised by Dreamhost!! *** </p>
<p>For a recent client project I&#8217;ve used a Dreamhost unlimited account, which for value compared with the resources available and the fact that you don&#8217;t have to do any building or setting up of the server environment makes it an easy win for a site that&#8217;s not going to have a huge amount of traffic or a large amount of processing.</p>
<p>Post-launch I got to work putting together a basic search engine and here&#8217;s a quick run through of the steps it took to get a very simple Sphinx instance working on Dreamhost, and a few hurdles thrown in the way by various googled articles.</p>
<h3>Development Environment</h3>
<p>Using the <a href="http://freelancing-gods.com/posts/a_concise_guide_to_using_thinking_sphinx">guide from FG</a> install Sphinx locally:</p>
<pre><code>curl -O http://sphinxsearch.com/downloads/sphinx-0.9.8-rc2.tar.gz
tar zxvf sphinx-0.9.8-rc2.tar.gz
cd sphinx-0.9.8-rc2
./configure
make
sudo make install</code></pre>
<p>then install the TS plugin into your application</p>
<pre><code>script/plugin install git://github.com/freelancing-god/thinking-sphinx.git
</code></pre>
<p>Any problems with that, check out the <a href="http://freelancing-gods.com/posts/a_concise_guide_to_using_thinking_sphinx">FG page</a> linked.</p>
<h3>Getting a basic search going</h3>
<p>Following tutorials such as the <a href="http://railscasts.com/episodes/120-thinking-sphinx">Sphinx Railscast</a> will get you there pretty quick.</p>
<p>In your searchable model you need to define an index</p>
<pre><code>
class Page < ActiveRecord::Base
  define_index do
    indexes :title
    indexes :long
    indexes :short
  end ...</code></pre>
<p>Run the indexer and start the Sphinx instance:</p>
<pre><code>
rake thinking_sphinx:index
rake thinking_sphinx:start</code></pre>
<p>After this you'll be able to search on your object. So using script/console</p>
<pre><code>@searched_pages = Page.search("query")</code></pre>
<p>will return what you're looking for!</p>
<h3>Setting up Dreamhost</h3>
<p>First things first you need to install Sphinx in your local area, as posted <a href="http://hughevans.net/2009/03/10/thinking-sphinx-dreamhost">by Hugh Evans</a>:</p>
<pre><code>cd ~/
mkdir -p local
wget http://sphinxsearch.com/downloads/sphinx-0.9.8.1.tar.gz
tar -xzf sphinx-0.9.8.1.tar.gz
cd sphinx-0.9.8.1/
./configure --prefix=$HOME/local/ --exec-prefix=$HOME/local/
make
make install</code></pre>
<p>then set up the PATHs</p>
<pre><code>echo "export PATH=\"$PATH:~/local/bin\"" >> ~/.bash_profile
source ~/.bash_profile</code></pre>
<p>You can choose to set up a CRON task at this point too, but I'm not going into that.</p>
<p>Also at this point in the there's talk of using Sphinx being anti TOS in DH's eyes... but we'll see does the process get killed or not!</p>
<h3>Configuring Sphinx for DH</h3>
<p>Create a file called sphinx.yml in the RAILS_ROOT/config/ folder.</p>
<p>Because Dreamhost uses an externally referenced MySQL server instead of localhost you need to set up the sql_* parameters:</p>
<pre><code>
  sql_host: "mysql.YOURDOMAIN"
  sql_port: 3306
  sql_user: "USER"
  sql_password: "PASSWORD"
  sql_database: "DATABASE"
</code></pre>
<p>And because you installed Sphinx in your local area:</p>
<pre><code>
  bin_path: '/home/YOURUSERNAME/local/bin'
</code></pre>
<p>Finally, after setting whatever memory/fine tuning settings you wish/require set up the locations for the Sphinx files:</p>
<pre><code>
  config_file: "/home/YOURUSERNAME/DOMAIN.co.uk/shared/production.sphinx.conf"
  searchd_log_file: "/home/YOURUSERNAME/DOMAIN.co.uk/shared/log/searchd.log"
  query_log_file: "/home/YOURUSERNAME/DOMAIN.co.uk/shared/log/searchd.query.log"
  pid_file: "/home/YOURUSERNAME/DOMAIN.co.uk/shared/log/searchd.production.pid"
  searchd_file_path: "/home/YOURUSERNAME/DOMAIN.co.uk/shared/db/sphinx"
</code></pre>
<p>That should be you ready to start deploying.</p>
<h3>Deploying</h3>
<p>Using Git + Capistrano for deployment (and Passenger for the http server) my deploy.rb's namespace area looks like this:</p>
<pre><code>
namespace :deploy do
  task :restart do
    after_symlink
    restart_sphinx
    run "touch #{deploy_to}/current/tmp/restart.txt"
  end

  task :start do
    # nothing  (this avoids the 'spin' script issue)
  end

  desc "Re-establish symlinks"
  task :after_symlink do
    run <<-CMD
      rm -fr #{release_path}/db/sphinx &#038;&#038;
      ln -nfs #{shared_path}/db/sphinx #{release_path}/db/sphinx
    CMD
  end

  desc "Stop the sphinx server"
  task :stop_sphinx , :roles => :app do
    run "cd #{current_path} &#038;&#038; rake thinking_sphinx:stop RAILS_ENV=production"
  end

  desc "Start the sphinx server"
  task :start_sphinx, :roles => :app do
    run "cd #{current_path} &#038;&#038; rake thinking_sphinx:configure RAILS_ENV=production &#038;&#038; rake thinking_sphinx:index RAILS_ENV=production &#038;&#038; rake thinking_sphinx:start RAILS_ENV=production"
  end

  desc "Restart the sphinx server"
  task :restart_sphinx, :roles => :app do
    stop_sphinx
    start_sphinx
  end

end
</code></pre>
<p>There's probably a neater way to do this, but basically this makes sure Sphinx's indexes and conf files live in the shared deployment folder.</p>
<p>I recommend you try all this in a staging area first, obviously... and you can use Dreamhost's control panel to set up a staging subdomain with a new database in whatever fashion you prefer.</p>
<p>Any problems with this script flag them up, please! This is as much for my future reference as you googlies out there.</p>
]]></content:encoded>
			<wfw:commentRss>http://davidlowry.co.uk/simple-thinking-sphinx-on-dreamhost/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Rails / LAMP / WordPress Setup Guide</title>
		<link>http://davidlowry.co.uk/rails-lamp-wordpress-setup-guide/</link>
		<comments>http://davidlowry.co.uk/rails-lamp-wordpress-setup-guide/#comments</comments>
		<pubDate>Thu, 07 May 2009 19:47:27 +0000</pubDate>
		<dc:creator>David</dc:creator>
				<category><![CDATA[blog]]></category>
		<category><![CDATA[meaningful labor]]></category>
		<category><![CDATA[rails]]></category>
		<category><![CDATA[ruby]]></category>
		<category><![CDATA[environment]]></category>
		<category><![CDATA[guide]]></category>
		<category><![CDATA[lamp]]></category>
		<category><![CDATA[setup]]></category>
		<category><![CDATA[wordpress]]></category>

		<guid isPermaLink="false">http://davidlowry.co.uk/rails-lamp-wordpress-setup-guide/</guid>
		<description><![CDATA[I&#8217;ve had huge amounts of pain getting WordPress set up locally over the last few weeks on my old machine and when it struck again with my new development environment it definitely time to write down the required steps to &#8230; <a href="http://davidlowry.co.uk/rails-lamp-wordpress-setup-guide/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve had huge amounts of pain getting WordPress set up locally over the last few weeks on my old machine and when it struck again with my new development environment it definitely time to write down the required steps to get a great LAMP/Rails setup prepared (for me)!</p>
<p>Leopard has PHP and Apache fairly up to date, and probably MySQL as well, but I decided to get MySQL up to scratch using the Universal Binary along with a few command line instructions available <a href="http://developer.apple.com/internet/opensource/osdb.html">here (guide and links to downloads)</a>.</p>
<p>I didn&#8217;t use this script myself, and at a year old it may have a few imperfections, but <a href="http://hivelogic.com/articles/view/ruby-rails-leopard">HiveLogic have a fairly well rounded Ruby/Rails install going on.</a></p>
<p>Finally, WordPress to go on your LAMP stack will be a lot less painful using <a href="http://www.tech-recipes.com/rx/2757/leopard_how_to_install_wordpress/">this guide for Tech Recipes</a>.</p>
<p>Any suggestions additions or replacements for this list? Drop a comment!</p>
]]></content:encoded>
			<wfw:commentRss>http://davidlowry.co.uk/rails-lamp-wordpress-setup-guide/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Git Pro Tips!</title>
		<link>http://davidlowry.co.uk/git-pro-tips/</link>
		<comments>http://davidlowry.co.uk/git-pro-tips/#comments</comments>
		<pubDate>Sun, 22 Feb 2009 17:51:10 +0000</pubDate>
		<dc:creator>David</dc:creator>
				<category><![CDATA[blog]]></category>
		<category><![CDATA[meaningful labor]]></category>
		<category><![CDATA[rails]]></category>
		<category><![CDATA[ruby]]></category>
		<category><![CDATA[git]]></category>
		<category><![CDATA[repository]]></category>

		<guid isPermaLink="false">http://davidlowry.co.uk/?p=230</guid>
		<description><![CDATA[Stumbled across this fella on Twitter today, noticed he worked at Slide (who make successful/annoying Facebook Apps). None the less (I joke) he writes a blog at http://www.unethicalblogger.com/ and has recently been writing a series of &#8216;ProTips&#8217; on using Git &#8230; <a href="http://davidlowry.co.uk/git-pro-tips/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Stumbled across this fella on Twitter today, noticed he worked at Slide (who make successful/annoying Facebook Apps). None the less (I joke) he writes a blog at <a href="http://www.unethicalblogger.com/">http://www.unethicalblogger.com/</a> and has recently been writing a series of &#8216;ProTips&#8217; on using Git Repos for his work colleagues.</p>
<p>I think it might prove useful: <a href="http://www.unethicalblogger.com/blog_categories/git">http://www.unethicalblogger.com/blog_categories/git</a> will be taking a read in my spare time.</p>
]]></content:encoded>
			<wfw:commentRss>http://davidlowry.co.uk/git-pro-tips/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
