• Find me online:  coolest guy on the internet
Subscribe: Via E-mail | Blog Posts | Lifestream | Photos
  • BlogI like to write.
  • LifestreamPlaying online.
  • PhotosMy life in Photos.
  • ProfileAll about me.
  • ProjectsStuff I'm doing.
  • WikiA place for Geeks.
  • ContactGet in touch.

Justin Hartman

Posted on April 20, 2009 - by Justin Hartman

Grabble laid to rest

Grabble Open Source Projects Search Technology

Hey, it looks like you're new here to my blog! You may want to consider subscribing to my RSS feed and you'll automatically get notified when I post a new blog entry. If web feeds aren't your thing then why not join my email list by clicking here. Thanks for stopping by.

GrabbleIn 2006 I was fascinated with search. So much so that I learnt all that I could about search engines and SEO and this was largely driven by the fact that I was in complete awe of Google at the time. All of this inspired me to start a South African search engine and during 2006 I launched Grabble.

Old Grabble

Because I only understood the theory of search engine technology it wasn’t as easy as I had thought to start my own and implementing the technology was an extremely complicated experience. My very first version was powered by Lucene but as it was powered by Java (and TomCat) I had great difficulty in getting it to work as expected.

I then ran a version of mnoGoSearch which was run as a Perl command line utility and while this one seemed to give me the best results I had resource issues which hampered my efforts. What I soon figured out was why Google has a few hundred thousand servers powering their search engine and after about a week of indexing South African websites I ran out of disk space. With about 8 million records indexed I had used up my 80GB hard drive.

So, in the end Grabble went PHP and while I retained some indexing capabilities from mnoGoSearch I also tapped into Yahoo to help out. At the time Yahoo didn’t have a search API so I had to do a lot of hacking to get results returned from Yahoo. In particular, Yahoo provided all the image, news, videos, blogs, sport and forum search results.

While being in awe of Google I also felt that total dominance of only a few players in the market wasn’t a good thing. I believed, esoterically at the time, that local search needed to be dominated by a local company. These beliefs I still hold true to heart and I feel that it’s critical that competitors exist. There is something very uncomfortable in knowing that one or two companies hold the majority of the world’s data and I was convinced that Grabble would be a winner.

Almost three years later and I’ve been debating what to do with Grabble. For the last two months the site hasn’t even been operational (well web search anyway) and while I didn’t want to close it down completely I also had to face the reality that Grabble was never going to dominate and I simply did not have the time to maintain it any longer.

Then along came 4hoursearch – a website powered by the Yahoo BOSS search API and running off Google AppEngine. I stumbled across this site by chance and realised that they had released their source code under GPL. Subsequent to this finding I have now moved Grabble onto this Python software and my final version of Grabble has now been hatched.

Final version of Grabble

What this final version does is allow me to keep the site up for those who still want to use it and at the same time it gives me an opportunity to say goodbye. Running through the Yahoo BOSS API means that I don’t have to worry about indexes any longer and living on the Google AppEngine means technology is now taken care of.

I have learnt so much about search through my efforts with Grabble and I hold it very close to my heart. While it never took off it did give me an opportunity to expand my knowledge and experience in an area I felt passionate about and the learning curve I’ve gone through has certainly helped define where I am today.

Peep on Gatorpeeps 

Share this post: Share this post with the world.
  • Gatorpeeps
  • Muti
  • Twitter
  • Posterous
  • Facebook
  • laaik.it
  • del.icio.us
  • Digg
  • Friendfeed
  • Google
  • LinkedIn
  • Ping.fm
  • Reddit
  • StumbleUpon
  • Technorati
You can follow any responses to this entry through the RSS 2.0 feed. Both comments and pings are currently closed.

5 Comments

I'd love to hear yours!



  1. Visit My Website

    April 20, 2009

    Permalink

    verone said:

    Nice(& great) post(or making of…) !
    so your first version running on nutch ?
    me too i used nutch, and aspeek for my search engines projects… but Yahoo ! Boss… is so cute !
    Good luck



  2. Visit My Website

    April 20, 2009

    Permalink

    verone said:

    so i think, i wanna make my own search engine too with 4hoursearch… i will send you URL



  3. Visit My Website

    April 20, 2009

    Permalink

    arthur charles van wyk said:

    I had no idea that you actually owned Grabble. I thought it was just something you worked on as one of your professional endeavours.

    Since broad search is out of the question, why not try niche search. Perhaps a search engine on all things pertaining to mobile technology.. something like that.. or position it to only search RSS feeds..

    isn’t that where technology is always going.. niche..

    then again.. unlike Azazel.. time is not on your side



  4. Visit My Website

    April 20, 2009

    Permalink

    Justin Hartman said:

    @verone best of luck man, let’s see what you come up with!

    @arthur yes, if time was on my side and I felt that it could be something worth pursuing then I would focus more energy on it.

    Initially Grabble was SA only results and I thought that angle would do it but alas it didn’t.

    To be honest, I have to now focus my time on things that actually do work and Afrigator is that thing. I think my time is better spent trying to improve and grow that ;)



  5. Visit My Website

    May 4, 2009

    Permalink

    Yaarik said:

    This is (surprisingly for me) the first time I have heard of this site and I am not very familiar with you or your work either (I know about Afrigator though)! I must have been under a cuckoo’s nest or something because you seem like a very busy guy in the “SA internet” world :)

    I have scowered your site now and I think I am a bit more knowledgeable than yesterday. I will be keeping myself up to date with your happenings. Thanks Justin.

    Dries.



  • About me

    Justin HartmanI am a seasoned entrepreneur and currently the CEO of Afrigator, a startup which I co-founded in April 2007 and later sold a stake to MIH Internet Africa. This blog is mostly about my life, experience running a startup and thoughts on technology that I encounter along the way... read more.
  • Stuff I’ve written

    • Digital strategy in SA simply doesn’t exist
    • My Tattoo, what, where and why I did it
    • Social Code goes public, finally
    • Social Media ROI… does it exist?
    • My US trip as visualised by Flipboard
  • What people are saying

    • Tony R on What the Internet looked like in 1995
    • j on How important is money?
    • Kayla on Digital strategy in SA simply doesn’t exist
    • ASH on I must pay Netcare so my son can visit his brother!
    • Emy on My Tattoo, what, where and why I did it
© 2008 Justin Hartman - Living Life. Running a Startup. Loving Technology. Web Hosting Sponsored by WPWebHost.
Afrigator