Category Archives: Open Source Software

and everything to do with open source software

Google’s Open Source OCR

Written by Omar Kattan. Filed under Open Source Software. No comments.
google ocr

OCR -Optical character recognition- is a type of software designed to translate images of handwritten or typewritten text (usually captured by a scanner) into machine-editable text. OCR also has the capability to translate pictures of characters into a standard encoding scheme representing them (e.g. ASCII or Unicode). Why am I blogging about OCR?  Well because Google has [...]

Nutch Search Engine Finally Working!

Written by Omar Kattan. Filed under Open Source Software. 1 Comment.
my nutch engine

If you recall a few weeks back I posted about building your own google, well I finally did it, my very own google search engine is finally up and running.

Building your own Google

Written by Omar Kattan. Filed under Open Source Software. No comments.

I’ve recently (two days ago) started experimenting with building my own search engine.  The inspiration came from my experimenting with Google’s Co-op Custom Search Engine contraption. The software I’m using to build my very own Google is Nutch which is an open source web-search software.  Nutch builds on Lucene Java, adding web-specifics, such as a crawler, [...]