UPDATED 18:49 EDT / JANUARY 27 2012

NEWS

What Can You Use Hadoop For? How About a Scalable Vertical Search Engine?

What if your wanted to build its own custom search engine? It could be a public facing vertical search engine like Indeed.com, or it could be some sort internal search engine that helps you search your companies’ own documentation or customer information. Either way, you’d probably turn to some of the usual suspects like Lucene.

But what do you do if that information starts to get really large, really fast?

A presentation (embedded below) from Ivan de Prado of the services firm Datasalt explains why you might want to use Apache Hadoop along with Lucene to build a scalable search engine. According to Datasalt, this approach can create a more scalable, bug tolerant and flexible solution.

Wikibon analyst Jeff Kelly has written about the lack of Hadoop applications ServicesAngle:

Let’s say, for example, you’re a business analyst at a pharmaceutical maker and you’ve come up with an idea to correlate sales data with demographic data with social media data to identify new revenue opportunities. You present your idea to the CEO, who gives you the green light. “Get it done,” she says.

Fantastic. You talk to IT, spin-up an inexpensive Hadoop cluster, then collect, process and store the needed data. Next you take a look at the Hadoop application market and …. and you quickly realize you’re out of luck. You discover there are no compelling applications on the market to suit your innovative use case. Developing the application internally maybe isn’t an option. Your great idea for leveraging Hadoop is DOA.

A highly scalable search engine based on Hadoop and Lucene is exactly the sort of ready-made Hadoop application many enterprises will likely want. It’s the sort of thing that could easily be delivered as a virtual appliance or as a hosted service, and since it’s based on open source, could be built by many different service providers.


A message from John Furrier, co-founder of SiliconANGLE:

Your vote of support is important to us and it helps us keep the content FREE.

One click below supports our mission to provide free, deep, and relevant content.  

Join our community on YouTube

Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.

“TheCUBE is an important partner to the industry. You guys really are a part of our events and we really appreciate you coming and I know people appreciate the content you create as well” – Andy Jassy

THANK YOU