Tag: hadoop

  • MySQL to Hadoop Step-By-Step

    We had a great webinar on Thursday about replicating from MySQL to Hadoop (watch the whole thing). It was great, but one of the questions at the end was ‘is there an easy way to test’. Sadly we can’t go giving out convenient ready-to-run downloads of these things because of licensing and and other complexities,…

  • Getting Data into Hadoop in real-time

    Moving data between databases is hard. Without ever intending it, I seem to have spent a lifetime working on solutions for getting data into and out of databases, but more frequently between. In fact, my first job out of university was migrating data from BRS/Text, a free-text database (probably what we would call a NoSQL)…

  • Moving to MapReduce 2 with YARN

    Moving to MapReduce 2 with YARN I have a guest blog post over on the Safari Books Online blog looking at the changes in the YARN system for managing Hadoop jobs.

  • Building flexible apps from big data sources

    My article on how to build flexible apps on top of the BigInsights platform has been published. This demonstrates a cool way to combine some client-end JavaScript and existing technologies to build a Big Data query interface without developing a specialised application for the purpose. It’s no secret that a significant proportion of the needs…

  • Process big data with Big SQL in InfoSphere BigInsights

    The ability to write an SQL statement against your Big Data stored in Hadoop provides some much needed flexibility. Sure, using Hive or HBase you can perform some of those operations, but there are other alternatives that may suit your needs better, such as the Big SQL utility. My latest article on this tool is…

  • SQL to Hadoop and back again, Part 3: Direct transfer and live data exchange

    The third, and final article in my series on migrating data to and from Hadoop and SQL databases is now available: Big data is a term that has been used regularly now for almost a decade, and it — along with technologies like NoSQL — are seen as the replacements for the long-successful RDBMS solutions…

  • SQL to Hadoop and back again, Part 1: Basic data interchange techniques

    I’ve got a new article, which is part of a new three-part series, on moving data between SQL and Hadoop, both the export to Hadoop and importing processed content back into an SQL store. In this first one, we look at the basic mechanics and considerations before you start the migration of data, such as…

  • Couchbase and Hadoop Article in Portuguese

    There is nothing cooler than finding that one of your articles has been translated into another language, and I just found out recently that my Using Hadoop with Couchbase article has been translated into Portuguese here: Usando o Hadoop com Couch…

  • Using Hadoop and Couchbase

    My new article on using Hadoop with Couchbase is available now on the IBM developerWorks site. The article tells you how to integrate the massive map/reduce functionality offered by Hadoop with the query functionality offered in Couchbase. With th…