Category: Articles

  • SQL to Hadoop and back again, Part 3: Direct transfer and live data exchange

    The third, and final article in my series on migrating data to and from Hadoop and SQL databases is now available: Big data is a term that has been used regularly now for almost a decade, and it — along with technologies like NoSQL — are seen as the replacements for the long-successful RDBMS solutions…

  • SQL to Hadoop and back again, Part 2: Leveraging HBase and Hive

    The second article in a series covering Big Data and SQL interaction is available now: “Big data” is a term that has been used regularly now for almost a decade, and it — along with technologies like NoSQL — are seen as the replacements for the long-successful RDBMS solutions that use SQL. Today, DB2®, Oracle,…

  • SQL to Hadoop and back again, Part 1: Basic data interchange techniques

    I’ve got a new article, which is part of a new three-part series, on moving data between SQL and Hadoop, both the export to Hadoop and importing processed content back into an SQL store. In this first one, we look at the basic mechanics and considerations before you start the migration of data, such as…

  • Developing Applications for use with Continuent Tungsten and Tungsten Replicator in SDJ

    I’ve just had a new article published with the Software Developers Journal talking about how you can write applications to take full advantage of Continuent Tungsten and Tungsten Replicator. As a developer of an application there really isn’t a problem better than finding that you have to scale up the application and the database that…

  • Data Mining in a Document World

    As databases evolve, learning how to get the best out of the different solutions out there is the key to understanding and extracting the data in the way you need from your required data store. Document databases, like MongoDB, CouchDB, Couchbase Server and many others provide a completely different model and set of problems for…

  • Moving from MySQL to Couchbase Server 2.0: Part 2

    To continue from where my last blog left off, I’ve written a second piece that tries to cover some of the more complex solutions to the problems of querying and extracting data using the Views system within Couchbase Server. Read: How to Move from MySQL to Couchbase Server 2.0: Part 2  

  • Data Mining Techniques

    I have a new article on the basics of data mining techniques so that you can better understand some of the key principles behind the different methods and principles of data mining. From the abstract: Many different data mining, query model, proce…

  • Couchbase and Hadoop Article in Portuguese

    There is nothing cooler than finding that one of your articles has been translated into another language, and I just found out recently that my Using Hadoop with Couchbase article has been translated into Portuguese here: Usando o Hadoop com Couch…

  • Document Databases in Predictive Modeling

    My latest article on performing predictive modeling using document databases is now available on IBM developerWorks. The abstract: Predictive analytics relies on processing, analyzing data from many different sources, collating, and then processin…

  • Using Hadoop and Couchbase

    My new article on using Hadoop with Couchbase is available now on the IBM developerWorks site. The article tells you how to integrate the massive map/reduce functionality offered by Hadoop with the query functionality offered in Couchbase. With th…