Tag: big data

  • Hadoop BoF Session at OSCON

    I have a BoF session next week at OSCON next week: Migrating Data from MySQL and Oracle into Hadoop The session is at 7pm Tuesday night – look for rooms D135 and/or D137/138. Correction: We are now in  E144 on Tuesday with the Hadoop get together first at 7pm, and the Data Migration to follow at…

  • Making Real-Time Analytics a Reality — TDWI -The Data Warehousing Institute

    My article on how to make the real-time processing of information from traditional transactional stores into Hadoop a reality has been published over at TDWI: Making Real-Time Analytics a Reality — TDWI -The Data Warehousing Institute.

  • Continuent at Hadoop Summit

    I’m pleased to say that Continuent will be at the Hadoop Summit in San Jose next week (3-5 June). Sadly I will not be attending as I’m taking an exam next week, but my colleagues Robert Hodges, Eero Teerikorpi and Petri Versunen will be there to answer any questions you have about Continuent products, and, of…

  • Real-Time Data Movement: The Key to Enabling Live Analytics With Hadoop

    An article about moving data into Hadoop in real-time has just been published over at DBTA, written by me and my CEO Robert Hodges. In the article I talk about one of the major issues for all people deploying databases in the modern heterogenous world – how do we move and migrate data effectively between entirely…

  • Cross your Fingers for Tech14, see you at OSCON

    So I’ve submitted my talks for the Tech14 UK Oracle User Group conference which is in Liverpool this year. I’m not going to give away the topics, but you can imagine they are going to be about data translation and movement and how to get your various databases talking together. I can also say, after…

  • Harvest machine data using Hadoop and Hive

    A new article on has been published on IBM developerWorks, looking at the basics of processing machine data using Hadoop, from extracting the core data, storing it, and then determining the baselines and trigger points required to identifying worrying trends and points. From the intro: Machine data can come in many different formats and quantities.…

  • Tungsten Replicator 3.0 is Cloudera Enterprise 5 Certified

    One of the key platforms I’ve been testing on for the MySQL to Hadoop replication has been Cloudera, largely driven by customer requirements, but it’s also one of the easiest way to get started with Hadoop. What I’m even more pleased about is the fact that we are proud to announce that Tungsten Replicator 3.0 is…

  • Continuent Replication to Hadoop – Now in Stereo!

    Hopefully by now you have already seen that we are working on Hadoop replication. I’m happy to say that it is going really well. I’ve managed to push a few terabytes of data and different data sets through into Hadoop on Cloudera, HortonWorks, and Amazon’s Elastic MapReduce (EMR). For those who have been following my…

  • Real-Time Data Loading from MySQL to Hadoop using Tungsten Replicator 3.0 Webinar

    To follow-up and describe some of the methods and techniques behind replicating into Hadoop from MySQL in real-time, and how this can be combined into your data workflow, Continuent are running a webinar with me presenting that will go over the details and provide a demo of the data replication process. Real-Time Data Loading from…

  • Real-Time Replication from MySQL to Cassandra

    Earlier this month I blogged about our new Hadoop applier, I published the docs for that this week (http://docs.continuent.com/tungsten-replicator-3.0/deployment-hadoop.html) as part of the Tungsten Replicator 3.0 documentation (http://docs.continuent.com/tungsten-replicator-3.0/index.html). It contains some additional interesting nuggets that will appear in future blog posts. The main part of that functionality that performs the actual applier for Hadoop is…