Tag: mysql

  • Tungsten Replicator 3.0 is Cloudera Enterprise 5 Certified

    One of the key platforms I’ve been testing on for the MySQL to Hadoop replication has been Cloudera, largely driven by customer requirements, but it’s also one of the easiest way to get started with Hadoop. What I’m even more pleased about is the fact that we are proud to announce that Tungsten Replicator 3.0 is…

  • Continuent Replication to Hadoop – Now in Stereo!

    Hopefully by now you have already seen that we are working on Hadoop replication. I’m happy to say that it is going really well. I’ve managed to push a few terabytes of data and different data sets through into Hadoop on Cloudera, HortonWorks, and Amazon’s Elastic MapReduce (EMR). For those who have been following my…

  • Real-Time Data Loading from MySQL to Hadoop using Tungsten Replicator 3.0 Webinar

    To follow-up and describe some of the methods and techniques behind replicating into Hadoop from MySQL in real-time, and how this can be combined into your data workflow, Continuent are running a webinar with me presenting that will go over the details and provide a demo of the data replication process. Real-Time Data Loading from…

  • Parallel Extractor for Provisioning

    Coming up as a new feature in Tungsten Replicator (and written by our replicator expert Stephane Giron) is the ability to provision a new database by using data from an existing database. This new feature comes in the form of a tool called the Parallel Extractor. The principles are very simple. On the master side: Start…

  • MC at Percona Live San Francisco 2014

    Now I’m back in the MySQL fold, I’ve got the opportunity to speak at Percona Live again. I’ve always enjoyed speaking at this conference (back when it was known by another name…), although I need to up my game and do the 6 talks I did back in 2009. On the Tuesday afternoon, tutorials day,…

  • MySQL to Hadoop Step-By-Step

    We had a great webinar on Thursday about replicating from MySQL to Hadoop (watch the whole thing). It was great, but one of the questions at the end was ‘is there an easy way to test’. Sadly we can’t go giving out convenient ready-to-run downloads of these things because of licensing and and other complexities,…

  • Real-Time Replication from MySQL to Cassandra

    Earlier this month I blogged about our new Hadoop applier, I published the docs for that this week (http://docs.continuent.com/tungsten-replicator-3.0/deployment-hadoop.html) as part of the Tungsten Replicator 3.0 documentation (http://docs.continuent.com/tungsten-replicator-3.0/index.html). It contains some additional interesting nuggets that will appear in future blog posts. The main part of that functionality that performs the actual applier for Hadoop is…

  • Getting Data into Hadoop in real-time

    Moving data between databases is hard. Without ever intending it, I seem to have spent a lifetime working on solutions for getting data into and out of databases, but more frequently between. In fact, my first job out of university was migrating data from BRS/Text, a free-text database (probably what we would call a NoSQL)…

  • Anonymizing Data During Replication

    If you happen to work with personal data, chances are you are subject to SOX (Sarbanes-Oxley) whether you like it or not. One of the worst aspects of this is that if you want to be able to analyse your data and you replicate out to another host, you have to find a way of…

  • SQL to Hadoop and back again, Part 3: Direct transfer and live data exchange

    The third, and final article in my series on migrating data to and from Hadoop and SQL databases is now available: Big data is a term that has been used regularly now for almost a decade, and it — along with technologies like NoSQL — are seen as the replacements for the long-successful RDBMS solutions…