Spring-Batch for a massive nightly / hourly Hive / MySQL data processing Spring-Batch for a massive nightly / hourly Hive / MySQL data processing hadoop hadoop

Spring-Batch for a massive nightly / hourly Hive / MySQL data processing


If you want to stay within the Hadoop ecosystem, I'd highly recommend checking out Oozie to automate your workflow. We (Cloudera) provide a packaged version of Oozie that you can use to get started. See our recent blog post for more details.


Why not use JasperETL or Talend? Seems like the right tool for the job.


I've used Cascading quite a bit and found it be quite impressive:

Cascading

It is a M/R abstraction layer, and runs on Hadoop.