Visit our blog

News & Events

SpringOne 2GX Session Highlight: How to build Big Data Pipelines for Hadoop using OSS

Costin Leau

In this SpringOne 2GX 2012 session Costin Leau will discuss how Hadoop is a great foundation for big data initatives. To deliver a complete Big Data solution, however, a data pipeline needs to be developed that incorporates and orchestrates many diverse technologies. A Hadoop focused data pipeline not only needs to coordinate the running of multiple Hadoop jobs (MapReduce, Hive, Pig or Cascading), but also encompass real-time data acquisition and the analysis of reduced data sets extracted into relational/NoSQL databases or dedicated analytical engines.

Using an example of real-time event processing (Twitter), in this session we will demonstrate how to build manageable and robust pipeline solutions around BigData using Open Source software such as Apache Hadoop, Cascading, Spring Batch & Integration and Redis.

Check out the session on SpringOne2GX.com for more details and register today!

 

Newsletter Subscription

Our monthly newsletter is packed full of techniques, tutorials, tips and tricks to get you on your way to Spring nirvana. View Archive

Upcoming Training