Spring Projects

Spring Data - Apache Hadoop

Spring for Apache Hadoop simplifies developing Apache Hadoop by providing a unified configuration model and easy to use APIs for using HDFS, MapReduce, Pig, and Hive. It also provides integration with other Spring ecosystem project such as Spring Integration and Spring Batch enabling you to develop solutions for big data ingest/export and Hadoop workflow orchestration.

Check out the new book from O'Reilly Media Spring Data: Modern Data Access for Enterprise Java that contains several chapters on using Spring for Apache Hadoop. Sample code for the book is also available on github here.

Spring for Apache Hadoop simplifies developing Apache Hadoop by providing a unified configuration model and easy to use APIs for using HDFS, MapReduce, Pig, and Hive. It also provides integration with other Spring ecosystem project such as Spring Integration and Spring Batch enabling you to develop solutions for big data ingest/export and Hadoop workflow orchestration.

Check out the new book from O'Reilly Media Spring Data: Modern Data Access for Enterprise Java that contains several chapters on using Spring for Apache Hadoop. Sample code for the book is also available on github here.

#maven

Features

  • Support to create Hadoop applications that are configured using Dependency Injection and run as standard Java applications vs. using Hadoop command line utilities.
  • Create and configure applications that use Java MapReduce, Streaming, Hive, Pig, Cascading, or HBase
  • Extensions to Spring Batch to support creating Hadoop based workflows for any type of Hadoop Job or HDFS operation.
  • Script HDFS operations using any JVM based scripting language
  • DAO support (Template & Callbacks) for HBase
  • Cascading Taps for Spring & Spring Integration
  • Support for Hadoop Security

Latest News

  • Spring for Apache Hadoop 1.0.0 goes GA
  • Spring for Apache Hadoop 1.0.0.RC2 released
  • "Introducing Spring for Apache Hadoop" webinar announced for November 8th
  • Spring for Apache Hadoop featured in GigaOM A programmer's guide to big data: 12 tools to know
  • Spring for Apache Hadoop 1.0.0.RC1 released
  • Project Serengeti announced
  • Spring for Apache Hadoop 1.0.0.M2 released
  • Spring for Apache Hadoop Talk at Strata 2012 - Download Presentation
  • Spring for Apache Hadoop 1.0.0.M1 released
  • Introducing Spring for Apache Hadoop

    Resources

    Reference Documentation HTML PDF
    Javadocs HTML
    Issue Tracking JIRA
    Source Control GitHub
    Source Repository Browser Fisheye
    Build Status CI
    Forum Forum

    Development snapshot

    Reference Documentation HTML
    Javadocs HTML

     

    Maven Artifacts

    Maven Release Repository
    <repository>
    <-- Release -->
    <id>spring-release</id>
    <name>Spring Maven Release Repository</name>
    <url>http://repo.springframework.org/release</url>
    </repository>
    Maven Release Dependency
    <dependency>
    <groupId>org.springframework.data</groupId>
    <artifactId>spring-data-hadoop</artifactId>
    <version>1.0.0.RELEASE</version>
    </dependency>
    Maven Milestone Repository
    <repository>
    <-- Milestone/RC -->
    <id>spring-milestone</id>
    <name>Spring Maven Milestone Repository</name>
    <url>http://repo.springframework.org/milestone</url>
    </repository>
    Maven Milestone Dependency
    <dependency>
    <groupId>org.springframework.data</groupId>
    <artifactId>spring-data-hadoop</artifactId>
    <version>1.0.0.RC2</version>
    </dependency>
    Maven Snapshot Repository
    <repository>
    <-- Snapshots -->
    <id>spring-snapshot</id>
    <name>Spring Maven SNAPSHOT Repository</name>
    <url>http://repo.springframework.org/snapshot</url>
    </repository>
    Maven Snapshot Dependency
    <dependency>
    <groupId>org.springframework.data</groupId>
    <artifactId>spring-data-hadoop</artifactId>
    <version>1.0.0.BUILD-SNAPSHOT</version>
    </dependency>

     

    Latest GA release - 1.0.0.RELEASE

    Spring for Apache Hadoop

  •  

     


    Quick Jump

    Reference Docs
    Javadocs
    Maven Artifacts
    Discussions

    Issue Tracker
    Source Repository
    Build Status
    Spring Hadoop Samples

     


    Release Summary

    Spring for Apache Hadoop

     


    Documentation

    Latest GA release - 1.0.0.RELEASE

     


    Other Resources

    Spring for Apache Hadoop

     


    Related Discussions

    Spring for Apache Hadoop