hadoop

Cassandra and Hadoop Case Studies

This presentation gives developers insight into how to model data in Cassandra, how to integrate Cassandra and Hadoop, and how to build big data platforms suitable for both batch and real-time processing while maintaining low latency response times suitable for web applications.

Querying and scripting in Hadoop

The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models.

From Big Data to Big Information

The Big Data landscape is at a crossroads. Currently dominated by Hadoop and NoSQL databases, alternatives to Hadoop and next-generation datastores are emerging.

Introduction to Hadoop Big Data

This session assumes absolutely no knowledge of Apache Hadoop and will provide a complete introduction to all the major aspects of the Hadoop ecosystem of projects and tools.

High Speed Continuous Reliable Data Ingest into Hadoop

10M events per second into HDFS, Under a sec query per 20GB of HDFS data… All of this and more will be demonstrated live during this presentation that explores the area of real-time data ingest into Hadoop.