Real-time data stream processing for big data applications by aggregating, normalizing and analyzing data for hadoop.