Processing Kinesis Data Streams with Spark Streaming

Solution Overview : In this blog, we are going to build a real time anomaly detection solution using Spark Streaming. Kinesis Data Streams would act as the input streaming source and the anomalous records would be written as Data Streams in DynamoDB. Amazon Kinesis Data Streams (KDS) is a massively scalable and durable real-time data streaming... » read more

Linear regression using Apache Spark MLlib

What is linear Regression? Wikipedia states – In statistics, linear regression is a linear approach to modeling the relationship between  dependent variable and one or more  independent variables. Linear regression is a basic and commonly used type of predictive analysis. Back to school math, every straight line can be represented by the equation: y = mx + b, where y is dependent variable... » read more