Apache Mahout Apache Mahout TM is a distributed linear algebra framework and mathematically expressive Scala DSL designed to let mathematicians, statisticians, and data scientists quickly implement their own algorithms.Apache Spark is the recommended out of the box distributed back end, or can be extended to other distributed backends Mathematically Expressive Scala DSL Apache Mahout Apache Mahout is a project of the Apache Software Foundation to produce free implementations of distributed or otherwise scalable machine learning algorithms focused primarily in the areas of collaborative filtering, clustering and classification.Many of the implementations use the Apache Hadoop platform Mahout also provides Java libraries for common maths operations focused on linear Apache Mahout EMR Note Only Mahout version and later are compatible with Spark version .x in EMR release version and later. Apache Hadoop Apache Hadoop The Apache Hadoop project develops open source software for reliable, scalable, distributed computing The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. Mahout Machine Learning Tutorials Point Apache Mahout is a highly scalable machine learning library that enables developers to use optimized algorithms Mahout implements popular machine learning techniques such as recommendation, classification, and clustering Therefore, it is prudent to have a brief section on machine learning before Apache NiFi Zero Master Clustering Hortonworks Originally posted in HCC Hortonworks Community Connection One of the most highly anticipated features of Apache NiFi is the introduction of Zero Master Clustering Previous versions of NiFi relied upon a single Master Node formally known as the NiFi Cluster Manager to show the User Interface If this node was lost, data continued to Anomaly Detection Using K Means Clustering Detect anomalies by grouping common patterns together using k mean clustering This way, unusual patterns can be categorised as anomalous. Apache ServiceMix Apache ServiceMix is an enterprise class open source distributed enterprise service bus ESB based on the service oriented architecture SOA model It is a project of the Apache Software Foundation and was built on the semantics and application programming interfaces of the Java Business Integration JBI specification JSR The software is distributed under the Apache License. Generate movie recommendations by using Apache Mahout with Learn how to use the Apache Mahout machine learning library with Azure HDInsight to generate movie recommendations Mahout is a machine learning library for Apache Hadoop Mahout contains algorithms for processing data, such as filtering, classification, and clustering In this article, you use a Apache Solr Features Solr Features Solr is a standalone enterprise search server with a REST like API You put documents in it called indexing via JSON, XML, CSV or binary over HTTP.