Spark has unparalleled advantages in machine learning, especially suitable for algorithms that require multiple iterations. At the same time, Spark has excellent fault tolerance and scheduling mechanism to ensure the stable operation of the system. Spark's current development concept is to integrate SQL, machine learning, graph computing, flow computing and other functions into a project through a computing framework, which is very easy to use.
At present, SPARK has built its own whole big data processing ecosystem, such as stream processing, graph technology, machine learning, NoSQL query, etc., which is the top project of Apache. It can be predicted that in the second half of 20 14, there will be explosive growth in community and commercial applications.
Domestic Taobao, Youku Tudou, etc. Spark technology has been used in its own commercial production system, and its application at home and abroad is becoming more and more extensive. Some large foreign Internet companies have deployed Spark. Even Yahoo, the main contributor of early Hadoop, deployed Spark in many projects. In China, we have deployed Spark in traditional industries such as operators and e-commerce.
Baidu Encyclopedia Portal:/Link? URL = shmvm 5 dfonr 5 uevxvs 953 fzvzl 9 lkuhssdzqryojwqclpqv 3k 74 letcpi-wfvgur 2 f 9 I 4 fyfnebylkt 1y 7 OCC vt 4 jn 2 _ jzlyizyjfsz 1e