Current location - Music Encyclopedia - Today in History - What is the big data major? What have you learned?
What is the big data major? What have you learned?
The full name of big data major is "Big Data Collection and Management Major".

The major of Big Data Collection and Management is a major that systematically helps enterprises to master solutions to various typical problems in big data applications from the aspects of data management, system development, massive data analysis and mining.

1. Industry Status: Now more and more industries are optimistic about the application of big data. Using big data or related data analysis solutions has become a standard in the Internet industry, such as Baidu, Tencent, Taobao, Sina and other companies. In traditional industries such as telecommunications, finance and energy, more and more users are trying or considering how to use big data solutions to improve their business level.

2. Curriculum: Big Data will systematically help enterprises master the solutions to various typical problems in big data applications from three main aspects (namely, data management, system development, massive data analysis and mining), including the implementation and analysis of collaborative filtering algorithm, running and learning classification algorithm, the construction and benchmarking of distributed Hadoop cluster, the construction and benchmarking of distributed Hbase cluster, the implementation of a parallel algorithm based on Mapreduce, and the deployment and implementation of Hive.

3. Core technologies:

(1) Big Data and Hadoop Ecosystem. The principles and applications of distributed file system HDFS, cluster file system ClusterFS and NoSQL database technology are introduced and analyzed in detail. Mapreduce, distributed databases HBase and Hive.

(2) Relational database technology. Introduce the principle of relational database in detail, and master the construction, management, development and application of typical enterprise-level databases.

(3) Distributed data processing. The principle and application of Map/Reduce computing model and Hadoop Map/Reduce technology are introduced and analyzed in detail.

(4) Massive data analysis and data mining. This paper introduces data mining technology and data mining algorithms-Minhash, jaccard and cosine similarity, TF-IDF data mining algorithm-clustering algorithm in detail. And the specific application of data mining technology in the industry.

(5) Internet of Things and big data. The application of big data in the Internet of Things, automatic interpretation of remote sensing images, query, analysis and mining of time series data are introduced in detail.

(6) file system (HDFS). The deployment of HDFS is introduced in detail. High performance based on HDFS provides high throughput data access.

(7)NoSQL. The principle, structure and typical application of NoSQL non-relational database system are introduced in detail.