Traditional Culture Encyclopedia - Traditional stories - What's the difference between big data, data analysis and data mining?
What's the difference between big data, data analysis and data mining?
The concept of data mining: Data mining is based on database theory, machine learning, artificial intelligence and modern statistics, and has been applied in many fields. It involves many algorithms, such as neural network and decision tree derived from machine learning, support vector machine based on statistical learning theory, classification regression tree, correlation analysis and so on. The definition of data mining is to discover meaningful patterns or knowledge from massive data. ?
Big data needs to be mapped into small units for calculation, and then all the results are integrated, which is the so-called map-reduce algorithm framework. Some data mining techniques are still needed for computing on a single computer. The difference is that some original data mining techniques may not be easily embedded in the map-reduce framework, and some algorithms need to be adjusted. ?
The similarity or correlation between big data and data mining lies in that the future of data mining is no longer aimed at a small amount of accurate data or sampling and randomization, but a large number of mixed big data. Data analysis refers to the process of analyzing a large number of collected data with appropriate statistical analysis methods, extracting useful information and forming conclusions, and then studying and summarizing the data in detail. This process is also the supporting process of quality management system. In practice, data analysis can help people make judgments. ?
Big data refers to a collection of data that cannot be captured, managed and processed by conventional software tools within a certain time range. It is a massive, high-growth and diversified information asset, which needs a new processing mode to have stronger decision-making, insight and discovery, and process optimization ability. ?
In The Age of Big Data, co-authored by Victor Meyer-Schoenberg and Kenneth Cookeye, big data means that all data are used for analysis and processing, and there is no shortcut to random analysis (sampling survey). 5V characteristics of big data (proposed by IBM): volume (mass), speed (high speed), diversity (diversity), value (low value density) and authenticity.
- Previous article:Question: What oil should I put in the cake?
- Next article:What is lion dancing and picking green?
- Related articles
- Who painted horses better than Li in Song Dynasty and Zhao Mengfu in Yuan Dynasty?
- Confucian view of benevolence and filial piety: filial piety is the fundamental difference between human beings and animals. What do you think of this?
- Professor Pipa from Liu Huanzhang
- Characteristics of Zhejiang Folk Houses in Homework Help
- Can ethylenediamine catalyze the reaction between mercaptan and isocyanate?
- Essay on loving the motherland and hometown
- Auspicious and auspicious WeChat pictures, WeChat avatars that can bring good luck and luck.
- Application introduction of coating additives
- Sapling paper cutting how to cut
- Is the Chengdu Tianfu New Area Administrative Committee not under Chengdu's control after it is abolished?