Traditional Culture Encyclopedia - Tourist attractions - What are the algorithms for big data mining?
What are the algorithms for big data mining?
1. Naive Bayes, super simple, just like doing some counting work. If the conditional independence hypothesis holds, NB will converge faster than the discriminant model, so you only need a small amount of training data. Even if the hypothesis of conditional independence is not established, NB still performs surprisingly well in practice.
2. Logistic regression, LR has many methods to regularize the model. Compared with NB's conditional independence hypothesis, LR does not need to consider whether the samples are relevant. Different from decision tree and support vector machine, NB has good probability interpretation ability, and it is easy to update the model with new training data. If you want some probability information or want to update and improve the model conveniently when there is more data in the future, LR is worth using.
3. Decision tree, DT is easy to understand and explain. DT is nonparametric, so there is no need to worry about whether outliers (or outliers) and data are linearly separable. The main disadvantage of DT is that it is easy to over-fit, which is also the reason why an ensemble learning algorithm such as random forest is proposed.
4. Support vector machine has high classification accuracy, which has a good theoretical guarantee for over-fitting. Facing the problem of inseparability of feature linearity, it can also perform well by choosing appropriate kernel function. SVM is very popular in high-dimensional text classification.
If you want more detailed information, I suggest you take the CDA data analysis course. Big data analysts now have professional international certification. CDA, namely "CDA Data Analyst", is a professional authoritative international qualification certification for the whole industry under the background of digital economy and artificial intelligence era, aiming at improving the digital skills of the whole people, helping enterprises to transform digitally and promoting the digital development of the industry. "CDA data analyst" refers to a new type of data analyst who specializes in data collection, cleaning, processing and analysis, and can make business reports and provide decisions in the Internet, finance, retail, consulting, telecommunications, medical care, tourism and other industries. Click to make an appointment for a free audition class.
- Related articles
- A letter from Shanxi Provincial Department of Culture and Tourism to tourists.
- Will Changbai Mountain go to Ling Xue to wear snow pants on June 5438+ 10?
- Is it corny to wear a silver bracelet?
- Lin Heng's hosting experience:
- How to distinguish the grades of scenic spots? How many levels are there in the tourist area?
- How can I send a circle of friends if I can't buy a ticket
- What to prepare for traveling to Xiamen in November
- Huangshan's composition
- We are in Yunnan! I want to see Kunming, Lijiang, Shangri-La, Baoshan, Pu 'er, Xishuangbanna and the border, and I want to find the fastest tourist route, so I don't go.
- What necessities should I bring with me when I go on a two-day trip?