Machine Learning
- Supervised learning
- Classification
- Regression
- Unsupervised learning
- Clustering
- Association
- Reinforcement learning
Bayes’s Rule
E: Evidence
B: Belief
P(B): Prior probability
P(E): Marginal probability
P(E|B): Likelihood
P(B|E): Posterior probability
Multiple Beliefs
If we have beliefs
Multiple Evidences
If we have evidences, i.e.
Naive Bayes
Assume each evidence makes an independent and equal contribution to the belief, therefore
Zero Frequency Problem and α-Laplace Smoothing
is the number of possible values of
Continuous Variables
Gaussian Distribution
Underflow Preventation
Applications
- Real-time predications (fast)
- Multi-class prediction
- Text classification (sentiment analysis, spam filtering)
- Recommendation system
Pros
- Relatively easy to implement
- Computational efficiency
- Works on large dataset
- Predict multiple class
- NLP text classification
Cons
- Assumed features are independent
- Precision low when dataset is small
- Doing regression will provide biased results (probability instead of classification)
- Zero Frequency Problem and α-Laplace Smoothing