Skip to content
#

bernoulli-distribution

Here are 26 public repositories matching this topic...

Data distribution is a function that lists out all possible values the Data can take. It can be a continuous or discrete Data distribution. Several known standard Probability Distribution functions provide probabilities of occurrence of different possible outcomes in an experiment.

  • Updated Jan 23, 2022
  • Jupyter Notebook

Used data of emails being spam or non-spam for performing text classification using different probability distributions. Used NLTK library to remove stop words, non-alphabetic characters, and for tokenizing the text. Calculated mean and variance and other params for each word based on the label(spam or ham).

  • Updated Dec 5, 2023
  • Jupyter Notebook

Used data of emails being spam or non-spam for performing text classification using different probability distributions. Used NLTK library to remove stop words, non-alphabetic characters, and for tokenizing the text. Calculated mean and variance and other params for each word based on the label(spam or ham).

  • Updated Dec 5, 2023
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the bernoulli-distribution topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the bernoulli-distribution topic, visit your repo's landing page and select "manage topics."

Learn more