Add Gaussian Naive Bayes classifier in machine_learning/ by PRERITARYA · Pull Request #14853 · TheAlgorithms/Python

PRERITARYA · 2026-06-24T03:20:31Z

Describe your change:

Add Gaussian Naive Bayes classifier implemented from scratch without any
external ML libraries (no sklearn).

Implements the full pipeline:

separate_by_class: splits training data by class label
compute_mean_variance: computes per-feature Gaussian statistics
train: fits priors and per-class feature summaries
gaussian_log_probability: evaluates the Gaussian PDF in log space
predict / predict_single: classifies new samples
accuracy: evaluates classifier performance

Add an algorithm?
Fix a bug or typo in an existing algorithm?
Add or change doctests?
Documentation change?

Checklist:

for more information, see https://pre-commit.ci

algorithms-keeper

Click here to look at the relevant links ⬇️

🔗 Relevant Links

Repository:

Contributing guidelines

Project Euler solution guidelines

Python:

Formatted string literals (f-strings)

Type hints

doctest

unittest

pytest

Automated review generated by algorithms-keeper. If there's any problem regarding this review, please open an issue about it.

algorithms-keeper commands and options

algorithms-keeper actions can be triggered by commenting on this PR:

@algorithms-keeper review to trigger the checks for only added pull request files

@algorithms-keeper review-all to trigger the checks for all the pull request files, including the modified files. As we cannot post review comments on lines not part of the diff, this command will post all the messages in one comment.

NOTE: Commands are in beta and so this feature is restricted only to a member or owner of the organization.

algorithms-keeper · 2026-06-24T03:21:20Z

+
+    return priors, summaries
+
+


Please provide descriptive name for the parameter: x

Copilot

Pull request overview

Adds a from-scratch Gaussian Naive Bayes classifier implementation under machine_learning/, intended to provide a lightweight probabilistic classifier without external ML dependencies.

Changes:

Introduces training helpers to compute per-class priors and per-feature Gaussian summaries (mean/variance).
Implements log-space Gaussian likelihood scoring for stable prediction.
Adds doctests for core helpers and an executable doctest.testmod() entrypoint.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+    n_samples = len(data)
+    separated = separate_by_class(data, labels)
+
+    priors: dict[int, float] = {}
+    summaries: dict[int, list[tuple[float, float]]] = {}
+
+    for class_label, class_samples in separated.items():
+        priors[class_label] = math.log(len(class_samples) / n_samples)
+        # transpose to get per-feature lists
+        features_by_column = [
+            [row[col] for row in class_samples] for col in range(len(class_samples[0]))
+        ]
+        summaries[class_label] = [
+            compute_mean_variance(column) for column in features_by_column
+        ]


+    for class_label, feature_summaries in summaries.items():
+        score = priors[class_label]
+        for feature_value, (mean, variance) in zip(feature_vector, feature_summaries):
+            score += gaussian_log_probability(feature_value, mean, variance)


+    if not predictions:
+        raise ValueError("Inputs must not be empty.")
+    if len(predictions) != len(actual):
+        raise ValueError("Predictions and actual labels must have the same length.")


+Time Complexity:  O(n * k * d) for training, O(k * d) for prediction
+                  where n = samples, k = classes, d = features


Add Gaussian Naive Bayes classifier in machine_learning/

d5ee46e

Copilot AI review requested due to automatic review settings June 24, 2026 03:20

[pre-commit.ci] auto fixes from pre-commit.com hooks

37325bb

for more information, see https://pre-commit.ci

Copilot started reviewing on behalf of PRERITARYA June 24, 2026 03:20 View session

algorithms-keeper Bot added awaiting reviews This PR is ready to be reviewed require descriptive names This PR needs descriptive function and/or variable names labels Jun 24, 2026

algorithms-keeper Bot reviewed Jun 24, 2026

View reviewed changes

Copilot AI reviewed Jun 24, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Gaussian Naive Bayes classifier in machine_learning/#14853

Add Gaussian Naive Bayes classifier in machine_learning/#14853
PRERITARYA wants to merge 2 commits into
TheAlgorithms:masterfrom
PRERITARYA:add/gaussian-naive-bayes

PRERITARYA commented Jun 24, 2026

Uh oh!

algorithms-keeper Bot left a comment

Uh oh!

algorithms-keeper Bot Jun 24, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		Time Complexity: O(n * k * d) for training, O(k * d) for prediction
		where n = samples, k = classes, d = features

Uh oh!

Conversation

PRERITARYA commented Jun 24, 2026

Describe your change:

Checklist:

Uh oh!

algorithms-keeper Bot left a comment

Choose a reason for hiding this comment

🔗 Relevant Links

Repository:

Python:

Automated review generated by algorithms-keeper. If there's any problem regarding this review, please open an issue about it.

algorithms-keeper actions can be triggered by commenting on this PR:

Uh oh!

algorithms-keeper Bot Jun 24, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants