Calculate True Positive Rate (Sensitivity) from Confusion Matrix

Calculate True Positive Rate (Sensitivity)

Analyze your classification model's performance by calculating the True Positive Rate (TPR) from a confusion matrix.

True Positives (TP) The number of actual positive cases correctly identified as positive.

False Negatives (FN) The number of actual positive cases incorrectly identified as negative.

Your Results

— %

True Positives (TP) —

False Negatives (FN) —

Actual Positives (P) —

The True Positive Rate (TPR), also known as Sensitivity or Recall, measures the proportion of actual positives that were correctly identified.

Formula: TPR = TP / (TP + FN)

What is True Positive Rate (Sensitivity)?

The True Positive Rate (TPR), commonly referred to as Sensitivity or Recall, is a crucial performance metric in classification tasks. It quantifies how well a model identifies all the relevant cases within a dataset. In simpler terms, it tells you: "Out of all the actual positive instances, how many did the model correctly predict as positive?"

This metric is particularly vital in scenarios where missing a positive case has significant consequences, such as in medical diagnoses (failing to detect a disease), fraud detection (failing to flag a fraudulent transaction), or critical system alerts. A high TPR indicates that your model is good at capturing positive instances, minimizing false negatives.

Understanding the True Positive Rate is essential for anyone evaluating machine learning models, especially in fields like data science, bioinformatics, and cybersecurity. It helps in diagnosing model weaknesses and making informed decisions about model selection and improvement. Misinterpreting TPR can lead to overlooking critical errors in a model's predictions.

Who Should Use This Calculator?

Data Scientists and Machine Learning Engineers: To evaluate and compare classification models.
Researchers: To assess the performance of diagnostic or detection algorithms.
Business Analysts: To understand the effectiveness of models used for identifying customers, detecting anomalies, or flagging risks.
Students and Educators: To learn and demonstrate key machine learning evaluation metrics.

Common Misunderstandings

Confusing TPR with Accuracy: Accuracy considers all predictions (true positives, true negatives, false positives, false negatives). TPR focuses only on the actual positive cases. A model can have high accuracy but low TPR if it misses many positive cases while correctly classifying negatives.
Ignoring the Context: The importance of TPR varies. In some applications, minimizing false positives might be more critical than maximizing TPR.
Unit Confusion: Although TPR is usually expressed as a percentage, the raw inputs (TP, FN) are counts. The calculation itself is unitless and then converted to a percentage.

True Positive Rate (Sensitivity) Formula and Explanation

The True Positive Rate (TPR) is calculated using the counts of True Positives (TP) and False Negatives (FN) from a confusion matrix. The sum of TP and FN represents the total number of actual positive instances in the dataset.

The formula is straightforward:

TPR = True Positives / (True Positives + False Negatives)

This value is typically expressed as a percentage.

Variables Explained:

True Positives (TP): The number of instances that were correctly predicted as positive by the model. These are cases where the model's prediction matches the actual positive label.
False Negatives (FN): The number of instances that were actually positive but were incorrectly predicted as negative by the model. These are often called "Type II errors."
Actual Positives (P): The total count of all instances that are actually positive in the dataset. This is calculated as TP + FN.

Confusion Matrix Variables Table

Confusion Matrix Components for TPR Calculation
Variable	Meaning	Unit	Typical Range
TP (True Positives)	Correctly predicted positive instances	Count (Unitless)	≥ 0
FN (False Negatives)	Actual positive instances predicted as negative	Count (Unitless)	≥ 0
P (Actual Positives)	Total number of actual positive instances (TP + FN)	Count (Unitless)	≥ 0
TPR (True Positive Rate)	Proportion of actual positives correctly identified	Percentage (%)	0% to 100%

Practical Examples

Example 1: Medical Diagnosis Model

A hospital uses a machine learning model to detect a specific disease from patient data. The model analyzes 150 patients known to have the disease.

Inputs:
True Positives (TP): 120 (patients correctly identified as having the disease)
False Negatives (FN): 30 (patients who actually had the disease but were incorrectly identified as not having it)

Calculation: Actual Positives (P) = TP + FN = 120 + 30 = 150 TPR = 120 / 150 = 0.80

Result: The True Positive Rate is 80%. This means the model correctly identifies 80% of patients who actually have the disease. The hospital might consider improving the model to reduce the 30 missed diagnoses (false negatives).

Example 2: Spam Email Detection

An email service provider tests a new spam filter. They evaluate it on a batch of 1000 emails that are known to be legitimate (not spam).

Inputs:
True Positives (TP): 950 (legitimate emails correctly classified as not spam)
False Negatives (FN): 50 (legitimate emails incorrectly classified as spam)

Calculation: Actual Positives (P) = TP + FN = 950 + 50 = 1000 TPR = 950 / 1000 = 0.95

Result: The True Positive Rate is 95%. This indicates that the filter correctly identifies 95% of the legitimate emails as "not spam." A high TPR here means fewer legitimate emails are mistakenly sent to the spam folder. In this context, "positive" refers to the condition being tested for *detection*, so here, the "positive" case is actually "legitimate email" for the purpose of calculating TPR for non-spam. However, it's more common to define the "positive" class as the one of interest for detection (e.g., spam). Let's reframe for clarity, assuming "Spam" is the positive class:

Let's re-evaluate assuming "Spam" is the positive class, and the model is trying to detect spam.

Scenario: Testing for Spam (Positive Class = Spam)
Actual Spam emails: 100
Actual Legitimate emails: 900
Model predicts:

TP: 70 (Spam emails correctly identified as Spam)
FN: 30 (Spam emails incorrectly identified as Legitimate)
TN: 880 (Legitimate emails correctly identified as Legitimate)
FP: 20 (Legitimate emails incorrectly identified as Spam)

Calculation for TPR (Sensitivity to Spam): Actual Positives (P) = TP + FN = 70 + 30 = 100 TPR = 70 / 100 = 0.70

Result: The True Positive Rate (Sensitivity) for detecting spam is 70%. This means the model catches 70% of the actual spam emails. The remaining 30% are missed (False Negatives). Whether 70% is acceptable depends on the application's tolerance for missed spam. Note that the previous calculation focused on identifying legitimate emails, which is typically the role of the True Negative Rate (Specificity).

How to Use This True Positive Rate Calculator

Identify Your Confusion Matrix Values: First, you need the counts for True Positives (TP) and False Negatives (FN) from your classification model's confusion matrix.
Input TP: Enter the number of True Positives into the "True Positives (TP)" field.
Input FN: Enter the number of False Negatives into the "False Negatives (FN)" field.
Calculate: Click the "Calculate True Positive Rate" button.
Interpret Results: The calculator will display:
- The calculated True Positive Rate (TPR) as a percentage.
- The intermediate values used in the calculation (TP, FN, and total Actual Positives P).
- A brief explanation of the formula.
Units: The inputs (TP, FN) are counts and are unitless. The output is a percentage (%), representing a ratio.
Reset: Use the "Reset" button to clear all input fields and results, allowing you to perform a new calculation.
Copy: Click "Copy Results" to copy the calculated TPR, its unit, and the intermediate values to your clipboard.

Key Factors That Affect True Positive Rate

Class Imbalance: If the dataset has significantly more negative instances than positive ones, a model might become biased towards predicting negative, potentially lowering the TPR. While TPR calculation itself doesn't change, the model's ability to achieve a high TPR can be challenged by imbalance.
Model Complexity: Overly simple models might not capture the patterns needed to identify all positive cases (underfitting), leading to a low TPR. Conversely, overly complex models (overfitting) might perform well on training data but generalize poorly, though this often impacts other metrics more directly than TPR specifically.
Feature Engineering and Selection: The quality and relevance of the input features significantly impact a model's ability to distinguish between positive and negative cases. Poor features can mask the signals of positive instances.
Threshold Selection: For models that output probabilities (like logistic regression or neural networks), the decision threshold used to classify an instance as positive or negative directly affects TP and FN counts. Lowering the threshold generally increases TPR but also increases False Positives.
Data Quality: Errors, noise, or missing values in the dataset can mislead the model, making it harder to correctly identify true positives.
Algorithm Choice: Different classification algorithms have varying strengths and weaknesses. Some algorithms might be inherently better suited for capturing positive patterns in specific types of data than others, influencing the achievable TPR.

Frequently Asked Questions (FAQ)

What is the difference between True Positive Rate and Accuracy?

Accuracy is the overall correctness of the model across all classes ( (TP+TN) / Total ). True Positive Rate (Sensitivity) specifically measures how well the model identifies positive cases out of all actual positives ( TP / (TP+FN) ). A model can have high accuracy but low TPR if it misses many positives.

What is a "good" True Positive Rate?

A "good" TPR is context-dependent. In medical diagnosis or safety-critical applications, a very high TPR (e.g., >95%) is often desired to minimize missed critical events. In other applications, a lower TPR might be acceptable if the cost of False Positives is high. Aim for the highest possible TPR that meets the specific needs of your application without introducing unacceptable trade-offs.

Can True Positive Rate be greater than 100%?

No. The True Positive Rate is calculated as a ratio of True Positives to the total number of actual positives (TP + FN). Since TP cannot be greater than the total number of actual positives, the TPR will always be between 0 and 1 (or 0% and 100%).

What does a TPR of 0% mean?

A True Positive Rate of 0% means that the model failed to identify any of the actual positive instances. All actual positive cases were incorrectly classified as negative (i.e., all actual positives resulted in False Negatives).

What does a TPR of 100% mean?

A True Positive Rate of 100% means that the model correctly identified all actual positive instances. There were zero False Negatives.

How do False Positives (FP) affect TPR?

False Positives (FP) do not directly factor into the calculation of True Positive Rate (TPR). TPR is solely dependent on True Positives (TP) and False Negatives (FN). FP are used in calculating other metrics like Precision and the False Positive Rate.

Is TPR the same as Recall?

Yes, True Positive Rate (TPR) is often used interchangeably with the term Recall in the context of classification models. Both measure the proportion of relevant items that are correctly retrieved.

How do I find TP and FN values?

TP and FN values are derived from a confusion matrix, which is generated by comparing your model's predictions against the true labels of your test dataset. The confusion matrix typically breaks down predictions into True Positives, True Negatives, False Positives, and False Negatives.

Related Tools and Resources

Explore these related calculators and guides to deepen your understanding of model evaluation:

Calculate Precision (Positive Predictive Value)
Calculate Recall (Sensitivity/TPR) – *You are here!*
Calculate F1 Score
Calculate Specificity (True Negative Rate)
Understanding Confusion Matrices
Guide to ROC Curves and AUC

Internal Resource Links:

Calculate True Positive Rate From Confusion Matrix