AI Models

Machine learning models powering CAREN fraud detection

2 Models Active
99.94% Ensemble Accuracy
Production

Random Forest

99.94%accuracy

Ensemble of decision trees with bagging. Primary production model with highest accuracy across all metrics. Handles class imbalance through SMOTE oversampling.

Precision
94.1%
Recall
81.6%
Production

XGBoost

99.92%accuracy

Gradient boosted trees with regularization. Secondary production model providing ensemble diversity. Excels at capturing complex non-linear fraud patterns.

Precision
92.3%
Recall
79.6%
Baseline

Logistic Regression

97.41%accuracy

Linear classification baseline model. Provides interpretable probability estimates and serves as the performance baseline for all other models in the pipeline.

Precision
85.7%
Recall
61.2%
Standby

K-Nearest Neighbors

99.65%accuracy

Instance-based lazy learner using proximity voting. On standby for production failover scenarios. Strong performance on localized fraud clusters in feature space.

Precision
89.5%
Recall
69.4%
Evaluation

Decision Tree (AdaBoost)

99.87%accuracy

Adaptive boosting ensemble of shallow decision trees. Currently under evaluation for potential production deployment. Shows promising results on recent fraud patterns.

Precision
91.2%
Recall
77.5%

Performance Comparison

Side-by-side metrics for all trained models

ModelAccuracyPrecisionRecallF1 ScoreAUC-ROCStatus
Random Forest
Best
99.94%94.12%81.63%87.43%98.21%
Production
XGBoost
99.92%92.35%79.59%85.49%97.84%
Production
Logistic Regression
97.41%85.71%61.22%71.43%95.12%
Baseline
K-Nearest Neighbors
99.65%89.47%69.39%78.16%96.53%
Standby
Decision Tree (AdaBoost)
99.87%91.18%77.55%83.78%97.42%
Evaluation

Live Model Testing

Generate and analyze test transactions in real-time

Click the button above to generate a test transaction and see how the CAREN ensemble model analyzes it.

Feature Importance

Top PCA features contributing to fraud detection

V14
Top 1
18.23%
V4
Top 2
14.56%
V12
Top 3
12.34%
V10
9.87%
V11
8.76%
V17
7.65%
V3
6.54%
V16
5.43%
Amount
4.32%
V7
3.21%

Feature importance is derived from the Random Forest model's Gini impurity reduction. V14, V4, and V12 are the most discriminative PCA components for separating fraudulent from legitimate transactions.