Understand why SMOTE struggles as features grow. Explore the geometry, see the data, and get clear, evidence-based recommendations.
Move the sliders. We simulate how SMOTE creates synthetic points between neighbors. In high dimensions, those points cover less of the space—this is geometric dilution.
The curve drops quickly. That’s the core reason SMOTE can underperform in high-d data.
Loading evaluation results...
Tell us about your dataset. We’ll recommend when SMOTE helps—and when to skip it.
These load from your visualizations/ folder if present.
SMOTE makes new points between nearby minority samples. In low dimensions, that works well. As dimensions grow, the “space” grows even faster—so those linear interpolations cover a tiny fraction. That’s geometric dilution, and it explains why SMOTE often stalls or backfires in high-d datasets.
Repo: smote-geometric-analysis