Daily Dose of Data Science

Daily Dose of Data Science

Home
Sponsor
Premium
Archive
Leaderboard
About

Sklearn

Train Classical ML Models on Large Datasets
Extend the Bagging objective to any ML algorithm.
May 5 • Avi Chawla
A Simple Implementation of Boosting Algorithm
...covered with design principles for Boosting.
Jan 1 • Avi Chawla
7 Categorical Data Encoding Techniques
...summarized in a single frame.
Mar 5, 2025 • Avi Chawla
Accelerate tSNE with GPU
Over 30x faster tSNE than Sklearn.
Feb 24, 2025 • Avi Chawla
ANN-driven KMeans with Faiss
20x speedup over sklearn.
Feb 17, 2025 • Avi Chawla
Categorization of Clustering Algorithms
6 types of clustering algorithms in a single frame.
Nov 14, 2024 • Avi Chawla
Avoid Using PCA for Visualization Unless
...this plot says so.
Nov 7, 2024 • Avi Chawla
KernelPCA vs. PCA for Dimensionality Reduction
...and when to not use KernelPCA.
Nov 4, 2024 • Avi Chawla
DBSCAN++: The Faster and Scalable Alternative to DBSCAN
Addressing major limitations of DBSCAN.
Oct 29, 2024 • Avi Chawla
How Decision Tree Computes Feature Importance?
The underlying mathematical details.
Oct 10, 2024 • Avi Chawla
Sparse Random Projections
An alternative to PCA for highly dimensional datasets.
Oct 5, 2024 • Avi Chawla
Cost Complexity Pruning in Decision Trees
Decision trees always overfit. Prevent it this way.
Sep 20, 2024 • Avi Chawla
© 2026 Avi Chawla · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture