Discussion about this post

User's avatar
Joseph Adams's avatar

“In this case, the dataset overlap between any two trees is expected to be huge compared to the typical random forest.”

Is this a typo, or did I misunderstand? In a batching context isn’t the batch size normally much smaller than the whole dataset? And wouldn’t that imply minimal overlap in datasets between trees compared to a typical random forest? I agree though this would aid the bagging objective and reduce bias.

Expand full comment
2 more comments...

No posts