Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Warn on RP forest degenerescence #230

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

hamelin
Copy link
Contributor

@hamelin hamelin commented Nov 10, 2023

Some datasets have such properties as to yield poor random projection trees, where branching does not divide the data very well. In such cases, one gets one-leaf branches and what remains is a large leaf monolith. One approach is to carry on dividing deeper, but the ill is done nonetheless, as these tiny branches can still cause problems. Thus, when the random projection recursion fails to yield a tree where all leaves satisfy the leaf size constraint, we will drop this tree. Should we drop all the trees, we will revert to random initialisation.

Some datasets have such properties as to yield poor random projection
trees, where branching does not divide the data very well. In such
cases, one gets one-leaf branches and what remains is a large leaf
monolith. One approach is to carry on dividing deeper, but the ill is
done nonetheless, as these tiny branches can still cause problems. Thus,
when the random projection recursion fails to yield a tree where all
leaves satisfy the leaf size constraint, we will drop this tree. Should
we drop *all* the trees, we will revert to random initialisation.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant