mlpack
2.0.1
|
Public Types | |
typedef neighbor::NeighborSearchTraversalInfo< TreeType > | TraversalInfoType |
Public Member Functions | |
RASearchRules (const arma::mat &referenceSet, const arma::mat &querySet, arma::Mat< size_t > &neighbors, arma::mat &distances, MetricType &metric, const double tau=5, const double alpha=0.95, const bool naive=false, const bool sampleAtLeaves=false, const bool firstLeafExact=false, const size_t singleSampleLimit=20, const bool sameSet=false) | |
double | BaseCase (const size_t queryIndex, const size_t referenceIndex) |
size_t | NumDistComputations () |
size_t | NumEffectiveSamples () |
double | Rescore (const size_t queryIndex, TreeType &referenceNode, const double oldScore) |
Re-evaluate the score for recursion order. More... | |
double | Rescore (TreeType &queryNode, TreeType &referenceNode, const double oldScore) |
Re-evaluate the score for recursion order. More... | |
double | Score (const size_t queryIndex, TreeType &referenceNode) |
Get the score for recursion order. More... | |
double | Score (const size_t queryIndex, TreeType &referenceNode, const double baseCaseResult) |
Get the score for recursion order. More... | |
double | Score (TreeType &queryNode, TreeType &referenceNode) |
Get the score for recursion order. More... | |
double | Score (TreeType &queryNode, TreeType &referenceNode, const double baseCaseResult) |
Get the score for recursion order, passing the base case result (in the situation where it may be needed to calculate the recursion order). More... | |
const TraversalInfoType & | TraversalInfo () const |
TraversalInfoType & | TraversalInfo () |
Private Member Functions | |
void | InsertNeighbor (const size_t queryIndex, const size_t pos, const size_t neighbor, const double distance) |
Insert a point into the neighbors and distances matrices; this is a helper function. More... | |
double | Score (const size_t queryIndex, TreeType &referenceNode, const double distance, const double bestDistance) |
Perform actual scoring for single-tree case. More... | |
double | Score (TreeType &queryNode, TreeType &referenceNode, const double distance, const double bestDistance) |
Perform actual scoring for dual-tree case. More... | |
Private Attributes | |
arma::mat & | distances |
The matrix the resultant neighbor distances should be stored in. More... | |
bool | firstLeafExact |
Whether to do exact computation on the first leaf before any sampling. More... | |
MetricType & | metric |
The instantiated metric. More... | |
arma::Mat< size_t > & | neighbors |
The matrix the resultant neighbor indices should be stored in. More... | |
size_t | numDistComputations |
arma::Col< size_t > | numSamplesMade |
The number of samples made for every query. More... | |
size_t | numSamplesReqd |
The minimum number of samples required per query. More... | |
const arma::mat & | querySet |
The query set. More... | |
const arma::mat & | referenceSet |
The reference set. More... | |
bool | sameSet |
If the query and reference set are identical, this is true. More... | |
bool | sampleAtLeaves |
Whether to sample at leaves or just use all of it. More... | |
double | samplingRatio |
The sampling ratio. More... | |
size_t | singleSampleLimit |
The limit on the largest node that can be approximated by sampling. More... | |
TraversalInfoType | traversalInfo |
Definition at line 25 of file ra_search_rules.hpp.
typedef neighbor::NeighborSearchTraversalInfo<TreeType> mlpack::neighbor::RASearchRules< SortPolicy, MetricType, TreeType >::TraversalInfoType |
Definition at line 195 of file ra_search_rules.hpp.
mlpack::neighbor::RASearchRules< SortPolicy, MetricType, TreeType >::RASearchRules | ( | const arma::mat & | referenceSet, |
const arma::mat & | querySet, | ||
arma::Mat< size_t > & | neighbors, | ||
arma::mat & | distances, | ||
MetricType & | metric, | ||
const double | tau = 5 , |
||
const double | alpha = 0.95 , |
||
const bool | naive = false , |
||
const bool | sampleAtLeaves = false , |
||
const bool | firstLeafExact = false , |
||
const size_t | singleSampleLimit = 20 , |
||
const bool | sameSet = false |
||
) |
double mlpack::neighbor::RASearchRules< SortPolicy, MetricType, TreeType >::BaseCase | ( | const size_t | queryIndex, |
const size_t | referenceIndex | ||
) |
|
private |
Insert a point into the neighbors and distances matrices; this is a helper function.
queryIndex | Index of point whose neighbors we are inserting into. |
pos | Position in list to insert into. |
neighbor | Index of reference point which is being inserted. |
distance | Distance from query point to reference point. |
|
inline |
Definition at line 186 of file ra_search_rules.hpp.
References mlpack::neighbor::RASearchRules< SortPolicy, MetricType, TreeType >::numDistComputations.
|
inline |
Definition at line 187 of file ra_search_rules.hpp.
References mlpack::neighbor::RASearchRules< SortPolicy, MetricType, TreeType >::numSamplesMade.
double mlpack::neighbor::RASearchRules< SortPolicy, MetricType, TreeType >::Rescore | ( | const size_t | queryIndex, |
TreeType & | referenceNode, | ||
const double | oldScore | ||
) |
Re-evaluate the score for recursion order.
A low score indicates priority for recursion, while DBL_MAX indicates that the node should not be recursed into at all (it should be pruned). This is used when the score has already been calculated, but another recursion may have modified the bounds for pruning. So the old score is checked against the new pruning bound.
For rank-approximation, it also checks if the number of samples left for a query to satisfy the rank constraint is small enough at this point of the algorithm, then this node is approximated by sampling and given a new score of 'DBL_MAX'.
double mlpack::neighbor::RASearchRules< SortPolicy, MetricType, TreeType >::Rescore | ( | TreeType & | queryNode, |
TreeType & | referenceNode, | ||
const double | oldScore | ||
) |
Re-evaluate the score for recursion order.
A low score indicates priority for recursion, while DBL_MAX indicates that the node should not be recursed into at all (it should be pruned). This is used when the score has already been calculated, but another recursion may have modified the bounds for pruning. So the old score is checked against the new pruning bound.
For the rank-approximation, we check if the referenceNode can be approximated by sampling. If it can be, enough samples are made for every query in the queryNode. No further query-tree traversal is performed.
The 'NumSamplesMade' query stat is propagated up the tree. And then if pruning occurs (by distance or by sampling), the 'NumSamplesMade' stat is not propagated down the tree. If no pruning occurs, the stat is propagated down the tree.
queryNode | Candidate query node to recurse into. |
referenceNode | Candidate reference node to recurse into. |
oldScore | Old score produced by Socre() (or Rescore()). |
double mlpack::neighbor::RASearchRules< SortPolicy, MetricType, TreeType >::Score | ( | const size_t | queryIndex, |
TreeType & | referenceNode | ||
) |
Get the score for recursion order.
A low score indicates priority for recursion, while DBL_MAX indicates that the node should not be recursed into at all (it should be pruned).
For rank-approximation, the scoring function first checks if pruning by distance is possible. If yes, then the node is given the score of 'DBL_MAX' and the expected number of samples from that node are added to the number of samples made for the query.
If no, then the function tries to see if the node can be pruned by approximation. If number of samples required from this node is small enough, then that number of samples are acquired from this node and the score is set to be 'DBL_MAX'.
If the pruning by approximation is not possible either, the algorithm continues with the usual tree-traversal.
queryIndex | Index of query point. |
referenceNode | Candidate node to be recursed into. |
double mlpack::neighbor::RASearchRules< SortPolicy, MetricType, TreeType >::Score | ( | const size_t | queryIndex, |
TreeType & | referenceNode, | ||
const double | baseCaseResult | ||
) |
Get the score for recursion order.
A low score indicates priority for recursion, while DBL_MAX indicates that the node should not be recursed into at all (it should be pruned).
For rank-approximation, the scoring function first checks if pruning by distance is possible. If yes, then the node is given the score of 'DBL_MAX' and the expected number of samples from that node are added to the number of samples made for the query.
If no, then the function tries to see if the node can be pruned by approximation. If number of samples required from this node is small enough, then that number of samples are acquired from this node and the score is set to be 'DBL_MAX'.
If the pruning by approximation is not possible either, the algorithm continues with the usual tree-traversal.
queryIndex | Index of query point. |
referenceNode | Candidate node to be recursed into. |
baseCaseResult | Result of BaseCase(queryIndex, referenceNode). |
double mlpack::neighbor::RASearchRules< SortPolicy, MetricType, TreeType >::Score | ( | TreeType & | queryNode, |
TreeType & | referenceNode | ||
) |
Get the score for recursion order.
A low score indicates priority for recursionm while DBL_MAX indicates that the node should not be recursed into at all (it should be pruned).
For the rank-approximation, we check if the referenceNode can be approximated by sampling. If it can be, enough samples are made for every query in the queryNode. No further query-tree traversal is performed.
The 'NumSamplesMade' query stat is propagated up the tree. And then if pruning occurs (by distance or by sampling), the 'NumSamplesMade' stat is not propagated down the tree. If no pruning occurs, the stat is propagated down the tree.
queryNode | Candidate query node to recurse into. |
referenceNode | Candidate reference node to recurse into. |
double mlpack::neighbor::RASearchRules< SortPolicy, MetricType, TreeType >::Score | ( | TreeType & | queryNode, |
TreeType & | referenceNode, | ||
const double | baseCaseResult | ||
) |
Get the score for recursion order, passing the base case result (in the situation where it may be needed to calculate the recursion order).
A low score indicates priority for recursion, while DBL_MAX indicates that the node should not be recursed into at all (it should be pruned).
For the rank-approximation, we check if the referenceNode can be approximated by sampling. If it can be, enough samples are made for every query in the queryNode. No further query-tree traversal is performed.
The 'NumSamplesMade' query stat is propagated up the tree. And then if pruning occurs (by distance or by sampling), the 'NumSamplesMade' stat is not propagated down the tree. If no pruning occurs, the stat is propagated down the tree.
queryNode | Candidate query node to recurse into. |
referenceNode | Candidate reference node to recurse into. |
baseCaseResult | Result of BaseCase(queryIndex, referenceNode). |
|
private |
Perform actual scoring for single-tree case.
|
private |
Perform actual scoring for dual-tree case.
|
inline |
Definition at line 197 of file ra_search_rules.hpp.
References mlpack::neighbor::RASearchRules< SortPolicy, MetricType, TreeType >::traversalInfo.
|
inline |
Definition at line 198 of file ra_search_rules.hpp.
References mlpack::neighbor::RASearchRules< SortPolicy, MetricType, TreeType >::traversalInfo.
|
private |
The matrix the resultant neighbor distances should be stored in.
Definition at line 211 of file ra_search_rules.hpp.
|
private |
Whether to do exact computation on the first leaf before any sampling.
Definition at line 220 of file ra_search_rules.hpp.
|
private |
The instantiated metric.
Definition at line 214 of file ra_search_rules.hpp.
|
private |
The matrix the resultant neighbor indices should be stored in.
Definition at line 208 of file ra_search_rules.hpp.
|
private |
Definition at line 235 of file ra_search_rules.hpp.
Referenced by mlpack::neighbor::RASearchRules< SortPolicy, MetricType, TreeType >::NumDistComputations().
|
private |
The number of samples made for every query.
Definition at line 229 of file ra_search_rules.hpp.
Referenced by mlpack::neighbor::RASearchRules< SortPolicy, MetricType, TreeType >::NumEffectiveSamples().
|
private |
The minimum number of samples required per query.
Definition at line 226 of file ra_search_rules.hpp.
|
private |
The query set.
Definition at line 205 of file ra_search_rules.hpp.
|
private |
The reference set.
Definition at line 202 of file ra_search_rules.hpp.
|
private |
If the query and reference set are identical, this is true.
Definition at line 238 of file ra_search_rules.hpp.
|
private |
Whether to sample at leaves or just use all of it.
Definition at line 217 of file ra_search_rules.hpp.
|
private |
The sampling ratio.
Definition at line 232 of file ra_search_rules.hpp.
|
private |
The limit on the largest node that can be approximated by sampling.
Definition at line 223 of file ra_search_rules.hpp.
|
private |
Definition at line 240 of file ra_search_rules.hpp.
Referenced by mlpack::neighbor::RASearchRules< SortPolicy, MetricType, TreeType >::TraversalInfo().