Comments (3)
@naarkhoo thanks for your interest int he package. I am not exactly sure what weighting scheme you had in mind – perhaps you can explain. It is not so likely that I can add this since the underlying ANN library has no support for any weighting feature.
from rann.
In my dataset rows are customers and columns are covariates. I have three covariates, 1)age, 2) category of movie, 3) wether they have bought a movie for someone else. The variance of covariate 3, is much lower, 95% of cases 0 and a few are 1 and the variation in 1 is higher.
So I am thinking of a scheme where one can give weight to each covariate when calculating the distances among rows(customers). If I calculate the distance as it is, I am assuming all covariates are equally important, which I don't think is the case. I believe, this package https://cran.r-project.org/web/packages/distances/distances.pdf provides a weight parameter and has some good examples. Weights either can be based on the variance of each covariance, or manually set.
I hope, I have explained it well - let me know if there is any think unclear. Thank you again.
from rann.
@naarkhoo thanks for the explanation. However this normalisation is something that you need to do before you pass your points to RANN::nn2
, by scaling each column appropriately. This is exactly what the distances
package seems to do. So I am sorry, but it doesn't make sense for RANN to do this.
from rann.
Related Issues (17)
- nn2 should check length query/data explicitly
- release RANN1 to CRAN HOT 6
- fix bug in checking query vs data dimensions HOT 1
- bd tree HOT 16
- nn2 docs say it returns a data.frame (not a matrix)
- return the number of point within a radius HOT 7
- Remove register keyword from ANN source for CRAN
- points to line distance
- Solaris compile error ERR HOT 1
- Don't use CPPFLAGS for compiler flags HOT 1
- Please ship declarations in inst/include so we can linkingTo your c++ functions HOT 2
- tic::use_tic() for automatic deployment of pkgdown and testing on AppVeyor HOT 5
- Consider provinding L1 metric in the RANN package
- Support other Minkowski metrics HOT 20
- RANN.L1 remote dependency HOT 11
- Objects with zero distance HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from rann.