Comments (6)
You are right, it is currently not saved. I think we can add it as you suggest, we just need to add something for the x/y interface (where there might be no name).
from ranger.
IMHO for x/y interface, dependent.variable.name
should be NULL
as we are not getting it either from a named argument or a named datastructure (such as a dataframe).
from ranger.
For the xy/ interface we could default to using deparse(substitute(y))
, no? Taking the name of the object might be more informative than a NULL
from ranger.
But then it would be an object name and not a variable name, right? I think people want to use dependent.variable.name
to subset data
later, which wouldn't make sense with x/y.
from ranger.
Ah, sure - I was primarily worried about having default behavior that doesn't yield NULL
, and assumed that x/y interface would be used in a context where target and features are in distinct objects anyway, hence falling back to object names is the next best thing to having variable names. Obviously, if y = somedata$sometarget
then that point is moot, kind of.
from ranger.
Thanks, merged #698.
from ranger.
Related Issues (20)
- classProbs are not in line with the predicted label HOT 4
- Trees summary statistics: height, splits HOT 2
- Matrices without colnames. HOT 2
- A check on inbag size would be nice
- Feature Request: inclusion of the trivial random forest model HOT 2
- compilation failed for package 'ranger' HOT 2
- Clarify Gini index calculation HOT 2
- What is the difference between case.weights, class.weights and sample.fraction? HOT 1
- Identifying out-of-bag observations for each tree HOT 4
- parallel execution of importance_pvalues HOT 1
- Simple fix for installation issue found? HOT 1
- consistent errors with ranger on MacBook Pro Sonoma 14.1.1 HOT 1
- Create a list of forks, extensions etc.
- Multicollinearity and Variable Importance HOT 2
- Error: Too many levels in unordered categorical variable. Only 63 levels allowed on this system. Ranger will EXIT now. HOT 3
- Ranger does not throw an error when predicting with a categorical variable with new/unseen categories HOT 1
- Major Performance Degradation when mtry Parameter is Large HOT 2
- FindBestSplit of C++ version may not find the best split HOT 1
- Question about Calculation of Feature Importance in Regression
- Order of feature in formula seems to matter (last place, particularly) HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ranger.