Comments (9)
@Mateko Could you supply a case for more details?
from toad.
@Secbone I'm trying to test my data, while IV values seem like my function, then gini only returns values from (42-43) for all variables in the dataset - which is definitely wrong.
from toad.
@Mateko the gini value is depends on data, so you can't say "the values is between 42 and 43" is wrong, could you supply a case, so we can find out if it is wrong?
BTW, the gini value in quality
is conditional gini
, not the gini value of the feature.
from toad.
Its looking like constat value, but is not, how the variable with 0.04 IV could have 43 gini??
I probably don't understand something
from toad.
@Mateko the gini value's formula in quality
is , which is to use to measure the correlation between the feature and the target, you can see the order of values in your result, the higher the gini value is, the more useless the feature is.
from toad.
If corellation between feature and default its higher then this feature should be more usefull.
from toad.
@Mateko Yes, you can think the gini value as the negtive correlation value.
from toad.
So u think its good to named it Gini in this place? Maybe be better to get there a default % and get a roc auc score from this?
from toad.
@Mateko Of course it is Gini. The ROC or AUC usually used in binary classification as model metrics, but in this place, it needs a value to measure features in many cases, not only binary.
So I think Gini in this place is not bad at least.
from toad.
Related Issues (20)
- 请问{toad各版本}依赖的{numpy库版本}是什么? HOT 1
- 分箱边界inf的可能bug HOT 2
- ScoreCard的保存过程的优化建议 HOT 2
- 200万数据,500维特征,卡方分箱很慢,有没有好办法? HOT 1
- toad.quality报错 HOT 1
- 导入报错问题 HOT 4
- setup.py不支持macOS系统安装 HOT 1
- 关于selection.py中StatsModel的loglikelihood方法 HOT 1
- toad0.1.1在服务器内核为aarch64上安装失败 HOT 3
- IOS 安装成功,但是无法导入 HOT 1
- toad\c_utils.pyx:40:9: 'number' is not a type identifier Cython.Compiler.Errors.CompileError: toad/c_utils.pyx HOT 4
- OSX-ARM Compatibility HOT 2
- There is a problem about toad.quality HOT 2
- toad.selection执行逐步回归方法时报错:ValueError: at least one array or dtype is required HOT 1
- 使用bin_plot观察分箱时,不支持column为bool型
- Public Datasets
- toad.selection.select 可以处理字符串变量吗? HOT 4
- 请问支持多分类吗?
- Error Installation HOT 7
- toad.quality报错 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from toad.