Comments (6)
Hi @vikrant-sahu!
You got it right from the comments! We are aware of it and will fix it really soon :)
Could you share your use case when you got a wrong result? We will add it to the testing now.
from evidently.
If you want the data. Then sorry I can't share it as it a confidential dataset and can't be shared outside our network.
from evidently.
Hi @vikrant-sahu
Could share something of the below:
- the number of classes in the dataset
- the rough number of observations (are we talking thousands, or tens of thousands, or more)
- whether the metric was incorrect or did not compute at all.
That would help!
We understand it can be sensitive - appreciate the response in any case.
from evidently.
Cool..
- the number of classes in the dataset - 2 classes
- the rough number of observations (are we talking thousands, or tens of thousands, or more) - thousands, within 10K
- whether the metric was incorrect or did not compute at all. It was incorrect. I used sklearn and it gave different metric. Even AUC curve was incorrect. Also please check other curves too.
from evidently.
Hi @vikrant-sahu,
Apologies for the late reply! (🦠)
Thanks for your answers! We are still struggling to reproduce the bug. Here are a couple ideas we have on why the calculation is wrong.
- Could you please check if your target (positive) class is the first in the column-mapping ['prediction'] list?
In other tools including sklearn, in order to calculate the ROC AUC metric, you have to explicitly pass the probabilities for the target class. Evidently expects the class columns to follow the order. If it is reversed, the metrics will be calculated incorrectly.
- Do you pass the target function as is (categorical label) or there is some post-processing (e.g. you pass the class label like "0" or "1")?
If you pass a categorical label to Evidently, it should automatically binarize it. However, the problem might have happened at this stage, and we'd look further!
Overall, we believe that if you:
- pass a binarized target label to the Evidently (1 for target class and 0 for other); and
- make sure that positive class is the first in the column-mapping ['prediction'] list
the calculation should be correct.
If it is not, please let us know!
from evidently.
Hi @vikrant-sahu!
We found a bug and fixed it in #122
from evidently.
Related Issues (20)
- evidently collector service HOT 2
- Weird labelling for y-axis in some error plots
- Collector service cannot process np.nan, np.inf, np.NINF values passed to endpoints
- Multi-output regression metrics HOT 1
- UI 500 when opening a report or tests if one of the metrics or tests ends with an error
- Slow test execution and metric calculation HOT 8
- UI dashboard cannot update if the project does not contain a Git repository
- Report import error in evidently.report package in databricks notebook HOT 1
- UI dashboard doesn't automatically update HOT 1
- Support for using embeded images instead of Plotly interactive component HOT 2
- DataDriftPreset : drift_share threshehold does not work with spark engine
- evidently slow in docker HOT 2
- If current df doesn't have an expected column, running tests raises assertion error HOT 1
- Failed to load module script: Expected a JavaScript module script but the server responded with a MIME type of "text/html". Strict MIME type checking is enforced for module scripts per HTML spec. HOT 1
- New pydantic version crashes import resolving
- Save report to file cause "unclosed file" ResourceWarning
- Custom StatTest Failing in Python Due to missing Python Engine HOT 1
- Feedback and bugs (mostly about Cloud) HOT 1
- Bug: Cannot save html classification report when target column and possible labels do not match. HOT 2
- Missing "Per feature"-metrics in Regression HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from evidently.