Git Product home page Git Product logo

skab's People

Contributors

dependabot[bot] avatar igormozolin avatar kotsoevk avatar kozitsinslava avatar waico avatar ykatser avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

skab's Issues

Data<->Readme mismatch

Hi,
The data in valve1 is labeled 0-15, and in valve2 is labeled 0-3. The README.md file, however states that the experiment labels should be reversed, and that names should be 1-based.

Thanks

Sensor details

Thanks for this very interesting project. Could you provide a few more details about the sensors and their location within your test bed? I am particularly interested in the location of the two accelerometers.

Many thanks.

Notebooks implementations

Hello,

I have some concerns regarding the notebooks provided.
Why is the training made with files having anomalies instead of the free anomaly csv file and then testing on the other files?
Also the results are taking in count the whole file including the training samples but I know that results need to be done on only data that is unknown to the model. Am I missing something?
Thank you.

Changepoint issue

Hi, I have some questions about the column: 'changepoint'.
According to my last post(thanks Kaster! you're really helpful!) : https://www.kaggle.com/datasets/yuriykatser/skoltech-anomaly-benchmark-skab/discussion/325533, the changepoint should indicate the status changing between 'normal' and 'anomaly': i.e. 'normal' to 'anomaly' or 'anomaly' to 'normal'.

However, when I checked the data, I found something unexpected.

I merged all .csv file under data/valve1/ and sort them by the column: datetime. There are 63 records with changepoint = 1, but only 32 indicate a status switching.

About datasets and EDA

Good Morning :)

I wanna know when are you gonna upload the English version of EDA on kaggle?
Also, where can I get upcoming update to v1.0? I want to get more anomaly and anomaly-free dataset :)
Can u plz share more valve datasets?

Thanks for your great work!
Have a great day.

Conv-AE Results not reproducable

With the notebook provided for the Conv-AE Anomaly-Detector I am not able to reproduce the metrics reported by this repo.

I did not change any code in the notebook.

The models result in the following metrics:

grafik

The idea of using SKAB as a forecasting benchmark.

In real technical systems, there are the following tasks of the diagnostic system: early detection of faults (anomalies), finding the root causes, and predicting the development of faults. The last task gives the greatest economic effects since it allows you to determine the time to failure, which makes it possible to optimize MRO. SCAB's capabilities allow it to be used as a predictive benchmark. Now work is underway to bring the idea to life.

irregular sampling rate and gaps

One of the files contains only 104 points before the first anomaly is introduced, which means that it is not possible to train on the first 400 points as suggested.

All the files are irregularly sampled with some intervals 2 seconds instead of 1 second. Three files contain gaps of greater than 50 seconds. Two more have multiple gaps of 3, 4, and 5 seconds throughout.

As such, the data is not really usable for a ML study.

Anomalous indices loop

Hello, I have a question about this snippet of code (the loop helps to find the anomalies):

# data i is an anomaly if samples [(i - timesteps + 1) to (i)] are anomalies

anomalous_data = cnn_residuals > (3/2 * UCL)
anomalous_data_indices = []
    for data_idx in range(N_STEPS - 1, len(X) - N_STEPS + 1):
        if np.all(anomalous_data[data_idx - N_STEPS + 1 : data_idx]):
            anomalous_data_indices.append(data_idx)

comment says that given a point i, if all samples between i-timesteps+1 and i are anomalies then the point i is anomaly.
This loop starts from N_STEPS - 1 which is (lets say N_STEPS=10 ) from 9 and ends to len(X) - N_STEPS + 1 which is (lets say len(X)=100 ) the number 91.

Then the if statement going to check (for each iteration) the samples from data_idx - N_STEPS + 1 to data_idx
So
iteration 1: all samples from 9-10+1 which is 0 to 9 (not inclusive)
iteration 2: all samples from 10-10+1 which is 1 to 10 (not inclusive)
....[going at the end].....
iteration 82: all samples from 10-10+1 which is 81 to 90 (not inclusive)

In other words:
We check i=9 with samples 0 to 8 (inclusive)
We check i=10 with samples 1 to 9 (inclusive)
....[going at the end].....
We check i=90 with samples 81 to 89 (inclusive)

Question
I think we are missing i=92, 93, 94,....,99. Am I wrong? And why?
If am not wrong shouldn't the for data_idx in range(N_STEPS - 1, len(X) - N_STEPS + 1) be replaced with for data_idx in range(N_STEPS - 1, len(X)): in order to iterate until the end?

Thank you in advance

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.