Git Product home page Git Product logo

sibyl's People

Contributors

cadereynoldson avatar wolverine3301 avatar

Stargazers

 avatar  avatar

Watchers

 avatar

sibyl's Issues

some Ranking working only on categorical

some Ranking working only on categorical

  • info gain, gain ratio, and gini are only working on categorical columns, there is a way to calculate them on numeric ones as well
    Exception in thread "main" java.lang.IllegalArgumentException at java.base/java.util.PriorityQueue.<init>(PriorityQueue.java:171) at info_gain.InformationGain.gain(InformationGain.java:37) at scout.CategoryRanker.makeGain(CategoryRanker.java:71) at scout.CategoryRanker.<init>(CategoryRanker.java:42) at scout.CategoryRan_test.main(CategoryRan_test.java:11)

Chi2 functionality improvements

add printing and access for contingency tables ,critical values , and degrees of freedom

  • access and print for observed table

  • access and print for expected table

  • print and access to chi2 critical values

  • print and acces to degrees freedom

  • overall print

New separate column list by type

make soft copy column list for different types of columns. also have counts for how many in each.
have them be updated on creation of the dataframe, when a column type changes, one is added/removed.

Standardize rounding error

there is a rounding error after a dataframe is standardized.
after standardization the mean should equal 0 and the std should equal 1

Frequency binning

while it does work and puts roughly the same number of items in each bin even when not evenly divisible, there seems to be a slight unbalance of items in bins when it should be able to put an exactly even number in each.

Errors with new column type array list

when using multiple dataframe manipulation methods such as Util.splitontarget, acquire, and replace columns, the counts and arraylist of columns become out of sync. the problem seems to be around setStatistics() which adds columns to their arraylist and updates their counts. the errors experienced are either columns not being added or duplicated columns added.

Numeric indexes not initiallized

broke up a dataframe using aquire and the numeric indexing is not set. could be not initalized from the start or reset when aquire is used

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.