Git Product home page Git Product logo

bird-audio-detection's People

Contributors

jacobsn avatar weiliansong avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar

bird-audio-detection's Issues

live detection script

We should have a script that pulls live audio from the computer and displays whether or not a bird is currently singing.

change run name format

I think a better format would be something like:

[network name][string w/o an underscore (network configuration)][string w/o an underscore (training configuration)]

The two "strings w/o an underscore" would be parsed inside of the respective functions. This would allow us to put more network-specific options in the configuration format... for example, not every network needs two capacity parameters, but we are stuck with that in the current format.

This is not a change I think we should make before the end of the contest, but it is probably worthwhile if we choose to fork this repo to work on another challenge.

Improve generalization to new datasets

Here are a few suggestions:

http://machine-listening.eecs.qmul.ac.uk/2016/11/bird-audio-detection-tips-on-building-robust-detectors/

Probably the first step would be to setup the training/testing code to train on one dataset and test on another so we can really test whether our attempts at improving generalization are working.

Here are some other datasets we could use:

handle deprecation of variable initialization methods

I get the following warnings

WARNING:tensorflow:From evaluate.py:55 in .: initialize_all_variables (from tensorflow.python.ops.variables) is deprecated and will be removed after 2017-03-02.
Instructions for updating:
Use tf.global_variables_initializer instead.
WARNING:tensorflow:From evaluate.py:56 in .: initialize_local_variables (from tensorflow.python.ops.variables) is deprecated and will be removed after 2017-03-02.
Instructions for updating:
Use tf.local_variables_initializer instead.

I tried to fix this at some point but couldn't figure out the correct replacement.

add basic tensorboard summaries

A few useful things:

  • loss
  • learning rate
  • mini-batch accuracy (training data)
  • (maybe) mini-batch accuracy (test data)

This will allow us to more easily monitor training progress and compare different methods. Output to logs/{run_name}/ so we can see them all together.

create visualization scripts

Things we will want to be able to do:

  • find input sequences that maximize the activation for internal nodes of the network
  • prettyprint the network architecture (filter sizes, strides) and activations (blob sizes)

incorporate dropout?

It seems like we might be overfitting (since continuing to lower the model capacity helps with generalization). Would it help to incorporate dropout layers? I suggest we pick a few of the best model settings and add dropout to see how it works.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.