Git Product home page Git Product logo

aloe's Introduction

by Harlin Lee, Aaqib Saeed, Andrea L. Bertozzi @ 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

Abstract

Pretraining neural networks with massive unlabeled datasets has become popular as it equips the deep models with a better prior to solve downstream tasks. However, this approach generally assumes that for downstream tasks, we have access to annotated data of sufficient size. In this work, we propose ALOE, a novel system for improving the data- and label-efficiency of non-semantic speech tasks with active learning. ALOE uses pre-trained models in conjunction with active learning to label data incrementally and learns classifiers for downstream tasks, thereby mitigating the need to acquire labeled data beforehand. We demonstrate the effectiveness of ALOE on a wide range of tasks, uncertainty-based acquisition functions, and model architectures. Training a linear classifier on top of a frozen encoder with ALOE is shown to achieve performance similar to several baselines that utilize the entire labeled data.

Running the experiments

We provide an example of running an experiment with SpeechCommands dataset.

1. Installation

Install packages as follows:

pip3 install -r requirements.txt

2. Extract features with a pretrained model:

python3 fe.py

This will extract embeddings from train, validation and test splits of SpeechCommands and save them in 'features' directory.

3. Run active learning with a linear model:

python3 al.py --by_class --num_per_class=5 --iters=100 --seed=1

This will load the features along with correspoding labels and perform active learning for 100 iterations with initially 5 labeled example per class in a class-aware manner.

Citation

@inproceedings{lee2023active,
  title={Active Learning of Non-semantic Speech Tasks with Pretrained Models},
  author={Lee, Harlin and Saeed, Aaqib and Bertozzi, Andrea L},
  booktitle={ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
  year={2023},
  organization={IEEE}
}

aloe's People

Contributors

aqibsaeed avatar harlinlee avatar

Stargazers

Gautham Krishna Gudur avatar Arnav Goel avatar Jeff Carpenter avatar Dan Nissenbaum avatar  avatar Vasileios Tsouvalas avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.