Git Product home page Git Product logo

Comments (2)

tobiasblasberg avatar tobiasblasberg commented on May 17, 2024 1

Thanks!
I tried reducing the batch size before but this did not change a lot. The sequence length was a good intuition ;) Reducing it to max 30 tokens allows much larger datasets.

Best,
Tobias

from aspect-based-sentiment-analysis.

rolczynski avatar rolczynski commented on May 17, 2024

Hey Tobias,
if you have problems with the OOM error, you need to inspect how your training batches look like ☺️

  1. reduce the batch size
  2. reduce the input sequence length (too long texts => use a text splitter, take a look how the Pipeline works)
  3. distribute the training (few GPUs)

I think you should check the training dataset and remove/trim outliers (in terms of the input sequence length). In other words, your training dataset probably contains texts that are too long. This is a common problem of processing sequences (you need to adjust a batch to the longest example and then we have problems allocating a tensor even if there are mainly zeros).

This is my quick intuition. You need to provide more details ☺️

from aspect-based-sentiment-analysis.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.