Git Product home page Git Product logo

Comments (2)

a3616001 avatar a3616001 commented on May 30, 2024

Hi Yaobin! How many GPUs are you using for each run? I suspect that this is because the model behavior might be slightly different with different #GPUs. (My experiments were based on one single 2080 GPU).

from pure.

isLouisHsu avatar isLouisHsu commented on May 30, 2024

Hi Zexuan,

I use one single GTX3090 GPU. I found the problem is caused by allennlp.nn.utils.batched_index_select which calls torch.index_select. I implemented the function and got deterministic results. However, the function is really time-consuming.

def batched_index_select(target: torch.Tensor, indices: torch.LongTensor) -> torch.Tensor:
    batch_size, sequence_length, hidden_size = target.size()
    _, num_spans = indices.size()
    indexed = []
    for b in range(batch_size):
        target_b = target[b]
        indices_b = indices[b]
        indexed_b = []
        for i in range(num_spans):
            indexed_b.append(target_b[indices_b[i]])
        indexed_b = torch.stack(indexed_b, dim=0)
        indexed.append(indexed_b)
    indexed = torch.stack(indexed, dim=0)
    return indexed

Best,
Yaobin

from pure.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.