Comments (2)
Hi Yaobin! How many GPUs are you using for each run? I suspect that this is because the model behavior might be slightly different with different #GPUs. (My experiments were based on one single 2080 GPU).
from pure.
Hi Zexuan,
I use one single GTX3090 GPU. I found the problem is caused by allennlp.nn.utils.batched_index_select
which calls torch.index_select
. I implemented the function and got deterministic results. However, the function is really time-consuming.
def batched_index_select(target: torch.Tensor, indices: torch.LongTensor) -> torch.Tensor:
batch_size, sequence_length, hidden_size = target.size()
_, num_spans = indices.size()
indexed = []
for b in range(batch_size):
target_b = target[b]
indices_b = indices[b]
indexed_b = []
for i in range(num_spans):
indexed_b.append(target_b[indices_b[i]])
indexed_b = torch.stack(indexed_b, dim=0)
indexed.append(indexed_b)
indexed = torch.stack(indexed, dim=0)
return indexed
Best,
Yaobin
from pure.
Related Issues (20)
- Multiple issues HOT 2
- different F1 with the same seed HOT 2
- tensorflow版本 HOT 1
- About the relation in datasets HOT 1
- [Paper] What are "gold" entity and relationship types? HOT 2
- Provide full environment
- Input Data Format HOT 5
- How to load models into Python HOT 2
- some code problems reguarding run_relation_approx(get_features_from_file) HOT 2
- where is the code of Efficient Batch Computations
- Approximation Model Training & Inference HOT 1
- entity is S or O ?
- Further question of f1 and e2e_f1
- 版本库问题 HOT 1
- 版本库问题
- ACE dataset
- Training a model on a dataset that is not ace04, ace05, or scierc HOT 1
- training model for WLP -- stuck in suboptimal solution
- Input data format question for custom dataset !
- cuda out of memory
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pure.