Comments (2)
There's this abandoned PR: #124722
The current approach in the PR is too intrusive. It'd be better to just check whether there's any global dtype on, and if so, then use that. In the general case, you don't want to be passing around the input_dtype
, but simply take the logic in
inputs_devices = list(
{i.device for i in pytree.tree_flatten(aot_example_inputs)[0]}
) + [None]
assert len(inputs_devices) > 0
lazy_init(input_device=next(iter(inputs_devices)))
and wrap the lazy_init
into a with torch.dtype...
construct.
Would you like to send a fix?
from pytorch.
Also, dupe of #124722
from pytorch.
Related Issues (20)
- (Refactor) Change default nonlinearity and bound calculation in kaiming_uniform_ & kaiming_normal_ and change kaiming_uniform call in reset_parameters in conv.py & linear.py to avoid sqrt(5) confusion (and maybe change numerical bound val in kaiming_uniform_ and numerical std val in kaiming_normal?)
- Incorrect index from torch.mode
- `python3 setup.py bdist_wheel` tries to write to /usr/local/... during build HOT 2
- PyTorch C++ API binary compiled with xmake crashes HOT 4
- [ExecutionTraceObserver] Tracer gets stuck using Pytorch 2.2 versions for some models using torch.compile
- [ONNX][low pri] Move old (non-public) implementation into legacy/ and schedule for deprecation
- `argsort()` can use the 0D tensor of a complex type value against error message HOT 1
- Upgrade dependencies MKL and Intel OpenMP to 2024.2.0 HOT 6
- The unexpected behavior of `argsort()`
- `msort()` can use the 0D tensor of a complex type value against error message HOT 1
- [TP+FSDP2] model weights become fully shard again after calling model.unshard() followed by dcp get_model_state_dict HOT 1
- `int` type for `dims` of `tile()` without `dims=` works with a tensor against the doc HOT 1
- `repeat_interleave()` without `repeats` argument and `input` keyword works HOT 1
- [export/dynamo] torch._check fails at compile time when the condition evaluates to False HOT 7
- Torch dynamo deep dive and overview discrepancy HOT 1
- _foreach_addc_
- Fuyou Training Framework Integration for PyTorch HOT 3
- Exporting the operator 'aten::fft_fft' to ONNX opset version 12 is not supported.
- torch.Tensor.register_hook() source link does not work HOT 3
- `start` and `step` of `arange()` should be optional on the doc
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pytorch.