Comments (4)
what do you mean specifically and what do you suggest? can you give an example?
from anndata.
So, if I make a new anndata object using:
adata = sc.AnnData(X=adata_training.X, obs = adata_training.obs, uns = adata_training.uns)
The uns field is supposed to be one unstructured annotation for all observations. However, we can still use adata[0].uns , adata[1].uns ....etc. This would be fine if what we added to the uns field is a small dictionary, because replicating it to all observations will not significantly increase the size or the query time of the andata object.
However, once we have more meta data to add in the uns field, the size of the file as well as query/read/write time increase massively.
It could be, that I missed the point of using uns field but then the issue is still valid but related to the documentation of the Anndata object more than to the object itself.
from anndata.
Usually you wouldn't split up your AnnData's into many tiny objects. Also, usually, .uns
doesn't contain a lot of data, just some very light-weight annotations.
from anndata.
Hi @falexwolf .
Usually you wouldn't split up your AnnData's into many tiny objects.
Well, sometimes you just need to parse and process the observations one by one.
Also, usually, .uns doesn't contain a lot of data, just some very light-weight annotations.
That was not my use case.
Anyway thanks for the great package and for your clarification :)
from anndata.
Related Issues (20)
- Does anndata.raw can be automatically modified after define it?
- Make the call to `_check_uniqueness()` optional in `_init_as_actual()` (and `__init__()`) HOT 1
- `concat_on_disk` outer join
- `concat_on_disk` merge strategies are untested/not implemented
- Potentially flaky benchmark HOT 2
- Reading Anndata from only parts of h5ad file: Hack solution HOT 8
- 2D indexing (no “:”) in `sparse_dataset` is not lazy HOT 7
- Set of categories changes when slicing AnnData, but not when slicing DataFrame HOT 1
- Gpu failing because of `cp.full` with `np.nan`
- Add path parameter to write_zarr method HOT 8
- GPU writing HOT 1
- GPU concatenation HOT 1
- TypeError when writing string columns to h5ad HOT 1
- ad.read_zarr doesn't return `.uns` nor `.obsm` HOT 3
- read_csv irmproperly parses first column if wrapped in quotes and has comma in string HOT 2
- i cannot read the h5ad object in a conda environment HOT 1
- Policy for format updates HOT 1
- `KeyError` on Flask app using `pyinstaller` with `setuptools_scm` HOT 1
- `dask` `2024.8.0` breaks with `scipy.sparse.{csr,csc}` HOT 1
- backed="r" is leaking memory HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from anndata.