Comments (3)
My suggestion here would be to make sure that deme names in metadata.tsv
match deme names in lat_longs.tsv
and colors.tsv
. I would suggest snake_casing everything in metadata.tsv
. This was my standard practice for all other builds.
I want an augur clean
module that would take some of the functionality of sacra that would make it easier to get a clean and snake_cased metadata file. Augur clean vs sacra is definitely worth a larger conversation.
In the interim, you could make a simple script to do this if FASTA coming in is capitalized (and has the "non-standard" 2-letter state codes).
from augur.
How about converting both the metadata.tsv defined demes and the lat-long demes to lowercase for comparison only, but the exported demes match those in the metadata.tsv?
from augur.
Sorry, but I'm worried about that approach. The canonical lat-long file will have names like north_dakota
. If metadata comes in as NorthDakota
, then simple lowercasing won't work. This particular example would cause some things to match and some to not. It will be opaque on why these are failing. I favor a simple direct comparison. This should make it obvious how to construct lat_longs.tsv
.
If someone really doesn't want to use snake_case deme names then they can make a lat-long file with names as NorthDakota
etc...
I see now. I was confused.
from augur.
Related Issues (20)
- frequencies: error with `--region` flag HOT 3
- Improve validation output to identify problematic nodes / properties
- `parse` silently removes spaces from record ids in the sequence output but not in the metadata output HOT 1
- `measurements export` does not consistently allow the strain column to be used as a grouping column
- Export schema wrongly fails on gene names starting with 'nuc' due to lookahead
- align: error message when reading a reference sequence does not completely explain the root issue
- Add schema for node-data JSONs HOT 1
- Allow custom date column name to be specified in `refine` - similar to `metadata-id-column` HOT 1
- Add docs regarding bootstraps
- Clarification on augur tree --exclude-sites masking HOT 4
- Make command line option headings linkable
- Augur export error HOT 2
- `augur align --method nextclade` should wrap `nextclade run` HOT 1
- pip/conda dependency version constraints not guaranteed in all environments
- export: Add option to extend the default lat/longs HOT 3
- Number of Nt changes is different from number of mutations (divergence)
- Use PyPI's pyright? HOT 2
- Augur 24.4.0 release
- Review Pyright rule exceptions HOT 2
- Support pandas version 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from augur.