Git Product home page Git Product logo

poptevo's Introduction

PopTEvo

Study of TE evolution in a population of genomes

Contents

TE_annotation

panEDTA annotation of NAM genomes

  • bin: scripts to summarize panEDTA annotations
  • data:
    • B73v5.NAM-illumina_filtered-pass-only-two-round-gatk-snps.homo.chr.gt.pres.25k.h: 25k NAM SNPs using B73v5 as the reference
    • NAM.EDTA1.9.0.MTEC02052020.TE.v1.anno.intact.LTR.sort.gz: Intact LTR superfamily classification based on TEsorter
    • NAM.EDTA2.0.0.MTEC02052020.TElib.fa: panEDTA library generated for NAM genomes
    • NAM.intact.LTR.genedist.gz: physical distance of intact LTRs to the nearest genes.
    • pan_TE_bootstrap1000.summary26.txt: the occurrance of 1000 bootstrap resampling of panTE families in NAM genomes
    • soloLTR.txt: All solo LTRs found in NAM
    • TE_Fam_stats.txt: Statistics and descriptions of pangenome TE families
    • individual_genomes: panEDTA annotation results generated for each NAM genome

panTE

Population genomics of intact LTR elements

  • 1.pairwise: identification of syntenic LTRs between any two NAM genomes
    • bin: scripts to identify pairwise syntenic LTRs
    • data: syntenic LTR information for each genome pair
  • 2.combine: combine pairwise information to a pangenome table
    • bin: scripts to combine pairwise syntenic LTRs
    • data/final_27_genome_TE_resolved_B73_added_v2.txt.gz: syntenic LTR information in 26 NAM genomes
  • 3.rooting: identify ancestral states of LTR insertions using a teosinte genome
    • bin: scripts to root syntenic LTRs
    • data: syntenic LTR information in a teosinte genome
  • 4.site_frequency_spectrum: SFS studies for both SNP and syntenic LTRs
    • SNP_calling: scripts to call SNPs
    • data: VCF and SFS files for both SNPs nad syntenic LTRs

methylation

Methylation analysis of intact LTR elements

  • data: UMR information of each NAM genomes using B73v5 as the reference

expression

Expression analysis of TE families

  • scripts: scripts to map short reads to each genome
  • TEexpression: scripts to generate per-family read counts
  • data: raw count data for each TE family in each RNA library

NLR_analyses

LTR neighborhoods of NLR genes

  • nlrannotator: Annotating NLR genes in the NAM genomes
  • ltr: LTR annotation for each NAM line used for the analysis
  • clustering: Clustering of NLR genes and background genes based on physical distance
  • nlr_neighborhood_age: Analysis of young and old LTRs in the neighborhoods of foreground and background gene sets
  • nlr_neighborhood_amplified: Analysis of tropical-amplifying and not tropical-amplifying LTRs in the neighborhoods of foreground and background gene sets in tropical and temperate NAM lines
  • plots: Pre-computed results tables and plotting script for visualising the main results

TE_fam_clean2.R

R commands to compile all data (except expression) and generate most figures

TE_fam_exp_clean.R

R commands for expression-related analyses

poptevo's People

Contributors

oushujun avatar qiuxx221 avatar aseetharam avatar ascheben avatar

Stargazers

Tongjian Liu avatar Ankush Sharma avatar Wenwen avatar Scott Teresi avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.