Code and data to accompany
./data/ Contains the following raw data:
- GFF3 files downloaded from Ensembl FTP (links here http://www.ensembl.org/info/data/ftp/index.html/) NB: files for mouse and human are too large so are not included here
- Extra UTR regions for S.cerevisiae downloaded from https://github.com/ewallace/yeastutrgff
- Whole exome sequencing capture regions for the UK Biobank capture downloaded from https://biobank.ndph.ox.ac.uk/ukb/refer.cgi?id=3803
./code/ Contains code for two analyses:
- Assessing the proportion of exonic bases with different annotations
- Assessing the overlap of human exonic bases with whole exome sequencing capture regions