Git Product home page Git Product logo

thesis's Introduction

thesis

Hi Emily. Hi Barbara.

Read this thesis in html or as a pdf (Imperial spiral).

Created using the nmfs-opensci/quarto-thesis template. Run make all from thesis.

thesis's People

Contributors

theorashid avatar rlaker avatar

Stargazers

carl colglazier avatar

thesis's Issues

corrections

Required corrections:

  • P25 (and P40): While not strictly a paper, a very classical depiction of inequalities in longevity across small areas actually comes from London (https://journals.sagepub.com/doi/abs/10.1068/a45341)
  • P28: please revise the textual distinction made between areal model using spatial neighbourhood structure and those which use nested hierarchy as they are both a specific case of hierarchical models
  • P31: Please discuss here or elsewhere that in some cases it is not appropriate to have statistical smoothing and how can someone evaluate if that is the case in a specific situation; we discussed using posterior predictive checks and abplots on death rates
  • P32: Here or elsewhere please also discuss potential disadvantages in life expectancy as an outcome measure especially as relevant to the assumptions that you need to make in terms of its calculation
  • P48 the idea of exploring population turnover (and therefore a measure of changing composition) is great. The only thing I’d mention is that turnover itself may be an exposure of ill health (some discussion here https://www.sciencedirect.com/science/article/abs/pii/S1353829218310700)
  • P54: An important topic not considered in detail is model adequacy. Please add a table summarizing outcomes of model adequacy and consistency checks performed.
  • P56: Please elaborate on your prior choices and robustness of your analyses.
  • P57: Please reconsider your statement that “INLA scales badly” in light of the discussion during the viva and our elaboration that the number of hyperparameters are probably less than 20.
  • P57: It would be useful to see a deeper discussion on the age-specific random walk priors, in light of the J-shape of all-cause mortality rates, and different age profiles of cause-specific mortality rates, as well as a discussion on independence across gender. For example, another way of modelling age (as compared to random walks) is through the use of linear splines (e.g., TOPALS): https://www.scielo.br/j/rbepop/a/9szg7XYCXck9dJrKmgBSdrf/?lang=en and the pros and cons would merit discussion.
  • Figure 5.1: what years are used for the country life expectancies? 2002 or 2019?
  • P62: missing from this paper is an evaluation of the uncertainty around life expectancy. This could be measured with a sort of coefficient of variation (SD of all posterior LEs divided over the median LE). The CDC in the US has used 25% as a threshold for poor certainty in some of their papers, though this threshold may be too high.
  • P73: On the discussion about factors driving lower life expectancy in the US (linked to deaths of despair), a recent paper blames the lack of social safety net systems (https://jamanetwork.com/journals/jamapsychiatry/article-abstract/2788767) which would fit well with the authors narrative on the erosion of the welfare system in the UK from 2012 onwards
  • P75: please clarify the discussion on spatial analyses of mortality rates at post code level with a greater appreciation of the pitfalls in such.
  • P76: Please specify the mean and variance of the BetaBinomial as expressed in your parameters so that the precise choice of parameterisation is clear.
  • P77: there appear to be a few life expectancies in the high 90s or even 100s (City of London), which seems unfeasible. There must be wide uncertainty around these, right? Alternatively, the author mentions a mismatch of population estimates due to changes in adjacent areas.
  • P81: please clarify how the uncertainty is visualised (for instance in Fig 6.5). Is this the distribution at district level of the average posterior mean at LSOA level?
  • P82: One thing that is not discussed is the potential for stronger spatial effects in a city like London using LSOAs as compared to the whole of England using MSOAs. The other chapter finds similar results, but segregation patterns over larger areas may differ from smaller areas.
  • P88: please add a description on how cause-of-death is assigned in the UK to clarify the nature of the underlying data; similarly has any uncertainty been considered in the estimated of the contributions of death from each cause and would this be important to consider possibly in future work?
  • P88: It is not clear what the “ICD-10 code in the first position of the death record” is for the classification of deaths for neonates without underlying cause of death. Does this refer to contributory causes?
  • P89: It is not entirely clear on where ill-defined deaths go (R chapter mostly). The chapter does not mention anything, and Table D.1 mentions those deaths being included in the residual causes (but are not listed in those). There’s also a mention to a redistribution without any details.
  • P91: the descriptions on Arriaga’s method are very hard to follow without equations, and please lay out key assumptions also in the main body of the thesis.
  • P91: if the causes of death cover all causes (I don’t see anything missing), why is the rescaling of cause specific mortality needed? (there’s a mention in P103 to the “total mortality rule” which I assume refers to this, but it is unclear)
  • P97: what may be reasons of the far more common contribution of lung cancer to inequality among women compared to men? (this is somewhat talked about in chapter 8)
  • P109: please better justifying the need for this project. There is not much in the introduction about this.
  • P111: as with our earlier comment above on ill-defined causes, it isn’t clear here how they are treated for this analysis.
  • P112: The author indicates there are very few studies looking at geographic patterns and cancer. Spain has been producing reports on that for some time (See here, the newest one is in English: https://www.isciii.es/QueHacemos/Servicios/VigilanciaSaludPublicaRENAVE/EnfermedadesCronicas/Paginas/Atlas.aspx direct link to the newest one here https://www.isciii.es/QueHacemos/Servicios/VigilanciaSaludPublicaRENAVE/EnfermedadesCronicas/Documents/atlas/Atlas_espana_portugal.pdf )
  • P117: reword the definition of posterior probability of a true decline
  • P125: For the discussion on epidemiologic transitions, I suggest https://pubmed.ncbi.nlm.nih.gov/35639549/ and https://academic.oup.com/ije/article/51/4/1054/6525745 for some nuanced discussions. Injuries have definitely been a weak spot of the theory.
  • P145: please provide a full justification of equation A.1, and under what assumptions death probabilities conditional upon survival to age x can be derived from cross-sectional, empiric mortality rates.
  • P145: please provide a worked example to allow readers to build intuition into the approach, we suggest when n=1.
  • P146: please explain in words how the equations ensure that the life table is closed in the sense that everyone must die at some point
  • P146: please provide reasonable detail and assumptions behind the Kannisto Thatcher approach so the thesis is more self-contained.
  • P146 Section A.2: please provide a more detailed justification with greater attention to the underlying assumptions that lead to equation A.6. We did not follow details such as “subtract the propability of surviving”. Further, it seems the construction uses D^i_x from some year t to construct a hypothetical cohort that survives to cause i alone. However there are competing deaths and if these had not occurred, D^i_x would be larger for old ages x. It seems the underpinning are strong and these render interpretation of the calculated cause-specific life exptectancies challenging.

viva questions

If the assessors are reading this, please ask the following:

Chapter 2

In what applications are nonstationary covariance functions useful?

Lengthscales/variances changes by location.

This failure to adapt to variability, or heterogeneity, in the unknown process is of particular importance in environmental, geophysical, and other spatial datasets, in which domain knowledge suggests that the function may be nonstationary. For example, in mountainous regions, environmental variables are likely to vary much more quickly than in flat regions.

https://onlinelibrary.wiley.com/doi/abs/10.1002/env.785
https://www.stat.berkeley.edu/~paciorek/diss/paciorek-thesis.pdf

BYM: Are there any problems of assigning priors to the variances of $U$ and $V$?

Yes, the convolution of the random effects components is difficult to fit without strong constraints on one of the two components, as either component can account for most or all of the individual-level variance.

This had let to the BYM2 parametrisation. Rather than a tap for hot and cold, you have a mixer tap with an overall variance and a weighting parameter.

https://mc-stan.org/users/documentation/case-studies/icar_stan.html

Are spatial models appropriate for the data? They were designed for grids and the spatial units have different sizes

There are concerns that the GMRF representation of space as an adjacency matrix is reductive. Two wildly different geometries can share the same adjacency graph. The ICAR and BYM models were originally proposed for image analysis, where the adjacency matrix for a regular lattice of pixels is more representative of its original form than the graph of a politically- and geographically-influenced boundaries within a country. Further, the ICAR model weights all neighbours equally. Nevertheless, in an epidemiological context, Duncan et al. (2017) compared a BYM model with binary, first-order adjacency matrix to models with a variety of different weighting schemes, including matrix weights based on higher-order degrees of neighbours, distance between neighbours, and distance between covariate values.

Models were consistent when run at district vs aggregate MSOAs to districts – sign that it isn't biased.

https://github.com/athowes/beyond-borders

What is the Modifiable Areal Unit Problem?

The same basic data yield different results when aggregated in different ways when aggregated in different ways:

  • the scale effect, showing major differences depending on the size of units
  • the zonation effect, showing major differences depending on how the study area is divided up

Not an issue for me because I did not define MSOAs/LSOAs, and designed to have some homogeneity within them based on household measures. Districts are administrative so local policy happens at that level.

https://athowes.github.io/beyond-borders/resources.html

What is the difference between indirect and direct standardisation?

Crude mortality rate it $\frac{\text{deaths}}{\text{population}}$. Indirect $\text{SMR} = \frac{\text{actual total number of deaths}}{\text{expected total number of deaths}}$. Direct requires a standard population to compare two sets of age-specific death rates.

https://www.youtube.com/watch?v=zObgZu54mJw

Why are modern death registration systems, such as that of the UK, not entirely complete and accurate?

Questions on how to count deaths from overseas, deaths of visitors, age at death in older ages, etc. Coroners, slow to add to the dataset for specific causes.

What was the outcome of the SAHSU studies?

Point source studies:

  • Elliot 1992: Clear increase in risk of mesothelioma and asbestosis within 3km of the Plymouth docks.
  • Aylin 1999: Small increase in risk but none greater than 10%.
  • Hodgson 2004: Significant excess kidney disease near polluting factories in Runcorn.
  • Elliot 2001: Landfill sites: small unexplained excess of congenital anomalies, no increase risk of cancers or Down's.

Geographic correlation (physics-informed models to create an exposure surface, and assessed the geographic correlations between this surface and the health outcome):

  • Hodgson 2007: plume of mercury pollution
  • Elliot 2010: No increased risk for exposure to mobile phone base stations during pregnancy.
  • Hansell 2013: noise from aircraft near Heathrow
  • Halonen 2015: Road traffic noise in London
  • Parkes 2020: particulate matter from incinerators during pregnancy

Why did you not cite Bouleri and Blangiardo (2020), Spatio-temporal model to estimate life expectancy and to detect unusual trends at the local authority level in England?

Do you believe some parts of England are in different stages of the transition to others?

Time trends of different risk factors such as smoking, alcohol use. These risk factors are not to do with infection so probably not still in stage 2.

What is Malthusian theory?

Population growth is potentially exponential, according to the Malthusian growth model, while the growth of the food supply or other resources is linear, which eventually reduces living standards to the point of triggering a population decline. This event, called a Malthusian check occurs when population growth outpaces agricultural production, causing famine or war, resulting in poverty and depopulation.

What about smallpox rather than typhoid fever?

Smallpox is also a common example and was actually eliminated through vaccination (last case 1977, WHO 1980), but much later than this reference.

What does "burden" mean in GBD studies?

Disease burden is the impact of a health problem as measured by financial cost, mortality (YLL), morbidity (YLD), or other indicators.

What's the difference between an absolute and a compositional model for causes of death?

Absolute models model death rates.

Compositional models relative proportions of each cause of death. Compositional models use an additive logistic normal model (if you apply a multivariate normal model to additive log-ratio it is equivalent to applying a multivariate logistic-normal model to the original compositional dataset). As we are modelling proportions, this does not allow death rates estimation.

Has anyone jointly modelled the overall death rate and the composition and from those you can estimate the cause-specific death rates which can then be input to the cause-specific likelihoods?

Chapter 4

How did you avoid over/underfitting?

The most important thing for us was smoothing. We need death rates between 0 and 1. Followed a Bayesian workflow, starting from a small model and adding effects that we thought were epidemiologically plausible. Effects might not improve model fit and might decrease effective sample size by adding another parameter.

Need sufficient parameters to not have bias at the level of analysis.

Model performance should be evaluated by whether it can predict future data. Use as many methods as you can to justify the model structure - train/test split, argumentation with subject matter experts, WAIC, LOOCV. Given the complexity of the world and the approximate nature of our models, there isn't be “one and preferably only one way” to evaluate how good our model is.

https://discourse.pymc.io/t/do-we-need-a-testing-set/759/5

What do you classify as a rare event? Do deaths in the oldest age group satisfy this?

“The sample size should be equal to or larger than 20 and the probability of a single success, $p$, should be smaller than or equal to 0.05. If $n$ > 100, the approximation is excellent if $np$ is also < 10.”

Hence why we moved to binomial.

https://www.solon-karapanagiotis.com/post/approx_binomial/approximating-binomial-with-poisson/
https://www.itl.nist.gov/div898/handbook/pmc/section3/pmc331.htm

Why does INLA scales badly with the number of hyperparameters? What about TMB?

INLA using the Laplace approximation on the latents (hopefully Gaussian) field. Then it performs quadrature (numerical integration) on the hyperparameters, which is slow and hence scales badly.

https://athowes.github.io/thesis/naomi-aghq.html#marginal-la

TMB optimises the hyperparameters so they are fixed values/point estimates. This is Empirical Bayes/type II ML. Kind of like GP where we just optimise for lengthscale/variance, or SSM etc etc. This can underestimate the variances.

What are the mild conditions for MCMC to converge to the true posterior?

  • Irreducibility/ergodicity. It is possible to eventually get from every state to every other state with positive probability
  • Aperiodicity. Does not return to the state $i$ after some transition with certainty.
  • Stationary/invariant. If the transition function of that chain leaves that distribution unchanged. Each part of the chain gives the same information: if you split up the chain into parts and rearrange, it should look near-identical.

https://www.youtube.com/watch?v=tByUQbJdt14

Why did you not use cohort effects like Bennett 2015?

Much longer timescale, forecasting to 2030.

Time slope for cohort, but not an intercept. Only certain types of age period cohort which are possible due to identifiability.

How did you choose your priors? What about PC priors?

Started with vague/standard/uninformative priors.

Tested sensitivity to priors (in early models) when first switching from U(0,100) to U(0,2) (vague enough, never near upper limit) to half-normal by comparing scatter plots of death rates with abline.

Now, I would do prior predictive simulation.

Penalised Complexity priors (for precisions), which are implemented in INLA but not commonly in most PPLs, penalise departure from the base model such that the probability of the precision above a certain threshold is $\alpha$. So the random effect will not be included unless there is enough information in the data for them to be there. We did not use them because:

  • Have to tune the threshold and $\alpha$
  • These priors bias the model by reducing variance. That is a modelling choice, and our model has lower bias but more variance.

Chapter 5

Figure on probability of dying at different ages. Could there be a survivor effect here? i.e. those in oldest age group have not died young so the SES gradient in the oldest age groups will be weaker than in the younger age groups?

Because they didn't die of this thing earlier, they die of it later so the death rates in older ages might be lower?

Is there anything similar in European countries? e.g. they have had slowdowns in e0 rises, have the worst off deciles gone into reverse?

Slowdown Leon (2019) paper but not as much as E&W.

"In contrast to the United States, and despite the occurrence of an economic crisis, the health of the lowly educated in Europe has improved in recent years, and health inequalities have sometimes narrowed."
^ Mackenbach 2018. 27 European countries, not UK, trends by education level and by age
https://www.pnas.org/doi/abs/10.1073/pnas.1800028115

Chapter 6

What does the beta-binomial likelihood look like? What is $\rho$?

It is a mixture distribution that allows for overdispersion, in the same way the Gamma-Poisson distribution can. The parameter $\rho$ controls the spread of the distribution,

Chapter 7

Why did you group diabetes and nephritis and nephrosis?

Diabetes is the leading cause of kidney disease. Different certifying physicians may assign deaths that have similar natural history to either "diabetes" or "nephritis and nephrosis" depending on their clinical belief on what the underlying cause.

Why did you use ICAR? Why not use BYM?

We tested a BYM but found it made no difference to fit. ICAR also recommended by Chris Paciorek, statistical help for the project.

Also, the age-space interaction is IID and accounts for a lot of the non-spatial variation, while also allowing for interactions.

What causes make up the residual groups?

Women: hypertensive heart disease, liver cirrhosis, oesophagus cancer, falls...

Men: pancreatic cancer, bladder cancer, Parkinson's, self-inflicted injuries, stomach cancer, leukaemia...

Why didn't you use to age 85?

We limited the age range because the probability of death in the absence of competing causes equals 1.0 when the entire life course is considered. We selected 80 years of age as the upper bound because it covers a wide age range but does not include the very oldest ages where multimorbidity makes the assignment of cause of death increasingly difficult.

Why did you use unconditional probability of dying?

We used unconditional probability of death because removing competing causes of death enhances comparability and equity. For example, two populations with similar exposure to cancer risk factors and similar quality of cancer care, have similar need for effective interventions. If one population has a higher risk of dying from another competing disease, for example HIV or road traffic injuries, the conditional probability would be lower, hence downplaying the need for interventions related to cancers.

Why didn't you use ASDR?

We used probability of death because it has an intuitive interpretation, and because, unlike age-standardised death rate, it does not depend on the choice of standard population. We also calculated age-standardised death rates. The correlation coefficients between age-standardised death rates and the probability of dying between birth and 80 years of age ranged from 0.93 to >0.99 across sex-year-cancer combinations for the years 2002 and 2019.

Why are there sex-specific differences in changes for IMPN? Too much smoothing dragging maternal mortality into young ages?

Probably due to a different mix in subcategories. Needs further investigation by separating this category, but difficult to model at a smaller group level. Age at which they die from these causes is young. RW1 is not smooth, similarity.

Districts with the greatest improvements in life expectancy had large contributions from declines in CVD mortality as well as some of the smallest contributions from increases in mortality from dementias. Was it directional?

How many moves occur as a short distance?

Van Dijk et al. "the majority of moves occur over short distances: the 2011 Census records that 57.1% of the individuals aged 16 and over that changed address within the preceding 12 months moved within the same local authority district."

However, as discussed in Bennett et al. 2023, there might be more between-district migration in London, although this would require further digging into the churn and migration data.

What does your study add to that already presented in Bennett et al. 2018?

  • Mapping. Some causes have different spatial patterns, and cannot be attributed only to patterns of deprivation.
  • Which causes are driving life expectancy change and their differences between districts?

Are drug overdoses intentional?

They're listed as intentional on ICD codes.

Although, this is the conservative view that drug use is a personal choice. However, there are issues with this related to poverty/low education/poor housing/etc leading to poor mental health/no social support and drug use.

Drug overdose often happens accidentally so would not be "intentional" and same for alcoholic liver disease.

Chapter 8

Is there variation between districts of quality of CoD assignment?

  • The quality of the cause of death data in the UK has been given the highest rating based on completeness and a low share of deaths assigned to implausible and ill-defined codes. There are national guidelines for death certification, and the Office for National Statistics (ONS) uses standardised coding algorithms and computer-based assignment of the cause of death, which assign the underlying cause of death uniformly so that data are comparable between places and times.
  • Validation of cause of death data for prostate cancer in the UK found good sensitivity and specificity.
  • Around 30% of deaths go to a coroner. Some evidence of locally issued guidance, but this has been withdrawn since: "Hitherto there have been no such regulations and the circumstances of reporting of deaths by medical practitioners to coroners has varied across coroner areas. As with immediate effect any locally issued guidance or direction should be withdrawn and the principles set out in this document used by all coroners to ensure greater consistency over death reporting."

https://www.ons.gov.uk/peoplepopulationandcommunity/birthsdeathsandmarriages/deaths/methodologies/userguidetomortalitystatisticsjuly2017#cause-of-death-coding

Why did you not put CrI for X-fold variation or Spearman correlation?

Presentation. It would result in correlations different to figures which use the median. But we did this for the revisions.

Colorectal cancer has correlation with poverty of 0.45 for men but only 0.09 for women. Is there an obvious explanation for this?

No obvious explanation (coauthor Amanda Cross). You could imagine that men in general are less likely to seek healthcare for GI symptoms (symptoms associated with later stage disease). This may be exacerbated by SES? Could be related to sex difference in diet.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.