Abstract

Background

Visualisation techniques can greatly aid authors in getting across the point of their paper.

Objective

This paper aims to demonstrate the utility of polar plots for exploring demographic data, while also demonstrating the value of allowing interactivity in demographic research papers through the use of html and javascript.

Methods

A simple Bayesian model is estimated on weekly UK mortality data, allowing the identification of periods that deviate from seasonal and long-term trends. Polar plots and d3 based interactive graphics are then used to investigate these deviations.

Contribution

The paper demonstrates the value of the use of html as a format for academic papers, as this may allow authors to unlock the power of interactive graphics in order to better communicate both features of the data and the results of statistical modelling exercises. Additionally, the paper shows that polar plots provide a useful method for examining demographic data that display seasonal effects or other periodic recurrences.


1 Introduction

Traditionally, demographers are concerned with analysing data with a frequency of one year. This is partly because of demography’s long association with the practical business of official statistics and population forecasting (Greenhalgh 1996). Statistics collection and aggregation is an expensive and time-consuming business which puts a limit on the frequency of releases, and it is natural to align with preexisting cycles such as the calendar or financial year that also form the basis for governmental decision making. The importance of age to demographic analysis is another reason why the year is the discipline’s basic temporal unit. Our everyday understanding of age is fixed at a temporal resolution of one year, in that when we are asked to report our age, what we supply is ‘age at last birthday’1. The process of ‘ageing on’ in the cohort component projection model that is integral to most demographic projection almost always involves a discrete increase in both time and age of at least one year.

As a result, many demographic visualisations in which time is a dimension are likely to be displaying observations (or predictions) at a frequency of one year. Of course, there are many exceptions to this generalisation; to give but one example, duration models work with directly with dates of events, and so are able to display survivorship curves over days, weeks or months. Willekens (2013) gives a comprehensive overview of the use of chronological objects in demographic analysis, highlighting the problems caused by the irregularities of the Julian calendar.

A need for increased responsiveness on the part of government and an increased technological capacity to collect and process data has led to some statistics being collected and released on a more frequent basis. For instance, the harmonised Eurostat statistics on Asylum applications and decisions are published at a data frequency of one month (Eurostat 2010). In parallel, the drive towards the use of big data, particularly from internet user data collected initially collected for other purposes, will create new opportunities to examine data over smaller periods of time (Billari and Zagheni 2017).

This paper presents an interactive visualisation of recent UK weekly mortality data using polar charts. The cyclical presentation emphasises the periodic nature of the data, while the interactivity allows the user to better understand the data through manipulation.

2 Polar Graphs

At its most basic, creating a graph is primarily a process of converting data into visual characteristics such as distance along a coordinate axis, colour, or shape (Wickham 2009). Data generally consists of individual observations, each of which contains values for one or more dimension (Wickham 2014). Each dimension can be consistently represented by the same visual characteristic, so that individual observations are distinguishable and comparable. Graphs plotted within the Cartesian coordinate space map dimensions of the data to the horizontal (\(x\)) or vertical (\(y\)) axis. In contrast, those using polar coordinates instead use the clockwise angle (\(\theta\)) and the radius (\(r\)). This encoding is most advantageous when the data to be plotted is naturally periodic, so that the dimension mapped to the angular axis has the characteristic that if it is increased enough it will return to its starting point.

In his study of the ‘Golden Age’ of Visualisation, Friendly (2008) notes that one of the earliest use of polar graphs was to display information about wind direction. Compass bearings are clearly periodic; as the compass needle swings clockwise from North to South, its distance from its starting point increases, but if it continues in the same direction it begins to approach North again.

Another early use of polar charts to display quantitative information was also in the field of mortality. Florence Nightingale presented her data on mortality amongst soldiers in the Crimean War using polar charts, so that the circular plotting area is split into 12 segments with equal angles, one for each month (Friendly 2008). The area of each segment represented the number of deaths, with colours mapped to cause of death. Nightingale’s innovative graphics clearly showed that preventable disease, not battle injury was the primary cause of death in field hospitals, and that her work on basic hygiene drastically reduced the death rate [ibid].

More recently, polar charts have been to great effect used to illustrate the acceleration of man-made climate change through the dramatic increase in temperature with the atmospheric carbon dioxide concentration.2

Circular plotting areas are often used in other cases where angular distance is not directly mapped to a continuous data dimension. Radar charts are often used to compare scores of different subjects or products across several dimensions, but these are fraught with difficulty.3 Circular plots have also used where one discrete data dimension has no natural ordering, particularly when interactions between different levels of this dimensions need to be shown. Genomic data, for instance, can be shown in such a manner using the circos plotting library, with chromosomes and the genes the contain located around the circumferences, allowing associations between genes to be displayed with arcs linking them (Krzywinski 2009). Abel and Sander (2014) use a similar technique to good effect in a demographic context to show migration flows between countries.

3 Animation and Interactivity in Demographic Visualisation

As discussed above, visualisation involves converting data dimensions to visual characteristics. However, there is a limit to the amount of dimensions that can be encoded within a single plot before the supply of characteristics run dry, and as the editorial to this collection suggests, over-complexity can hinder rather than help the communication of information. Generally speaking, two dimensions may be mapped to axes, one to colour, and perhaps another to shape, size or line type. Exploiting colour scales through ternary diagrams or qualitative-sequential schemes (Schöley and Willekens 2017) can extend the amount of information that can be imparted by a static plot by allowing compositional data to be included, but again there are limits to such gains. The use of animations or interactivity can allow the form of the plot to remain relatively simple, but provide for additional dimensions to be added. The use of animation is particular effective if the animated dimension is time, as the time dimension in the data unfolds intuitively in real time (at some rate of compression). Interactivity in this context may be as simple as allowing switching between data-sets or between different statistical models, facilitating direct comparisons by the user. For example, a reader may examine the effect on coefficients or on predictions of switching between models with different controls or covariates.

Billari (2015) describes two modes of demographic analysis; discovery and explanation. Interactivity facilitates the exploration and discovery of features of the data. Kerren and Schreiber (2012, p. 3885) describe how interactivity forms a crucial part of the ‘sense-making loop’ in the field of visual analytics, which focuses on the use of visual tools in the analysis and processing of big data. In the context of an academic article, providing interactive tools to the reader of the paper allows them to re-discover points of interest identified by the author, providing a greater understanding of the thought processes behind the analysis. Interactivity can also help to explain difficult methodological concepts. Distill is an online-only journal which focuses on the explanation of algorithms in machine learning, and strongly encourages the use of interactivity. A recent article attempted to show how neural network-based image recognition software ‘sees’ the world (Olah et al. 2018), allow the reader to examine the effect of changing inputs images.

There are some very good examples of interactive demographic visualisations. These include those built by national statistics agencies4 and those built by academics, such as the Human Mortality Explorer5. However, such graphics do not often form part of the exposition of an academic paper. Where animations are included, they are often relegated to the supplemental material, which may or may not by accessed by the casual reader. This seems regrettable, as active exploration of the data or the set of models may provide greater insight into the points made in the paper. A greater emphasis on html in the publication of academic papers may help to facilitate exploration of visualisation opportunities. The advantages of a html-first direction for journal articles are discussed by Peroni et al. (2017), who also provide a simplified html specification (RASH) aimed entirely at the production of scholarly articles. The previously mentioned Distill journal encourages the use of html in submissions for these reasons. While many academics are happy to use latex for the production of pdf articles, fewer are comfortable with creating html documents (this author included), although the latter is a more generally useful technology.

This is partly a cultural issue, as we are trained to think of academic papers as documents that exist mainly as printed artifacts (Swoger 2013), even though this may not be how they are often read. An examination of the author guidelines for leading demographic journals suggest although videos are often permitted, little information is given on how to include other forms of interactive graphics. Although websites of journal publishers might allow inline videos or html-based interactive graphics, taking advantage of this is not something that comes naturally to academic authors. Additionally, the use of colour must generally be coupled with another visual characteristic (such as line type or point shape), so that the graphic is still comprehensible when printed.

The press towards open access publication and replicability may provide provide a route towards more web-based publication of academic research. The Open Science movement provides admirable solutions to sharing code and data that may also facilitate greater experimentation with manuscript format (c.f Masuzzo and Martens 2017), There is still appears to be an underlying assumption that that pdf is the primary format for publishing academic research, however. Another exception to the rule is the Journal for Artificial Societies and Social Simulation, an online-only open-access journal which encourages the use of both colour and animations; the utility of simple animations in presenting results can be seen in section 4 of Fitzpatrick et al. (2015).

The availability of tools to simplify the production of interactive visualisations is also an issue. The d3 javascript plotting library (Bostock et al. 2011) is highly flexible and can produce dynamic client-side visualisations, where the computational work involved with changing the plot in response to interaction takes place on the user’s computer, thus avoiding the need for server hosting. However, the learning curve associated with d3 is relatively steep. Interfaces with common statistical programming languages exist - for instance, the rCharts and htmlwidgets R packages provide interfaces to certain d3-based libraries. The development of pre-defined d3 plots for interactive visualisation of, for instance, R model objects might be one way in which to lower the technical barrier to the production of useful visualisations. The accessibility and permanence of research outputs are also an issue to be considered in this debate. In light of the above discussion, in parallel to the .pdf version of this paper, an .html version will also be provided with containing plots produced using d3. The bookdown package for R is used for this purpose, as it facilitates production of .pdf and .html outputs from a single source file written in a simple markdown format, interleaved with the R code needed to produce plots and graphics (Xie 2018). Snapshots of interactive graphics can also easily be produced in this framework so that the .pdf and html version of the document do not diverge.

4 Weekly Mortality in England and Wales 2010-2019

In order to demonstrate the utility of the two visualisation tools discussed in the preceding discussion, polar charts and interactivity, the rest of this paper explores their use in the exploration of weekly mortality data for England and Wales between 2010 and Februry 2019. This period is particularly interesting and controversial as it coincides with both an apparent slow-down in the rate of mortality decline (Continuous Mortality Investigation 2015; Hiam et al. 2017a) and a period of austerity where government spending on health and social care was not increased at the rate it had been previously (Hiam et al. 2017b; Local Government Association 2015). This latter period of spending restraint also took place as the population aged, so that the demand for healthcare was in any case likely to increase. A series of papers have investigated the characteristics of this deceleration and its potential causes and effects. An early paper by the actuaries of the Continuous Mortality Investigation identified that the rate of decline of the age-standardised mortality rate had slowed considerably between 2010-2014 when compared to the trend up to 2010 (Continuous Mortality Investigation 2015), and inspired the approach of this paper in also examining weekly mortality. Flu out-breaks that disproportionately effect the elderly population (also seen in other parts of Europe) are responsible for some of the year-to-year variation (Public Health England 2016), but it has also been suggested that reductions in the social care budget of local councils and additional pressure on the National Health Services caused by lack of funding may be responsible for the greater than expected number of deaths (Hiam et al. 2017b; Loopstra et al. 2016).

The purpose of this paper is not to shed any new light on the debate over the causes of mortality deceleration in recent years, however, but rather to demonstrate the utility of particular visualisation tools using a relevant dataset. Mortality data show clear seasonal patterns relating to temperature and disease prevalence, and so are periodic in the sense discussed above and suitable for examination using polar charts. The months of December and January, which share similar climatic conditions, are displayed adjacent to each other in polar charts and are thus easily comparable. Data published by the ONS provides weekly deaths counts for England and Wales, with breakdowns along a number of dimensions: sex, broad age group; Government Office Region; and cause of death (broadly categorised as respiratory or other cause) (Office for National Statistics 2018). Figure 4.1 displays raw weekly death data for England and Wales using standard Cartesian coordinates, using the interactive dygraphs d3 library to facilitate user investigation in the html version of this paper. Seasonal patterns are noticeable in both respiratory and other causes of death, and the effect of public holidays and death registration is also evident in sharp falls followed by compensating recoveries around Christmas and Easter. Raw death counts do also appear higher in the second half of the data, but no attempt has yet been made to account for exposure differences, both in absolute terms and with respect to age composition.

Figure 4.1: Time Series of Deaths in England and Wales. Source: ONS (2018)

We can alternatively examine this data in a polar coordinates plot, as shown in Figure 4.2. It is necessary to limit the the period displayed to 2014-2019 to avoid over-plotting. The polar plot emphasises the seasonal nature of the respiratory death series, showing higher counts and greater volatility in the first quarter of the year. The dotted ring provides a visual point of reference, and is set at the mean value over the period. Peaks in respiratory deaths in 2015 and 2018 are clearly visible and comparable with early years.

Polar plot of England and Wales Mortality 2014-2018. Source: ONS 2018

Figure 4.2: Polar plot of England and Wales Mortality 2014-2018. Source: ONS 2018

5 Methodology

Plotting raw death counts provides little information about the underlying mortality trends. For this we need to take into account both seasonality and exposure to risk, and to attempt to unpick the effect of public holidays on death registrations. In the exploratory analysis that follows, a crude interpolation method is used to approximate weekly exposure by age, and a statistical model is used to extract trend and seasonal components, together with fixed effects for public holidays. The approached used is similar in some respects to that used in Chapter 21 of Gelman et al. (2014) for the analysis of daily birth data; although the model here is simpler, and uses a periodic spline rather than a Gaussian process for smoothing.

5.1 Exposure

To approximate exposures by week and age, midyear population totals taken from ONS midyear estimates and, for 2017-19, ONS forecasts, are linearly interpolated from mid-year to mid-year in a similar manner to Continuous Mortality Investigation (2015). The population at the end of week \(w\) in year \(t\) is calculated by taking midyear estimates for years \(t\) and \(t+1\), and moving \(\frac{w}{52}\) of the way along a straight line connecting these estimates. The exposure is then the average of the population at the beginning and end of the week, correcting for the fact that we wish to express exposure in person-years:

\[\begin{align} \begin{split} P_{w,t} &= \left( P_{t} + w \frac{P_{t+1} - P_{t}}{52} \right)\\ E_{w,t} &= \frac{1}{52} \frac{P_{w,t} + P_{w+1,t}}{2}. \end{split} \tag{5.1} \end{align}\]

This is not an ideal method as it assumes deaths occur evenly throughout the year, which directly contradicts the seasonal trends seen in the data we are analysing. However, it is expected for the broad age groups used, the difference will be small enough to conduct rudimentary modelling carried out below.

5.2 Model

Deaths in each week and age group are assumed to follow a Poisson distribution, with the log of the Poisson rate represented using a simple model with a time trend indexed by the number of days \(t\) since the start of the fitting period on the 1st January 2010. A smooth order 4 b-spline \(s(d)\) with a period of one year is also employed to capture the seasonal effect (Wood 2006), which is allowed to vary for different age groups:

\[\begin{align} \begin{split} D_{x,w} \sim \text{Poisson}(\mu_{x,w}E_{x,w}) \\ \text{log}(\mu_{x,w}) = \alpha_x + \gamma_x t + s_x(d) + \mathbf{x}_{w}\boldsymbol{\beta}. \end{split} \tag{5.2} \end{align}\]

This is a function of the day of the year \(d\), which is more precisely the number of days since the 31 December the previous year, and contains 9 basis functions, 3 of which straddle the end of the year, providing for a smooth periodic fit without discontinuities. Without adjustments, the level of this spline will be confounded with the intercept, so the spline is constrained to sum to zero over the whole fitting period, and the random walk smoothing prior on the basis function coefficients is conditioned on this constraint (Hilton et al. 2018; Wood 2006).

\[\begin{align} \begin{split} s_x(d) = B(X)\theta_x \\ \theta_x \sim \text{MVN}(\mathbf{0},\Sigma) \end{split} \tag{5.3} \end{align}\]

Fixed effects \(\beta\) are also included to capture the effect of public holidays. A separate term is included for each different public holiday, so that the effect of Easter and Christmas may have different magnitudes, but the effect of Easters in different years are equal. The vector \(\mathbf{x}_w\) contains an indicator for each holiday that takes the value \(1\) if that week contains that specific public holiday, and the value \(-1\) if the preceding week included it. This aims to capture the idea that deaths not registered because of a holiday are likely to instead be registered the following week (Gelman et al. 2014 Chap. 21). Normal priors are given to all coefficient vectors, with vague half normal priors on the variance parameters.

\[\begin{align} \begin{split} \boldsymbol{\beta} \sim \text{Normal}(0, \sigma_{\beta}^{2}) \\ \boldsymbol{\gamma} \sim \text{Normal}(0, \sigma_{\gamma}^{2}) \\ \boldsymbol{\alpha} \sim \text{Normal}(0, \sigma_{\alpha}^{2}). \end{split} \tag{5.4} \end{align}\]

6 Estimation and Results

6.1 Estimation

Posterior model samples were obtained using the Bayesian sampling software stan (The Stan Development Team 2018). The estimated models are were quick to converge due to their simplicity, taking a matter of seconds to obtain suitable effective posterior sample sizes on a modern desktop. Code is provided with the supplementary material for the replication of results.

6.2 Results for the Crude Mortality Rate

Looking first at a simple model with no distinction by age, where the modeled rate is just the crude mortality rate, a number of features of the posterior distributions are worth mentioning. Firstly, the polar plot in Figure 6.1 displays the magnitude of fixed effects of public holidays on death registration. These effects are reasonably large in size; New Year appears to cause a delay in deaths registrations by about 20%, and Diamond Jubilee of 2011 (for which there were two days off) had a similarly sized effect. The polar chart gives a good visual indication of the distribution of holidays across the year.

Location and magnitude of the posterior distribution of the fixed effect of holidays

Figure 6.1: Location and magnitude of the posterior distribution of the fixed effect of holidays

The estimated seasonal effect was as expected; mortality is higher during the period December-March, and considerably lower during the summer months (Figure 6.2. Figure 6.3 and the interactive graphic in the html version of this paper display the model residuals, defined here as the difference between the observed log rates and the equivalent posterior samples, we can see that the greatest deviations occur in the winters of early 2015 and 2018. This is potentially related to influenza outbreaks that appear to have disproportionately affected care homes and the old (Public Health England 2015, 2018). The interactive version of the plot was constructed in d3 from the outputs of the model.6 Points inside the green circle are below the posterior mean estimate, while outside this circle rates are higher than the simple linear, seasonal and holiday terms together imply. As time progresses, older data is fades and is removed from the plot, with the most recent data displayed in blue. The plot can be paused at any point, and the slider dragged back to any point in the range of the data, between 2010 and February 2019.

Polar plot of Posterior Seasonal Effect

Figure 6.2: Polar plot of Posterior Seasonal Effect

Mean model residual in log-rates

Figure 6.3: Mean model residual in log-rates





Figure 6.4: Interactive plot of model residuals by date



6.3 Results: Rate by broad age group.

The ONS weekly death dataset provides age breakdowns for 7 broad age categories. However, as can be seen from Figure 6.5, the mean age of individuals within these age groups fluctuates, so that changes to the mortality rate experienced by the group as a whole may be related to changes in its age composition. The black vertical lines indicate when the ONS weekly dataset begins. Of particular interest is the ageing of the 85+ age group in the final panel of the plot, which may account for the increasing mortality rate of that group.

Exposure averaged age within each age category

Figure 6.5: Exposure averaged age within each age category

As before, we can examine the posterior mean of the log residuals by week and age. Focusing on the oldest group, we can see that the heat-map of residuals is similar to the pattern for the total population. This is not surprising, as we expect a large proportion of all deaths to occur at these older ages.

Heatmap of posterior means of log residuals by week of the year for Age 85 and above

Figure 6.6: Heatmap of posterior means of log residuals by week of the year for Age 85 and above

As before, we can also examine the evolution of these patterns through time using an animated d3 graph. Using the drop-down menu, one can also switch between different age groups, and the chart and chart axis adjust dynamically. A greater variability is evident at lower ages, where the underlying rates are low. Older ages, in contrast, show more pronounced peaks in particular periods when influenza or lower temperatures may have increased mortality. Pausing at particular time periods and switch between age-groups facilitates direct comparison between them. A similar technique could be fruitfully employed within demographic papers for comparing predictions between different models, or differences in raw rates between sexes. The use of interactivity within a paper in this manner may help authors better communicate their arguments.



Figure 6.7: Interactive plot of model residuals by Age group and date



7 Conclusion

This paper has provided a simple demonstration of the how animated polar charts can be used to map demographic data with a frequency smaller than one year. Polar charts are particularly useful for data that is truly periodic, and in the case of mortality, may emphasise seasonal effects in the data. Used in combination with statistical methods, polar plots may help to tease out the variation from trends and existing seasonal patterns. They may not always be appropriate, however, and can particularly be deceptive when areas are involved, as these do not scale linearly with increases in radius. The editorial to this special edition also suggests that more familiar chart types should be favoured where possible, as audiences are likely to be better able to decypher the information contained in these. However, where the data is strongly seasonal, the use of polar charts may be justifiable.

Many improvements could be made to the analytical work conducted in this paper; estimation of exposures could be made more accurate by utilising information about the distribution of deaths over weeks, and the pattern of deaths by single year of age over whole year periods could be included to infer the weekly pattern of age-specific mortality rates. However, this paper is primarily focused on visualisation rather than substantive issues; the main aim is to show how different visualisation methods can highlight certain features of the data.

To that end, this paper has also demonstrated some relatively crude interactive graphs. The use of animation and interactive plots as a integral part of a demographic paper is an area which is under-explored, largely because of cultural and technical barriers. A greater emphasis on html as a publishing medium may help encourage greater use of interactivity. The development of d3-based tools focused on making it easy to produce common demographic visualisations from existing statistical programming languages (particularly R) will help to make this process painless, and will encourage embedding of interactive elements within dynamic, reproducible documents (Xie 2015).


Acknowledgments

This work was supported by the ESRC Centre for Population Change - phase II (grant ES/K007394/1). Many thanks to all the participants at the Rostock Retreat on Visualisation who provided helpful comments and questions on early versions of the work. All the views presented in this paper are those of the author only.

References

Abel, G. J., & Sander, N. (2014). Quantifying Global International Migration Flows. Science, 343(March), 1520–22.

Billari, F. C. (2015). Integrating macro- and micro-level approaches in the explanation of population change. Population Studies, 69 Suppl 1(April), S11–20. doi:10.1080/00324728.2015.1009712

Billari, F. C., & Zagheni, E. (2017). Big Data and Population Processes: A Revolution?, (June), 1–12. doi:10.17605/OSF.IO/F9VZP

Bostock, M., Ogievetsky, V., & Heer, J. (2011). D3 Data-Driven Documents. IEEE Transactions on Visualization and Computer Graphics, 17(12), 2301–2309. doi:10.1109/TVCG.2011.185

Continuous Mortality Investigation. (2015). Recent Mortality in England and Wales. London: Continuous Mortality Investigation Mortality Projections Committee. https://www.actuaries.org.uk/documents/cmi-working-paper-83-recent-mortality-england-wales

Eurostat. (2010). Eurostat Statistics Database. Online database. http://epp.eurostat.ec.europa.eu

Fitzpatrick, B., Martinez, J., Polidan, E., & Angelis, E. (2015). The big impact of small groups on college drinking. Journal of Artificial Societies and Social Simulation, 18(3), 4. doi:10.18564/jasss.2760

Friendly, M. (2008). The Golden Age of Statistical Graphics. Statistical Science, 23(4), 502–535. doi:10.1214/08-STS268

Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A., & Rubin, D. B. (2014). Bayesian Data Analysis (Third.). Abingdon: CRC Press.

Greenhalgh, S. (1996). The Social Construction of Population Science: An Intellectual, Institutional, and Political History of Twentieth-Century Demography. Comparative Studies in Society and History, 38(01), 26. doi:10.1017/S0010417500020119

Hiam, L., Dorling, D., Harrison, D., & McKee, M. (2017a). Why has mortality in England and Wales been increasing? An iterative demographic analysis. Journal of the Royal Society of Medicine, 110(4), 153–162. doi:10.1177/0141076817693599

Hiam, L., Dorling, D., Harrison, D., & McKee, M. (2017b). What caused the spike in mortality in England and Wales in January 2015? Journal of the Royal Society of Medicine, 0(January), 014107681769360. doi:10.1177/0141076817693600

Hilton, J., Dodd, E., Forster, J., & Smith, P. W. F. (2018). Projecting UK Mortality using Bayesian Generalised Additive Models. Journal of the Royal Statistical Society. Series C, Early View(12 August 2018).

Kerren, A., & Schreiber, F. (2012). Towards the Role of Interaction in Visual Analytics. Proceedings of the 2012 Winter Simulation Conference, 3881–3893.

Krzywinski, M. (2009). Circos: an Information Aesthetic for Comparative Genomics. Genome Res, 19(604), 1639–1645. doi:10.1101/gr.092759.109.19

Local Government Association. (2015). Future funding outlook for councils - interim 2015 update (No. June) (p. 1). Local Government Association Publications. http://issuu.com/lgapublications/docs/future_funding_outlook_interim/2?e=16807299/14035821

Loopstra, R., McKee, M., Katikireddi, S. V., Taylor-Robinson, D., Barr, B., & Stuckler, D. (2016). Austerity and old-age mortality in England: a longitudinal cross-local area analysis, 2007–2013. Journal of the Royal Society of Medicine, 109(3), 109–116. doi:10.1177/0141076816632215

Masuzzo, P., & Martens, L. (2017). Do you speak open science? Resources and tips to learn the language. PeerJ, 5:e2689v1, 0–3. doi:10.7287/PEERJ.PREPRINTS.2689V1

Office for National Statistics. (2018). Deaths registered weekly in England and Wales, provisional. Office for National Statistics. https://www.ons.gov.uk/peoplepopulationandcommunity/birthsdeathsandmarriages/deaths/datasets/weeklyprovisionalfiguresondeathsregisteredinenglandandwales

Olah, C., Satyanarayan, A., Johnson, I., Carter, S., Schubert, L., Ye, K., & Mordvintsev, A. (2018). The Building Blocks of Interpretability. Distill. doi:10.23915/distill.00010

Peroni, S., Osborne, F., Di Iorio, A., Nuzzolese, A. G., Poggi, F., Vitali, F., & Motta, E. (2017). Research Articles in Simplified HTML: a Web-first format for HTML-based scholarly articles. PeerJ Computer Science, 3, e132. doi:10.7717/peerj-cs.132

Public Health England. (2015). PHE Weekly National Influenza Report (No. Week 13). Public Heath England. https://www.gov.uk/government/statistics/weekly-national-flu-reports-2014-to-2015-season

Public Health England. (2016). Surveillance of influenza and other respiratory viruses in the United Kingdom: Winter 2014 to 2015, 29. https://www.gov.uk/government/uploads/system/uploads/attachment_data/file/526405/Flu_Annual_Report_2015_2016.pdf

Public Health England. (2018). PHE Weekly National Influenza Report (No. Week 19). Public Health England. https://www.gov.uk/government/statistics/weekly-national-flu-reports

Schöley, J., & Willekens, F. (2017). Visualizing compositional data on the Lexis surface. Demographic Research, 36(1), 627–658. doi:10.4054/DemRes.2017.36.21

Swoger, B. (2013). Vestiges of print publication in scientific journals. Scientific American. https://blogs.scientificamerican.com/information-culture/vestiges-of-print-publication-in-scientific-journals/

The Stan Development Team. (2018). The Stan Core Library. http://mc-stan.org

Wickham, H. (2009). Ggplot2: Elegant graphics for data analysis (2nd Editio.). Springer.

Wickham, H. (2014). Tidy Data. Journal of Statistical Software, 59(1), 1–23. doi:10.18637/jss.v059.i10

Willekens, F. J. (2013). Chronological objects in demographic research. Demographic Research, 28(March), 649–680. doi:10.4054/DemRes.2013.28.23

Wood, S. N. (2006). Generalised Additive Models: An Introduction with R. Boca Raton: Chapman; Hall / CRC Press.

Xie, Y. (2015). Dynamic Documents with R and Knitr (2nd Editio.). Boca Raton: CRC Press.

Xie, Y. (2018). bookdown: Authoring Books and Technical Documents with R Markdown. Boca Raton: CRC Press. https://bookdown.org/yihui/bookdown/


  1. The exception to this rule is when we have collected so few years that even the fractions matter!

  2. http://openclimatedata.net/climate-spirals/from-emissions-to-global-warming-line-chart/

  3. http://blog.scottlogic.com/2011/09/23/a-critique-of-radar-charts.html

  4. https://www.gut-leben-in-deutschland.de/static/LB/report/income

  5. https://jschoeley.shinyapps.io/hmdexp/

  6. Code from two simple d3 code ‘blocks’ produced by Mike Bostock and Jane Pong were utilised to help construct this plot. These are available at https://bl.ocks.org/mbostock/4583749 and https://bl.ocks.org/officeofjane/47d2b0bfeecfcb41d2212d06d095c763 respectively.