Research Questions
Road safety interventions have traditionally been prioritised by looking at absolute number of casualties at different locations. Whilst this helps practitioners identify dangerous locations, it will not capture the relative risk per user of different types of infrastructure or route environment.
To assess how characteristics of the route environment affect injury risk for each individual road user, we need to control for exposure (Vandenbulcke, Thomas, and Int Panis 2014; Vanparijs et al. 2015). Epidemiological methods offer ways of doing this. Case-control studies control for exposure by comparing aggregate injury locations to aggregate control locations, with control locations selected from modelled or measured flow data (Miller et al. 2017; Aldred et al. 2018; Meuleners et al. 2019).
By contrast, in case-crossover studies each person serves as their own control, reducing intrapersonal confounding. Control points are selected from a person’s route prior to injury, as in Teschke et al. (2012). This study uses a similar method, however, using a cycling route prediction algorithm to generate cyclist routes rather than recall.
Methods and Data
Britain’s database for police-reported traffic incidents, Stats19, was used to identify cycle injury locations, with casualty home postcode data provided via Transport for London (TfL). Between 2016-2018, 13,700 cycling injuries were reported to police in London. After removing non-London residents and incorrect home postcodes, 11,936 injuries remained.
As we only had home postcode data rather than journey origins, we could only use injuries sustained where a person’s home was likely the starting point. For the purposes of routing, their trip needed to be for utility purposes (A to B; i.e. not a weekend leisure ride which might not follow typical utility routing patterns). Morning commuter travel fits both requirements. Therefore, only collisions occurring between 7-10am on weekdays and involving people over 16 were used. This left a total of 2,876 injury points, most of which (86.7%, or 2,494) were slight injuries. 375 were serious and 7 were fatal.
Routes were predicted between home and injury locations on TfL’s Cynemon network (which contains all roads and other routes that can legally be cycled) using the Network Analyst tool in ArcGIS. The route prediction tool was programmed to use Cynemon generalised costs for commuters, which places route directness above other factors such as infrastructure and traffic volumes. Cynemon is a cycling assignment model which has primarily been developed to support appraisal of cycling interventions and formation of cycling policy. As such, Cynemon can estimate cyclist routes, flows and journey times.
A control point was introduced for each 3km of a cyclist’s route, i.e. 1 point for routes under 3km, 2 points for routes 3-6km. This method resulted in 5,754 control points, i.e. almost double the number of injury points - which was the target. 80% of control points were for routes under 12km (67% for under 9km). Control points were then selected at random points along the route by using the Random Point Generator tool in ArcGIS. Figure 1 shows the location of these points across Greater London.
Table 1 shows route environment datasets used and sources. Figure 2 illustrates cycling infrastructure in London, taken from the TfL Cycle Infrastructure Database, separated into different types of infrastructure, and Figure 3 provides a composite image of different facility types.
The cycling flow data uses 2016 AM peak hour (08:00-09:00) link flows from Cynemon, which have been calibrated and validated using observed counts. Aggregate temporal and seasonal factors from Strava data were applied to the Cynemon flows to increase precision and better align with the injury data. Temporal factors derived from London Travel Demand Survey (LTDS) data were applied to the pedestrian and traffic flow data. Traffic speed data was obtained from Trafficmaster via DfT: this uses data from GPS-enabled vehicles to generate average flows at peak and off-peak times.
Road classes were ‘A’, ‘B’, ‘C’ and ‘Other’. A roads are primary roads, which include TfL-managed ‘red routes’, carrying up to 30% of London’s traffic despite only making up 5% of roads. B roads are secondary roads, while C roads tend to connect local roads with A or B roads, and ‘Other’ roads include residential and private roads.
We performed statistical analysis using similar methodology to Aldred et al. (2018), with binary logistic regression modelling to predict whether a location was an injury or control point. Coefficients (odds ratios) were produced for each variable after controlling for other factors. This indicated whether the variable increased or decreased cycling injury odds and a corresponding significance level (we chose p<0.05 as our threshold). All analysis was undertaken in the statistical package R (https://www.r-project.org/) and the mixed-effects models were fitted using the ‘glmer’ function (https://www.rdocumentation.org/packages/lme4/versions/1.1-25/topics/glmer).
Findings
Table 2 provides descriptive statistics for route environment characteristics included in the final model, alongside the proportion of points for each characteristic that represent injury sites. Across all points, this proportion is 33.3%, so any value higher than this means an over-representation of injury points for that variable category.
Coefficients related to whether a point was in Inner, Outer, or Central London do not reflect differences in the ‘real’ risks in each area. Apparent higher risk in Inner and Central areas is an artefact of the method as control points, by definition, lie closer to home than work. Instead, we included this variable to control for possible differences between these areas not included in the table below.
Table 3 presents the results of the two models, alongside univariate odds ratios. Model 1 controlled for road characteristics whilst model 2 also controlled for road user flows (other cyclists, pedestrians, motor traffic). Variables for which p=>0.1 were removed before re-running the models, which made no substantive difference to results.
Cycling at intersections increases injury odds threefold. This is in line with literature on intersections (e.g. Quddus 2008; Harris et al. 2013) and the odds ratio is similar to Aldred et al’s London study (2018). B roads were relatively risky compared both to residential streets and A roads. These findings differ from Teschke et al. (2012) and Aldred et al. (2018).
Kerb separated cycle infrastructure reduced injury odds substantially; by 40% compared to no infrastructure. Stepped tracks were even more protective, reducing injury odds by 65%, albeit with large confidence intervals due to low numbers (0.15-0.85, CI 95%). These findings are in line with Teschke et al. (2012) and in London, Li, Graham, and Liu (2017). By contrast, painted cycle lanes did not reduce injury. Mandatory painted lanes did not lead to any risk reduction and advisory lanes (which motor vehicles are legally permitted to enter) increased injury odds by over 30%.
Bus lanes reduced injury odds, although less than protected cycle infrastructure. Bus stops were also, surprisingly, associated with reduced risk. Perhaps they provide a respite from parked cars, which increase risk (Teschke et al. 2012) but are not included in this study.
There is evidence of a ‘safety in numbers’ effect as an increase in cycle flow reduced injury odds (e.g. Jacobsen 2003; Marques and Hernández-Herrador 2017; Aldred et al. 2018). The study suggests that a doubling of cyclists in the morning commute reduces cycling injury odds by 13%.
Higher pedestrian density and traffic volume increased risk, although the latter was only borderline significant. A doubling in pedestrian density increased cycling injury odds by 6%. This finding reflects the increased risk of busy high streets, town centres or business hubs.
Counterintuitively, risk decreased as speed increased. When looking at the absolute numbers it appears that roads with speeds less than 10mph are more dangerous, indicating congestion could increase risk. However, an increase in speed is likely to increase severity of a given injury (Elvik 2013).
In conclusion, cycle infrastructure on main roads and junctions must be protected (kerb separated or stepped tracks, rather than painted lanes), particularly when there is high traffic and/or pedestrian activity.
Acknowledgements
A special gratitude goes to Transport for London and the Strategic Analysis department, as this study would not have been possible without its data and support. University College London provided a platform for this research which is also appreciated.