Webinar: Predicting symptomatic COVID-19

April 6, 2020

This article has not been updated recently

ZOE and King's College data science and machine learning teams have been working around the clock to create a machine learning model that uses Symptom Tracker data to predict COVID-19 in the UK. Based on data from the COVID Symptom Tracker app and the assumptions that we lay out below, we estimate that there are a total of 1.9m people in the UK with symptomatic COVID (aged 20-69 only) as of 1st April 2020.

Jonathan Wolf, CEO of ZOE explains the model in our webinar below. You can find a daily feed of maps here.

‍

Our estimate was calculated in 3 steps:

1. Learn which symptoms best predict COVID, based on app users who have been tested

2,932 users of COVID Symptom Tracker both recorded their symptoms and have been tested for COVID, with 1,130 testing positive and 1,802 testing negative. We used machine learning* on this data to learn which symptoms are most predictive of a positive test. The most predictive symptoms, with most important first, were: anosmia (lack of taste & smell), fatigue, shortness of breath, fever and persistent cough.

2. Estimate total number of app users with COVID by applying those rules to all users’ logged symptoms (people who don’t report symptoms are not part of the model)

In total there were 1,626,355 users of COVID Symptom Tracker aged 20-69 who have recorded their symptoms, healthy or not, as of 1st April 2020. We applied the rules learnt from the tested users to estimate that 79,405 out of these total users would be positive if tested (4.9%).

3. Extrapolate to the whole UK population from app users, based on region, age & gender proportions

We segmented the whole UK population by location, age-decade and gender. For each such segment, we applied the percentage predicted as positive by our rules amongst app users, and then combined back to a total UK estimate.

Key assumptions:

Does not include asymptomatic COVID infections: there may be a large additional number of these.
Does not include COVID infections for people aged <20 or >70, since we have too little data logged in the app to model these.
Assumes that tested app users’ symptoms are representative of symptoms for those with positive or negative Covid status in the whole population
Assumes that app users are representative of the whole population within each segment of location, age-decade and gender. This model does not adjust for other demographics or health information.
Assumes healthy and unhealthy people are equally likely to use our app.
Assumes that app users report the severity of their symptoms in the same way.
Assumes no interaction between symptoms in this first logistic regression model.
Does not capture the most serious hospitalised cases well, although these are smaller in number compared to our total estimate.
‍

* Technical model detail: We trained a logistic regression model on app users who had been tested, to learn which symptoms are most predictive of a positive COVID test.

To assess the variability of the model, we trained ten logistic regression models on 80:20 random splits of the data. The mean and standard deviation of the weights for each symptom are given below, with a mean intercept of -1.44 (sd 0.04).

We obtained a mean classification accuracy on the test set of 0.73 (sd 0.01).

Text Link

Webinar: Predicting symptomatic COVID-19

1. Learn which symptoms best predict COVID, based on app users who have been tested

2. Estimate total number of app users with COVID by applying those rules to all users’ logged symptoms (people who don’t report symptoms are not part of the model)

3. Extrapolate to the whole UK population from app users, based on region, age & gender proportions

Key assumptions:

Other updates

ZOE Health Study joins world-leading cancer trial, NHS-Galleri

How your new health reports advance science

COVID numbers have stopped falling

Introducing the ZOE Health Study!

COVID numbers tumble, as new variants fail to take hold

1 in 15 has COVID in new record high

Introducing the new simplified daily report

COVID cases rocket to new high

ZOE government funding has been cancelled, but we’re not stopping

Why scientists need you to complete your ZOE Health Profile

COVID testing is changing: keep logging your tests and symptoms in the ZOE app

Have you done your science today? Easy ways to log your daily health report

COVID déjà vu as cases now falling for second time this year

COVID peaks for second time this year

Is my upset stomach a symptom of Omicron?

What does Omicron mean for the future of COVID?

UK back to 200,000 a day

Omicron bounces back

Omicron falling fast, but new uptick detected in children

Introducing the ZOE Studies Hub

Omicron wave has finally peaked

Omicron spread slows but cases hit vulnerable over 75s

Cases set to break 200,000

COVID cases explode

Everything we know so far about Omicron

Omicron and cold-like symptoms rapidly taking over in London

Map of Omicron spread in England

Strength in numbers: How ZOE is shaping the future of healthcare

Third doses can save Christmas

What are the symptoms of COVID?

Don't cancel Christmas yet

How ZOE is shaping the future of healthcare

Are COVID booster shots safe?

Biggest drop in cases since start of winter wave

Does a COVID infection guarantee protective antibodies?

COVID cases have probably peaked for 2021

Had a COVID antibody test? Here’s how to log your result

Vaccines and the Immunosuppressed

Worryingly close to 100,000 new cases a day

Can a pill cure COVID?

COVID reaches new highs for 2021

What impact has COVID-19 had on cancer?

What's the difference between natural and vaccine immunity against COVID-19?

Living with Long COVID summary

Third wave reaches new peak

Had a seasonal flu vaccine? You can log it with ZOE

Do I have COVID or the flu? How to tell the difference

Future-proofing our COVID estimates

Do I need a COVID vaccine if I’ve had COVID?

COVID cases in children continue to climb

COVID not showing signs of dropping

Does having COVID-19 affect mental health?

Can COVID-19 vaccines affect male fertility?

COVID cases declining but remain high and unpredictable

Do I have COVID or a cold? How to tell the difference

With the highest cases in Europe, UK should trigger Plan B now

You can now log any vaccine, including boosters, with ZOE

COVID-19, pregnancy, fertility and vaccines: Your questions answered

COVID cases no longer climbing as feared

ZOE to fight more health conditions

Double COVID vaccination halves risk of Long COVID

COVID still rising after restriction-free summer

How does COVID-19 affect children?

COVID bounces back as cases start to rise

COVID vaccines: are they working?

Is COVID vaccine protection fading?

What to eat when you have COVID-19 or long COVID

Is loss of smell still an important symptom of COVID-19?

UK cases hold steady

Should you self-isolate after being ‘pinged’? Here are the signs to look out for

Fall in UK COVID cases stalls

We’ve updated the COVID tests screen

COVID cases are falling

Should I still wear a mask?