Skip to content

Commit f431baf

Browse files
authored
Merge pull request #1508 from cmu-delphi/release/delphi-epidata-4.1.25
Release Delphi Epidata 4.1.25
2 parents aa3dedb + 704e898 commit f431baf

39 files changed

+210
-340
lines changed

.bumpversion.cfg

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
[bumpversion]
2-
current_version = 4.1.24
2+
current_version = 4.1.25
33
commit = False
44
tag = False
55

.git-blame-ignore-revs

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -20,3 +20,5 @@ b9ceb400d9248c8271e8342275664ac5524e335d
2020
07ed83e5768f717ab0f9a62a9209e4e2cffa058d
2121
# style(black): format wiki acquisition
2222
923852eafa86b8f8b182d499489249ba8f815843
23+
# lint: trailing whitespace changes
24+
81179c5f144b8f25421e799e823e18cde43c84f9

.github/PULL_REQUEST_TEMPLATE.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
closes|addresses <!--list issues closed or partially-addressed by this PR -->
1+
addresses issue(s) #ISSUE <!--list issue(s) associated with this PR -->
22

33
### Summary:
44

.github/workflows/update_gdocs_data.yml

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,10 @@ jobs:
2121
restore-keys: |
2222
${{ runner.os }}-pipd-
2323
- name: Install Dependencies
24-
run: pip install -r requirements.dev.txt
24+
run: |
25+
pip -V
26+
python -m pip install pip==22.0.2
27+
pip install -r requirements.dev.txt
2528
- name: Update Docs
2629
run: inv update-gdoc
2730
- name: Create pull request into dev

dev/local/setup.cfg

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
[metadata]
22
name = Delphi Development
3-
version = 4.1.24
3+
version = 4.1.25
44

55
[options]
66
packages =

docs/api/covidcast-signals/covid-act-now.md

Lines changed: 21 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -15,11 +15,13 @@ grand_parent: COVIDcast Main Endpoint
1515
* **Time type:** day (see [date format docs](../covidcast_times.md))
1616
* **License:** [CC BY-NC](../covidcast_licensing.md#creative-commons-attribution-noncommercial)
1717

18-
The COVID Act Now (CAN) data source provides COVID-19 testing statistics, such as positivity rates and total tests performed.
19-
The county-level positivity rates and test totals are pulled directly from CAN.
20-
While CAN provides this data potentially from multiple sources, we only use data sourced from the
18+
The [COVID Act Now (CAN)](https://covidactnow.org/) data source provides COVID-19 testing statistics, such as positivity rates and total tests performed.
19+
The county-level positivity rates and test totals are pulled directly from CAN using [their API](https://covidactnow.org/data-api).
20+
While CAN provides this data potentially from multiple sources, we only use data that CAN sources from the
2121
[CDC's COVID-19 Integrated County View](https://covid.cdc.gov/covid-data-tracker/#county-view).
2222

23+
Delphi's mirror of the CAN data was deactivated in December 2021 (last issue 2021-12-10) in favor of the [DSEW CPR data](./dsew-cpr.md), which reports the same information under the `covid_naat_pct_positive_7dav` signal.
24+
2325

2426
| Signal | Description |
2527
|--------------------------------|----------------------------------------------------------------|
@@ -34,9 +36,9 @@ While CAN provides this data potentially from multiple sources, we only use data
3436

3537
## Estimation
3638

37-
The quantities received from CAN / CDC are the county-level positivity rate and total tests,
38-
which are based on the counts of PCR specimens tested.
39-
In particular, they are also already smoothed with a 7-day-average.
39+
We receive county-level positivity rate and total tests from CAN, originating from the CDC.
40+
These quantiles are based on the counts of PCR specimens tested.
41+
They are also already smoothed with a 7-day-average.
4042

4143
For a fixed location $$i$$ and time $$t$$, let $$Y_{it}$$ denote the number of PCR specimens
4244
tested that have a positive result. Let $$N_{it}$$ denote the total number of PCR specimens tested.
@@ -79,38 +81,41 @@ $$
7981

8082
### Smoothing
8183

82-
No additional smoothing is done to avoid double-smoothing, since the data pulled from CAN / CDC
84+
No additional smoothing is done to avoid double-smoothing, since the CAN data
8385
is already smoothed with a 7-day-average.
8486

8587
## Limitations
8688

87-
Estimates for geographical levels beyond counties may be inaccurate due to how aggregations
88-
are done on smoothed values instead of the raw values. Ideally we would aggregate raw values
89+
Estimates for geographical levels beyond counties may be inaccurate because our aggregations
90+
are performed on smoothed values instead of the raw values.
91+
Ideally we would aggregate raw values
8992
then smooth, but the raw values are not accessible in this case.
9093

91-
The positivity rate here should not be interpreted as the population positivity rate as
94+
The reported test positivity rate should not be interpreted as the population positivity rate as
9295
the testing performed are typically not randomly sampled, especially for early data
9396
with lower testing volumes.
9497

9598
A few counties, most notably in California, are also not covered by this data source.
9699

97-
Entries with zero total tests performed are also suppressed, even if it was actually the case that
100+
Entries with zero total tests performed are suppressed, even if it was actually the case that
98101
no tests were performed for the day.
99102

100103
## Lag and Backfill
101104

102105
The lag for these signals varies depending on the reporting patterns of individual counties.
103106
Most counties have their latest data report with a lag of 2 days, while others can take 9 days
104-
or more in the case of California counties.
107+
or more, as is the case with California counties.
105108

106-
These signals are also backfilled as backlogged test results could get assigned to older 7-day timeframes.
107-
Most recent test positivity rates do not change substantially with backfill (having a median delta of close to 0).
108-
However, most recent total tests performed is expected to increase in later data revisions (having a median increase of 7%).
109+
Revisions are sometimes made to the data. For example, backlogged test results can get assigned to past dates.
110+
The majority of recent test positivity rates do not change substantially with backfill (having a median delta of close to 0).
111+
However, the majority of recent total tests performed is expected to increase in later data revisions (having a median increase of 7%).
109112
Values more than 5 days in the past are expected to remain fairly static (with total tests performed
110113
having a median increase of 1% of less), as most major revisions have already occurred.
111114

112115
## Source and Licensing
113116

114-
County-level testing data is scraped by CAN from the
117+
County-level testing data is scraped by [CAN](https://covidactnow.org/) from the
115118
[CDC's COVID-19 Integrated County View](https://covid.cdc.gov/covid-data-tracker/#county-view),
116119
and made available through [CAN's API](https://covidactnow.org/tools).
120+
121+
The data is made available under a [CC BY-NC](../covidcast_licensing.md#creative-commons-attribution-noncommercial) license.

docs/api/covidcast-signals/hhs.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
22
title: Department of Health & Human Services
3-
parent: Data Sources and Signals
3+
parent: Inactive Signals
44
grand_parent: COVIDcast Main Endpoint
55
---
66

docs/epidata_development.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -49,7 +49,7 @@ $ [sudo] make test pdb=1
4949
$ [sudo] make test test=repos/delphi/delphi-epidata/integrations/acquisition
5050
```
5151

52-
You can read the commands executed by the Makefile [here](../dev/local/Makefile).
52+
You can read the commands executed by the Makefile [here](https://github.com/cmu-delphi/delphi-epidata/blob/dev/dev/local/Makefile).
5353

5454
## Rapid Iteration and Bind Mounts
5555

@@ -87,8 +87,8 @@ You can test your changes manually by:
8787

8888
What follows is a worked demonstration based on the `fluview` endpoint. Before
8989
starting, make sure that you have the `delphi_database_epidata`,
90-
`delphi_web_epidata`, and `delphi_redis` containers running; if you don't, see
91-
the Makefile instructions above.
90+
`delphi_web_epidata`, and `delphi_redis` containers running (with `docker ps`);
91+
if you don't, see the Makefile instructions above.
9292

9393
First, let's insert some fake data into the `fluview` table:
9494

docs/symptom-survey/publications.md

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -26,6 +26,10 @@ Pandemic"](https://www.pnas.org/topic/548) in *PNAS*:
2626

2727
Research publications using the survey data include:
2828

29+
- C.K. Ettman, E. Badillo-Goicoechea, E.A. Stuart (2024). [Financial
30+
strain, schooling modality and mental health of US adults living
31+
with children during the COVID-19 pandemic](https://doi.org/10.1136/jech-2023-221672).
32+
*Journal of Epidemiology & Community Health*.
2933
- K. Sasse, R. Mahabir, O. Gkountouna, A. Crooks, A. Croitoru (2024).
3034
[Understanding the determinants of vaccine hesitancy in the United
3135
States: A comparison of social surveys and social media](https://doi.org/10.1371/journal.pone.0301488).
@@ -41,7 +45,7 @@ Research publications using the survey data include:
4145
- Z. Yang, R. Krishnan, and B. Li (2024). [The interplay between individual
4246
mobility, health risk, and economic choice: A holistic model for COVID-19
4347
policy intervention](https://doi.org/10.1287/ijds.2023.0013). *INFORMS
44-
Journal on Data Science*.
48+
Journal on Data Science* 3 (1), 6-27.
4549
- A. Srivastava, J. M. Ramirez, S. Díaz-Aranda, J. Aguilar, A. F. Anta, A. Ortega,
4650
and R. E. Lillo (2024). [Nowcasting temporal trends using indirect surveys](https://doi.org/10.1609/aaai.v38i20.30242).
4751
In *Proceedings of the 38th AAAI Conference on Artificial Intelligence* 38,

0 commit comments

Comments
 (0)