A few weeks ago we added incremental MIRA data to the released ImmuneCODE set. We’re doing that again today, growing the binding pair set from about 90,000 hits to more than 135,000. But this release also opens a new chapter of the immune response story to SARS-CoV-2 — 1,414 complete immunoSEQ repertoires from COVID-19 acute, recovered, and exposed subjects around the world.
All together, we’ve cataloged more than 540,000,000 TCR-Beta sequences in these repertoires with over 352,000,000 unique TCR-Beta sequences. We are incredibly excited to share this expansive dataset with researchers working to understand and contribute to the global fight against COVID-19.
To make the information as usable as possible, we have published a full description of the ImmuneCODE data under peer review. A preprint of the article is available for open access here.
Of course, at this scale the dataset doesn’t fit well into a couple of CSV files, so going forward we’ll be releasing the data for interactive querying and visualization using our public data platform immuneACCESS at https://clients.adaptivebiotech.com/pub/covid-2020. Downloadable versions are also available from the immuneACCESS site.
In just a few months, COVID-19 has spread across the world and impacted almost every facet of our lives. We are proud to be part of the solution by contributing these data, and are confident that ImmuneCODE can help the scientific community crack the code and get us back to living, playing, and working together face-to-face soon.
We’ll keep adding to the dataset over the next few months. Please let us know how you’re using it so we can keep growing our collective understanding of COVID-19 and build on each other’s work.