This data release includes population (pVCF) and individual-level (CRAM, gVCF and more) Whole Genome Sequencing data for all UK Biobank participants. It does not include PLINK or BGEN formatted files, these should be available in the first release of 2024. To view the full details of the data in this November release, please visit the UK Biobank website: https://www.ukbiobank.ac.uk/enable-your-research/about-our-data/genetic-data
The proteomic data for levels of 3,000 proteins in 56,000 people, together with updated imaging files (both of which were included within Data Showcase as part of the October 2023 release) are now also available through UKB-RAP.
Please note: Dispensing of data has been enabled and we are expecting a great amount of interest in this new data, and there may be long queues for data dispensing. We have been working hard to help improve your experience and we would strongly encourage you to read our FAQ before dispensing any data within UKB-RAP: https://www.ukbiobank.ac.uk/media/dovbae03/uk-biobank-final-whole-genome-sequencing-release-faqs_v1-0.pdf
Given the UK Biobank dataset now comprises about 30 petabytes of data, with nearly 18 million individual-level and 600,000 population-level files, we are introducing a new function as part of the data dispensal process. You can select which elements of the data to dispense (i.e. population-level files and/or individual-level), so that you only dispense the data you need – this will make the system more efficient for everyone. This function is now part of the standard data dispensing process.
We are stepping into the unknown in terms of both the scale of data and demand. If you have any questions, please direct them to the very helpful members of the online community forum at https://community.dnanexus.com/s/, or otherwise contact ukbiobank-support@dnanexus.com. If you have an access-related query, such as changing to tier 3 to be able to access this data, then please contact access@ukbiobank.ac.uk.