Before downloading any of the datasets, you are strongly encouraged to read the document Supernova 2.X performance with background information about how we have generated them.
Each final assembly is presented in four forms, as described here.
Before downloading any of the datasets, you are strongly encouraged to read the document Supernova 2.0 performance with background information about how we have generated them.
Each final assembly is presented in four forms, as described here.
Datasets for the manuscript Weisenfeld et al. "Direct determination of genome sequences". The data are from seven human samples, six of which are Coriell cell lines, and one of which was obtained from blood from a Human Genome Project donor (labeled HGP below). For each sample, we provide all the reads from a single 10x Genomics library sequenced on a HiSeq X, and an assembly of a 1200M random sample of the reads, obtained by applying the Supernova Assembler (version 1.1). Each assembly is presented in four forms, as described here. The read data and assemblies are in the process of being submitted to the Short Read Archive and GenBank. For the HGP sample, we also provide a file of 3,431 finished sequences that we obtained from GenBank, and comprising in total 340 Mb.