{"__v":2,"_id":"5761946e207db7170022fbf0","category":{"__v":0,"_id":"57619455a7c9f729009a74e0","project":"57618347b65324200072d6a5","version":"57618347b65324200072d6a8","sync":{"url":"","isSync":false},"reference":false,"createdAt":"2016-06-15T17:45:57.617Z","from_sync":false,"order":5,"slug":"file-descriptions","title":"File formats"},"parentDoc":null,"project":"57618347b65324200072d6a5","user":"57617c8caa540f3600bfed20","version":{"__v":8,"_id":"57618347b65324200072d6a8","project":"57618347b65324200072d6a5","createdAt":"2016-06-15T16:33:11.587Z","releaseDate":"2016-06-15T16:33:11.587Z","categories":["57618347b65324200072d6a9","5761912d207db7170022fbe9","57619455a7c9f729009a74e0","576e8ae1f37ab41700147471","5797b8e5209a6e0e00b8321b","57989a8817ced017003c4c69","579ca6f3d46f960e0029a8ec","579ca703fefb1d0e00c94f06"],"is_deprecated":false,"is_hidden":false,"is_beta":false,"is_stable":true,"codename":"blobtools v0.9.19","version_clean":"0.9.19","version":"0.9.19"},"updates":[],"next":{"pages":[],"description":""},"createdAt":"2016-06-15T17:46:22.496Z","link_external":false,"link_url":"","githubsync":"","sync_unique":"","hidden":false,"api":{"results":{"codes":[]},"settings":"","auth":"required","params":[],"url":""},"isReference":false,"order":0,"body":"[block:api-header]\n{\n  \"type\": \"basic\",\n  \"title\": \"Why is it needed?\"\n}\n[/block]\nA genome assembly is a collection of strings representing DNA sequences (contigs, scaffolds, chromosomes, ...) composed of the five letters A,G,C,T and N. \n[block:api-header]\n{\n  \"type\": \"basic\",\n  \"title\": \"Format\"\n}\n[/block]\n- [FASTA](https://en.wikipedia.org/wiki/FASTA_format)\n[block:api-header]\n{\n  \"type\": \"basic\",\n  \"title\": \"Comment\"\n}\n[/block]\n> **Caution**: While parsing a [FASTA](https://en.wikipedia.org/wiki/FASTA_format) file, blobtools will split the [sequence ID](https://en.wikipedia.org/wiki/FASTA_format#Description_line) on the first whitespace it encounters, analogous to the behaviour of other bioinformatic applications.","excerpt":"Description of assembly files formats for use as input files for blobtools","slug":"assembly-file","type":"basic","title":"assembly file"}

assembly file

Description of assembly files formats for use as input files for blobtools

[block:api-header] { "type": "basic", "title": "Why is it needed?" } [/block] A genome assembly is a collection of strings representing DNA sequences (contigs, scaffolds, chromosomes, ...) composed of the five letters A,G,C,T and N. [block:api-header] { "type": "basic", "title": "Format" } [/block] - [FASTA](https://en.wikipedia.org/wiki/FASTA_format) [block:api-header] { "type": "basic", "title": "Comment" } [/block] > **Caution**: While parsing a [FASTA](https://en.wikipedia.org/wiki/FASTA_format) file, blobtools will split the [sequence ID](https://en.wikipedia.org/wiki/FASTA_format#Description_line) on the first whitespace it encounters, analogous to the behaviour of other bioinformatic applications.