|
Host plant
|
---|
Sequencing
|
Z. mays
|
M. truncatula
|
---|
Total raw sequence
|
2.73 Gbp
|
2.91 Gbp
|
Reads Total
|
35,894,662
|
38,228,134
|
Host reads
|
(401,352)
|
(1,588,592)
|
Host filtered
|
35,493,310
|
36,639,542
|
Quality trimmed
|
26,947,737
|
27,325,845
|
Assembly
| | |
Assembly length
|
12.77 Mbp
|
12.25 Mbp
|
Unigenes Total
|
28,126
|
26,709
|
Unigenes >500 bp
|
9,369
|
9,718
|
Min/Max length (bp)
|
197/3,115
|
197/3,265
|
N50
|
525 bp
|
536 bp
|
N50 >500 bp
|
731 bp
|
695 bp
|
Annotation
| | |
Host unigenes
|
(4,967)
|
(7,785)
|
Non-plant unigenes
|
(127)
|
(329)
|
Triphysaria unigenes
|
23,032
|
18,595
|
Triphysaria hits
|
17,887
|
14,352
|
Other Plant hits
|
2,975
|
2,086
|
No hits
|
2,170
|
2,157
|
- Low quality reads were filtered before assembly, and host sequences were filtered both before and after assembly. Unigenes remaining after removal of host plant and non-plant sequences were aligned with BLASTx to sequences detected in any other PPGP transcriptome library of Triphysaria versicolor (http://ppgp.huck.psu.edu/). Unigenes with less than 95% pairwise identity to either host or to other Triphysaria libraries were sorted further if a BLASTx search of the NR (http://www.ncbi.nlm.nih.gov) database yielded alignments of 1e-10 or stronger. The remaining unclassified unigenes were submitted to OrthoMCL DB and InterProScan. Unigenes that remained unclassified after the final screen are called “no hit” unigenes.