Soybean EST Libraries from GenBank Contig Assembly and EST analysis:
We used the following GenBank root libraries for this analysis (sequences and
further information on the libraries can be found here):
Gm-c1004 (8428 sequences)
Gm-c1081 (3603 sequences)
Gm-c1087 (5674 sequences)
Gm-c1060 (1220 sequences)
Gm-c1033 (1369 sequences)
Results of the contig assembly and subsequent EST analysis of these libraries
are can be found here
(note: contigs of 3 or more sequences were selected for analysis, a total
of 1,646 contigs). Below is a list of the top genes (hits) that were
observed in the contigs (contigs with similar hits were added up):
| Name |
Number of Sequences |
Percentage |
| G.max ADR12
mRNA |
484 |
2.42 |
| G.max
proline-rich protein |
385 |
1.92 |
| Arabidopsis
thaliana chromosome 1 |
366 |
1.83 |
| Arabidopsis
thaliana chromosome 5 |
356 |
1.78 |
| Arabidopsis
thaliana chromosome 4 |
270 |
1.35 |
| Arabidopsis
thaliana chromosome 3 |
233 |
1.16 |
| G. max gene
for ubiquitin |
211 |
1.05 |
| G. max
Sali3-2 mRNA |
175 |
0.87 |
| Arabidopsis
thaliana chromosome 2 |
169 |
0.84 |
| 40S subunit
ribosomal protein |
165 |
0.82 |
| G.max tefS1
gene for elongation factor EF-1a |
158 |
0.79 |
| 60S
ribosomal protein |
149 |
0.74 |
| mRNA for
metallothionein-like protein |
135 |
0.67 |
| G. max
nodulin-26 mRNA |
132 |
0.66 |
| G.max
ascorbate peroxidase 2 (APx2) mRNA |
92 |
0.46 |
| Plasma
membrane integral protein |
90 |
0.45 |
| G.max
lipoxygenase mRNA |
90 |
0.45 |
| Aquaporin-1
(Mip-1) mRNA |
86 |
0.43 |
| mRNA for
putative extensin |
84 |
0.42 |
| mRNA for
small GTP-binding protein |
84 |
0.42 |
| ferrochelatase
mRNA |
73 |
0.36 |
* out of 20,005 EST's that were analyzed. |