Contents of the Secondary Database
- Total number of Polymorphic
sets analyzed: 2319
- Total number of Analysis units: 4471 [Phylogeny]
(average number of
analysis units per polymorphic set: 1.9)- CDS: 2184
- exon: 1167
- intron: 1040
- 5ŽUTR: 31
- 3ŽUTR: 33
- promoter: 16
QUALITY OF
THE ALIGNMENTS:
- Number of Analysis units
according to the "Number of sequences":
- Low number (2-5
=
! ): 1828 (40.9%)
- Medium number (6-10
=
K
): 690 (15.4%)
- High number (>10
=
J):
1953 (43.7%)
- Number of Analysis units
according to the "Percentage of gaps or ambiguous bases":
- High (≥30%
=
! , low quality): 42 (0.9%)
- Medium (≥10%-<30%
=
K
, medium quality): 159 (3.6%)
- Low (<10%
= J
, high quality): 4270 (95.5%)
- Number of Analysis units
according to the "Percentage of difference in size between
the longest and the shortest sequences":
- High (≥30%
=
! , low quality): 424 (9.5%)
- Medium (≥10%-<30%
=
K
, medium quality): 292 (6.5%)
- Low (<10%
= J
, high quality): 3755 (84.0%)
SET
CONFIDENCE:
- Existence in NCBI PopSet*: 28.6%
(1279/4471)
- Consecutive GenBank accession numbers**: 40.1%
(1791/4471)
- One or more shared references**: 57.1%
(2552/4471)
- Journals* (Genetics,
Mol.Biol.Evol., J.Mol.Evol., Mol.Phylogenet.Evol.): 58.6%
(2620/4471)
* At least one sequence in the set ** All the sequences in the set
AVERAGES BY GENE REGIONS:
| |
#Aligns |
#Polym. sets |
Avg. #Seqs |
Avg. Align. length (bases) |
Avg. #Analyzed sites (bases) |
Avg. θ |
Avg. π |
Avg. %G+C |
| 3ŽUTR |
33 | 30 | 13.3 | 609.3 | 514.3 | 0.00662 | 0.00455 | 38.98 | | 5ŽUTR |
31 | 28 | 15.7 | 530.3 | 375.9 | 0.00765 | 0.00585 | 42.46 | | CDS |
2184 | 2020 | 11.8 | 1365.5 | 1048.3 | 0.00692 | 0.00650 | 53.91 | | exon |
1166 | 506 | 15.2 | 589.5 | 497.8 | 0.00587 | 0.00549 | 52.58 | | intron |
1040 | 562 | 15.2 | 382.7 | 335.7 | 0.01200 | 0.01087 | 37.52 | | promoter |
16 | 11 | 38.8 | 603.6 | 324.8 | 0.00794 | 0.00761 | 24.14 |
Get your own lists and graphics from the
Graphical Search or the Comparative SearchTools
Contents of the Primary
the Database
- Total number of Sequences: 57650
- Total number of References
3695
- Published: 1361
- Submitted: 1692
- In press: 85
- Published Only in Database: 10
- Unpublished: 399
- Thesis: 18
- Number of different
Journals:
105
|