
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC141862.21 - phase: 0 /pseudo
(659 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 91 2e-18
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 88 1e-17
BU083646 weakly similar to GP|6642775|gb| gag-pol polyprotein {V... 58 1e-08
NP004897 gag-protease polyprotein 52 1e-06
AW831299 41 2e-04
CO984748 44 3e-04
TC233180 similar to UP|Q944K0 (Q944K0) At1g18030/T10F20_3, parti... 40 0.004
BF324756 similar to GP|29423270|gb gag-pol polyprotein {Glycine ... 39 0.007
BU548265 homologue to GP|18000935|gb| cytochrome b {Mabuya delal... 37 0.021
BG881907 homologue to GP|21432|emb|CA ORF2 {Solanum tuberosum}, ... 31 1.5
CF922226 31 1.5
TC224797 homologue to UP|IF4Y_TOBAC (Q40467) Eukaryotic initiati... 30 2.6
TC215893 similar to UP|Q6K674 (Q6K674) Translation initiation fa... 30 3.4
TC231477 similar to UP|Q6K722 (Q6K722) OTU-like cysteine proteas... 29 5.8
TC212187 weakly similar to UP|Q6USA0 (Q6USA0) Transposase (Fragm... 28 10.0
AW203235 28 10.0
TC215701 similar to GB|CAA89201.2|6469340|ATANTMR adenine nucleo... 28 10.0
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 90.9 bits (224), Expect = 2e-18
Identities = 49/101 (48%), Positives = 61/101 (59%), Gaps = 1/101 (0%)
Frame = +1
Query: 559 LKLQMCLRAREKQRSWYLDSGCSRHMTGEKSLFLTLTMKDGGEVKFGGNQTGKIIGTGTI 618
L + LRA K+ WYLDSGCSRHMTG K L + V FG GKIIG G +
Sbjct: 1642 LVVHTSLRASAKE-DWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIGMGKL 1818
Query: 619 GNSSI-SINNVWLVDGLKHNLLSISQFCDNGYDVMFSKTNC 658
+ + S+N V LV GL NL+SISQ CD G++V F+K+ C
Sbjct: 1819 VHDGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSEC 1941
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 87.8 bits (216), Expect = 1e-17
Identities = 47/101 (46%), Positives = 60/101 (58%), Gaps = 1/101 (0%)
Frame = +1
Query: 559 LKLQMCLRAREKQRSWYLDSGCSRHMTGEKSLFLTLTMKDGGEVKFGGNQTGKIIGTGTI 618
L + LRA K+ WYLDSGCSRHMTG K + + V FG GKI G G +
Sbjct: 1645 LVVHTSLRASAKE-DWYLDSGCSRHMTGVKEFLVNIEPCSTSYVTFGDGSKGKITGMGKL 1821
Query: 619 GNSSI-SINNVWLVDGLKHNLLSISQFCDNGYDVMFSKTNC 658
+ + S+N V LV GL NL+SISQ CD G++V F+K+ C
Sbjct: 1822 VHDGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSEC 1944
>BU083646 weakly similar to GP|6642775|gb| gag-pol polyprotein {Vitis
vinifera}, partial (19%)
Length = 428
Score = 58.2 bits (139), Expect = 1e-08
Identities = 34/99 (34%), Positives = 48/99 (48%), Gaps = 4/99 (4%)
Frame = +1
Query: 564 CLRAREKQRSWYLDSGCSRHMTGEKSLFLTLTMKDGGEVKFGGNQTGKIIGTGTI---GN 620
C A SW +DSGC+ HMT ++ LF L +VK + G T+ G+
Sbjct: 43 CFAASSST*SWLIDSGCTNHMTYDRELFTELDEVVFSKVKIRNEAYIDVKGKETVAI*GH 222
Query: 621 SSIS-INNVWLVDGLKHNLLSISQFCDNGYDVMFSKTNC 658
+ + I+NV V + NLLS+ Q GY V+F NC
Sbjct: 223 TGLKLISNVLYVSEISQNLLSVPQLLKKGYKVLFEDKNC 339
>NP004897 gag-protease polyprotein
Length = 1923
Score = 51.6 bits (122), Expect = 1e-06
Identities = 27/60 (45%), Positives = 32/60 (53%)
Frame = +1
Query: 559 LKLQMCLRAREKQRSWYLDSGCSRHMTGEKSLFLTLTMKDGGEVKFGGNQTGKIIGTGTI 618
L + LRA K+ WYLDSGCSRHMTG K + + V FG GKI G G +
Sbjct: 1645 LVVHTSLRASAKE-DWYLDSGCSRHMTGVKEFLVNIEPCSTSYVTFGDGSKGKITGMGKL 1821
>AW831299
Length = 334
Score = 40.8 bits (94), Expect(2) = 2e-04
Identities = 15/30 (50%), Positives = 22/30 (73%)
Frame = +2
Query: 571 QRSWYLDSGCSRHMTGEKSLFLTLTMKDGG 600
++ WY+DSGCS+HMTG+ S F ++ K G
Sbjct: 242 RKKWYIDSGCSKHMTGDASNFTHISPKKSG 331
Score = 22.7 bits (47), Expect(2) = 2e-04
Identities = 11/22 (50%), Positives = 15/22 (68%), Gaps = 2/22 (9%)
Frame = +1
Query: 536 GKVKPPEL--TQKDP*RYGYLN 555
G K P+L T KDP ++GYL+
Sbjct: 148 GSQKDPQLILTCKDPIKFGYLS 213
>CO984748
Length = 810
Score = 43.5 bits (101), Expect = 3e-04
Identities = 25/96 (26%), Positives = 45/96 (46%), Gaps = 6/96 (6%)
Frame = -2
Query: 569 EKQRSWYLDSGCSRHMTGEKSLFLTLTMKDGGEVKFGGNQTGKIIGTGTIGNS------S 622
++ ++WY++ G + H+T + G E F GN G I T+ NS
Sbjct: 581 QQSQNWYINYGATHHVTASPQNLMNEVPTSGNEQVFLGNGQGLPITGSTVFNSPFASNVK 402
Query: 623 ISINNVWLVDGLKHNLLSISQFCDNGYDVMFSKTNC 658
+++NN+ V + NL+S+S F + F ++C
Sbjct: 401 LTLNNLLHVPHITKNLVSVS*FAKDNVFFEFHSSHC 294
>TC233180 similar to UP|Q944K0 (Q944K0) At1g18030/T10F20_3, partial (11%)
Length = 916
Score = 39.7 bits (91), Expect = 0.004
Identities = 32/86 (37%), Positives = 43/86 (49%), Gaps = 3/86 (3%)
Frame = +2
Query: 576 LDSGCSRHMTGEKSLFLTLTMKDGGEVKFGGNQTG-KIIGTGTIG-NSSISINNVWLVDG 633
+DSG + HMT S F + T G + N + IIG G I SS+ +NNV V
Sbjct: 458 IDSGVTDHMTPHSSYFSSYTFLIGNQHIIVANGSHIPIIGCGNIQLQSSLHLNNVLYVPK 637
Query: 634 LKHNLLSISQFC-DNGYDVMFSKTNC 658
L +NLLSI + D V F ++C
Sbjct: 638 LSNNLLSIHKIT*DLNCVVTFFHSHC 715
>BF324756 similar to GP|29423270|gb gag-pol polyprotein {Glycine max},
partial (2%)
Length = 121
Score = 38.9 bits (89), Expect = 0.007
Identities = 18/28 (64%), Positives = 20/28 (71%)
Frame = -1
Query: 565 LRAREKQRSWYLDSGCSRHMTGEKSLFL 592
LRA KQ W++DSGCSRHMTG K L
Sbjct: 91 LRASAKQ-DWHIDSGCSRHMTGVKEFLL 11
>BU548265 homologue to GP|18000935|gb| cytochrome b {Mabuya delalandii},
partial (5%)
Length = 667
Score = 37.4 bits (85), Expect = 0.021
Identities = 29/109 (26%), Positives = 49/109 (44%), Gaps = 9/109 (8%)
Frame = -3
Query: 559 LKLQMCLRAREKQRSWYLDSGCSRHMTG-EKSLFLTLTMKDGGEVKFGGNQTGKIIGTGT 617
L++ + A +SWY DSG S H+T +++ ++ ++ G Q I +G
Sbjct: 662 LQIFLTSSAATPSQSWYPDSGASHHVTNMSQNIQQVAPFEEPIQIIIGNGQGLNINSSGL 483
Query: 618 IGNS-------SISINNVWLVDGLKHNLLSISQFC-DNGYDVMFSKTNC 658
S S+ ++N+ V + NL+ +SQFC DN F C
Sbjct: 482 STFSSPINPQFSLVLSNLLFVPTITKNLIRVSQFCKDNNVYFEFHSYVC 336
>BG881907 homologue to GP|21432|emb|CA ORF2 {Solanum tuberosum}, partial (3%)
Length = 429
Score = 31.2 bits (69), Expect = 1.5
Identities = 15/53 (28%), Positives = 28/53 (52%), Gaps = 1/53 (1%)
Frame = -2
Query: 567 AREKQRSWYLDSGCSRHMTGEKSLFLTL-TMKDGGEVKFGGNQTGKIIGTGTI 618
++ K++SW +DS S HMT + +F + D V+ K++G G++
Sbjct: 179 SKGKRKSWIVDSRASDHMTVDTPVFDNYSSCHDHATVQIADGTLSKVVGKGSV 21
>CF922226
Length = 667
Score = 31.2 bits (69), Expect = 1.5
Identities = 12/24 (50%), Positives = 15/24 (62%)
Frame = -3
Query: 568 REKQRSWYLDSGCSRHMTGEKSLF 591
+ + W +DSGCS HMT KS F
Sbjct: 89 KNPETKWIMDSGCSWHMTPNKSWF 18
>TC224797 homologue to UP|IF4Y_TOBAC (Q40467) Eukaryotic initiation factor
4A-14 (eIF4A-14) (eIF-4A-14), partial (28%)
Length = 681
Score = 30.4 bits (67), Expect = 2.6
Identities = 16/67 (23%), Positives = 30/67 (43%)
Frame = +2
Query: 561 LQMCLRAREKQRSWYLDSGCSRHMTGEKSLFLTLTMKDGGEVKFGGNQTGKIIGTGTIGN 620
++ C W L S + F ++ +++ ++ GG+ GKI G G++ N
Sbjct: 272 MKKCCLTSRSFTMW*LRSCLQMLLNSFDEFFCSIILEESWFLRIGGSL*GKIFGCGSLLN 451
Query: 621 SSISINN 627
S + NN
Sbjct: 452 SYMDCNN 472
>TC215893 similar to UP|Q6K674 (Q6K674) Translation initiation factor
IF-3-like, partial (58%)
Length = 1171
Score = 30.0 bits (66), Expect = 3.4
Identities = 12/31 (38%), Positives = 22/31 (70%)
Frame = +2
Query: 443 LKLLVRRLRSHQSQKNSSLRL*QSLIPRLKR 473
+KLL R+L++HQ ++ S L++ L+P+ R
Sbjct: 845 IKLLCRKLKNHQRKETSQLQMKSPLVPKHNR 937
>TC231477 similar to UP|Q6K722 (Q6K722) OTU-like cysteine protease-like,
partial (31%)
Length = 609
Score = 29.3 bits (64), Expect = 5.8
Identities = 16/49 (32%), Positives = 24/49 (48%)
Frame = -1
Query: 528 AFTLRYKGGKVKPPELTQKDP*RYGYLNLNWLKLQMCLRAREKQRSWYL 576
A TL KV PP L + *RY L ++ + + +C + +E W L
Sbjct: 366 AATLLMLSPKVPPPFLLSRLT*RYISLRIDGVVMHLCPKTKENHSFWLL 220
>TC212187 weakly similar to UP|Q6USA0 (Q6USA0) Transposase (Fragment),
partial (45%)
Length = 973
Score = 28.5 bits (62), Expect = 10.0
Identities = 14/41 (34%), Positives = 25/41 (60%), Gaps = 4/41 (9%)
Frame = -2
Query: 107 MNPLRRVNPLLCHLKESLQNLL----KPTKPVNLKKNHLME 143
++ LRR+ + C L SLQN+L +P++P K H+++
Sbjct: 435 LSTLRRMWKMQCRLT*SLQNILVSCAEPSQPAQTKVKHMLK 313
>AW203235
Length = 410
Score = 28.5 bits (62), Expect = 10.0
Identities = 13/34 (38%), Positives = 20/34 (58%)
Frame = +1
Query: 195 VISLLIALIFRKRSLKANPRNQASAPVNLESKSK 228
++ L+ L+ RKR K+ P ++ A L SKSK
Sbjct: 127 MLQLMFCLLIRKRKAKSEPESKPQANPLLRSKSK 228
>TC215701 similar to GB|CAA89201.2|6469340|ATANTMR adenine nucleotide
translocase {Arabidopsis thaliana;} , partial (42%)
Length = 1100
Score = 28.5 bits (62), Expect = 10.0
Identities = 12/18 (66%), Positives = 13/18 (71%)
Frame = -1
Query: 574 WYLDSGCSRHMTGEKSLF 591
WYLDSGCS H+ E LF
Sbjct: 758 WYLDSGCSFHLFLEFFLF 705
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.351 0.154 0.514
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 29,082,621
Number of Sequences: 63676
Number of extensions: 396349
Number of successful extensions: 3461
Number of sequences better than 10.0: 34
Number of HSP's better than 10.0 without gapping: 2525
Number of HSP's successfully gapped in prelim test: 94
Number of HSP's that attempted gapping in prelim test: 899
Number of HSP's gapped (non-prelim): 2667
length of query: 659
length of database: 12,639,632
effective HSP length: 103
effective length of query: 556
effective length of database: 6,081,004
effective search space: 3381038224
effective search space used: 3381038224
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.9 bits)
S2: 62 (28.5 bits)
Medicago: description of AC141862.21