
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC130802.9 + phase: 0 /pseudo
(937 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC89912 weakly similar to PIR|B84512|B84512 probable retroelemen... 171 9e-43
BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F2... 122 8e-28
BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2... 91 2e-18
CB893680 weakly similar to GP|1167523|db ORF(AA 1-1338) {Nicotia... 76 5e-15
CB893805 similar to GP|10177935|d copia-type polyprotein {Arabid... 73 4e-13
BE123913 weakly similar to GP|22093573|d polyprotein {Oryza sati... 67 4e-11
BG644690 weakly similar to GP|18542179|gb putative pol protein {... 60 5e-09
BF650113 weakly similar to GP|4753889|emb| Tpv2-1c {Phaseolus vu... 57 3e-08
AW689768 weakly similar to GP|10177485|d polyprotein {Arabidopsi... 53 5e-07
BG587174 similar to PIR|A47759|A4775 retrovirus-related reverse ... 47 3e-05
BG587156 similar to PIR|G85055|G8 probable polyprotein [imported... 37 0.027
TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-relate... 34 0.23
CB894419 weakly similar to GP|10177935|db copia-type polyprotein... 32 1.5
TC93066 weakly similar to GP|19920130|gb|AAM08562.1 Putative ret... 31 1.9
TC91257 similar to GP|21434|emb|CAA36616.1|| ORF4 {Solanum tuber... 31 2.5
BG453964 30 3.3
TC80722 similar to GP|19699290|gb|AAL91256.1 At1g15070/F9L1_1 {A... 30 5.6
AL366944 30 5.6
>TC89912 weakly similar to PIR|B84512|B84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(10%)
Length = 814
Score = 171 bits (434), Expect = 9e-43
Identities = 93/126 (73%), Positives = 101/126 (79%)
Frame = +3
Query: 796 GNVDTRKSLVGFCVYSL*HDD*LEGKSTIRGGIIYNSSGVHRTC*RCEGGHMVERDDW*V 855
G +K +GFCVYSL HD *LEGKSTIRG II NSSGVH C R E HMVER DW*V
Sbjct: 117 GQCGHKKISIGFCVYSLWHDY*LEGKSTIRGDIINNSSGVHCLCRRGERCHMVERYDW*V 296
Query: 856 RNYSRMCEDTL**PKCHSLGKSSSVS*KDKAH*HSPALCQRHD*NKRDCG*ESGIGGESG 915
RNYSR+CEDTL** KCHSLG+SSSVS*+D AH*HS AL RHD* KRDCG ++GIG ESG
Sbjct: 297 RNYSRICEDTL**SKCHSLGESSSVS*ED*AH*HSLALY*RHD*IKRDCGGKNGIGRESG 476
Query: 916 GCVHQV 921
GCV+QV
Sbjct: 477 GCVYQV 494
Score = 92.4 bits (228), Expect = 7e-19
Identities = 44/51 (86%), Positives = 47/51 (91%)
Frame = +1
Query: 757 QALKWVLRYLNGSLKGGLKYTRAAQDEDALEGYVDADYAGNVDTRKSLVGF 807
QALKWVL+YLN SLK LKYT+AAQ+EDALEGYVDADYAGNVDTRKSL GF
Sbjct: 1 QALKWVLKYLNESLKSSLKYTKAAQEEDALEGYVDADYAGNVDTRKSLSGF 153
>BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana, partial (10%)
Length = 744
Score = 122 bits (305), Expect = 8e-28
Identities = 82/246 (33%), Positives = 133/246 (53%)
Frame = +2
Query: 565 EESLYGLKQSPREWYRRFDEFLLKTGFVRSGYDSCVYMLKKNEKVILYLLLYVDDILMAS 624
++S+YGLKQ+ R+WY + E L+ G+++S D ++ K + LL+YVDDI++A
Sbjct: 32 QKSIYGLKQASRQWYSKLSESLISFGYLQSSSDFSLFT-KFKDSSFTTLLVYVDDIVLAG 208
Query: 625 SSKDEIMKLKERLNGEFEMKDLGPAKRVLGIDIKRNRDKGELFLSQLGYLKKGGERFRMS 684
+ EI +K L F++KDLG + LG+++ R+ K + L+Q Y + E
Sbjct: 209 NDISEIQHVKCFLIDRFKIKDLGSLRYFLGLEVARS--KQGILLNQRKYTLELLEDSGNL 382
Query: 685 NSKTVSTPLGHHTKLSIQQCPQSEDEKQLMEGTPYASGVGSIMYGMVCSRPDLAYAVSIV 744
K+ TP KL P DE T Y +G ++Y + +RPD+++AV +
Sbjct: 383 AVKSTLTPYDISLKLHNSDSPLYNDE------TQYRRLIGKLIY-LTTTRPDISFAVQQL 541
Query: 745 SRFMANPGIVHWQALKWVLRYLNGSLKGGLKYTRAAQDEDALEGYVDADYAGNVDTRKSL 804
S+F++ P VH+QA VL+YL + GL Y +A L + D+D+A TRKS+
Sbjct: 542 SQFVSKPQQVHYQAAIRVLQYLKTAPAKGLFY--SATSNLKLSSFADSDWATCPTTRKSV 715
Query: 805 VGFCVY 810
G+ V+
Sbjct: 716 TGYWVF 733
>BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2O9.150 -
Arabidopsis thaliana, partial (11%)
Length = 732
Score = 91.3 bits (225), Expect = 2e-18
Identities = 55/143 (38%), Positives = 81/143 (56%), Gaps = 1/143 (0%)
Frame = +1
Query: 666 LFLSQLGYLKKGGERFRMSNSKTVSTPLGHHTKLSIQQCPQSEDEKQL-MEGTPYASGVG 724
+++ Q Y+ ERF M S P+ KL +DE + ++ T Y VG
Sbjct: 28 IYICQRKYVTDLLERFGMEKSNLSRNPIAPRCKLI-------KDENGVKVDATKYKQIVG 186
Query: 725 SIMYGMVCSRPDLAYAVSIVSRFMANPGIVHWQALKWVLRYLNGSLKGGLKYTRAAQDED 784
+MY + +RPDL Y +S++SRFM P +H A+K VLRYLNG++ G+ Y R ++
Sbjct: 187 CLMY-LAATRPDLMYVLSLISRFMNCPTELHMHAVKRVLRYLNGTINLGIMYKRNGSEK- 360
Query: 785 ALEGYVDADYAGNVDTRKSLVGF 807
LE Y D+DYAG++D RKS G+
Sbjct: 361 -LEAYTDSDYAGDLDDRKSTSGY 426
>CB893680 weakly similar to GP|1167523|db ORF(AA 1-1338) {Nicotiana tabacum},
partial (7%)
Length = 780
Score = 75.9 bits (185), Expect(2) = 5e-15
Identities = 40/89 (44%), Positives = 58/89 (64%)
Frame = -3
Query: 620 ILMASSSKDEIMKLKERLNGEFEMKDLGPAKRVLGIDIKRNRDKGELFLSQLGYLKKGGE 679
+L+ S+ DEI LK R + E +MKDLGPAK+++G+ I ++ KG L LSQ+ Y+ + +
Sbjct: 271 LLVVGSNIDEIKNLKTRFSKEIDMKDLGPAKKIIGMQIMIDKQKGVL*LSQVEYITRVLQ 92
Query: 680 RFRMSNSKTVSTPLGHHTKLSIQQCPQSE 708
F M N+ VST L H LS +Q PQ+E
Sbjct: 91 IFNMGNAILVSTTLASHFCLSHEQSPQTE 5
Score = 23.9 bits (50), Expect(2) = 5e-15
Identities = 10/27 (37%), Positives = 15/27 (55%)
Frame = -2
Query: 565 EESLYGLKQSPREWYRRFDEFLLKTGF 591
++S+YGLKQ PR+ + GF
Sbjct: 371 KKSMYGLKQGPRQCI*SLKALCTRKGF 291
>CB893805 similar to GP|10177935|d copia-type polyprotein {Arabidopsis
thaliana}, partial (14%)
Length = 778
Score = 73.2 bits (178), Expect = 4e-13
Identities = 34/100 (34%), Positives = 58/100 (58%)
Frame = +3
Query: 565 EESLYGLKQSPREWYRRFDEFLLKTGFVRSGYDSCVYMLKKNEKVILYLLLYVDDILMAS 624
+ +LYGLKQ+PR WY R + + K GF + Y+ +++ IL + LYVDD++
Sbjct: 468 KRALYGLKQAPRAWYSRIEAYFTKEGFEKCPYEHTLFVKLSEGGKILIISLYVDDLIFIG 647
Query: 625 SSKDEIMKLKERLNGEFEMKDLGPAKRVLGIDIKRNRDKG 664
+ ++ + K+ + EF M DLG LG+++ +N +KG
Sbjct: 648 NDENMFEEFKKSMKKEFNMSDLGKMHYFLGVEVTQN-EKG 764
>BE123913 weakly similar to GP|22093573|d polyprotein {Oryza sativa (japonica
cultivar-group)}, partial (8%)
Length = 503
Score = 66.6 bits (161), Expect = 4e-11
Identities = 42/122 (34%), Positives = 65/122 (52%)
Frame = +1
Query: 573 QSPREWYRRFDEFLLKTGFVRSGYDSCVYMLKKNEKVILYLLLYVDDILMASSSKDEIMK 632
QSPR+W+ RF + K G+++ D +++ + L++YVDDI + I +
Sbjct: 1 QSPRDWFDRFT*VVKKFGYIQCQTDHAMFIKHSSTVKKAILIVYVDDIFLTGDHGK*IKR 180
Query: 633 LKERLNGEFEMKDLGPAKRVLGIDIKRNRDKGELFLSQLGYLKKGGERFRMSNSKTVSTP 692
LK L EFE+KDLG K LG+++ R + KG +SQ Y+ + RM KT+ P
Sbjct: 181 LKNLLAEEFEIKDLGNLKYFLGMEVARWK-KGS-SISQRKYVLDLLKETRMIGCKTIRDP 354
Query: 693 LG 694
G
Sbjct: 355 YG 360
>BG644690 weakly similar to GP|18542179|gb putative pol protein {Zea mays},
partial (22%)
Length = 629
Score = 59.7 bits (143), Expect = 5e-09
Identities = 28/55 (50%), Positives = 40/55 (71%)
Frame = -2
Query: 566 ESLYGLKQSPREWYRRFDEFLLKTGFVRSGYDSCVYMLKKNEKVILYLLLYVDDI 620
++LYGLKQ+PR WY R +FLLK GF R D+ +++LK+ E +L + +YVDDI
Sbjct: 163 KTLYGLKQAPRAWYERLSKFLLKNGFKRGKIDNTLFLLKR-E*ELLIIQVYVDDI 2
>BF650113 weakly similar to GP|4753889|emb| Tpv2-1c {Phaseolus vulgaris},
partial (13%)
Length = 494
Score = 57.4 bits (137), Expect = 3e-08
Identities = 37/108 (34%), Positives = 61/108 (56%), Gaps = 8/108 (7%)
Frame = +1
Query: 708 EDEKQLMEGTPYASGVG------SIMYGM-VCSRPDLAYAVSIVSRFMANPGIVHWQALK 760
E E+ L +G P S +G + G+ +C RPD+ Y+VS++S+FM +P H A
Sbjct: 22 EQEQGLEKGEPQESRIGVVRS*IEVQVGINLC*RPDICYSVSVISKFMHDPRKPHLIAAN 201
Query: 761 WVLRYLNGSLKGGLKYTRAAQDE-DALEGYVDADYAGNVDTRKSLVGF 807
+LRY+ G+++ GL + A+ E L Y D+D+ G+ R+S G+
Sbjct: 202 RILRYVRGTMEYGLLFPYGAKSEVYELICYSDSDWCGD---RRSTSGY 336
>AW689768 weakly similar to GP|10177485|d polyprotein {Arabidopsis thaliana},
partial (9%)
Length = 675
Score = 53.1 bits (126), Expect = 5e-07
Identities = 27/61 (44%), Positives = 37/61 (60%)
Frame = +1
Query: 566 ESLYGLKQSPREWYRRFDEFLLKTGFVRSGYDSCVYMLKKNEKVILYLLLYVDDILMASS 625
+SLYGLKQ+PR WY ++ GF +S D + + +N I YL +YVDDIL+ S
Sbjct: 490 KSLYGLKQAPRAWYEXLTSAQIQFGFTKSRCDPSLLIYNQNGACI-YLXIYVDDILITGS 666
Query: 626 S 626
S
Sbjct: 667 S 669
>BG587174 similar to PIR|A47759|A4775 retrovirus-related reverse
transcriptase homolog - rape retrotransposon copia-like
(fragment), partial (84%)
Length = 249
Score = 47.4 bits (111), Expect = 3e-05
Identities = 25/58 (43%), Positives = 38/58 (65%), Gaps = 2/58 (3%)
Frame = -1
Query: 566 ESLYGLKQSPREWYRRFDEFLLKTGFVRSGYDSCVYMLKKNEKV--ILYLLLYVDDIL 621
+SLYGLKQSPR+WY+RFD + ++ + +G V ++ +YL+LYVDD+L
Sbjct: 168 KSLYGLKQSPRQWYKRFDSY--RSSWATTGVLMTVVST*TR*RMSRYIYLVLYVDDML 1
>BG587156 similar to PIR|G85055|G8 probable polyprotein [imported] -
Arabidopsis thaliana, partial (17%)
Length = 618
Score = 37.4 bits (85), Expect = 0.027
Identities = 16/39 (41%), Positives = 24/39 (61%)
Frame = -1
Query: 565 EESLYGLKQSPREWYRRFDEFLLKTGFVRSGYDSCVYML 603
++++YGLKQSPR WY + L GF +S D ++ L
Sbjct: 129 KKAIYGLKQSPRAWYNKLSTTLNGRGFRKSELDHTLFTL 13
>TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
(EC 3.4.23.-);, partial (7%)
Length = 705
Score = 34.3 bits (77), Expect = 0.23
Identities = 17/49 (34%), Positives = 29/49 (58%)
Frame = +1
Query: 759 LKWVLRYLNGSLKGGLKYTRAAQDEDALEGYVDADYAGNVDTRKSLVGF 807
+K ++RY+ G+ + + E + GYVD+D+AG+ D RKS G+
Sbjct: 1 VKRIMRYIKGTSGVAVCF---GGSELTVRGYVDSDFAGDHDKRKSTTGY 138
>CB894419 weakly similar to GP|10177935|db copia-type polyprotein
{Arabidopsis thaliana}, partial (2%)
Length = 170
Score = 31.6 bits (70), Expect = 1.5
Identities = 16/50 (32%), Positives = 29/50 (58%)
Frame = +3
Query: 679 ERFRMSNSKTVSTPLGHHTKLSIQQCPQSEDEKQLMEGTPYASGVGSIMY 728
++F+M +SK +STP+ KL+ E + + ++ T Y S +GS+ Y
Sbjct: 27 KKFKMEHSKPISTPVEEKLKLT------RESDGKRVDSTHYKSLIGSLRY 158
>TC93066 weakly similar to GP|19920130|gb|AAM08562.1 Putative retroelement
{Oryza sativa} [Oryza sativa (japonica cultivar-group)],
partial (10%)
Length = 823
Score = 31.2 bits (69), Expect = 1.9
Identities = 35/121 (28%), Positives = 53/121 (42%)
Frame = +3
Query: 105 AFGSLGSF*DSYSWGRFLFSFYH**LL*ESVGLCFEKQK*HL*KVQRMAHSHRKSNGN*T 164
+F LG+F + W L+ YH** E +GL F K* +Q + +S S+
Sbjct: 108 SF*PLGTFKSYFLWRTPLYDDYH**FSSEGLGLFFAV*K*DFSHIQEVENSC*NSDREEC 287
Query: 165 KRFKN*QWPGVCFRAV**VLQVERNQEA*NRTKNTATKWPCGTHE*DTFGACEVYDSRSW 224
+ N V +**VL N +K + TK C T++ D+ +Y + W
Sbjct: 288 EEAHNR*LIRVL***L**VLHKSWYC*TQNHSKESPTKRCCRTNDQDST*ESSMYALKCW 467
Query: 225 V 225
V
Sbjct: 468 V 470
>TC91257 similar to GP|21434|emb|CAA36616.1|| ORF4 {Solanum tuberosum},
partial (8%)
Length = 854
Score = 30.8 bits (68), Expect = 2.5
Identities = 14/30 (46%), Positives = 22/30 (72%)
Frame = +3
Query: 719 YASGVGSIMYGMVCSRPDLAYAVSIVSRFM 748
Y VG + Y + +RPD++YAVS+VS+F+
Sbjct: 762 YRRLVGKLNY-LTMTRPDISYAVSVVSQFL 848
>BG453964
Length = 647
Score = 30.4 bits (67), Expect = 3.3
Identities = 17/50 (34%), Positives = 31/50 (62%)
Frame = -2
Query: 611 LYLLLYVDDILMASSSKDEIMKLKERLNGEFEMKDLGPAKRVLGIDIKRN 660
+YLL+Y+D+IL+ S ++ L++ L+ F+ KDL K GI + ++
Sbjct: 202 IYLLVYIDEILL-SIVIIPLVCLRQHLSNHFQTKDLDLFKYFSGIVVAQS 56
>TC80722 similar to GP|19699290|gb|AAL91256.1 At1g15070/F9L1_1 {Arabidopsis
thaliana}, partial (16%)
Length = 1001
Score = 29.6 bits (65), Expect = 5.6
Identities = 12/43 (27%), Positives = 21/43 (47%)
Frame = +3
Query: 731 VCSRPDLAYAVSIVSRFMANPGIVHWQALKWVLRYLNGSLKGG 773
VC + L + V + R ++ +Q W+L Y+N S + G
Sbjct: 663 VCQKESLGFLVKVKQRSCLPNYLIRFQNFWWILHYMNNSTR*G 791
>AL366944
Length = 275
Score = 29.6 bits (65), Expect = 5.6
Identities = 15/38 (39%), Positives = 22/38 (57%)
Frame = -3
Query: 734 RPDLAYAVSIVSRFMANPGIVHWQALKWVLRYLNGSLK 771
RP AVS+VS+F + H L W+L+Y+ G+ K
Sbjct: 117 RPKHFIAVSLVSQF*NSSSQRHKDVLIWMLKYIEGARK 4
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.356 0.161 0.595
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 28,415,973
Number of Sequences: 36976
Number of extensions: 414619
Number of successful extensions: 3350
Number of sequences better than 10.0: 36
Number of HSP's better than 10.0 without gapping: 981
Number of HSP's successfully gapped in prelim test: 104
Number of HSP's that attempted gapping in prelim test: 2295
Number of HSP's gapped (non-prelim): 1185
length of query: 937
length of database: 9,014,727
effective HSP length: 105
effective length of query: 832
effective length of database: 5,132,247
effective search space: 4270029504
effective search space used: 4270029504
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (21.6 bits)
S2: 63 (28.9 bits)
Medicago: description of AC130802.9