
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0028b.2
(533 letters)
Database: LJGI
28,460 sequences; 14,692,800 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BP066094 131 4e-35
AV780009 121 3e-28
BP084332 102 2e-22
TC12832 weakly similar to UP|POLX_TOBAC (P10978) Retrovirus-rela... 89 2e-18
BP075943 86 1e-17
BP057233 77 5e-15
BP043850 71 5e-13
TC17418 similar to UP|Q94AX2 (Q94AX2) AT5g39790/MKM21_80, partia... 67 5e-12
TC16929 weakly similar to UP|Q9LFY6 (Q9LFY6) T7N9.5, partial (4%) 67 7e-12
TC15664 weakly similar to UP|O81617 (O81617) F8M12.17 protein, p... 62 2e-10
TC10186 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprot... 61 4e-10
AV766665 59 2e-09
AV424544 59 2e-09
TC8952 similar to UP|O82607 (O82607) T2L5.9 protein, partial (3%) 57 7e-09
TC19389 weakly similar to UP|POLX_TOBAC (P10978) Retrovirus-rela... 54 8e-08
BP084259 40 0.001
AV767042 39 0.002
TC19474 weakly similar to UP|Q9FJA1 (Q9FJA1) Similarity to retro... 33 0.003
TC10484 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragm... 33 0.028
TC9992 35 0.040
>BP066094
Length = 532
Score = 131 bits (329), Expect(2) = 4e-35
Identities = 65/151 (43%), Positives = 102/151 (67%)
Frame = +2
Query: 98 ARIETIRLVVSIASARNWPIFQLDVKSAFLNGPLEEEVYVSQPPGFEIQGKENSVFKLSK 157
++ E IRL++S + N + Q++VKSAFLNG + EEVYV QPPG E + + +FKL K
Sbjct: 86 SKTEAIRLLISFSVNHNIILHQMNVKSAFLNGYISEEVYVHQPPGXEDEKNSDHIFKLKK 265
Query: 158 ALYGLKQAPRAWNKRIDGFLLQLKFHRCSVEYGVYVRKSHGDTGLLLICLYVDDLLITGS 217
+LYGLKQAPRAW +R+ FLL+ + R V+ ++ + D +L++ +YVDD++ +
Sbjct: 266 SLYGLKQAPRAWYERLSSFLLENEXVRGKVDTTLFCKTYKDD--ILIVQIYVDDIIFGSA 439
Query: 218 DPKELDELKQILKSEFEMSDLGKLGYFLGMQ 248
+P E +++++EFEM +G+L YFLG+Q
Sbjct: 440 NPSLCKEFSEMMQAEFEMRMMGELKYFLGIQ 532
Score = 34.3 bits (77), Expect(2) = 4e-35
Identities = 16/26 (61%), Positives = 20/26 (76%)
Frame = +3
Query: 74 QGKACG*RFSTKPGLDYSEVFAPVAR 99
QGKA R+S + G+DY+E FAPVAR
Sbjct: 15 QGKASCSRYSQQ*GIDYTETFAPVAR 92
>AV780009
Length = 529
Score = 121 bits (303), Expect = 3e-28
Identities = 67/167 (40%), Positives = 95/167 (56%)
Frame = -1
Query: 340 AAKRILRYLQGTTDYGILFPRQKKLGKELELVGYTDSDYSGDLDDRKSTSGYLFQLHGAP 399
AA R+LRY++G G+ F L +L Y+DSD++G D R+S +GY L +
Sbjct: 526 AATRVLRYVKGAPAQGLFFSADSPL----KLQAYSDSDWAGCPDTRRSVTGYSIFLGTSL 359
Query: 400 ISWTSKKQPVIALSSCEAEYIAGSFAACQAVWLEELLKEMGFVTKKPMELMVDNISAISL 459
ISW +KKQ ++ SS EAEY A + C+ WL L + + P+ L DN SA+ +
Sbjct: 358 ISWRTKKQTTVSRSSSEAEYRALAATVCEVQWLSYLFQFLKLNVPLPVPLFCDNQSALHI 179
Query: 460 AKNPISHGRSKHIEVRFHFLREQVNGGKLELVYCPTDEQKADIFTKP 506
A NP H R+KHIE+ H +R ++ G + L+ T Q ADIFTKP
Sbjct: 178 AHNPTFHERTKHIELDCHVVRAKLQAGLIHLLPISTHHQLADIFTKP 38
>BP084332
Length = 368
Score = 102 bits (253), Expect = 2e-22
Identities = 51/113 (45%), Positives = 75/113 (66%)
Frame = +2
Query: 407 QPVIALSSCEAEYIAGSFAACQAVWLEELLKEMGFVTKKPMELMVDNISAISLAKNPISH 466
Q IALS+ EAEYI+ + + Q +W++ L++ + + + + DN +AISL+KNPI H
Sbjct: 32 QSTIALSTAEAEYISAAICSTQMLWMKHQLEDYQ-ILESNIPIYCDNTAAISLSKNPILH 208
Query: 467 GRSKHIEVRFHFLREQVNGGKLELVYCPTDEQKADIFTKPLKTERFVILRKEI 519
R+KHIEV++HF+R+ V G L L + TD Q ADIFTKPL +RF + K +
Sbjct: 209 SRAKHIEVKYHFIRDYVQKGVLLLKFVDTDHQWADIFTKPLAEDRFNFILKNL 367
>TC12832 weakly similar to UP|POLX_TOBAC (P10978) Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
; Reverse transcriptase ; Endonuclease] , partial (9%)
Length = 747
Score = 89.0 bits (219), Expect = 2e-18
Identities = 50/147 (34%), Positives = 80/147 (54%), Gaps = 9/147 (6%)
Frame = +2
Query: 174 DGFLLQLKFHRCSVEYGVYVRKSHGDTGLLLICLYVDDLLITGSDPKELDELKQILKSEF 233
D F++ L ++R S ++ Y K D +++ LYVDD+L+ G + + ELK L EF
Sbjct: 5 DSFIMSLGYNRLSSDHCTY-HKRFDDNDFIILLLYVDDMLVVGPNKDRVQELKAQLAREF 181
Query: 234 EMSDLGKLGYFLGMQF--EYTAAGIFMYQQKYIKDILKRFNMGNCHPVTTPSEVNIKLRR 291
+M DLG LGMQ + I++ Q+ Y++ +L+RFNM +C+P++TP VN KL
Sbjct: 182 DMKDLGPANKILGMQIHRDRKDRRIWLSQKNYLQKVLRRFNMQDCNPISTPLPVNYKLSS 361
Query: 292 D-------DEGDVDASLYKQLVGSLRF 311
+ ++ Y VGSL +
Sbjct: 362 SMIPSSEAERMEMSRVPYASAVGSLMY 442
>BP075943
Length = 547
Score = 86.3 bits (212), Expect = 1e-17
Identities = 48/157 (30%), Positives = 81/157 (51%), Gaps = 2/157 (1%)
Frame = -3
Query: 205 ICLYVDDLLITGSDPKELDELKQILKSEFEMSDLGKLGYFLGMQFEYTAAGIFMYQQKYI 264
+ LYVDD+L+ G+D E+ +K L +F + DLG+ YFLG++ + +GI + Q+KY
Sbjct: 521 LLLYVDDVLLAGNDMHEIQLVKSSLHDQFRIKDLGEAKYFLGLEIARSTSGIVLNQRKYA 342
Query: 265 KDILKRFNMGNCHPVTTPSEVNIKLRRDDEGD--VDASLYKQLVGSLRFLCNSRLDISYG 322
++ P N + + G D Y+++VG L +L +R DI++
Sbjct: 341 LQLISDSGHFGFQPRFYSHGTNSQTLGTNTGTPLTDIGSYRRIVGRLLYLNTTRPDITFA 162
Query: 323 VGLISKFMCAPKKSHLAAAKRILRYLQGTTDYGILFP 359
V +S+F+ AP H L+YL G+ G+ +P
Sbjct: 161 VNQLSQFLSAPTDIHEQQLTGFLKYL*GSPGSGLFYP 51
>BP057233
Length = 473
Score = 77.4 bits (189), Expect = 5e-15
Identities = 38/79 (48%), Positives = 54/79 (68%)
Frame = -2
Query: 444 KKPMELMVDNISAISLAKNPISHGRSKHIEVRFHFLREQVNGGKLELVYCPTDEQKADIF 503
+KP+ L DN+SA +LA NP+ H RSKHIE+ H++R+QV ++ + Y PT +Q AD
Sbjct: 457 RKPI-LWCDNLSAKALASNPVLHARSKHIEIDVHYIRDQVLQNEVVVAYVPTTDQIADCL 281
Query: 504 TKPLKTERFVILRKEIGVV 522
TKPL RF LR ++GV+
Sbjct: 280 TKPLSHTRFSQLRDKLGVI 224
>BP043850
Length = 515
Score = 70.9 bits (172), Expect = 5e-13
Identities = 40/105 (38%), Positives = 58/105 (55%)
Frame = -1
Query: 428 QAVWLEELLKEMGFVTKKPMELMVDNISAISLAKNPISHGRSKHIEVRFHFLREQVNGGK 487
+ +WL LL E+GF+ +P L DN SAI +A NP+ H ++HIEV H +RE +
Sbjct: 512 EIIWLRGLLSELGFLQSQPTPLHADNTSAIQIAANPVYHEWTRHIEVDCHSVREAYDRRV 333
Query: 488 LELVYCPTDEQKADIFTKPLKTERFVILRKEIGVVSLGYLN*GGV 532
+ L + T Q ADI TK L +R L ++ +V *GG+
Sbjct: 332 ITLPHVSTSVQIADILTKSLTRQRHNFLVSKLMLVESPASI*GGM 198
>TC17418 similar to UP|Q94AX2 (Q94AX2) AT5g39790/MKM21_80, partial (20%)
Length = 739
Score = 67.4 bits (163), Expect = 5e-12
Identities = 59/171 (34%), Positives = 80/171 (46%), Gaps = 3/171 (1%)
Frame = -1
Query: 260 QQKYIKDILKRFNMGNCHPVTTP---SEVNIKLRRDDEGDVDASLYKQLVGSLRFLCNSR 316
Q+ Y+ DILK+F M N ++T E+ RR+ VD++ YK L+GS+R+L R
Sbjct: 739 QKIYVDDILKKFKMTNSKYISTTIGGKEIEAG-RRNGGKRVDSTYYKSLIGSVRYLNTVR 563
Query: 317 LDISYGVGLISKFMCAPKKSHLAAAKRILRYLQGTTDYGILFPRQKKLGKELELVGYTDS 376
DI GVGL S+FM P H A+R LRY++GT GI LV +
Sbjct: 562 SDIVCGVGLRSRFM-EP*DCH*QGAQRSLRYIKGTLKDGIFM-----------LVITM*N 419
Query: 377 DYSGDLDDRKSTSGYLFQLHGAPISWTSKKQPVIALSSCEAEYIAGSFAAC 427
+ R GY F L ISW+S + + CE I S C
Sbjct: 418 SLDTQIVTR---PGYAFHLGRGAISWSS---TISCCTLCE*SIIHSS*ELC 284
>TC16929 weakly similar to UP|Q9LFY6 (Q9LFY6) T7N9.5, partial (4%)
Length = 553
Score = 67.0 bits (162), Expect = 7e-12
Identities = 32/94 (34%), Positives = 57/94 (60%)
Frame = +1
Query: 431 WLEELLKEMGFVTKKPMELMVDNISAISLAKNPISHGRSKHIEVRFHFLREQVNGGKLEL 490
WL LL+++ ++P + DN SA +A NP+ H R+KHIE+ H +RE++ G + L
Sbjct: 4 WLTYLLQDLKVPFEQPALVYCDNNSARHIAANPVFHERTKHIEIDCHIVRERIQKGLIHL 183
Query: 491 VYCPTDEQKADIFTKPLKTERFVILRKEIGVVSL 524
+ + E ADI+TK L + F + ++G++++
Sbjct: 184 LPISSSEPLADIYTKALSPQNFHQICAKLGLINI 285
>TC15664 weakly similar to UP|O81617 (O81617) F8M12.17 protein, partial (4%)
Length = 670
Score = 62.4 bits (150), Expect = 2e-10
Identities = 35/107 (32%), Positives = 60/107 (55%)
Frame = +1
Query: 427 CQAVWLEELLKEMGFVTKKPMELMVDNISAISLAKNPISHGRSKHIEVRFHFLREQVNGG 486
C+ WL +L+++ P L D+ SA +A N + H R+KH+++ H +RE++
Sbjct: 1 CEL*WLTYILQDLRVPFISPSLLYCDSQSARHIATNAVFHERTKHLDIDCHVVREKLQAK 180
Query: 487 KLELVYCPTDEQKADIFTKPLKTERFVILRKEIGVVSLGYLN*GGVL 533
L+ + +Q ADI TKPL++ F L ++GV+++ GGVL
Sbjct: 181 LFHLLPISSVDQTADILTKPLESGPFSHLVSKLGVLNIYSSACGGVL 321
>TC10186 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein,
partial (4%)
Length = 528
Score = 61.2 bits (147), Expect = 4e-10
Identities = 31/61 (50%), Positives = 41/61 (66%)
Frame = +3
Query: 463 PISHGRSKHIEVRFHFLREQVNGGKLELVYCPTDEQKADIFTKPLKTERFVILRKEIGVV 522
P+ + +IE ++HFLR+QV GK+ L +C TD Q ADI TK LKTERF +R + VV
Sbjct: 45 PMDVVNTLNIETKYHFLRDQVTKGKISLKHCGTDLQVADIMTKGLKTERFRNMRAMLNVV 224
Query: 523 S 523
S
Sbjct: 225 S 227
Score = 44.3 bits (103), Expect = 5e-05
Identities = 20/23 (86%), Positives = 21/23 (90%)
Frame = +2
Query: 451 VDNISAISLAKNPISHGRSKHIE 473
VDN SAI LAKNP+SHGRSKHIE
Sbjct: 2 VDNKSAIDLAKNPVSHGRSKHIE 70
>AV766665
Length = 601
Score = 58.9 bits (141), Expect = 2e-09
Identities = 47/136 (34%), Positives = 65/136 (47%), Gaps = 1/136 (0%)
Frame = +3
Query: 288 KLRRDDEGDVDASLYKQLVGSLRFLCNSRLDISYGVGLISKFMCAPKKSHLAA-AKRILR 346
K+ D G V Y L G + + L + GL+ K + S L RI
Sbjct: 216 KIHLCDRGCVSLIKYIYLKGDVSLI----LVRAQSEGLVQKTYGKNRWSSLKTFTTRIF* 383
Query: 347 YLQGTTDYGILFPRQKKLGKELELVGYTDSDYSGDLDDRKSTSGYLFQLHGAPISWTSKK 406
YL+ + G LF ++ K + G+T +DY G + DR ST GY L G ++W SK+
Sbjct: 384 YLKANSRRGPLFQKEGKSSMD----GFTYADYLGSIVDRLSTMGYYMFLSGNLVTWRSKQ 551
Query: 407 QPVIALSSCEAEYIAG 422
Q +IA SS EAE AG
Sbjct: 552 QNIIARSSGEAELRAG 599
>AV424544
Length = 276
Score = 58.9 bits (141), Expect = 2e-09
Identities = 26/65 (40%), Positives = 42/65 (64%)
Frame = +3
Query: 199 DTGLLLICLYVDDLLITGSDPKELDELKQILKSEFEMSDLGKLGYFLGMQFEYTAAGIFM 258
D +I +YVDDL++ G+D E+ +K L +F + DLG L YFLG++ ++ G+F+
Sbjct: 81 DASFTVILVYVDDLILAGNDLNEIQCVKNKLDIQFRIKDLGTLKYFLGLEVARSSCGLFL 260
Query: 259 YQQKY 263
Q+KY
Sbjct: 261 SQRKY 275
>TC8952 similar to UP|O82607 (O82607) T2L5.9 protein, partial (3%)
Length = 550
Score = 57.0 bits (136), Expect = 7e-09
Identities = 28/81 (34%), Positives = 49/81 (59%)
Frame = +3
Query: 427 CQAVWLEELLKEMGFVTKKPMELMVDNISAISLAKNPISHGRSKHIEVRFHFLREQVNGG 486
C+A+WL L ++ + + + DN SA+ LA N + H R+++IE+ H + +V G
Sbjct: 18 CEALWLTYALADLRIASLLLVVIYCDNRSALHLAANSVFHKRTENIEIDCHIV*VKVLFG 197
Query: 487 KLELVYCPTDEQKADIFTKPL 507
L L++ P+ +Q AD+FTK +
Sbjct: 198 ILHLLHVPSSDQVADVFTKTI 260
>TC19389 weakly similar to UP|POLX_TOBAC (P10978) Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
; Reverse transcriptase ; Endonuclease] , partial (6%)
Length = 498
Score = 53.5 bits (127), Expect = 8e-08
Identities = 35/129 (27%), Positives = 62/129 (47%)
Frame = +3
Query: 397 GAPISWTSKKQPVIALSSCEAEYIAGSFAACQAVWLEELLKEMGFVTKKPMELMVDNISA 456
G ++W S+ Q +ALS+ EAE+IA + A + +W++ L+ F + + +V +
Sbjct: 18 GGAVAWPSRLQKCVALSTAEAEFIAATEACHELLWMKNFLQNAWFHSHPILCCIVITKAL 197
Query: 457 ISLAKNPISHGRSKHIEVRFHFLREQVNGGKLELVYCPTDEQKADIFTKPLKTERFVILR 516
+LA+ + + LR+ +N LEL TD+ AD+ TK L E+ +
Sbjct: 198 FTLARILLFIQDPSTLMFVIIGLRDVLNSKLLELEKIHTDDDGADMMTKSLPREKLEVCD 377
Query: 517 KEIGVVSLG 525
G+ G
Sbjct: 378 MIAGMARFG 404
>BP084259
Length = 385
Score = 40.0 bits (92), Expect = 0.001
Identities = 22/65 (33%), Positives = 35/65 (53%), Gaps = 6/65 (9%)
Frame = -3
Query: 420 IAGSFA------ACQAVWLEELLKEMGFVTKKPMELMVDNISAISLAKNPISHGRSKHIE 473
IAGS+ A + +WL++LLKE+ F K + +NI+A + N + H KH+
Sbjct: 197 IAGSWTRTIGSTAAEFLWLQQLLKELQFSLPKVPCIFSNNINATYVCLNLVFHSHMKHVA 18
Query: 474 VRFHF 478
+ HF
Sbjct: 17 LDCHF 3
>AV767042
Length = 444
Score = 38.9 bits (89), Expect = 0.002
Identities = 17/41 (41%), Positives = 30/41 (72%)
Frame = -3
Query: 87 GLDYSEVFAPVARIETIRLVVSIASARNWPIFQLDVKSAFL 127
G+D ++ F+PV + IR V +IA +R+WPI QL+V++ ++
Sbjct: 340 GVDCNKTFSPVVKPAPIRTVFTIALSRSWPIPQLNVQNLWI 218
>TC19474 weakly similar to UP|Q9FJA1 (Q9FJA1) Similarity to retroelement pol
polyprotein, partial (3%)
Length = 517
Score = 32.7 bits (73), Expect(2) = 0.003
Identities = 25/83 (30%), Positives = 43/83 (51%), Gaps = 8/83 (9%)
Frame = -2
Query: 431 WLEELLKEMGFVTKKPMELMV--------DNISAISLAKNPISHGRSKHIEVRFHFLREQ 482
WL+ L + + KP +L+ DN +A+ + K S R+KHIE+ FHFL ++
Sbjct: 435 WLQPLXQSS--LLPKPRDLVDVSLITHT*DNKAALHMPKI*FSM-RAKHIEIDFHFL*DR 265
Query: 483 VNGGKLELVYCPTDEQKADIFTK 505
+ + +++Q D+FTK
Sbjct: 264 RLYLDISTRFVNSNDQLTDVFTK 196
Score = 24.6 bits (52), Expect(2) = 0.003
Identities = 10/20 (50%), Positives = 14/20 (70%)
Frame = -1
Query: 400 ISWTSKKQPVIALSSCEAEY 419
ISW SK+ ++A S EAE+
Sbjct: 502 ISWKSKETIIVARSRAEAEF 443
>TC10484 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment),
partial (20%)
Length = 479
Score = 33.1 bits (74), Expect(2) = 0.028
Identities = 14/35 (40%), Positives = 21/35 (60%)
Frame = +3
Query: 431 WLEELLKEMGFVTKKPMELMVDNISAISLAKNPIS 465
W+ ++L EMG PM L DN +A+ ++ NP S
Sbjct: 3 WVGQILSEMGIERISPMPLWCDNQAALHISSNPSS 107
Score = 20.8 bits (42), Expect(2) = 0.028
Identities = 8/12 (66%), Positives = 9/12 (74%)
Frame = +2
Query: 466 HGRSKHIEVRFH 477
H R+KHIEV H
Sbjct: 113 HERTKHIEVDCH 148
>TC9992
Length = 508
Score = 34.7 bits (78), Expect = 0.040
Identities = 14/52 (26%), Positives = 33/52 (62%)
Frame = +3
Query: 430 VWLEELLKEMGFVTKKPMELMVDNISAISLAKNPISHGRSKHIEVRFHFLRE 481
+W+++LL ++ + + VDN + IS++ +P+ ++ +++F+FLRE
Sbjct: 81 LWIQKLLLDLPMEPMEHPKFFVDN*AIISISAHPLFQRKTHPSKIQFYFLRE 236
Database: LJGI
Posted date: Jul 30, 2004 11:16 AM
Number of letters in database: 14,692,800
Number of sequences in database: 28,460
Lambda K H
0.331 0.146 0.454
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,773,529
Number of Sequences: 28460
Number of extensions: 114347
Number of successful extensions: 517
Number of sequences better than 10.0: 50
Number of HSP's better than 10.0 without gapping: 509
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 511
length of query: 533
length of database: 4,897,600
effective HSP length: 95
effective length of query: 438
effective length of database: 2,193,900
effective search space: 960928200
effective search space used: 960928200
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.9 bits)
S2: 57 (26.6 bits)
Lotus: description of TM0028b.2