
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0166.12
(1518 letters)
Database: LJGI
28,460 sequences; 14,692,800 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AV780009 106 3e-23
BP084332 97 1e-20
BP075943 84 2e-16
BP052632 82 6e-16
TC19389 weakly similar to UP|POLX_TOBAC (P10978) Retrovirus-rela... 48 2e-12
BP066094 63 4e-10
BP063694 59 4e-09
AV781229 55 8e-08
AU088980 55 1e-07
AV427921 50 2e-06
AV766665 48 1e-05
TC12832 weakly similar to UP|POLX_TOBAC (P10978) Retrovirus-rela... 48 1e-05
TC16929 weakly similar to UP|Q9LFY6 (Q9LFY6) T7N9.5, partial (4%) 47 2e-05
AV424544 47 3e-05
BP057233 45 9e-05
AV766088 44 1e-04
BP043850 44 1e-04
TC15664 weakly similar to UP|O81617 (O81617) F8M12.17 protein, p... 43 3e-04
BP033602 43 4e-04
TC11361 39 0.006
>AV780009
Length = 529
Score = 106 bits (264), Expect = 3e-23
Identities = 53/135 (39%), Positives = 84/135 (61%), Gaps = 2/135 (1%)
Frame = -1
Query: 1380 RIIKYINGTTDYGTFYAHNSNSNLISYCDADWAGNADDGKSTSGGCFFLGNNLISWFNKK 1439
R+++Y+ G G F++ +S L +Y D+DWAG D +S +G FLG +LISW KK
Sbjct: 517 RVLRYVKGAPAQGLFFSADSPLKLQAYSDSDWAGCPDTRRSVTGYSIFLGTSLISWRTKK 338
Query: 1440 QNCVSLSTTEAEYIAAGSWCTQLLWMKHMLR--EYNVKQDVMTLYCENLSAINISKNPIQ 1497
Q VS S++EAEY A + ++ W+ ++ + + NV V L+C+N SA++I+ NP
Sbjct: 337 QTTVSRSSSEAEYRALAATVCEVQWLSYLFQFLKLNVPLPV-PLFCDNQSALHIAHNPTF 161
Query: 1498 HSRTKHIDIRHHFIK 1512
H RTKHI++ H ++
Sbjct: 160 HERTKHIELDCHVVR 116
>BP084332
Length = 368
Score = 97.4 bits (241), Expect = 1e-20
Identities = 40/78 (51%), Positives = 61/78 (77%)
Frame = +2
Query: 1440 QNCVSLSTTEAEYIAAGSWCTQLLWMKHMLREYNVKQDVMTLYCENLSAINISKNPIQHS 1499
Q+ ++LST EAEYI+A TQ+LWMKH L +Y + + + +YC+N +AI++SKNPI HS
Sbjct: 32 QSTIALSTAEAEYISAAICSTQMLWMKHQLEDYQILESNIPIYCDNTAAISLSKNPILHS 211
Query: 1500 RTKHIDIRHHFIKDLVEE 1517
R KHI++++HFI+D V++
Sbjct: 212 RAKHIEVKYHFIRDYVQK 265
>BP075943
Length = 547
Score = 84.0 bits (206), Expect = 2e-16
Identities = 47/170 (27%), Positives = 86/170 (49%), Gaps = 1/170 (0%)
Frame = -3
Query: 1233 SDGGKVMIAQIYVDDIVFGGMSDHMVEHFVRHMKSEFEMSLVGELNYFPGLQVKQMQDSI 1292
+ G +YVDD++ G H ++ + +F + +GE YF GL++ + I
Sbjct: 545 ASSGSFTALLLYVDDVLLAGNDMHEIQLVKSSLHDQFRIKDLGEAKYFLGLEIARSTSGI 366
Query: 1293 FISQSKYAK*LVKKFGLEGSTHKRTPAATHIKLTKDESGTDV-DQSLYRSMICSLLYLTA 1351
++Q KYA L+ G G + T+ + +GT + D YR ++ LLYL
Sbjct: 365 VLNQRKYALQLISDSGHFGFQPRFYSHGTNSQTLGTNTGTPLTDIGSYRRIVGRLLYLNT 186
Query: 1352 SRPDIAFSIDVCARYQAAPKESHLIHVKRIIKYINGTTDYGTFYAHNSNS 1401
+RPDI F+++ +++ +AP + H + +KY+ G+ G FY +S++
Sbjct: 185 TRPDITFAVNQLSQFLSAPTDIHEQQLTGFLKYL*GSPGSGLFYPASSST 36
>BP052632
Length = 489
Score = 82.0 bits (201), Expect = 6e-16
Identities = 40/74 (54%), Positives = 51/74 (68%)
Frame = -3
Query: 891 TLYELWKGKKPTVKYFYVFESRCYILTDREQRRKLDPKSDEGLFLGYSINSKAYRVFNSR 950
T YELW+G++PTV YF+ F + +IL + K D K D+ + LGYS NSKAY VFNSR
Sbjct: 484 TPYELWRGRRPTVSYFHPFGCK*FILNTNDNIGKFDNKFDKRILLGYSSNSKAYIVFNSR 305
Query: 951 TKTMMESINVVVDD 964
T+ + ESINV DD
Sbjct: 304 TQVVEESINVKFDD 263
>TC19389 weakly similar to UP|POLX_TOBAC (P10978) Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
; Reverse transcriptase ; Endonuclease] , partial (6%)
Length = 498
Score = 47.8 bits (112), Expect(2) = 2e-12
Identities = 20/38 (52%), Positives = 29/38 (75%)
Frame = +3
Query: 1433 ISWFNKKQNCVSLSTTEAEYIAAGSWCTQLLWMKHMLR 1470
++W ++ Q CV+LST EAE+IAA C +LLWMK+ L+
Sbjct: 27 VAWPSRLQKCVALSTAEAEFIAATEACHELLWMKNFLQ 140
Score = 42.4 bits (98), Expect(2) = 2e-12
Identities = 18/36 (50%), Positives = 25/36 (69%)
Frame = +2
Query: 1481 LYCENLSAINISKNPIQHSRTKHIDIRHHFIKDLVE 1516
LYC+N AI + +N HSR+KHID+R+H+ VE
Sbjct: 173 LYCDNQGAIYLGQNSTFHSRSKHIDVRYHWTT*CVE 280
>BP066094
Length = 532
Score = 62.8 bits (151), Expect = 4e-10
Identities = 28/60 (46%), Positives = 41/60 (67%)
Frame = +2
Query: 1225 IDKTLFVKSDGGKVMIAQIYVDDIVFGGMSDHMVEHFVRHMKSEFEMSLVGELNYFPGLQ 1284
+D TLF K+ ++I QIYVDDI+FG + + + F M++EFEM ++GEL YF G+Q
Sbjct: 353 VDTTLFCKTYKDDILIVQIYVDDIIFGSANPSLCKEFSEMMQAEFEMRMMGELKYFLGIQ 532
>BP063694
Length = 511
Score = 59.3 bits (142), Expect = 4e-09
Identities = 44/164 (26%), Positives = 73/164 (43%), Gaps = 10/164 (6%)
Frame = -2
Query: 636 VTNENEEVVMTGVRSKDNCYMWSPQNKAMSSMCLVSMEDEAKLWHQK*GHLNPTSMKKVI 695
+ N V+ R Y+ + ++A+ + +LWH + GH+N +K+ +
Sbjct: 486 IQNRKTGAVLGRXRCDQGLYVLNQDSQALLTTSSPLPRASFELWHSRLGHVNFDIIKQ-L 310
Query: 696 SKEAIRGIPRLKIEEDKVCGECQIGKQTKT----SHKKLTQIGTTRSLQLLHMDLMGPMH 751
K + + + + C CQ+ K + ++K+ + + L L+H DL GP
Sbjct: 309 HKHGCLDVSSI-LPKPICCTSCQMAKSKRLVFHDNNKRASAV-----LDLIHCDLRGPSP 148
Query: 752 KESFGGKRYVFVCVDDFSRFTWVNFLAEKSE------TFEVFRE 789
S G Y + VDDFSRFTW L KS+ F+VF E
Sbjct: 147 VASIDGFSYFVIFVDDFSRFTWFYPLKRKSDFSDVLLRFKVFME 16
>AV781229
Length = 208
Score = 55.1 bits (131), Expect = 8e-08
Identities = 25/32 (78%), Positives = 27/32 (84%)
Frame = +3
Query: 269 KSKGVQCHECEGYEHIKAECATYLKKLGKGMV 300
KSKGV CHE EGY HI++ECATYLKK KGMV
Sbjct: 108 KSKGV*CHEWEGYGHIRSECATYLKKQKKGMV 203
>AU088980
Length = 360
Score = 54.7 bits (130), Expect = 1e-07
Identities = 31/112 (27%), Positives = 52/112 (45%)
Frame = +2
Query: 840 NGVVERKNRTLQESARVMLHAKKVPYHLWAEAINTACYIHYRVTIRSGTDSTLYELWKGK 899
N +VERK+R + R ++ +P +LW A+ A Y+ + Y+
Sbjct: 2 NSIVERKHRHILNVTRALMFHSYLPKNLWTFAVKHAAYLINXLPSPLLKGQCPYQFLNND 181
Query: 900 KPTVKYFYVFESRCYILTDREQRRKLDPKSDEGLFLGYSINSKAYRVFNSRT 951
PT+ VF + CY T +KLDP++ + +FLG+ + Y V + T
Sbjct: 182 SPTLLDLKVFGTLCYSSTLTHNXQKLDPRARKCVFLGFKTGTXGYIVMDLPT 337
>AV427921
Length = 284
Score = 50.4 bits (119), Expect = 2e-06
Identities = 30/82 (36%), Positives = 42/82 (50%)
Frame = +1
Query: 832 SAPITPQQNGVVERKNRTLQESARVMLHAKKVPYHLWAEAINTACYIHYRVTIRSGTDST 891
+A TPQQNGV ERKNRT+ R +L +P EA+ + +I R +
Sbjct: 10 TAAYTPQQNGVSERKNRTILNMVRSLLTMSGLPKSFLPEAVMWSLHILNRSPTLVVQNMM 189
Query: 892 LYELWKGKKPTVKYFYVFESRC 913
E W G++P V +F + SRC
Sbjct: 190 PEEAWSGRQPAVDHFRI--SRC 249
>AV766665
Length = 601
Score = 48.1 bits (113), Expect = 1e-05
Identities = 24/72 (33%), Positives = 39/72 (53%)
Frame = +3
Query: 1380 RIIKYINGTTDYGTFYAHNSNSNLISYCDADWAGNADDGKSTSGGCFFLGNNLISWFNKK 1439
RI Y+ + G + S++ + AD+ G+ D ST G FL NL++W +K+
Sbjct: 372 RIF*YLKANSRRGPLFQKEGKSSMDGFTYADYLGSIVDRLSTMGYYMFLSGNLVTWRSKQ 551
Query: 1440 QNCVSLSTTEAE 1451
QN ++ S+ EAE
Sbjct: 552 QNIIARSSGEAE 587
>TC12832 weakly similar to UP|POLX_TOBAC (P10978) Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
; Reverse transcriptase ; Endonuclease] , partial (9%)
Length = 747
Score = 47.8 bits (112), Expect = 1e-05
Identities = 33/138 (23%), Positives = 65/138 (46%), Gaps = 9/138 (6%)
Frame = +2
Query: 1220 YNKRGIDKTLFVKS-DGGKVMIAQIYVDDIVFGGMSDHMVEHFVRHMKSEFEMSLVGELN 1278
YN+ D + K D +I +YVDD++ G + V+ + EF+M +G N
Sbjct: 29 YNRLSSDHCTYHKRFDDNDFIILLLYVDDMLVVGPNKDRVQELKAQLAREFDMKDLGPAN 208
Query: 1279 YFPGLQVKQMQDS--IFISQSKYAK*LVKKFGLEGSTHKRTPAATHIKL------TKDES 1330
G+Q+ + + I++SQ Y + ++++F ++ TP + KL + +
Sbjct: 209 KILGMQIHRDRKDRRIWLSQKNYLQKVLRRFNMQDCNPISTPLPVNYKLSSSMIPSSEAE 388
Query: 1331 GTDVDQSLYRSMICSLLY 1348
++ + Y S + SL+Y
Sbjct: 389 RMEMSRVPYASAVGSLMY 442
>TC16929 weakly similar to UP|Q9LFY6 (Q9LFY6) T7N9.5, partial (4%)
Length = 553
Score = 47.4 bits (111), Expect = 2e-05
Identities = 20/55 (36%), Positives = 37/55 (66%), Gaps = 1/55 (1%)
Frame = +1
Query: 1464 WMKHMLREYNVKQDVMTL-YCENLSAINISKNPIQHSRTKHIDIRHHFIKDLVEE 1517
W+ ++L++ V + L YC+N SA +I+ NP+ H RTKHI+I H +++ +++
Sbjct: 4 WLTYLLQDLKVPFEQPALVYCDNNSARHIAANPVFHERTKHIEIDCHIVRERIQK 168
>AV424544
Length = 276
Score = 46.6 bits (109), Expect = 3e-05
Identities = 23/80 (28%), Positives = 40/80 (49%)
Frame = +3
Query: 1220 YNKRGIDKTLFVKSDGGKVMIAQIYVDDIVFGGMSDHMVEHFVRHMKSEFEMSLVGELNY 1279
Y + D +LF K + +YVDD++ G + ++ + +F + +G L Y
Sbjct: 36 YIQSAHDHSLFTKFRDASFTVILVYVDDLILAGNDLNEIQCVKNKLDIQFRIKDLGTLKY 215
Query: 1280 FPGLQVKQMQDSIFISQSKY 1299
F GL+V + +F+SQ KY
Sbjct: 216 FLGLEVARSSCGLFLSQRKY 275
>BP057233
Length = 473
Score = 45.1 bits (105), Expect = 9e-05
Identities = 18/35 (51%), Positives = 28/35 (79%)
Frame = -2
Query: 1481 LYCENLSAINISKNPIQHSRTKHIDIRHHFIKDLV 1515
L+C+NLSA ++ NP+ H+R+KHI+I H+I+D V
Sbjct: 445 LWCDNLSAKALASNPVLHARSKHIEIDVHYIRDQV 341
>AV766088
Length = 501
Score = 44.3 bits (103), Expect = 1e-04
Identities = 33/112 (29%), Positives = 48/112 (42%), Gaps = 5/112 (4%)
Frame = -3
Query: 678 LWHQK*GHLNPTSMK-----KVISKEAIRGIPRLKIEEDKVCGECQIGKQTKTSHKKLTQ 732
LWH + GH + ++++ K IS E + P VC C GK +
Sbjct: 307 LWHSRLGHPSSSALRYLRSNKFISYELLNYSP--------VCESCVFGKHVRLPFVSSNN 152
Query: 733 IGTTRSLQLLHMDLMGPMHKESFGGKRYVFVCVDDFSRFTWVNFLAEKSETF 784
+ T +LH DL S G + YV +DDF+ F W L+ KS+ F
Sbjct: 151 V-TVMPFDILHSDLWTSPVLSSAGHRFYVLF-LDDFTDFLWTFPLSNKSQVF 2
>BP043850
Length = 515
Score = 44.3 bits (103), Expect = 1e-04
Identities = 18/60 (30%), Positives = 37/60 (61%), Gaps = 1/60 (1%)
Frame = -1
Query: 1460 TQLLWMKHMLREYNVKQDVMT-LYCENLSAINISKNPIQHSRTKHIDIRHHFIKDLVEEK 1518
++++W++ +L E Q T L+ +N SAI I+ NP+ H T+HI++ H +++ + +
Sbjct: 515 SEIIWLRGLLSELGFLQSQPTPLHADNTSAIQIAANPVYHEWTRHIEVDCHSVREAYDRR 336
>TC15664 weakly similar to UP|O81617 (O81617) F8M12.17 protein, partial (4%)
Length = 670
Score = 43.1 bits (100), Expect = 3e-04
Identities = 20/59 (33%), Positives = 37/59 (61%), Gaps = 1/59 (1%)
Frame = +1
Query: 1461 QLLWMKHMLREYNVKQ-DVMTLYCENLSAINISKNPIQHSRTKHIDIRHHFIKDLVEEK 1518
+L W+ ++L++ V LYC++ SA +I+ N + H RTKH+DI H +++ ++ K
Sbjct: 4 EL*WLTYILQDLRVPFISPSLLYCDSQSARHIATNAVFHERTKHLDIDCHVVREKLQAK 180
>BP033602
Length = 533
Score = 42.7 bits (99), Expect = 4e-04
Identities = 33/148 (22%), Positives = 66/148 (44%), Gaps = 5/148 (3%)
Frame = +3
Query: 625 LNVSSNKAGCTVTNENEEVVMTGVRSKDNCYMWSP--QNKAMSSMC---LVSMEDEAKLW 679
+N+S ++ C + ++++ + + KD Y + +NKA +C S+ D+ LW
Sbjct: 105 INLSFIQS*CCYSGT*QKMIGSA-KMKDGLYYFEDTFRNKAAQGLCGMSCTSVRDQIMLW 281
Query: 680 HQK*GHLNPTSMKKVISKEAIRGIPRLKIEEDKVCGECQIGKQTKTSHKKLTQIGTTRSL 739
H + GH N ++ + + + + +E C C + K + + +R
Sbjct: 282 HNRLGHPNFQYLRHLFP-DLFKNVNCSSLE----CESCVLAKNQRAPYYS-QPYHASRPF 443
Query: 740 QLLHMDLMGPMHKESFGGKRYVFVCVDD 767
L+H D+ GP + GKR+ +DD
Sbjct: 444 YLIHSDVWGPSKITTQFGKRWFVTFIDD 527
>TC11361
Length = 549
Score = 38.9 bits (89), Expect = 0.006
Identities = 15/34 (44%), Positives = 20/34 (58%)
Frame = +3
Query: 487 CHHCGKHGHIRPYCYRLYGTPQHWRTTQMHTSTN 520
C HCG GH + CYRL G P H++ ++ S N
Sbjct: 42 CTHCGILGHTKDKCYRLIGFPPHYKKSKSVASVN 143
Database: LJGI
Posted date: Jul 30, 2004 11:16 AM
Number of letters in database: 14,692,800
Number of sequences in database: 28,460
Lambda K H
0.329 0.141 0.437
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 24,901,178
Number of Sequences: 28460
Number of extensions: 339153
Number of successful extensions: 2069
Number of sequences better than 10.0: 62
Number of HSP's better than 10.0 without gapping: 2022
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 2059
length of query: 1518
length of database: 4,897,600
effective HSP length: 102
effective length of query: 1416
effective length of database: 1,994,680
effective search space: 2824466880
effective search space used: 2824466880
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.8 bits)
S2: 61 (28.1 bits)
Lotus: description of TM0166.12