
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0039.4
(1435 letters)
Database: LJGI
28,460 sequences; 14,692,800 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AU089582 109 9e-39
AV427422 143 2e-34
TC19412 similar to UP|Q84KB0 (Q84KB0) Pol protein, partial (7%) 95 3e-23
BP054462 95 7e-20
BE122516 95 9e-20
BP055130 95 9e-20
BG662087 93 3e-19
BF177840 92 8e-19
TC18927 similar to PIR|AI2934|AI2934 chromate transport protein ... 74 2e-13
AV410603 74 2e-13
BP046863 60 3e-09
TC11573 similar to UP|Q9M6N4 (Q9M6N4) Pol protein integrase regi... 52 7e-07
BP085845 47 3e-05
TC9039 41 0.002
TC18698 40 0.002
TC12574 38 0.013
BP055633 37 0.029
TC17929 37 0.029
BP064003 35 0.064
AV421607 35 0.084
>AU089582
Length = 383
Score = 109 bits (273), Expect(2) = 9e-39
Identities = 51/80 (63%), Positives = 63/80 (78%)
Frame = +2
Query: 1130 SEKLARIYVKEIVRLHGVPANIVSDRDPRFVSKFWGSLHEALGTRLSLSSAYHPQSDGQS 1189
+ + A+IY+ EIV LHGVP +I+SDR +F S FW S ALGTRL +S+A+HPQ+DGQS
Sbjct: 11 ASQYAKIYLDEIVSLHGVPVSIISDRGAQFTSHFWRSFQTALGTRLKMSTAFHPQTDGQS 190
Query: 1190 ERTIQTLEDMLRACVLDYKG 1209
ERTIQ LEDMLRACV D +G
Sbjct: 191 ERTIQILEDMLRACVXDLRG 250
Score = 69.3 bits (168), Expect(2) = 9e-39
Identities = 28/42 (66%), Positives = 36/42 (85%)
Frame = +3
Query: 1208 KGSWEDFLPLAEFSYNNSYHSSLGMAPFEALYGRRCKTPLCW 1249
+GSW+ +L L EF+YNNSY SS+ MAPFEALYGRRC++P+ W
Sbjct: 246 EGSWDQYLSLMEFAYNNSYRSSI*MAPFEALYGRRCRSPIGW 371
>AV427422
Length = 417
Score = 143 bits (360), Expect = 2e-34
Identities = 67/139 (48%), Positives = 98/139 (70%)
Frame = +1
Query: 1096 SGLPRTTTGHDAIWVIVDRLTKSAHFIAVNMTFPSEKLARIYVKEIVRLHGVPANIVSDR 1155
+GLP++ G++A+ V+VDRL+K +HF+ + + ++ +A I+V+E+VRLHGVP +IVSDR
Sbjct: 4 TGLPKSK-GYEAVLVVVDRLSKFSHFVPLKHPYTAKVIADIFVREVVRLHGVPLSIVSDR 180
Query: 1156 DPRFVSKFWGSLHEALGTRLSLSSAYHPQSDGQSERTIQTLEDMLRACVLDYKGSWEDFL 1215
DP F+S FW L + GT+L +S+AYHP+SDGQ+E + LE LR + D SW ++
Sbjct: 181 DPLFMSNFWKELFKMQGTKLKMSTAYHPESDGQTEVVNRCLETYLRCFIADQPKSWAHWV 360
Query: 1216 PLAEFSYNNSYHSSLGMAP 1234
P AE+ YN SYH S G P
Sbjct: 361 PWAEYWYNTSYHVSTGQTP 417
>TC19412 similar to UP|Q84KB0 (Q84KB0) Pol protein, partial (7%)
Length = 519
Score = 95.1 bits (235), Expect(2) = 3e-23
Identities = 41/80 (51%), Positives = 64/80 (79%)
Frame = +3
Query: 1306 RVTPITGVGRSIHSKKLTPKYLGPYQILDRIGAVAYRIALPPSLSNLHDVFHISQLRKYL 1365
RV+P+ GV R KL+P+++GP+++L+R+G+V+YR+ALPP LS +H VFH+S LRKYL
Sbjct: 3 RVSPMKGVLRFGKKGKLSPRFIGPFEVLERVGSVSYRLALPPDLSAVHPVFHVSMLRKYL 182
Query: 1366 PDSSHVIEPDNIELEENLTY 1385
D SHVI ++++L+ +L+Y
Sbjct: 183 YDPSHVIRHEDVQLDVHLSY 242
Score = 32.0 bits (71), Expect(2) = 3e-23
Identities = 14/34 (41%), Positives = 21/34 (61%), Gaps = 1/34 (2%)
Frame = +2
Query: 1403 RTVPLVKLAW-SDDNQDATWELEESARKRYPSLF 1435
+ V VK+ W ++ATWE E+ R++YP LF
Sbjct: 296 KDVGSVKVLWRGPSGEEATWEAEDIMREKYPHLF 397
>BP054462
Length = 422
Score = 95.1 bits (235), Expect = 7e-20
Identities = 56/143 (39%), Positives = 86/143 (59%), Gaps = 3/143 (2%)
Frame = +1
Query: 1223 NNSYHSSLGMAPFEALYGRRCKTPLCWLSGEDKITLGPELLQEMTEKVRSIREKLRIAQD 1282
N++Y+ S M+PF+ALYGR L + KI +L E + +R L +QD
Sbjct: 1 NSNYNRSAKMSPFQALYGREPPVLLQGTTIPSKIAAVNDLQVGRDELLSDLRANLLKSQD 180
Query: 1283 RQKSYYDKRHKPLEFQEGDHVFLRVTPITGVGRSIHSK---KLTPKYLGPYQILDRIGAV 1339
++Y +K+ + +++Q GD VFL++ P RS+ K KL+P+Y GPY I+ +IGAV
Sbjct: 181 MMRTYANKKRRDVDYQIGDEVFLKLQPYR--RRSLAKKMNEKLSPRYYGPYPIVAKIGAV 354
Query: 1340 AYRIALPPSLSNLHDVFHISQLR 1362
AYR+ L P+ S +H VFH+S L+
Sbjct: 355 AYRLEL-PAHSRVHPVFHVSLLK 420
>BE122516
Length = 364
Score = 94.7 bits (234), Expect = 9e-20
Identities = 53/114 (46%), Positives = 73/114 (63%), Gaps = 6/114 (5%)
Frame = +2
Query: 416 LGMNWLSINNVLLDCRLRVPIFLQKYKEKHTASLPEK-----EPSAYLILFSSEGTKRPA 470
+GMNWL+ N+ L+CR + F + +K E + ++L + E K
Sbjct: 14 VGMNWLTANDATLNCRKKTVTFGTSEGDAKRVKRTDKVGKASECESDVLLGALETDKSDT 193
Query: 471 -MEDIPVVREFPEVFPEDMTELPPEREVEFAIDVIPGTTPISAAPYRISPLELA 523
+E IPVVREF +VFPE+++ELPPEREVEF+ID +PGT PIS APYR+S +ELA
Sbjct: 194 GVEGIPVVREFSDVFPEEVSELPPEREVEFSID*VPGTGPISIAPYRMSLVELA 355
>BP055130
Length = 567
Score = 94.7 bits (234), Expect = 9e-20
Identities = 49/112 (43%), Positives = 66/112 (58%)
Frame = +2
Query: 1104 GHDAIWVIVDRLTKSAHFIAVNMTFPSEKLARIYVKEIVRLHGVPANIVSDRDPRFVSKF 1163
G I V+VDRL+K HF + S ++A +V IV+LHG+P IVSDRD F S F
Sbjct: 179 GFTVIIVVVDRLSKYGHFAPHRANYTSSQVAETFVSTIVKLHGMPRAIVSDRDKAFTSAF 358
Query: 1164 WGSLHEALGTRLSLSSAYHPQSDGQSERTIQTLEDMLRACVLDYKGSWEDFL 1215
W + GT L++SS+YHPQ+DGQ+E + LE LR V + W +L
Sbjct: 359 WKHFFKLHGTTLNMSSSYHPQTDGQTEALNKCLELYLRCFVHETPRLWVSYL 514
>BG662087
Length = 373
Score = 92.8 bits (229), Expect = 3e-19
Identities = 46/119 (38%), Positives = 69/119 (57%)
Frame = +1
Query: 557 GSMRLCVDYRQLNKVTIKNRYPLPRIDDLMDQLKGARVFSKIDLRSGYHQIRVKSDDVQK 616
G R+ VDY LNK K+ YPLP ID L+D + S +D SGYHQI++ D K
Sbjct: 16 GKWRMWVDYTDLNKACPKDSYPLPSIDKLVDGASDNELLSLMDAYSGYHQIKMHPSDEDK 195
Query: 617 TAFRTRYGHYEYLVMPFGVTNAPAIFMDYMNRIFHPYLDKFVIVFIDDILIYSKSKEEH 675
TAF T +Y Y +PFG+ NA A + M+R+F + + + V++D++++ S + H
Sbjct: 196 TAFMTARVNYCYQTIPFGLKNAGATYQXLMDRVFXDXVGRNMEVYLDNMIVKSALRANH 372
>BF177840
Length = 410
Score = 91.7 bits (226), Expect = 8e-19
Identities = 50/130 (38%), Positives = 76/130 (58%)
Frame = +2
Query: 1150 NIVSDRDPRFVSKFWGSLHEALGTRLSLSSAYHPQSDGQSERTIQTLEDMLRACVLDYKG 1209
+IVSDRD +F+S FW +L +GT+L S+ HPQ+DGQ+E +TL +LR+ +
Sbjct: 14 SIVSDRDTKFISHFWRTLWGKVGTKLLYSTTCHPQTDGQTEVVNKTLSTLLRSVLERNLK 193
Query: 1210 SWEDFLPLAEFSYNNSYHSSLGMAPFEALYGRRCKTPLCWLSGEDKITLGPELLQEMTEK 1269
WE +LP EF+YN HS+ +PFE +YG TPL L + L + + E
Sbjct: 194 MWETWLPHIEFAYNRVVHSTTKHSPFEIVYGYNPLTPLDLLPMPNTYLLKHKDGKAKAEF 373
Query: 1270 VRSIREKLRI 1279
V+ + EK+++
Sbjct: 374 VKRLHEKIKL 403
>TC18927 similar to PIR|AI2934|AI2934 chromate transport protein chrA
[imported] - Agrobacterium tumefaciens
(strain C58, Dupont) {Agrobacterium tumefaciens;},
partial (6%)
Length = 561
Score = 73.9 bits (180), Expect = 2e-13
Identities = 48/157 (30%), Positives = 82/157 (51%), Gaps = 18/157 (11%)
Frame = -2
Query: 215 NKTSKRREERKKPYSPRNY-KPELQNRNYGGA--------RPTNPN-SHVTCYRCGKEGH 264
N+++ R +R K + + + +P+ + + G + RPT + S + C+RC K+GH
Sbjct: 482 NRSAPGRFDRNKSFQKKPFQRPQNRGTSSGYSHSFGNFVPRPTQSDTSEIVCHRCSKKGH 303
Query: 265 KS--------WSCTATTTSGQTNLKPPVVGGTNTTSAPRNSAGASAQKPGRPVNKGKVFA 316
+ W+C T SG+ P V TN +A R + A+ K RPV +V+
Sbjct: 302 FANRCPDLVCWNCQKTGHSGKDCTNPKVEAATNAIAARRPAPAANKGK--RPVASARVYT 129
Query: 317 MTGAEANTSEDFIQGTCFLCDISLVVLYDSGATHSFI 353
++GAE++ ++ I+ + L +L+DSGATHSFI
Sbjct: 128 VSGAESHRADGLIRSVGSVNCKPLTILFDSGATHSFI 18
>AV410603
Length = 162
Score = 73.9 bits (180), Expect = 2e-13
Identities = 30/53 (56%), Positives = 44/53 (82%)
Frame = +1
Query: 572 TIKNRYPLPRIDDLMDQLKGARVFSKIDLRSGYHQIRVKSDDVQKTAFRTRYG 624
T+K+ +P+P +D+L+D+L+G++ FSK+DLRSGYHQI VK +D KT FRT +G
Sbjct: 4 TVKDSFPMPTVDELLDELRGSQFFSKLDLRSGYHQILVKPEDRHKTVFRTHHG 162
>BP046863
Length = 580
Score = 59.7 bits (143), Expect = 3e-09
Identities = 34/104 (32%), Positives = 65/104 (61%), Gaps = 2/104 (1%)
Frame = +1
Query: 1262 LLQEMTEKVRSIREKLRIAQDRQKSYYDKRHKPLEFQEGDHVFLRVTP--ITGVGRSIHS 1319
L++E + +R L AQD+ ++ +K + +++Q G+ VFL++ P + + + +
Sbjct: 274 LIEERDALLLELRGNLLKAQDQMRAQANKHRRYVDYQVGNWVFLKLQPYKLQNLAQR-KN 450
Query: 1320 KKLTPKYLGPYQILDRIGAVAYRIALPPSLSNLHDVFHISQLRK 1363
+KL+P++ GP+++L+R+ VAY + L S S +H VFH+S L K
Sbjct: 451 QKLSPRFYGPFKVLERVVQVAY*LDL-XSESRVHPVFHLSLLEK 579
>TC11573 similar to UP|Q9M6N4 (Q9M6N4) Pol protein integrase region
(Fragment), partial (10%)
Length = 572
Score = 52.0 bits (123), Expect = 7e-07
Identities = 27/90 (30%), Positives = 46/90 (51%)
Frame = +2
Query: 1158 RFVSKFWGSLHEALGTRLSLSSAYHPQSDGQSERTIQTLEDMLRACVLDYKGSWEDFLPL 1217
+F SK + +G ++ SS HPQ++GQ+E + + ++ + + +G W D LP+
Sbjct: 20 QFTSKQTQDFCDGMGIQMRFSSVKHPQTNGQTEAANKVILKGIKRRLYEAEGRWIDELPI 199
Query: 1218 AEFSYNNSYHSSLGMAPFEALYGRRCKTPL 1247
+SYN SS+ PF YG P+
Sbjct: 200 VLWSYNTMPQSSIKETPF*LTYGADTMLPV 289
>BP085845
Length = 464
Score = 46.6 bits (109), Expect = 3e-05
Identities = 24/93 (25%), Positives = 50/93 (52%)
Frame = -2
Query: 1320 KKLTPKYLGPYQILDRIGAVAYRIALPPSLSNLHDVFHISQLRKYLPDSSHVIEPDNIEL 1379
+K K +GP++IL+ + +AY + L H V H++ L + L D S I + + L
Sbjct: 415 RKTESKIIGPFKILEMVCLIAY*LTHSLYLLAAHIVLHVTLLWRNLYDQSQNICHEGVXL 236
Query: 1380 EENLTYPTQPVKILERREKQLRKRTVPLVKLAW 1412
E+ ++ P+ +++ + + +R + + VK+ W
Sbjct: 235 GEHWSHMEHPIVMVDMKVRCMRPKNIDDVKVIW 137
>TC9039
Length = 1218
Score = 40.8 bits (94), Expect = 0.002
Identities = 22/72 (30%), Positives = 32/72 (43%)
Frame = +2
Query: 199 EENDRARREYYKSSKFNKTSKRREERKKPYSPRNYKPELQNRNYGGARPTNPNSHVTCYR 258
+E +R ++E +S+ F TSK + +RKK P+N + P TCY
Sbjct: 695 QEEERLKQERKESAHFVSTSKDKGKRKKTVEPKNEAAD-------APAPKKQKEDDTCYF 853
Query: 259 CGKEGHKSWSCT 270
C GH CT
Sbjct: 854 CNVSGHMKKKCT 889
>TC18698
Length = 808
Score = 40.4 bits (93), Expect = 0.002
Identities = 19/61 (31%), Positives = 34/61 (55%)
Frame = -2
Query: 615 QKTAFRTRYGHYEYLVMPFGVTNAPAIFMDYMNRIFHPYLDKFVIVFIDDILIYSKSKEE 674
+KT + +Y Y VMP G+ N + M++IFH + K V V+++D+++ S +
Sbjct: 804 KKTTLKINRVNYYYQVMPLGLKNI*TTYQRLMDKIFHKQI*KNVEVYVEDMIVKSSQE*F 625
Query: 675 H 675
H
Sbjct: 624 H 622
>TC12574
Length = 325
Score = 37.7 bits (86), Expect = 0.013
Identities = 16/33 (48%), Positives = 24/33 (72%)
Frame = +2
Query: 642 FMDYMNRIFHPYLDKFVIVFIDDILIYSKSKEE 674
F + +N IF + + F+IVFI+DIL Y++ KEE
Sbjct: 2 FKNSVNHIFESFFEHFMIVFINDILSYTEDKEE 100
>BP055633
Length = 528
Score = 36.6 bits (83), Expect = 0.029
Identities = 37/137 (27%), Positives = 59/137 (43%), Gaps = 3/137 (2%)
Frame = -2
Query: 101 VLTWSIFKEAFLEKYFPADVKGKKETEFLELKQG---EMFVGQYAARFEELSQFHPYYGT 157
V TW+ FKEA LE+ F L LKQ E FVGQ+ RF + +
Sbjct: 527 VTTWTKFKEAMLEQ-FQLTSNSSPFAALLALKQEGSVEEFVGQF-ERFAGMLK------- 375
Query: 158 TADDASKCIRFECGLRPDIRAAIGHQQIRTFTVLVEKCRIFEENDRARREYYKSSKFNKT 217
D+ F GL+ +I A I + ++ T++V+K + E+ + A + T
Sbjct: 374 GIDEEHYMDIFVNGLKEEIAAEIKLYEPKSLTIMVKKALMVEQKNLA----VSKAGIGST 207
Query: 218 SKRREERKKPYSPRNYK 234
S+ K P+ ++
Sbjct: 206 SRYNSSFKPPFRSTTFQ 156
>TC17929
Length = 791
Score = 36.6 bits (83), Expect = 0.029
Identities = 14/32 (43%), Positives = 19/32 (58%)
Frame = +2
Query: 246 RPTNPNSHVTCYRCGKEGHKSWSCTATTTSGQ 277
RP + TCYRCG+ GHK +C +SG+
Sbjct: 32 RPNDSKFRQTCYRCGESGHKMRNCPKEHSSGE 127
>BP064003
Length = 509
Score = 35.4 bits (80), Expect = 0.064
Identities = 28/95 (29%), Positives = 46/95 (47%), Gaps = 3/95 (3%)
Frame = +1
Query: 864 GVKFTIYSDHQSLKYLFDQKTLNMRQRRWVEFLED---YDFKLQYHPGKANVVADALSRK 920
G +F I +D+ + QK R R W +FL + ++Q ++N VADALSRK
Sbjct: 115 GSRFVIKTDNVATSSFLTQKRAPTRAR-WQDFLSGGVRHGPRVQAREDQSNKVADALSRK 291
Query: 921 SLHAARLMIEETELIEKFRDMNLIMETLPQGTRLG 955
+ A + +E+ DM+ + +P+ R G
Sbjct: 292 AELA-------SNKLEEIADMSQLKGAIPERIREG 375
>AV421607
Length = 245
Score = 35.0 bits (79), Expect = 0.084
Identities = 20/59 (33%), Positives = 28/59 (46%), Gaps = 1/59 (1%)
Frame = +3
Query: 256 CYRCGKEGHKSWSCTATTTSGQTNLKPPVVGGTNTTSAPRNSAGASAQ-KPGRPVNKGK 313
CY+CG+ GH S C ++ + N P NTT+ P +S+ KP V K K
Sbjct: 54 CYKCGRPGHWSRDCPSSAPNSNPNPNP------NTTTTPNPLLPSSSSFKPRSAVEKPK 212
Database: LJGI
Posted date: Jul 30, 2004 11:16 AM
Number of letters in database: 14,692,800
Number of sequences in database: 28,460
Lambda K H
0.319 0.136 0.411
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 24,788,784
Number of Sequences: 28460
Number of extensions: 343948
Number of successful extensions: 1653
Number of sequences better than 10.0: 52
Number of HSP's better than 10.0 without gapping: 1634
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1646
length of query: 1435
length of database: 4,897,600
effective HSP length: 102
effective length of query: 1333
effective length of database: 1,994,680
effective search space: 2658908440
effective search space used: 2658908440
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 61 (28.1 bits)
Lotus: description of TM0039.4