
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0004b.4
(929 letters)
Database: LJGI
28,460 sequences; 14,692,800 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC18927 similar to PIR|AI2934|AI2934 chromate transport protein ... 177 9e-45
BG662087 95 4e-20
BE122516 89 2e-18
AV410603 75 6e-14
AU251673 47 1e-05
TC11885 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, parti... 44 2e-04
TC18698 39 0.003
TC12574 38 0.008
TC13053 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, parti... 37 0.011
BI418821 37 0.011
TC10011 similar to UP|Q9FYA7 (Q9FYA7) Splicing factor RSZ33, par... 33 0.16
AV779679 32 0.60
TC11095 similar to PIR|T05112|T05112 splicing factor 9G8-like SR... 30 1.3
TC12773 30 1.3
BP043519 30 1.7
TC9039 30 1.7
BP079571 26 2.1
TC11419 similar to GB|AAM15519.1|20198312|U78721 Expressed prote... 29 3.0
TC18767 29 3.0
BP062983 28 5.1
>TC18927 similar to PIR|AI2934|AI2934 chromate transport protein chrA
[imported] - Agrobacterium tumefaciens
(strain C58, Dupont) {Agrobacterium tumefaciens;},
partial (6%)
Length = 561
Score = 177 bits (448), Expect = 9e-45
Identities = 96/192 (50%), Positives = 118/192 (61%), Gaps = 10/192 (5%)
Frame = -2
Query: 230 KTKGIEAIDNARGKFQGLNKPYQGSGGPARTNQGRGDKGRHFQKKPYVRPQGRGTTSGSF 289
K K IEAIDN R + +P QGSGGP R+ GR D+ + FQKKP+ RPQ RGT+SG +
Sbjct: 560 KAKSIEAIDNLRSR--PAFRPNQGSGGPNRSAPGRFDRNKSFQKKPFQRPQNRGTSSG-Y 390
Query: 290 YPTGGNAIALRTPSGNREDVTCFRCNKKGHYANHCSESLAACWNCNKPGHTAAECRIPKV 349
+ GN + T S E + C RC+KKGH+AN C + + CWNC K GH+ +C PKV
Sbjct: 389 SHSFGNFVPRPTQSDTSE-IVCHRCSKKGHFANRCPDLV--CWNCQKTGHSGKDCTNPKV 219
Query: 350 EAAANVAGARRPT----------AGGRVYSISGTEAEEDDGLIRSTCEIAGNSLIALFDS 399
EAA N ARRP A RVY++SG E+ DGLIRS + L LFDS
Sbjct: 218 EAATNAIAARRPAPAANKGKRPVASARVYTVSGAESHRADGLIRSVGSVNCKPLTILFDS 39
Query: 400 GATHSFIDIACA 411
GATHSFID+ACA
Sbjct: 38 GATHSFIDLACA 3
>BG662087
Length = 373
Score = 95.1 bits (235), Expect = 4e-20
Identities = 48/119 (40%), Positives = 70/119 (58%)
Frame = +1
Query: 615 GRSRLCVDYRQLNKVTIKNRYPLPRIDDLMDQLKGAAIFSKIDLRSGYHQIRVKDEDIQK 674
G+ R+ VDY LNK K+ YPLP ID L+D + S +D SGYHQI++ D K
Sbjct: 16 GKWRMWVDYTDLNKACPKDSYPLPSIDKLVDGASDNELLSLMDAYSGYHQIKMHPSDEDK 195
Query: 675 TAFRTRYGHYEYLVMPFGVTNAPAVFMDYMNRIFHPFLDRFVVVFIDDILIYSRNREEH 733
TAF T +Y Y +PFG+ NA A + M+R+F + R + V++D++++ S R H
Sbjct: 196 TAFMTARVNYCYQTIPFGLKNAGATYQXLMDRVFXDXVGRNMEVYLDNMIVKSALRANH 372
>BE122516
Length = 364
Score = 89.4 bits (220), Expect = 2e-18
Identities = 48/113 (42%), Positives = 68/113 (59%), Gaps = 1/113 (0%)
Frame = +2
Query: 469 IGMDWLSHHHVLLDCANKVVIFPDA-GLAEFLNSYFSKLSLRKGALSSLMSTTVVEAKEN 527
+GM+WL+ + L+C K V F + G A+ + + L+ + +
Sbjct: 14 VGMNWLTANDATLNCRKKTVTFGTSEGDAKRVKRTDKVGKASECESDVLLGALETDKSDT 193
Query: 528 GVHGIAVVQDFEDVFPEDVPGIPPVRDMEFTIDIVPGTGPISIAPYRMAPAEL 580
GV GI VV++F DVFPE+V +PP R++EF+ID VPGTGPISIAPYRM+ EL
Sbjct: 194 GVEGIPVVREFSDVFPEEVSELPPEREVEFSID*VPGTGPISIAPYRMSLVEL 352
>AV410603
Length = 162
Score = 74.7 bits (182), Expect = 6e-14
Identities = 31/53 (58%), Positives = 43/53 (80%)
Frame = +1
Query: 630 TIKNRYPLPRIDDLMDQLKGAAIFSKIDLRSGYHQIRVKDEDIQKTAFRTRYG 682
T+K+ +P+P +D+L+D+L+G+ FSK+DLRSGYHQI VK ED KT FRT +G
Sbjct: 4 TVKDSFPMPTVDELLDELRGSQFFSKLDLRSGYHQILVKPEDRHKTVFRTHHG 162
>AU251673
Length = 413
Score = 47.4 bits (111), Expect = 1e-05
Identities = 32/109 (29%), Positives = 51/109 (46%), Gaps = 1/109 (0%)
Frame = +2
Query: 109 VDYETYLLTGEAEYWWRGARAMMEADHQAITWECFRGAFLDKYFPRSARAAKEAQFLRLR 168
V+ ++ L G A W+ TW F F++++ P+S R F RL
Sbjct: 23 VELASFQLEGVARDWYNVLTRAKPVGSPPWTWADFSAEFMNRFLPQSVRDGFVRDFERLE 202
Query: 169 QG-GMTVAEYAAKLESLAKHFRYFRGQIDEGYMCERFIEGLCYELQRAV 216
Q GMTV+EY+A L+++ Y + E +RF+ GL L ++V
Sbjct: 203 QAEGMTVSEYSAHFTHLSRYVPY---PLLEEERVKRFVRGLKEYLFKSV 340
>TC11885 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, partial (26%)
Length = 555
Score = 43.5 bits (101), Expect = 2e-04
Identities = 18/41 (43%), Positives = 23/41 (55%)
Frame = +2
Query: 304 GNREDVTCFRCNKKGHYANHCSESLAACWNCNKPGHTAAEC 344
G D C C + GH+A C ++A C NC PGH A+EC
Sbjct: 323 GFSRDNLCKNCKRPGHFARECP-NVAICHNCGLPGHIASEC 442
Score = 42.7 bits (99), Expect = 3e-04
Identities = 15/34 (44%), Positives = 20/34 (58%)
Frame = +2
Query: 311 CFRCNKKGHYANHCSESLAACWNCNKPGHTAAEC 344
C C GH A+ C+ + CWNC +PGH A+ C
Sbjct: 401 CHNCGLPGHIASECTTK-SLCWNCKEPGHMASSC 499
>TC18698
Length = 808
Score = 39.3 bits (90), Expect = 0.003
Identities = 20/69 (28%), Positives = 36/69 (51%)
Frame = -2
Query: 673 QKTAFRTRYGHYEYLVMPFGVTNAPAVFMDYMNRIFHPFLDRFVVVFIDDILIYSRNREE 732
+KT + +Y Y VMP G+ N + M++IFH + + V V+++D+++ S
Sbjct: 804 KKTTLKINRVNYYYQVMPLGLKNI*TTYQRLMDKIFHKQI*KNVEVYVEDMIVKSSQE*F 625
Query: 733 HEEHLRQVL 741
H L + L
Sbjct: 624 HRGDLSRDL 598
>TC12574
Length = 325
Score = 37.7 bits (86), Expect = 0.008
Identities = 15/33 (45%), Positives = 24/33 (72%)
Frame = +2
Query: 700 FMDYMNRIFHPFLDRFVVVFIDDILIYSRNREE 732
F + +N IF F + F++VFI+DIL Y+ ++EE
Sbjct: 2 FKNSVNHIFESFFEHFMIVFINDILSYTEDKEE 100
>TC13053 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, partial (9%)
Length = 450
Score = 37.4 bits (85), Expect = 0.011
Identities = 15/36 (41%), Positives = 17/36 (46%)
Frame = +3
Query: 309 VTCFRCNKKGHYANHCSESLAACWNCNKPGHTAAEC 344
V C C + GH + C L C NC GH A EC
Sbjct: 3 VVCRNCQQLGHMSRDCMGPLMICHNCGGRGHLAYEC 110
>BI418821
Length = 614
Score = 37.4 bits (85), Expect = 0.011
Identities = 18/55 (32%), Positives = 22/55 (39%), Gaps = 8/55 (14%)
Frame = +2
Query: 311 CFRCNKKGHYANHCSESL--------AACWNCNKPGHTAAECRIPKVEAAANVAG 357
C+ C GH A C S AAC+NC GH A +C + AG
Sbjct: 392 CYNCGDTGHLARDCHRSNNNGGGGGGAACYNCGDAGHLARDCNRSNNNSGGGGAG 556
Score = 37.0 bits (84), Expect = 0.014
Identities = 16/48 (33%), Positives = 20/48 (41%), Gaps = 7/48 (14%)
Frame = +2
Query: 304 GNREDVTCFRCNKKGHYANHCSESL-------AACWNCNKPGHTAAEC 344
G C+ C GH A C+ S A C+NC GH A +C
Sbjct: 455 GGGGGAACYNCGDAGHLARDCNRSNNNSGGGGAGCYNCGDTGHLARDC 598
>TC10011 similar to UP|Q9FYA7 (Q9FYA7) Splicing factor RSZ33, partial (62%)
Length = 684
Score = 33.5 bits (75), Expect = 0.16
Identities = 17/55 (30%), Positives = 22/55 (39%), Gaps = 11/55 (20%)
Frame = +2
Query: 302 PSGNREDV---------TCFRCNKKGHYANHC--SESLAACWNCNKPGHTAAECR 345
P GNRE + CF C GH+A C + C+ C GH C+
Sbjct: 317 PRGNREYLGRGPPPGSGRCFNCGLDGHWARDCKAGDWKNKCYRCGDRGHVERNCK 481
>AV779679
Length = 440
Score = 31.6 bits (70), Expect = 0.60
Identities = 14/28 (50%), Positives = 19/28 (67%)
Frame = +3
Query: 618 RLCVDYRQLNKVTIKNRYPLPRIDDLMD 645
+LC DY QL+ VTI N+ LP +D+ D
Sbjct: 351 QLCDDYMQLDYVTIPNKSLLPHLDEWSD 434
>TC11095 similar to PIR|T05112|T05112 splicing factor 9G8-like SR protein
RSZp22 [validated] - Arabidopsis thaliana
{Arabidopsis thaliana;}, partial (89%)
Length = 912
Score = 30.4 bits (67), Expect = 1.3
Identities = 16/46 (34%), Positives = 23/46 (49%), Gaps = 4/46 (8%)
Frame = +1
Query: 331 CWNCNKPGHTAAECRI----PKVEAAANVAGARRPTAGGRVYSISG 372
C+ C +PGH A ECR+ + + + R P+ G R YS G
Sbjct: 379 CYECGEPGHFARECRMRGGSGRRRSRSPPRFRRSPSYGRRSYSPRG 516
>TC12773
Length = 420
Score = 30.4 bits (67), Expect = 1.3
Identities = 10/22 (45%), Positives = 13/22 (58%)
Frame = +2
Query: 325 SESLAACWNCNKPGHTAAECRI 346
S S CW C +PGH A +C +
Sbjct: 329 SLSTYECWKCQRPGHMAEDCLV 394
>BP043519
Length = 456
Score = 30.0 bits (66), Expect = 1.7
Identities = 21/60 (35%), Positives = 29/60 (48%), Gaps = 1/60 (1%)
Frame = +3
Query: 635 YPLPRIDDLMDQLKGAAIFSKIDLRSGYHQIRVKDEDIQ-KTAFRTRYGHYEYLVMPFGV 693
Y LP + LMD+ GA D Y + D++ K AFR +YG + VMPF +
Sbjct: 237 YALPMLSILMDKGTGAVTSVPSDAPDDYMALL----DLKSKPAFRAKYGVKDEWVMPFEI 404
>TC9039
Length = 1218
Score = 30.0 bits (66), Expect = 1.7
Identities = 13/35 (37%), Positives = 17/35 (48%)
Frame = +2
Query: 302 PSGNREDVTCFRCNKKGHYANHCSESLAACWNCNK 336
P +ED TC+ CN GH C++ A W K
Sbjct: 818 PKKQKEDDTCYFCNVSGHMKKKCTKYHA--WRARK 916
>BP079571
Length = 414
Score = 25.8 bits (55), Expect(2) = 2.1
Identities = 9/22 (40%), Positives = 11/22 (49%)
Frame = -1
Query: 311 CFRCNKKGHYANHCSESLAACW 332
CFR + GH A C +A W
Sbjct: 357 CFRFGEVGHLARDCDGGVAVTW 292
Score = 22.3 bits (46), Expect(2) = 2.1
Identities = 6/9 (66%), Positives = 8/9 (88%)
Frame = -3
Query: 331 CWNCNKPGH 339
C++C KPGH
Sbjct: 262 CFHCGKPGH 236
>TC11419 similar to GB|AAM15519.1|20198312|U78721 Expressed protein
{Arabidopsis thaliana;}, partial (61%)
Length = 626
Score = 29.3 bits (64), Expect = 3.0
Identities = 27/90 (30%), Positives = 38/90 (42%), Gaps = 13/90 (14%)
Frame = -1
Query: 378 DDGLIRSTCEIAGNSLIALFDSGATHSFIDIACAARLKLEVSKLP-----FELTVSTPA- 431
+ G++RSTC + + S THS I AA ++LP FELT T +
Sbjct: 308 ESGVMRSTCSLRAVKTMVPVSSPTTHSAIRGGLAASTDGRPTRLPTAFTGFELTTFTVSV 129
Query: 432 -------SKSLVTNTACLECPWMYLDKKFV 454
S S TNT L + +D + V
Sbjct: 128 *PLVPGLS*STFTNTGFLFSICVAIDSEIV 39
>TC18767
Length = 1004
Score = 29.3 bits (64), Expect = 3.0
Identities = 12/35 (34%), Positives = 17/35 (48%)
Frame = +2
Query: 326 ESLAACWNCNKPGHTAAECRIPKVEAAANVAGARR 360
E + C+NC H+ EC P+ A N A +R
Sbjct: 152 EDASRCFNCGSYNHSLRECSRPRDNVAVNSARKQR 256
>BP062983
Length = 470
Score = 28.5 bits (62), Expect = 5.1
Identities = 11/43 (25%), Positives = 20/43 (45%)
Frame = -3
Query: 100 FLRTTAEMKVDYETYLLTGEAEYWWRGARAMMEADHQAITWEC 142
FL T + + + + +G A+ WW G + + + WEC
Sbjct: 468 FLYETLVXRKENDCFCRSGSAKCWWHGLKGVKITNT*PYRWEC 340
Database: LJGI
Posted date: Jul 30, 2004 11:16 AM
Number of letters in database: 14,692,800
Number of sequences in database: 28,460
Lambda K H
0.321 0.137 0.413
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 15,066,562
Number of Sequences: 28460
Number of extensions: 202480
Number of successful extensions: 1035
Number of sequences better than 10.0: 52
Number of HSP's better than 10.0 without gapping: 1006
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1024
length of query: 929
length of database: 4,897,600
effective HSP length: 99
effective length of query: 830
effective length of database: 2,080,060
effective search space: 1726449800
effective search space used: 1726449800
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 60 (27.7 bits)
Lotus: description of TM0004b.4