
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0067b.6
(404 letters)
Database: LJGI
28,460 sequences; 14,692,800 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC18927 similar to PIR|AI2934|AI2934 chromate transport protein ... 101 2e-22
TC11885 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, parti... 43 8e-05
BI418821 40 7e-04
TC9521 similar to UP|Q9FYA7 (Q9FYA7) Splicing factor RSZ33, part... 39 0.001
TC10011 similar to UP|Q9FYA7 (Q9FYA7) Splicing factor RSZ33, par... 39 0.002
AV421607 38 0.003
TC17853 similar to UP|Q42412 (Q42412) RNA-binding protein RZ-1, ... 38 0.003
AV773507 37 0.006
TC11095 similar to PIR|T05112|T05112 splicing factor 9G8-like SR... 36 0.013
AV420911 34 0.038
TC18003 similar to PIR|T05112|T05112 splicing factor 9G8-like SR... 34 0.038
TC18767 34 0.050
BP039068 32 0.25
AU240127 31 0.42
TC17929 30 0.55
TC13053 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, parti... 30 0.55
BP079571 30 0.94
TC19143 28 2.7
TC8167 similar to PIR|JC7519|JC7519 subtilisin-like serine prote... 28 2.7
AV777865 27 4.7
>TC18927 similar to PIR|AI2934|AI2934 chromate transport protein chrA
[imported] - Agrobacterium tumefaciens
(strain C58, Dupont) {Agrobacterium tumefaciens;},
partial (6%)
Length = 561
Score = 101 bits (252), Expect = 2e-22
Identities = 70/190 (36%), Positives = 88/190 (45%), Gaps = 18/190 (9%)
Frame = -2
Query: 222 KATEVELMKNRRLNRA-----GTGGPMRSGSQNFQSREKFQSRGPYQRPAGTGFTSGSYR 276
KA +E + N R A G+GGP RS F + FQ + P+QRP G +SG Y
Sbjct: 560 KAKSIEAIDNLRSRPAFRPNQGSGGPNRSAPGRFDRNKSFQKK-PFQRPQNRGTSSG-YS 387
Query: 277 PMTGAAGGSGDQTLKKETTCFRCGEPGHYANACPDTRPKCFNCNRMGHTAGQCRAPKTEP 336
G Q+ E C RC + GH+AN CPD C+NC + GH+ C PK E
Sbjct: 386 HSFGNFVPRPTQSDTSEIVCHRCSKKGHFANRCPDL--VCWNCQKTGHSGKDCTNPKVEA 213
Query: 337 TVNTA----------RGKRPAAKARVYTMDG---ERAEEFARGERKNDGNFLTILSYSSI 383
N +GKRP A ARVYT+ G RA+ R + LTIL S
Sbjct: 212 ATNAIAARRPAPAANKGKRPVASARVYTVSGAESHRADGLIRSVGSVNCKPLTILFDSGA 33
Query: 384 TCSITLVACA 393
T S +ACA
Sbjct: 32 THSFIDLACA 3
>TC11885 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, partial (26%)
Length = 555
Score = 43.1 bits (100), Expect = 8e-05
Identities = 25/89 (28%), Positives = 37/89 (41%), Gaps = 3/89 (3%)
Frame = +2
Query: 244 RSGSQNFQSREKFQSRGPYQRPAGT---GFTSGSYRPMTGAAGGSGDQTLKKETTCFRCG 300
R S + +SR + +SR P R + + YR + ++ C C
Sbjct: 200 RKMSSDSRSRSRSRSRSPMDRKIRSDRFSYRDAPYRR-------DSRRGFSRDNLCKNCK 358
Query: 301 EPGHYANACPDTRPKCFNCNRMGHTAGQC 329
PGH+A CP+ C NC GH A +C
Sbjct: 359 RPGHFARECPNV-AICHNCGLPGHIASEC 442
Score = 42.4 bits (98), Expect = 1e-04
Identities = 29/101 (28%), Positives = 43/101 (41%), Gaps = 2/101 (1%)
Frame = +2
Query: 231 NRRLNRAGTGGPM--RSGSQNFQSREKFQSRGPYQRPAGTGFTSGSYRPMTGAAGGSGDQ 288
+R +R+ + PM + S F R+ PY+R + GF+ + G +
Sbjct: 218 SRSRSRSRSRSPMDRKIRSDRFSYRD-----APYRRDSRRGFSRDNLCKNCKRPGHFARE 382
Query: 289 TLKKETTCFRCGEPGHYANACPDTRPKCFNCNRMGHTAGQC 329
C CG PGH A+ C T+ C+NC GH A C
Sbjct: 383 C-PNVAICHNCGLPGHIASECT-TKSLCWNCKEPGHMASSC 499
Score = 30.0 bits (66), Expect = 0.72
Identities = 9/19 (47%), Positives = 15/19 (78%)
Frame = +2
Query: 293 ETTCFRCGEPGHYANACPD 311
++ C+ C EPGH A++CP+
Sbjct: 449 KSLCWNCKEPGHMASSCPN 505
>BI418821
Length = 614
Score = 40.0 bits (92), Expect = 7e-04
Identities = 23/73 (31%), Positives = 27/73 (36%), Gaps = 8/73 (10%)
Frame = +2
Query: 265 PAGTGFTSGSYRPMTGAAGGSGDQTLKKETTCFRCGEPGHYANACPDTR--------PKC 316
P G G G R G GG G C+ CG+ GH A C + C
Sbjct: 320 PRGFGGWRGGERRNGGGGGGGGGG-------CYNCGDTGHLARDCHRSNNNGGGGGGAAC 478
Query: 317 FNCNRMGHTAGQC 329
+NC GH A C
Sbjct: 479 YNCGDAGHLARDC 517
Score = 35.8 bits (81), Expect = 0.013
Identities = 18/62 (29%), Positives = 23/62 (37%), Gaps = 7/62 (11%)
Frame = +2
Query: 275 YRPMTGAAGGSGDQTLKKETTCFRCGEPGHYANACPDTRPK-------CFNCNRMGHTAG 327
+R GG G C+ CG+ GH A C + C+NC GH A
Sbjct: 434 HRSNNNGGGGGG-------AACYNCGDAGHLARDCNRSNNNSGGGGAGCYNCGDTGHLAR 592
Query: 328 QC 329
C
Sbjct: 593 DC 598
>TC9521 similar to UP|Q9FYA7 (Q9FYA7) Splicing factor RSZ33, partial (56%)
Length = 598
Score = 39.3 bits (90), Expect = 0.001
Identities = 17/41 (41%), Positives = 22/41 (53%), Gaps = 3/41 (7%)
Frame = +1
Query: 296 CFRCGEPGHYANACP--DTRPKCFNCNRMGHTAGQCR-APK 333
CF CG GH+A C D + KC+ C GH C+ +PK
Sbjct: 391 CFNCGIDGHWARDCKAGDWKNKCYRCGERGHIEKNCKNSPK 513
Score = 27.3 bits (59), Expect = 4.7
Identities = 20/67 (29%), Positives = 26/67 (37%), Gaps = 3/67 (4%)
Frame = +1
Query: 252 SREKFQSRGPYQRPAGTGFTSGSYRPMTGAAGGSGDQTLKK---ETTCFRCGEPGHYANA 308
SRE R + G G GS R G + K + C+RCGE GH
Sbjct: 316 SREGGGGRDRDREYMGRGPPPGSGRCFNCGIDGHWARDCKAGDWKNKCYRCGERGHIEKN 495
Query: 309 CPDTRPK 315
C ++ K
Sbjct: 496 CKNSPKK 516
>TC10011 similar to UP|Q9FYA7 (Q9FYA7) Splicing factor RSZ33, partial (62%)
Length = 684
Score = 38.9 bits (89), Expect = 0.002
Identities = 17/41 (41%), Positives = 22/41 (53%), Gaps = 3/41 (7%)
Frame = +2
Query: 296 CFRCGEPGHYANACP--DTRPKCFNCNRMGHTAGQCR-APK 333
CF CG GH+A C D + KC+ C GH C+ +PK
Sbjct: 371 CFNCGLDGHWARDCKAGDWKNKCYRCGDRGHVERNCKNSPK 493
Score = 26.6 bits (57), Expect = 8.0
Identities = 10/22 (45%), Positives = 12/22 (54%)
Frame = +2
Query: 310 PDTRPKCFNCNRMGHTAGQCRA 331
P +CFNC GH A C+A
Sbjct: 353 PPGSGRCFNCGLDGHWARDCKA 418
>AV421607
Length = 245
Score = 38.1 bits (87), Expect = 0.003
Identities = 15/36 (41%), Positives = 21/36 (57%)
Frame = +3
Query: 279 TGAAGGSGDQTLKKETTCFRCGEPGHYANACPDTRP 314
+G GSG T T C++CG PGH++ CP + P
Sbjct: 12 SGPGSGSGSGTA---TGCYKCGRPGHWSRDCPSSAP 110
>TC17853 similar to UP|Q42412 (Q42412) RNA-binding protein RZ-1, partial
(58%)
Length = 881
Score = 38.1 bits (87), Expect = 0.003
Identities = 19/67 (28%), Positives = 31/67 (45%)
Frame = +2
Query: 244 RSGSQNFQSREKFQSRGPYQRPAGTGFTSGSYRPMTGAAGGSGDQTLKKETTCFRCGEPG 303
+ S++ ++++ RG + G S Y G+ GG CF+CG+PG
Sbjct: 293 QGSSRDRDDGDRYRERGRDRDDRGDRDRSRGYGGSRGSNGGE----------CFKCGKPG 442
Query: 304 HYANACP 310
H+A CP
Sbjct: 443 HFARECP 463
>AV773507
Length = 496
Score = 37.0 bits (84), Expect = 0.006
Identities = 17/45 (37%), Positives = 23/45 (50%)
Frame = +2
Query: 266 AGTGFTSGSYRPMTGAAGGSGDQTLKKETTCFRCGEPGHYANACP 310
A G+ G Y + G+GD+ + + CF CG PGH A CP
Sbjct: 266 ADQGYRGGGYSSGGRGSYGAGDRVGQDD--CFECGRPGHRARDCP 394
>TC11095 similar to PIR|T05112|T05112 splicing factor 9G8-like SR protein
RSZp22 [validated] - Arabidopsis thaliana
{Arabidopsis thaliana;}, partial (89%)
Length = 912
Score = 35.8 bits (81), Expect = 0.013
Identities = 17/43 (39%), Positives = 21/43 (48%)
Frame = +1
Query: 267 GTGFTSGSYRPMTGAAGGSGDQTLKKETTCFRCGEPGHYANAC 309
G G G R + GG G +K C+ CGEPGH+A C
Sbjct: 304 GGGGGGGGGRGGGRSGGGGGGSDMK----CYECGEPGHFAREC 420
>AV420911
Length = 418
Score = 34.3 bits (77), Expect = 0.038
Identities = 14/30 (46%), Positives = 17/30 (56%)
Frame = +2
Query: 280 GAAGGSGDQTLKKETTCFRCGEPGHYANAC 309
G G G + LK C+ CGEPGH+A C
Sbjct: 338 GGGRGRGGEDLK----CYECGEPGHFAREC 415
>TC18003 similar to PIR|T05112|T05112 splicing factor 9G8-like SR protein
RSZp22 [validated] - Arabidopsis thaliana
{Arabidopsis thaliana;}, partial (42%)
Length = 587
Score = 34.3 bits (77), Expect = 0.038
Identities = 14/30 (46%), Positives = 17/30 (56%)
Frame = +1
Query: 280 GAAGGSGDQTLKKETTCFRCGEPGHYANAC 309
G G G + LK C+ CGEPGH+A C
Sbjct: 130 GGGRGRGGEDLK----CYECGEPGHFAREC 207
Score = 27.7 bits (60), Expect = 3.6
Identities = 11/37 (29%), Positives = 18/37 (47%)
Frame = +1
Query: 315 KCFNCNRMGHTAGQCRAPKTEPTVNTARGKRPAAKAR 351
KC+ C GH A +CR+ + + R + P+ R
Sbjct: 163 KCYECGEPGHFARECRSRGGSRGLGSGRRRSPSPYRR 273
>TC18767
Length = 1004
Score = 33.9 bits (76), Expect = 0.050
Identities = 12/35 (34%), Positives = 20/35 (56%)
Frame = +2
Query: 315 KCFNCNRMGHTAGQCRAPKTEPTVNTARGKRPAAK 349
+CFNC H+ +C P+ VN+AR +R + +
Sbjct: 164 RCFNCGSYNHSLRECSRPRDNVAVNSARKQRKSRR 268
>BP039068
Length = 467
Score = 31.6 bits (70), Expect = 0.25
Identities = 18/72 (25%), Positives = 29/72 (40%), Gaps = 6/72 (8%)
Frame = +3
Query: 276 RPMTGAAGGSGDQTLKKETTCFRCGEPGHYANACPDTRPKCFNC------NRMGHTAGQC 329
R T + + + ++T C C E GH +N CP + N + G T+ C
Sbjct: 252 RGPTASTSSNVNHNNPRQTVCMNCQETGHASNDCPSLTLRSRNAQHPRMNQQSGETSVSC 431
Query: 330 RAPKTEPTVNTA 341
+ T + TA
Sbjct: 432 SSCGTPCALRTA 467
>AU240127
Length = 300
Score = 30.8 bits (68), Expect = 0.42
Identities = 25/66 (37%), Positives = 34/66 (50%), Gaps = 5/66 (7%)
Frame = +1
Query: 230 KNRRLNRAGT---GGPMRSGSQNFQSREKFQSRGPYQRPAG--TGFTSGSYRPMTGAAGG 284
+NRR N + G P S S++ QS F S GPY P+ T +SGS R A+ G
Sbjct: 46 RNRRSNPDPSHEPGSPSLSSSKSHQSPAAFSSAGPYSSPSSHPTCSSSGS-RTRGPASYG 222
Query: 285 SGDQTL 290
S ++L
Sbjct: 223 SAARSL 240
>TC17929
Length = 791
Score = 30.4 bits (67), Expect = 0.55
Identities = 11/20 (55%), Positives = 12/20 (60%)
Frame = +2
Query: 291 KKETTCFRCGEPGHYANACP 310
K TC+RCGE GH CP
Sbjct: 47 KFRQTCYRCGESGHKMRNCP 106
>TC13053 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, partial (9%)
Length = 450
Score = 30.4 bits (67), Expect = 0.55
Identities = 12/34 (35%), Positives = 15/34 (43%)
Frame = +3
Query: 296 CFRCGEPGHYANACPDTRPKCFNCNRMGHTAGQC 329
C C + GH + C C NC GH A +C
Sbjct: 9 CRNCQQLGHMSRDCMGPLMICHNCGGRGHLAYEC 110
>BP079571
Length = 414
Score = 29.6 bits (65), Expect = 0.94
Identities = 10/18 (55%), Positives = 12/18 (66%)
Frame = -3
Query: 293 ETTCFRCGEPGHYANACP 310
ET CF CG+PGH+ P
Sbjct: 271 ETHCFHCGKPGHFC*GMP 218
Score = 28.1 bits (61), Expect = 2.7
Identities = 14/33 (42%), Positives = 14/33 (42%)
Frame = -1
Query: 277 PMTGAAGGSGDQTLKKETTCFRCGEPGHYANAC 309
P G GG G CFR GE GH A C
Sbjct: 387 PSNGGVGGGG---------CFRFGEVGHLARDC 316
>TC19143
Length = 371
Score = 28.1 bits (61), Expect = 2.7
Identities = 14/53 (26%), Positives = 26/53 (48%), Gaps = 2/53 (3%)
Frame = -2
Query: 145 PTSARDERESQFLTLRQGGMSVPEFASKLESLAKHFQFF--HDHVNERYMCKR 195
PT R ++ ++ ++Q V F + S K++ + H N++Y CKR
Sbjct: 322 PTYKRKKKVQNYILIQQNYYKVQRFDPRYNSKKKNYNENS*YKHTNKQYNCKR 164
>TC8167 similar to PIR|JC7519|JC7519 subtilisin-like serine proteinase -
Arabidopsis thaliana {Arabidopsis thaliana;} ,
partial (65%)
Length = 2760
Score = 28.1 bits (61), Expect = 2.7
Identities = 12/25 (48%), Positives = 14/25 (56%)
Frame = +2
Query: 262 YQRPAGTGFTSGSYRPMTGAAGGSG 286
+ RP G + SGS RP AGG G
Sbjct: 1262 FTRPGGGSWKSGSLRPWVERAGGEG 1336
>AV777865
Length = 440
Score = 27.3 bits (59), Expect = 4.7
Identities = 18/81 (22%), Positives = 31/81 (38%), Gaps = 1/81 (1%)
Frame = -1
Query: 248 QNFQSREKFQSRGPYQRPAGTGFTSGSYRPMTGAAGGSGDQTLKKETTCFRCGEPGHYAN 307
+ +Q ++ S R AG+G G+ + GAA + + C+ G+ +
Sbjct: 419 EQYQQQQAAASGAE*LRGAGSGLGGGAAEELGGAAAPAAGDVQVADEQCYDVGQRRPHRR 240
Query: 308 ACPDTRPKCFN-CNRMGHTAG 327
R +C C GH G
Sbjct: 239 RHVPRRRQCQA*CEAQGHRLG 177
Database: LJGI
Posted date: Jul 30, 2004 11:16 AM
Number of letters in database: 14,692,800
Number of sequences in database: 28,460
Lambda K H
0.318 0.131 0.389
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,140,086
Number of Sequences: 28460
Number of extensions: 81122
Number of successful extensions: 474
Number of sequences better than 10.0: 47
Number of HSP's better than 10.0 without gapping: 443
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 463
length of query: 404
length of database: 4,897,600
effective HSP length: 92
effective length of query: 312
effective length of database: 2,279,280
effective search space: 711135360
effective search space used: 711135360
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 56 (26.2 bits)
Lotus: description of TM0067b.6