
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC147428.3 - phase: 0
(364 letters)
Database: LJGI
28,460 sequences; 14,692,800 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BP031175 221 1e-58
TC9934 similar to UP|BAC98494 (BAC98494) AG-motif binding protei... 166 4e-42
TC20115 similar to PIR|T52104|T52104 GATA-binding transcription ... 147 2e-36
TC14877 similar to GB|AAP37701.1|30725358|BT008342 At4g32890 {Ar... 130 3e-31
BP048741 122 1e-28
TC10150 similar to UP|BAC98494 (BAC98494) AG-motif binding prote... 120 5e-28
TC15996 similar to UP|BAC98491 (BAC98491) AG-motif binding prote... 116 5e-27
TC16671 similar to PIR|T52104|T52104 GATA-binding transcription ... 113 4e-26
AV767608 82 2e-16
AV408469 65 2e-11
AV408760 60 4e-10
TC17952 similar to UP|GRP1_PETHY (P09789) Glycine-rich cell wall... 55 1e-08
AU251628 50 5e-07
CN825365 50 8e-07
BP040189 46 9e-06
BU494175 44 4e-05
AV780606 43 1e-04
TC10447 similar to UP|BAC98495 (BAC98495) AG-motif binding prote... 42 1e-04
AV428413 39 0.001
TC12776 37 0.007
>BP031175
Length = 537
Score = 221 bits (563), Expect = 1e-58
Identities = 101/116 (87%), Positives = 110/116 (94%), Gaps = 1/116 (0%)
Frame = -2
Query: 250 GEGRKCLHCATDKTPQWRTGPLGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNS 309
G+GR+CLHCATDKTPQWRTGP+GPKTLCNACGVRYKSGRLVPEYRPAASPTF+LTKHSNS
Sbjct: 533 GDGRRCLHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFMLTKHSNS 354
Query: 310 HRKVQELRRQKEMMRAQQHQLLQLQHHHSIMFEGPSNGDDYLIHQHV-GPDFTHLI 364
HRKV ELRRQKEMM+AQQ QLLQ+QHH S+MF+ SNGDDYLIHQHV GPDF+HLI
Sbjct: 353 HRKVLELRRQKEMMKAQQQQLLQMQHHQSMMFDASSNGDDYLIHQHVAGPDFSHLI 186
>TC9934 similar to UP|BAC98494 (BAC98494) AG-motif binding protein-4,
partial (37%)
Length = 1050
Score = 166 bits (421), Expect = 4e-42
Identities = 107/269 (39%), Positives = 140/269 (51%), Gaps = 7/269 (2%)
Frame = +2
Query: 60 NTNNVNTAASDHFIVEDLFDFSN----EDVAIEDPTFEESPPTNSNDSPPLETNPTSNFF 115
N NNV + F V+DL DFSN +D E+ EE + + SN
Sbjct: 8 NANNV--VVGEDFSVDDLLDFSNGKEMDDYEEEEEEEEEEKGSTVSGGSQDRIEEDSN-- 175
Query: 116 TDNSCQNSADGPFSGELSVPYDDLAELEWVSKFAEESFSSEDLHKLQLISGLKAPNNVAS 175
++++ D F+GEL+VP DD+A LEWVS F ++S L +L L+ ++A A
Sbjct: 176 SNSTAAGDYDSIFAGELAVPADDVAGLEWVSHFVDDS-----LPELSLLYPVRAEQTRAR 340
Query: 176 KPYEESNPTVHSQVSVPAKARSKRSRVPPCNWTSRLLVLSPTTTTTTTTTTSSHSDTMAP 235
E + + K R+ R+R P + +P ++ S P
Sbjct: 341 AEPEARPGSDRAPFPFRTKPRTTRTRKP----IGHVWSFNPMGFSSMAPPLSGEGFRQPP 508
Query: 236 PKKPS--PRKRDPNDGGE-GRKCLHCATDKTPQWRTGPLGPKTLCNACGVRYKSGRLVPE 292
KK P +D G + R+C HC KTPQWR GPLGPKTLCNACGVR+KSGRL PE
Sbjct: 509 MKKQKKKPEAQDHTGGPQFQRRCSHCQVQKTPQWRAGPLGPKTLCNACGVRFKSGRLFPE 688
Query: 293 YRPAASPTFVLTKHSNSHRKVQELRRQKE 321
YRPA SPTF HSNSHRKV E+RR+KE
Sbjct: 689 YRPACSPTFSGEIHSNSHRKVLEMRRRKE 775
>TC20115 similar to PIR|T52104|T52104 GATA-binding transcription factor
homolog 2 [imported] - Arabidopsis
thaliana {Arabidopsis thaliana;}, partial (31%)
Length = 635
Score = 147 bits (372), Expect = 2e-36
Identities = 73/104 (70%), Positives = 84/104 (80%), Gaps = 6/104 (5%)
Frame = +3
Query: 230 SDTMAPPKKPSPRKR----DPNDGGEG--RKCLHCATDKTPQWRTGPLGPKTLCNACGVR 283
SD +A S KR + + GEG R+C HCA++KTPQWRTGPLGPKTLCNACGVR
Sbjct: 12 SDVIASVTGKSRSKRAAAVEGSPAGEGGVRRCSHCASEKTPQWRTGPLGPKTLCNACGVR 191
Query: 284 YKSGRLVPEYRPAASPTFVLTKHSNSHRKVQELRRQKEMMRAQQ 327
+KSGRLVPEYRPAASPTFVL++HSNSHRKV ELRRQKE++R QQ
Sbjct: 192 FKSGRLVPEYRPAASPTFVLSQHSNSHRKVMELRRQKELIRHQQ 323
>TC14877 similar to GB|AAP37701.1|30725358|BT008342 At4g32890 {Arabidopsis
thaliana;}, partial (25%)
Length = 1039
Score = 130 bits (328), Expect = 3e-31
Identities = 85/220 (38%), Positives = 104/220 (46%), Gaps = 44/220 (20%)
Frame = +3
Query: 142 LEWVSKFAEESFSSED--------LHKLQLISGLKAPNNVASKPYEESNPTVHSQVSVPA 193
LEW+S+F E+ FS+ + S K KP E +P + SVP
Sbjct: 9 LEWLSEFVEDCFSTPSGCVLVPASVQTASTSSKSKPLGTTVQKPLESESPLQKN--SVPG 182
Query: 194 KARSKRSRV-------PPCNWTSRLLVLSPT---------------------TTTTTTTT 225
KARSKR R+ P W+ L + T
Sbjct: 183 KARSKRKRLSAPRTKDPLSMWSHHLNPQNEAFGPDPPLLKQDYWLADSELIVKKEEQEVT 362
Query: 226 TSSHSDTMAPPKKPSPRKRDPNDG-----GEG---RKCLHCATDKTPQWRTGPLGPKTLC 277
T + + KK K + DG G+ R+C HC + +TPQWR GPLGPKTLC
Sbjct: 363 TKEEQEVVVVVKKEIIVKEEFEDGEVSNNGQNPMPRRCTHCLSQRTPQWRAGPLGPKTLC 542
Query: 278 NACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVQELR 317
NACGVRYKSGRL PEYRPA SPTFV HSNSH+KV E+R
Sbjct: 543 NACGVRYKSGRLHPEYRPAKSPTFVSFLHSNSHKKVMEMR 662
>BP048741
Length = 558
Score = 122 bits (305), Expect = 1e-28
Identities = 55/87 (63%), Positives = 63/87 (72%)
Frame = -1
Query: 235 PPKKPSPRKRDPNDGGEGRKCLHCATDKTPQWRTGPLGPKTLCNACGVRYKSGRLVPEYR 294
P K+P + +D R+C HC KTPQWRTGP G KTLCNACGVR+KSGRL+PEYR
Sbjct: 516 PAKRPKRIRNSCSDQAAPRRCSHCGVQKTPQWRTGPHGAKTLCNACGVRFKSGRLLPEYR 337
Query: 295 PAASPTFVLTKHSNSHRKVQELRRQKE 321
PA SPTF HSN HRKV E+RR+KE
Sbjct: 336 PACSPTFSSELHSNHHRKVLEMRRKKE 256
>TC10150 similar to UP|BAC98494 (BAC98494) AG-motif binding protein-4,
partial (22%)
Length = 552
Score = 120 bits (300), Expect = 5e-28
Identities = 58/91 (63%), Positives = 65/91 (70%), Gaps = 3/91 (3%)
Frame = +3
Query: 236 PKKPSPRKRDPNDGGEG---RKCLHCATDKTPQWRTGPLGPKTLCNACGVRYKSGRLVPE 292
P K RK + +G R+C HC KTPQWRTGPLG KTLCNACGVRYKSGRL E
Sbjct: 3 PAKKLKRKAEEVEGAGAQSQRRCSHCQVQKTPQWRTGPLGAKTLCNACGVRYKSGRLCSE 182
Query: 293 YRPAASPTFVLTKHSNSHRKVQELRRQKEMM 323
YRPA SPTF HSNSHRKV E+RR+KE++
Sbjct: 183 YRPACSPTFCGEIHSNSHRKVLEIRRRKEVV 275
>TC15996 similar to UP|BAC98491 (BAC98491) AG-motif binding protein-1,
partial (19%)
Length = 555
Score = 116 bits (291), Expect = 5e-27
Identities = 50/65 (76%), Positives = 55/65 (83%)
Frame = +2
Query: 253 RKCLHCATDKTPQWRTGPLGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRK 312
RKC+HC KTPQWR GP+GPKTLCNACGVRY+SGRL PEYRPAASPTFV HSN H+K
Sbjct: 161 RKCMHCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPAVHSNCHKK 340
Query: 313 VQELR 317
V E+R
Sbjct: 341 VLEMR 355
>TC16671 similar to PIR|T52104|T52104 GATA-binding transcription factor
homolog 2 [imported] - Arabidopsis
thaliana {Arabidopsis thaliana;}, partial (20%)
Length = 684
Score = 113 bits (283), Expect = 4e-26
Identities = 59/105 (56%), Positives = 70/105 (66%), Gaps = 15/105 (14%)
Frame = +2
Query: 275 TLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVQELRRQKEMMRAQQHQLLQLQ 334
TLCNACGVR+KSGRLV EYRPAASPTFV +HSNSHRKV ELRRQK+M R QQ QL+
Sbjct: 2 TLCNACGVRFKSGRLVAEYRPAASPTFVSARHSNSHRKVLELRRQKDMQRQQQQQLMS-- 175
Query: 335 HHHSIMFEGPSNGDDYLI---------------HQHVGPDFTHLI 364
S +F G + GD+++I QH GP+F H+I
Sbjct: 176 --QSSIFGGSNGGDEFMIRHHHQHHHHQQQQQQQQHCGPEFRHVI 304
>AV767608
Length = 398
Score = 81.6 bits (200), Expect = 2e-16
Identities = 39/53 (73%), Positives = 43/53 (80%)
Frame = -1
Query: 270 PLGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVQELRRQKEM 322
P GPKTLCNACGVR+KSGRLV E RPA+SPTF HSNS RKV E+R+QK M
Sbjct: 398 PHGPKTLCNACGVRFKSGRLVLENRPASSPTFRAELHSNSPRKVMEMRKQKHM 240
>AV408469
Length = 429
Score = 64.7 bits (156), Expect = 2e-11
Identities = 47/132 (35%), Positives = 61/132 (45%), Gaps = 40/132 (30%)
Frame = +2
Query: 189 VSVPA-KARSKRSRVPPCNWTSRLLVLSPTTTTTTTTTT-----------SSHSDTMAP- 235
+ VP +ARSKR R N S + ++SP ++ T SS S+ A
Sbjct: 32 IPVPCGRARSKRPRPASFNPRSAMQLISPASSFIGENNTMKPNYHHHVRTSSDSENFAES 211
Query: 236 ---------PKKPS--PRKRD---------PNDGGE-------GRKCLHCATDKTPQWRT 268
PK+ S P+K+ P+DG RKC+HC KTPQWR
Sbjct: 212 QPVITKMMMPKQASGEPKKKKKIKVPLPLAPSDGNNIQNGSQPVRKCMHCEITKTPQWRA 391
Query: 269 GPLGPKTLCNAC 280
GP+GPKTLCNAC
Sbjct: 392 GPMGPKTLCNAC 427
>AV408760
Length = 425
Score = 60.5 bits (145), Expect = 4e-10
Identities = 44/111 (39%), Positives = 59/111 (52%), Gaps = 2/111 (1%)
Frame = +2
Query: 50 QEFFQSDNNSNTNNV--NTAASDHFIVEDLFDFSNEDVAIEDPTFEESPPTNSNDSPPLE 107
+E + + N+N N V + F VEDLFDFSN D ++ D EE NS+ S +
Sbjct: 113 EEIWCLNANANANAVVAGGGGGEDFAVEDLFDFSNGD-SLHD---EEEEKDNSSLSSHSQ 280
Query: 108 TNPTSNFFTDNSCQNSADGPFSGELSVPYDDLAELEWVSKFAEESFSSEDL 158
+ S NS S D FS EL+VP DL +LEWVS F ++S S +L
Sbjct: 281 DSTNS-----NSTGVSNDSIFSAELTVPAGDLDDLEWVSHFVDDSLSLPEL 418
>TC17952 similar to UP|GRP1_PETHY (P09789) Glycine-rich cell wall structural
protein 1 precursor, partial (5%)
Length = 441
Score = 55.5 bits (132), Expect = 1e-08
Identities = 38/123 (30%), Positives = 56/123 (44%), Gaps = 21/123 (17%)
Frame = +3
Query: 47 MEAQEFFQSD----------NNSNTNNVNTAASDHFIVEDLFDFSNEDVAIEDPTFEESP 96
MEA E+F + + + + + F ++DLFDFSN D + D F+ +
Sbjct: 72 MEAPEYFVGGYYGAGGQDQFSPAEKRHTDQKPGEPFAIDDLFDFSNADAIMSDGFFD-NV 248
Query: 97 PTNSNDSPPLETNPTSNFFTDNSCQN-----------SADGPFSGELSVPYDDLAELEWV 145
NS DS + + N N ++D SGEL VPYD++AELEW+
Sbjct: 249 AGNSTDSSTVTAVDSCNSSISGGDNNRFAAAIVPRGFASDLHLSGELCVPYDEMAELEWL 428
Query: 146 SKF 148
S F
Sbjct: 429 SNF 437
>AU251628
Length = 350
Score = 50.4 bits (119), Expect = 5e-07
Identities = 22/43 (51%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Frame = +3
Query: 239 PSPRKRDPNDGGEGRK-CLHCATDKTPQWRTGPLGPKTLCNAC 280
P+ + G E +K C C T KTP WR GP GPK+LCNAC
Sbjct: 222 PNSAAAAASSGSEQKKTCADCGTSKTPLWRGGPAGPKSLCNAC 350
>CN825365
Length = 511
Score = 49.7 bits (117), Expect = 8e-07
Identities = 23/56 (41%), Positives = 30/56 (53%)
Frame = +3
Query: 243 KRDPNDGGEGRKCLHCATDKTPQWRTGPLGPKTLCNACGVRYKSGRLVPEYRPAAS 298
K+D + G +G C HC TP WR GP LCNACG R+++ + Y P S
Sbjct: 132 KKD*DMGKQG-PCYHCGVTSTPLWRNGPPEKPVLCNACGSRWRTKGTLANYTPLHS 296
>BP040189
Length = 501
Score = 46.2 bits (108), Expect = 9e-06
Identities = 36/123 (29%), Positives = 59/123 (47%), Gaps = 6/123 (4%)
Frame = +2
Query: 65 NTAASDHFIVEDLFDFSNEDVAIEDPTFEESPPTNSNDSPPLETNPTSNFFTDNSCQNSA 124
N +D F V+DL DFS+ V E+P ++ + + + S + + S
Sbjct: 101 NGGQNDDFSVDDLLDFSHA-VEEEEPEQQQQKVEQEHSASACVSLQQSREISKPAA-TSN 274
Query: 125 DGPF-----SGELSVPYDDLAELEWVSKFAEES-FSSEDLHKLQLISGLKAPNNVASKPY 178
PF +GEL+VP DD+A+LEW+S F E+S FS+ + + V ++P
Sbjct: 275 HHPFKDEFPTGELTVPVDDVADLEWLSHFVEDSEFSAAAAFPAAVALPVNPKKEVKTEPE 454
Query: 179 EES 181
E+
Sbjct: 455 PET 463
>BU494175
Length = 475
Score = 43.9 bits (102), Expect = 4e-05
Identities = 18/34 (52%), Positives = 23/34 (66%), Gaps = 2/34 (5%)
Frame = +2
Query: 253 RKCLHCATDK--TPQWRTGPLGPKTLCNACGVRY 284
R+C HC + TP R GP GP+TLCNACG+ +
Sbjct: 146 RRCQHCGVSENNTPAMRRGPDGPRTLCNACGLMW 247
>AV780606
Length = 585
Score = 42.7 bits (99), Expect = 1e-04
Identities = 18/38 (47%), Positives = 22/38 (57%), Gaps = 2/38 (5%)
Frame = -1
Query: 247 NDGGEGRKCLHCATDK--TPQWRTGPLGPKTLCNACGV 282
+D C HC T TP R GP GP++LCNACG+
Sbjct: 459 DDSQAETSCTHCGTSSKSTPMMRRGPSGPRSLCNACGL 346
>TC10447 similar to UP|BAC98495 (BAC98495) AG-motif binding protein-5,
partial (15%)
Length = 501
Score = 42.4 bits (98), Expect = 1e-04
Identities = 28/82 (34%), Positives = 42/82 (51%), Gaps = 3/82 (3%)
Frame = +2
Query: 70 DHFIVEDLFDFSNEDVAIEDPTFEESPPTNSNDSPPLETNP--TSNF-FTDNSCQNSADG 126
DH ++DL DF EDV T + +N + P T +F +D+ ++
Sbjct: 260 DH--IDDLLDFPVEDVDGTATTLPSAAAAAANCTSLASIWPPETDSFPGSDSVFSGNSGS 433
Query: 127 PFSGELSVPYDDLAELEWVSKF 148
S ELSVPY+D+ +LEW+S F
Sbjct: 434 DLSAELSVPYEDIVQLEWLSNF 499
Score = 32.3 bits (72), Expect = 0.13
Identities = 18/51 (35%), Positives = 25/51 (48%)
Frame = +3
Query: 191 VPAKARSKRSRVPPCNWTSRLLVLSPTTTTTTTTTTSSHSDTMAPPKKPSP 241
V K+ S R + +WTS S TT+TT++ + S S PP P P
Sbjct: 183 VSGKSHSPRKWLDQTSWTS*TAEASLTTSTTSSISPSRTSTAPPPPYHPPP 335
>AV428413
Length = 416
Score = 39.3 bits (90), Expect = 0.001
Identities = 33/88 (37%), Positives = 44/88 (49%), Gaps = 2/88 (2%)
Frame = +1
Query: 50 QEFFQSDNNSNTNNV--NTAASDHFIVEDLFDFSNEDVAIEDPTFEESPPTNSNDSPPLE 107
+E + + N+N N V + F VEDLFDFSN D ++ D EE NS+ S +
Sbjct: 76 EEIWCLNANANANAVVAGGGGGEDFAVEDLFDFSNGD-SLHD---EEEEKDNSSLSSHSQ 243
Query: 108 TNPTSNFFTDNSCQNSADGPFSGELSVP 135
+ S NS S D FS EL+VP
Sbjct: 244 DSTNS-----NSTGVSNDSIFSAELTVP 312
>TC12776
Length = 560
Score = 36.6 bits (83), Expect = 0.007
Identities = 24/70 (34%), Positives = 34/70 (48%), Gaps = 1/70 (1%)
Frame = +3
Query: 181 SNPTVHSQVSVPAKARSKRSRVPPCNWTSRLLVLSPTTTTTTTTTTSSHSDTMAPPKK-P 239
S+PT S P A S+ S +PP T SPT+T +T+T S + AP ++ P
Sbjct: 150 SSPTTQSPTPSPPMASSQSSSIPPATST------SPTST*STSTPASPENSPTAPSQESP 311
Query: 240 SPRKRDPNDG 249
R R + G
Sbjct: 312 GSRPRYSSSG 341
Database: LJGI
Posted date: Jul 30, 2004 11:16 AM
Number of letters in database: 14,692,800
Number of sequences in database: 28,460
Lambda K H
0.313 0.129 0.388
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,592,829
Number of Sequences: 28460
Number of extensions: 127965
Number of successful extensions: 2032
Number of sequences better than 10.0: 228
Number of HSP's better than 10.0 without gapping: 1499
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1766
length of query: 364
length of database: 4,897,600
effective HSP length: 91
effective length of query: 273
effective length of database: 2,307,740
effective search space: 630013020
effective search space used: 630013020
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 56 (26.2 bits)
Medicago: description of AC147428.3