
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0103.12
(1541 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC208955 similar to UP|O04698 (O04698) Chloroplast DNA-binding p... 218 2e-56
AW277379 147 4e-35
TC208016 similar to UP|Q9SSE9 (Q9SSE9) MLP3.6 protein, partial (... 143 5e-34
TC229359 similar to UP|O48794 (O48794) F24O1.3, partial (15%) 139 1e-32
BM309653 homologue to PIR|T06461|T0 DNA-binding protein PD3 chl... 95 2e-32
TC208954 weakly similar to UP|Q41700 (Q41700) ENBP1 protein, par... 130 4e-30
TC206872 similar to UP|O04024 (O04024) F7G19.7 protein, partial ... 124 3e-28
TC222090 weakly similar to UP|O82022 (O82022) ENBP1 protein, par... 118 2e-26
TC208017 107 5e-23
BU548357 80 7e-15
TC222431 similar to UP|O48794 (O48794) F24O1.3, partial (9%) 79 2e-14
AW133328 weakly similar to PIR|T10955|T10 early nodulin binding ... 66 1e-10
TC229360 homologue to UP|O48794 (O48794) F24O1.3, partial (8%) 61 4e-09
BE821849 50 8e-06
CO984522 47 5e-05
BG238864 45 3e-04
TC234962 similar to UP|Q9FXE1 (Q9FXE1) F12A21.9, partial (17%) 44 4e-04
TC210174 43 0.001
TC231109 weakly similar to UP|Q9LUP7 (Q9LUP7) Gb|AAD25583.1, par... 39 0.013
BI974743 38 0.030
>TC208955 similar to UP|O04698 (O04698) Chloroplast DNA-binding protein PD3,
partial (11%)
Length = 809
Score = 218 bits (555), Expect = 2e-56
Identities = 96/110 (87%), Positives = 106/110 (96%)
Frame = +3
Query: 1432 VKNDISSNNFFQNDDHMETQFGSAVWDIFRRQDVPKLTEYLKKHHKEFRHANNLPVDSVT 1491
VKND SSNNF QNDDH+ETQ+GSAVWDIFRRQDVPKLTEYLKKHH+EFRH NNLPV+SV
Sbjct: 12 VKNDTSSNNFCQNDDHLETQYGSAVWDIFRRQDVPKLTEYLKKHHREFRHINNLPVNSVI 191
Query: 1492 HPIHDQILYLNEKHKRQLKKEYGIEPWTFEQHLGEAVFIPAGCPHQVRNR 1541
HPIHDQILYLNEKHK+QLK+E+G+EPWTFEQHLG+AVF+PAGCPHQVRNR
Sbjct: 192 HPIHDQILYLNEKHKKQLKQEFGVEPWTFEQHLGDAVFVPAGCPHQVRNR 341
>AW277379
Length = 459
Score = 147 bits (371), Expect = 4e-35
Identities = 72/150 (48%), Positives = 97/150 (64%), Gaps = 2/150 (1%)
Frame = +2
Query: 1078 EHFQRHWIRGEPVIVRNVFEKASGLSWHPMVMWRAF-RGANKILKEEPTTFKAIDCLDWC 1136
++F++HW GEP+IV+ VF+ +S SW PMV+WR ++ K+E KAIDCLD
Sbjct: 5 DNFRKHWKTGEPIIVKQVFDGSSISSWDPMVIWRGILETIDEKAKDENRMVKAIDCLDGS 184
Query: 1137 EVQINIFQFFKGYLEGRRYRNGWPEMLKLKDWPPSNSFEECFPRHGAEFIAMLPFSDYTH 1196
E+ I + QF KGY EG NGWP++LKLKDWP ++ EE EFI+ LP Y H
Sbjct: 185 EIDIELAQFMKGYFEGLILENGWPQLLKLKDWPSPSASEEFLLYQRPEFISKLPLLQYIH 364
Query: 1197 PKFGILNLATKLPAV-LKPDLGPKTYIAYG 1225
K+G+LN+A KLP L+ D+ PK YI+YG
Sbjct: 365 SKWGLLNVAAKLPHYSLQNDVXPKIYISYG 454
>TC208016 similar to UP|Q9SSE9 (Q9SSE9) MLP3.6 protein, partial (13%)
Length = 799
Score = 143 bits (361), Expect = 5e-34
Identities = 70/127 (55%), Positives = 85/127 (66%)
Frame = +3
Query: 1414 CSSPVMPCSKIQIDKTVPVKNDISSNNFFQNDDHMETQFGSAVWDIFRRQDVPKLTEYLK 1473
C S + K++I I ++ F+ D A+WDIFRRQDVPKL EYLK
Sbjct: 27 CGSDLKEIDKVKI---------IQESDLFRGDASE-----GALWDIFRRQDVPKLQEYLK 164
Query: 1474 KHHKEFRHANNLPVDSVTHPIHDQILYLNEKHKRQLKKEYGIEPWTFEQHLGEAVFIPAG 1533
KH +EFRH + P+ V HPIHDQ YL +HK++LK+EYGIEPWTF Q LG+AVFIPAG
Sbjct: 165 KHFREFRHIHCCPLKQVIHPIHDQTFYLTMEHKKKLKEEYGIEPWTFTQKLGDAVFIPAG 344
Query: 1534 CPHQVRN 1540
CPHQVRN
Sbjct: 345 CPHQVRN 365
>TC229359 similar to UP|O48794 (O48794) F24O1.3, partial (15%)
Length = 942
Score = 139 bits (350), Expect = 1e-32
Identities = 64/96 (66%), Positives = 76/96 (78%)
Frame = +1
Query: 1445 DDHMETQFGSAVWDIFRRQDVPKLTEYLKKHHKEFRHANNLPVDSVTHPIHDQILYLNEK 1504
++ MET GSA+WDIF+R+D KL YL+KH KEFRH PV+ V HPIHDQ YL +
Sbjct: 34 NESMET--GSALWDIFQREDSEKLETYLRKHSKEFRHTYCSPVEQVVHPIHDQCFYLTWE 207
Query: 1505 HKRQLKKEYGIEPWTFEQHLGEAVFIPAGCPHQVRN 1540
HK++LK+E G+EPWTFEQ LGEAVFIPAGCPHQVRN
Sbjct: 208 HKKKLKEELGVEPWTFEQKLGEAVFIPAGCPHQVRN 315
>BM309653 homologue to PIR|T06461|T0 DNA-binding protein PD3 chloroplast -
garden pea, partial (7%)
Length = 432
Score = 95.1 bits (235), Expect(2) = 2e-32
Identities = 41/50 (82%), Positives = 48/50 (96%)
Frame = +2
Query: 1467 KLTEYLKKHHKEFRHANNLPVDSVTHPIHDQILYLNEKHKRQLKKEYGIE 1516
KLTEYLKKHH+EFRH NNLPV+SV HPIHDQILYLNEKHK+QLK+E+G++
Sbjct: 2 KLTEYLKKHHREFRHINNLPVNSVIHPIHDQILYLNEKHKKQLKQEFGMK 151
Score = 64.7 bits (156), Expect(2) = 2e-32
Identities = 25/28 (89%), Positives = 28/28 (99%)
Frame = +3
Query: 1514 GIEPWTFEQHLGEAVFIPAGCPHQVRNR 1541
G+EPWTFEQHLG+AVF+PAGCPHQVRNR
Sbjct: 213 GVEPWTFEQHLGDAVFVPAGCPHQVRNR 296
>TC208954 weakly similar to UP|Q41700 (Q41700) ENBP1 protein, partial (7%)
Length = 627
Score = 130 bits (327), Expect = 4e-30
Identities = 62/93 (66%), Positives = 71/93 (75%)
Frame = +1
Query: 1420 PCSKIQIDKTVPVKNDISSNNFFQNDDHMETQFGSAVWDIFRRQDVPKLTEYLKKHHKEF 1479
PCS I ++K VKND SSNNF QNDDH+ETQ+GSAVWDI DVP LTEYL H +E
Sbjct: 340 PCSDINVEKIESVKNDTSSNNFCQNDDHLETQYGSAVWDISPMLDVPMLTEYL*IHLREL 519
Query: 1480 RHANNLPVDSVTHPIHDQILYLNEKHKRQLKKE 1512
RH NLP +SV HPIHDQILYLNE H ++LK+E
Sbjct: 520 RHIYNLPCNSVIHPIHDQILYLNEMHYKRLKQE 618
>TC206872 similar to UP|O04024 (O04024) F7G19.7 protein, partial (13%)
Length = 1851
Score = 124 bits (311), Expect = 3e-28
Identities = 55/100 (55%), Positives = 75/100 (75%)
Frame = -3
Query: 1441 FFQNDDHMETQFGSAVWDIFRRQDVPKLTEYLKKHHKEFRHANNLPVDSVTHPIHDQILY 1500
F QN D E +WD+FRRQDVP LT+YLK H KEF +N+L + V P++D ++
Sbjct: 1624 FNQNGDVSEKTHPGVLWDVFRRQDVPILTKYLKIHWKEFGKSNDLGNEFVEWPLYDGAIF 1445
Query: 1501 LNEKHKRQLKKEYGIEPWTFEQHLGEAVFIPAGCPHQVRN 1540
L++ HKR+LK+E+G+EPW+FEQ+LGEA+F+PAGCP Q RN
Sbjct: 1444 LDKHHKRKLKEEFGVEPWSFEQNLGEAIFVPAGCPFQARN 1325
>TC222090 weakly similar to UP|O82022 (O82022) ENBP1 protein, partial (8%)
Length = 1124
Score = 118 bits (295), Expect = 2e-26
Identities = 57/110 (51%), Positives = 77/110 (69%), Gaps = 4/110 (3%)
Frame = +1
Query: 1435 DISSNNFFQNDDHME----TQFGSAVWDIFRRQDVPKLTEYLKKHHKEFRHANNLPVDSV 1490
D + N F+N + + T+ A WD+FRRQDVPKL EYLK+H EF + N+ + +
Sbjct: 361 DHNPRNPFENSNSDKRKKFTENSGAHWDVFRRQDVPKLLEYLKRHSDEFSY-NSECHEKM 537
Query: 1491 THPIHDQILYLNEKHKRQLKKEYGIEPWTFEQHLGEAVFIPAGCPHQVRN 1540
HPI DQ +L+ HK +LK+E+ IEPWTFEQH+GEAV IP+GCP+Q+RN
Sbjct: 538 VHPILDQSFFLDNTHKMRLKEEFKIEPWTFEQHVGEAVIIPSGCPYQIRN 687
>TC208017
Length = 626
Score = 107 bits (266), Expect = 5e-23
Identities = 45/68 (66%), Positives = 55/68 (80%)
Frame = +2
Query: 1473 KKHHKEFRHANNLPVDSVTHPIHDQILYLNEKHKRQLKKEYGIEPWTFEQHLGEAVFIPA 1532
+KH +EFRH + P+ V HPIHDQ YL +HKR+LK+EYGIEPWTF Q +G+AVF+PA
Sbjct: 2 RKHFREFRHIHCCPLKQVIHPIHDQTFYLTVEHKRKLKEEYGIEPWTFIQKVGDAVFVPA 181
Query: 1533 GCPHQVRN 1540
GCPHQVRN
Sbjct: 182 GCPHQVRN 205
>BU548357
Length = 629
Score = 80.1 bits (196), Expect = 7e-15
Identities = 43/115 (37%), Positives = 64/115 (55%), Gaps = 23/115 (20%)
Frame = +1
Query: 919 DNCNTSIVNFHRSCPNPNCRYDLCLTCCMELRNG-------------------LHYED-I 958
D+C TSI++ HRSCPN C Y+LCL+CC E+R+G +H D +
Sbjct: 295 DHCATSIIDLHRSCPN--CSYELCLSCCQEIRDGSITPRAELKFPYVNRGYDYMHGGDPL 468
Query: 959 PASGNEETID---EPPITSAWRAEINGRIPCPPKARGGCGTSILSLRRLFEANWV 1010
P + ET + EP ++ W+A+ +G I C PK GGCG+++L L +F W+
Sbjct: 469 PVPCDLETSEGHIEP--STVWKAKXDGSISCAPKELGGCGSAVLELXCIFPDGWI 627
>TC222431 similar to UP|O48794 (O48794) F24O1.3, partial (9%)
Length = 664
Score = 79.0 bits (193), Expect = 2e-14
Identities = 40/90 (44%), Positives = 55/90 (60%), Gaps = 1/90 (1%)
Frame = +3
Query: 1050 VRKAASRETNHDNFLYCPDAVDMGDTEYEHFQRHWIRGEPVIVRNVFEKASGLSWHPMVM 1109
+RK A RE +DN +Y P++ + FQ+HW GEP+IVR+V ++ +GLSW PMVM
Sbjct: 126 LRKEAIREGINDNNIYYPESSNTQKEGLLLFQKHWANGEPIIVRDVLKQGTGLSWEPMVM 305
Query: 1110 WRAF-RGANKILKEEPTTFKAIDCLDWCEV 1138
WRA + + + KAIDCL CEV
Sbjct: 306 WRALCENMVSEISSKMSEVKAIDCLANCEV 395
>AW133328 weakly similar to PIR|T10955|T10 early nodulin binding protein 1 -
spring vetch, partial (2%)
Length = 421
Score = 66.2 bits (160), Expect = 1e-10
Identities = 44/107 (41%), Positives = 58/107 (54%), Gaps = 4/107 (3%)
Frame = +2
Query: 672 VKPKRGRPKGSKNKMKSIANKARNKFGKVRNMRGRPKGSLRKKNETAYCLDSQNERNSL- 730
V+ K GRPKG K+K A +A N F K RGRPKGS RK+ E+AY DS ER+ L
Sbjct: 8 VRTKHGRPKGYKSK----AEEAGNNFDKGGKKRGRPKGSHRKEKESAYHFDSLIERHGLV 175
Query: 731 ---DGRTSTEAAYRNDVDLHRGHCSQEELLRMLSVEHKNIQGVGVEE 774
+G TS E+A +ND + S + R+ K Q G++E
Sbjct: 176 AEKEGXTSAESASKNDTGQEKKIYSCQRSSRITRQAIKQSQSRGLKE 316
Score = 35.8 bits (81), Expect = 0.15
Identities = 33/113 (29%), Positives = 49/113 (43%), Gaps = 16/113 (14%)
Frame = +2
Query: 500 KCGRPKGSKNKQKNIVQVSQEVAGSADCEIAGPKKCGRPKGSMKKRKSLV--CASILEGA 557
K GRPKG K+K ++E + D G KK GRPKGS +K K S++E
Sbjct: 17 KHGRPKGYKSK-------AEEAGNNFD---KGGKKRGRPKGSHRKEKESAYHFDSLIERH 166
Query: 558 GGITRE--------------GLENKMLSNLCQEHIEYTQPVVRGGRPKGSRNK 596
G + + G E K+ S CQ T+ ++ + +G + K
Sbjct: 167 GLVAEKEGXTSAESASKNDTGQEKKIYS--CQRSSRITRQAIKQSQSRGLKEK 319
Score = 33.1 bits (74), Expect = 0.96
Identities = 20/47 (42%), Positives = 24/47 (50%)
Frame = +2
Query: 466 KRGRPRGSKNKVNNIMEVSKKAASGGDCKIAGPKKCGRPKGSKNKQK 512
K GRP+G K+K A G+ G KK GRPKGS K+K
Sbjct: 17 KHGRPKGYKSK----------AEEAGNNFDKGGKKRGRPKGSHRKEK 127
Score = 32.7 bits (73), Expect = 1.3
Identities = 21/45 (46%), Positives = 25/45 (54%)
Frame = +2
Query: 404 KHGRPKSSKCRKKNIMEAGDEAAGEIGGDKKLGRPKGSLNKLKNT 448
KHGRPK K + + EAG+ G KK GRPKGS K K +
Sbjct: 17 KHGRPKGYKSKAE---EAGNNFDK---GGKKRGRPKGSHRKEKES 133
>TC229360 homologue to UP|O48794 (O48794) F24O1.3, partial (8%)
Length = 421
Score = 60.8 bits (146), Expect = 4e-09
Identities = 25/27 (92%), Positives = 26/27 (95%)
Frame = +3
Query: 1514 GIEPWTFEQHLGEAVFIPAGCPHQVRN 1540
G+EPWTFEQ LGEAVFIPAGCPHQVRN
Sbjct: 12 GVEPWTFEQKLGEAVFIPAGCPHQVRN 92
>BE821849
Length = 443
Score = 50.1 bits (118), Expect = 8e-06
Identities = 19/34 (55%), Positives = 21/34 (60%)
Frame = -2
Query: 798 LRCHQCWRNSWSGVVICAKCKRKQYCYECITKWY 831
L CHQC RN VV C CKRK++C CI WY
Sbjct: 265 LMCHQCQRNDKGRVVRCTSCKRKRFCVHCIENWY 164
>CO984522
Length = 758
Score = 47.4 bits (111), Expect = 5e-05
Identities = 20/29 (68%), Positives = 25/29 (85%)
Frame = -2
Query: 1455 AVWDIFRRQDVPKLTEYLKKHHKEFRHAN 1483
A+WDIF RQDVPKL EYLKK+ +EFR+ +
Sbjct: 103 ALWDIFWRQDVPKLQEYLKKNFREFRYVH 17
Score = 34.3 bits (77), Expect = 0.43
Identities = 16/37 (43%), Positives = 25/37 (67%)
Frame = -2
Query: 1247 VNILTHTEEVKAPLWQPKIIKKLQKKYEAEDMRDLYG 1283
VN+LTH EVK Q +I+KL++K+ ++ R+L G
Sbjct: 490 VNVLTHIAEVKLDCDQLTVIEKLKQKHLEQEKRELLG 380
>BG238864
Length = 363
Score = 44.7 bits (104), Expect = 3e-04
Identities = 19/33 (57%), Positives = 22/33 (66%)
Frame = +1
Query: 920 NCNTSIVNFHRSCPNPNCRYDLCLTCCMELRNG 952
NC TSI ++HRSC C +DLCL CC EL G
Sbjct: 1 NCKTSIFDYHRSCTK--CSFDLCLICCRELLMG 93
Score = 30.8 bits (68), Expect = 4.8
Identities = 14/35 (40%), Positives = 18/35 (51%)
Frame = +3
Query: 976 WRAEINGRIPCPPKARGGCGTSILSLRRLFEANWV 1010
W A NG IPC PK G C L LR + +++
Sbjct: 258 WHA*SNGNIPC-PKVNGECNHGFLELRTILGKHFI 359
>TC234962 similar to UP|Q9FXE1 (Q9FXE1) F12A21.9, partial (17%)
Length = 524
Score = 44.3 bits (103), Expect = 4e-04
Identities = 24/75 (32%), Positives = 36/75 (48%), Gaps = 9/75 (12%)
Frame = -2
Query: 800 CHQCWRNSWSGVVICAKCKRKQ-----YCYECITKWYPGKTREEIEIA----CPFCLRNC 850
CHQC + + V C K+ + +C++C+ Y G+ E++E CP C C
Sbjct: 523 CHQCRQKTRDFAVSCKNMKKGKPCPINFCHKCLLNRY-GEKAEKVEQLGNWMCPKCRNFC 347
Query: 851 NCRLCLKKDISVMTG 865
NC C KK + TG
Sbjct: 346 NCSFCRKKQGELPTG 302
>TC210174
Length = 861
Score = 43.1 bits (100), Expect = 0.001
Identities = 29/109 (26%), Positives = 52/109 (47%), Gaps = 5/109 (4%)
Frame = +2
Query: 800 CHQCWRNSWSGVVICAKCK--RKQYCYECITKWYPGKTREEIEIA---CPFCLRNCNCRL 854
CHQC + + C++C + Q+C +C+ Y E ++ CP C CNC L
Sbjct: 107 CHQCRQKTLGYRTSCSQCNMVQGQFCGDCLYMRYGEHVLEALQNPTWLCPVCRGICNCSL 286
Query: 855 CLKKDISVMTGSGEADTGVILQKLLYLLNKTLPLLQHIQREQISEMEVE 903
C + G A TG + +K+ L K+ + ++ + + SE++V+
Sbjct: 287 CRQ-------AKGWAPTGTLYKKISTLGYKS--VAHYLIQTRRSEIDVK 406
>TC231109 weakly similar to UP|Q9LUP7 (Q9LUP7) Gb|AAD25583.1, partial (8%)
Length = 1061
Score = 39.3 bits (90), Expect = 0.013
Identities = 35/123 (28%), Positives = 51/123 (41%), Gaps = 1/123 (0%)
Frame = -1
Query: 121 KGEEGQQVHSGGFGEGCGRMGQVLGDYGVEYAE-DRNAVAGLGAFRNVGNEDHGCVAGRN 179
+G + H G EG G G+VL G E E D VGN V G
Sbjct: 686 EGAGDEDAHGAGAAEGVGHAGEVLPGVGFEGVELDLGIEGAFLEIHEVGNAQELVVVGG- 510
Query: 180 VCVNDRLGLPSEGIESLIGEEPGFGSFQALLCKDRGCAEDVIFIGDVTGFEGLSGENTHG 239
G+ +E +++ GEE G G+ +A+ ++ AE+ + GDV G + E G
Sbjct: 509 -------GVVAEAVDAG-GEEGGLGAVEAVGGENTVLAEERVPEGDVVSNVGDAEEAVGG 354
Query: 240 FRD 242
D
Sbjct: 353 AED 345
>BI974743
Length = 421
Score = 38.1 bits (87), Expect = 0.030
Identities = 36/159 (22%), Positives = 63/159 (38%), Gaps = 9/159 (5%)
Frame = +3
Query: 682 SKNKMKSIANKARNKFGKVRNMRGRPKGSLRKKNETAYCLDSQNERNSLDGRTS--TEAA 739
SKNK + NK +NK K RK+ + C + R L + A+
Sbjct: 3 SKNKKNN--NKKKNK----------KKEKRRKEEKEELCYTKEELRRELPNGVMEISPAS 146
Query: 740 YRNDVDLHRGHCSQEELLRMLSV-----EHKNIQGV--GVEETIDYGLRSSGLMGDTERK 792
D + HC + + +V KN+ V G + + YG
Sbjct: 147 PTRDYNNVGSHCDVKVGVDSKTVTPRYFRSKNVDRVPAGKLQIVPYG----------SNL 296
Query: 793 KETRILRCHQCWRNSWSGVVICAKCKRKQYCYECITKWY 831
K+ + +CH C R+ ++ C+ C+R+ +C +C+ + Y
Sbjct: 297 KKGKRKKCHWCQRSESGNLIQCSSCQREFFCMDCVKERY 413
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.315 0.135 0.409
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 71,583,259
Number of Sequences: 63676
Number of extensions: 1088665
Number of successful extensions: 4704
Number of sequences better than 10.0: 68
Number of HSP's better than 10.0 without gapping: 4549
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4693
length of query: 1541
length of database: 12,639,632
effective HSP length: 110
effective length of query: 1431
effective length of database: 5,635,272
effective search space: 8064074232
effective search space used: 8064074232
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 65 (29.6 bits)
Lotus: description of TM0103.12