
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC149547.3 + phase: 0
(492 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC86799 similar to GP|22655368|gb|AAM98276.1 At2g17760/At2g17760... 433 e-122
TC84053 similar to PIR|T04698|T04698 hypothetical protein F4B14.... 383 e-106
TC91764 similar to GP|22655368|gb|AAM98276.1 At2g17760/At2g17760... 155 2e-89
CA858223 similar to GP|22655368|gb At2g17760/At2g17760 {Arabidop... 243 1e-71
TC81854 weakly similar to PIR|T50012|T50012 hypothetical protein... 147 1e-35
TC79170 similar to GP|10177232|dbj|BAB10606. protease-like prote... 136 2e-32
TC90871 similar to PIR|T45765|T45765 hypothetical protein F24M12... 110 1e-24
TC79244 weakly similar to PIR|B86193|B86193 hypothetical protein... 105 4e-23
TC77221 weakly similar to PIR|T01996|T01996 nucleoid DNA-binding... 100 2e-21
AW586678 similar to PIR|T50012|T50 hypothetical protein T31P16.7... 79 9e-21
BF641229 weakly similar to GP|22655368|gb| At2g17760/At2g17760 {... 96 3e-20
TC86774 similar to PIR|T06000|T06000 aspartic proteinase homolog... 94 1e-19
TC77267 similar to GP|9759559|dbj|BAB11161.1 nucleoid DNA-bindin... 93 3e-19
TC78047 similar to PIR|T02706|T02706 hypothetical protein At2g03... 92 3e-19
BG451032 similar to PIR|T50012|T500 hypothetical protein T31P16.... 92 5e-19
TC77266 similar to GP|9759559|dbj|BAB11161.1 nucleoid DNA-bindin... 91 1e-18
TC86488 similar to GP|9665144|gb|AAF97328.1| Unknown protein {Ar... 89 5e-18
TC89288 weakly similar to GP|21740390|emb|CAD40869. OSJNBa0064H2... 84 1e-16
TC83406 weakly similar to GP|19699359|gb|AAL91289.1 At1g79720/F1... 76 3e-14
BG648486 similar to PIR|T45858|T45 hypothetical protein F3A4.130... 72 4e-13
>TC86799 similar to GP|22655368|gb|AAM98276.1 At2g17760/At2g17760
{Arabidopsis thaliana}, partial (48%)
Length = 1850
Score = 433 bits (1113), Expect = e-122
Identities = 222/454 (48%), Positives = 300/454 (65%), Gaps = 11/454 (2%)
Frame = +1
Query: 7 IIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELAD 66
+ +I+ ++ C + F F +HHR+S+PVK P+KGS EYY +A
Sbjct: 103 LTLILFLVSQSQRCYGSSSFGFDIHHRFSDPVK-----GILGIDNIPDKGSREYYVAMAH 267
Query: 67 RDRFLRGRRLSQFDAG------LAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDT 120
RDR RGRRL+ D G L FS N+T++IS G+LH+ + +GTP ++VALDT
Sbjct: 268 RDRVFRGRRLA--DGGDVDQKLLTFSPDNTTYQISLFGYLHFANVSVGTPASSYLVALDT 441
Query: 121 GSDLFWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCL 180
GSDLFW+PC+CT+C + +A ++Y+ SSTSK V CN+SLC + QC
Sbjct: 442 GSDLFWLPCNCTKCVHGIQLSTGQKIA----FNIYDNKESSTSKNVACNSSLCEQKTQCS 609
Query: 181 GTFSN-CPYMVSYVSAETSTSGILVEDVLHL-TQPDDNHDLVEANVIFGCGQVQSGSFLD 238
+ CPY V Y+S TST+G LVEDVLHL T DD + FGCGQVQ+G+FLD
Sbjct: 610 SSSGGTCPYQVEYLSENTSTTGFLVEDVLHLITDNDDQTQHANPLITFGCGQVQTGAFLD 789
Query: 239 VAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGD-KGSLDQDETPFN 297
AAPNGLFGLGM +SVPS+L+++G T++SFSMCF DG+GRI+FGD SLDQ +TPFN
Sbjct: 790 GAAPNGLFGLGMSDVSVPSILAKQGLTSNSFSMCFAADGLGRITFGDNNSSLDQGKTPFN 969
Query: 298 VNPSHPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRR 357
+ PSH TYNIT+ Q+ VG D+EF A+FD+GTSFTYL +P Y ++++SF S+++ +R
Sbjct: 970 IRPSHSTYNITVTQIIVGGNSADLEFNAIFDTGTSFTYLNNPAYKQITQSFDSKIKLQRH 1149
Query: 358 --PPDSRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSELVYCLAVV 415
+PF+YCYD+ + T +P+++LTM GG + V DPII + V CLAV+
Sbjct: 1150SFSNSDDLPFEYCYDLR-TNQTIEVPNINLTMKGGDNYFVMDPIITSGGGNNGVLCLAVL 1326
Query: 416 KSAELNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
KS +NIIGQNFMTGYR+VFDR + LGWK+S+C
Sbjct: 1327KSNNVNIIGQNFMTGYRIVFDRENMTLGWKESNC 1428
>TC84053 similar to PIR|T04698|T04698 hypothetical protein F4B14.150 -
Arabidopsis thaliana, partial (19%)
Length = 590
Score = 383 bits (983), Expect = e-106
Identities = 183/184 (99%), Positives = 184/184 (99%)
Frame = +3
Query: 1 MLSFTKIIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEY 60
MLSFTKIIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEY
Sbjct: 39 MLSFTKIIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEY 218
Query: 61 YAELADRDRFLRGRRLSQFDAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDT 120
YAELADRDRFLRGRRLSQFDAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDT
Sbjct: 219 YAELADRDRFLRGRRLSQFDAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDT 398
Query: 121 GSDLFWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCL 180
GSDLFWVPCDCTRCSATR+SAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCL
Sbjct: 399 GSDLFWVPCDCTRCSATRNSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCL 578
Query: 181 GTFS 184
GTFS
Sbjct: 579 GTFS 590
>TC91764 similar to GP|22655368|gb|AAM98276.1 At2g17760/At2g17760
{Arabidopsis thaliana}, partial (20%)
Length = 996
Score = 155 bits (393), Expect(3) = 2e-89
Identities = 77/85 (90%), Positives = 80/85 (93%)
Frame = +1
Query: 365 FDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSELVYCLAVVKSAELNIIG 424
+ + + SPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSELVYCLAVVKSAELNIIG
Sbjct: 385 YSFLWFNSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSELVYCLAVVKSAELNIIG 564
Query: 425 QNFMTGYRVVFDRGKLILGWKKSDC 449
QNFMTGYRVVFDR KLILGWKKSDC
Sbjct: 565 QNFMTGYRVVFDREKLILGWKKSDC 639
Score = 143 bits (361), Expect(3) = 2e-89
Identities = 68/69 (98%), Positives = 68/69 (98%)
Frame = +2
Query: 303 PTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSR 362
PTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSE FHSQVEDRRRPPDSR
Sbjct: 119 PTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSEIFHSQVEDRRRPPDSR 298
Query: 363 IPFDYCYDM 371
IPFDYCYDM
Sbjct: 299 IPFDYCYDM 325
Score = 70.1 bits (170), Expect(3) = 2e-89
Identities = 31/31 (100%), Positives = 31/31 (100%)
Frame = +1
Query: 271 MCFGRDGIGRISFGDKGSLDQDETPFNVNPS 301
MCFGRDGIGRISFGDKGSLDQDETPFNVNPS
Sbjct: 16 MCFGRDGIGRISFGDKGSLDQDETPFNVNPS 108
>CA858223 similar to GP|22655368|gb At2g17760/At2g17760 {Arabidopsis
thaliana}, partial (32%)
Length = 813
Score = 243 bits (621), Expect(2) = 1e-71
Identities = 120/210 (57%), Positives = 152/210 (72%), Gaps = 3/210 (1%)
Frame = +3
Query: 176 RNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGS 235
+ QC + S+C Y V Y+S +TS+SG LVEDVLHL +D ++ + GCGQVQ+G
Sbjct: 12 QTQCHSSGSSCRYEVEYLSNDTSSSGFLVEDVLHLITDNDQTKDIDTQITIGCGQVQTGV 191
Query: 236 FLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETP 295
FL+ AAPNGLFGLGME +SVPS+L+++G +DSFSMCFG DG GRI+FGD GS DQ +TP
Sbjct: 192 FLNGAAPNGLFGLGMENVSVPSILAQKGLISDSFSMCFGSDGSGRITFGDTGSSDQGKTP 371
Query: 296 FNVNPSHPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDR 355
FN+ SHPTYN+TI Q+ VG D EF A+FDSGTSFTYL DP Y+ +SE F+S V+
Sbjct: 372 FNLRESHPTYNVTITQIIVGGYAADHEFHAIFDSGTSFTYLNDPAYTLISEKFNSLVKAN 551
Query: 356 RR---PPDSRIPFDYCYDMSPDSNTSLIPS 382
R PDS +PF+YCYDM D+ +SL S
Sbjct: 552 RHSPLSPDSDLPFEYCYDMRSDN*SSLFES 641
Score = 45.1 bits (105), Expect(2) = 1e-71
Identities = 22/51 (43%), Positives = 34/51 (66%), Gaps = 1/51 (1%)
Frame = +1
Query: 380 IPSMSLTMGGGSRFVVYDPIIIISTQSE-LVYCLAVVKSAELNIIGQNFMT 429
+P ++LTM GG + V DPI+ +S++ E + CL + KS LNIIG+ + T
Sbjct: 625 VPFLNLTMKGGDDYYVTDPIVPVSSEVEGNLLCLGIQKSDNLNIIGREYTT 777
>TC81854 weakly similar to PIR|T50012|T50012 hypothetical protein T31P16.70
- Arabidopsis thaliana, partial (29%)
Length = 1124
Score = 147 bits (370), Expect = 1e-35
Identities = 87/214 (40%), Positives = 121/214 (55%), Gaps = 4/214 (1%)
Frame = +2
Query: 241 APNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPF-NVN 299
AP+GL GLG + SVPS L++ G DSFS+CF D GR+ FGD+GS Q TPF V+
Sbjct: 2 APDGLIGLGPGESSVPSFLAKSGLIRDSFSLCFNEDDSGRLFFGDQGSTVQQSTPFLLVD 181
Query: 300 PSHPTYNITINQVRVGTTLIDV-EFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRP 358
TY + + +G + V F A FDSGTSFT+L Y ++E F QV + R
Sbjct: 182 GMFSTYIVGVETCCIGNSCPKVTSFNAQFDSGTSFTFLPGHAYGAIAEEFDKQV-NATRS 358
Query: 359 PDSRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIII-ISTQSELVYCLAVVKS 417
P++YCY + IP+++L + FVVY+P+ + + Q +CLA+ +
Sbjct: 359 TFQGSPWEYCY-VPSSQQLPKIPTLTLMFQQNNSFVVYNPVFVSYNEQGVDGFCLAIQPT 535
Query: 418 -AELNIIGQNFMTGYRVVFDRGKLILGWKKSDCK 450
+ IGQNFMTGYR+VFDR L W S+C+
Sbjct: 536 EGGMGTIGQNFMTGYRLVFDRENKKLAWSHSNCQ 637
>TC79170 similar to GP|10177232|dbj|BAB10606. protease-like protein
{Arabidopsis thaliana}, partial (65%)
Length = 1607
Score = 136 bits (342), Expect = 2e-32
Identities = 110/365 (30%), Positives = 166/365 (45%), Gaps = 20/365 (5%)
Frame = +2
Query: 62 AELADRDRFLRGRRLSQFDAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTG 121
++L RD LR RRL G+A + TF +G L++T + LGTP V+F V +DTG
Sbjct: 536 SQLKARD-LLRHRRLQSSSNGVADFAVHGTFDPFQVG-LYFTKVLLGTPPVEFYVQIDTG 709
Query: 122 SDLFWVPC-DCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQ-- 178
SD+ WV C C C T +L+ ++P SSTS ++C++ C Q
Sbjct: 710 SDVLWVSCSSCNGCPQTS--------GLQIELNFFDPRSSSTSSLISCSDKRCNSGIQSS 865
Query: 179 ---CLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLV--EANVIFGCGQVQS 233
C G + C Y Y + TSG V D +HL + A V+FGC QS
Sbjct: 866 DATCSGQTNQCSYTFQYGDG-SGTSGYYVSDTMHLDTIFEGSVSTNSSAPVVFGCSNQQS 1042
Query: 234 GSFL-DVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRD--GIGRISFGDKGSLD 290
G A +G+FG G +++SV S LS +G + FS C D G G + G+ +
Sbjct: 1043GDLTKSDRAVDGIFGFGQQQMSVISQLSSQGIASGVFSHCLRGDSSGGGILVLGEIVEPN 1222
Query: 291 QDETPFNVNPSHPTYNITINQVRVGTTLIDVEFT---------ALFDSGTSFTYLVDPTY 341
TP + PS P YN+ + + V + V+ + + DSGT+ YL + Y
Sbjct: 1223IVYTP--LVPSQPHYNLNLQSISVNGQALQVDPSVFATSSNRGTIVDSGTTLAYLAEEAY 1396
Query: 342 SRLSESFHSQVEDRRRPPDSRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIII 401
+ + + R SR + C+ + DS + + P +SL GG+ V+ +
Sbjct: 1397DPFVNAITATIPQSVRTVVSR--GNQCF-LITDSVSDIFPQVSLNFAGGASMVLRPQDYL 1567
Query: 402 ISTQS 406
I S
Sbjct: 1568IQQNS 1582
>TC90871 similar to PIR|T45765|T45765 hypothetical protein F24M12.380 -
Arabidopsis thaliana, partial (7%)
Length = 507
Score = 110 bits (275), Expect = 1e-24
Identities = 53/92 (57%), Positives = 69/92 (74%), Gaps = 1/92 (1%)
Frame = +2
Query: 359 PDSRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSE-LVYCLAVVKS 417
PDS +PF+YCYDMSPD T +P ++LTM GG + V DPI+ +S++ E + CL + KS
Sbjct: 23 PDSDLPFEYCYDMSPDQ-TIEVPFLNLTMKGGDDYYVTDPIVPVSSEVEGNLLCLGIQKS 199
Query: 418 AELNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
LNIIGQNFMTGYR+VFDR + LGWK+S+C
Sbjct: 200 DNLNIIGQNFMTGYRIVFDRENMNLGWKESNC 295
>TC79244 weakly similar to PIR|B86193|B86193 hypothetical protein [imported]
- Arabidopsis thaliana, partial (57%)
Length = 1114
Score = 105 bits (262), Expect = 4e-23
Identities = 89/243 (36%), Positives = 112/243 (45%), Gaps = 11/243 (4%)
Frame = +3
Query: 72 RGRRLSQFDAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDLFWVPC-D 130
RGR L+ D L GN SS G L+YT + LG+P +F V +DTGSD+ WV C
Sbjct: 408 RGRFLAAIDVPLG---GNGL--PSSTG-LYYTKVGLGSPAKEFYVQVDTGSDILWVNCAG 569
Query: 131 CTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHR-----NQCLGTFSN 185
CT C DL++Y+PNGS TS V C + CT + C S
Sbjct: 570 CTACPKKSGLG--------MDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQDMS- 722
Query: 186 CPYMVSYVSAETSTSGILVEDVLHLTQPDDN-HDLVE-ANVIFGCGQVQSGSFLDVA--A 241
CPY ++Y T TSG V D L + N H + ++VIFGCG QSGS + A
Sbjct: 723 CPYSITYGDGST-TSGSFVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEA 899
Query: 242 PNGLFGLGMEKISVPSMLSREGFTADSFSMCF-GRDGIGRISFGDKGSLDQDETPFNVNP 300
+G+ G G SV S L+ G FS C G G +S G E FN P
Sbjct: 900 LDGIIGFGQANSSVLSQLAASGKVKRIFSHCLDSHHGGGILSIG-----QVMEPKFNTTP 1064
Query: 301 SHP 303
P
Sbjct: 1065LVP 1073
>TC77221 weakly similar to PIR|T01996|T01996 nucleoid DNA-binding protein
cnd41 chloroplast - common tobacco, partial (63%)
Length = 2150
Score = 99.8 bits (247), Expect = 2e-21
Identities = 96/356 (26%), Positives = 150/356 (41%), Gaps = 18/356 (5%)
Frame = +1
Query: 101 HYTTIELGTPGVKFMVALDTGSDLFWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGS 160
++ + LGTP + DTGSDL W T+C S + +Y+P S
Sbjct: 592 YFVVLGLGTPKKDLSLIFDTGSDLTW-----TQCQPCVGSCYKQ------QDEIYDPTKS 738
Query: 161 STSKKVTCNNSLCTHRN-------QCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQP 213
++ +TC +S CT + +C + C Y + Y S E + + P
Sbjct: 739 TSYYNITCTSSDCTQLSSATGNDPRCAKVSNACVYGIQYGDQSFSVGYFSRERL--IVNP 912
Query: 214 DDNHDLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCF 273
D D + +FGCGQ G F GL GLG IS S++ S+ +
Sbjct: 913 TDAID----SFLFGCGQDNEGLF---GGSAGLLGLGRHPISFVQQTSQKYQKTFSYCLPS 1071
Query: 274 GRDGIGRISFGDKGSLDQDETPFN-VNPSHPTYNITINQVRVGTTLIDVEFT------AL 326
G+G ++FG + T F+ V+ S+ Y + I + VG T + + + A+
Sbjct: 1072TSSGVGHLTFGASDNKYVKYTSFSTVSRSNSFYGLDIAGISVGGTKLPISSSIFSSGGAI 1251
Query: 327 FDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRIPFDYCYDMSPDSNTSLIPSMSLT 386
DSGT T L Y+ L +SF + P I D CYD+S S IP +S
Sbjct: 1252IDSGTVITRLPPTAYASLRDSFKKGMTKYPVAPAVSI-LDTCYDLSGYKIVS-IPKISFF 1425
Query: 387 MGGGSRFVVYDP-IIIISTQSELVYCLAVVKS---AELNIIGQNFMTGYRVVFDRG 438
+GGG + P I+ +++ + CLA + +++ I G VV+D G
Sbjct: 1426LGGGVTVEIAAPGILYVASLKQA--CLAFAPNGDDSDITIFGNVQQRTLEVVYDVG 1587
>AW586678 similar to PIR|T50012|T50 hypothetical protein T31P16.70 -
Arabidopsis thaliana, partial (20%)
Length = 558
Score = 79.0 bits (193), Expect(2) = 9e-21
Identities = 37/73 (50%), Positives = 46/73 (62%)
Frame = +1
Query: 224 VIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISF 283
V+ GC G +LD AP+GL GLG + SVPS L++ G DSFS+CF D GR+ F
Sbjct: 4 VVVGCFMKHCGGYLDGTAPDGLIGLGPGESSVPSFLAKSGLIRDSFSLCFNEDDSGRLFF 183
Query: 284 GDKGSLDQDETPF 296
GD+GS Q TPF
Sbjct: 184 GDQGSTVQQSTPF 222
Score = 39.3 bits (90), Expect(2) = 9e-21
Identities = 24/67 (35%), Positives = 34/67 (49%), Gaps = 1/67 (1%)
Frame = +2
Query: 304 TYNITINQVRVGTTLIDV-EFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSR 362
TY + + +G + V F A FDSGTSFT+L Y ++E F QV + R
Sbjct: 341 TYIVGVETCCIGNSCPKVTSFNAQFDSGTSFTFLPGHAYGAIAEEFDKQV-NATRSTFQG 517
Query: 363 IPFDYCY 369
P++Y Y
Sbjct: 518 SPWEYXY 538
>BF641229 weakly similar to GP|22655368|gb| At2g17760/At2g17760 {Arabidopsis
thaliana}, partial (16%)
Length = 465
Score = 95.9 bits (237), Expect = 3e-20
Identities = 50/125 (40%), Positives = 76/125 (60%), Gaps = 5/125 (4%)
Frame = +3
Query: 7 IIVIILIILHLSM----CCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYA 62
+++++L++L LS+ C + F +HHR+S+PV + P KG+ +YYA
Sbjct: 96 VLLLLLMVLVLSLSSHSCYSLGKFGLDIHHRFSDPVTEIL--GIGNDELLPHKGTPQYYA 269
Query: 63 ELADRDRFLRGRRLSQF-DAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTG 121
+ RDR GRRL+ D + F+ GN T I++ GFLH+ + +GTP + F+VALDTG
Sbjct: 270 AMVHRDRVFHGRRLADDRDTPITFAAGNETHXIAAFGFLHFANVSVGTPPLWFLVALDTG 449
Query: 122 SDLFW 126
SDLFW
Sbjct: 450 SDLFW 464
>TC86774 similar to PIR|T06000|T06000 aspartic proteinase homolog F17M5.250
- Arabidopsis thaliana, partial (81%)
Length = 1895
Score = 94.0 bits (232), Expect = 1e-19
Identities = 105/415 (25%), Positives = 175/415 (41%), Gaps = 32/415 (7%)
Frame = +2
Query: 67 RDRFLRGRRLSQFDAGLAFSDGNSTF-----RISSLGFLHYTTIELGTPGVKFMVALDTG 121
R+ L G +S + + + G+S + +GF + T+ +G P + + +DTG
Sbjct: 467 RNSILPGEAMSSRPSLMNHAAGSSIVFPIYGNVYPVGFYN-VTLNIGQPPRPYFLDVDTG 643
Query: 122 SDLFWVPCD--CTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLC-----T 174
S+L W+ CD C++CS T + + +DF + C + LC T
Sbjct: 644 SELTWLQCDAPCSQCSETPHPLYKPS--NDF---------------IPCKDPLCASLQPT 772
Query: 175 HRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSG 234
C + C Y + Y + ST G+L+ DV L N ++ + GCG Q
Sbjct: 773 DDYTCEDP-NQCDYEIKYAD-QYSTLGVLLNDVYLLNFT--NGVQLKVRMALGCGYDQIF 940
Query: 235 SFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGD-KGSLDQDE 293
S +G+ GLG K S+ S L+ +G + C G G I FG+ S
Sbjct: 941 SPSTYHPLDGILGLGRGKASLISQLNSQGLVRNVMGHCLSSRGGGYIFFGNVYDSSRMSW 1120
Query: 294 TPFNVNPSHPTYNITINQVRVGTTLIDV-EFTALFDSGTSFTYLVDPTY----SRLSESF 348
TP + S Y+ ++ G V +FD+G+S+TY Y S L++
Sbjct: 1121TPISSIDSGKHYSAGPAELVFGGRKTGVGSLNIIFDTGSSYTYFNSQAYQAMISLLNKEL 1300
Query: 349 HSQVEDRRRPPDSRIPFDYCYDMSPDSNTSLIP------SMSLTMGGGSR---FVVYDPI 399
H + + P D +P + + P + + + ++S T GG + + +
Sbjct: 1301HRK-PIKAAPDDQTLPMCW-HGKRPFRSINEVKKYFKPLTLSFTNGGRVKPQFEIPPEAY 1474
Query: 400 IIISTQSELVYCLAVVKS-----AELNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
+IIS + CL ++ ELN+IG M +VFD K ++GW +DC
Sbjct: 1475LIISNMGNV--CLGILNGPEVGLGELNLIGDISMLDKVMVFDNEKQLIGWGPADC 1633
>TC77267 similar to GP|9759559|dbj|BAB11161.1 nucleoid DNA-binding-like
protein {Arabidopsis thaliana}, partial (90%)
Length = 1653
Score = 92.8 bits (229), Expect = 3e-19
Identities = 97/356 (27%), Positives = 155/356 (43%), Gaps = 25/356 (7%)
Frame = +1
Query: 106 ELGTPGVKFMVALDTGSDLFWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKK 165
++GTP ++A+DT +D W+P CT C S+ FA P S+T K
Sbjct: 307 KIGTPPQTLLLAMDTSNDAAWIP--CTACDGCASTLFA-------------PEKSTTFKN 441
Query: 166 VTCNNSLCTHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHL-TQPDDNHDLVEANV 224
V+C C S+C + ++Y S +S + LV+D + L T P ++
Sbjct: 442 VSCAAPECKQVPNPGCGVSSCNFNLTYGS--SSIAANLVQDTITLATDPVPSY------- 594
Query: 225 IFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFG 284
FGC +G+ A P GL GLG +S+ S + +FS C ++F
Sbjct: 595 TFGCVSKTTGT---SAPPQGLLGLGRGPLSLLS--KTQNLYQSTFSYCL--PSFKSLNFS 753
Query: 285 DK---GSLDQDE----TPFNVNPSHPT-YNITINQVRVGTTLIDVEFTAL---------- 326
G + Q + TP NP + Y + + +RVG ++D+ AL
Sbjct: 754 GSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGT 933
Query: 327 -FDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRIPFDYCYDMSPDSNTSLIPSMSL 385
FDSGT FT LV P Y + + F +V + S FD CY++ ++P+++
Sbjct: 934 IFDSGTVFTRLVAPVYVAVRDEFRRKV-GPKLTVTSLGGFDTCYNV-----PIVVPTITF 1095
Query: 386 TMGGGSRFVVYDPIIIISTQSELVYCLAVVKSAE-----LNIIGQNFMTGYRVVFD 436
G + + D I+I ST CLA+ + + LN+I +RV++D
Sbjct: 1096IFTGMNVTLPQDNILIHSTAGSTT-CLAMAGAPDNVNSVLNVIANMQQQNHRVLYD 1260
>TC78047 similar to PIR|T02706|T02706 hypothetical protein At2g03200
[imported] - Arabidopsis thaliana, partial (67%)
Length = 1727
Score = 92.4 bits (228), Expect = 3e-19
Identities = 91/373 (24%), Positives = 152/373 (40%), Gaps = 21/373 (5%)
Frame = +2
Query: 105 IELGTPGVKFMVALDTGSDLFWVPCD-CTRCSATRSSAFASALASDFDLSVYNPNGSSTS 163
+ +GTP + + LDTGSDL W C+ C++C + +++P SST
Sbjct: 557 LSIGTPPISYPAVLDTGSDLIWTQCEPCSQCYKQPT-------------PIFDPKKSSTF 697
Query: 164 KKVTCNNSLCTHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEA- 222
K++C+++LC + + C Y+ SY + T GIL + T DD + V
Sbjct: 698 SKLSCSSNLCNALPSPTCSNNGCNYVYSY-GDYSMTQGILGSET--FTFGDDKKNQVSVK 868
Query: 223 NVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRIS 282
N+ FGCG+ G + A +GL GLG +S+ S L + F+ SM + +
Sbjct: 869 NIGFGCGEDNEGKGFEQA--SGLVGLGRGPLSLVSQLQEQEFSYCLTSMDEHKTKCSFVR 1042
Query: 283 FGDKGSLDQDETPFNVNPSHPTYNITINQVRVGTTLI--------------DVEFTALFD 328
F K + +++ H + I ++V + D + D
Sbjct: 1043FSSKC*CHKTGKQHHLSQIHCNHLFIIFHLKVSLLVTQNCQLRQSTFEVSDDGSGGVIID 1222
Query: 329 SGTSFTYLVDPTYSRLSESFHSQVEDRRRPPD--SRIPFDYCYDMSPDSNTSLIPSMSLT 386
SGT+ TY+ + + L + F SQ + P D D C+ + IP +
Sbjct: 1223SGTTITYIEENAFDSLKKEFTSQT---KLPVDKSGSTGLDVCFSLPSGKTEVEIPKLVFH 1393
Query: 387 MGGGSRFVVYDPIIIISTQSELVYCLAVVKSAELNIIGQNFMTGYRVVFDRGKLILGWKK 446
GG + +I+ S V CLA+ S ++I G V D K + +
Sbjct: 1394FKGGD-LELPGENYMIADSSLGVACLAMGASNGMSIFGNIQQQNILVNHDLQKETITFIP 1570
Query: 447 SDCKWL---FFCH 456
+ C + + CH
Sbjct: 1571TQCNKVVIEYLCH 1609
>BG451032 similar to PIR|T50012|T500 hypothetical protein T31P16.70 -
Arabidopsis thaliana, partial (15%)
Length = 636
Score = 92.0 bits (227), Expect = 5e-19
Identities = 54/139 (38%), Positives = 78/139 (55%), Gaps = 2/139 (1%)
Frame = +3
Query: 26 FTFTMHHRYSEPVK-KWSHSAPSPSHRWPEKGSVEYYAELADRDRFLRGRRLSQFDAGLA 84
F+ + HR+S+ K ++ WP++GS EY+ L + D + +L D
Sbjct: 219 FSSRIIHRFSDEAKVHLRNNGGENVQSWPKRGSSEYFRLLLNSDLTRQKMKLGSQDQSFY 398
Query: 85 FSDGNSTFRISS-LGFLHYTTIELGTPGVKFMVALDTGSDLFWVPCDCTRCSATRSSAFA 143
S+G+ T + +LHYT I++GTP V F+VALDTGSD+FWVPCDC C A AF
Sbjct: 399 PSEGSKTLSFGNDFVWLHYTWIDIGTPNVSFLVALDTGSDMFWVPCDCIXC-APLXXAFY 575
Query: 144 SALASDFDLSVYNPNGSST 162
+AL D DL+ P+ S+
Sbjct: 576 NAL--DRDLNXXXPSLXSS 626
>TC77266 similar to GP|9759559|dbj|BAB11161.1 nucleoid DNA-binding-like
protein {Arabidopsis thaliana}, partial (90%)
Length = 1604
Score = 90.9 bits (224), Expect = 1e-18
Identities = 98/370 (26%), Positives = 156/370 (41%), Gaps = 26/370 (7%)
Frame = +1
Query: 106 ELGTPGVKFMVALDTGSDLFWVPCD-CTRCSATRSSAFASALASDFDLSVYNPNGSSTSK 164
+ GTP ++ALDT SD W+PC C CS ++ A P S++ +
Sbjct: 319 KFGTPPQTLLLALDTSSDAAWIPCSGCVGCSTSKPFA---------------PIKSTSFR 453
Query: 165 KVTCNNSLCTHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHL-TQPDDNHDLVEAN 223
V+C + C S C + +Y S+ + S +V+D L L T P +
Sbjct: 454 NVSCGSPHCKQVPNPTCGGSACAFNFTYGSSSIAAS--VVQDTLTLATDPIPGY------ 609
Query: 224 VIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISF 283
FGC +GS A GL GLG +S+ S + +FS C I+F
Sbjct: 610 -TFGCVNKTTGS---SAPQQGLLGLGRGPLSLLS--QSQNLYKSTFSYCL--PSFKSINF 765
Query: 284 GDK---GSLDQDE----TPFNVNPSHPT-YNITINQVRVGTTLIDVEFTAL--------- 326
G + Q + TP NP + Y + + ++VG ++D+ AL
Sbjct: 766 SGSLRLGPVYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAG 945
Query: 327 --FDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRIPFDYCYDMSPDSNTSLIPSMS 384
FDSGT FT L +P Y+ + F +V + P + FD CY++ ++P+++
Sbjct: 946 TIFDSGTVFTRLAEPVYTAVRNEFRRRV-GPKLPVTTLGGFDTCYNV-----PIVVPTIT 1107
Query: 385 LTMGGGSRFVVYDPIIIISTQSELVYCLAVVKSAE-----LNIIGQNFMTGYRVVFDRGK 439
G + + D I+I ST CLA+ + + LN+I +RV+FD
Sbjct: 1108FLFSGMNVTLPPDNIVIHSTAGSTT-CLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPN 1284
Query: 440 LILGWKKSDC 449
+G + C
Sbjct: 1285SRIGIARELC 1314
>TC86488 similar to GP|9665144|gb|AAF97328.1| Unknown protein {Arabidopsis
thaliana}, partial (74%)
Length = 1753
Score = 88.6 bits (218), Expect = 5e-18
Identities = 95/372 (25%), Positives = 142/372 (37%), Gaps = 22/372 (5%)
Frame = +3
Query: 101 HYTTIELGTPGVKFMVALDTGSDLFWVPC-DCTRCSATRSSAFASALASDFDLSVYNPNG 159
++T I +GTP + LDTGSD+ W+ C C +C + V++P
Sbjct: 450 YFTRIGVGTPARYVFMVLDTGSDVVWLQCAPCRKCYSQAD-------------PVFDPTK 590
Query: 160 SSTSKKVTCNNSLCTHRNQ--CLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNH 217
S + + C LC + C C Y VSY + E +
Sbjct: 591 SRSYAGIPCGAPLCRRLDTAGCNTKTKVCQYQVSYGDGSFTFGDFSTETLTF-------R 749
Query: 218 DLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCF-GRD 276
A V GCG G F+ A GL GLG ++S P R FS C R
Sbjct: 750 KTRVARVALGCGHDNEGLFVGAA---GLLGLGRGRLSFPVQTGRR--FNQKFSYCLVDRS 914
Query: 277 GIGR---ISFGDKG-SLDQDETPFNVNPSHPT-YNITINQVRVGTTLIDVEFTALF---- 327
+ + FGD S TP NP T Y + + + VG + +LF
Sbjct: 915 ATSKPSSVVFGDSAVSRTARFTPLLKNPKLDTFYYVGLLGISVGGAPVRGVSASLFKLDT 1094
Query: 328 --------DSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRIPFDYCYDMSPDSNTSL 379
DSGTS T L P Y L ++F ++ P+ + FD C+D+S +
Sbjct: 1095AGNGGVIIDSGTSVTRLTRPAYIALRDAFRLGATHLKKAPEFSL-FDTCFDLSGLTEVK- 1268
Query: 380 IPSMSLTMGGGS-RFVVYDPIIIISTQSELVYCLAVVKSAELNIIGQNFMTGYRVVFDRG 438
+P++ L G + +I + + A S L+IIG G+RV +D
Sbjct: 1269VPTLVLHFQGADVSLPAQNYLIPVDNSGSFCFAFAGTMSG-LSIIGNIQQQGFRVSYDLA 1445
Query: 439 KLILGWKKSDCK 450
+G+ C+
Sbjct: 1446TSRVGFAPKGCE 1481
>TC89288 weakly similar to GP|21740390|emb|CAD40869. OSJNBa0064H22.14 {Oryza
sativa}, partial (22%)
Length = 1358
Score = 84.0 bits (206), Expect = 1e-16
Identities = 91/336 (27%), Positives = 141/336 (41%), Gaps = 26/336 (7%)
Frame = +3
Query: 105 IELGTPGVKFMVALDTGSDLFWVPC-DCTRCSATRSSAFASALASDFDLSVYNPNGSSTS 163
+ +GTP +K DTGSDL W C CT+C ++ F +P SS+
Sbjct: 246 LSIGTPPIKIYAEADTGSDLVWFQCIPCTKCYKQQNPMF-------------DPRSSSSY 386
Query: 164 KKVTCNNSLCTHRNQ--CLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVE 221
+TC C + C C Y SY + T G+L ++ L LT +
Sbjct: 387 TNITCGTESCNKLDSSLCSTDQKTCNYTYSYAD-NSITQGVLAQETLTLTS-TTGEPVAF 560
Query: 222 ANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSML-SREGFTADSFSMC---FGRDG 277
+IFGCG SG F D GL GLG +S+ S + S G ++ FS C F D
Sbjct: 561 QGIIFGCGHNNSG-FND--REMGLIGLGRGPLSLISQIGSSLGAGSNMFSQCLVPFNTDP 731
Query: 278 --IGRISFGDKGS------------LDQDETPFNVNPSHPTYNITINQVRV----GTTLI 319
+++FG KGS + +D T + I++ + + G++L
Sbjct: 732 SITSQMNFG-KGSEVLGNGTVSTPLISKDGTGYFAT----LLGISVEDINLPFSNGSSLG 896
Query: 320 DV-EFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRIPFDYCYDMSPDSNTS 378
+ + L DSGT+ TYL + Y RL E ++V P ++ CY + N
Sbjct: 897 TITKGNILIDSGTTITYLPEEFYHRLIEQVRNKV---ALEPFRIDGYELCYQTPTNLNG- 1064
Query: 379 LIPSMSLTMGGGSRFVVYDPIIIISTQSELVYCLAV 414
P++++ GG V+ P + + +C AV
Sbjct: 1065--PTLTIHFEGGD--VLLTPAQMFIPVQDDNFCFAV 1160
>TC83406 weakly similar to GP|19699359|gb|AAL91289.1 At1g79720/F19K16_30
{Arabidopsis thaliana}, partial (51%)
Length = 1197
Score = 76.3 bits (186), Expect = 3e-14
Identities = 78/270 (28%), Positives = 113/270 (40%), Gaps = 23/270 (8%)
Frame = +2
Query: 105 IELGTPGVKFMVALDTGSDLFWVPCD-CTRCSATRSSAFASALASDFDLSVYNPNGSSTS 163
+ +G V +DTGSDL WV CD C C + + V+NP+ SS+
Sbjct: 419 VTIGLGNQNMTVIIDTGSDLTWVQCDPCMSCYSQQG-------------PVFNPSNSSSY 559
Query: 164 KKVTCNNSLCTHRNQCLGTF--------SNCPYMVSYVSAETSTSGILVEDVLHLTQPDD 215
+ CN+S C + G S+C Y VSY + + VE HL+
Sbjct: 560 NSLLCNSSTCQNLQFTTGNTETCESNNPSSCNYTVSYGDGSFTDGELGVE---HLS---- 718
Query: 216 NHDLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFT-ADSFSMCF- 273
+ +N +FGCG+ G F V +G+ GLG + SM+S+ T FS C
Sbjct: 719 FGGISVSNFVFGCGRNNKGLFGGV---SGIMGLGRSNL---SMISQTNTTFGGVFSYCLP 880
Query: 274 --GRDGIGRISFGDKGSLDQDETP--FNVNPSHPT----YNITINQVRVGTTLI-DVEF- 323
G + G++ SL ++ TP + S+P Y + + + VG I D F
Sbjct: 881 TTDSGASGSLVIGNESSLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGGVAIQDTSFG 1060
Query: 324 --TALFDSGTSFTYLVDPTYSRLSESFHSQ 351
L DSGT T L Y+ L F Q
Sbjct: 1061NGGILIDSGTVITRLAPSLYNALKAEFLKQ 1150
>BG648486 similar to PIR|T45858|T45 hypothetical protein F3A4.130 -
Arabidopsis thaliana, partial (34%)
Length = 795
Score = 72.4 bits (176), Expect = 4e-13
Identities = 54/181 (29%), Positives = 84/181 (45%), Gaps = 2/181 (1%)
Frame = +3
Query: 101 HYTT-IELGTPGVKFMVALDTGSDLFWVPCD-CTRCSATRSSAFASALASDFDLSVYNPN 158
+YTT + +GTP +F + +DTGS + +VPC C C + F P+
Sbjct: 162 YYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCEHCGRHQDPKF-------------QPD 302
Query: 159 GSSTSKKVTCNNSLCTHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHD 218
S T + V C T C G + C Y Y +S+SG+L EDV+ + +
Sbjct: 303 LSETYQPVKC-----TPDCNCDGDTNQCMYDRQYAEM-SSSSGVLGEDVVSF---GNLSE 455
Query: 219 LVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGI 278
L +FGC ++G A +G+ GLG +S+ L + +DSFS+C+G +
Sbjct: 456 LAPQRAVFGCENDETGDLYSQRA-DGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDV 632
Query: 279 G 279
G
Sbjct: 633 G 635
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.323 0.137 0.425
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 17,300,660
Number of Sequences: 36976
Number of extensions: 263033
Number of successful extensions: 1774
Number of sequences better than 10.0: 104
Number of HSP's better than 10.0 without gapping: 1693
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1711
length of query: 492
length of database: 9,014,727
effective HSP length: 100
effective length of query: 392
effective length of database: 5,317,127
effective search space: 2084313784
effective search space used: 2084313784
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 60 (27.7 bits)
Medicago: description of AC149547.3