
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC124214.2 + phase: 0
(402 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC78988 similar to GP|6862912|gb|AAF30301.1| unknown protein {Ar... 163 1e-40
TC87263 similar to GP|9759298|dbj|BAB09804.1 emb|CAB82953.1~gene... 157 9e-39
TC86527 weakly similar to PIR|T51373|T51373 hypothetical protein... 155 3e-38
TC77779 similar to GP|18377638|gb|AAL66969.1 unknown protein {Ar... 147 5e-36
TC87744 weakly similar to GP|17104519|gb|AAL34148.1 unknown prot... 115 4e-26
TC87950 weakly similar to GP|17104519|gb|AAL34148.1 unknown prot... 113 1e-25
TC82783 similar to GP|22136764|gb|AAM91701.1 unknown protein {Ar... 106 2e-23
TC81392 weakly similar to GP|19310589|gb|AAL85025.1 unknown prot... 86 3e-17
BI308089 similar to GP|22136748|gb unknown protein {Arabidopsis ... 83 2e-16
TC90384 similar to PIR|T51373|T51373 hypothetical protein F1N13_... 80 1e-15
TC90342 similar to PIR|C96757|C96757 hypothetical protein T18K17... 80 1e-15
TC93535 weakly similar to PIR|A96816|A96816 F9K20.25 [imported] ... 80 1e-15
TC90222 similar to GP|15451108|gb|AAK96825.1 putative protein {A... 80 2e-15
BF646181 similar to GP|17104519|gb unknown protein {Arabidopsis ... 78 5e-15
TC92054 similar to GP|21592371|gb|AAM64322.1 unknown {Arabidopsi... 78 7e-15
BM812987 weakly similar to GP|16226759|gb At2g30010/F23F1.7 {Ara... 78 7e-15
CB892704 weakly similar to GP|17104519|gb unknown protein {Arabi... 77 1e-14
CA921717 weakly similar to PIR|T51373|T51 hypothetical protein F... 75 4e-14
TC89733 weakly similar to GP|19571157|dbj|BAB86580. hypothetical... 75 4e-14
BG450971 similar to GP|21553616|gb unknown {Arabidopsis thaliana... 75 6e-14
>TC78988 similar to GP|6862912|gb|AAF30301.1| unknown protein {Arabidopsis
thaliana}, partial (68%)
Length = 2203
Score = 163 bits (413), Expect = 1e-40
Identities = 117/424 (27%), Positives = 190/424 (44%), Gaps = 31/424 (7%)
Frame = +3
Query: 1 MSPKLPSHLFPFLIILTLASLYYF----------SSFISLNTLSSSSSTSPSLTIST--- 47
+ P L F F+ IL + +Y + S + S+S S+ P + T
Sbjct: 231 LEPSLTILGFFFVSILFITCFFYVDYKGILHSRGTKLFSFHFSSTSPSSPPPIQFLTQKG 410
Query: 48 --CNLFKGTWVFDPNHTPLYDDTCPFHRNAWNCIRNQRQNLSLINSWKWVPRGCDMNRID 105
C++F G WV+D + + C F + C N R + + W+W P+ C++ R D
Sbjct: 411 DNCDVFDGNWVWDETYPLYHSSNCSFLDQGFRCSENGRPD-AFYTKWRWQPKDCNLPRFD 587
Query: 106 PFRFLGSMRNKNVGIVGDSLNENLLVSFLCTL--------RVADGGARKWKKKGAWRGAY 157
+ L ++RNK + VGDS+ N S LC L V + K +
Sbjct: 588 ARKMLENIRNKRLVFVGDSIGRNQWESLLCMLSSAVTNKSSVYEVNGNPITKHTGFLAFK 767
Query: 158 FPKFNVTVAYHRAVLLSKYKWQPKQSESGMQDGSEGIHRVD-VDVPADEWAKIAGFYDVL 216
F FN TV Y+R+ L P G + RVD +D + W DVL
Sbjct: 768 FEDFNCTVEYYRSPFLVVQGRPP----HGAPYRVKLTLRVDHMDWTSHRWRDA----DVL 923
Query: 217 LFNTGHWWNHDKFPKEKPLVFYKAGQPIVPPLEMLDGLKVVLGNMITYIEKEFPRN-TLK 275
+ N GHWWN++K K +++ G+ + + D ++ + ++ +I +E RN T
Sbjct: 924 VLNAGHWWNYEKTVKMG--CYFQIGEQVKMNMSTEDAFRLSVETVVDWIAREVNRNKTYV 1097
Query: 276 FWRLQSPRHFYGGDWNQNGSCLFNKPLEENEL----DLWFEPRNNGVNKEARQMNFVIEK 331
+R +P HF GGDWN G C + + D+ F N +++ A + + +
Sbjct: 1098LFRTYAPVHFRGGDWNTGGGCHSETLPDLGSVPAISDIHFSTVTNVLSQRASKSHVL--- 1268
Query: 332 VLQGTNIHVVDFTHLSEFRADAHPAIWL--GRKDAVAIWGQDCMHWCLPGVPDTWVDILS 389
N+ +++ T +S R D H +I+ K ++ QDC HWCLPGVPD+W +IL
Sbjct: 1269-----NLDLLNITQMSARRKDGHASIYYIGPDKGPASMQRQDCSHWCLPGVPDSWNEILY 1433
Query: 390 QLII 393
L++
Sbjct: 1434ALLL 1445
>TC87263 similar to GP|9759298|dbj|BAB09804.1
emb|CAB82953.1~gene_id:MPH15.5~strong similarity to
unknown protein {Arabidopsis thaliana}, partial (65%)
Length = 2229
Score = 157 bits (396), Expect = 9e-39
Identities = 110/361 (30%), Positives = 180/361 (49%), Gaps = 16/361 (4%)
Frame = +2
Query: 44 TISTCNLFKGTWVFDPNHTPLYDD-TCPFHRNAWNCIRNQRQNLSLINSWKWVPRGCDMN 102
++ C+ F G W+ D ++ PLY +C +NCIRN R + +KW P+GC +
Sbjct: 809 SLMKCDFFDGEWIKDDSY-PLYKPGSCSIIDEQFNCIRNGRPDKDY-QKYKWKPKGCSLP 982
Query: 103 RIDPFRFLGSMRNKNVGIVGDSLNENLLVSFLCTLRVADGGARK-WKKKGA--WRGAYFP 159
R+D R L +R K + VGDSLN N+ S +C L+ + +K ++ G +RG
Sbjct: 983 RLDGHRMLDLLRGKRLVFVGDSLNRNMWESLICILKNSVKDKKKVYEANGRVHFRGEASY 1162
Query: 160 KFNVTVAYHRAVLLSKYKWQPKQSESGMQDGS-EGIHRVDVDVPADEWAKIAGFYDVLLF 218
F V Y +V L + ++ E ++G+ + R+D+ + + K A D+++F
Sbjct: 1163 SF-VFKDYKFSVELFVSPFLVQEWEMPDKNGTKKETLRLDLVGRSSDQYKDA---DIIVF 1330
Query: 219 NTGHWWNHDKFPKEKPLVFYKAGQPIVPPLEMLDGLKVVLGNMITYIEKEF-PRNTLKFW 277
NTGHWW HDK K K +Y+ G + L +L+ + + +++ P ++ +
Sbjct: 1331 NTGHWWTHDKTSKGKD--YYQEGSHVYDELNVLEAFRRAITTWGRWVDANVNPTKSIVLF 1504
Query: 278 RLQSPRHFYGGDWNQNGSCLF-NKPLEENELDLWFEPRNNGVNKEARQMNFVIEKVLQGT 336
R S HF GG WN G C P++ + + P+ V+EKVL+
Sbjct: 1505 RGYSASHFSGGQWNSGGQCDHETAPIDNEKYLTEYPPKMR-----------VLEKVLKNM 1651
Query: 337 N--IHVVDFTHLSEFRADAHPAIWLGRKDAVA-------IWGQDCMHWCLPGVPDTWVDI 387
+ ++ T +++FR D HP+I+ RK ++ + QDC HWCLPGVPD W +I
Sbjct: 1652 KNPVSYLNITRMTDFRKDGHPSIY--RKQNLSPEERKSPLRFQDCSHWCLPGVPDAWNEI 1825
Query: 388 L 388
L
Sbjct: 1826 L 1828
>TC86527 weakly similar to PIR|T51373|T51373 hypothetical protein F1N13_40 -
Arabidopsis thaliana, partial (78%)
Length = 1637
Score = 155 bits (391), Expect = 3e-38
Identities = 104/371 (28%), Positives = 163/371 (43%), Gaps = 13/371 (3%)
Frame = +2
Query: 35 SSSSTSPSLTISTCNLFKGTWVFDPNHTPLYDDTCPFHRNAWNCIRNQRQNLSLINSWKW 94
+ S++ PS +I C++F G WV +P + TC NC++ R + + WKW
Sbjct: 248 NESASLPSTSIKKCDIFTGEWVPNPKGPYYTNKTCWAIHEHQNCMKYGRPDSDYLK-WKW 424
Query: 95 VPRGCDMNRIDPFRFLGSMRNKNVGIVGDSLNENLLVSFLCTL-RVADGGARKWKKKGAW 153
P GC++ +PF+FL +R K++ VGDS+ N + S +C L RV + +
Sbjct: 425 KPNGCELPIFNPFQFLEIVRGKSMAFVGDSVGRNQMQSMICLLSRVEWPVVVSHSQNDYF 604
Query: 154 RGAYFPKFNVTVAYHRAVLLSKYKWQP---KQSESGMQ-DGSEGIHRVDVDVPADEWAKI 209
++P +N T+A W P + ES ++ GS G+ + +D P + W
Sbjct: 605 MRWHYPTYNFTMASF---------WTPHLVRSKESDLKGPGSTGLFDLYIDEPDENWITQ 757
Query: 210 AGFYDVLLFNTGHWWNHDK--FPKEKPLVFYKAGQPIVPPLEMLDGLKVVLGNMITYIEK 267
++ ++ N GHW+ + K+K + + VP L M G + I
Sbjct: 758 IEDFNYVILNGGHWFTRSMVFYEKQKIVGCHYCLLENVPDLTMYHGYRRAFRTAFKAINN 937
Query: 268 EFPRNTLKFWRLQSPRHFYGGDWNQNGSCLFNKPLEENELDLWFEPRNNGVNKEARQMNF 327
+ F R SP HF G WNQ G+C+ KP NE L G N E +
Sbjct: 938 LQNFKGITFLRTFSPSHFENGLWNQGGNCVRTKPFRSNETQL------EGFNLEFYMIQL 1099
Query: 328 VIEKVLQ------GTNIHVVDFTHLSEFRADAHPAIWLGRKDAVAIWGQDCMHWCLPGVP 381
K+ + G + D T + R D HP+ + + DC+HWCLPG
Sbjct: 1100EEFKIAEKEARKKGLKFRLYDTTQATMLRPDGHPSKYGHWPNENVTLYNDCVHWCLPGPI 1279
Query: 382 DTWVDILSQLI 392
DTW D L ++
Sbjct: 1280DTWSDFLLDML 1312
>TC77779 similar to GP|18377638|gb|AAL66969.1 unknown protein {Arabidopsis
thaliana}, partial (85%)
Length = 1450
Score = 147 bits (372), Expect = 5e-36
Identities = 107/385 (27%), Positives = 175/385 (44%), Gaps = 6/385 (1%)
Frame = +2
Query: 10 FPFLIILTLASLYY--FSSFISLNTLSSSSSTSPSLTISTCNLFKGTWVFDPNHTPLYDD 67
F + +L L SL++ + +SS P +T + CNLF G+WV DP++ PLYD
Sbjct: 191 FRAITLLLLFSLFHQLLLGEAKFHNVSSLRGKKPVVT-NGCNLFIGSWVVDPSY-PLYDS 364
Query: 68 TCPFHRNAWNCIRNQRQNLSLINSWKWVPRGCDMNRIDPFRFLGSMRNKNVGIVGDSLNE 127
+CPF +NC + R + + + W P C + D FL R K + VGDSL+
Sbjct: 365 SCPFIDPEFNCQKYGRPDKQYLK-YSWKPDSCALPSFDGKDFLNKWRGKKIMFVGDSLSL 541
Query: 128 NLLVSFLCTLRVADGGAR-KWKKKGAWRGAYFPKFNVTVAYHRAVLLSKYKWQPKQSESG 186
N+ S C + + + + ++ A F + VT+ +R L
Sbjct: 542 NMWESLSCMIHASVPNVKTSFLRREAQSTVTFQDYGVTIQLYRTPYLVDI---------- 691
Query: 187 MQDGSEGIHRVDVDVPADEWAKIAGFYDVLLFNTGHWWNHDKFPKEKPLVFYKAGQPIVP 246
+++ + +D V + W + D+L+FN+ HWW H + + + G +V
Sbjct: 692 IRENVGRVLTLDSIVAGNAWKGM----DMLVFNSWHWWTHKG--SSQGWDYIRDGSKLVK 853
Query: 247 PLEMLDGLKVVLGNMITYIEKEF-PRNTLKFWRLQSPRHFYGGDWNQ-NGSCLFN-KPLE 303
++ L L +++ P T F++ SP H+ G +WNQ SC +PL
Sbjct: 854 NMDRLVAYNKGLTTWAKWVDLNVDPTKTKVFFQGISPTHYMGKEWNQPKNSCSGQLEPLS 1033
Query: 304 ENELDLWFEPRNNGVNKEARQMNFVIEKVLQGTNIHVVDFTHLSEFRADAHPAIWLGRKD 363
+ P +N +N + M + ++++D T LS+ R DAHP+ + G
Sbjct: 1034GSTYPAGLPPSSNILNNVLKSMK---------SPVYLLDITLLSQLRKDAHPSSYSGDHA 1186
Query: 364 AVAIWGQDCMHWCLPGVPDTWVDIL 388
G DC HWCLPG+PDTW +L
Sbjct: 1187-----GNDCSHWCLPGLPDTWNQLL 1246
>TC87744 weakly similar to GP|17104519|gb|AAL34148.1 unknown protein
{Arabidopsis thaliana}, partial (69%)
Length = 1387
Score = 115 bits (287), Expect = 4e-26
Identities = 106/388 (27%), Positives = 162/388 (41%), Gaps = 11/388 (2%)
Frame = +3
Query: 17 TLASLYYFSSFISLNTLSSSSSTSPSLTISTCNLFKGTWVFDPNHTPLYDDT-CPFHRNA 75
T+ S+ F S I + + + S C+LF+G WV+D ++ PLY + CPF
Sbjct: 48 TMKSIIIFVSLIHVLLMIHVHGKTIGFAKSGCDLFQGKWVYDESY-PLYQTSQCPFIEKE 224
Query: 76 WNCIRNQRQNLSLINSWKWVPRGCDMNRIDPFRFLGSMRNKNVGIVGDSLNENLLVSFLC 135
++C N R + + ++W P CD+ R + FL R K++ VGDSL+ N S C
Sbjct: 225 FDCQNNGRPDKFYLK-YRWQPTKCDLPRFNGEDFLRRYRGKSILFVGDSLSLNQWQSLTC 401
Query: 136 TLRVADGGAR-KWKKKGAWRGAYFPKFNVTVAYHRAVLLSKYKWQPKQSESGMQDGSEGI 194
L +A A + G F + V V + R L + SE I
Sbjct: 402 MLHIAVPQAHYTLVRIGDLSIFTFTTYGVKVMFSRNAFL-------------VDIFSENI 542
Query: 195 HRV---DVDVPADEWAKIAGFYDVLLFNTGHWWNHDKFPKEKPLVFYKAGQPIVPPLEML 251
RV D A W I DVL+F++ HWW H +++P + G ++ L
Sbjct: 543 GRVLKLDSIQSARNWKGI----DVLIFDSWHWWLHT--GRKQPWDLIQEGNNTFRDMDRL 704
Query: 252 DGLKVVLGNMITYIEKEFP-RNTLKFWRLQSPRHFYGGDWNQNGSCLF---NKPLEENEL 307
+ L +I+ T F++ SP H W + KPL +
Sbjct: 705 VAYEKGLKTWAKWIDDNVDITKTKVFFQGISPDHLNSRQWGDPKANFCEGQEKPLSGSMY 884
Query: 308 DLWFEPRNNGVNKEARQMNFVIEKVLQGTNIHVVDFTHLSEFRADAHPAIW--LGRKDAV 365
P + + R M ++++D T LS+ R D HP+++ G +D
Sbjct: 885 PGGPVPAQLALERVIRAMK---------KPVYLLDITTLSQLRKDGHPSVYGHGGHRD-- 1031
Query: 366 AIWGQDCMHWCLPGVPDTWVDILSQLII 393
DC HWCL GVPDTW +L +I
Sbjct: 1032----MDCSHWCLAGVPDTWNQLLYASLI 1103
>TC87950 weakly similar to GP|17104519|gb|AAL34148.1 unknown protein
{Arabidopsis thaliana}, partial (69%)
Length = 938
Score = 113 bits (283), Expect = 1e-25
Identities = 84/300 (28%), Positives = 136/300 (45%), Gaps = 4/300 (1%)
Frame = +1
Query: 5 LPSHLFPFLIILTLASLYYFSSFISLNTLSSSSSTSPSLTISTCNLFKGTWVFDPNHTPL 64
L + LFP L LTL + + N+ S+ ++ S TCNLF+G WV+D ++ PL
Sbjct: 31 LVNALFPTLF-LTLFLYSHQTKSEDFNSFSNETNFEVSNIAGTCNLFRGKWVYDASY-PL 204
Query: 65 YD-DTCPFHRNAWNCIRNQRQNLSLINSWKWVPRGCDMNRIDPFRFLGSMRNKNVGIVGD 123
YD TCPF +NC ++ R++ L ++W+P C+M R + FL + K + VGD
Sbjct: 205 YDPSTCPFIDPQFNCQKHGRKD-KLYQKYRWMPFSCNMPRFNGLNFLKGNKGKKIMFVGD 381
Query: 124 SLNENLLVSFLCTLRVADGGARK-WKKKGAWRGAYFPKFNVTVAYHRAVLLSKYKWQPKQ 182
SL+ N S C + A +R ++++ A F ++ + + +R L
Sbjct: 382 SLSLNQFNSLACMIHAAVPNSRSTFRQRDAISSVTFEEYGLELFLYRTAYLVDLD----- 546
Query: 183 SESGMQDGSEGIHRVDVDVPADEWAKIAGFYDVLLFNTGHWWNHDKFPKEKPLVFYKAGQ 242
D + ++D + W + DVL+FNT HWW H +P + + +
Sbjct: 547 -----HDKEGRVLKLDSIKSGEAWRGM----DVLIFNTWHWWTHT--GSSQPWDYIQENK 693
Query: 243 PIVPPLEMLDGLKVVLGNMITYIEKEF-PRNTLKFWRLQSPRHFYGGDWNQ-NGSCLFNK 300
+ + L ++E P T F+ SP H+ G DWNQ SC+ K
Sbjct: 694 KLYKDMNRFVAFYKGLQTWARWVEMNVNPAQTKVFFLGISPVHYQGRDWNQPTKSCMSEK 873
>TC82783 similar to GP|22136764|gb|AAM91701.1 unknown protein {Arabidopsis
thaliana}, partial (36%)
Length = 823
Score = 106 bits (264), Expect = 2e-23
Identities = 60/183 (32%), Positives = 94/183 (50%), Gaps = 8/183 (4%)
Frame = +1
Query: 214 DVLLFNTGHWWNHDKFPKEKPLVFYKAGQPIVPPLEMLDGLKVVLGNMITYIEKEFPRN- 272
++++FNTGHWW H+K + + +Y+ G + P L+ LD L +++++ N
Sbjct: 139 NIIVFNTGHWWTHEKTSEGEE--YYQEGNHVYPRLKALDAYMRALTTWAKWVDRKINANH 312
Query: 273 TLKFWRLQSPRHFYGGDWNQNGSCLFNKPLEENELDLWFEPRNNGVNKEARQMNFVIEKV 332
T F+R S HF+GG WN G C NE L P + R + VI+ +
Sbjct: 313 TQVFFRGYSVTHFWGGQWNSGGQCHKETEPIYNETYLQKHP------SKMRALEHVIQNM 474
Query: 333 LQGTNIHVVDFTHLSEFRADAHPAIWLGRKDAVAIWGQ-------DCMHWCLPGVPDTWV 385
T + ++ + L+++R D HP+++ RKD Q DC HWCLPGVPDTW
Sbjct: 475 K--TEVIYMNISRLTDYRKDGHPSVY--RKDYKTSMKQNSSSLYEDCSHWCLPGVPDTWN 642
Query: 386 DIL 388
++L
Sbjct: 643 ELL 651
>TC81392 weakly similar to GP|19310589|gb|AAL85025.1 unknown protein
{Arabidopsis thaliana}, partial (34%)
Length = 1277
Score = 85.9 bits (211), Expect = 3e-17
Identities = 57/187 (30%), Positives = 84/187 (44%), Gaps = 9/187 (4%)
Frame = +1
Query: 47 TCNLFKGTWVFDPNHTPLYDDT-CPFHRNAWNCIRNQRQNLSLINSWKWVPRGCDMNRID 105
+C+ G WV D PLY+ T C R NCI N R + S + W+W P CD+ D
Sbjct: 166 SCDYLIGNWVHD-KRGPLYNGTTCNEIRENQNCIVNGRPDTSYLY-WRWKPNECDLPIFD 339
Query: 106 PFRFLGSMRNKNVGIVGDSLNENLLVSFLCTLRVAD--------GGARKWKKKGAWRGAY 157
P FL M N N+ VGDSL+ N L S +C L G +W Y
Sbjct: 340 PNTFLKLMSNMNIVFVGDSLSRNQLESLICLLSTVSKPKYINHIGSIGRW---------Y 492
Query: 158 FPKFNVTVAYHRAVLLSKYKWQPKQSESGMQDGSEGIHRVDVDVPADEWAKIAGFYDVLL 217
FP +N + + A L K + K+ + + + +D ++ WAK D+++
Sbjct: 493 FPSYNANLTSYWAPFLVKGDQRRKEGPN--------YNTIHLDHVSENWAKDIDQMDLIM 648
Query: 218 FNTGHWW 224
+ GHW+
Sbjct: 649 LSFGHWF 669
Score = 58.2 bits (139), Expect = 6e-09
Identities = 47/151 (31%), Positives = 63/151 (41%), Gaps = 10/151 (6%)
Frame = +2
Query: 255 KVVLGNMITYIEKEFPRNTLKFWRLQSPRHFYGGDWNQNGSCLFNKPLEENELDLWFEPR 314
K+V GN I I + F SP HF G W++ G+C P + E L
Sbjct: 818 KMVKGNEIDVIVRTF-----------SPTHFEGS-WDKGGTCSKKNPYKYEEKKL----- 946
Query: 315 NNGVNKEARQMNF-----VIEKVLQ-GTNIHVVDFTHLSEFRADAHPAIWLG----RKDA 364
G+ + R M EK Q G N+ V+D T L+ R D H + K
Sbjct: 947 -EGMEAKIRSMEIEEAENAKEKSKQVGLNLKVLDITKLALLRPDGHAGAYRYPFPFAKTI 1123
Query: 365 VAIWGQDCMHWCLPGVPDTWVDILSQLIIDG 395
DC HWCLPG DTW +I +++ G
Sbjct: 1124PKNVQNDCTHWCLPGPIDTWNEIFLEMMKKG 1216
>BI308089 similar to GP|22136748|gb unknown protein {Arabidopsis thaliana},
partial (49%)
Length = 849
Score = 83.2 bits (204), Expect = 2e-16
Identities = 69/286 (24%), Positives = 118/286 (41%), Gaps = 10/286 (3%)
Frame = +2
Query: 77 NCIRNQRQNLSLINSWKWVPRGCDMNRIDPFRFLGSMRNKNVGIVGDSLNENLLVSFLCT 136
NC N R + +W+W P CD+ R DP +FL MR K + +GDS+ N + S LC
Sbjct: 35 NCQGNGRPD-KYYENWRWKPFQCDIPRFDPRKFLELMRGKTLAFIGDSVARNQMESMLCI 211
Query: 137 LRVADGGARKWKKKGAWRGAYFPKFNVTVAYHRAVLLSKYKWQPKQSESGMQDGSEGIHR 196
L + + + + YF +V + + W K + G+ +
Sbjct: 212 LWQVEVPKNRGNRN--MQRYYFRSASVMIVRIWS------SWLVKVTSEPFDYAPAGVDK 367
Query: 197 VDVDVPADEWAKIAGFYDVLLFNTGHWWNHDKFPKEKPLVFYK---AGQPIVP------P 247
+ +D P + + +DV++ ++GHW F K+ + GQ P
Sbjct: 368 LHLDAPDPKLMENIPNFDVVVLSSGHW-----FAKKSVYILNNEIVGGQLWWPDKSKQMK 532
Query: 248 LEMLDGLKVVLGNMITYIEKEFPRNTLKFWRLQSPRHFYGGDWNQNGSCLFN-KPLEENE 306
+ + + + ++T + L R SP H+ GG WN GSC KPL E
Sbjct: 533 VNNIQAYAISVETILTALATHPTYKGLAIVRSYSPDHYEGGAWNTGGSCTGKVKPLALGE 712
Query: 307 LDLWFEPRNNGVNKEARQMNFVIEKVLQGTNIHVVDFTHLSEFRAD 352
L + N ++ N +EK + + ++D T + ++R D
Sbjct: 713 L-VENAYTNTMHEQQVTGFNRAMEKAANKSKLRLMDITQVFQYRHD 847
>TC90384 similar to PIR|T51373|T51373 hypothetical protein F1N13_40 -
Arabidopsis thaliana, partial (32%)
Length = 905
Score = 80.5 bits (197), Expect = 1e-15
Identities = 57/227 (25%), Positives = 104/227 (45%), Gaps = 13/227 (5%)
Frame = +3
Query: 9 LFPFLIILTLASLY--YFSSFISLNTLSSSSSTS----------PSLTISTCNLFKGTWV 56
+F +I+L Y + SSF+ + + S ++S PS ++ C++F G WV
Sbjct: 243 IFVIIIVLVTPLSYPLFGSSFLLMMSKSKQPNSSNVDSIEKENLPSTSLKNCDIFSGEWV 422
Query: 57 FDPNHTPLYDDTCPFHRNAWNCIRNQRQNLSLINSWKWVPRGCDMNRIDPFRFLGSMRNK 116
+P + TC NC++ R + + W+W P C++ +PF+FL +R K
Sbjct: 423 PNPKGPYYTNKTCWAIHEHQNCMKYGRPDSEFMK-WRWKPNECELPIFNPFQFLEIVRGK 599
Query: 117 NVGIVGDSLNENLLVSFLCTL-RVADGGARKWKKKGAWRGAYFPKFNVTVAYHRAVLLSK 175
++ VGDS+ N + S +C L RV + K + +P +N T+A + L K
Sbjct: 600 SMAFVGDSVGRNHMQSLICLLSRVEWPIDVSYTKDDYFMRWKYPSYNFTMAAYWTPFLVK 779
Query: 176 YKWQPKQSESGMQDGSEGIHRVDVDVPADEWAKIAGFYDVLLFNTGH 222
Q E+ G++ + ++ ++W +D ++ N GH
Sbjct: 780 -----AQRENSDGPTHTGLYNLYLNEFDEKWTSQIEDFDYVIINGGH 905
>TC90342 similar to PIR|C96757|C96757 hypothetical protein T18K17.20
[imported] - Arabidopsis thaliana, partial (80%)
Length = 1533
Score = 80.5 bits (197), Expect = 1e-15
Identities = 40/125 (32%), Positives = 68/125 (54%), Gaps = 2/125 (1%)
Frame = +3
Query: 48 CNLFKGTWVFDPNHTPLYDD-TCPFHRNAWNCIRNQRQNLSLINSWKWVPRGCDMNRIDP 106
CN+F+G WV+D PLY++ +CP+ C++N R + S +W+W P C++ R DP
Sbjct: 210 CNIFEGKWVWDNVSYPLYEEESCPYLVKQTTCMKNGRPD-SFYTNWRWQPHECNLPRFDP 386
Query: 107 FRFLGSMRNKNVGIVGDSLNENLLVSFLCTLR-VADGGARKWKKKGAWRGAYFPKFNVTV 165
+ L +RNK + +GDSL S +C ++ V G + ++ + +FN T+
Sbjct: 387 LKLLHMLRNKRMMFIGDSLQRGQFESMICLVQSVIPEGKKSLQRIPPMKIFRVEEFNATI 566
Query: 166 AYHRA 170
Y+ A
Sbjct: 567 EYYWA 581
Score = 67.4 bits (163), Expect = 9e-12
Identities = 52/176 (29%), Positives = 73/176 (40%), Gaps = 14/176 (7%)
Frame = +2
Query: 223 WWNHDKFPKEKPLVFYKAGQPI-VPPLEMLDGLKVVLGNMITYIEKEF-PRNTLKFWRLQ 280
WW H P + G P V + K+ L ++E P N F+
Sbjct: 719 WWMHS------PFINATHGSPDEVQEYNVTTAYKLALKTWANWLESNIQPLNQYVFFMSM 880
Query: 281 SPRHFYGGDWNQNG--SCLFNKPLEENELDLWFEPRNNGVNKEARQMNFVIEKVLQGTNI 338
SP H + +W +C FN+ W G N E + ++ LQ I
Sbjct: 881 SPTHLWSWEWKPGSDENC-FNESYPIQGSSYW----GTGSNLEIMK---ILHDSLQELKI 1036
Query: 339 HV--VDFTHLSEFRADAHPAIWLGRKDAVAIWGQ--------DCMHWCLPGVPDTW 384
V ++ T LSE+R DAH +++ RK + Q DC+HWCLPGVPDTW
Sbjct: 1037DVTLLNITQLSEYRKDAHTSVYGERKGKLLTKEQRSNPKSFADCIHWCLPGVPDTW 1204
>TC93535 weakly similar to PIR|A96816|A96816 F9K20.25 [imported] -
Arabidopsis thaliana, partial (38%)
Length = 660
Score = 80.1 bits (196), Expect = 1e-15
Identities = 55/188 (29%), Positives = 88/188 (46%), Gaps = 9/188 (4%)
Frame = +1
Query: 48 CNLFKGTWVFDPNHT----PLYDDT--CPFHRNAWNCIRNQRQNLSLINSWKWVPRGCDM 101
C+ +G WV D + T PLYD + CPF ++C+ N R + I ++W P C++
Sbjct: 79 CDYSQGNWVNDDDSTTFSYPLYDASKDCPFIGQGFDCLGNGRTDKDYIK-YRWKPSRCNL 255
Query: 102 NRIDPFRFLGSMRNKNVGIVGDSLNENLLVSFLCTLRVADGGARKWKKKGAWRGAYF--P 159
R D +FL + K + VGDS++ N+ S C L +A A K + F P
Sbjct: 256 PRFDGGKFLERYKGKKILFVGDSISNNMWQSLTCLLHIAIPNANYTLTKQTNQLTVFSIP 435
Query: 160 KFNVTVAYHRAVLLSKYKWQPKQSESGMQDGSEG-IHRVDVDVPADEWAKIAGFYDVLLF 218
++ ++ + + L + D +G I R+D ++W +DVL+F
Sbjct: 436 EYEASIMWLKNGFLVDL----------VHDKEKGRILRLDTISSGNQWKG----FDVLIF 573
Query: 219 NTGHWWNH 226
NT HWW H
Sbjct: 574 NTYHWWTH 597
>TC90222 similar to GP|15451108|gb|AAK96825.1 putative protein {Arabidopsis
thaliana}, partial (41%)
Length = 624
Score = 79.7 bits (195), Expect = 2e-15
Identities = 41/104 (39%), Positives = 55/104 (52%)
Frame = +3
Query: 48 CNLFKGTWVFDPNHTPLYDDTCPFHRNAWNCIRNQRQNLSLINSWKWVPRGCDMNRIDPF 107
C+ G WV+D ++ PLYD CP+ A C +N R + S WKW P GC + R D
Sbjct: 153 CDFSVGKWVYDESY-PLYDPNCPYLSTAVTCQKNGRPD-SDYEKWKWKPSGCSIPRFDAL 326
Query: 108 RFLGSMRNKNVGIVGDSLNENLLVSFLCTLRVADGGARKWKKKG 151
+FLG MR K + +VGDS+ N S +C + W KKG
Sbjct: 327 KFLGKMRRKRIMLVGDSIMRNQWESLVCLSTRCNS---NW*KKG 449
>BF646181 similar to GP|17104519|gb unknown protein {Arabidopsis thaliana},
partial (41%)
Length = 656
Score = 78.2 bits (191), Expect = 5e-15
Identities = 50/143 (34%), Positives = 75/143 (51%), Gaps = 8/143 (5%)
Frame = +3
Query: 3 PKLPSHLFPFLIILTLASLY------YFSSFISLNTLSSSSSTSPSLTIS-TCNLFKGTW 55
PK L P L +L L S + + S+ + T+SSSSS+S ++ CN F+G W
Sbjct: 21 PKKMGFLLPTLFLLLLFSSHQTKAYEFNSNTTATTTISSSSSSSNERKLAGRCNWFRGKW 200
Query: 56 VFDPNHTPLYD-DTCPFHRNAWNCIRNQRQNLSLINSWKWVPRGCDMNRIDPFRFLGSMR 114
V+DP++ PLYD +CPF +NC + R + + ++W P C + R + FL R
Sbjct: 201 VYDPSY-PLYDPSSCPFIDPQFNCQKYGRPD-TQYQKYRWQPFTCSIPRFNALDFLAKYR 374
Query: 115 NKNVGIVGDSLNENLLVSFLCTL 137
K + VGDSL+ N S C +
Sbjct: 375 GKKIMFVGDSLSLNQFNSLACMI 443
>TC92054 similar to GP|21592371|gb|AAM64322.1 unknown {Arabidopsis
thaliana}, partial (17%)
Length = 636
Score = 77.8 bits (190), Expect = 7e-15
Identities = 38/97 (39%), Positives = 57/97 (58%), Gaps = 1/97 (1%)
Frame = +2
Query: 43 LTISTCNLFKGTWVFDPNHTPLYDD-TCPFHRNAWNCIRNQRQNLSLINSWKWVPRGCDM 101
L + C+L+ GTWV D ++ P+Y+ +CP+ A++C N R N + W+W P GCD+
Sbjct: 323 LALKECDLYNGTWVEDGDY-PIYEPGSCPYVDEAYDCKINGR-NDTRYTKWRWKPHGCDL 496
Query: 102 NRIDPFRFLGSMRNKNVGIVGDSLNENLLVSFLCTLR 138
R FL ++ K + +VGDS+N N S LC LR
Sbjct: 497 PRFSAKDFLARLKGKQLMLVGDSMNRNQFESILCVLR 607
>BM812987 weakly similar to GP|16226759|gb At2g30010/F23F1.7 {Arabidopsis
thaliana}, partial (32%)
Length = 622
Score = 77.8 bits (190), Expect = 7e-15
Identities = 54/190 (28%), Positives = 87/190 (45%), Gaps = 16/190 (8%)
Frame = +3
Query: 215 VLLFNTGHWWNHDKFPKEKPLVFYKAGQPIVPPLEMLDGLKVVLGNMITYIEKEFPRN-T 273
VL+FNTGHWW+H + V + G P ++ L L+ + +++ R+ T
Sbjct: 3 VLVFNTGHWWSHQGSLQGWDYV--ELGGNFYPDMDRLVALERGMKTWANWVDANIDRSRT 176
Query: 274 LKFWRLQSPRHFYGGDWNQ----------NGSCLFNK-PLEENELDLWFEPRNNGVNKEA 322
++ SP H+ +WN +C P+ D F +++
Sbjct: 177 HVLFQAISPTHYDENEWNSAVGRATSVTTTKNCYGETAPISGTTTD--FGGGETYTDQQM 350
Query: 323 RQMNFVIEKVLQGTNIHVVDFTHLSEFRADAHPAIWLG----RKDAVAIWGQDCMHWCLP 378
R +N VI ++ +++D T LSE R D HP+I+ G ++ DC HWCLP
Sbjct: 351 RVVNMVIREMRDPA--YLLDITMLSEMRKDGHPSIYSGELSSQQKTDPDHSADCSHWCLP 524
Query: 379 GVPDTWVDIL 388
G+PDTW +L
Sbjct: 525 GLPDTWNQLL 554
>CB892704 weakly similar to GP|17104519|gb unknown protein {Arabidopsis
thaliana}, partial (27%)
Length = 765
Score = 77.0 bits (188), Expect = 1e-14
Identities = 41/115 (35%), Positives = 64/115 (55%), Gaps = 1/115 (0%)
Frame = +3
Query: 32 TLSSSSSTSPSLTISTCNLFKGTWVFDPNHTPLYD-DTCPFHRNAWNCIRNQRQNLSLIN 90
+ S+ ++ S TCNLF+G WV+D ++ PLYD TCPF +NC ++ R++ L
Sbjct: 3 SFSNETNFEVSNIAGTCNLFRGKWVYDASY-PLYDPSTCPFIDPQFNCQKHGRKD-KLYQ 176
Query: 91 SWKWVPRGCDMNRIDPFRFLGSMRNKNVGIVGDSLNENLLVSFLCTLRVADGGAR 145
++W+P C+M R + FL + K + VGDSL+ N S C + A +R
Sbjct: 177 KYRWMPFSCNMPRFNGLNFLKGNKGKKIMFVGDSLSLNQFNSLACMIHAAVPNSR 341
>CA921717 weakly similar to PIR|T51373|T51 hypothetical protein F1N13_40 -
Arabidopsis thaliana, partial (32%)
Length = 755
Score = 75.1 bits (183), Expect = 4e-14
Identities = 46/149 (30%), Positives = 62/149 (40%), Gaps = 6/149 (4%)
Frame = -2
Query: 250 MLDGLKVVLGNMITYIEKEFPRNTLKFWRLQSPRHFYGGDWNQNGSCLFNKPLEENELDL 309
M G + LG + + F R +P HF G WNQ G+CL KP + N
Sbjct: 715 MYMGYRKALGTAFRALNSLENFKGVTFLRTFAPSHFENGIWNQGGNCLRTKPFKSN---- 548
Query: 310 WFEPRNNGVNKEARQMNFVIEKVLQ------GTNIHVVDFTHLSEFRADAHPAIWLGRKD 363
E R G N E + K+ Q G ++D T R D HP+ +
Sbjct: 547 --EARLEGTNMELYMIQLEEYKISQKKAKRNGLKFRLLDTTQAMLLRPDGHPSRYGHLPQ 374
Query: 364 AVAIWGQDCMHWCLPGVPDTWVDILSQLI 392
DC+HWCLPG DTW D L +++
Sbjct: 373 ENVTLYNDCVHWCLPGPIDTWSDFLLEML 287
>TC89733 weakly similar to GP|19571157|dbj|BAB86580. hypothetical protein
{Oryza sativa (japonica cultivar-group)}, partial (27%)
Length = 768
Score = 75.1 bits (183), Expect = 4e-14
Identities = 56/219 (25%), Positives = 95/219 (42%), Gaps = 3/219 (1%)
Frame = +1
Query: 9 LFPFLIILTLASLYYFSSFISLNTLSSSSSTSPSLTISTCNLFKGTWVFDPNHTPLYDDT 68
LFP +++ + L F ++ P +T S NLF GTWV P Y+ T
Sbjct: 121 LFPLTVLILVILLPLLIHFNQTSSKVLYPIVEP-ITTSCNNLFLGTWV-PYLKQPYYNQT 294
Query: 69 CPFHRNAWNCIRNQRQNLSLINSWKWVPRGCDMNRIDPFRFLGSMRNKNVGIVGDSLNEN 128
CPF NC+ + R + + W+W P C++ D +F ++ K++ VGDS+ N
Sbjct: 295 CPFITEKQNCLIHGRPDSDFLK-WRWKPDNCELPLFDATQFFKIVKGKSMAFVGDSIGRN 471
Query: 129 LLVSFLCTLR-VADGGARKWKKKGAWRGAYFPKFNVTVAYHRAVLLSKYKWQPKQSESGM 187
+ S LC L VA K + + K+ Y+ + + + K S+S +
Sbjct: 472 QMESLLCLLNSVAHPEEITTKYVSSIEDLTYFKWWFYADYNATIAMLWSPFLVKSSKSYI 651
Query: 188 QDGSEGI--HRVDVDVPADEWAKIAGFYDVLLFNTGHWW 224
+ S + +D P W +D ++F+ G W+
Sbjct: 652 YNSSNFYKPESLYLDEPDTAWTSRIENFDYVIFSGGQWF 768
>BG450971 similar to GP|21553616|gb unknown {Arabidopsis thaliana}, partial
(35%)
Length = 675
Score = 74.7 bits (182), Expect = 6e-14
Identities = 39/128 (30%), Positives = 67/128 (51%), Gaps = 2/128 (1%)
Frame = +1
Query: 48 CNLFKGTWVFDPNHTPLYDD-TCPFHRNAWNCIRNQRQNLSLINSWKWVPRGCDMNRIDP 106
CN+ G W+F+ + PLY D +CP+ ++C++N R + ++ W+W P C + + +P
Sbjct: 244 CNVANGKWIFNSSIKPLYSDKSCPYIDKQFSCVKNGRNDSDYLH-WEWQPEDCTLPQFNP 420
Query: 107 FRFLGSMRNKNVGIVGDSLNENLLVSFLCTLRVADGGARKWKKKGAWRGAYFPK-FNVTV 165
L + K + VGDSL N SF+C ++ K K+G R + K +N T+
Sbjct: 421 EIALKKLEGKRLLFVGDSLQRNQWESFVCLVQGIIPEKEKSMKRGRVRSVFKAKEYNATI 600
Query: 166 AYHRAVLL 173
++ A L
Sbjct: 601 EFYWAPFL 624
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.322 0.139 0.464
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 16,600,694
Number of Sequences: 36976
Number of extensions: 291094
Number of successful extensions: 2033
Number of sequences better than 10.0: 98
Number of HSP's better than 10.0 without gapping: 1917
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1950
length of query: 402
length of database: 9,014,727
effective HSP length: 98
effective length of query: 304
effective length of database: 5,391,079
effective search space: 1638888016
effective search space used: 1638888016
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 59 (27.3 bits)
Medicago: description of AC124214.2