
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC126784.4 - phase: 0 /pseudo
(525 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAB79796.1| putative protein [Arabidopsis thaliana] gi|28973... 394 e-108
gb|AAP49526.1| At2g24100 [Arabidopsis thaliana] gi|15809913|gb|A... 378 e-103
gb|AAD03372.2| expressed protein [Arabidopsis thaliana] gi|18400... 374 e-102
ref|XP_468236.1| hypothetical protein [Oryza sativa (japonica cu... 301 3e-80
emb|CAC84497.1| hypothetical protein [Pinus pinaster] 229 2e-58
dbj|BAD72448.1| unknown protein [Oryza sativa (japonica cultivar... 167 8e-40
gb|AAF26079.1| unknown protein [Arabidopsis thaliana] gi|1523004... 133 2e-29
gb|AAD25610.1| Hypothetical protein [Arabidopsis thaliana] gi|25... 132 2e-29
gb|AAM09320.2| similar to Dictyostelium discoideum (Slime mold).... 132 2e-29
ref|NP_175832.1| hypothetical protein [Arabidopsis thaliana] 115 4e-24
ref|XP_638504.1| hypothetical protein DDB0218621 [Dictyostelium ... 89 3e-16
gb|EAK85872.1| hypothetical protein UM04928.1 [Ustilago maydis 5... 42 0.062
ref|XP_641873.1| hypothetical protein DDB0205043 [Dictyostelium ... 38 0.69
ref|XP_636103.1| hypothetical protein DDB0188566 [Dictyostelium ... 38 0.69
gb|AAK54089.2| histidine kinase DhkH [Dictyostelium discoideum] 38 0.90
ref|XP_641414.1| histidine kinase [Dictyostelium discoideum] gi|... 38 0.90
ref|XP_641068.1| hypothetical protein DDB0206062 [Dictyostelium ... 37 1.2
ref|NP_765049.1| hypothetical protein SE1494 [Staphylococcus epi... 37 1.5
dbj|BAC42834.1| unknown protein [Arabidopsis thaliana] 37 1.5
gb|AAL68158.2| AT30755p [Drosophila melanogaster] 37 2.0
>emb|CAB79796.1| putative protein [Arabidopsis thaliana] gi|28973227|gb|AAO63938.1|
unknown protein [Arabidopsis thaliana]
gi|5725442|emb|CAB52451.1| putative protein [Arabidopsis
thaliana] gi|28393759|gb|AAO42289.1| unknown protein
[Arabidopsis thaliana] gi|15234853|ref|NP_194807.1|
expressed protein [Arabidopsis thaliana]
gi|25407680|pir||C85360 hypothetical protein AT4g30780
[imported] - Arabidopsis thaliana
Length = 589
Score = 394 bits (1011), Expect = e-108
Identities = 263/595 (44%), Positives = 334/595 (55%), Gaps = 88/595 (14%)
Query: 2 VRFMGSKNPRSQWESSSATSSISPKFEI-EDSIQDQHAPLNKRHK-------------AT 47
+R G P S+ ++ + K EI ED ++++H PLNKR + A
Sbjct: 4 MRKSGGLRPESE-SAARCRDELPVKLEIAEDDLEEEHGPLNKRSRLWSPGTSSSTMAPAK 62
Query: 48 NDILNEPSPLGLSLRKSPSLLDLIQM--TLCQE----NSVNANTANDNLNSKANKNGRA- 100
+ L+EPSPLGLSL+KSPSLL+LIQM T C + ++ A L ++ A
Sbjct: 63 YNPLDEPSPLGLSLKKSPSLLELIQMKITHCGDPKAAETLKAGALGSGLKRESKTIAAAA 122
Query: 101 ---------SVEKLKASNFPATHLKIGSWEYKSKYEGDLVAKCYFAKQKLVWEVLEGELK 151
S+EKLKASNFPA+ LKIG WEYKS+YEGDLVAKCYFAK KLVWEVLE LK
Sbjct: 123 SVGPTLAPGSIEKLKASNFPASLLKIGQWEYKSRYEGDLVAKCYFAKHKLVWEVLEQGLK 182
Query: 152 SKIEIQWSDISQLKANCPDDGPSTLTLMVARQPLFFRETNPQPRKHTLWQSTTDFTGGQA 211
SKIEIQWSDI LKANCP+DGP TLTL++ARQPLFFRETNPQPRKHTLWQ+T+DFT GQA
Sbjct: 183 SKIEIQWSDIMALKANCPEDGPGTLTLVLARQPLFFRETNPQPRKHTLWQATSDFTDGQA 242
Query: 212 SIHRRHVLQCEQGLLIKHYEKLVQCNDRLKFLSQQPEIMVDSPHFDPRSAAIENPHNLKD 271
S++R+H LQC QG++ KH+EKLVQC+ RL LS+QPEI +DSP+FD R + E+P K
Sbjct: 243 SMNRQHFLQCAQGIMNKHFEKLVQCDHRLFHLSRQPEIAIDSPYFDARQSIFEDPSESKG 302
Query: 272 ---CDLHQGNGSAVSCFQNMGSPHSSLSPS----FTTEHSDPSAI--------------- 309
+L+ G ++S QN+ SP + S S + E PS++
Sbjct: 303 HPFGNLNLSTGPSISGTQNLASPVGAQSSSEHMYLSHEAPSPSSVIDARANEGIGGSEAV 362
Query: 310 ----TLDSVPCEAPS-SSSA*IMKFSSKIC------QMSII*NL**LCKSVIVVLAC*FH 358
D EAP S + F + +C +++ ++ L +S+ V +
Sbjct: 363 NSRNRTDCGQIEAPGIHQSMSLSDFLAVLCDTKNTTDLNLADDVDGLHQSMSVSDFVAY- 421
Query: 359 Q*FNMLGSRN---WDQIKLPGLRPSMSMSDFLGHIEHHISKEMASGDPSFSAERLEYQQM 415
+ SRN DQIK+PGL SMS+SDF+G + + A G E+ E +
Sbjct: 422 ----LSDSRNITDSDQIKVPGLHQSMSVSDFVG-----LLSDSAGGSHPEHMEKFEIMK- 471
Query: 416 MDGITQHLLNDNQVTTDSDEKSLMSRVNSLRCLLQMDPPAVPNSHDNTGFIEGPNDAKVN 475
Q LL+DN DEKSLM RVNSL LL DP NS NT G
Sbjct: 472 -----QQLLSDNIQFEAPDEKSLMPRVNSLFNLLYKDPNVAANSQLNTEMSVGLKSEPKG 526
Query: 476 IDIKATEENSR-----DVYGGNPAPGMSRKDSFGDLLLSLPRIASLPKFLFDISE 525
I N+ D + GM RKDSF DLLL LPRI SLPKFL +ISE
Sbjct: 527 IVSDNNNNNNNNNRVLDTASSSKPQGMLRKDSFSDLLLHLPRITSLPKFLSNISE 581
>gb|AAP49526.1| At2g24100 [Arabidopsis thaliana] gi|15809913|gb|AAL06884.1|
At2g24100/F27D4.1 [Arabidopsis thaliana]
gi|14596217|gb|AAK68836.1| Unknown protein [Arabidopsis
thaliana] gi|25371341|pir||F84632 hypothetical protein
At2g24100 [imported] - Arabidopsis thaliana
Length = 466
Score = 378 bits (970), Expect = e-103
Identities = 240/537 (44%), Positives = 304/537 (55%), Gaps = 85/537 (15%)
Query: 1 MVRFMGSKNPRSQWESSSATSSISPKFEI-EDSIQDQHAPLNKRHKATND--------IL 51
MV M S+N Q S ++ K EI EDS++++HAPLNKR K ++ +L
Sbjct: 1 MVEMMRSENHLRQ-ASKHRNNNFPVKLEIIEDSLEEEHAPLNKRSKLWSNGTSVSKFSLL 59
Query: 52 NEPSPLGLSLRKSPSLLDLIQMTLCQENSVNANTANDNLNSKANKNGRASVEKLKASNFP 111
EPSPLGLSL+KSPS +LI+M L Q + + +N G +VEKLKASNFP
Sbjct: 60 EEPSPLGLSLKKSPSFQELIEMKLSQ----SGDDSNSVKKESFGFGGVGTVEKLKASNFP 115
Query: 112 ATHLKIGSWEYKSKYEGDLVAKCYFAKQKLVWEVLEGELKSKIEIQWSDISQLKANCPDD 171
AT L+IG WEYKS+YEGDLVAKCYFAK KLVWEVLE LKSKIEIQWSDI LKAN P+D
Sbjct: 116 ATILRIGQWEYKSRYEGDLVAKCYFAKHKLVWEVLEQGLKSKIEIQWSDIMALKANLPED 175
Query: 172 GPSTLTLMVARQPLFFRETNPQPRKHTLWQSTTDFTGGQASIHRRHVLQCEQGLLIKHYE 231
P TLT+++AR+PLFFRETNPQPRKHTLWQ+T+DFT GQAS++R+H LQC G++ KH+E
Sbjct: 176 EPGTLTIVLARRPLFFRETNPQPRKHTLWQATSDFTDGQASMNRQHFLQCPPGIMNKHFE 235
Query: 232 KLVQCNDRLKFLSQQPEIMVDSPHFDPRSAAIENPHNLKDCDLHQGNGSAVSCFQNMGSP 291
KLVQC+ RL LS+QPEI + +P FD R + E+P ++ G A S +++
Sbjct: 236 KLVQCDHRLFCLSRQPEINLAAPFFDSRLSIFEDPSVSGSHNIASPVG-AQSSSEHVSLS 294
Query: 292 HSSLSPSFTTEHSDPSAITLDSVPCEAPSSSSA*IMKFSSKICQMSII*NL**LCKSVIV 351
H +LSPS +D+ E S
Sbjct: 295 HDALSPS----------SVMDARAIEGVGGS----------------------------- 315
Query: 352 VLAC*FHQ*FNMLGSRNWDQIKLPGLRPSMSMSDFLGHIEHHISKEMASGDPSFSAERLE 411
+ + W QIK+PGL S+SM+DFL + S + E
Sbjct: 316 ---------IDSRNTNGWSQIKMPGLHQSISMNDFLTFL---------------SDQACE 351
Query: 412 YQQMMDGITQHLLNDNQVTTDSDEKSLMSRVNSLRCLLQMDPPAVPNSHDNTGFIEGPND 471
Q + + Q LL+DN T SDEKS+MS+VNS LLQ + NS N +
Sbjct: 352 NNQEFEEMKQLLLSDNTQTDPSDEKSVMSKVNSFCNLLQ----SAANSQLNIETADTERV 407
Query: 472 AKVNIDIKATEENSRDV---YGGNPAPGMSRKDSFGDLLLSLPRIASLPKFLFDISE 525
V+ + E R V P GMSRKDSF DLL+ LPRI SLPKFLF+ISE
Sbjct: 408 VGVDNNRHMPEGGKRVVDPASSSKPLQGMSRKDSFSDLLVHLPRITSLPKFLFNISE 464
>gb|AAD03372.2| expressed protein [Arabidopsis thaliana]
gi|18400458|ref|NP_565562.1| expressed protein
[Arabidopsis thaliana]
Length = 463
Score = 374 bits (961), Expect = e-102
Identities = 231/508 (45%), Positives = 292/508 (57%), Gaps = 83/508 (16%)
Query: 29 IEDSIQDQHAPLNKRHKATND--------ILNEPSPLGLSLRKSPSLLDLIQMTLCQENS 80
IEDS++++HAPLNKR K ++ +L EPSPLGLSL+KSPS +LI+M L Q
Sbjct: 26 IEDSLEEEHAPLNKRSKLWSNGTSVSKFSLLEEPSPLGLSLKKSPSFQELIEMKLSQ--- 82
Query: 81 VNANTANDNLNSKANKNGRASVEKLKASNFPATHLKIGSWEYKSKYEGDLVAKCYFAKQK 140
+ + +N G +VEKLKASNFPAT L+IG WEYKS+YEGDLVAKCYFAK K
Sbjct: 83 -SGDDSNSVKKESFGFGGVGTVEKLKASNFPATILRIGQWEYKSRYEGDLVAKCYFAKHK 141
Query: 141 LVWEVLEGELKSKIEIQWSDISQLKANCPDDGPSTLTLMVARQPLFFRETNPQPRKHTLW 200
LVWEVLE LKSKIEIQWSDI LKAN P+D P TLT+++AR+PLFFRETNPQPRKHTLW
Sbjct: 142 LVWEVLEQGLKSKIEIQWSDIMALKANLPEDEPGTLTIVLARRPLFFRETNPQPRKHTLW 201
Query: 201 QSTTDFTGGQASIHRRHVLQCEQGLLIKHYEKLVQCNDRLKFLSQQPEIMVDSPHFDPRS 260
Q+T+DFT GQAS++R+H LQC G++ KH+EKLVQC+ RL LS+QPEI + +P FD R
Sbjct: 202 QATSDFTDGQASMNRQHFLQCPPGIMNKHFEKLVQCDHRLFCLSRQPEINLAAPFFDSRL 261
Query: 261 AAIENPHNLKDCDLHQGNGSAVSCFQNMGSPHSSLSPSFTTEHSDPSAITLDSVPCEAPS 320
+ E+P ++ G A S +++ H +LSPS +D+ E
Sbjct: 262 SIFEDPSVSGSHNIASPVG-AQSSSEHVSLSHDALSPS----------SVMDARAIEGVG 310
Query: 321 SSSA*IMKFSSKICQMSII*NL**LCKSVIVVLAC*FHQ*FNMLGSRNWDQIKLPGLRPS 380
S + + W QIK+PGL S
Sbjct: 311 GS--------------------------------------IDSRNTNGWSQIKMPGLHQS 332
Query: 381 MSMSDFLGHIEHHISKEMASGDPSFSAERLEYQQMMDGITQHLLNDNQVTTDSDEKSLMS 440
+SM+DFL + S + E Q + + Q LL+DN T SDEKS+MS
Sbjct: 333 ISMNDFLTFL---------------SDQACENNQEFEEMKQLLLSDNTQTDPSDEKSVMS 377
Query: 441 RVNSLRCLLQMDPPAVPNSHDNTGFIEGPNDAKVNIDIKATEENSRDV---YGGNPAPGM 497
+VNS LLQ + NS N + V+ + E R V P GM
Sbjct: 378 KVNSFCNLLQ----SAANSQLNIETADTERVVGVDNNRHMPEGGKRVVDPASSSKPLQGM 433
Query: 498 SRKDSFGDLLLSLPRIASLPKFLFDISE 525
SRKDSF DLL+ LPRI SLPKFLF+ISE
Sbjct: 434 SRKDSFSDLLVHLPRITSLPKFLFNISE 461
>ref|XP_468236.1| hypothetical protein [Oryza sativa (japonica cultivar-group)]
gi|47497146|dbj|BAD19195.1| hypothetical protein [Oryza
sativa (japonica cultivar-group)]
gi|47497593|dbj|BAD19663.1| hypothetical protein [Oryza
sativa (japonica cultivar-group)]
Length = 523
Score = 301 bits (772), Expect = 3e-80
Identities = 205/496 (41%), Positives = 272/496 (54%), Gaps = 81/496 (16%)
Query: 40 LNKRHKATNDILNEPSPLGLSLRKSPSLLDLIQMTLCQENSVNANTANDNLNSKANKNGR 99
L R K IL+ SPLGL LRKSPSLL+LIQM L EN T +++ S++
Sbjct: 83 LGDRLKWDMHILDGSSPLGLRLRKSPSLLELIQMKLAMEN-----TKKEDIKSRS----L 133
Query: 100 ASVEKLKASNFPATHLKIGSWEYKSKYEGDLVAKCYFAKQKLVWEVLEGELKSKIEIQWS 159
+ E++KASNF A LKIG+WE S+YEGDLVAKCYFAK KLVWEVL+ LK KIEIQWS
Sbjct: 134 IASERVKASNFAADFLKIGTWECTSQYEGDLVAKCYFAKHKLVWEVLDAGLKRKIEIQWS 193
Query: 160 DISQLKANCPDDGPSTLTLMVARQPLFFRETNPQPRKHTLWQSTTDFTGGQASIHRRHVL 219
DI LKA CP++G TL L++AR P FF+ET+PQPRKHTLWQ +DFTGGQASI RRH+L
Sbjct: 194 DIIALKATCPENGIGTLDLVLARPPTFFKETDPQPRKHTLWQVASDFTGGQASIKRRHIL 253
Query: 220 QCEQGLLIKHYEKLVQCNDRLKFLSQQPEIMVDSPHFDPRS--AAIENPHNLKDCDLHQG 277
QC+ LL K++EKL+QC+ RL +LS QP M+DSP F P++ + ENP+ K + G
Sbjct: 254 QCQSSLLSKNFEKLIQCDQRLNYLSLQP-YMIDSPVFRPKTEGSIFENPNKSKS---YHG 309
Query: 278 NGSAVSCFQNMGSPHSSLSPSFTTEHSDPSAITLDSVPCEAPSSSSA*IMKFSSKICQMS 337
F + H S + S PC+ P S MK Q S
Sbjct: 310 -------FSYLEGEHESHLSKYIDHVS----------PCDFPLMSKKDGMKDDIANQQQS 352
Query: 338 II*NL**LCKSVIVVLAC*FHQ*FNMLGSRNWD-------QIKLP-----GLRPSMSMSD 385
F + N G+ + D ++K P S+S+ D
Sbjct: 353 -------------------FSRPINW-GASDVDLQVDVSQELKSPHPNSLSQARSLSIDD 392
Query: 386 FLGHIEHHISKEMASG-DPSFSAERLEYQQMMDGITQHLLNDNQVTTDSDEKSLMSRVNS 444
L H++ I ++ +G +PS ++++ ITQ LL+D+ V SDEK +M+RV S
Sbjct: 393 LLSHLDDCIVEQKPAGNNPSLPISEASSNELLEKITQQLLSDSHVAPASDEKRVMARVGS 452
Query: 445 LRCLLQMD------PPAVPNSHDNTGFIEGPNDAKVNIDIKATEENSRDVYGGNPAPGMS 498
L LLQ D P PN G +E + +++ I G NP PG+S
Sbjct: 453 LLSLLQKDAVPANLPKFEPNDSGKIGVVEVGISSALDMGI---------ANGTNP-PGIS 502
Query: 499 RKDSFGDLLLSLPRIA 514
RKDS+ +LL +L I+
Sbjct: 503 RKDSYEELLSNLFNIS 518
>emb|CAC84497.1| hypothetical protein [Pinus pinaster]
Length = 197
Score = 229 bits (584), Expect = 2e-58
Identities = 111/177 (62%), Positives = 136/177 (76%)
Query: 103 EKLKASNFPATHLKIGSWEYKSKYEGDLVAKCYFAKQKLVWEVLEGELKSKIEIQWSDIS 162
+KLKASNFP ++L+IG+WE S+YEGDLVAKCYFAK KLVWEVL+G LKSKIEIQWSDI+
Sbjct: 15 DKLKASNFPVSNLRIGTWECISRYEGDLVAKCYFAKHKLVWEVLDGGLKSKIEIQWSDIT 74
Query: 163 QLKANCPDDGPSTLTLMVARQPLFFRETNPQPRKHTLWQSTTDFTGGQASIHRRHVLQCE 222
LKA+ +D P TL + V+R PLFFRETNPQPRKHTLWQ+T+DFTGGQA+I RRH LQC
Sbjct: 75 ALKASYLEDEPGTLDIEVSRPPLFFRETNPQPRKHTLWQATSDFTGGQATICRRHFLQCP 134
Query: 223 QGLLIKHYEKLVQCNDRLKFLSQQPEIMVDSPHFDPRSAAIENPHNLKDCDLHQGNG 279
GLL +HYEKL+QC+ RL LS++ + SP FD +SA ++ H+ G
Sbjct: 135 HGLLNRHYEKLIQCDPRLNLLSKKGFLSEASPLFDSKSAVFQDLDEQSSYAKHKKRG 191
>dbj|BAD72448.1| unknown protein [Oryza sativa (japonica cultivar-group)]
Length = 299
Score = 167 bits (423), Expect = 8e-40
Identities = 114/334 (34%), Positives = 171/334 (51%), Gaps = 60/334 (17%)
Query: 213 IHRRHVLQCEQGLLIKHYEKLVQCNDRLKFLSQQPEIMVDSPHFDPRSAAIENP------ 266
++RRH LQC LL K++EKL+QC+ RL LSQQP+I++DSP F+PR + E+P
Sbjct: 1 MNRRHFLQCPSSLLSKNFEKLLQCDQRLNQLSQQPDIILDSPVFEPRCSIFEDPVESKCQ 60
Query: 267 --HNLKD-CDLHQGNGSAVSCFQNMGSPHSSLSPSFTTE--------HSDPSAITLDSVP 315
NLKD +L +GS C + S ++ S T+ + PSA+ + V
Sbjct: 61 GFTNLKDEHELSGFSGSLSPCAGSSMSAKIEVNDSIATQAGFLAQPGNPGPSAVNVQGVS 120
Query: 316 CEAPSSSSA*IMKFSSKICQMSII*NL**LCKSVIVVLAC*FHQ*FNMLGSRNWDQIKLP 375
+ I + W Q+K+P
Sbjct: 121 RNVNGAPELNIPSW---------------------------------------WSQLKVP 141
Query: 376 GLRPSMSMSDFLGHIEHHISKEMASGDPSFSAERLEYQQMMDGITQHLLNDNQ--VTTDS 433
GLRPSMS+ D + H+ + IS+++ S +P+ + + ++ ++ I Q+LL D Q + S
Sbjct: 142 GLRPSMSVDDLVNHLGNCISEQITSVNPTLPSNEVPTKETLEEIAQYLLGDAQGPPASTS 201
Query: 434 DEKSLMSRVNSLRCLLQMDPPAV--PNSHDNTGFIEGPNDAKVNIDIKATEENSRDVYGG 491
DE+SLM+RV+SL CL+Q D P V P N G + + + + ++ ++ G
Sbjct: 202 DERSLMARVDSLCCLIQKDTPPVAQPKPEPNDSDSIGGDGTEGSDEEFSSAASTVKTTGP 261
Query: 492 NPAPGMSRKDSFGDLLLSLPRIASLPKFLFDISE 525
P MSRKDSFGDLL++LPRIASLP+FLF I E
Sbjct: 262 AQPPAMSRKDSFGDLLMNLPRIASLPQFLFKIPE 295
>gb|AAF26079.1| unknown protein [Arabidopsis thaliana] gi|15230043|ref|NP_187228.1|
expressed protein [Arabidopsis thaliana]
Length = 410
Score = 133 bits (334), Expect = 2e-29
Identities = 85/197 (43%), Positives = 112/197 (56%), Gaps = 14/197 (7%)
Query: 48 NDILNEPSPLGLSLRKSPSLLDLIQMTL-CQENSVNANTANDNLNSKANKNGRASVEKLK 106
N ++E L L L K+P L++ I+ L + T N + S K S EKLK
Sbjct: 17 NRFVDEGPRLNLPLTKTPELINKIESYLKVHYTCPHQQTENSSKTSTLPK----SPEKLK 72
Query: 107 ASNFPATHLKIGSWEYKSKYEGDLVAKCYFAKQKLVWEVLEGE-------LKSKIEIQWS 159
A NFP + +KIG + +K D+VAK YFAK+KL+WE L GE LKSKIEIQW+
Sbjct: 73 AMNFPISTIKIGDCVFVAKNPDDIVAKFYFAKKKLLWEFLFGEPVANMPRLKSKIEIQWN 132
Query: 160 DISQLKANCPD-DGPSTLTLMVARQPLFFRETNPQPRKHTLW-QSTTDFTGGQASIHRRH 217
D+S + + D L + + ++P FF ETNPQ KHT W Q DFTG QAS +RRH
Sbjct: 133 DVSSFEESINSRDETGILKIELKKRPTFFTETNPQAGKHTQWKQLDYDFTGDQASYYRRH 192
Query: 218 VLQCEQGLLIKHYEKLV 234
L G+L K+ EKL+
Sbjct: 193 TLHFPPGVLQKNLEKLL 209
>gb|AAD25610.1| Hypothetical protein [Arabidopsis thaliana] gi|25405713|pir||E96584
hypothetical protein F20D21.12 [imported] - Arabidopsis
thaliana
Length = 444
Score = 132 bits (333), Expect = 2e-29
Identities = 90/211 (42%), Positives = 119/211 (55%), Gaps = 15/211 (7%)
Query: 57 LGLSLRKSPSLLDLIQMTLCQENSV-NANTANDNLNSKANKNGRASVEKLKASNFPATHL 115
L LSL KSP L++ I+ L + + T N + S K S EKLKA NFP + +
Sbjct: 8 LNLSLTKSPELINKIESYLNGHCTCPHQQTENSSKRSTLPK----SPEKLKAMNFPISTI 63
Query: 116 KIGSWEYKSKYEGDLVAKCYFAKQKLVWEVLEGE-------LKSKIEIQWSDISQLKANC 168
+IG W +K D+VAK YFAK+KL+WE L GE LK KIEIQW+D+S + +
Sbjct: 64 RIGGWVVVAKNPDDIVAKFYFAKKKLIWEFLFGEPETNTLRLKRKIEIQWNDVSSFEESI 123
Query: 169 PD-DGPSTLTLMVARQPLFFRETNPQPRKHTLW-QSTTDFTGGQASIHRRHVLQCEQGLL 226
D L + + ++P FF ETNPQ KHT W Q DFTG AS +RRH L G+L
Sbjct: 124 SSRDETGILKIELKKRPTFFIETNPQAGKHTQWKQLDHDFTGDHASNYRRHTLHFPPGVL 183
Query: 227 IKHYEKLVQCNDRLKFLSQQPEIMVDSPHFD 257
K+ EKLV + K L + P + +S +FD
Sbjct: 184 QKNLEKLVTDSFWSK-LYEVPFPVHESRYFD 213
>gb|AAM09320.2| similar to Dictyostelium discoideum (Slime mold).
Homeobox-containing protein (Fragment)
gi|66820947|ref|XP_644015.1| hypothetical protein
DDB0167808 [Dictyostelium discoideum]
gi|60472067|gb|EAL70020.1| hypothetical protein
DDB0167808 [Dictyostelium discoideum]
Length = 1108
Score = 132 bits (333), Expect = 2e-29
Identities = 70/191 (36%), Positives = 116/191 (60%), Gaps = 11/191 (5%)
Query: 69 DLIQMTLCQENSVNANTANDNLNSKA--NKNGRASVEKLKASNFPATHLKIGSWEYKSKY 126
+++ T +S+ NT N +N A + +G KLK SN AT +K+G WE + +
Sbjct: 442 NILSSTSTSSSSIPINT-NPLINDSAVDDDDGLTETSKLKPSNLAATKIKVGEWECSASF 500
Query: 127 EGDLVAKCYFAKQKLVWEVLEGELKSKIEIQWSDISQLKANCPDDGPSTLTLMVARQPLF 186
GD++AK YF K+++VWE+L+ LKSK+ I +S+I+ +K D +TL++ +++ P F
Sbjct: 501 PGDIIAKFYFTKKQIVWELLKLGLKSKMVISFSEITAVKVEDTSDDSATLSIEISKAPKF 560
Query: 187 FRETNPQPRKHTLWQSTTDFTGGQ-ASIHRRHVLQCEQGLLIKHYEKLVQCNDRLKFLSQ 245
++E NPQP+K+T W + DFT Q AS +RRH+L ++ L K+ KL + + +LK L
Sbjct: 561 YKEVNPQPKKNTTWNLSNDFTDNQSASTYRRHILTFKKSSLAKNIVKLHKSDQKLKKL-- 618
Query: 246 QPEIMVDSPHF 256
+++ HF
Sbjct: 619 -----IENQHF 624
>ref|NP_175832.1| hypothetical protein [Arabidopsis thaliana]
Length = 314
Score = 115 bits (287), Expect = 4e-24
Identities = 70/158 (44%), Positives = 93/158 (58%), Gaps = 10/158 (6%)
Query: 109 NFPATHLKIGSWEYKSKYEGDLVAKCYFAKQKLVWEVLEGE-------LKSKIEIQWSDI 161
NFP + ++IG W +K D+VAK YFAK+KL+WE L GE LK KIEIQW+D+
Sbjct: 2 NFPISTIRIGGWVVVAKNPDDIVAKFYFAKKKLIWEFLFGEPETNTLRLKRKIEIQWNDV 61
Query: 162 SQLKANCPD-DGPSTLTLMVARQPLFFRETNPQPRKHTLW-QSTTDFTGGQASIHRRHVL 219
S + + D L + + ++P FF ETNPQ KHT W Q DFTG AS +RRH L
Sbjct: 62 SSFEESISSRDETGILKIELKKRPTFFIETNPQAGKHTQWKQLDHDFTGDHASNYRRHTL 121
Query: 220 QCEQGLLIKHYEKLVQCNDRLKFLSQQPEIMVDSPHFD 257
G+L K+ EKLV + K L + P + +S +FD
Sbjct: 122 HFPPGVLQKNLEKLVTDSFWSK-LYEVPFPVHESRYFD 158
>ref|XP_638504.1| hypothetical protein DDB0218621 [Dictyostelium discoideum]
gi|60467112|gb|EAL65150.1| hypothetical protein
DDB0218621 [Dictyostelium discoideum]
Length = 1222
Score = 89.0 bits (219), Expect = 3e-16
Identities = 50/164 (30%), Positives = 87/164 (52%), Gaps = 1/164 (0%)
Query: 79 NSVNANTANDNLNSKANKNGRASVEKLKASNFPATHLKIGSWEYKSKYEGDLVAKCYFAK 138
N+ N N +N N N++ N + S + K S+ P+ +KIG+WE S + GDLVA+ + +
Sbjct: 610 NNSNNNNSNANSNNQNNSSNNISPQVYKFSDLPSISMKIGNWEKISSFCGDLVARFSYTE 669
Query: 139 QKLVWEVL-EGELKSKIEIQWSDISQLKANCPDDGPSTLTLMVARQPLFFRETNPQPRKH 197
+K +WE+ G+ +K+E+ + D++ + N D +TL + + P F+ +
Sbjct: 670 RKFMWEIFNNGKSLTKMEMFFDDVTSVDINLLDGDKVEMTLHLKKCPTFYEAIFIENIPS 729
Query: 198 TLWQSTTDFTGGQASIHRRHVLQCEQGLLIKHYEKLVQCNDRLK 241
++QS DFTGG+A+ +H L + L E L + R K
Sbjct: 730 MVYQSCGDFTGGEATKSNKHTLCFIKNALANPLEALSSTDPRFK 773
>gb|EAK85872.1| hypothetical protein UM04928.1 [Ustilago maydis 521]
gi|49077346|ref|XP_402543.1| hypothetical protein
UM04928.1 [Ustilago maydis 521]
Length = 1084
Score = 41.6 bits (96), Expect = 0.062
Identities = 51/197 (25%), Positives = 76/197 (37%), Gaps = 48/197 (24%)
Query: 78 ENSVNANTANDNLNSKANKNGRASVEKLKAS--NFPATHLKIGSWEYKSKYEGDLVAKCY 135
+ ++ +A DN + GR +L S P + L IG+W S C+
Sbjct: 328 DKDAHSKSAGDNA-----RLGRGDPAELVPSVTAIPTSALCIGTWRRVSPLI------CF 376
Query: 136 FAK--QKLVWEVLEGELKSKIEIQWSDISQLKANCPDDGPST----------------LT 177
F++ Q L W + + K+EI W+ I + DGPS
Sbjct: 377 FSRRLQSLTWYLTSESIGFKLEIPWTSI----RSAYFDGPSNPSIAERAEGVRVPLGHFV 432
Query: 178 LMVARQPLFFRE----------TNPQPRKHTLWQSTTDFT-GGQASIHRRHVLQCEQGLL 226
+ + R P FF E +N +P+ W+ DFT QA RHV+ L
Sbjct: 433 IDLERPPTFFMEVFRSAPLKEGSNEEPK--VSWRQCEDFTEDHQAMTVSRHVINGPYEEL 490
Query: 227 IKHYEKLVQCNDRLKFL 243
+ L +CND LK L
Sbjct: 491 RIAVQALARCNDVLKRL 507
>ref|XP_641873.1| hypothetical protein DDB0205043 [Dictyostelium discoideum]
gi|60469913|gb|EAL67896.1| hypothetical protein
DDB0205043 [Dictyostelium discoideum]
Length = 1045
Score = 38.1 bits (87), Expect = 0.69
Identities = 32/102 (31%), Positives = 48/102 (46%), Gaps = 10/102 (9%)
Query: 1 MVRFMGSKNPRSQWESSSATSSISPKFEIEDSIQDQHAPLNKR--HKATNDILNEPSPLG 58
M +F+GS P W + SPK +I D I H L K +D N+ S G
Sbjct: 472 MTKFVGSIVPDRAWIEKES----SPKTQIIDMIFSPHGDLIMTVVKKENHDHKNKSSN-G 526
Query: 59 LSLRKSPSLLDLIQMTLCQENSVNANT---ANDNLNSKANKN 97
+ + PS L+L+ +NS ++NT +N+N N+ N N
Sbjct: 527 IGFQCIPSTLELLNFNSPTDNSNSSNTSLSSNNNNNNNNNNN 568
>ref|XP_636103.1| hypothetical protein DDB0188566 [Dictyostelium discoideum]
gi|60464445|gb|EAL62592.1| hypothetical protein
DDB0188566 [Dictyostelium discoideum]
Length = 1966
Score = 38.1 bits (87), Expect = 0.69
Identities = 27/90 (30%), Positives = 41/90 (45%)
Query: 8 KNPRSQWESSSATSSISPKFEIEDSIQDQHAPLNKRHKATNDILNEPSPLGLSLRKSPSL 67
+N SQ +S S+TSS SPK E + + Q K + D + P+ + S + S
Sbjct: 1648 QNKLSQQQSPSSTSSSSPKEEQQQQQKQQQQQQQTIKKRSFDEITLPTSIKRSKSDNQSS 1707
Query: 68 LDLIQMTLCQENSVNANTANDNLNSKANKN 97
T+ N+ N N N+N N+ N N
Sbjct: 1708 SPSSTTTVNNNNNNNNNNNNNNNNNNNNNN 1737
>gb|AAK54089.2| histidine kinase DhkH [Dictyostelium discoideum]
Length = 1147
Score = 37.7 bits (86), Expect = 0.90
Identities = 19/51 (37%), Positives = 32/51 (62%), Gaps = 1/51 (1%)
Query: 60 SLRKSPSLLDLIQMTLCQEN-SVNANTANDNLNSKANKNGRASVEKLKASN 109
SL SPS+ L +L N +++ N +N+N+N+ N +G ++ +KLK SN
Sbjct: 847 SLSSSPSIQGLTNSSLSINNINISGNNSNNNINNNNNNSGSSTPKKLKKSN 897
>ref|XP_641414.1| histidine kinase [Dictyostelium discoideum]
gi|60469435|gb|EAL67428.1| histidine kinase
[Dictyostelium discoideum]
Length = 1378
Score = 37.7 bits (86), Expect = 0.90
Identities = 19/51 (37%), Positives = 32/51 (62%), Gaps = 1/51 (1%)
Query: 60 SLRKSPSLLDLIQMTLCQEN-SVNANTANDNLNSKANKNGRASVEKLKASN 109
SL SPS+ L +L N +++ N +N+N+N+ N +G ++ +KLK SN
Sbjct: 1078 SLSSSPSIQGLTNSSLSINNINISGNNSNNNINNNNNNSGSSTPKKLKKSN 1128
>ref|XP_641068.1| hypothetical protein DDB0206062 [Dictyostelium discoideum]
gi|60469102|gb|EAL67098.1| hypothetical protein
DDB0206062 [Dictyostelium discoideum]
Length = 946
Score = 37.4 bits (85), Expect = 1.2
Identities = 31/122 (25%), Positives = 57/122 (46%), Gaps = 5/122 (4%)
Query: 8 KNPRSQWESSSATSSISPKFEIEDSIQDQHAPLNKRHKATNDILNEPSPLGLSLRKSPSL 67
K+ S+ +S S+T+S+S +I NK+ ++T+ + P + S SPSL
Sbjct: 462 KSKNSKSKSHSSTTSLSSLLANNANISSSTCLSNKKSQSTSPS-SAPISIPKSSPSSPSL 520
Query: 68 LDLIQMTLCQENSVN--ANTANDNLNSKANKNGRASVEKLKASNFPATHLKIGSWEYKSK 125
++ + N +N ANT+ +N + N N ++ K SN ++ I + K
Sbjct: 521 MNSNNNSNSNNNGINILANTSENNNINNTNNNINININNNKNSN--NNNINISPMKINGK 578
Query: 126 YE 127
Y+
Sbjct: 579 YD 580
>ref|NP_765049.1| hypothetical protein SE1494 [Staphylococcus epidermidis ATCC 12228]
gi|27315959|gb|AAO05093.1| conserved hypothetical
protein [Staphylococcus epidermidis ATCC 12228]
Length = 765
Score = 37.0 bits (84), Expect = 1.5
Identities = 32/109 (29%), Positives = 47/109 (42%), Gaps = 10/109 (9%)
Query: 397 EMASGDPSFSAERLEYQQMMDGITQHLLNDNQVTTDSDEKSLMSRVNSLRCLLQMDPPAV 456
E A A+ + + M+D + DNQ + DEKS ++ N LQ D
Sbjct: 460 ENARSSAKSVADEIHDKHMLDNAGSYNKRDNQ--KNKDEKSFINPDNPN---LQQDKKQG 514
Query: 457 PNSHDNTGFIEGPNDAKVNIDIKATEENSRDVYGGNPAPGMSRKDSFGD 505
NS+ +TG ND+K+N TEE + G + KDS G+
Sbjct: 515 ENSYSDTG-----NDSKLNKSNNQTEEAEDNKNGHDKNIDSKEKDSLGN 558
>dbj|BAC42834.1| unknown protein [Arabidopsis thaliana]
Length = 107
Score = 37.0 bits (84), Expect = 1.5
Identities = 24/62 (38%), Positives = 32/62 (50%), Gaps = 3/62 (4%)
Query: 274 LHQGNGSAVSCFQNMGSPHSSLSPSFTTEHSD---PSAITLDSVPCEAPSSSSA*IMKFS 330
L + + A SC + S S S SF+T S P+ + S C APSSSS+ I + S
Sbjct: 7 LSKKDALAASCSSSSTSSKSKFSRSFSTSASSSKAPAFVRSSSTKCSAPSSSSSSISRSS 66
Query: 331 SK 332
SK
Sbjct: 67 SK 68
>gb|AAL68158.2| AT30755p [Drosophila melanogaster]
Length = 1193
Score = 36.6 bits (83), Expect = 2.0
Identities = 38/166 (22%), Positives = 75/166 (44%), Gaps = 24/166 (14%)
Query: 16 SSSATSSISPKF---EIEDSIQDQHAPLNKRHKATNDILNEPSPLGLSLRKSPSLLDLIQ 72
S +A + IS + + +D+ + H + ++ + ++ P GL+ + SP +LDL
Sbjct: 362 SDAALTVISSRLTAGQEDDNARIGHGHITNTSAESDHMTSKSIPSGLTTKDSPDVLDL-- 419
Query: 73 MTLCQENSVNANTANDNLNSKANKNG---------RASVEKLKASNFPATHLKIGSWEYK 123
C+ ++++ N L A N + ++K +A+NF A I S
Sbjct: 420 --SCESEPLHSSKQNSQLMEDAETNFTERLKVNQLKEQIDKQEAANFKAQENLIKSIRQT 477
Query: 124 SKYEGD------LVAKCYFAKQ--KLVWEVLEGELKSKIEIQWSDI 161
+ GD L+AK + +L +V +LKS++E + S++
Sbjct: 478 IQALGDKEKMENLIAKSKMEEDPFQLKSQVESSQLKSQVEGESSEL 523
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.322 0.135 0.408
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 903,535,339
Number of Sequences: 2540612
Number of extensions: 38440243
Number of successful extensions: 146716
Number of sequences better than 10.0: 47
Number of HSP's better than 10.0 without gapping: 14
Number of HSP's successfully gapped in prelim test: 34
Number of HSP's that attempted gapping in prelim test: 146455
Number of HSP's gapped (non-prelim): 194
length of query: 525
length of database: 863,360,394
effective HSP length: 133
effective length of query: 392
effective length of database: 525,458,998
effective search space: 205979927216
effective search space used: 205979927216
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 77 (34.3 bits)
Medicago: description of AC126784.4