KMC015242A_c01
[Fasta Sequence]
[Nr Search]
[EST assemble image]
Fasta Sequence
>KMC015242A_C01 KMC015242A_c01
ctctcattgcagactccgacgcgcacccgcacccgcacccaaactctctttctcttgccg
cctccttcgatttccccaaaGCTCAAATCCCCAAAAACCCAATCATCACTATCTCCGTCT
ACAAATCCCCAACAACCCCCTCCTGCATCTTCACCTCCGCCAAACTCCTCGGCAAGGTCA
CGCTGCCTCTCGATCTCACCATGGCGGACTCGCGACCCTGCATGTTCCAAAACGGATGGC
TTCCTCTCGGCCGCAAAACACACAACACTCAGTTGTTGCACTTAACACTCCGGGCCGAAC
CGGACCCGAGGTTCGTTTTCCGCTTCGACGGTGAACCGGAGTGCAGCCCGCAGGTTTTTC
AAGTCAAAGGAGATGTTAAGCAGCCGGTTTTCACTTGCAAGTTCAGTTTCAGAGATAGAA
ACCCGGTTCAGTTCCCCTCCTCGACCACCGCGAACGAGCGGAAAGGATGGTCCATCACGG
TGCACGATTTATCGGGTTCTCCGGTTGCGGCGGCGTCTATGGCCACTCCGTTCGTTCCCT
CACCCGGTTCACAGCGGGTCAGCAAGTCCAACCCGGGAGCCTGGCTCATCATCCGACCCG
ACGGTGATTGGCACTTGAAGCCTTGGGGCCGCCTCGAGAGCGTGGGTGAAnccaacaact
ccaacgccgtcggg
Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC015242A_C01 KMC015242A_c01
(674 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_175426.1| hypothetical protein; protein id: At1g50040.1, ... 218 6e-56
ref|NP_172473.1| unknown protein; protein id: At1g10020.1 [Arabi... 213 3e-54
gb|AAL60032.1| unknown protein [Arabidopsis thaliana] 203 2e-51
ref|NP_188602.1| unknown protein; protein id: At3g19680.1, suppo... 203 2e-51
ref|NP_194660.1| putative protein; protein id: At4g29310.1, supp... 195 4e-49
>ref|NP_175426.1| hypothetical protein; protein id: At1g50040.1, supported by cDNA:
gi_18389271, supported by cDNA: gi_20258934 [Arabidopsis
thaliana] gi|25354742|pir||H96536 hypothetical protein
F2J10.8 [imported] - Arabidopsis thaliana
gi|8569096|gb|AAF76441.1|AC015445_8 ESTs gb|AI994059,
gb|T43740 come from this gene. [Arabidopsis thaliana]
gi|18389272|gb|AAL67079.1| unknown protein [Arabidopsis
thaliana] gi|20258935|gb|AAM14183.1| unknown protein
[Arabidopsis thaliana]
Length = 460
Score = 218 bits (555), Expect = 6e-56
Identities = 123/235 (52%), Positives = 157/235 (66%), Gaps = 24/235 (10%)
Frame = +3
Query: 42 NSLSLAASFDFPKAQIP------KNPIITISVYKSPTTPSCIFTSA---KLLGKVTLPLD 194
N ++AA F K+QI K ++++ VY S + SC F +A KL+G+ + LD
Sbjct: 78 NVSTVAACFSLSKSQIETSLKKAKWSVLSVEVY-SRRSASCGFVAASGEKLIGRFQVTLD 136
Query: 195 LTMADSRPCMFQNGWLPLGRKTHNTQL------LHLTLRAEPDPRFVFRFDGEPECSPQV 356
L A+S+ C+ NGW+ LG K+ N + LH+++R EPD RFVF+FDGEPECSPQV
Sbjct: 137 LKAAESKTCLAHNGWVDLGTKSKNNKKSGSDPELHVSVRVEPDTRFVFQFDGEPECSPQV 196
Query: 357 FQVKGDVKQPVFTCKFSFR---DRNPVQFPSSTTAN------ERKGWSITVHDLSGSPVA 509
FQV+G+ KQ VFTCKF FR DRN SS T+ ERKGWSIT+HDLSGSPVA
Sbjct: 197 FQVQGNAKQAVFTCKFGFRNSGDRNLSLSLSSVTSGKEQFSKERKGWSITIHDLSGSPVA 256
Query: 510 AASMATPFVPSPGSQRVSKSNPGAWLIIRPDGDWHLKPWGRLESVGEXNNSNAVG 674
ASM TPFVPSPGS RVS+S+PGAWLI+RPDG + KPW RL++ E S+ +G
Sbjct: 257 MASMVTPFVPSPGSNRVSRSSPGAWLILRPDG-YTWKPWVRLQAWREPGVSDVLG 310
>ref|NP_172473.1| unknown protein; protein id: At1g10020.1 [Arabidopsis thaliana]
gi|7487490|pir||T00621 hypothetical protein T27I1.4 -
Arabidopsis thaliana gi|3540181|gb|AAC34331.1| Unknown
protein [Arabidopsis thaliana]
Length = 461
Score = 213 bits (541), Expect = 3e-54
Identities = 111/234 (47%), Positives = 149/234 (63%), Gaps = 33/234 (14%)
Frame = +3
Query: 39 PNSLSLAASFDFPKAQIPK---------NPIITISVYKSPTTPSCIFTSAKLLGKVTLPL 191
P +LAA+F + I + P + I +Y +C S +LL KV++PL
Sbjct: 63 PEIQTLAATFHLSSSDIQRLASRSIFTSKPCLKILIYTGRAGAACGVHSGRLLAKVSVPL 122
Query: 192 DLTMADSRPCMFQNGWLPLGR---KTHNTQLLHLTLRAEPDPRFVFRFDGEPECSPQVFQ 362
DL+ S+PC+F NGW+ +G+ K+ ++ HL ++AEPDPRFVF+FDGEPECSPQV Q
Sbjct: 123 DLSGTQSKPCVFHNGWISVGKGAGKSSSSAQFHLNVKAEPDPRFVFQFDGEPECSPQVVQ 182
Query: 363 VKGDVKQPVFTCKFSFRD-----RNPVQFPSSTTAN----------------ERKGWSIT 479
++G+++QPVFTCKFS R + P+ T+ + ERKGWSIT
Sbjct: 183 IQGNIRQPVFTCKFSCRHTGDRTQRSRSLPTETSVSRSWLNSFGSERERPGKERKGWSIT 242
Query: 480 VHDLSGSPVAAASMATPFVPSPGSQRVSKSNPGAWLIIRPDGDWHLKPWGRLES 641
VHDLSGSPVA AS+ TPFV SPG+ RVS+SNPG+WLI+RP GD +PWGRLE+
Sbjct: 243 VHDLSGSPVAMASIVTPFVASPGTDRVSRSNPGSWLILRP-GDCTWRPWGRLEA 295
>gb|AAL60032.1| unknown protein [Arabidopsis thaliana]
Length = 491
Score = 203 bits (517), Expect = 2e-51
Identities = 122/268 (45%), Positives = 157/268 (58%), Gaps = 46/268 (17%)
Frame = +3
Query: 9 ADSDAHPHPHPNSLSLAASFDFPKAQI------PKNPIITISVYKSPTTP--------SC 146
++S+ N ++AA F KAQI PK ++++ Y + SC
Sbjct: 67 SESETRCSSSGNVSTVAACFSLSKAQIEASLKKPKFSVLSVEAYSRGNSDGDDGVSGASC 126
Query: 147 IFTSA--KLLGKVTLPLDLTMADSRPCMFQNGWLPLGRK-----THNTQLLHLTLRAEPD 305
+A KLLG+ + LDL A+++ + NGW+ L K T + LH+++R EPD
Sbjct: 127 GLATAGEKLLGRFEVSLDLKSAETKSFLAHNGWVALPSKKTKSKTGSDPELHVSVRVEPD 186
Query: 306 PRFVFRFDGEPECSPQVFQVKGDVKQPVFTCKFSFR-----DRNPVQFPSSTT------- 449
PRFVF+FDGEPECSPQVFQV+G+ KQ VFTCKF R DRN + S +
Sbjct: 187 PRFVFQFDGEPECSPQVFQVQGNTKQAVFTCKFGSRNSNSGDRNLLHSSSMMSEISSTRS 246
Query: 450 ------------ANERKGWSITVHDLSGSPVAAASMATPFVPSPGSQRVSKSNPGAWLII 593
+ ERKGWSITVHDLSGSPVA ASM TPFVPSPGS RV++S+PGAWLI+
Sbjct: 247 CISSMNSEKEQPSKERKGWSITVHDLSGSPVAMASMVTPFVPSPGSNRVTRSSPGAWLIL 306
Query: 594 RPDG-DWHLKPWGRLESVGEXNNSNAVG 674
RPDG W KPWGRLE+ E S+ +G
Sbjct: 307 RPDGCTW--KPWGRLEAWREAGYSDTLG 332
>ref|NP_188602.1| unknown protein; protein id: At3g19680.1, supported by cDNA:
gi_18176371 [Arabidopsis thaliana]
gi|25354743|pir||T52398 hypothetical protein MMB12.17
[imported] - Arabidopsis thaliana
gi|9294435|dbj|BAB02555.1|
gb|AAC34331.1~gene_id:MMB12.17~similar to unknown
protein [Arabidopsis thaliana]
gi|23297407|gb|AAN12962.1| unknown protein [Arabidopsis
thaliana]
Length = 491
Score = 203 bits (517), Expect = 2e-51
Identities = 122/268 (45%), Positives = 157/268 (58%), Gaps = 46/268 (17%)
Frame = +3
Query: 9 ADSDAHPHPHPNSLSLAASFDFPKAQI------PKNPIITISVYKSPTTP--------SC 146
++S+ N ++AA F KAQI PK ++++ Y + SC
Sbjct: 67 SESETRCSSSGNVSTVAACFSLSKAQIEASLKKPKFSVLSVEAYSRGNSDGDDGVSGASC 126
Query: 147 IFTSA--KLLGKVTLPLDLTMADSRPCMFQNGWLPLGRK-----THNTQLLHLTLRAEPD 305
+A KLLG+ + LDL A+++ + NGW+ L K T + LH+++R EPD
Sbjct: 127 GLATAGEKLLGRFEVSLDLKSAETKSFLAHNGWVALPSKKTKSKTGSDPELHVSVRVEPD 186
Query: 306 PRFVFRFDGEPECSPQVFQVKGDVKQPVFTCKFSFR-----DRNPVQFPSSTT------- 449
PRFVF+FDGEPECSPQVFQV+G+ KQ VFTCKF R DRN + S +
Sbjct: 187 PRFVFQFDGEPECSPQVFQVQGNTKQAVFTCKFGSRNSNSGDRNLLHSSSMMSEISSTRS 246
Query: 450 ------------ANERKGWSITVHDLSGSPVAAASMATPFVPSPGSQRVSKSNPGAWLII 593
+ ERKGWSITVHDLSGSPVA ASM TPFVPSPGS RV++S+PGAWLI+
Sbjct: 247 CISSMNSEKEQPSKERKGWSITVHDLSGSPVAMASMVTPFVPSPGSNRVTRSSPGAWLIL 306
Query: 594 RPDG-DWHLKPWGRLESVGEXNNSNAVG 674
RPDG W KPWGRLE+ E S+ +G
Sbjct: 307 RPDGCTW--KPWGRLEAWREAGYSDTLG 332
>ref|NP_194660.1| putative protein; protein id: At4g29310.1, supported by cDNA:
gi_20260629 [Arabidopsis thaliana]
gi|7487111|pir||T13451 hypothetical protein T17A13.130 -
Arabidopsis thaliana gi|7269829|emb|CAB79689.1| putative
protein [Arabidopsis thaliana]
gi|20260630|gb|AAM13213.1| unknown protein [Arabidopsis
thaliana] gi|28059515|gb|AAO30065.1| unknown protein
[Arabidopsis thaliana]
Length = 424
Score = 195 bits (496), Expect = 4e-49
Identities = 113/217 (52%), Positives = 133/217 (61%), Gaps = 27/217 (12%)
Frame = +3
Query: 105 ITISVYKSPTTPSCIFTSAKLLGKVTLPLDLTMADSRPCMFQNGWLPLGRKTHNTQL-LH 281
+ +SVY T +C S KLLGKV + +DL A SR F NGW LG LH
Sbjct: 92 LRVSVYAGRTGHTCGVASGKLLGKVEVAVDLAAALSRTVAFHNGWKKLGGDGDKPSARLH 151
Query: 282 LTLRAEPDPRFVFRFDGEPECSPQVFQVKGDVKQPVFTCKFSFRDRN------PVQFPSS 443
L + AEPDPRFVF+F GEPECSP V+Q++ ++KQPVF+CKFS DRN P F S
Sbjct: 152 LLVCAEPDPRFVFQFGGEPECSPVVYQIQDNLKQPVFSCKFS-SDRNGRSRSLPSGFTYS 210
Query: 444 TT----------------ANERKGWSITVHDLSGSPVAAASMATPFVPSPGSQRVSKSNP 575
+ A ERKGW IT+HDLSGSPVAAASM TPFV SPGS RVS+SNP
Sbjct: 211 SRGWITRTLSGDQWEKKQARERKGWMITIHDLSGSPVAAASMITPFVASPGSDRVSRSNP 270
Query: 576 GAWLIIRPDG----DWHLKPWGRLESVGEXNNSNAVG 674
GAWLI+RP G W KPWGRLE+ E + +G
Sbjct: 271 GAWLILRPHGTCVSSW--KPWGRLEAWRERGAIDGLG 305
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 692,303,925
Number of Sequences: 1393205
Number of extensions: 18283580
Number of successful extensions: 137899
Number of sequences better than 10.0: 896
Number of HSP's better than 10.0 without gapping: 84538
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 124197
length of database: 448,689,247
effective HSP length: 119
effective length of database: 282,897,852
effective search space used: 29704274460
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
EST assemble image
|
|
|
|
clone |
accession |
position |
1 |
MWM063g02_f |
AV765712 |
1 |
344 |
2 |
SPDL009b08_f |
BP052513 |
181 |
674 |
|
Lotus japonicus
Kazusa DNA Research Institute