KMC016381A_c01
[Fasta Sequence]
[Nr Search]
[EST assemble image]
Fasta Sequence
>KMC016381A_C01 KMC016381A_c01
aTGAATGAATAAATAAATAAATTCAAATTACTTCTCACTTTTACAACTTTAACATTAATT
AAATTAAAATCATACAAATGATTAAAAAAAACTTCTTGGAATGATCACTCACAAATCACA
TTTTATCTATCGAACCCAACATGCCTTTGTCTTTGAGCAGTTTGAGGCACTTGAGCAGCC
AAACAGTCCCAATCATCATGCCCTCTGAGCTGTGTTCTTTCTGCATCTCAGGAGAACCAA
TTAATCATAGAATGAATTTCTCTTGTGTTATTATTTCCAATAATTCTCCAACACTATCAA
TCATTTATATGTGCAAGAATGCTTGAGGGGGATTGGAGCACTGAGCTTCTTGGGATCCAA
GGTGATGGAACATTGAATCCTCTTGTAGTACTTGGGCTTCACCAATTTCCCTAGGACATA
AGCTCTGGATCGCAGCACAAAGTTCAGGTTCAATGGCACTGGCACAGTTGGCATACCAGT
TGTGCTGCTCAAGCTAGCACCACTTCCATACAGAGGGATCTTGTTGCCCATCACTGCCAC
ACTCACCAACCTGTGACTCCTTCTATGTTGATAAAACTCCTTCATATTCCCTGCAGCAAT
CACAATTTCTGAATAGGACAGTTCTAAGGGTGTAGATGCAACATGAACCCCAAAGAATGT
GCCAGTGTTACGGTATGTGAATTTCAAAGTAGAGTTCATGGAGATCATATCAGTAGCCAC
CCCAGTGGAATCTGAACCAGCTTGGACTTGAACATGATCAAACTTTATGCTCTTGATAAA
AATCTTGGGTTTCATGGGTCTGCTAGCACCCCAGAGAATAAGCGAAAACAGTGTGAAGAG
GAGAAGAAATCCCAGAAGAAAAATGAGGAAGTAGCAGCGACGAGAGAGAGTTCTGTCACG
ATCTTCCCCTTGGAGAAGCCCTTCTTCCTCAATGACATCGATCTGCTTCCATGGCTTGAG
ACTGTGGTGGTGGTGGTGGTTGTCTTTCTTGCGGTGAGGAGCAGAGAACCGAGTGGAGGA
TGAAGAATGAGGAGGGGAAGCGTTGGGGCTGAGAACAGGAGTGGAGTGGAAGGAAGTGGT
GACGGTTTTCTCGCCGTCGTGAGAGTCCCTTGAGGGGCTCTGAACAAAGTAGAGAGGACG
GCGAGGTGGGGATCTTGCAGGGGATGATGCAGAAATgctggtcacctctgagtctgtctt
ggcatgcatttttccttctttgatggaacctgtccttgaaagaatgtgaatgacag
Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC016381A_C01 KMC016381A_c01
(1256 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_564495.1| expressed protein; protein id: At1g45688.1, sup... 348 1e-94
ref|NP_199100.1| putative protein; protein id: At5g42860.1 [Arab... 343 4e-93
gb|AAO42869.1| At5g42860 [Arabidopsis thaliana] 341 2e-92
emb|CAB53482.1| CAA30379.1 protein [Oryza sativa] 337 2e-91
ref|NP_181730.1| unknown protein; protein id: At2g41990.1 [Arabi... 205 1e-51
>ref|NP_564495.1| expressed protein; protein id: At1g45688.1, supported by cDNA: 8255.,
supported by cDNA: gi_20466719 [Arabidopsis thaliana]
gi|25405173|pir||A96511 unknown protein [imported] -
Arabidopsis thaliana
gi|12321012|gb|AAG50630.1|AC083835_15 unknown protein
[Arabidopsis thaliana] gi|20466720|gb|AAM20677.1| unknown
protein [Arabidopsis thaliana] gi|21595730|gb|AAM66126.1|
unknown [Arabidopsis thaliana] gi|23198230|gb|AAN15642.1|
unknown protein [Arabidopsis thaliana]
Length = 342
Score = 348 bits (893), Expect = 1e-94
Identities = 189/340 (55%), Positives = 236/340 (68%), Gaps = 40/340 (11%)
Frame = -3
Query: 1209 MHAKTDSEVTSISASSPARSPPRRPLYFVQSPSRDSHDGEKTVTTSFHSTPVLSPNASPP 1030
MHAKTDSEVTS++ASSPARSP RRP+Y+VQSPSRDSHDGEKT T SFHSTPVLSP SPP
Sbjct: 1 MHAKTDSEVTSLAASSPARSP-RRPVYYVQSPSRDSHDGEKTAT-SFHSTPVLSPMGSPP 58
Query: 1029 HS----------SSSTRFSA---PHRKKDNHHH------HHSLKPWKQIDVIEEEGLLQG 907
HS SSS+RFS P +K N + H K WK+ VIEEEGLL
Sbjct: 59 HSHSSMGRHSRESSSSRFSGSLKPGSRKVNPNDGSKRKGHGGEKQWKECAVIEEEGLLDD 118
Query: 906 EDRDRTLSRRCYFLIFLLGFLLLFTLFSLILWGASRPMKPKIFIKSIKFDHVQVQAGSDS 727
DRD + RRCY L F++GF +LF FSLIL+GA++PMKPKI +KSI F+ +++QAG D+
Sbjct: 119 GDRDGGVPRRCYVLAFIVGFFILFGFFSLILYGAAKPMKPKITVKSITFETLKIQAGQDA 178
Query: 726 TGVATDMISMNSTLKFTYRNTGTFFGVHVASTPLELSYSEIVIAAGNMKEFYQHRRSHRL 547
GV TDMI+MN+TL+ YRNTGTFFGVHV STP++LS+S+I I +G++K+FYQ R+S R
Sbjct: 179 GGVGTDMITMNATLRMLYRNTGTFFGVHVTSTPIDLSFSQIKIGSGSVKKFYQGRKSERT 238
Query: 546 VSVAVMGNKIPLYGSGASLSSTT---------------------GMPTVPVPLNLNFVLR 430
V V V+G KIPLYGSG++L P PVP+ L+FV+R
Sbjct: 239 VLVHVIGEKIPLYGSGSTLLPPAPPAPLPKPKKKKGAPVPIPDPPAPPAPVPMTLSFVVR 298
Query: 429 SRAYVLGKLVKPKYYKRIQCSITLDPKKLSAPIPLKHSCT 310
SRAYVLGKLV+PK+YK+I+C I + K L+ I + +CT
Sbjct: 299 SRAYVLGKLVQPKFYKKIECDINFEHKNLNKHIVITKNCT 338
>ref|NP_199100.1| putative protein; protein id: At5g42860.1 [Arabidopsis thaliana]
gi|9758574|dbj|BAB09187.1|
emb|CAB53482.1~gene_id:MBD2.5~similar to unknown protein
[Arabidopsis thaliana]
Length = 320
Score = 343 bits (879), Expect = 4e-93
Identities = 188/323 (58%), Positives = 230/323 (71%), Gaps = 23/323 (7%)
Frame = -3
Query: 1209 MHAKTDSEVTSISASSPARSPPRRPLYFVQSPSRDSHDGEKTVTTSFHSTPVL-SPNASP 1033
MHAKTDSEVTS+SASSP RSP RRP YFVQSPSRDSHDGEKT T SFHSTPVL SP SP
Sbjct: 1 MHAKTDSEVTSLSASSPTRSP-RRPAYFVQSPSRDSHDGEKTAT-SFHSTPVLTSPMGSP 58
Query: 1032 PHS-SSSTRFSAPHRKKDNHHHHHSLKPWKQIDVIEEEGLLQGEDRDR-TLSRRCYFLIF 859
PHS SSS+RFS + K H KQ +IEEEGLL DR++ L RRCY L F
Sbjct: 59 PHSHSSSSRFSKINGSKRKGHAGE-----KQFAMIEEEGLLDDGDREQEALPRRCYVLAF 113
Query: 858 LLGFLLLFTLFSLILWGASRPMKPKIFIKSIKFDHVQVQAGSDSTGVATDMISMNSTLKF 679
++GF LLF FSLIL+ A++P KPKI +KSI F+ ++VQAG D+ G+ TDMI+MN+TL+
Sbjct: 114 IVGFSLLFAFFSLILYAAAKPQKPKISVKSITFEQLKVQAGQDAGGIGTDMITMNATLRM 173
Query: 678 TYRNTGTFFGVHVASTPLELSYSEIVIAAGNMKEFYQHRRSHRLVSVAVMGNKIPLYGSG 499
YRNTGTFFGVHV S+P++LS+S+I I +G++K+FYQ R+S R V V V+G+KIPLYGSG
Sbjct: 174 LYRNTGTFFGVHVTSSPIDLSFSQITIGSGSIKKFYQSRKSQRTVVVNVLGDKIPLYGSG 233
Query: 498 ASLSS--------------------TTGMPTVPVPLNLNFVLRSRAYVLGKLVKPKYYKR 379
++L P PVP+ LNF +RSRAYVLGKLV+PK+YKR
Sbjct: 234 STLVPPPPPAPIPKPKKKKGPIVIVEPPAPPAPVPMRLNFTVRSRAYVLGKLVQPKFYKR 293
Query: 378 IQCSITLDPKKLSAPIPLKHSCT 310
I C I + KKLS IP+ ++CT
Sbjct: 294 IVCLINFEHKKLSKHIPITNNCT 316
>gb|AAO42869.1| At5g42860 [Arabidopsis thaliana]
Length = 320
Score = 341 bits (874), Expect = 2e-92
Identities = 187/323 (57%), Positives = 229/323 (70%), Gaps = 23/323 (7%)
Frame = -3
Query: 1209 MHAKTDSEVTSISASSPARSPPRRPLYFVQSPSRDSHDGEKTVTTSFHSTPVL-SPNASP 1033
MHAKTDSEVTS+SASSP RSP RRP YFVQSPSRDSHDGEKT T SFHSTPVL SP SP
Sbjct: 1 MHAKTDSEVTSLSASSPTRSP-RRPAYFVQSPSRDSHDGEKTAT-SFHSTPVLTSPMGSP 58
Query: 1032 PHS-SSSTRFSAPHRKKDNHHHHHSLKPWKQIDVIEEEGLLQGEDRDR-TLSRRCYFLIF 859
PHS SSS+RFS + K H KQ +IEEEGLL DR++ L RRCY L F
Sbjct: 59 PHSHSSSSRFSKINGSKRKGHAGE-----KQFAMIEEEGLLDDGDREQEALPRRCYVLAF 113
Query: 858 LLGFLLLFTLFSLILWGASRPMKPKIFIKSIKFDHVQVQAGSDSTGVATDMISMNSTLKF 679
++GF LLF FSLIL+ A++P KPKI +KSI F+ ++VQAG D+ G+ TDMI+MN+TL+
Sbjct: 114 IVGFSLLFAFFSLILYAAAKPQKPKISVKSITFEQLKVQAGQDAGGIGTDMITMNATLRM 173
Query: 678 TYRNTGTFFGVHVASTPLELSYSEIVIAAGNMKEFYQHRRSHRLVSVAVMGNKIPLYGSG 499
YRNTGTFFG HV S+P++LS+S+I I +G++K+FYQ R+S R V V V+G+KIPLYGSG
Sbjct: 174 LYRNTGTFFGXHVTSSPIDLSFSQITIGSGSIKKFYQSRKSQRTVVVNVLGDKIPLYGSG 233
Query: 498 ASLSS--------------------TTGMPTVPVPLNLNFVLRSRAYVLGKLVKPKYYKR 379
++L P PVP+ LNF +RSRAYVLGKLV+PK+YKR
Sbjct: 234 STLVPPPPPAPIPKPKKKKGPIVIVEPPAPPAPVPMRLNFTVRSRAYVLGKLVQPKFYKR 293
Query: 378 IQCSITLDPKKLSAPIPLKHSCT 310
I C I + KKLS IP+ ++CT
Sbjct: 294 IVCLINFEHKKLSKHIPITNNCT 316
>emb|CAB53482.1| CAA30379.1 protein [Oryza sativa]
Length = 835
Score = 337 bits (864), Expect = 2e-91
Identities = 180/319 (56%), Positives = 228/319 (71%), Gaps = 14/319 (4%)
Frame = -3
Query: 1221 KEGKMHAKTDSEVTSISASSPARSPPRR---PLYFVQSPSRDSHDGEKTVTTSFHSTPVL 1051
K KMHAKTDSEVTS++ SSP RSP R P+Y+VQSPSRDSHDGEKT T S HSTP L
Sbjct: 517 KTRKMHAKTDSEVTSLAPSSPPRSPTSRGGRPVYYVQSPSRDSHDGEKTAT-SVHSTPAL 575
Query: 1050 SPNASPPHS----SSSTRFSA-PHRKKDNHHHHHSLKP----WKQIDVIEEEGLLQGEDR 898
SP SP HS SSS+RFS P RK D P W++I VIEEEGLL ED
Sbjct: 576 SPMGSPRHSVGRDSSSSRFSGHPKRKGDKSSSGRKGAPAGKGWQEIGVIEEEGLLDDEDE 635
Query: 897 DRTLSRRC-YFLIFLLGFLLLFTLFSLILWGASRPMKPKIFIKSIKFDHVQVQAGSDSTG 721
R + +RC YFLIF+LGF++LF+ F+L+LWGASR KP+I IKSI F++ +QAG+D++
Sbjct: 636 RRGIPKRCKYFLIFVLGFVVLFSFFALVLWGASRSQKPQIVIKSITFENFIIQAGTDASL 695
Query: 720 VATDMISMNSTLKFTYRNTGTFFGVHVASTPLELSYSEIVIAAGNMKEFYQHRRSHRLVS 541
V TDM + NST+K TYRNTGTFFG+HV + P LSYS++ +A+G++ +FYQ R S R VS
Sbjct: 696 VPTDMATTNSTVKLTYRNTGTFFGIHVTADPFTLSYSQLTLASGDLNKFYQARSSRRTVS 755
Query: 540 VAVMGNKIPLYGSGASLSSTTGMPTV-PVPLNLNFVLRSRAYVLGKLVKPKYYKRIQCSI 364
V VMGNK+PLYG G +L++ G ++ PVP+ L + SRAYVLG LVKPK+ + I+C +
Sbjct: 756 VGVMGNKVPLYGGGPTLTAGKGSGSMAPVPMILRTTVHSRAYVLGALVKPKFTRAIECKV 815
Query: 363 TLDPKKLSAPIPLKHSCTY 307
++P KL+ PI L SC Y
Sbjct: 816 LMNPAKLNKPISLDKSCIY 834
>ref|NP_181730.1| unknown protein; protein id: At2g41990.1 [Arabidopsis thaliana]
gi|25408769|pir||F84848 hypothetical protein At2g41990
[imported] - Arabidopsis thaliana
gi|1871184|gb|AAB63544.1| unknown protein [Arabidopsis
thaliana]
Length = 297
Score = 205 bits (521), Expect = 1e-51
Identities = 128/307 (41%), Positives = 173/307 (55%), Gaps = 8/307 (2%)
Frame = -3
Query: 1209 MHAKTDSEVTSISASSPARSPPR---RPLYFVQSPSRDSHDGEKTVTTSFHSTPVLSPNA 1039
MHAKTDSE TSI A+ A SPPR RPLY+VQSPS +HD EK SF S L +
Sbjct: 1 MHAKTDSEATSIDAA--ALSPPRSAIRPLYYVQSPS--NHDVEKM---SFGSGCSLMGSP 53
Query: 1038 SPPHSSSSTRFSAPHRKKDNHHHHHSLKPWKQID-----VIEEEGLLQGEDRDRTLSRRC 874
+ PH + + +L +K I + + + G D D
Sbjct: 54 THPHYYHCSPIHHSRESSTSRFSDRALLSYKSIRERRRYINDGDDKTDGGDDDDPFRNVR 113
Query: 873 YFLIFLLGFLLLFTLFSLILWGASRPMKPKIFIKSIKFDHVQVQAGSDSTGVATDMISMN 694
++ LL + LFT+FSLILWGAS+ PK+ +K + + +QAG+D +GV TDM+S+N
Sbjct: 114 LYVWLLLSVIFLFTVFSLILWGASKSYPPKVTVKGMLVRDLNLQAGNDLSGVPTDMLSLN 173
Query: 693 STLKFTYRNTGTFFGVHVASTPLELSYSEIVIAAGNMKEFYQHRRSHRLVSVAVMGNKIP 514
ST++ YRN TFF VHV ++PL L YS +++++G M +F R V V G++IP
Sbjct: 174 STVRIYYRNPSTFFAVHVTASPLLLHYSNLLLSSGEMNKFTVGRNGETNVVTVVQGHQIP 233
Query: 513 LYGSGASLSSTTGMPTVPVPLNLNFVLRSRAYVLGKLVKPKYYKRIQCSITLDPKKLSAP 334
LYG G S + T+ +PLNL VL S+AY+LG+LV K+Y RI CS TLD L
Sbjct: 234 LYG-GVSFH----LDTLSLPLNLTIVLHSKAYILGRLVTSKFYTRIICSFTLDANHLPKS 288
Query: 333 IPLKHSC 313
I L SC
Sbjct: 289 ISLLRSC 295
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,141,837,794
Number of Sequences: 1393205
Number of extensions: 27942702
Number of successful extensions: 126781
Number of sequences better than 10.0: 111
Number of HSP's better than 10.0 without gapping: 97870
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 120699
length of database: 448,689,247
effective HSP length: 126
effective length of database: 273,145,417
effective search space used: 79758461764
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
EST assemble image
|
|
|
|
clone |
accession |
position |
1 |
SPD042a06_f |
BP047307 |
1 |
581 |
2 |
SPD029g01_f |
BP046321 |
62 |
632 |
3 |
MF058g11_f |
BP031380 |
479 |
969 |
4 |
MF017c04_f |
BP029136 |
777 |
1317 |
|
Lotus japonicus
Kazusa DNA Research Institute