KMC019605A_c01
[Fasta Sequence]
[Nr Search]
[EST assemble image]
Fasta Sequence
>KMC019605A_C01 KMC019605A_c01
attgccccaccagatgcaGGGTGCTGTTGCACTCAGAAGATAGATCTGGTGGTGGATGAA
GCCTCACATGATCCCTATTATTGTTAGAGAAGTCCAGCATGTCGCTACTAAAACTTGTCA
CACAGGATTTTGGAGACCAACAAGACTTTGCTGAGTGAAGTTCTTCGTTTCCATGCCCAT
ACACATAGCTGTTTCCCGAGTTTTCTTGTTTAACATCAATAAAGGAAGCATTTGGAGCCT
GAGTTAGCATTTGATTCTGAAACTGGGCCATAGCACCCTTATCTTCTTCAGCCACAACTC
CACTCATGAGTAGCTGGCTCCATGACTCAGGAACCTCTTGATTTTCTGGCCAAGAAGGAA
AGGGAAGAGAAGAAGAAGAAGAAGGGGGATATGGGGTGAGAAAGTTGGAAGGAGTGGGAA
GGAAAGGAGGAGCTTGTGGTTGTGGAGATGGTGGCCTCATGCTATTGATATTGATATTCC
ACCAGTTAGGGTTCCCAGCCATCATTTGTTGCACTGGGGAGCTTTGAAGAACACCTCTAT
TCATGCCTTGTTGTTCTTCTTCTTCGCTTTGAG
Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC019605A_C01 KMC019605A_c01
(573 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_194639.1| putative protein; protein id: At4g29100.1, supp... 103 2e-21
emb|CAC14433.1| putative protein [Brassica napus] 98 6e-20
ref|NP_179599.1| unknown protein; protein id: At2g20090.1 [Arabi... 76 3e-13
gb|AAL92370.2| similar to Dictyostelium discoideum (Slime mold).... 40 0.021
ref|XP_289489.1| hypothetical protein XP_289489 [Mus musculus] 40 0.027
>ref|NP_194639.1| putative protein; protein id: At4g29100.1, supported by cDNA:
gi_19698938 [Arabidopsis thaliana]
gi|7485742|pir||T08965 hypothetical protein F19B15.130 -
Arabidopsis thaliana gi|4972056|emb|CAB43924.1| putative
protein [Arabidopsis thaliana]
gi|7269808|emb|CAB79668.1| putative protein [Arabidopsis
thaliana] gi|19698939|gb|AAL91205.1| putative protein
[Arabidopsis thaliana]
gi|22711852|gb|AAM10966.2|AF488634_1 putative bHLH
transcription factor [Arabidopsis thaliana]
gi|23197826|gb|AAN15440.1| putative protein [Arabidopsis
thaliana]
Length = 407
Score = 103 bits (257), Expect = 2e-21
Identities = 89/242 (36%), Positives = 118/242 (47%), Gaps = 68/242 (28%)
Frame = -3
Query: 544 MNRGVLQSSPVQQMM-AGNPNWWNININSMRPPSP-----QPQAPPFLPTPSNFL----- 398
MNRGVL+SSPVQQ+M AGNPNWWN++ MRPP P Q PP + +N+L
Sbjct: 1 MNRGVLESSPVQQLMAAGNPNWWNVS-GGMRPPPPLMGHQQAPLPPHMTPNNNYLRPRMM 59
Query: 397 -TPYP-------PSSSSSLPFPSWPENQEV----------PESW--SQLLMSGVVAEED- 281
TP+P SSSSS PS P N + PESW SQLL+ G++ E+
Sbjct: 60 PTPFPHFLPSPATSSSSSSSSPSLPNNPNLSSWLESNDLPPESWSLSQLLLGGLMMGEEE 119
Query: 280 ---------------------KGAMAQFQNQMLT-QAPNASFIDVKQE---NSGNSYVYG 176
K + ++ Q+L+ Q + +D+KQE N+ N YV
Sbjct: 120 RLEMMNHHNHHDEQQHHGFQGKIRLENWEEQVLSHQQASMVAVDIKQEGNINNNNGYVIS 179
Query: 175 HGNEELHSAKSCWSPKSCVTSFSSD--------MLDF-SNNNRDHVR--LHPPPDLSSEC 29
N + KSC + + + S+D MLDF SN+N H+ H PPD SSEC
Sbjct: 180 SPNSPPN--KSCVTTTTTTSLNSNDDNINNNNNMLDFSSNHNGLHLSEGRHTPPDRSSEC 237
Query: 28 NS 23
NS
Sbjct: 238 NS 239
>emb|CAC14433.1| putative protein [Brassica napus]
Length = 389
Score = 98.2 bits (243), Expect = 6e-20
Identities = 83/226 (36%), Positives = 113/226 (49%), Gaps = 52/226 (23%)
Frame = -3
Query: 544 MNRGVLQSSPVQQMM-AGNPNWWNININSMRPPSP-----QPQAPPFLPTPSNFLTP--Y 389
MNRG L+SSPVQQ+M AGNPNWWN++ S RPP P Q PP + +N+L P
Sbjct: 1 MNRGALESSPVQQLMVAGNPNWWNVS-GSTRPPPPLMGHQQGPLPPQMTPNNNYLRPRMM 59
Query: 388 PPSSSSSL----PFPSWPENQEV-PESW--SQLLMSGVVAEED----------------- 281
SSS SL SW E+ ++ PESW SQLL+ G++ E+
Sbjct: 60 MTSSSPSLLDNPSLSSWLESNDLPPESWSLSQLLLGGLMMGEEERLEIMNHHSHHDEQHH 119
Query: 280 -----KGAMAQFQNQMLT-QAPNASFIDVKQE---NSGNSYVYGHGNEELHSAKSCWSPK 128
K + ++ Q+L Q + +D+KQE N+ N Y+ N + KSC +
Sbjct: 120 HSFQGKMRLENWEEQVLRHQQASMGVVDIKQESNINNNNGYLISSPNSPPN--KSCVTTT 177
Query: 127 SCVTSFSSD-------MLDFSNN----NRDHVRLHPPPDLSSECNS 23
+ + S+D ML FS+N N +R H PPD SSECNS
Sbjct: 178 TTTSLNSNDNTNNNNNMLGFSSNHNGLNLSEIR-HTPPDRSSECNS 222
>ref|NP_179599.1| unknown protein; protein id: At2g20090.1 [Arabidopsis thaliana]
gi|25411964|pir||H84584 hypothetical protein At2g20090
[imported] - Arabidopsis thaliana
gi|4580464|gb|AAD24388.1| unknown protein [Arabidopsis
thaliana]
Length = 219
Score = 75.9 bits (185), Expect = 3e-13
Identities = 70/197 (35%), Positives = 96/197 (48%), Gaps = 34/197 (17%)
Frame = -3
Query: 544 MNRGVLQSSPVQQMMA-GNPNWWNININSMRPPSP-----QPQAPPFLPT--PSNFLTPY 389
MNRGVL+SSPVQ + A GNPNWWN +RPP+P P F+P+ P+ F +P
Sbjct: 1 MNRGVLESSPVQHLTAAGNPNWWNNVSRGLRPPTPLMSHEPPSTTAFIPSLLPNFFSSPT 60
Query: 388 PPSSSS-SLP-------FPSWPENQEVP--ESW--SQLLMSGVV--AEEDKGAMAQFQNQ 251
SSSS S P F SW E ++P + W SQLL+ G++ EE M +Q
Sbjct: 61 SSSSSSPSFPPPNSNPNFSSWLEMSDLPLDQPWSLSQLLLGGLMMGEEEKMEMMNHHHHQ 120
Query: 250 MLTQAPNASFI------------DVKQENSGNSYVYGHGNEELHSAKSCWSPKSCVTSFS 107
Q+ A I +KQE+S N+ YG + S+ + KSC T +
Sbjct: 121 NQHQSYQAKRIQNWEEQVLRHQASMKQESSNNN-SYG-----IMSSPNSPPNKSCATIIN 174
Query: 106 SDMLDFSNNNRDHVRLH 56
++ NNN H L+
Sbjct: 175 TNE---DNNNNIHSGLN 188
>gb|AAL92370.2| similar to Dictyostelium discoideum (Slime mold). TRFA
Length = 673
Score = 40.0 bits (92), Expect = 0.021
Identities = 39/160 (24%), Positives = 55/160 (34%), Gaps = 25/160 (15%)
Frame = -3
Query: 571 QSEEEEQQGMNRGVLQSSPVQQMMAGNPNW----------------WNININSMRPPSPQ 440
Q +++++Q + + P QQ NPN+ N N + PQ
Sbjct: 202 QQQDQQKQHDQQQHQEQQPQQQQFNNNPNFNGNTTNNSNQFMNGQNIQFNNNDIHQSQPQ 261
Query: 439 PQAPPF-LPTPSNFLTPYPPSSSSSLPFPSWPENQEVPESWSQLLMS----GVVAEEDKG 275
PQ+ P P P P P P P P+ Q P+ QL S G +
Sbjct: 262 PQSQPQPQPQPQPQPQPQPQPQPQPQPQPQQPQPQPQPQQQQQLQFSSNNNGTFNNTNNY 321
Query: 274 AMAQF----QNQMLTQAPNASFIDVKQENSGNSYVYGHGN 167
F N N+ FI+ N+ NSY GN
Sbjct: 322 NNGSFNNNNNNNNNNNNNNSGFINSSNGNNFNSYNNNSGN 361
>ref|XP_289489.1| hypothetical protein XP_289489 [Mus musculus]
Length = 330
Score = 39.7 bits (91), Expect = 0.027
Identities = 23/59 (38%), Positives = 29/59 (48%), Gaps = 1/59 (1%)
Frame = -3
Query: 490 PNWWNININSMRPPSPQPQAP-PFLPTPSNFLTPYPPSSSSSLPFPSWPENQEVPESWS 317
P W++ RPP P PQ P P LP S + P+ + LP SW Q PE+WS
Sbjct: 272 PQPWDV-----RPPQPLPQPPSPLLPRTSAL--DWSPNPPAPLPSLSWVVTQSSPEAWS 323
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 555,271,291
Number of Sequences: 1393205
Number of extensions: 13627375
Number of successful extensions: 86195
Number of sequences better than 10.0: 563
Number of HSP's better than 10.0 without gapping: 60073
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 79970
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 21243732558
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
EST assemble image
|
|
|
|
clone |
accession |
position |
1 |
MFB034g01_f |
BP036517 |
1 |
532 |
2 |
MFB077g01_f |
BP039642 |
19 |
573 |
|
Lotus japonicus
Kazusa DNA Research Institute