KMC000785A_c01
[Fasta Sequence]
[Nr Search]
[EST assemble image]
Fasta Sequence
>KMC000785A_C01 KMC000785A_c01
agagaaaaagggttttcgttcgctctcactgttgttctcttctcttctcttctctgcact
ctttcatcatcatccatggcTCAACGCGCCGCCGCTACCTCCGATTCTCCCCTCCGTGTC
GCCGCCGCACCTCCATTGCAGATCTCCAAAGATGTCACAGGCTCTGACAATCCTATTCCA
CTTTCACCACAGTGGCTTCTGCCAAAGCCCGTGGAGAGTAAACCTGGATCAGGAAATGTG
GGAAACAATGTGATCTCAATTCCACCATATGGCACCCACTCAGAGACTGTTAAGACATCT
GGTAATGGAGAGGATGGGCATGATGTCCAAAAGAGAAAGGATGTGTTTAGGCCATCCATG
CTGGACTCAGAAAGTGGACGTCGTGATCGTTGGCGTGATGAGGAGAGAGAAACTAAGTCT
GCCACACGTAAAGATCGTTGGAGGGACGGGGACAAAGACCTAGGTGATTCTCGGAAAGTG
GATCGATGGGCAGATAGTCTTCCTTCAAAGAACTTAGGAGAAGCACGTCGTGGTGCATCT
GATAGTCATAGACGGAATGATTCAGGAAACCGAGAAACTAACTTTGATCAGCGACGTGAG
AGTAAGTGGAACACACGTTGGGGTCCTGACAATAAGGAGCCAGAAGGTCCTCGTGAAAAA
TGGAGcgattctgcaaaagatggtgatatacatctggacaaaggcattgtctcacattcc
aagtcttggaaaggatgagaaggag
Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC000785A_C01 KMC000785A_c01
(745 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_199109.1| putative protein; protein id: At5g42950.1 [Arab... 235 4e-61
dbj|BAB90655.1| B1033B05.5 [Oryza sativa (japonica cultivar-grou... 102 6e-21
ref|NP_173840.1| unknown protein; protein id: At1g24300.1 [Arabi... 85 1e-15
ref|NP_174063.1| unknown protein; protein id: At1g27430.1 [Arabi... 83 4e-15
pir||F86399 protein F17L21.22 [imported] - Arabidopsis thaliana ... 83 4e-15
>ref|NP_199109.1| putative protein; protein id: At5g42950.1 [Arabidopsis thaliana]
gi|9758584|dbj|BAB09197.1|
gb|AAC80623.1~gene_id:MBD2.15~similar to unknown protein
[Arabidopsis thaliana]
Length = 1714
Score = 235 bits (600), Expect = 4e-61
Identities = 116/192 (60%), Positives = 140/192 (72%)
Frame = +1
Query: 130 PPLQISKDVTGSDNPIPLSPQWLLPKPVESKPGSGNVGNNVISIPPYGTHSETVKTSGNG 309
PP QI KD+ GSDN IPLSPQWLL KP E+K G G N YG HS+ V+T+GNG
Sbjct: 21 PPHQIFKDIQGSDNAIPLSPQWLLSKPGENKTGMGTGDPN-----QYGNHSDVVRTTGNG 75
Query: 310 EDGHDVQKRKDVFRPSMLDSESGRRDRWRDEERETKSATRKDRWRDGDKDLGDSRKVDRW 489
E+ D K+KDVFRPS+LD+ESGRRDRWRDEER+T S+ R DRWR+GDKD GD++KVDRW
Sbjct: 76 EETLDNLKKKDVFRPSLLDAESGRRDRWRDEERDTLSSVRNDRWRNGDKDSGDNKKVDRW 135
Query: 490 ADSLPSKNLGEARRGASDSHRRNDSGNRETNFDQRRESKWNTRWGPDNKEPEGPREKWSD 669
+ P GE RRG +D R DSGN++ +QRRESKWN+RWGPD+KE E PR KW +
Sbjct: 136 DNVAP--KFGEQRRGPND--RWTDSGNKDAAPEQRRESKWNSRWGPDDKEAEIPRNKWDE 191
Query: 670 SAKDGDIHLDKG 705
KDG+I +KG
Sbjct: 192 PGKDGEIIREKG 203
>dbj|BAB90655.1| B1033B05.5 [Oryza sativa (japonica cultivar-group)]
gi|20161927|dbj|BAB90838.1| P0592G05.24 [Oryza sativa
(japonica cultivar-group)]
Length = 1530
Score = 102 bits (254), Expect = 6e-21
Identities = 71/200 (35%), Positives = 110/200 (54%)
Frame = +1
Query: 103 DSPLRVAAAPPLQISKDVTGSDNPIPLSPQWLLPKPVESKPGSGNVGNNVISIPPYGTHS 282
D AA + SKD DN IPLSPQWL KP ++K +G+ + P +
Sbjct: 23 DDMAAAAAIGFVDESKDQQHLDNSIPLSPQWLYAKPTDAKI----LGHGSLLDP---SEK 75
Query: 283 ETVKTSGNGEDGHDVQKRKDVFRPSMLDSESGRRDRWRDEERETKSATRKDRWRDGDKDL 462
E G + ++R++VF D++S R W +EERET R++R ++ D+D+
Sbjct: 76 EVRMPEGAADKK---ERRRNVF-----DADSSLR--WLEEERETSLPGRRERKKEVDRDM 125
Query: 463 GDSRKVDRWADSLPSKNLGEARRGASDSHRRNDSGNRETNFDQRRESKWNTRWGPDNKEP 642
+SRK DR +D++ ++ G++R A S R ND R + + RR+ KW++RWGPD+KE
Sbjct: 126 -ESRKNDRRSDNVSVRDGGDSR--APPSERWNDGSTRGSGNEGRRDGKWSSRWGPDDKEK 182
Query: 643 EGPREKWSDSAKDGDIHLDK 702
+ EK D+ KD + H +K
Sbjct: 183 DSRSEKKLDAEKD-ESHAEK 201
>ref|NP_173840.1| unknown protein; protein id: At1g24300.1 [Arabidopsis thaliana]
gi|7486371|pir||T00661 hypothetical protein F3I6.24 -
Arabidopsis thaliana gi|2829883|gb|AAC00591.1| Unknown
protein [Arabidopsis thaliana]
Length = 1417
Score = 84.7 bits (208), Expect = 1e-15
Identities = 62/175 (35%), Positives = 85/175 (48%), Gaps = 9/175 (5%)
Frame = +1
Query: 163 SDNPIPLSPQWLLPKPVESKPGSGNVGNNVISIPPYGTHSETVKTSGNGEDGH------- 321
SDN IPLSPQWL K ESK S T GN D +
Sbjct: 26 SDNSIPLSPQWLYTKSSESK---------------MDVRSPTPMPMGNPSDPNLKDAWRL 70
Query: 322 DVQKRKDVFRPSMLDSESGRRDRWRDEERETK--SATRKDRWRDGDKDLGDSRKVDRWAD 495
D + K ++ + ++E+ RR WR+EERET A + DR RK +R D
Sbjct: 71 DAPEDKKDWKKIVSENETNRR--WREEERETGLLGARKVDR-----------RKTERRID 117
Query: 496 SLPSKNLGEARRGASDSHRRNDSGNRETNFDQRRESKWNTRWGPDNKEPEGPREK 660
++ S+ GE + A+ S R ND +R + RR++KW++RWGPD+KE E EK
Sbjct: 118 NVSSRETGEVKTTAA-SDRWNDVNSRAAVHEPRRDNKWSSRWGPDDKEKEARCEK 171
>ref|NP_174063.1| unknown protein; protein id: At1g27430.1 [Arabidopsis thaliana]
Length = 1453
Score = 83.2 bits (204), Expect = 4e-15
Identities = 59/173 (34%), Positives = 85/173 (49%), Gaps = 7/173 (4%)
Frame = +1
Query: 163 SDNPIPLSPQWLLPKPVESKPGSGNVGNNVISIPPYGTHSETVKTSGNGEDGHDVQKRKD 342
SDN IPLSPQWL K E K S T GN D + KD
Sbjct: 26 SDNSIPLSPQWLYTKSSEYK---------------MDVRSPTPVPMGNPSDPNP----KD 66
Query: 343 VFRPSMLDSESGRRDRWRDEERETKSATRKDRWRDGDKDLG-------DSRKVDRWADSL 501
+R LD+ ++D W+ E +++ R WR+ +++ G D RK +R DS+
Sbjct: 67 AWR---LDAPEDKKD-WKKIVHENETSRR---WREEERETGLLGARKVDRRKTERRIDSV 119
Query: 502 PSKNLGEARRGASDSHRRNDSGNRETNFDQRRESKWNTRWGPDNKEPEGPREK 660
S+ G+ + A+ S R ND +R + RR++KW++RWGPD+KE E EK
Sbjct: 120 SSRETGDIKNAAA-SDRWNDVNSRAAVHEPRRDNKWSSRWGPDDKEKEARCEK 171
>pir||F86399 protein F17L21.22 [imported] - Arabidopsis thaliana
gi|9802542|gb|AAF99744.1|AC004557_23 F17L21.22
[Arabidopsis thaliana]
Length = 1475
Score = 83.2 bits (204), Expect = 4e-15
Identities = 59/173 (34%), Positives = 85/173 (49%), Gaps = 7/173 (4%)
Frame = +1
Query: 163 SDNPIPLSPQWLLPKPVESKPGSGNVGNNVISIPPYGTHSETVKTSGNGEDGHDVQKRKD 342
SDN IPLSPQWL K E K S T GN D + KD
Sbjct: 26 SDNSIPLSPQWLYTKSSEYK---------------MDVRSPTPVPMGNPSDPNP----KD 66
Query: 343 VFRPSMLDSESGRRDRWRDEERETKSATRKDRWRDGDKDLG-------DSRKVDRWADSL 501
+R LD+ ++D W+ E +++ R WR+ +++ G D RK +R DS+
Sbjct: 67 AWR---LDAPEDKKD-WKKIVHENETSRR---WREEERETGLLGARKVDRRKTERRIDSV 119
Query: 502 PSKNLGEARRGASDSHRRNDSGNRETNFDQRRESKWNTRWGPDNKEPEGPREK 660
S+ G+ + A+ S R ND +R + RR++KW++RWGPD+KE E EK
Sbjct: 120 SSRETGDIKNAAA-SDRWNDVNSRAAVHEPRRDNKWSSRWGPDDKEKEARCEK 171
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.311 0.130 0.396
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 724,368,250
Number of Sequences: 1393205
Number of extensions: 18085235
Number of successful extensions: 57946
Number of sequences better than 10.0: 513
Number of HSP's better than 10.0 without gapping: 49493
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 55575
length of database: 448,689,247
effective HSP length: 120
effective length of database: 281,504,647
effective search space used: 35751090169
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
EST assemble image
|
|
|
|
clone |
accession |
position |
1 |
GENLf036c09 |
BP064228 |
1 |
554 |
2 |
GENLf081d11 |
BP066753 |
397 |
745 |
|
Lotus japonicus
Kazusa DNA Research Institute