KMC009409A_c01
[Fasta Sequence]
[Nr Search]
[EST assemble image]
Fasta Sequence
>KMC009409A_C01 KMC009409A_c01
atacttggaattaaaGCCCCAAATGGTTGCCAAACGGCACTTACAACATGTGAGACTAAA
TCTCAGGGAGTGCTACAACAAATGCACTAAAACATAAGAAAATGAAACTCAAACATGCAC
ACTCAGCTGACTCAATCAGCATAAATCATGAAGTAAAACCTTAAAAGGCCAGACAGAAAC
CTCTCAAAACCCAAATTAACAGAAAGCTAAATAACTTCTAAGCAGTTAAAATATATCTTG
TTTGATCGGTGCTCCCAAAGCAACAACACAACCGGCTCGTGCCGGTGTTGCAAGTAGTGC
AGGCTTCAGCGGCAACAAAGGGGGGTCATCCAACAGATCTGACAACAAAGAGAAGGGAGA
AGCTCCCGCAGATCCCAAGCAAAACAGCAACAAAGGTTGCAAGGACGGGGGAAGCGGCAA
AATAAGGAGGCGACCGAGGTGGGGGCAGTGGCGGAAATGGGTGGCGCAGCGGTTGGACCA
GTAGCAGCAGCGGCGGCGGTGTTTACAGGTGATCGGAAGCAGAGGAGGATGAACATGTTC
A
Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC009409A_C01 KMC009409A_c01
(541 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAB75628.1| hypothetical protein L8325.01 [Leishmania major] 38 0.004
ref|NP_180614.1| putative glycine-rich protein; protein id: At2g... 32 0.006
sp|Q9R0I7|ZAP3_MOUSE Nuclear protein ZAP3 gi|6016842|dbj|BAA8518... 30 0.055
ref|XP_126978.3| ZAP3 protein [Mus musculus] 30 0.055
emb|CAD21122.1| related to lariat-debranching enzyme [Neurospora... 38 0.090
>emb|CAB75628.1| hypothetical protein L8325.01 [Leishmania major]
Length = 664
Score = 37.7 bits (86), Expect(2) = 0.004
Identities = 29/69 (42%), Positives = 36/69 (52%), Gaps = 11/69 (15%)
Frame = +2
Query: 263 TTQPARAGVASS----AGFSGNKGGSSNR----SDNKEKGEAPADPKQNSN---KGCKDG 409
T +PA G A+S AG SG G S+ SDN+E G P+Q S+ +G K G
Sbjct: 359 TVRPAAKGPAASSSPAAGASGVLGTSAQGRGYGSDNEEAGNNAYMPQQPSSAVPRGAKAG 418
Query: 410 GSGKIRRRP 436
GSG RRP
Sbjct: 419 GSGAASRRP 427
Score = 23.5 bits (49), Expect(2) = 0.004
Identities = 12/29 (41%), Positives = 14/29 (47%), Gaps = 4/29 (13%)
Frame = +3
Query: 426 GGDRGGGSGGNGWRS----GWTSSSSGGG 500
GG GGG G N W + G +S GG
Sbjct: 451 GGGGGGGGGMNDWANAYPPGMVPPTSTGG 479
>ref|NP_180614.1| putative glycine-rich protein; protein id: At2g30560.1 [Arabidopsis
thaliana] gi|25296180|pir||H84709 probable glycine-rich
protein [imported] - Arabidopsis thaliana
gi|1946363|gb|AAB63081.1| putative glycine-rich protein
[Arabidopsis thaliana]
Length = 171
Score = 32.0 bits (71), Expect(2) = 0.006
Identities = 17/33 (51%), Positives = 18/33 (54%)
Frame = +3
Query: 408 GEAAK*GGDRGGGSGGNGWRSGWTSSSSGGGVY 506
G K GG GGG GG G +SG SGGG Y
Sbjct: 110 GGGGKNGGGCGGGGGGKGGKSG---GGSGGGGY 139
Score = 28.9 bits (63), Expect(2) = 0.006
Identities = 18/50 (36%), Positives = 21/50 (42%), Gaps = 5/50 (10%)
Frame = +2
Query: 284 GVASSAGFSGNKGG-----SSNRSDNKEKGEAPADPKQNSNKGCKDGGSG 418
G +G G GG SNRS + +DPK S G K GG G
Sbjct: 38 GGGGKSGGGGGGGGYMVAPGSNRSSYISRDNFESDPKGGSGGGGKGGGGG 87
Score = 28.1 bits (61), Expect(2) = 1.7
Identities = 16/33 (48%), Positives = 18/33 (54%), Gaps = 1/33 (3%)
Frame = +3
Query: 405 TGEAAK*GGDRGGGSGGN-GWRSGWTSSSSGGG 500
+G K GG GG SGG G +SG SGGG
Sbjct: 77 SGGGGKGGGGGGGISGGGAGGKSGCGGGKSGGG 109
Score = 24.3 bits (51), Expect(2) = 1.7
Identities = 14/45 (31%), Positives = 14/45 (31%)
Frame = +2
Query: 284 GVASSAGFSGNKGGSSNRSDNKEKGEAPADPKQNSNKGCKDGGSG 418
G S G KGG S G K G K GG G
Sbjct: 3 GKGGSGSGGGGKGGGGGGSGGGRGGGGGGGAKGGCGGGGKSGGGG 47
>sp|Q9R0I7|ZAP3_MOUSE Nuclear protein ZAP3 gi|6016842|dbj|BAA85182.1| nuclear protein ZAP
[Mus musculus]
Length = 1386
Score = 29.6 bits (65), Expect(2) = 0.055
Identities = 19/43 (44%), Positives = 21/43 (48%)
Frame = -2
Query: 411 PPSLQPLLLFCLGSAGASPFSLLSDLLDDPPLLPLKPALLATP 283
PP L P L SAG+ P L PP+LPL P ATP
Sbjct: 551 PPVLPPP---ALSSAGSPPVLPPPALPGGPPILPLPPLSSATP 590
Score = 27.7 bits (60), Expect(2) = 0.055
Identities = 12/30 (40%), Positives = 15/30 (50%)
Frame = -1
Query: 499 PPPLLLLVQPLRHPFPPLPPPRSPPYFAAS 410
PPP++ P P P +PPP PP S
Sbjct: 507 PPPVMPPALPTSIPPPGMPPPVMPPSLPTS 536
>ref|XP_126978.3| ZAP3 protein [Mus musculus]
Length = 1386
Score = 29.6 bits (65), Expect(2) = 0.055
Identities = 19/43 (44%), Positives = 21/43 (48%)
Frame = -2
Query: 411 PPSLQPLLLFCLGSAGASPFSLLSDLLDDPPLLPLKPALLATP 283
PP L P L SAG+ P L PP+LPL P ATP
Sbjct: 551 PPVLPPP---ALSSAGSPPVLPPPALPGGPPILPLPPLSSATP 590
Score = 27.7 bits (60), Expect(2) = 0.055
Identities = 12/30 (40%), Positives = 15/30 (50%)
Frame = -1
Query: 499 PPPLLLLVQPLRHPFPPLPPPRSPPYFAAS 410
PPP++ P P P +PPP PP S
Sbjct: 507 PPPVMPPALPTSIPPPGMPPPVMPPSLPTS 536
>emb|CAD21122.1| related to lariat-debranching enzyme [Neurospora crassa]
gi|28917935|gb|EAA27620.1| hypothetical protein
[Neurospora crassa]
Length = 617
Score = 37.7 bits (86), Expect = 0.090
Identities = 19/51 (37%), Positives = 23/51 (44%)
Frame = +3
Query: 348 KRREKLPQIPSKTATKVARTGEAAK*GGDRGGGSGGNGWRSGWTSSSSGGG 500
+R + Q P A + G + GG GGG GG GWR G GGG
Sbjct: 551 EREGRRVQGPRPMAERFGGRGGGHRGGGRGGGGGGGGGWRGGRGGRGGGGG 601
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 505,101,617
Number of Sequences: 1393205
Number of extensions: 12291288
Number of successful extensions: 125586
Number of sequences better than 10.0: 420
Number of HSP's better than 10.0 without gapping: 64852
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 108820
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 18462123008
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
EST assemble image
|
|
|
|
clone |
accession |
position |
1 |
MFB084e04_f |
BP040151 |
1 |
541 |
2 |
MFB057b08_f |
BP038110 |
16 |
445 |
|
Lotus japonicus
Kazusa DNA Research Institute