KMC006542A_c01
[Fasta Sequence]
[Nr Search]
[EST assemble image]
Fasta Sequence
>KMC006542A_C01 KMC006542A_c01
gcatagagcacaacatcaaattcgacggtaagtcataagatttcaacagaccaaaagatc
gcacttaggaacataaaagaGTTTTAAAAATGTGCTGAGTACTGCATATCAAAACACTGG
TATCATGGGTCCATCCAAACAACAATCCCAGAACATGGTTTATGAAAAAGAAAGTTGAAG
CATTATCAGCAACCCCTTCAATTTCAAAGCTCATAGAGAAGTCCAAATTATATCAACGAT
AGCCTCCTGATGATCCCCTTCGTTCATACGGTCCAGAACGATCACGATGGTATCGATCAC
TTCCAGAATCTCCATGACTACCACCATCCCTGCTGCTGCGCCCACCAGAAGAACGATCTG
CATTTCTATCAGGACCATAGCCACCGCCACTGnTTCGACTTTCCCGGCCACCATACCTTC
CCCCTCCTCGTTCCCCTTCACTAGGACACTCCCTAGCAAAATGACCAGGCTTACCACACT
TAAAGCATTCACCTCCATTAGATCCACGACTACCTCCATAGCCTCGGCTACGATCACGAT
CGCCACGATCATCACGATCACGACCACGCTCTCTGTAGCGATCACCATCATCCCTATCCC
TAGATGATCCTTGTGGCTGAGCTTTATCAACAGTAATAGTACGCCCATCTAAATCCATCC
CATTCATAGCATCAATAGCCTCGTCCATTGCTTTCTTGTCATCAAATGTGACAAATCCAA
ACCCCCGAGAGCGTCCAGAGAACTTGTCAACAACAACCTTTGCTTCCGTAAGCTTGCCAA
ACTTTTCAAATGCATCCTTTAACTTTCTATCAGATGTTGACCAGGcaggccaccaataaa
acagcgatactcgtccacatctgacatcttaaacgattcggaggtggagttgttggagaa
aaggc
Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC006542A_C01 KMC006542A_c01
(905 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
pir||JC4817 RNA-binding protein RZ-1 - wood tobacco gi|1395193|d... 263 4e-70
ref|NP_189273.1| glycine-rich RNA-binding protein; protein id: A... 220 6e-58
gb|AAL09710.1| AT3g26420/F20C19_14 [Arabidopsis thaliana] gi|196... 191 2e-49
gb|AAK01176.1| RNA-binding protein [Triticum aestivum] 195 9e-49
ref|NP_196048.1| glycine-rich RNA-binding protein; protein id: A... 129 8e-29
>pir||JC4817 RNA-binding protein RZ-1 - wood tobacco gi|1395193|dbj|BAA12064.1|
RNA-binding protein RZ-1 [Nicotiana sylvestris]
gi|1435062|dbj|BAA06012.1| RNA binding protein, RZ-1
[Nicotiana sylvestris]
Length = 209
Score = 263 bits (673), Expect(2) = 4e-70
Identities = 142/200 (71%), Positives = 156/200 (78%), Gaps = 3/200 (1%)
Frame = -2
Query: 826 AWSTSDRKLKDAFEKFGKLTEAKVVVDKFSGRSRGFGFVTFDDKKAMDEAIDAMNGMDLD 647
+WSTSDR LKDAFEKFG L +AKVV+DKFSGRSRGFGFVTFD+K+AM++AI+AMNG+DLD
Sbjct: 14 SWSTSDRGLKDAFEKFGNLVDAKVVLDKFSGRSRGFGFVTFDEKRAMEDAIEAMNGVDLD 73
Query: 646 GRTITVDKAQP-QGSSRDRDDGDRYRERGRDRDDRGDRDR-SRGYGGSRGS-NGGECFKC 476
GR ITVDKAQP +GS RD D DR R+R RDR DRDR SR YGG RGS GG+CF C
Sbjct: 74 GRDITVDKAQPDKGSGRD-FDSDRPRDRDRDRGRDRDRDRGSRDYGGGRGSGGGGDCFNC 132
Query: 475 GKPGHFARECPSEGERGGGRYGGRESRXSGGGYGPDRNADRSSGGRSSRDGGSHGDSGSD 296
GKPGHFARECPSEG R GGRYGG GYGPDRN DR G RS RDGG G G +
Sbjct: 133 GKPGHFARECPSEGGR-GGRYGGGGGGSRSSGYGPDRNGDR-YGSRSGRDGGGRG--GGE 188
Query: 295 RYHRDRSGPYERRGSSGGYR 236
R+ RDRSGPYERR SSGG R
Sbjct: 189 RFSRDRSGPYERR-SSGGSR 207
Score = 24.6 bits (52), Expect(2) = 4e-70
Identities = 9/10 (90%), Positives = 9/10 (90%)
Frame = -3
Query: 855 DEYRCFIGGL 826
DEYRCFIG L
Sbjct: 4 DEYRCFIGNL 13
>ref|NP_189273.1| glycine-rich RNA-binding protein; protein id: At3g26420.1,
supported by cDNA: gi_15451065, supported by cDNA:
gi_15982738, supported by cDNA: gi_15983476, supported
by cDNA: gi_18377411 [Arabidopsis thaliana]
gi|9294301|dbj|BAB02203.1| contains similarity to
RNA-binding protein~gene_id:F20C19.15 [Arabidopsis
thaliana] gi|15451066|gb|AAK96804.1| Unknown protein
[Arabidopsis thaliana]
gi|15983477|gb|AAL11606.1|AF424613_1 AT3g26420/F20C19_14
[Arabidopsis thaliana] gi|18377412|gb|AAL66872.1|
unknown protein [Arabidopsis thaliana]
Length = 245
Score = 220 bits (560), Expect(2) = 6e-58
Identities = 133/243 (54%), Positives = 152/243 (61%), Gaps = 49/243 (20%)
Frame = -2
Query: 826 AWSTSDRKLKDAFEKFGKLTEAKVVVDKFSGRSRGFGFVTFDDKKAMDEAIDAMNGMDLD 647
AW+TSDR L+DAFEK+G L EAKVV+DKFSGRSRGFGF+TFD+KKAMDEAI AMNGMDLD
Sbjct: 15 AWTTSDRGLRDAFEKYGHLVEAKVVLDKFSGRSRGFGFITFDEKKAMDEAIAAMNGMDLD 74
Query: 646 GRTITVDKAQP-QGSSRDRDDGDRYRERGRDRDDRGDRDRSRGYGGSRGSNGGECFKCGK 470
GRTITVDKAQP QG + +DGDR R+RG DRDRSR GG RG GG+CFKCGK
Sbjct: 75 GRTITVDKAQPHQGGAGRDNDGDRGRDRGY------DRDRSRPSGG-RG--GGDCFKCGK 125
Query: 469 PGHFARECPSEGERGGG-----------------------------RYGGRESRXS---- 389
PGHFARECPSE R GG RYG ++ R S
Sbjct: 126 PGHFARECPSESSRDGGGRFSSKDDRYSSKDDRYGAKDDRYGAKEDRYGAKDDRYSSKDD 185
Query: 388 -------------GGG--YGPDRNADRSSGGRSSRDGGSHGDSGSDRYHRDRSGPYERRG 254
GGG YGPDR+ +R +GGR SRDGGS G G +R+ R PY+R
Sbjct: 186 RYSSKDDRYGSRDGGGSRYGPDRSGER-AGGR-SRDGGSRGAPGGERHSR---APYDRPR 240
Query: 253 SSG 245
+ G
Sbjct: 241 AGG 243
Score = 27.3 bits (59), Expect(2) = 6e-58
Identities = 11/14 (78%), Positives = 12/14 (85%)
Frame = -3
Query: 867 MSDVDEYRCFIGGL 826
MS+ EYRCFIGGL
Sbjct: 1 MSEDPEYRCFIGGL 14
>gb|AAL09710.1| AT3g26420/F20C19_14 [Arabidopsis thaliana]
gi|19699180|gb|AAL90956.1| AT3g26420/F20C19_14
[Arabidopsis thaliana]
Length = 148
Score = 191 bits (486), Expect(2) = 2e-49
Identities = 100/134 (74%), Positives = 110/134 (81%), Gaps = 1/134 (0%)
Frame = -2
Query: 826 AWSTSDRKLKDAFEKFGKLTEAKVVVDKFSGRSRGFGFVTFDDKKAMDEAIDAMNGMDLD 647
AW+TSDR L+DAFEK+G L EAKVV+DKFSGRSRGFGF+TFD+KKAMDEAI AMNGMDLD
Sbjct: 15 AWTTSDRGLRDAFEKYGHLVEAKVVLDKFSGRSRGFGFITFDEKKAMDEAIAAMNGMDLD 74
Query: 646 GRTITVDKAQP-QGSSRDRDDGDRYRERGRDRDDRGDRDRSRGYGGSRGSNGGECFKCGK 470
GRTITVDKAQP QG + +DGDR R+RG DRDRSR GG RG GG+CFKCGK
Sbjct: 75 GRTITVDKAQPHQGGAGRDNDGDRGRDRGY------DRDRSRPSGG-RG--GGDCFKCGK 125
Query: 469 PGHFARECPSEGER 428
PGHFARECPSE R
Sbjct: 126 PGHFARECPSESSR 139
Score = 27.3 bits (59), Expect(2) = 2e-49
Identities = 11/14 (78%), Positives = 12/14 (85%)
Frame = -3
Query: 867 MSDVDEYRCFIGGL 826
MS+ EYRCFIGGL
Sbjct: 1 MSEDPEYRCFIGGL 14
>gb|AAK01176.1| RNA-binding protein [Triticum aestivum]
Length = 183
Score = 195 bits (495), Expect = 9e-49
Identities = 111/181 (61%), Positives = 128/181 (70%)
Frame = -2
Query: 868 DVRCGRVSLFYWWPAWSTSDRKLKDAFEKFGKLTEAKVVVDKFSGRSRGFGFVTFDDKKA 689
D RC SL +W+T+D LKDAF KFG++TE KVV+DKFSGRSRGFGFVTFDDKKA
Sbjct: 6 DYRCFVGSL-----SWNTTDVDLKDAFGKFGRVTETKVVLDKFSGRSRGFGFVTFDDKKA 60
Query: 688 MDEAIDAMNGMDLDGRTITVDKAQPQGSSRDRDDGDRYRERGRDRDDRGDRDRSRGYGGS 509
M+EA++AMNG+DLDGR ITV++AQPQGS R+R DGDR G DR G RD G GG
Sbjct: 61 MEEAVEAMNGIDLDGRNITVERAQPQGSGRNR-DGDRDYRGGGDRYG-GGRDFGGGRGGG 118
Query: 508 RGSNGGECFKCGKPGHFARECPSEGERGGGRYGGRESRXSGGGYGPDRNADRSSGGRSSR 329
RG GG+C+KCGKPGHFARECPS GG RYG R+ R S DR + R SSR
Sbjct: 119 RG-GGGDCYKCGKPGHFARECPSGD--GGDRYGSRDDRYSS---RDDRYSSRDD-RYSSR 171
Query: 328 D 326
D
Sbjct: 172 D 172
>ref|NP_196048.1| glycine-rich RNA-binding protein; protein id: At5g04280.1,
supported by cDNA: gi_20260151 [Arabidopsis thaliana]
gi|20260152|gb|AAM12974.1| RNA-binding protein-like
[Arabidopsis thaliana] gi|21387121|gb|AAM47964.1|
RNA-binding protein-like [Arabidopsis thaliana]
Length = 310
Score = 129 bits (323), Expect = 8e-29
Identities = 73/162 (45%), Positives = 101/162 (62%), Gaps = 8/162 (4%)
Frame = -2
Query: 814 SDRKLKDAFEKFGKLTEAKVVVDKFSGRSRGFGFVTFDDKKAMDEAIDAMNGMDLDGRTI 635
+DR L+ AF +FG + + ++++++ +GRSRGFGF+TF D++AMDE+I M+G D R I
Sbjct: 19 TDRDLERAFSRFGDILDCQIMLERDTGRSRGFGFITFADRRAMDESIREMHGRDFGDRVI 78
Query: 634 TVDKAQPQGSSRDRDDGDRYRERGRDRDDRGDRDRSRGYGGSRGSNGG-----ECFKCGK 470
+V++A+P+ RDDG+ + RG D G +G G G GG ECFKCG+
Sbjct: 79 SVNRAEPK---LGRDDGESHGSRG--GRDSGYSIAGKGSFGGGGGGGGRVGEDECFKCGR 133
Query: 469 PGHFARECPSEGERGGGRYGGRESRXS--GGGYG-PDRNADR 353
GH+AR+CPS G GG GG SR S GG G DR ADR
Sbjct: 134 VGHWARDCPSAGGGRGGPVGGFSSRASAYGGSDGRVDRYADR 175
Score = 42.7 bits (99), Expect = 0.007
Identities = 43/133 (32%), Positives = 54/133 (40%), Gaps = 17/133 (12%)
Frame = -2
Query: 589 DGDRYRERGRDRDDRGDRDRSRGYGGSRGSNGGECFKCGKPGHFARE---CPSEGERGGG 419
D DRY +R R DDR D + YG + E + +A + P++ GG
Sbjct: 174 DRDRYVDRERYIDDR--YDGAARYGARDRFDSREAYI--PRDRYASDRYAAPADRFAGGD 229
Query: 418 RYGGRESRXSGGGY----------GPDRNADRSSGGRSS---RDGGSHGDSGSDRYHRDR 278
RY R G Y P +DR GGR+ R GG G R R R
Sbjct: 230 RYSRGSDRYPPGSYDKARSFERDIAPSAGSDRYGGGRAGGPIRGGGEEG-----RGFRSR 284
Query: 277 SG-PYERRGSSGG 242
+G PYER SGG
Sbjct: 285 AGAPYERPSRSGG 297
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 828,348,622
Number of Sequences: 1393205
Number of extensions: 21094577
Number of successful extensions: 170876
Number of sequences better than 10.0: 6701
Number of HSP's better than 10.0 without gapping: 88832
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 130780
length of database: 448,689,247
effective HSP length: 123
effective length of database: 277,325,032
effective search space used: 49363855696
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
EST assemble image
|
|
|
|
clone |
accession |
position |
1 |
GENLf066c02 |
BP065903 |
1 |
506 |
2 |
MF097c07_f |
BP033342 |
421 |
906 |
|
Lotus japonicus
Kazusa DNA Research Institute