
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0079c.14
(171 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAW30015.1| At2g15695 [Arabidopsis thaliana] gi|54606848|gb|A... 166 2e-40
pir||B84532 hypothetical protein At2g15690 [imported] - Arabidop... 166 2e-40
gb|AAP68297.1| At5g44250 [Arabidopsis thaliana] gi|21555160|gb|A... 155 5e-37
dbj|BAD35195.1| unknown protein [Oryza sativa (japonica cultivar... 148 5e-35
gb|EAL28413.1| GA18044-PA [Drosophila pseudoobscura] 36 0.37
ref|XP_644620.1| hypothetical protein DDB0203357 [Dictyostelium ... 36 0.48
ref|XP_482664.1| hypothetical protein [Oryza sativa (japonica cu... 35 1.1
ref|XP_638058.1| hypothetical protein DDB0186498 [Dictyostelium ... 35 1.1
ref|XP_392885.2| PREDICTED: similar to ENSANGP00000006433 [Apis ... 35 1.1
gb|AAN14142.1| CG9983-PC, isoform C [Drosophila melanogaster] gi... 33 2.4
gb|AAF56800.2| CG9983-PA, isoform A [Drosophila melanogaster] gi... 33 2.4
ref|XP_233001.3| PREDICTED: similar to Exosome component 3 [Ratt... 33 2.4
gb|AAN14144.1| CG9983-PF, isoform F [Drosophila melanogaster] gi... 33 2.4
gb|AAN14141.1| CG9983-PE, isoform E [Drosophila melanogaster] gi... 33 2.4
gb|EAA75460.1| hypothetical protein FG05224.1 [Gibberella zeae P... 33 4.1
emb|CAD20810.1| Keratin 13 [Oncorhynchus mykiss] 32 5.4
gb|AAG23220.1| glycine-rich RNA-binding protein [Sorghum bicolor] 32 5.4
ref|XP_321405.2| ENSANGP00000008532 [Anopheles gambiae str. PEST... 32 9.1
gb|EAL93089.1| cytochrome P450, putative [Aspergillus fumigatus ... 32 9.1
ref|XP_345647.2| PREDICTED: similar to growth differentiation fa... 32 9.1
>gb|AAW30015.1| At2g15695 [Arabidopsis thaliana] gi|54606848|gb|AAV34772.1|
At2g15695 [Arabidopsis thaliana]
gi|20197706|gb|AAM15216.1| unknown protein [Arabidopsis
thaliana] gi|18397898|ref|NP_565378.1| expressed protein
[Arabidopsis thaliana]
Length = 420
Score = 166 bits (420), Expect = 2e-40
Identities = 80/165 (48%), Positives = 109/165 (65%), Gaps = 11/165 (6%)
Query: 17 VRGGNIYWGRKHATDFR-----------GIVVIFAWVSVPQTLLQDFVNLYSSLGWNSLV 65
+ GG +YWG+K + G+VVIF W S+ + L +FV+LYSSLGWNSLV
Sbjct: 2 IGGGRVYWGKKLDKEMEDAAVVDGGGSNGVVVIFVWSSINENQLMNFVDLYSSLGWNSLV 61
Query: 66 CYAHYLSAFRDESTVPLAFCVVDELIEELKTKSCPVVFAAFSAGSKACLYKVLQLIDGRC 125
C A +L+A E + LAF ++ EL+EELK++ CPV+F AFS KAC+YKVLQ+I C
Sbjct: 62 CRADFLTAVYPEMALSLAFHLLSELVEELKSRPCPVIFLAFSGAPKACMYKVLQVIMDDC 121
Query: 126 ETPHCLHNYHLLRNCVSGHIYDSGPLDVTSDFGFRFALHPSMAKV 170
E + L+R C+SGH+YDSGPLD TSD +FALHP++ ++
Sbjct: 122 EAQIHPDDSQLVRTCLSGHVYDSGPLDFTSDLNVKFALHPTIRRM 166
>pir||B84532 hypothetical protein At2g15690 [imported] - Arabidopsis thaliana
Length = 989
Score = 166 bits (420), Expect = 2e-40
Identities = 80/165 (48%), Positives = 109/165 (65%), Gaps = 11/165 (6%)
Query: 17 VRGGNIYWGRKHATDFR-----------GIVVIFAWVSVPQTLLQDFVNLYSSLGWNSLV 65
+ GG +YWG+K + G+VVIF W S+ + L +FV+LYSSLGWNSLV
Sbjct: 2 IGGGRVYWGKKLDKEMEDAAVVDGGGSNGVVVIFVWSSINENQLMNFVDLYSSLGWNSLV 61
Query: 66 CYAHYLSAFRDESTVPLAFCVVDELIEELKTKSCPVVFAAFSAGSKACLYKVLQLIDGRC 125
C A +L+A E + LAF ++ EL+EELK++ CPV+F AFS KAC+YKVLQ+I C
Sbjct: 62 CRADFLTAVYPEMALSLAFHLLSELVEELKSRPCPVIFLAFSGAPKACMYKVLQVIMDDC 121
Query: 126 ETPHCLHNYHLLRNCVSGHIYDSGPLDVTSDFGFRFALHPSMAKV 170
E + L+R C+SGH+YDSGPLD TSD +FALHP++ ++
Sbjct: 122 EAQIHPDDSQLVRTCLSGHVYDSGPLDFTSDLNVKFALHPTIRRM 166
>gb|AAP68297.1| At5g44250 [Arabidopsis thaliana] gi|21555160|gb|AAM63792.1| unknown
[Arabidopsis thaliana] gi|9759526|dbj|BAB10992.1|
unnamed protein product [Arabidopsis thaliana]
gi|15241450|ref|NP_199238.1| expressed protein
[Arabidopsis thaliana] gi|15450862|gb|AAK96702.1|
Unknown protein [Arabidopsis thaliana]
Length = 403
Score = 155 bits (391), Expect = 5e-37
Identities = 76/154 (49%), Positives = 106/154 (68%), Gaps = 2/154 (1%)
Query: 19 GGNIYWGRKHAT--DFRGIVVIFAWVSVPQTLLQDFVNLYSSLGWNSLVCYAHYLSAFRD 76
GGN YW +K+ + IVV+FAW+S + L++ V+LYSSL W+SLVC++ +L+ F
Sbjct: 5 GGNYYWRKKNNNGGESEAIVVVFAWMSSEERNLKNHVDLYSSLLWDSLVCHSQFLNMFLP 64
Query: 77 ESTVPLAFCVVDELIEELKTKSCPVVFAAFSAGSKACLYKVLQLIDGRCETPHCLHNYHL 136
+ LA VV EL++ELK K P+VFA+FS G AC+YKVLQ+++G CET + L
Sbjct: 65 DKAADLASNVVSELVKELKAKPVPLVFASFSGGPNACMYKVLQILEGTCETGLNPDDCRL 124
Query: 137 LRNCVSGHIYDSGPLDVTSDFGFRFALHPSMAKV 170
+RNC+SG IYDS P+D TSD G R A+HP+ K+
Sbjct: 125 VRNCISGFIYDSCPVDFTSDLGARLAVHPTTLKM 158
>dbj|BAD35195.1| unknown protein [Oryza sativa (japonica cultivar-group)]
Length = 405
Score = 148 bits (374), Expect = 5e-35
Identities = 72/156 (46%), Positives = 95/156 (60%), Gaps = 7/156 (4%)
Query: 14 GCAVRGGNIYWG----RKHATDFRGIVVIFAWVSVPQTLLQDFVNLYSSLGWNSLVCYAH 69
GC GG YW A RG+VV+F WV + L+ FV LY+SLGW LVC+
Sbjct: 4 GC---GGRFYWAPAPPSPSAAGARGVVVVFGWVWSDEAQLRPFVELYASLGWRCLVCHPD 60
Query: 70 YLSAFRDESTVPLAFCVVDELIEELKTKSCPVVFAAFSAGSKACLYKVLQLIDGRCETPH 129
++ + E LA V+ EL++E K K P VFA+FS GSK C+YKV+QL+DG CE
Sbjct: 61 LVALYLSEKAASLASGVISELVKEFKVKPLPTVFASFSGGSKGCMYKVIQLLDGNCEGDA 120
Query: 130 CLHNYHLLRNCVSGHIYDSGPLDVTSDFGFRFALHP 165
+ +Y L+RNC+ G IYDSGP+D SD G +F +P
Sbjct: 121 TMKDYRLVRNCICGQIYDSGPVDFFSDVGTQFLQNP 156
>gb|EAL28413.1| GA18044-PA [Drosophila pseudoobscura]
Length = 772
Score = 36.2 bits (82), Expect = 0.37
Identities = 43/155 (27%), Positives = 59/155 (37%), Gaps = 22/155 (14%)
Query: 2 RGAESEGGGGGGGCAVRGGNI---YWGRK---------HATDFRGIVVIFAWV-SVPQTL 48
RGA S GGGGGGG V G +I W RK IV IF+W+ S
Sbjct: 364 RGASSGGGGGGGGGTVPGKSIGPPPWNRKGPFRCGPYFDRLPDEAIVRIFSWLDSCELCT 423
Query: 49 LQDFVNLYSSLGWNSLV--CYAHYLSAFRDESTVPLAFCVVDELIEELKTKSCPVVFAAF 106
+ + + W ++ C + T+ + F +L + +CP V
Sbjct: 424 VARVCRRFEQVAWRPVLWKCITLRGEHLNGDKTLKMIF---RQLCGQSCNGACPEVERVM 480
Query: 107 SAGSKACLYKVLQLIDGRCETPHCLHNYHLLRNCV 141
A K LQL+ RC P H L+ CV
Sbjct: 481 LADGCRISDKGLQLLTRRC--PELTHLQ--LQTCV 511
>ref|XP_644620.1| hypothetical protein DDB0203357 [Dictyostelium discoideum]
gi|66822551|ref|XP_644630.1| hypothetical protein
DDB0217163 [Dictyostelium discoideum]
gi|60472754|gb|EAL70704.1| hypothetical protein
DDB0217163 [Dictyostelium discoideum]
gi|60472736|gb|EAL70686.1| hypothetical protein
DDB0203357 [Dictyostelium discoideum]
Length = 354
Score = 35.8 bits (81), Expect = 0.48
Identities = 26/126 (20%), Positives = 56/126 (43%), Gaps = 8/126 (6%)
Query: 35 IVVIFAWVSVPQTLLQDFVNLYSSLGWNSLVCYAHYLSAFRDESTVPLAFCVVDELIEEL 94
+ +I W+ L + LY G+ ++ YL F + LA+ ++ L+ E
Sbjct: 91 MAIIVGWIKSNPKHLNKYSKLYLDNGFVTISFSPSYLCHFFPKKMKELAYNFLEFLVSEN 150
Query: 95 KTKSCPVVFAAFSAGSKACLYKVLQLIDGRCETPHCLHNYHLLRNCVSGHIYDSGPLDVT 154
+ + P++F FS G+ ++ +L++ + + L + G I+DS P ++
Sbjct: 151 EKIARPIIFQVFS-GNMVFQSEIFKLLNEEIK-------FKKLIPFIKGQIFDSCPSKIS 202
Query: 155 SDFGFR 160
+ F+
Sbjct: 203 EEQAFQ 208
>ref|XP_482664.1| hypothetical protein [Oryza sativa (japonica cultivar-group)]
gi|42409449|dbj|BAD09806.1| hypothetical protein [Oryza
sativa (japonica cultivar-group)]
gi|42408340|dbj|BAD09493.1| hypothetical protein [Oryza
sativa (japonica cultivar-group)]
Length = 330
Score = 34.7 bits (78), Expect = 1.1
Identities = 13/21 (61%), Positives = 15/21 (70%)
Query: 4 AESEGGGGGGGCAVRGGNIYW 24
A S+GGGGGGGC GG Y+
Sbjct: 47 ASSDGGGGGGGCRSNGGRFYF 67
>ref|XP_638058.1| hypothetical protein DDB0186498 [Dictyostelium discoideum]
gi|60466509|gb|EAL64561.1| hypothetical protein
DDB0186498 [Dictyostelium discoideum]
Length = 305
Score = 34.7 bits (78), Expect = 1.1
Identities = 27/120 (22%), Positives = 52/120 (42%), Gaps = 9/120 (7%)
Query: 33 RGIVVIFAWVSVPQTLLQDFVNLYSSLGWNSLVCYAHYLSAFRDESTVPLAFCVVDELIE 92
R I ++ W+ Q LL ++NL++S G+N+ A Y A ++ ++
Sbjct: 60 RPISIVLGWMGSTQKLLLKYINLWTSRGFNTFSYRADYFETLAILGLRLKAMHMLKQIST 119
Query: 93 ELKTK-SC-PVVFAAFSAGSKACLYKVLQLIDGRCETPHCLHNYHLLRNCVSGHIYDSGP 150
LK + +C ++F FS G + +++ + E Y + + G + DS P
Sbjct: 120 YLKERPNCDTIIFHIFSNGGGFLYWALIEFMLANDE-------YKFIHPMIKGVVMDSLP 172
>ref|XP_392885.2| PREDICTED: similar to ENSANGP00000006433 [Apis mellifera]
Length = 1685
Score = 34.7 bits (78), Expect = 1.1
Identities = 17/55 (30%), Positives = 30/55 (53%), Gaps = 1/55 (1%)
Query: 60 GWNSLVCYAHYLSAFRDESTVPLAFCVVDELIEELKTKSCPVVFAAFSAGSKACL 114
GW ++ H+ ++ RDE+ V LAF + ++I EL + ++ +F K CL
Sbjct: 1137 GWKNIFSVFHHAASDRDEAVVELAFSMTGKIINELYAEDFSIMVDSFQDAVK-CL 1190
>gb|AAN14142.1| CG9983-PC, isoform C [Drosophila melanogaster]
gi|7301687|gb|AAF56801.1| CG9983-PB, isoform B
[Drosophila melanogaster] gi|24650836|ref|NP_733251.1|
CG9983-PC, isoform C [Drosophila melanogaster]
gi|17738267|ref|NP_524543.1| CG9983-PB, isoform B
[Drosophila melanogaster] gi|133253|sp|P07909|ROA1_DROME
Heterogeneous nuclear ribonucleoprotein A1 (hnRNP core
protein A1-A) (PEN repeat clone P9)
gi|908757|gb|AAA70426.1| unknown protein
gi|157654|gb|AAA28624.1| nulcear ribonucleoprotein
Length = 365
Score = 33.5 bits (75), Expect = 2.4
Identities = 15/29 (51%), Positives = 16/29 (54%)
Query: 3 GAESEGGGGGGGCAVRGGNIYWGRKHATD 31
G S GGGGGGG GGN WG + D
Sbjct: 257 GNNSFGGGGGGGGGYGGGNNSWGNNNPWD 285
>gb|AAF56800.2| CG9983-PA, isoform A [Drosophila melanogaster]
gi|24650831|ref|NP_733249.1| CG9983-PA, isoform A
[Drosophila melanogaster] gi|157652|gb|AAA28622.1|
nuclear ribonucleoprotein
Length = 364
Score = 33.5 bits (75), Expect = 2.4
Identities = 15/29 (51%), Positives = 16/29 (54%)
Query: 3 GAESEGGGGGGGCAVRGGNIYWGRKHATD 31
G S GGGGGGG GGN WG + D
Sbjct: 256 GNNSFGGGGGGGGGYGGGNNSWGNNNPWD 284
>ref|XP_233001.3| PREDICTED: similar to Exosome component 3 [Rattus norvegicus]
Length = 287
Score = 33.5 bits (75), Expect = 2.4
Identities = 13/24 (54%), Positives = 15/24 (62%)
Query: 1 MRGAESEGGGGGGGCAVRGGNIYW 24
+R E GGGGGGG GG +YW
Sbjct: 90 LRHKEPSGGGGGGGGGGGGGGVYW 113
>gb|AAN14144.1| CG9983-PF, isoform F [Drosophila melanogaster]
gi|23172512|gb|AAN14143.1| CG9983-PD, isoform D
[Drosophila melanogaster] gi|24650840|ref|NP_733253.1|
CG9983-PF, isoform F [Drosophila melanogaster]
gi|24650838|ref|NP_733252.1| CG9983-PD, isoform D
[Drosophila melanogaster] gi|16769554|gb|AAL28996.1|
LD38464p [Drosophila melanogaster]
gi|157651|gb|AAA28621.1| nuclear ribonucleoprotein
Length = 361
Score = 33.5 bits (75), Expect = 2.4
Identities = 15/29 (51%), Positives = 16/29 (54%)
Query: 3 GAESEGGGGGGGCAVRGGNIYWGRKHATD 31
G S GGGGGGG GGN WG + D
Sbjct: 253 GNNSFGGGGGGGGGYGGGNNSWGNNNPWD 281
>gb|AAN14141.1| CG9983-PE, isoform E [Drosophila melanogaster]
gi|24650833|ref|NP_733250.1| CG9983-PE, isoform E
[Drosophila melanogaster] gi|157653|gb|AAA28623.1|
nuclear ribonucleoprotein
Length = 360
Score = 33.5 bits (75), Expect = 2.4
Identities = 15/29 (51%), Positives = 16/29 (54%)
Query: 3 GAESEGGGGGGGCAVRGGNIYWGRKHATD 31
G S GGGGGGG GGN WG + D
Sbjct: 252 GNNSFGGGGGGGGGYGGGNNSWGNNNPWD 280
>gb|EAA75460.1| hypothetical protein FG05224.1 [Gibberella zeae PH-1]
gi|46121691|ref|XP_385400.1| hypothetical protein
FG05224.1 [Gibberella zeae PH-1]
Length = 657
Score = 32.7 bits (73), Expect = 4.1
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 4/35 (11%)
Query: 2 RGAESEGGGGGGGCA----VRGGNIYWGRKHATDF 32
+G + EGGGGGGG + + GG IY+G A +F
Sbjct: 11 QGEQREGGGGGGGFSFNKLILGGAIYFGLNAAMNF 45
>emb|CAD20810.1| Keratin 13 [Oncorhynchus mykiss]
Length = 492
Score = 32.3 bits (72), Expect = 5.4
Identities = 13/20 (65%), Positives = 16/20 (80%)
Query: 6 SEGGGGGGGCAVRGGNIYWG 25
S GGGGGGG A+R G++Y G
Sbjct: 41 SMGGGGGGGGAMRSGSVYGG 60
>gb|AAG23220.1| glycine-rich RNA-binding protein [Sorghum bicolor]
Length = 170
Score = 32.3 bits (72), Expect = 5.4
Identities = 15/31 (48%), Positives = 18/31 (57%)
Query: 4 AESEGGGGGGGCAVRGGNIYWGRKHATDFRG 34
A+S GGGGGGG GG Y GR+ + G
Sbjct: 84 AQSRGGGGGGGGYGGGGGGYGGRREGGGYGG 114
>ref|XP_321405.2| ENSANGP00000008532 [Anopheles gambiae str. PEST]
gi|55233667|gb|EAA00899.2| ENSANGP00000008532
[Anopheles gambiae str. PEST]
Length = 929
Score = 31.6 bits (70), Expect = 9.1
Identities = 24/84 (28%), Positives = 39/84 (45%), Gaps = 2/84 (2%)
Query: 1 MRGAESEGGGGGGGCAVRGGNIYWGRKHATDFRGI-VVIFAWVSVPQTLLQDFVNLYSSL 59
M G +S GG GGG + G +W KH +R + +V+F +S+P V+ S++
Sbjct: 1 MTGYDSLGGNGGGPSSTGGTVCHW-LKHMKLYRVVLIVVFCLLSLPFVFYNLLVHEESNV 59
Query: 60 GWNSLVCYAHYLSAFRDESTVPLA 83
+ + L F D S + A
Sbjct: 60 QHSDIHRTRSQLFTFEDVSPLKAA 83
>gb|EAL93089.1| cytochrome P450, putative [Aspergillus fumigatus Af293]
Length = 525
Score = 31.6 bits (70), Expect = 9.1
Identities = 19/64 (29%), Positives = 31/64 (47%)
Query: 71 LSAFRDESTVPLAFCVVDELIEELKTKSCPVVFAAFSAGSKACLYKVLQLIDGRCETPHC 130
L A RDES P A + E + K+ F AFSAG+++C+ + + ++
Sbjct: 429 LVAHRDESVFPQADQYIPERWLGEEGKALQPYFVAFSAGARSCIGRNISYLEQTKAIATL 488
Query: 131 LHNY 134
+H Y
Sbjct: 489 VHRY 492
>ref|XP_345647.2| PREDICTED: similar to growth differentiation factor 7 [Rattus
norvegicus]
Length = 544
Score = 31.6 bits (70), Expect = 9.1
Identities = 15/27 (55%), Positives = 16/27 (58%)
Query: 2 RGAESEGGGGGGGCAVRGGNIYWGRKH 28
RGA+ GGGGGGG GG GR H
Sbjct: 409 RGAQGSGGGGGGGGGGGGGGGGAGRGH 435
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.325 0.141 0.461
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 321,786,635
Number of Sequences: 2540612
Number of extensions: 13560332
Number of successful extensions: 93647
Number of sequences better than 10.0: 20
Number of HSP's better than 10.0 without gapping: 15
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 93565
Number of HSP's gapped (non-prelim): 39
length of query: 171
length of database: 863,360,394
effective HSP length: 119
effective length of query: 52
effective length of database: 561,027,566
effective search space: 29173433432
effective search space used: 29173433432
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 70 (31.6 bits)
Lotus: description of TM0079c.14