
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC137839.10 - phase: 0
(502 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAM65660.1| Contains similarity to RNA-binding protein from A... 327 4e-88
gb|AAN28868.1| At1g53640/F22G10.8 [Arabidopsis thaliana] gi|1840... 327 7e-88
gb|AAF78422.1| Contains similarity to RNA-binding protein from A... 327 7e-88
dbj|BAD45380.1| hydroxyproline-rich glycoprotein-like [Oryza sat... 310 7e-83
dbj|BAD43958.1| unknown protein [Arabidopsis thaliana] gi|519705... 144 8e-33
gb|AAO25082.1| AT02511p [Drosophila melanogaster] gi|45445512|gb... 61 7e-08
ref|NP_573332.2| CG7282-PA [Drosophila melanogaster] gi|22832530... 55 5e-06
emb|CAH65227.1| hypothetical protein [Gallus gallus] gi|61098374... 55 7e-06
gb|AAC48170.1| Hypothetical protein T17H7.1 [Caenorhabditis eleg... 54 2e-05
emb|CAB45385.1| trithorax homologue 2 [Homo sapiens] gi|12643900... 52 3e-05
gb|AAM50992.1| RE35358p [Drosophila melanogaster] gi|7302093|gb|... 52 3e-05
ref|XP_640657.1| hypothetical protein DDB0205563 [Dictyostelium ... 52 3e-05
emb|CAA50796.1| GCR 1 protein [Drosophila melanogaster] gi|10790... 52 3e-05
dbj|BAB03282.1| EBNA-1 [Cynomolgus Epstein-Barr Virus Si-IIA] 52 4e-05
ref|ZP_00354276.1| hypothetical protein Krad07002061 [Kineococcu... 52 4e-05
ref|NP_083550.1| RIKEN cDNA 2610014H22 [Mus musculus] gi|5674418... 52 6e-05
emb|CAF91750.1| unnamed protein product [Tetraodon nigroviridis] 52 6e-05
ref|XP_468448.1| putative fibrillarin [Oryza sativa (japonica cu... 51 7e-05
gb|EAA64790.1| hypothetical protein AN1670.2 [Aspergillus nidula... 51 7e-05
gb|AAC14119.1| AUT1 [Schistosoma mansoni] 51 1e-04
>gb|AAM65660.1| Contains similarity to RNA-binding protein from Arabidopsis
thaliana gi|2129727 and contains RNA recognition
PF|00076 domain
Length = 523
Score = 327 bits (839), Expect = 4e-88
Identities = 226/560 (40%), Positives = 294/560 (52%), Gaps = 95/560 (16%)
Query: 1 MRGTIGVRLQNS---TISNATRQTLVPFST-SSGFGGGGGDGRGGGRGRGGSGTVTFNFG 56
MR IG R N TI++ +QT PF T S+ D G GRGRG
Sbjct: 1 MRSAIGRRFSNPNGFTIASLVKQT--PFLTQSTSHFSSSSDSSGRGRGRGS--------- 49
Query: 57 EKAAPGNPNPTPNVNESKPDATDSPIPPG------AGRGHGRGGTVP-DFPSFSFSSFMS 109
G P + P+ PG G GHGRG + D S +F+SF+
Sbjct: 50 -----GEDGGFPTAGRGQFGVNREPVVPGREPSSAGGYGHGRGRPIQSDSISPAFTSFVK 104
Query: 110 SIQQPGTGRGRGRGRGFD-------------PLPPQFENDSVPKKPVFIKREDNVSQTDA 156
S P GRGRG G D P PPQ + ++ +++ SQ
Sbjct: 105 S-DSPSIGRGRG-SVGSDTVSPFAAEPPRQSPPPPQQQQSQSQQQRSQPQQQQPRSQPQQ 162
Query: 157 --NDFSPPKNPVFTRSEDVR-----PVEPIDLSGDSES-DNRFVMTVPKVLPGGGRGRGK 208
ND S +PVF + ++++ P P G ++ DN F + G GRGK
Sbjct: 163 QPNDESQG-SPVFVKLQEMQDATSSPPPPESKPGQADPPDNIFNALGNEFSHPSGAGRGK 221
Query: 209 PLEEAAQ---------EAPQAPVVNRHIRVRQTPADAESDNVPR---------RQPMNRF 250
PL E+A P P + ++ +Q A D P+ R+ +
Sbjct: 222 PLVESAPIRQEDNRQIRRPPPPPQQQRVQPQQKRAPTVKDGTPKPQLSAEEAGRRARSEL 281
Query: 251 VRDDGDGS------GRGRGRGRGRDVYARGRGDRGRGGRGGRGDGRGGFKRYGDDRTSHQ 304
R + +GS GRGRGRGRG ARGRG RGRGG G R D + +
Sbjct: 282 SRGEAEGSSVGGRGGRGRGRGRG----ARGRG-RGRGGDGWRDDKK-------------E 323
Query: 305 DIARSNADGLYVGDNADGEKLAKKLGPEIMDQITEAYEEIIERVLPSPLQDEYVEAMDIN 364
+ A ++ GD+ADGEK A+K+GPE+M + E +EEI E+ LPS D ++A D N
Sbjct: 324 EEGEQEAMRIFAGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIIDAYDTN 383
Query: 365 CAIEFEPEYAV-EF-DNPDIDEKEPIALRDALEKMKPFLMTYEGIRSQEEWEEVIEELMQ 422
IE EPEY + +F NPDIDEK P++LR+ LEK+KPF++ YEGI+ QEEWEE I E M
Sbjct: 384 LMIECEPEYIMPDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAINEAMT 443
Query: 423 RVPLLKKIVDHYSGPDRVTAKKQQEELERVAKTLPTSAPSSVKEFTNRAVVSLQSNPGWG 482
+ PL+K+IVDHYSGPDRVTAKKQ EEL+R+A TLP SAP SVK F +RA ++L+SNPGWG
Sbjct: 444 QAPLMKEIVDHYSGPDRVTAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKSNPGWG 503
Query: 483 FDKKCQFMDKLVFEVSQHHK 502
FDKK QFMDKLV EVSQ +K
Sbjct: 504 FDKKYQFMDKLVLEVSQSYK 523
>gb|AAN28868.1| At1g53640/F22G10.8 [Arabidopsis thaliana]
gi|18404554|ref|NP_564639.1| hydroxyproline-rich
glycoprotein family protein [Arabidopsis thaliana]
gi|16323139|gb|AAL15304.1| At1g53640/F22G10.8
[Arabidopsis thaliana] gi|12324041|gb|AAG51990.1|
unknown protein; 43598-45751 [Arabidopsis thaliana]
gi|25405656|pir||E96576 unknown protein, 43598-45751
[imported] - Arabidopsis thaliana
Length = 523
Score = 327 bits (837), Expect = 7e-88
Identities = 229/564 (40%), Positives = 295/564 (51%), Gaps = 103/564 (18%)
Query: 1 MRGTIGVRLQNS---TISNATRQTLVPFST-----------SSGFGGGGGDGRGGGRGRG 46
MR IG R N TI++ +QT PF T SSG G G G G GG
Sbjct: 1 MRSAIGRRFSNPNGFTIASLVKQT--PFLTQSTSHFSSSSDSSGRGRGRGSGEDGGFPAA 58
Query: 47 GSGTVTFNFGEKAAPGNPNPTPNVNESKPDATDSPIPPGAGRGHGRGGTVP-DFPSFSFS 105
G G FG P P P+ G GHGRG + D S +F+
Sbjct: 59 GRG----QFGVNREPVVPGREPS--------------SAGGYGHGRGRPIQSDSISPAFT 100
Query: 106 SFMSSIQQPGTGRGRGRGRGFD-------------PLPPQFENDSVPKKPVFIKREDNVS 152
SF+ S P GRGRG G D P PPQ + ++ +++ S
Sbjct: 101 SFVKS-DSPSIGRGRG-SVGSDTVSPFAAEPPRQSPPPPQQQQSQSQQQRSQPQQQQPRS 158
Query: 153 QTDA--NDFSPPKNPVFTRSEDVR-----PVEPIDLSGDSES-DNRFVMTVPKVLPGGGR 204
Q ND S +PVF + ++++ P P G ++ DN F + G
Sbjct: 159 QPQQQPNDESQG-SPVFVKLQEMQDATSSPPPPESKPGQADPPDNIFNALGNEFSHPSGA 217
Query: 205 GRGKPLEEAAQ---------EAPQAPVVNRHIRVRQTPADAESDNVPR---------RQP 246
GRGKPL E+A P P + ++ +Q A D P+ R+
Sbjct: 218 GRGKPLVESAPIRQEDNRQIRRPPPPPQQQRVQPQQKRAPTVKDGTPKPQLSAEEAGRRA 277
Query: 247 MNRFVRDDGDGS------GRGRGRGRGRDVYARGRGDRGRGGRGGRGDGRGGFKRYGDDR 300
+ R + +GS GRGRGRGRG ARGRG RGRGG G R D +
Sbjct: 278 RSELSRGEAEGSSVGGRGGRGRGRGRG----ARGRG-RGRGGDGWRDDKK---------- 322
Query: 301 TSHQDIARSNADGLYVGDNADGEKLAKKLGPEIMDQITEAYEEIIERVLPSPLQDEYVEA 360
++ A ++ GD+ADGEK A+K+GPE+M + E +EEI E+ LPS D ++A
Sbjct: 323 ---EEEGEQEAMRIFAGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIIDA 379
Query: 361 MDINCAIEFEPEYAV-EF-DNPDIDEKEPIALRDALEKMKPFLMTYEGIRSQEEWEEVIE 418
D N IE EPEY + +F NPDIDEK P++LR+ LEK+KPF++ YEGI+ QEEWEE I
Sbjct: 380 YDTNLMIECEPEYIMPDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAIN 439
Query: 419 ELMQRVPLLKKIVDHYSGPDRVTAKKQQEELERVAKTLPTSAPSSVKEFTNRAVVSLQSN 478
E M + PL+K+IVDHYSGPDRVTAKKQ EEL+R+A TLP SAP SVK F +RA ++L+SN
Sbjct: 440 EAMTQAPLMKEIVDHYSGPDRVTAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKSN 499
Query: 479 PGWGFDKKCQFMDKLVFEVSQHHK 502
PGWGFDKK QFMDKLV EVSQ +K
Sbjct: 500 PGWGFDKKYQFMDKLVLEVSQSYK 523
>gb|AAF78422.1| Contains similarity to RNA-binding protein from Arabidopsis
thaliana gi|2129727 and contains RNA recognition
PF|00076 domain. ESTs gb|H37317, gb|F14415, gb|AA651290
come from this gene
Length = 829
Score = 327 bits (837), Expect = 7e-88
Identities = 229/564 (40%), Positives = 295/564 (51%), Gaps = 103/564 (18%)
Query: 1 MRGTIGVRLQNS---TISNATRQTLVPFST-----------SSGFGGGGGDGRGGGRGRG 46
MR IG R N TI++ +QT PF T SSG G G G G GG
Sbjct: 307 MRSAIGRRFSNPNGFTIASLVKQT--PFLTQSTSHFSSSSDSSGRGRGRGSGEDGGFPAA 364
Query: 47 GSGTVTFNFGEKAAPGNPNPTPNVNESKPDATDSPIPPGAGRGHGRGGTVP-DFPSFSFS 105
G G FG P P P+ G GHGRG + D S +F+
Sbjct: 365 GRG----QFGVNREPVVPGREPS--------------SAGGYGHGRGRPIQSDSISPAFT 406
Query: 106 SFMSSIQQPGTGRGRGRGRGFD-------------PLPPQFENDSVPKKPVFIKREDNVS 152
SF+ S P GRGRG G D P PPQ + ++ +++ S
Sbjct: 407 SFVKS-DSPSIGRGRG-SVGSDTVSPFAAEPPRQSPPPPQQQQSQSQQQRSQPQQQQPRS 464
Query: 153 QTDA--NDFSPPKNPVFTRSEDVR-----PVEPIDLSGDSES-DNRFVMTVPKVLPGGGR 204
Q ND S +PVF + ++++ P P G ++ DN F + G
Sbjct: 465 QPQQQPNDESQG-SPVFVKLQEMQDATSSPPPPESKPGQADPPDNIFNALGNEFSHPSGA 523
Query: 205 GRGKPLEEAAQ---------EAPQAPVVNRHIRVRQTPADAESDNVPR---------RQP 246
GRGKPL E+A P P + ++ +Q A D P+ R+
Sbjct: 524 GRGKPLVESAPIRQEDNRQIRRPPPPPQQQRVQPQQKRAPTVKDGTPKPQLSAEEAGRRA 583
Query: 247 MNRFVRDDGDGS------GRGRGRGRGRDVYARGRGDRGRGGRGGRGDGRGGFKRYGDDR 300
+ R + +GS GRGRGRGRG ARGRG RGRGG G R D +
Sbjct: 584 RSELSRGEAEGSSVGGRGGRGRGRGRG----ARGRG-RGRGGDGWRDDKK---------- 628
Query: 301 TSHQDIARSNADGLYVGDNADGEKLAKKLGPEIMDQITEAYEEIIERVLPSPLQDEYVEA 360
++ A ++ GD+ADGEK A+K+GPE+M + E +EEI E+ LPS D ++A
Sbjct: 629 ---EEEGEQEAMRIFAGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIIDA 685
Query: 361 MDINCAIEFEPEYAV-EF-DNPDIDEKEPIALRDALEKMKPFLMTYEGIRSQEEWEEVIE 418
D N IE EPEY + +F NPDIDEK P++LR+ LEK+KPF++ YEGI+ QEEWEE I
Sbjct: 686 YDTNLMIECEPEYIMPDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAIN 745
Query: 419 ELMQRVPLLKKIVDHYSGPDRVTAKKQQEELERVAKTLPTSAPSSVKEFTNRAVVSLQSN 478
E M + PL+K+IVDHYSGPDRVTAKKQ EEL+R+A TLP SAP SVK F +RA ++L+SN
Sbjct: 746 EAMTQAPLMKEIVDHYSGPDRVTAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKSN 805
Query: 479 PGWGFDKKCQFMDKLVFEVSQHHK 502
PGWGFDKK QFMDKLV EVSQ +K
Sbjct: 806 PGWGFDKKYQFMDKLVLEVSQSYK 829
>dbj|BAD45380.1| hydroxyproline-rich glycoprotein-like [Oryza sativa (japonica
cultivar-group)]
Length = 436
Score = 310 bits (794), Expect = 7e-83
Identities = 200/507 (39%), Positives = 265/507 (51%), Gaps = 82/507 (16%)
Query: 7 VRLQNSTISNATRQTLVPFSTSSGFGGGGGDGRGGGRGRGGSGTVTFNFGEKAAPGNPNP 66
+R + + A R +P S S+ F G G G GRGRG + + APG+P P
Sbjct: 1 MRAIGAAAAAARRHAHLPTSYSAAFSSFSGIGGGAGRGRGRG--LPPSATPPRAPGSPVP 58
Query: 67 TPNVNESKPDATDSPIPPGAGRGHGRGGTVPDFPSFSFSSFMSSIQQPGTGRGRGRGRGF 126
+ + D SP P G GRG +P +SS PG GRGRG
Sbjct: 59 DDD-DGGGADPFSSPAPIGRGRGE---AVIPS---------VSSPPLPGAGRGRGS---- 101
Query: 127 DPLPPQFENDSVPKKPVFIKREDNVSQTDANDFSPPKNPVFTRSEDVRPVEPIDLSGDSE 186
PP + PK+PV K D + ++ PP P
Sbjct: 102 ---PPPL-GEVAPKQPVPAKLFDAPAAEASSSEPPPPPP--------------------- 136
Query: 187 SDNRFVMTVPKVLPGGGRGRGKP-LEEAAQEAPQAPVVNRHIRVRQTPADAESDNVPRRQ 245
P+ LP G GRG P +++ E PQ NR IR R+ A S P
Sbjct: 137 ---------PRTLPSAGAGRGVPRMQQPPVEMPQEE--NRFIRRREEKKKAASAARPAPS 185
Query: 246 PMNRFVRDD----------GDGSGRGRGRGRGRDVYARGRGDRGRGGRGGRGDGRGGFKR 295
+ +D G G G GRGR R RG RGRG GGRG +R
Sbjct: 186 GQPKLSPEDAVKRAMELLGGGGDDDGGRGGRGRGARGRERG-RGRGRDGGRG------RR 238
Query: 296 YGDDRTSHQDIARSNADGLYVGDNADGEKLAKKLGPEIMDQITEAYEEIIERVLPSPLQD 355
D H G+Y+GDNADG++L K+LG + M EA++E + LP P QD
Sbjct: 239 SADMEEKH---------GIYLGDNADGDRLQKRLGEDKMKIFNEAFDEAADNALPDPKQD 289
Query: 356 EYVEAMDINCAIEFEPEYAVEFDNPDIDEKEPIALRDALEKMKPFLMTYEGIRSQEEWEE 415
Y+EA N IEFEPEY V F+NPDI+EK P++L D L+K+KPF++ YEGI++QEEWEE
Sbjct: 290 AYLEACHTNNMIEFEPEYHVNFNNPDIEEKPPMSLEDMLQKVKPFIVAYEGIQNQEEWEE 349
Query: 416 VIEELMQRVPLLKKIVDHYSGPDRVTAKKQQEELERVAKTLPTSAPSSVKEFTNRAVVSL 475
++++M R P +K+++D YSGPD VTAK+Q+EEL+RVA TLP + PSSVK FT++ ++SL
Sbjct: 350 AVKDVMARAPHMKELIDMYSGPDVVTAKQQEEELQRVANTLPGNIPSSVKRFTDKTLLSL 409
Query: 476 QSNPGWGFDKKCQFMDKLVFEVSQHHK 502
++NPGWGFDKKCQFMDK EVS+ +K
Sbjct: 410 KNNPGWGFDKKCQFMDKFAREVSELYK 436
>dbj|BAD43958.1| unknown protein [Arabidopsis thaliana] gi|51970502|dbj|BAD43943.1|
unknown protein [Arabidopsis thaliana]
Length = 417
Score = 144 bits (362), Expect = 8e-33
Identities = 139/432 (32%), Positives = 186/432 (42%), Gaps = 101/432 (23%)
Query: 1 MRGTIGVRLQNS---TISNATRQTLVPFST-----------SSGFGGGGGDGRGGGRGRG 46
MR IG R N TI++ +QT PF T SSG G G G G GG
Sbjct: 1 MRSAIGRRFSNPNGFTIASLVKQT--PFLTQSTSHFSSSSDSSGRGRGRGSGEDGGFPAA 58
Query: 47 GSGTVTFNFGEKAAPGNPNPTPNVNESKPDATDSPIPPGAGRGHGRGGTVP-DFPSFSFS 105
G G FG P P P+ G GHGRG + D S +F+
Sbjct: 59 GRG----QFGVNREPVVPGREPS--------------SAGGYGHGRGRPIQSDSISPAFT 100
Query: 106 SFMSSIQQPGTGRGRGRGRGFD-------------PLPPQFENDSVPKKPVFIKREDNVS 152
SF+ S P GRGRG G D P PPQ + ++ +++ S
Sbjct: 101 SFVKS-DSPSIGRGRG-SVGSDTVSPFAAEPPRQSPPPPQQQQSQSQQQRSQPQQQQPRS 158
Query: 153 QTDA--NDFSPPKNPVFTRSEDVR-----PVEPIDLSGDSES-DNRFVMTVPKVLPGGGR 204
Q ND S +PVF + ++++ P P G ++ DN F + G
Sbjct: 159 QPQQQPNDESQG-SPVFVKLQEMQDATSSPPPPESKPGQADPPDNIFNALGNEFSHPSGA 217
Query: 205 GRGKPLEEAAQ---------EAPQAPVVNRHIRVRQTPADAESDNVPR---------RQP 246
GRGKPL E+A P P + ++ +Q A D P+ R+
Sbjct: 218 GRGKPLVESAPIRQEDNRQIRRPPPPPQQQRVQPQQKRAPTVKDGTPKPQLSAEEAGRRA 277
Query: 247 MNRFVRDDGDGS------GRGRGRGRGRDVYARGRGDRGRGGRGGRGDGRGGFKRYGDDR 300
+ R + +GS GRGRGRGRG ARGRG RGRGG G R D +
Sbjct: 278 RSELSRGEAEGSSVGGRGGRGRGRGRG----ARGRG-RGRGGDGWRDDKK---------- 322
Query: 301 TSHQDIARSNADGLYVGDNADGEKLAKKLGPEIMDQITEAYEEIIERVLPSPLQDEYVEA 360
++ A ++ GD+ADGEK A+K+GPE+M + E +EEI E+ LPS D ++A
Sbjct: 323 ---EEEGEQEAMRIFAGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIIDA 379
Query: 361 MDINCAIEFEPE 372
D N IE EPE
Sbjct: 380 YDTNLMIECEPE 391
>gb|AAO25082.1| AT02511p [Drosophila melanogaster] gi|45445512|gb|AAS64829.1|
CG15920-PB, isoform B [Drosophila melanogaster]
gi|45552671|ref|NP_995860.1| CG15920-PB, isoform B
[Drosophila melanogaster]
Length = 575
Score = 61.2 bits (147), Expect = 7e-08
Identities = 82/328 (25%), Positives = 111/328 (33%), Gaps = 78/328 (23%)
Query: 24 PFSTSSGFGGGGGDGRGG---------GRGRGGSGTVTFNFGEKAAPGNPNPTPNVNESK 74
P T GGG G+G GG G+G+GG G P + P N+
Sbjct: 229 PSDTYGAPGGGNGNGSGGRPSSSYGAPGQGQGGFG---------GRPSDSYGAPGQNQKP 279
Query: 75 PDATDSPIPPGAGRGHGRGGTVPDFPSFSFSSFMS--------SIQQPGTGRGRGRGRGF 126
D+ +P G G+G GG PS S+ + S S P +G G G G
Sbjct: 280 SDSYGAP-----GSGNGNGGR----PSSSYGAPGSGPGGRPSDSYGPPASGSGAGGAGGS 330
Query: 127 DPLPPQFENDSVPKKPVFIKREDNVS-QTDANDFSPPKNPVFTRSEDVRPVEPIDLSGDS 185
P ++ND V + + + DAND S P P + +L D
Sbjct: 331 GPGGADYDNDIVEYEADQQGYRPQIRYEGDANDGSGPSGPGGPGGQ--------NLGADG 382
Query: 186 ESDNRFVMTVPKVLPGGGRGRGKPLEEAAQEAPQAPVVNRHIRVRQTPADAESDNVPRRQ 245
S R PG G G G + Q + + R D + +
Sbjct: 383 YSSGR---------PGNGNGNGNGGYSGGRPGGQDLGPSGYSGGRPGGQDLGAGGYSNGK 433
Query: 246 PMNRFVRDDGDGSGRGRGRGRGRDVYARGR-------------GDRGRGGRGGRGDGR-- 290
P + + G GR G+ GRD Y+ GR G G G GG GR
Sbjct: 434 PGGQDLGPGGYSGGRPGGQDLGRDGYSGGRPGGQDLGASGYSNGRPGGNGNGGSDGGRVI 493
Query: 291 ----------GGFKRYGDDRTSHQDIAR 308
GG + Y R QD+ R
Sbjct: 494 IGGRVIGGQDGGDQGYSGGRPGGQDLGR 521
Score = 50.4 bits (119), Expect = 1e-04
Identities = 82/322 (25%), Positives = 108/322 (33%), Gaps = 65/322 (20%)
Query: 26 STSSGFGGGGGDGRG----GGRGRGGSGTVTFNFGEKAAPGNPNPTPNVNESKPDATDSP 81
S+S G GGG GR G G G G + +G GN P+ + P +
Sbjct: 169 SSSYGAPGGGNGGRPSDTYGAPGGGNGGRPSDTYGAPGG-GNNGGRPSSSYGAPGGGNGG 227
Query: 82 IP------PGAGRGHGRGGTVPDFPSFSFSSFMSSIQQPGTGRGRGRGRGFDPLPPQFEN 135
P PG G G+G GG PS S+ + PG G+G GR D +N
Sbjct: 228 RPSDTYGAPGGGNGNGSGGR----PSSSYGA-------PGQGQGGFGGRPSDSYGAPGQN 276
Query: 136 DSVPKKPVFIKREDNVSQTDANDFSPPKNPVFTRSEDVRPVEPIDLSGDSESDNRFVMTV 195
Q ++ + P + RP G
Sbjct: 277 -----------------QKPSDSYGAPGSG---NGNGGRPSSSYGAPGSGPGGRPSDSYG 316
Query: 196 PKVLPGGGRGRGKPLEEAAQEAPQAPVVNRHIRVRQTPADAESDNVPRRQPMNRFVRDDG 255
P P G G G A P + I + E+D R P R+ D
Sbjct: 317 P---PASGSGAGG----AGGSGPGGADYDNDI------VEYEADQQGYR-PQIRYEGDAN 362
Query: 256 DGSGRGR-----GRGRGRDVYARGRGDRGRG-GRGGRGDGRGGFKRYGDDRTSHQDIARS 309
DGSG G+ G D Y+ GR G G G GG GR G + G S R
Sbjct: 363 DGSGPSGPGGPGGQNLGADGYSSGRPGNGNGNGNGGYSGGRPGGQDLGPSGYSG---GRP 419
Query: 310 NADGLYVGDNADGEKLAKKLGP 331
L G ++G+ + LGP
Sbjct: 420 GGQDLGAGGYSNGKPGGQDLGP 441
>ref|NP_573332.2| CG7282-PA [Drosophila melanogaster] gi|22832530|gb|AAF48895.2|
CG7282-PA [Drosophila melanogaster]
Length = 1868
Score = 55.1 bits (131), Expect = 5e-06
Identities = 75/283 (26%), Positives = 111/283 (38%), Gaps = 43/283 (15%)
Query: 15 SNATRQTLVPFSTSSGFGGGGGDGRGGGRGRGGSGTVTFNFGEKAAPGNPNPTPNVNESK 74
S +T+ S +SG GGGGG G GGG G GG G G G + P V++ +
Sbjct: 141 SKQKTRTISTSSANSGSGGGGGGGGGGGGGGGGGG------GSLLVQG--SQPPGVSDKQ 192
Query: 75 PDATD-SPIPPGAGRGHGRGGTVPDFPSFSFS--------SFMSSIQQ--PGTGRGRGRG 123
P S + P +G G P PS S S S +S++ + P G RGRG
Sbjct: 193 PGKDGCSKMSPSSGNSTG-----PGAPSLSGSLGGASSTPSLLSTVVKTPPTGGAKRGRG 247
Query: 124 RGFDPLPPQFENDSVPKKPVFIKREDNVSQTDANDFSPPKNP--VFTRSEDVRPVEPIDL 181
R D +PP+ S + SQ K P V + R V +
Sbjct: 248 RS-DSMPPRSTTPSSVVAHSGRTKSPAASQPQLQQ-QMKKRPTRVVPGTTTPRRVSDASM 305
Query: 182 SGDSESDNRFVMTVP-----KVLPGGGR----GRGKPLEEAAQEAPQAPVV--NRHIRVR 230
+ +S+SD+ + P K P G+ G+G+ A+ AP A +
Sbjct: 306 ASESDSDSDEPVRRPKRQSAKDKPQAGKAQPPGKGRLASSASSTAPAAHPSDDSEEDEEE 365
Query: 231 QTPADAESDNVPRRQPMNRFVRDDGDGSGR----GRGRGRGRD 269
+ P+ A + + ++Q +R G R G +GRD
Sbjct: 366 EEPSAARAASSKQQQQQASSLRGSRAGGNRAMSSGAASAKGRD 408
>emb|CAH65227.1| hypothetical protein [Gallus gallus]
gi|61098374|ref|NP_001012935.1| similar to WASP
interacting protein [Gallus gallus]
Length = 494
Score = 54.7 bits (130), Expect = 7e-06
Identities = 57/201 (28%), Positives = 69/201 (33%), Gaps = 44/201 (21%)
Query: 64 PNPTPNVNESKPDATDSPIP----PGAGRGHGRGG--------------TVPDFPSFSFS 105
P P P+V SKP+AT P+P P A HGRG T P FP +
Sbjct: 162 PPPRPDVG-SKPEATPPPVPSTPRPIASSLHGRGSAPAPVLNRQPSLGPTPPPFPGSRAA 220
Query: 106 SFMSSIQQPGTGRGRGRGRGFDPLPPQFENDSVPKKPVFIKREDNVSQTDANDFSPPKNP 165
S++QPG G G PLPP V +KP T + D + P P
Sbjct: 221 GSAGSLRQPGPGPATPYS-GRPPLPPTPGRSPVDEKPPPPPPPSGHRPTASRDMALPPPP 279
Query: 166 VFTRSEDV------------------RPVEPIDLSGDSESD------NRFVMTVPKVLPG 201
V RP P G + SD R + VP P
Sbjct: 280 PQNSKPPVPASPRPPLGVPAPPPPPSRPGPPPVPPGPASSDEMPRLPQRNLSLVPPAAPS 339
Query: 202 GGRGRGKPLEEAAQEAPQAPV 222
G GR PL E P P+
Sbjct: 340 SGSGRSGPLPPPPSERPPPPI 360
>gb|AAC48170.1| Hypothetical protein T17H7.1 [Caenorhabditis elegans]
gi|17555122|ref|NP_497250.1| keratin-like protein
precursor family member (72.1 kD) (3B359)
[Caenorhabditis elegans] gi|7507912|pir||T28899
hypothetical protein T17H7.1 - Caenorhabditis elegans
Length = 682
Score = 53.5 bits (127), Expect = 2e-05
Identities = 84/289 (29%), Positives = 101/289 (34%), Gaps = 56/289 (19%)
Query: 10 QNSTISNATRQTLVPFSTSSGFGGGGGDGRGGGRGRGGSGTVTFNFGEKAAPGNPNPTPN 69
+NS S++ Q F G GG GG G+G G GG G P
Sbjct: 286 ENSQHSDSNSQ----FDFPRGPGGRGGRGQGPDFGPGGQG-----------GRGQGPDFG 330
Query: 70 VNESKPDATDSPIPPGAGRGHGRGGTVPDF-PSFSFSSFMSSIQQPGTGRGRGRGRGFDP 128
+ P S P G G G G+G PDF P F S PG GRG+G F P
Sbjct: 331 PQDDFPGRRGSGGPGGRG-GRGQG---PDFEPQDDFPGRRGS-GGPGRRGGRGQGPDFGP 385
Query: 129 LPPQFENDSVPKKPVFIKREDNVSQTDANDFSPPKNPVFTRSEDVRPVEPIDLSGDSESD 188
D P + + DF P + + D P + D SG S
Sbjct: 386 ------QDDFPGRRGSGGPGGRGGRGQGPDFGPGRQGGRGQGPDFGPQD--DFSGRRGSG 437
Query: 189 NRFVMTVPKVLPGGGRGRGKPLEEAAQEAPQAPVVNRHIRVRQTPADAESDNVPRRQ--- 245
PGG GRG QE P Q P D+ P R+
Sbjct: 438 G----------PGGRGGRG-------QEPDFGP--GGQGGRGQGPDFGPQDDFPGRRGSG 478
Query: 246 -PMNRFVRDDGDGSGRGRGRGRGRDVYARGR----GDRGRGGRGGRGDG 289
P R R G G G GRG+D + + G RG GG GGRG G
Sbjct: 479 GPEGRDGRGQGPDFGPGSQGGRGQDSDSGSQDAFPGRRGSGGPGGRGQG 527
Score = 50.4 bits (119), Expect = 1e-04
Identities = 77/281 (27%), Positives = 95/281 (33%), Gaps = 43/281 (15%)
Query: 30 GFGGGGGDGRGGGRGRGGSGTVTFNFGEKAAPGNPNPTPNVNESKPD---ATDSPIPPGA 86
G G GG G GGRG+G +F + G P + PD D P G+
Sbjct: 337 GRRGSGGPGGRGGRGQGPDFEPQDDFPGRRGSGGPGRRGGRGQG-PDFGPQDDFPGRRGS 395
Query: 87 GRGHGRGGTV--PDFPSFSFSSFMSSIQQPGTGRGRGRGRGFDPLPPQFENDSVPKKPVF 144
G GRGG PDF PG GRG+G F P D +
Sbjct: 396 GGPGGRGGRGQGPDFG-------------PGRQGGRGQGPDFGP------QDDFSGRRGS 436
Query: 145 IKREDNVSQTDANDFSPPKNPVFTRSEDVRPVEPID----LSGDSESDNRFVMTVPKVLP 200
+ DF P + D P + G D R P P
Sbjct: 437 GGPGGRGGRGQEPDFGPGGQGGRGQGPDFGPQDDFPGRRGSGGPEGRDGRG--QGPDFGP 494
Query: 201 GGGRGRGKPLEEAAQEA-PQAPVVNRHIRVRQTPADAESDNVPRRQ----PMNRFVRDDG 255
G GRG+ + +Q+A P Q P D+ P R+ P R R G
Sbjct: 495 GSQGGRGQDSDSGSQDAFPGRRGSGGPGGRGQGPDFGPQDDFPGRRGSGGPEGRDGRGQG 554
Query: 256 DGSGRGRGRGRGRD-------VYARGRGDRGRGGRGGRGDG 289
G G GRG+D + RG G GG GGRG G
Sbjct: 555 PDFGPGSQGGRGQDSDSGSQDAFPGRRGPGGPGGLGGRGQG 595
Score = 39.7 bits (91), Expect = 0.22
Identities = 65/255 (25%), Positives = 91/255 (35%), Gaps = 54/255 (21%)
Query: 108 MSSIQQPGTGR-------GRGRGRGFDPLPPQFENDSVPKKPVFIKREDNVSQTDANDFS 160
M+ QQ G+ + GRG G GF P ++ + + T+ N FS
Sbjct: 210 MNRFQQTGSQQNFQSRRGGRGDGPGFVPGTQDNNQRGSGERGQRQNFGPSDNLTNGNQFS 269
Query: 161 PPKNPVFTRSEDVRPVEPIDLSGDSESDNRFVMTVPKVLPG-GGRGRGKPLEEAAQEAPQ 219
+ F R + + S S+S+++F P+ G GGRG+G Q
Sbjct: 270 KKQ---FARGPSSMNSDLSENSQHSDSNSQF--DFPRGPGGRGGRGQGPDFGPGGQGGRG 324
Query: 220 APVVNRHIRVRQTPADAESDNVPRRQ----PMNRFVRDDGD---------------GSGR 260
Q P D+ P R+ P R R G G GR
Sbjct: 325 -----------QGPDFGPQDDFPGRRGSGGPGGRGGRGQGPDFEPQDDFPGRRGSGGPGR 373
Query: 261 GRGRGRG-----RDVYARGRGDRGRGGRGGRGD------GRGGFKRYGDDRTSHQDIARS 309
GRG+G +D + RG G GGRGGRG GR G + G D D +
Sbjct: 374 RGGRGQGPDFGPQDDFPGRRGSGGPGGRGGRGQGPDFGPGRQGGRGQGPDFGPQDDFSGR 433
Query: 310 NADGLYVGDNADGEK 324
G G G++
Sbjct: 434 RGSGGPGGRGGRGQE 448
>emb|CAB45385.1| trithorax homologue 2 [Homo sapiens]
gi|12643900|sp|Q9UMN6|MLL4_HUMAN Myeloid/lymphoid or
mixed-lineage leukemia protein 4 (Trithorax homolog 2)
gi|7662046|ref|NP_055542.1| myeloid/lymphoid or
mixed-lineage leukemia 4 [Homo sapiens]
Length = 2715
Score = 52.4 bits (124), Expect = 3e-05
Identities = 82/306 (26%), Positives = 103/306 (32%), Gaps = 77/306 (25%)
Query: 32 GGGGGDGRGG-GRGRGGSGTVTFNFGEKAAPGNPNPTPN------------VNESKPDAT 78
G GGG GRGG G G G PG P + + +
Sbjct: 26 GAGGGGGRGGRGNGAERVRVALRRGGGATGPGGAEPGEDTALLRLLGLRRGLRRLRRLWA 85
Query: 79 DSPIPPGAGRGHGRG-----GTVPD------------FPSFSF------SSFMSSI--QQ 113
+ G GRG GRG G VP+ F F SS S++ Q+
Sbjct: 86 GPRVQRGRGRGRGRGWGPSRGCVPEEESSDGESDEEEFQGFHSDEDVAPSSLRSALRSQR 145
Query: 114 PGTGRGRGRGRGFDPLPPQFENDSVPKKPVFIKREDNVSQTDANDFSPPKNPVFTRSED- 172
RGRGR PLPP D P +PPK P R E+
Sbjct: 146 GRAPRGRGRKHKTTPLPPPRLADVAP--------------------TPPKTPARKRGEEG 185
Query: 173 ----VRPVEPIDLSGDSESDNRFVMTVPKVLPGGGRGR--GKPLEEAAQEAPQAPVVNRH 226
V+ + + + R P P RGR G+P ++ Q VV
Sbjct: 186 TERMVQALTELLRRAQAPQAPRSRACEPST-PRRSRGRPPGRPAGPCRRK--QQAVVVAE 242
Query: 227 IRVRQTPADAESDNVPRRQPMNRFVRDDGDGSGRGRGRGRGRDVYARGRGDRGRGGRGGR 286
V + VP + + +G G G G R RG G RGGRGGR
Sbjct: 243 AAVTIPKPEPPPPVVPVKHQTGSWKCKEGPGPGPGTPR--------RG-GQSSRGGRGGR 293
Query: 287 GDGRGG 292
G GRGG
Sbjct: 294 GRGRGG 299
Score = 35.4 bits (80), Expect = 4.2
Identities = 20/35 (57%), Positives = 21/35 (59%), Gaps = 1/35 (2%)
Query: 255 GDGSGRGRGRGRGRDVYARGRGDRGRGGRGGRGDG 289
G GS G G RGR R RG G GGRGGRG+G
Sbjct: 6 GGGSCPGPGSARGR-FPGRPRGAGGGGGRGGRGNG 39
>gb|AAM50992.1| RE35358p [Drosophila melanogaster] gi|7302093|gb|AAF57194.1|
CG2150-PA [Drosophila melanogaster]
gi|24651743|ref|NP_651892.1| CG2150-PA [Drosophila
melanogaster]
Length = 188
Score = 52.4 bits (124), Expect = 3e-05
Identities = 37/101 (36%), Positives = 45/101 (43%), Gaps = 4/101 (3%)
Query: 30 GFGGGGGDGRGGGRGRGGSGTVTFNFGEKAAPG---NPNPTPNVNESKPDATDSPIPPGA 86
GFGGG G G GGG G GG G + G + + G +P V + + G
Sbjct: 40 GFGGGFGGGLGGGGGGGGGGYQAVSGGFQTSEGQNVDPQLLEQVRQILLNEESKQGGGGG 99
Query: 87 GRGHGRGGTVPDFPSFSFSSFMSSIQQPGTGRGRGRGRGFD 127
G G G GG P PS S+ +S PG G G GR G D
Sbjct: 100 GGGGGGGGGYPSGPSSSYGPPSTSYGAPGIG-GGGRVVGID 139
>ref|XP_640657.1| hypothetical protein DDB0205563 [Dictyostelium discoideum]
gi|60468684|gb|EAL66686.1| hypothetical protein
DDB0205563 [Dictyostelium discoideum]
Length = 369
Score = 52.4 bits (124), Expect = 3e-05
Identities = 58/201 (28%), Positives = 82/201 (39%), Gaps = 37/201 (18%)
Query: 138 VPKKPVFIKREDNVSQTDANDFSPPKNPVFTRSE-DVRPVEPIDLS---------GDSES 187
+ KK FIK +S+T + +P P SE +P P D S G S
Sbjct: 101 ISKKSAFIKI--TLSKTPLDTTNPGYQPPLPESETQKKPTRPRDYSPRRTRGGFRGGSSG 158
Query: 188 DNRFVMTVPKVLPGGGRGRGKPLEEAAQEAPQAPVVNRHIRVRQTPADAESDNVP----- 242
R GG RGRG+ + + + T + + S P
Sbjct: 159 GFRGGSG------GGFRGRGRGGGFRGSSSHPSTATDNTAATTTTTSPSTSTTSPTTTTT 212
Query: 243 --------RRQPMNRFVRDDGDGSGRGRG----RGRGRDVYARGRGDRGRGGRGGRGDGR 290
++Q ++ D GRGRG RGRGR + RGRG RGRGG GRG GR
Sbjct: 213 TTAPVEQQQQQQQQQYQPSDSGFRGRGRGGFRGRGRGRGGF-RGRG-RGRGGFRGRGRGR 270
Query: 291 GGFKRYGDDRTSHQDIARSNA 311
GGF+ +++Q+ S++
Sbjct: 271 GGFRGGSPLNSNYQESESSSS 291
Score = 42.0 bits (97), Expect = 0.045
Identities = 47/170 (27%), Positives = 59/170 (34%), Gaps = 23/170 (13%)
Query: 27 TSSGFGGGGGD---GRGGGRGRGGSGTVTFNFGEKAAPGNPNPTPNVNESKPDATDSPIP 83
+S GF GG G GRG G G GS + + A +P+ + + P T +
Sbjct: 156 SSGGFRGGSGGGFRGRGRGGGFRGSSSHPSTATDNTAATTTTTSPSTSTTSPTTTTTTTA 215
Query: 84 P----------------GAGRGHGRGGTVPDFPSFSFSSFMSSIQQPGTGRGRGRGR-GF 126
P RG GRGG F + G RGRGRGR GF
Sbjct: 216 PVEQQQQQQQQQYQPSDSGFRGRGRGGF--RGRGRGRGGFRGRGRGRGGFRGRGRGRGGF 273
Query: 127 DPLPPQFENDSVPKKPVFIKREDNVSQTDANDFSP-PKNPVFTRSEDVRP 175
P N + N + T A + SP P T D RP
Sbjct: 274 RGGSPLNSNYQESESSSSTTNTTNTATTSATNNSPNPTTITTTEGNDFRP 323
Score = 37.7 bits (86), Expect = 0.85
Identities = 23/76 (30%), Positives = 37/76 (48%), Gaps = 13/76 (17%)
Query: 12 STISNATRQTLVPFSTSSGF----GGGGGDGRGGGRGRGGSGTVTFNFGEKAAPGNPNPT 67
S +N+ T + + + F G GG GRGG RGRGG+ +++ +P+ +
Sbjct: 302 SATNNSPNPTTITTTEGNDFRPRRGRGGFRGRGGFRGRGGN---------RSSSNSPSSS 352
Query: 68 PNVNESKPDATDSPIP 83
PN + +T SP P
Sbjct: 353 PNPTTTTTTSTTSPTP 368
>emb|CAA50796.1| GCR 1 protein [Drosophila melanogaster] gi|1079077|pir||S49192 GCR
1 protein - fruit fly (Drosophila melanogaster)
Length = 188
Score = 52.4 bits (124), Expect = 3e-05
Identities = 37/101 (36%), Positives = 45/101 (43%), Gaps = 4/101 (3%)
Query: 30 GFGGGGGDGRGGGRGRGGSGTVTFNFGEKAAPG---NPNPTPNVNESKPDATDSPIPPGA 86
GFGGG G G GGG G GG G + G + + G +P V + + G
Sbjct: 40 GFGGGFGGGLGGGGGGGGGGYQAVSGGFQTSEGQNVDPQLLEQVRQILLNEESKQGGGGG 99
Query: 87 GRGHGRGGTVPDFPSFSFSSFMSSIQQPGTGRGRGRGRGFD 127
G G G GG P PS S+ +S PG G G GR G D
Sbjct: 100 GGGGGGGGGYPSGPSSSYGPPSTSYGAPGIG-GGGRVVGID 139
>dbj|BAB03282.1| EBNA-1 [Cynomolgus Epstein-Barr Virus Si-IIA]
Length = 588
Score = 52.0 bits (123), Expect = 4e-05
Identities = 69/249 (27%), Positives = 87/249 (34%), Gaps = 73/249 (29%)
Query: 29 SGFGGGGGDGRGGGRGRGGSGTVTFNFGEKAAPGNPNPTPNVNESKPDATDSPIPPGAGR 88
SG GG GG G GGGRGRGGS + G + G GR
Sbjct: 228 SGAGGAGGSGGGGGRGRGGSRGRGGSRGRGGSRGRGG-------------------SRGR 268
Query: 89 GHGRGGTVPDFPSFSFSSFMSSIQQPGTGRGRGRGRGFDPLPPQFENDSVPKKPVFIKRE 148
G GRGG+ + G GRGRGRGRG P + + P+ P
Sbjct: 269 GRGRGGS----------------RGRGRGRGRGRGRGQGPR----QGEKRPRSPSGSSSS 308
Query: 149 DNVSQTDANDFSPPKNPVFTRSEDVRPVEPIDLSGDS----ESDNRFVMTVPKVLPGGGR 204
+ S++ ++ R+ LS S D +TVP GR
Sbjct: 309 GSSSRSSSSG----------RASSGGSSSGGTLSNGSFYGFPGDRPLTVTVPG--SALGR 356
Query: 205 GRGKPLEEAAQEAPQAPVVNRHIRVRQTPADAESDNVPRRQPMNRFVRDDGDGSGR---- 260
RG + E P A + Q P E D P P + G G G+
Sbjct: 357 YRGTDGTDGGDEPPGA--------MEQGP---EED--PGEGPSRQHTTSGGRGGGKKGGW 403
Query: 261 -GRGRGRGR 268
GR RG GR
Sbjct: 404 FGRHRGEGR 412
Score = 48.9 bits (115), Expect = 4e-04
Identities = 28/47 (59%), Positives = 30/47 (63%), Gaps = 2/47 (4%)
Query: 255 GDGSGRGRGRGRGRDVYARGR-GDRGRGGRGGRGDGRGGFKRYGDDR 300
G G GRGRG RGR +RGR G RGRGG GRG GRGG + G R
Sbjct: 237 GGGGGRGRGGSRGRG-GSRGRGGSRGRGGSRGRGRGRGGSRGRGRGR 282
Score = 43.1 bits (100), Expect = 0.020
Identities = 26/47 (55%), Positives = 27/47 (57%), Gaps = 4/47 (8%)
Query: 255 GDGSGRGRGRGRGRDVYARGRG-DRGRGGRGGRGDGRGGFKRYGDDR 300
G G RGRG RGR RGRG RGRG GRG GRG R G+ R
Sbjct: 255 GRGGSRGRGGSRGR---GRGRGGSRGRGRGRGRGRGRGQGPRQGEKR 298
Score = 42.4 bits (98), Expect = 0.035
Identities = 24/37 (64%), Positives = 25/37 (66%), Gaps = 2/37 (5%)
Query: 255 GDGSGRGRGRGRGRDVYARGRGDRGRGGRGGRGDGRG 291
G G RGRG RGR +RGRG RGRGG GRG GRG
Sbjct: 249 GRGGSRGRGGSRGRGG-SRGRG-RGRGGSRGRGRGRG 283
Score = 42.0 bits (97), Expect = 0.045
Identities = 27/58 (46%), Positives = 30/58 (51%), Gaps = 4/58 (6%)
Query: 257 GSGRGRGRGRGRDVYARGRG-DRGRGGRGGRGDGRGGFKRYGDDRTSHQDIARSNADG 313
GSG G GRGRG +RGRG RGRGG GRG RG + G R + R G
Sbjct: 235 GSGGGGGRGRGG---SRGRGGSRGRGGSRGRGGSRGRGRGRGGSRGRGRGRGRGRGRG 289
Score = 39.3 bits (90), Expect = 0.29
Identities = 24/49 (48%), Positives = 24/49 (48%), Gaps = 7/49 (14%)
Query: 252 RDDGDGSGRGRGRGRGRDVYARGRGDRGRGGRGGRGDGRGGFKRYGDDR 300
R G G GRGRGRGR RG G G GG G G R GDDR
Sbjct: 41 RGRGGSRGHGRGRGRGR---GRGGGQGGTVASGGSGSG----PRLGDDR 82
Score = 38.9 bits (89), Expect = 0.38
Identities = 31/99 (31%), Positives = 43/99 (43%), Gaps = 5/99 (5%)
Query: 26 STSSGFGGGGGDGRGGGRGRGGSGTVTFNFGEKAAPGNPNPTPNVNESKPDATDSPIPPG 85
S G G GG GRG GRGRG GEK P +P+ + + S ++ G
Sbjct: 265 SRGRGRGRGGSRGRGRGRGRGRGRGQGPRQGEK-RPRSPSGSSSSGSSSRSSSSGRASSG 323
Query: 86 AGRGHGRGGTVPDFPSFSFSSFMS-SIQQPGTGRGRGRG 123
G GGT+ + + F ++ PG+ GR RG
Sbjct: 324 ---GSSSGGTLSNGSFYGFPGDRPLTVTVPGSALGRYRG 359
Score = 37.7 bits (86), Expect = 0.85
Identities = 22/48 (45%), Positives = 25/48 (51%), Gaps = 2/48 (4%)
Query: 255 GDGSGRGRGRGRGRDVYARGRGDRGRGGRGGRGD--GRGGFKRYGDDR 300
G G+G G G G + G G RGRGG GRG GRGG + G R
Sbjct: 219 GSGAGGAGGSGAGGAGGSGGGGGRGRGGSRGRGGSRGRGGSRGRGGSR 266
Score = 35.0 bits (79), Expect = 5.5
Identities = 26/78 (33%), Positives = 31/78 (39%), Gaps = 10/78 (12%)
Query: 27 TSSGFGGGGGDGRGG----------GRGRGGSGTVTFNFGEKAAPGNPNPTPNVNESKPD 76
++SG GGGG GRGG GRGRGG T G + + +P
Sbjct: 31 STSGSGGGGTRGRGGSRGHGRGRGRGRGRGGGQGGTVASGGSGSGPRLGDDRRPDGQRPS 90
Query: 77 ATDSPIPPGAGRGHGRGG 94
S I G G G GG
Sbjct: 91 KRRSCIGCRGGAGGGSGG 108
>ref|ZP_00354276.1| hypothetical protein Krad07002061 [Kineococcus radiotolerans
SRS30216]
Length = 784
Score = 52.0 bits (123), Expect = 4e-05
Identities = 33/69 (47%), Positives = 36/69 (51%), Gaps = 4/69 (5%)
Query: 257 GSGRGRGRGRGRDVYA---RGRGDRGRGGRGGRGDGRG-GFKRYGDDRTSHQDIARSNAD 312
G GR RG GR RD RGRGDRGRG R R GRG G +R DDR D+ R
Sbjct: 114 GGGRQRGHGRDRDHGGRGHRGRGDRGRGDRSRRDGGRGDGSRRDRDDRGRRGDLDRGGRR 173
Query: 313 GLYVGDNAD 321
G + D D
Sbjct: 174 GDGLDDGLD 182
Score = 34.3 bits (77), Expect = 9.4
Identities = 22/69 (31%), Positives = 28/69 (39%)
Query: 253 DDGDGSGRGRGRGRGRDVYARGRGDRGRGGRGGRGDGRGGFKRYGDDRTSHQDIARSNAD 312
D+G+ RG GRG G GRG RG G G G+G D + +
Sbjct: 192 DEGELRLRGAGRGDGGRDGGGGRGGRGAGAERDEGGGQGDGGDDPDGGVGRAAAGQRHGH 251
Query: 313 GLYVGDNAD 321
G VG+ D
Sbjct: 252 GPAVGEQPD 260
>ref|NP_083550.1| RIKEN cDNA 2610014H22 [Mus musculus] gi|56744180|dbj|BAD81031.1|
mixed lineage leukemia 2 [Mus musculus]
Length = 2713
Score = 51.6 bits (122), Expect = 6e-05
Identities = 78/300 (26%), Positives = 109/300 (36%), Gaps = 64/300 (21%)
Query: 32 GGGGGDGRGG-GRGRGGSGTVTFNFGEKAAPGNPNPTPN------------VNESKPDAT 78
G GGG GRGG G G G A PG P + + +
Sbjct: 26 GCGGGGGRGGRGNGAERVRVALRRGGGAAGPGGAEPGEDTALLRLLGLRRGLRRLRRLWA 85
Query: 79 DSPIPPGAGRGHGRG-----GTVPD------------FPSFSF------SSFMSSI--QQ 113
+ + G GRG GRG G +P+ F F SS S++ Q+
Sbjct: 86 GARVQRGRGRGRGRGWGPNRGCMPEEESSDGESEEEEFQGFHSDEDVAPSSLRSALRSQR 145
Query: 114 PGTGRGRGRGRGFDPLPPQFENDS-VPKKPVFIKREDNVSQTDANDFSPPKNPVFTRSED 172
RGRGR PLPP+ + + VP K KR + ++ + + RS+
Sbjct: 146 GRAPRGRGRKHKTTPLPPRLADVTPVPPKAPTRKRGEEGTERMVQALTE----LLRRSQA 201
Query: 173 VRPVEPIDLSGDSESDNRFVMTVPKVLPGGGRGRGKPLEEAAQEAPQAPVVNRHIRVRQT 232
+P + + + R GR G+P ++ Q VV V
Sbjct: 202 PQPPRSRARAREPSTPRR----------SRGRPPGRPAGPCRKK--QQAVVLAEAAVTIP 249
Query: 233 PADAESDNVPRRQPMNRFVRDDGDGSGRGRGRGRGRDVYARGRGDRGRGGRGGRGDGRGG 292
+ VP + + +G G G G + RG G GRGGRGGRG GRGG
Sbjct: 250 KPEPPPPVVPVKNKAGSWKCKEGPGPGPGTPK--------RG-GQPGRGGRGGRGRGRGG 300
>emb|CAF91750.1| unnamed protein product [Tetraodon nigroviridis]
Length = 512
Score = 51.6 bits (122), Expect = 6e-05
Identities = 29/57 (50%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 245 QPMNRFVRDDGDGSGRGRGRGRGRDVYARGRGDRGRGGRGGRGDG-RGGFKRYGDDR 300
+P RF RD G+ S G RGRGR Y G +RGR G G G G RGG+K YG R
Sbjct: 108 RPGPRFGRDQGERSDGGGYRGRGRGGYDSGGYERGRRGPPGMGGGDRGGYKNYGGSR 164
>ref|XP_468448.1| putative fibrillarin [Oryza sativa (japonica cultivar-group)]
gi|48716271|dbj|BAD22886.1| putative fibrillarin [Oryza
sativa (japonica cultivar-group)]
gi|48716513|dbj|BAD23118.1| putative fibrillarin [Oryza
sativa (japonica cultivar-group)]
Length = 306
Score = 51.2 bits (121), Expect = 7e-05
Identities = 30/60 (50%), Positives = 32/60 (53%), Gaps = 7/60 (11%)
Query: 242 PRRQPMNRFVRDDGDGSGRGRGRGRGRDVYARGRGDRGRGG-------RGGRGDGRGGFK 294
PR + R R DG G G GRG GR + GRG RGRGG GGRG GRGG K
Sbjct: 4 PRGRGFGRGGRGDGGGRSGGGGRGFGRGGDSGGRGGRGRGGGRTPRGRGGGRGGGRGGMK 63
Score = 36.2 bits (82), Expect = 2.5
Identities = 20/35 (57%), Positives = 20/35 (57%), Gaps = 1/35 (2%)
Query: 255 GDGSGRGRGRGRGRDVYARGRGDRGRGGRGGRGDG 289
GD GRG GRGRG RGRG GGRGG G
Sbjct: 32 GDSGGRG-GRGRGGGRTPRGRGGGRGGGRGGMKGG 65
Score = 34.3 bits (77), Expect = 9.4
Identities = 23/40 (57%), Positives = 23/40 (57%), Gaps = 2/40 (5%)
Query: 260 RGRGRGRGRDVYARGR-GDRGRG-GRGGRGDGRGGFKRYG 297
RGRG GRG GR G GRG GRGG GRGG R G
Sbjct: 5 RGRGFGRGGRGDGGGRSGGGGRGFGRGGDSGGRGGRGRGG 44
>gb|EAA64790.1| hypothetical protein AN1670.2 [Aspergillus nidulans FGSC A4]
gi|67522427|ref|XP_659274.1| hypothetical protein
AN1670_2 [Aspergillus nidulans FGSC A4]
gi|49087830|ref|XP_405807.1| hypothetical protein
AN1670.2 [Aspergillus nidulans FGSC A4]
Length = 653
Score = 51.2 bits (121), Expect = 7e-05
Identities = 71/279 (25%), Positives = 94/279 (33%), Gaps = 53/279 (18%)
Query: 23 VPFSTSSGFGGGGGDGRGGGRGRGGS-GTVTFNFGEKAAPGNPNPTPNV----NESKPDA 77
+ + S G G G G G +G G S TV +APG V + P
Sbjct: 141 IQLTADSCLGTGAGGGGPGTQGPGQSTATVPVVSPSPSAPGGGTGGSEVPGQSTPTVPIV 200
Query: 78 TDSPIPPGAGRGHGRGGTVPDFPSFSFSSFMSSIQQPGTGRGRGRGRGFDPLPPQFENDS 137
T SP PG G G G G P + + S PG G G G S
Sbjct: 201 TPSPSVPGGGAGGGPGPQGPGQSTATVPVVSPSPSAPGGGAGGSEVPG----------QS 250
Query: 138 VPKKPVFIKREDNVSQTDANDFSPPKNPVFTRSEDVRPVEPIDLSGDSESDNRFVMTVPK 197
P P+ + ++ A P ++ P P +G SE+ + T+P
Sbjct: 251 TPTVPI-VTPSPSIPGGGAGG-GPGQSTATVPVVSPSPSVPGGGAGGSEAPGQSTATIPV 308
Query: 198 V-------LPGGGRGRGKPLEEAAQEAPQAPVVNRHIRVRQTPADAESDNVPRRQPMNRF 250
V +PGGG G G +Q PV + S +VP
Sbjct: 309 VSPSPSPSVPGGGAGGG-----PSQSTETVPVT------------SPSPSVP-------- 343
Query: 251 VRDDGDGSGRGRGRGRGRDVYARGRGDRGRGGRGGRGDG 289
G G+G G G G A G G G G GG+ G
Sbjct: 344 ----GGGAGSGEGPGTSPSPSAPGGGAGGGGEGGGQSTG 378
>gb|AAC14119.1| AUT1 [Schistosoma mansoni]
Length = 335
Score = 50.8 bits (120), Expect = 1e-04
Identities = 66/248 (26%), Positives = 97/248 (38%), Gaps = 47/248 (18%)
Query: 201 GGGRGRGKPLEEAAQEAPQAPVVNRHIRVRQTPADAESD-NVPRRQPMNRFVRDDGDGSG 259
G GRG + + + P+ I + P D+ SD N PR
Sbjct: 121 GSGRGTPRGMRVGRGQGPR-------IAPTEAPQDSVSDLNAPRGSSFEP---------- 163
Query: 260 RGRGRGRGRDVYARGRG---DRGRGGRGGRGDGRGGFKRYG--DDRTSHQDIARSNADGL 314
RGRGRGRGR ++ RGRG + R G R G ++YG D + QD+ DGL
Sbjct: 164 RGRGRGRGRGMFGRGRGMPFNSNRDFENQDGPDRQGPRQYGRRDGNWNSQDV-----DGL 218
Query: 315 YVGDNADGEKLAKKLGP--EIMDQITEAYEEIIERVLPSPLQDEYVEAMDINCAIEFEPE 372
+ ++ D E++ + E+ DQ A E E V+ + +E EP+
Sbjct: 219 IMPESGDSEQVVRFADDRNEVEDQPEHATAENEEGVV-----------VGTETPVEEEPK 267
Query: 373 -YAVEFDNPDIDEKEPIAL--RDALEKM---KPFLMTYEGIRSQEEWEEVIEELMQRVPL 426
Y +E +P L L K K R +E E + E+ +R
Sbjct: 268 SYTLEGYKAMRQSSKPAVLLNNKGLRKANDGKDVFANMVAHRKLQEVSEDVYEVEERKTS 327
Query: 427 LKKIVDHY 434
L+ VD Y
Sbjct: 328 LRASVDRY 335
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.313 0.137 0.403
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,037,528,021
Number of Sequences: 2540612
Number of extensions: 59276547
Number of successful extensions: 499107
Number of sequences better than 10.0: 4163
Number of HSP's better than 10.0 without gapping: 1333
Number of HSP's successfully gapped in prelim test: 3117
Number of HSP's that attempted gapping in prelim test: 344709
Number of HSP's gapped (non-prelim): 65260
length of query: 502
length of database: 863,360,394
effective HSP length: 132
effective length of query: 370
effective length of database: 527,999,610
effective search space: 195359855700
effective search space used: 195359855700
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 77 (34.3 bits)
Medicago: description of AC137839.10