
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0103.5
(275 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_Q9QRB7 Gag protein [Human immunodeficiency virus 1] 57 4e-07
UniRef100_Q6GL15 MGC69190 protein [Xenopus tropicalis] 57 5e-07
UniRef100_Q65XV7 Hypothetical protein P0016H04.11 [Oryza sativa] 57 5e-07
UniRef100_Q6BPQ4 Similar to CA2534|IPF12793 Candida albicans IPF... 57 5e-07
UniRef100_O93085 Gag protein [Human immunodeficiency virus 1] 57 7e-07
UniRef100_Q7PNE6 ENSANGP00000011651 [Anopheles gambiae str. PEST] 57 7e-07
UniRef100_Q9YNY7 Gag protein [Human immunodeficiency virus 1] 56 9e-07
UniRef100_P89686 Gag polyprotein [Feline immunodeficiency virus] 56 9e-07
UniRef100_UPI000042EAEE UPI000042EAEE UniRef100 entry 56 1e-06
UniRef100_Q6VG41 Gag protein [Chimpanzee immunodeficiency virus)] 56 1e-06
UniRef100_Q66QG0 Gag protein [Human immunodeficiency virus 1] 55 2e-06
UniRef100_Q9W6Q5 Cellular nucleic acid binding protein [Bufo are... 55 2e-06
UniRef100_Q91594 Cellular nucleic acid binding protein [Xenopus ... 55 2e-06
UniRef100_P70000 Cellular nucleic acid binding protein [Xenopus ... 55 2e-06
UniRef100_Q7JQ89 CnjB protein [Tetrahymena thermophila] 55 2e-06
UniRef100_Q6FNZ3 Similar to sp|P53849 Saccharomyces cerevisiae Y... 55 2e-06
UniRef100_Q6XD84 SR RNA-binding protein [Trypanosoma cruzi] 55 2e-06
UniRef100_Q97594 P24/NCp7 [Human immunodeficiency virus 1] 55 3e-06
UniRef100_Q8AC43 Gag polyprotein [Human immunodeficiency virus 1] 55 3e-06
UniRef100_Q6YAI6 Gag-pol fusion polyprotein [Human immunodeficie... 55 3e-06
>UniRef100_Q9QRB7 Gag protein [Human immunodeficiency virus 1]
Length = 506
Score = 57.4 bits (137), Expect = 4e-07
Identities = 33/102 (32%), Positives = 52/102 (50%), Gaps = 12/102 (11%)
Query: 171 HHQRLMATAMYQQRGMNRTGAGGPMRKFGHFANKCSVTGWRCFKCNREGHLAVNCKT-KE 229
H R++A AM Q TG M + G+F N+ + +CF C +EGH+A NCK ++
Sbjct: 358 HKARVLAEAMSQVT----TGNANIMMQRGNFQNRKKII--KCFNCGKEGHIAKNCKAPRK 411
Query: 230 SLCFNCNQPGHFSQDCMALRGESSGNV-----GKGKQLAPEE 266
C+ C + GH +DC + G + GK ++L E+
Sbjct: 412 RGCWKCGREGHQMKDCTERQANFLGKIWSSPQGKARELPSEQ 453
>UniRef100_Q6GL15 MGC69190 protein [Xenopus tropicalis]
Length = 138
Score = 57.0 bits (136), Expect = 5e-07
Identities = 27/76 (35%), Positives = 39/76 (50%), Gaps = 4/76 (5%)
Query: 199 GHFANKCSVTGWRCFKCNREGHLAVNCKT----KESLCFNCNQPGHFSQDCMALRGESSG 254
GH A C + C+ C R GH+A +CK +E C+NC +PGH ++DC +
Sbjct: 61 GHLAKDCDLQEDACYNCGRGGHIAKDCKEPRKEREQCCYNCGKPGHLARDCDHADEQKCY 120
Query: 255 NVGKGKQLAPEERIHA 270
+ G+ LA E I A
Sbjct: 121 SCGEFGHLARECTIEA 136
Score = 51.6 bits (122), Expect = 2e-05
Identities = 26/66 (39%), Positives = 35/66 (52%), Gaps = 4/66 (6%)
Query: 212 CFKCNREGHLAVNCKTKESLCFNCNQPGHFSQDCMALRGESSG---NVGKGKQLAPEERI 268
C++C GHLA +C +E C+NC + GH ++DC R E N GK LA +
Sbjct: 54 CYRCGESGHLAKDCDLQEDACYNCGRGGHIAKDCKEPRKEREQCCYNCGKPGHLA-RDCD 112
Query: 269 HAKEGK 274
HA E K
Sbjct: 113 HADEQK 118
>UniRef100_Q65XV7 Hypothetical protein P0016H04.11 [Oryza sativa]
Length = 951
Score = 57.0 bits (136), Expect = 5e-07
Identities = 28/62 (45%), Positives = 36/62 (57%), Gaps = 9/62 (14%)
Query: 199 GHFANKC---SVTGWRCFKCNREGHLAVNC---KTKESLCFNCNQPGHFSQDCMALRGES 252
GH+A C S G CFKC + GH + +C T S CF C QPGHF++DC G+S
Sbjct: 868 GHYARDCPGQSTGGLECFKCKQPGHFSRDCPVQSTGGSECFKCKQPGHFARDC---PGQS 924
Query: 253 SG 254
+G
Sbjct: 925 TG 926
Score = 50.1 bits (118), Expect = 6e-05
Identities = 24/63 (38%), Positives = 33/63 (52%), Gaps = 5/63 (7%)
Query: 186 MNRTGAGGPMRKFGHFANKCSVTGWRCFKCNREGHLAVNCKTKES---LCFNCNQPGHFS 242
M T AGG G++ + C+KC + GH A +C + + CF C QPGHFS
Sbjct: 837 MQETSAGGS--SMGNYNSIAGNGSSECYKCKQPGHYARDCPGQSTGGLECFKCKQPGHFS 894
Query: 243 QDC 245
+DC
Sbjct: 895 RDC 897
>UniRef100_Q6BPQ4 Similar to CA2534|IPF12793 Candida albicans IPF12793 [Debaryomyces
hansenii]
Length = 172
Score = 57.0 bits (136), Expect = 5e-07
Identities = 21/36 (58%), Positives = 26/36 (71%)
Query: 212 CFKCNREGHLAVNCKTKESLCFNCNQPGHFSQDCMA 247
C+KC GHLA NC+ ++ LC+NC QPGH S DC A
Sbjct: 9 CYKCGEAGHLADNCQQEQRLCYNCRQPGHESNDCPA 44
Score = 46.2 bits (108), Expect = 0.001
Identities = 24/75 (32%), Positives = 36/75 (48%), Gaps = 6/75 (8%)
Query: 196 RKFGHFANKC----SVTGWRCFKCNREGHLAVNCKTKE--SLCFNCNQPGHFSQDCMALR 249
R+ GH +N C T +C+ C GH+ +C T+ + C+NC Q GH S++C
Sbjct: 33 RQPGHESNDCPAPKQATQKQCYSCGDLGHVQSDCPTQSQGAKCYNCGQFGHISKNCTEAG 92
Query: 250 GESSGNVGKGKQLAP 264
ES+ K P
Sbjct: 93 NESAKKPASSKLQKP 107
Score = 45.4 bits (106), Expect = 0.002
Identities = 18/60 (30%), Positives = 30/60 (50%), Gaps = 10/60 (16%)
Query: 192 GGPMRKFGHFANKCSVTGWRCFKCNREGHLAVNCKTKES------LCFNCNQPGHFSQDC 245
GGP H+A C +C+ C + GH++ +C + C+NC + GH S++C
Sbjct: 115 GGP----NHYARDCQAGVVKCYACGKTGHISKDCNSASGGEFTSKTCYNCGKSGHISKEC 170
Score = 42.7 bits (99), Expect = 0.010
Identities = 23/90 (25%), Positives = 37/90 (40%), Gaps = 21/90 (23%)
Query: 197 KFGHFANKCSVTG----------------WRCFKCNREGHLAVNCKTKESLCFNCNQPGH 240
+FGH + C+ G C+KC H A +C+ C+ C + GH
Sbjct: 80 QFGHISKNCTEAGNESAKKPASSKLQKPATTCYKCGGPNHYARDCQAGVVKCYACGKTGH 139
Query: 241 FSQDCMALRG-----ESSGNVGKGKQLAPE 265
S+DC + G ++ N GK ++ E
Sbjct: 140 ISKDCNSASGGEFTSKTCYNCGKSGHISKE 169
Score = 39.3 bits (90), Expect = 0.11
Identities = 15/51 (29%), Positives = 24/51 (46%), Gaps = 4/51 (7%)
Query: 199 GHFANKCSVTGWRCFKCNREGHLAVNC----KTKESLCFNCNQPGHFSQDC 245
GH A+ C C+ C + GH + +C + + C++C GH DC
Sbjct: 16 GHLADNCQQEQRLCYNCRQPGHESNDCPAPKQATQKQCYSCGDLGHVQSDC 66
>UniRef100_O93085 Gag protein [Human immunodeficiency virus 1]
Length = 505
Score = 56.6 bits (135), Expect = 7e-07
Identities = 27/87 (31%), Positives = 45/87 (51%), Gaps = 3/87 (3%)
Query: 171 HHQRLMATAMYQQRGMNRTGAGGPMRKFGHFANKCSVTGWRCFKCNREGHLAVNCKT-KE 229
H R++A AM Q T + M + G+F N+ + +CF C +EGH+A NC+ ++
Sbjct: 358 HKARILAEAMSQVTNSAITNSATIMMQRGNFRNQRKIV--KCFNCGKEGHIAKNCRAPRK 415
Query: 230 SLCFNCNQPGHFSQDCMALRGESSGNV 256
C+ C + GH +DC + G +
Sbjct: 416 KGCWKCGKEGHLMKDCTERQANFLGKI 442
>UniRef100_Q7PNE6 ENSANGP00000011651 [Anopheles gambiae str. PEST]
Length = 153
Score = 56.6 bits (135), Expect = 7e-07
Identities = 22/51 (43%), Positives = 32/51 (62%), Gaps = 2/51 (3%)
Query: 197 KFGHFANKCSVTGWRCFKCNREGHLAVNCKTK--ESLCFNCNQPGHFSQDC 245
+ GHFA C RC++CN GH+A +C +S C+NCNQ GH +++C
Sbjct: 51 QMGHFARDCKEDLDRCYRCNGSGHIARDCSLSPDDSCCYNCNQSGHLARNC 101
Score = 51.2 bits (121), Expect = 3e-05
Identities = 23/54 (42%), Positives = 31/54 (56%), Gaps = 7/54 (12%)
Query: 199 GHFANKCSVT--GWRCFKCNREGHLAVNCKTKES-----LCFNCNQPGHFSQDC 245
GH A CS++ C+ CN+ GHLA NC K C+NCN+ GH S++C
Sbjct: 73 GHIARDCSLSPDDSCCYNCNQSGHLARNCPEKSDRDMNVSCYNCNKSGHISRNC 126
Score = 51.2 bits (121), Expect = 3e-05
Identities = 20/57 (35%), Positives = 31/57 (54%), Gaps = 5/57 (8%)
Query: 199 GHFANKCSVTGWR-----CFKCNREGHLAVNCKTKESLCFNCNQPGHFSQDCMALRG 250
GH A C R C+ CN+ GH++ NC + + C++C + GH S+DC +G
Sbjct: 95 GHLARNCPEKSDRDMNVSCYNCNKSGHISRNCPSGDKSCYSCGKIGHLSRDCTENKG 151
Score = 48.9 bits (115), Expect = 1e-04
Identities = 23/58 (39%), Positives = 30/58 (51%), Gaps = 10/58 (17%)
Query: 190 GAGGPM--RKFGHFANKCSVTGWRCFKCNREGHLAVNCKTKESLCFNCNQPGHFSQDC 245
G GGP R FG +C+KCN+ GH A +CK C+ CN GH ++DC
Sbjct: 30 GVGGPRDRRDFGRRE--------KCYKCNQMGHFARDCKEDLDRCYRCNGSGHIARDC 79
Score = 46.2 bits (108), Expect = 0.001
Identities = 23/79 (29%), Positives = 36/79 (45%), Gaps = 24/79 (30%)
Query: 212 CFKCNREGHLAVNCKT-------------------KESLCFNCNQPGHFSQDC-----MA 247
CFKC+R GH A +C+ + C+ CNQ GHF++DC
Sbjct: 7 CFKCDRPGHYARDCQNVGGGGGRGVGGPRDRRDFGRREKCYKCNQMGHFARDCKEDLDRC 66
Query: 248 LRGESSGNVGKGKQLAPEE 266
R SG++ + L+P++
Sbjct: 67 YRCNGSGHIARDCSLSPDD 85
>UniRef100_Q9YNY7 Gag protein [Human immunodeficiency virus 1]
Length = 115
Score = 56.2 bits (134), Expect = 9e-07
Identities = 32/95 (33%), Positives = 48/95 (49%), Gaps = 8/95 (8%)
Query: 171 HHQRLMATAMYQQRGMNRTGAGGPMRKFGHFANKCSVTGWRCFKCNREGHLAVNCKT-KE 229
H R++A AM Q TGA M + G+F N+ +CF C +EGHLA NC+ ++
Sbjct: 27 HKARVLAEAMSQT-----TGAANIMMQRGNFRNQRKSV--KCFNCGKEGHLARNCRAPRK 79
Query: 230 SLCFNCNQPGHFSQDCMALRGESSGNVGKGKQLAP 264
C+ C + GH +DC + G+ G + P
Sbjct: 80 KGCWKCGKEGHQMKDCTERQANFLGSSGPSHKERP 114
>UniRef100_P89686 Gag polyprotein [Feline immunodeficiency virus]
Length = 498
Score = 56.2 bits (134), Expect = 9e-07
Identities = 30/100 (30%), Positives = 49/100 (49%), Gaps = 8/100 (8%)
Query: 171 HHQRLMATAMYQQRGMNRT--GAGGPMRKFGHFANKCSVTGWRCFKCNREGHLAVNCKTK 228
+ ++A A+ QQ + + G G P N+ +CF C + GHLA NC+
Sbjct: 373 YKMNMLAQALQQQSQVCQVQQGRGKPQGN-----NRRPGQSLKCFNCGKPGHLARNCRAP 427
Query: 229 ESLCFNCNQPGHFSQDCMALRGESSGNVGKGKQLAPEERI 268
C C + GH + DC ++G+ GN KG+ AP +++
Sbjct: 428 RK-CNKCGKAGHIATDCWDMQGKQQGNWQKGRAAAPIKQV 466
>UniRef100_UPI000042EAEE UPI000042EAEE UniRef100 entry
Length = 1641
Score = 55.8 bits (133), Expect = 1e-06
Identities = 26/69 (37%), Positives = 33/69 (47%), Gaps = 8/69 (11%)
Query: 197 KFGHFANKCSVTGWR-----CFKCNREGHLAVNCKTK---ESLCFNCNQPGHFSQDCMAL 248
K GH A C TG+ CF+C + GH+A C CF C QPGHF+++C
Sbjct: 661 KTGHIARMCPDTGYSGSPNDCFRCQQPGHMARECPNTFGGGDACFKCGQPGHFARECPGA 720
Query: 249 RGESSGNVG 257
G G G
Sbjct: 721 YGGGGGGFG 729
Score = 37.7 bits (86), Expect = 0.33
Identities = 15/42 (35%), Positives = 21/42 (49%), Gaps = 5/42 (11%)
Query: 209 GWRCFKCNREGHLAVNC-----KTKESLCFNCNQPGHFSQDC 245
G C C + GH+A C + CF C QPGH +++C
Sbjct: 653 GGECHHCGKTGHIARMCPDTGYSGSPNDCFRCQQPGHMAREC 694
>UniRef100_Q6VG41 Gag protein [Chimpanzee immunodeficiency virus)]
Length = 510
Score = 55.8 bits (133), Expect = 1e-06
Identities = 32/100 (32%), Positives = 47/100 (47%), Gaps = 13/100 (13%)
Query: 171 HHQRLMATAM---YQQRGMNRTGAGGPMRKFGHFANKCSVTGWRCFKCNREGHLAVNCKT 227
H R++A AM MN A GP ++ G +CF C + GH+A NCK
Sbjct: 362 HKSRILAEAMAGAIANMPMNMVQARGPPQRKGQP---------KCFNCGKFGHMAKNCKA 412
Query: 228 KESL-CFNCNQPGHFSQDCMALRGESSGNVGKGKQLAPEE 266
+ C+NC QPGH ++DC ++ G G P++
Sbjct: 413 PQRRKCYNCGQPGHLAKDCPQPPKQNKGVNFLGNPFGPKK 452
>UniRef100_Q66QG0 Gag protein [Human immunodeficiency virus 1]
Length = 496
Score = 55.5 bits (132), Expect = 2e-06
Identities = 35/98 (35%), Positives = 49/98 (49%), Gaps = 21/98 (21%)
Query: 164 MAEATREHHQRLMA--TAMYQQRGMNRTGAGGPMRKFGHFANKCSVTGWRCFKCNREGHL 221
+AEA HQ L TA++ QRG N G GP++ CF C +EGHL
Sbjct: 362 LAEAMATAHQDLKGGYTAVFMQRGQN-PGRKGPIK---------------CFNCGKEGHL 405
Query: 222 AVNCKT-KESLCFNCNQPGHFSQDCMALRGESSGNVGK 258
A NC+ ++ C+ C Q GH +DC G+ + +GK
Sbjct: 406 ARNCRAPRKKGCWKCGQEGHQMKDCR--NGKQANFLGK 441
>UniRef100_Q9W6Q5 Cellular nucleic acid binding protein [Bufo arenarum]
Length = 178
Score = 55.5 bits (132), Expect = 2e-06
Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 4/51 (7%)
Query: 199 GHFANKCSVTGWRCFKCNREGHLAVNC----KTKESLCFNCNQPGHFSQDC 245
GH A C + C+ C R GH+A +C K +E C+NC +PGH ++DC
Sbjct: 62 GHLAKDCDLQEDACYNCGRGGHIAKDCKEPRKEREQCCYNCGKPGHLARDC 112
Score = 51.6 bits (122), Expect = 2e-05
Identities = 26/66 (39%), Positives = 35/66 (52%), Gaps = 4/66 (6%)
Query: 212 CFKCNREGHLAVNCKTKESLCFNCNQPGHFSQDCMALRGESSG---NVGKGKQLAPEERI 268
C++C GHLA +C +E C+NC + GH ++DC R E N GK LA +
Sbjct: 55 CYRCGESGHLAKDCDLQEDACYNCGRGGHIAKDCKEPRKEREQCCYNCGKPGHLA-RDCE 113
Query: 269 HAKEGK 274
HA E K
Sbjct: 114 HADEQK 119
Score = 51.2 bits (121), Expect = 3e-05
Identities = 20/50 (40%), Positives = 31/50 (62%), Gaps = 3/50 (6%)
Query: 197 KFGHFANKCSVTGWRCFKCNREGHLAVNC-KTKESLCFNCNQPGHFSQDC 245
+FGH C T +C++C GH+A+NC KT E C+ C + GH +++C
Sbjct: 125 EFGHIQKDC--TKVKCYRCGDTGHVAINCSKTSEVNCYRCGESGHLAREC 172
Score = 37.4 bits (85), Expect = 0.43
Identities = 21/75 (28%), Positives = 31/75 (41%), Gaps = 3/75 (4%)
Query: 197 KFGHFANKCS-VTGWRCFKCNREGHLAVNCKTKESLCFNCNQPGHFSQDCMALRGESSGN 255
K GH A C +C+ C GH+ +C + C+ C GH + +C +
Sbjct: 104 KPGHLARDCEHADEQKCYSCGEFGHIQKDCTKVK--CYRCGDTGHVAINCSKTSEVNCYR 161
Query: 256 VGKGKQLAPEERIHA 270
G+ LA E I A
Sbjct: 162 CGESGHLARECTIEA 176
>UniRef100_Q91594 Cellular nucleic acid binding protein [Xenopus laevis]
Length = 168
Score = 55.5 bits (132), Expect = 2e-06
Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 4/51 (7%)
Query: 199 GHFANKCSVTGWRCFKCNREGHLAVNCKT----KESLCFNCNQPGHFSQDC 245
GH A C + C+ C R GH+A +CK +E C+NC +PGH ++DC
Sbjct: 52 GHLAKDCDLQEDACYNCGRGGHIAKDCKEPRKEREQCCYNCGKPGHLARDC 102
Score = 53.5 bits (127), Expect = 6e-06
Identities = 29/82 (35%), Positives = 40/82 (48%), Gaps = 7/82 (8%)
Query: 185 GMNRTGAGGPMRKFGHFANKCSVTGWRCFKCNREGHLAVNCKTKESLCFNCNQPGHFSQD 244
G R GG R G F++ + C++C GHLA +C +E C+NC + GH ++D
Sbjct: 22 GGGRGRGGGRGRGRGGFSSSRDI----CYRCGESGHLAKDCDLQEDACYNCGRGGHIAKD 77
Query: 245 CMALRGESSG---NVGKGKQLA 263
C R E N GK LA
Sbjct: 78 CKEPRKEREQCCYNCGKPGHLA 99
Score = 52.0 bits (123), Expect = 2e-05
Identities = 20/50 (40%), Positives = 31/50 (62%), Gaps = 3/50 (6%)
Query: 197 KFGHFANKCSVTGWRCFKCNREGHLAVNC-KTKESLCFNCNQPGHFSQDC 245
+FGH C T +C++C GH+A+NC KT E C+ C + GH +++C
Sbjct: 115 EFGHIQKDC--TKVKCYRCGETGHVAINCSKTSEVNCYRCGESGHLAREC 162
Score = 42.4 bits (98), Expect = 0.013
Identities = 23/80 (28%), Positives = 35/80 (43%), Gaps = 20/80 (25%)
Query: 212 CFKCNREGHLAVNCKT-------------------KESLCFNCNQPGHFSQDCMALRGES 252
CFKC R GH A C T +C+ C + GH ++DC L+ ++
Sbjct: 6 CFKCGRTGHWARECPTGGGRGRGGGRGRGRGGFSSSRDICYRCGESGHLAKDC-DLQEDA 64
Query: 253 SGNVGKGKQLAPEERIHAKE 272
N G+G +A + + KE
Sbjct: 65 CYNCGRGGHIAKDCKEPRKE 84
Score = 39.3 bits (90), Expect = 0.11
Identities = 22/75 (29%), Positives = 32/75 (42%), Gaps = 3/75 (4%)
Query: 197 KFGHFANKCS-VTGWRCFKCNREGHLAVNCKTKESLCFNCNQPGHFSQDCMALRGESSGN 255
K GH A C RC+ C GH+ +C + C+ C + GH + +C +
Sbjct: 94 KPGHLARDCDHADEHRCYSCGEFGHIQKDCTKVK--CYRCGETGHVAINCSKTSEVNCYR 151
Query: 256 VGKGKQLAPEERIHA 270
G+ LA E I A
Sbjct: 152 CGESGHLARECTIEA 166
>UniRef100_P70000 Cellular nucleic acid binding protein [Xenopus laevis]
Length = 178
Score = 55.5 bits (132), Expect = 2e-06
Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 4/51 (7%)
Query: 199 GHFANKCSVTGWRCFKCNREGHLAVNC----KTKESLCFNCNQPGHFSQDC 245
GH A C + C+ C R GH+A +C K +E C+NC +PGH ++DC
Sbjct: 62 GHLAKDCDLQEDACYNCGRGGHIAKDCKEPRKEREQCCYNCGKPGHLARDC 112
Score = 51.6 bits (122), Expect = 2e-05
Identities = 26/66 (39%), Positives = 35/66 (52%), Gaps = 4/66 (6%)
Query: 212 CFKCNREGHLAVNCKTKESLCFNCNQPGHFSQDCMALRGESSG---NVGKGKQLAPEERI 268
C++C GHLA +C +E C+NC + GH ++DC R E N GK LA +
Sbjct: 55 CYRCGESGHLAKDCDLQEDACYNCGRGGHIAKDCKEPRKEREQCCYNCGKPGHLA-RDCE 113
Query: 269 HAKEGK 274
HA E K
Sbjct: 114 HADEQK 119
Score = 51.2 bits (121), Expect = 3e-05
Identities = 20/50 (40%), Positives = 31/50 (62%), Gaps = 3/50 (6%)
Query: 197 KFGHFANKCSVTGWRCFKCNREGHLAVNC-KTKESLCFNCNQPGHFSQDC 245
+FGH C T +C++C GH+A+NC KT E C+ C + GH +++C
Sbjct: 125 EFGHIQKDC--TKVKCYRCGDTGHVAINCSKTSEVNCYRCGESGHLAREC 172
Score = 37.4 bits (85), Expect = 0.43
Identities = 21/75 (28%), Positives = 31/75 (41%), Gaps = 3/75 (4%)
Query: 197 KFGHFANKCS-VTGWRCFKCNREGHLAVNCKTKESLCFNCNQPGHFSQDCMALRGESSGN 255
K GH A C +C+ C GH+ +C + C+ C GH + +C +
Sbjct: 104 KPGHLARDCEHADEQKCYSCGEFGHIQKDCTKVK--CYRCGDTGHVAINCSKTSEVNCYR 161
Query: 256 VGKGKQLAPEERIHA 270
G+ LA E I A
Sbjct: 162 CGESGHLARECTIEA 176
>UniRef100_Q7JQ89 CnjB protein [Tetrahymena thermophila]
Length = 1748
Score = 55.5 bits (132), Expect = 2e-06
Identities = 28/73 (38%), Positives = 39/73 (53%), Gaps = 10/73 (13%)
Query: 183 QRGMNRTGAGGPMRKFGHFANKCSVTGWR-------CFKCNREGHLAVNC---KTKESLC 232
Q G N+ K GH A C+ + CFKCN+EGH++ +C + K+S C
Sbjct: 1442 QNGGNKGKGCFKCGKVGHMAKDCTEPQQQGRKQSGACFKCNQEGHMSKDCPNQQQKKSGC 1501
Query: 233 FNCNQPGHFSQDC 245
F C + GHFS+DC
Sbjct: 1502 FKCGEEGHFSKDC 1514
Score = 52.4 bits (124), Expect = 1e-05
Identities = 27/73 (36%), Positives = 38/73 (51%), Gaps = 9/73 (12%)
Query: 182 QQRGMNRTGAGGPMRKFGHFANKCSVTGWR-----CFKCNREGHLAVNCKTKESL----C 232
QQ+ R GA + GH + C + CFKC +EGH++ +C ++ C
Sbjct: 1520 QQQQKPRGGACFKCGEEGHISKDCPNPQKQQQKNTCFKCKQEGHISKDCPNSQNSGGNKC 1579
Query: 233 FNCNQPGHFSQDC 245
FNCNQ GH S+DC
Sbjct: 1580 FNCNQEGHMSKDC 1592
Score = 50.1 bits (118), Expect = 6e-05
Identities = 23/60 (38%), Positives = 34/60 (56%), Gaps = 7/60 (11%)
Query: 199 GHFANKCSVT----GWRCFKCNREGHLAVNCKT---KESLCFNCNQPGHFSQDCMALRGE 251
GH + C + G +CF CN+EGH++ +C K+ CFNC + GH S++C R E
Sbjct: 1562 GHISKDCPNSQNSGGNKCFNCNQEGHMSKDCPNPSQKKKGCFNCGEEGHQSRECTKERKE 1621
Score = 48.1 bits (113), Expect = 2e-04
Identities = 21/53 (39%), Positives = 29/53 (54%), Gaps = 7/53 (13%)
Query: 209 GWRCFKCNREGHLAVNC-------KTKESLCFNCNQPGHFSQDCMALRGESSG 254
G CFKC + GH+A +C + + CF CNQ GH S+DC + + SG
Sbjct: 1448 GKGCFKCGKVGHMAKDCTEPQQQGRKQSGACFKCNQEGHMSKDCPNQQQKKSG 1500
>UniRef100_Q6FNZ3 Similar to sp|P53849 Saccharomyces cerevisiae YNL255c GIS2 [Candida
glabrata]
Length = 155
Score = 55.5 bits (132), Expect = 2e-06
Identities = 24/59 (40%), Positives = 33/59 (55%), Gaps = 6/59 (10%)
Query: 192 GGPMRKFGHFANKCSVTGWRCFKCNREGHLAVNCKT--KESLCFNCNQPGHFSQDCMAL 248
GGP H A C T +C+ C R GH++ +C E +C+NCN+ GH S+DC L
Sbjct: 101 GGP----NHVARDCMQTDTKCYSCGRFGHVSRDCPNGPNEKVCYNCNETGHISRDCPVL 155
Score = 47.8 bits (112), Expect = 3e-04
Identities = 18/34 (52%), Positives = 24/34 (69%), Gaps = 1/34 (2%)
Query: 212 CFKCNREGHLAVNCKTKESLCFNCNQPGHFSQDC 245
C+ C + GHLA +C + E LC+NCNQPGH +C
Sbjct: 6 CYVCGKIGHLADDCDS-ERLCYNCNQPGHVQSEC 38
Score = 46.6 bits (109), Expect = 7e-04
Identities = 22/75 (29%), Positives = 34/75 (45%), Gaps = 5/75 (6%)
Query: 197 KFGHFANKCSVTGWRCFKCNREGHLAVNCKTKESL----CFNCNQPGHFSQDCMALRGES 252
K GH A+ C + C+ CN+ GH+ C ++ C+NC + GH +C R +
Sbjct: 11 KIGHLADDCD-SERLCYNCNQPGHVQSECTMPRTVEHKQCYNCGETGHVKSECSIQRCFN 69
Query: 253 SGNVGKGKQLAPEER 267
G + PE R
Sbjct: 70 CNQTGHVSRECPEPR 84
Score = 46.2 bits (108), Expect = 0.001
Identities = 20/58 (34%), Positives = 30/58 (51%), Gaps = 12/58 (20%)
Query: 199 GHFANKCSVTGWRCFKCNREGHLAVNCK----------TKESLCFNCNQPGHFSQDCM 246
GH ++CS+ RCF CN+ GH++ C +K C+ C P H ++DCM
Sbjct: 56 GHVKSECSIQ--RCFNCNQTGHVSRECPEPRKGRFGAASKNVSCYKCGGPNHVARDCM 111
Score = 45.8 bits (107), Expect = 0.001
Identities = 24/66 (36%), Positives = 34/66 (51%), Gaps = 10/66 (15%)
Query: 199 GHFANKCS----VTGWRCFKCNREGHLAVNCKTKESLCFNCNQPGHFSQDCMALR----G 250
GH ++C+ V +C+ C GH+ C + CFNCNQ GH S++C R G
Sbjct: 32 GHVQSECTMPRTVEHKQCYNCGETGHVKSECSIQR--CFNCNQTGHVSRECPEPRKGRFG 89
Query: 251 ESSGNV 256
+S NV
Sbjct: 90 AASKNV 95
Score = 40.4 bits (93), Expect = 0.050
Identities = 13/34 (38%), Positives = 22/34 (64%)
Query: 212 CFKCNREGHLAVNCKTKESLCFNCNQPGHFSQDC 245
C+KC H+A +C ++ C++C + GH S+DC
Sbjct: 97 CYKCGGPNHVARDCMQTDTKCYSCGRFGHVSRDC 130
>UniRef100_Q6XD84 SR RNA-binding protein [Trypanosoma cruzi]
Length = 413
Score = 55.1 bits (131), Expect = 2e-06
Identities = 20/35 (57%), Positives = 26/35 (74%)
Query: 211 RCFKCNREGHLAVNCKTKESLCFNCNQPGHFSQDC 245
RCFKC+REGH+A+ C+ E C NC + GH S+DC
Sbjct: 272 RCFKCDREGHVALQCRAVEPYCRNCGRNGHLSRDC 306
>UniRef100_Q97594 P24/NCp7 [Human immunodeficiency virus 1]
Length = 77
Score = 54.7 bits (130), Expect = 3e-06
Identities = 31/77 (40%), Positives = 44/77 (56%), Gaps = 10/77 (12%)
Query: 171 HHQRLMATAMYQQRGMNRTGAGGPMRKFGHFAN-KCSVTGWRCFKCNREGHLAVNCKT-K 228
H R++A AM Q TG+ M + G+F N K SV +CF C REGH+A NC+ +
Sbjct: 9 HKARVLAEAMSQA-----TGSANIMMQRGNFRNQKKSV---KCFNCGREGHIAKNCRAPR 60
Query: 229 ESLCFNCNQPGHFSQDC 245
+ C+ C + GH +DC
Sbjct: 61 KKGCWKCGKEGHXMKDC 77
>UniRef100_Q8AC43 Gag polyprotein [Human immunodeficiency virus 1]
Length = 498
Score = 54.7 bits (130), Expect = 3e-06
Identities = 30/87 (34%), Positives = 45/87 (51%), Gaps = 7/87 (8%)
Query: 171 HHQRLMATAMYQQRGMNRTGAGGPMRKFGHFANKCSVTGWRCFKCNREGHLAVNCKT-KE 229
H R++A AM Q TG G M + G+F + + +CF C +EGHLA NCK ++
Sbjct: 356 HKARVLAEAMSQVT----TGNAGVMMQRGNFKGQRRII--KCFNCGKEGHLARNCKAPRK 409
Query: 230 SLCFNCNQPGHFSQDCMALRGESSGNV 256
C+ C + GH +DC + G +
Sbjct: 410 KGCWKCGREGHQMKDCTERQANFLGKI 436
>UniRef100_Q6YAI6 Gag-pol fusion polyprotein [Human immunodeficiency virus 1]
Length = 575
Score = 54.7 bits (130), Expect = 3e-06
Identities = 33/102 (32%), Positives = 51/102 (49%), Gaps = 13/102 (12%)
Query: 171 HHQRLMATAMYQQRGMNRTGAGGPMRKFGHFANKCSVTGWRCFKCNREGHLAVNCKT-KE 229
H R++A AM Q TG+ M + G+F N+ +CF C +EGH+A NCK ++
Sbjct: 5 HKARVLAEAMSQV-----TGSATIMMQRGNFRNQRKTV--KCFNCGKEGHIARNCKAPRK 57
Query: 230 SLCFNCNQPGHFSQDCMA-----LRGESSGNVGKGKQLAPEE 266
C+ C Q GH +DC R + G+ + +PE+
Sbjct: 58 KGCWKCGQEGHQMKDCTGRQANFFRENLAFPQGEARXFSPEQ 99
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.324 0.136 0.427
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 445,795,608
Number of Sequences: 2790947
Number of extensions: 16931749
Number of successful extensions: 60598
Number of sequences better than 10.0: 3431
Number of HSP's better than 10.0 without gapping: 193
Number of HSP's successfully gapped in prelim test: 3282
Number of HSP's that attempted gapping in prelim test: 51609
Number of HSP's gapped (non-prelim): 8721
length of query: 275
length of database: 848,049,833
effective HSP length: 125
effective length of query: 150
effective length of database: 499,181,458
effective search space: 74877218700
effective search space used: 74877218700
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 74 (33.1 bits)
Lotus: description of TM0103.5