
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC134822.5 - phase: 0
(195 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POLX_TOBAC (P10978) Retrovirus-related Pol polyprotein from tran... 102 4e-22
COPI_DROME (P04146) Copia protein (Gag-int-pol protein) [Contain... 50 4e-06
Z624_HUMAN (Q9P2J8) Zinc finger protein 624 37 0.022
GRP2_NICSY (P27484) Glycine-rich protein 2 35 0.14
GR2B_ARATH (Q38896) Glycine-rich protein 2b (AtGRP2b) 35 0.14
GAG_MMTVC (P11284) Gag polyprotein [Contains: Protein p10; Phosp... 34 0.25
GAG_SIVAI (Q02843) Gag polyprotein [Contains: Core protein p17; ... 33 0.42
Y328_MYCGE (Q49419) Hypothetical protein MG328 33 0.55
YO61_CAEEL (P34600) Hypothetical protein ZK1098.1 in chromosome III 32 0.94
RK2_PSINU (Q8WHY1) Chloroplast 50S ribosomal protein L2 32 1.2
HD_MOUSE (P42859) Huntingtin (Huntington's disease protein homol... 32 1.2
YVCE_BACSU (P40767) Hypothetical protein yvcE (PSPA2) 31 1.6
GAG_SRV2 (P51516) Gag polyprotein (Core polyprotein) [Contains: ... 31 1.6
XPC_MOUSE (P51612) DNA-repair protein complementing XP-C cells h... 31 2.1
RL2_HELPY (P56030) 50S ribosomal protein L2 31 2.1
RL2_HELPJ (Q9ZJR6) 50S ribosomal protein L2 31 2.1
P3K2_DICDI (P54674) Phosphatidylinositol 3-kinase 2 (EC 2.7.1.13... 31 2.1
MEDB_GIALA (Q08014) Median body protein 31 2.1
LSM4_CAEEL (Q19952) Probable U6 snRNA-associated Sm-like protein... 31 2.1
LON_MYCGE (P47481) ATP-dependent protease La (EC 3.4.21.53) 31 2.1
>POLX_TOBAC (P10978) Retrovirus-related Pol polyprotein from
transposon TNT 1-94 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1328
Score = 102 bits (255), Expect = 4e-22
Identities = 68/201 (33%), Positives = 106/201 (51%), Gaps = 22/201 (10%)
Query: 7 YKTKSLAHRQLLKQQLYSFKMLESKSISEQLAEFNKILDDLANIEVNMEDEDKALLLLCS 66
Y +K+L ++ LK+QLY+ M E + L FN ++ LAN+ V +E+EDKA+LLL S
Sbjct: 92 YMSKTLTNKLYLKKQLYALHMSEGTNFLSHLNVFNGLITQLANLGVKIEEEDKAILLLNS 151
Query: 67 LPKSFEHFKDTILYGKEGTTTLEEIQSALRTKKLTKSKDLRANENSEGLCVSRGNG---- 122
LP S+++ TIL+GK T L+++ SAL L K + EN ++ G G
Sbjct: 152 LPSSYDNLATTILHGKT-TIELKDVTSAL----LLNEKMRKKPENQGQALITEGRGRSYQ 206
Query: 123 ------GGRGNRGSSKSGNKERYK-CFKCHKFGHFKRDFSEDNENFAQVVSEEYEDAGAL 175
G G RG SK+ +K R + C+ C++ GHFKRD + + ++ +D A
Sbjct: 207 RSSNNYGRSGARGKSKNRSKSRVRNCYNCNQPGHFKRDCPNPRKGKGETSGQKNDDNTAA 266
Query: 176 VVS------CWEDDEGEVSHL 190
+V + ++E E HL
Sbjct: 267 MVQNNDNVVLFINEEEECMHL 287
>COPI_DROME (P04146) Copia protein (Gag-int-pol protein) [Contains:
Copia VLP protein; Copia protease (EC 3.4.23.-)]
Length = 1409
Score = 49.7 bits (117), Expect = 4e-06
Identities = 39/159 (24%), Positives = 71/159 (44%), Gaps = 14/159 (8%)
Query: 7 YKTKSLAHRQLLKQQLYSFKMLESKSISEQLAEFNKILDDLANIEVNMEDEDKALLLLCS 66
Y+ KSLA + L+++L S K+ S+ F++++ +L +E+ DK LL +
Sbjct: 88 YERKSLASQLALRKRLLSLKLSSEMSLLSHFHIFDELISELLAAGAKIEEMDKISHLLIT 147
Query: 67 LPKSFEHFKDTILYGKEGTTTLEEIQSALRTKKLTKSKDLRANENSEGLC---VSRGNGG 123
LP ++ I E TL +++ L +++ D N+ S+ + V N
Sbjct: 148 LPSCYDGIITAIETLSEENLTLAFVKNRLLDQEIKIKND--HNDTSKKVMNAIVHNNNNT 205
Query: 124 GRGNRGSS---------KSGNKERYKCFKCHKFGHFKRD 153
+ N + K +K + KC C + GH K+D
Sbjct: 206 YKNNLFKNRVTKPKKIFKGNSKYKVKCHHCGREGHIKKD 244
>Z624_HUMAN (Q9P2J8) Zinc finger protein 624
Length = 739
Score = 37.4 bits (85), Expect = 0.022
Identities = 40/160 (25%), Positives = 75/160 (46%), Gaps = 11/160 (6%)
Query: 2 KVLQKYKTKSLAHRQLLKQQLYSFKMLESKSISEQLAEFNKILDDLANIEVNMEDE-DKA 60
K K T++ A + L Q+ K+ E+ ++ K D + ++ N E+ +
Sbjct: 4 KPATKNATRTKAISEDLSQEAILEKLTENGLWDSRMEGLWKWNDRILRLQNNQENHLSQR 63
Query: 61 LLLLCSLPKSFEHFK-DTILYGKEGTTTLEEIQSALRTKKLTKSKDLRANENSE-GLCVS 118
++ L P S F+ ++IL + G T EE+ S +T++ +++L ++ G +
Sbjct: 64 IIPLKKTPTSQRGFRFESILIPEPGIAT-EELHSRCQTQEENFTENLNLITDTHLGKIIC 122
Query: 119 RGNGGGRGNRGSS------KSGNKER-YKCFKCHKFGHFK 151
+ G + R +S KS NKE+ YKC C K H++
Sbjct: 123 KEMKGSKAIRQTSELTLGKKSNNKEKPYKCSTCEKAFHYR 162
>GRP2_NICSY (P27484) Glycine-rich protein 2
Length = 214
Score = 34.7 bits (78), Expect = 0.14
Identities = 17/37 (45%), Positives = 19/37 (50%), Gaps = 5/37 (13%)
Query: 120 GNGGGRGNRGSSKSGNKERYKCFKCHKFGHFKRDFSE 156
G GGG G GS CFKC + GHF RD S+
Sbjct: 143 GGGGGYGGGGSGGGSG-----CFKCGESGHFARDCSQ 174
Score = 31.6 bits (70), Expect = 1.2
Identities = 14/36 (38%), Positives = 18/36 (49%)
Query: 118 SRGNGGGRGNRGSSKSGNKERYKCFKCHKFGHFKRD 153
S+ GGG G R G C+KC + GHF R+
Sbjct: 173 SQSGGGGGGGRFGGGGGGGGGGGCYKCGEDGHFARE 208
>GR2B_ARATH (Q38896) Glycine-rich protein 2b (AtGRP2b)
Length = 201
Score = 34.7 bits (78), Expect = 0.14
Identities = 15/36 (41%), Positives = 17/36 (46%)
Query: 118 SRGNGGGRGNRGSSKSGNKERYKCFKCHKFGHFKRD 153
S G GGGR G G C+ C + GHF RD
Sbjct: 159 SGGGGGGRYGSGGGGGGGGGGLSCYSCGESGHFARD 194
Score = 33.1 bits (74), Expect = 0.42
Identities = 16/42 (38%), Positives = 22/42 (52%), Gaps = 1/42 (2%)
Query: 122 GGGRGNRGSS-KSGNKERYKCFKCHKFGHFKRDFSEDNENFA 162
GGG G RGS + G CFKC + GH R+ S+ ++
Sbjct: 118 GGGYGGRGSGGRGGGGGDNSCFKCGEPGHMARECSQGGGGYS 159
>GAG_MMTVC (P11284) Gag polyprotein [Contains: Protein p10;
Phosphorylated protein pp21; Protein p3; Protein p8;
Major core protein p27; Nucleic acid binding protein
p14]
Length = 591
Score = 33.9 bits (76), Expect = 0.25
Identities = 17/41 (41%), Positives = 21/41 (50%), Gaps = 6/41 (14%)
Query: 117 VSRGNGGGRGNRGSSKSGNKERYKCFKCHKFGHFKRDFSED 157
V + GGG+G +GS CF C K GH KRD E+
Sbjct: 509 VKQTYGGGKGGQGSKGP------VCFSCGKTGHIKRDCKEE 543
>GAG_SIVAI (Q02843) Gag polyprotein [Contains: Core protein p17;
Core protein p24; Core protein p15]
Length = 513
Score = 33.1 bits (74), Expect = 0.42
Identities = 14/33 (42%), Positives = 17/33 (51%)
Query: 121 NGGGRGNRGSSKSGNKERYKCFKCHKFGHFKRD 153
NG G K G + KCF C KFGH +R+
Sbjct: 372 NGQNMVQVGPQKKGPRGPLKCFNCGKFGHMQRE 404
Score = 30.0 bits (66), Expect = 3.6
Identities = 11/22 (50%), Positives = 12/22 (54%)
Query: 140 KCFKCHKFGHFKRDFSEDNENF 161
KCFKC K GH +D NF
Sbjct: 412 KCFKCGKIGHMAKDCKNGQANF 433
>Y328_MYCGE (Q49419) Hypothetical protein MG328
Length = 756
Score = 32.7 bits (73), Expect = 0.55
Identities = 30/160 (18%), Positives = 67/160 (41%), Gaps = 30/160 (18%)
Query: 28 LESKSISEQLAEFNKILDDLANIEVNMEDEDKALLLLCSLPKSFEHFKDTILYGKEGTTT 87
++++ + +++AE + ++ +N+ N L +L + + ++ ++ T
Sbjct: 493 IQNQQLQDKIAELEEWNEEKSNLNTNQ---------LVNLQQQLKDSQNLFNVAQDKLAT 543
Query: 88 LEEIQSALRTKKLTKSKDLRANENSEGLCVSRGNGGGRGNRGSSKSGNKERYKCFKCHKF 147
LEE+ AL K +L +ENS L + + E + +
Sbjct: 544 LEEVNLALNEKINDLEDELSGSENSNNLL-------------AKLQADHEILQ----ESY 586
Query: 148 GHFKRDFSEDNENFAQVVSEEYEDAGALVVSCWEDDEGEV 187
G K DF + +N +E+Y+D ++S +E+ E+
Sbjct: 587 GKLKTDFEKLKKNKLNDANEQYQD----LLSAFEETNSEL 622
>YO61_CAEEL (P34600) Hypothetical protein ZK1098.1 in chromosome III
Length = 724
Score = 32.0 bits (71), Expect = 0.94
Identities = 30/98 (30%), Positives = 46/98 (46%), Gaps = 8/98 (8%)
Query: 19 KQQLYSFKMLESKSISEQLAEFNKILDDLANIEVNMEDEDKALLLLCSLPK---SFEHFK 75
K+ L + LES+ L E N D + ++DKA L + + + +F H+K
Sbjct: 572 KESLRRKRRLESE-FRNLLKEHNVDKDSEWTVIKPKIEKDKAYLAMENDDERETAFNHYK 630
Query: 76 DTILYGKEGTTTLEEIQSALRTKKLTKSKDLRANENSE 113
+ G GTT EI + KK K K+ R++ NSE
Sbjct: 631 N----GTSGTTAGSEILEKKKKKKDKKKKNKRSDNNSE 664
>RK2_PSINU (Q8WHY1) Chloroplast 50S ribosomal protein L2
Length = 277
Score = 31.6 bits (70), Expect = 1.2
Identities = 15/39 (38%), Positives = 21/39 (53%)
Query: 86 TTLEEIQSALRTKKLTKSKDLRANENSEGLCVSRGNGGG 124
+T EEI K+LT +K + N+ G+ SR GGG
Sbjct: 19 STFEEISKYKPKKRLTLNKHSKKGRNNRGIITSRHRGGG 57
>HD_MOUSE (P42859) Huntingtin (Huntington's disease protein homolog)
(HD protein)
Length = 3119
Score = 31.6 bits (70), Expect = 1.2
Identities = 23/74 (31%), Positives = 39/74 (52%), Gaps = 8/74 (10%)
Query: 40 FNKILDDLANIEVNMEDEDKALLLL-------CSLPKSFEHFKDTILYGKEGTTTLEEIQ 92
+N +L+ L + V ME+E LL+L C +P + KDT L G G T +E++
Sbjct: 267 YNWLLNVLLGLLVPMEEEHSTLLILGVLLTLRCLVPLLQQQVKDTSLKGSFGVTR-KEME 325
Query: 93 SALRTKKLTKSKDL 106
+ T++L + +L
Sbjct: 326 VSPSTEQLVQVYEL 339
>YVCE_BACSU (P40767) Hypothetical protein yvcE (PSPA2)
Length = 473
Score = 31.2 bits (69), Expect = 1.6
Identities = 29/137 (21%), Positives = 57/137 (41%), Gaps = 5/137 (3%)
Query: 1 MKVLQKYKTKSLAHRQLLKQQLYSFKMLESKSISEQLAEFNKILDDLANIE---VNMEDE 57
++ +QK K L + L + + + +K+ISE +E +++ + AN E ++ E
Sbjct: 199 LETMQKDLDKQLNEKDKLFDEAKASQKKTAKAISELKSEASELANQKANTEAEQARIKKE 258
Query: 58 DKALLLLCSLPKSFEHFKDTILYGKEGTTTLEEIQSALRTKKLTKSKDLRANENSEGLCV 117
+A L + + D T T E +++ S D +N +S
Sbjct: 259 QEAAAALIKKQEEAQKASDETQTDDSQTATTESSKASSSDDSSDNSSDNSSNGSSNS--S 316
Query: 118 SRGNGGGRGNRGSSKSG 134
S G+ + + +S SG
Sbjct: 317 SNGSSSKKSSGSNSNSG 333
>GAG_SRV2 (P51516) Gag polyprotein (Core polyprotein) [Contains:
Core protein p10; Core phosphoprotein p18; Core protein
p12; Core protein p27; Core protein p14; Core protein
p4]
Length = 654
Score = 31.2 bits (69), Expect = 1.6
Identities = 12/19 (63%), Positives = 14/19 (73%)
Query: 135 NKERYKCFKCHKFGHFKRD 153
NK+R CFKC K GHF +D
Sbjct: 540 NKDRGGCFKCGKKGHFAKD 558
>XPC_MOUSE (P51612) DNA-repair protein complementing XP-C cells
homolog (Xeroderma pigmentosum group C complementing
protein homolog) (p125)
Length = 930
Score = 30.8 bits (68), Expect = 2.1
Identities = 21/78 (26%), Positives = 34/78 (42%), Gaps = 4/78 (5%)
Query: 63 LLCSLPKSFEHFKDTILYGKEGTTTLEEIQSALRTKKLTKSKDLRANENSEGLCVSRGNG 122
L ++ K + K+T + G G++ L K T + SEG RG
Sbjct: 320 LKSAVTKGRKSSKETSVEGPGGSSELSSNSPESHNKPTTSRRIKEEETLSEG----RGKA 375
Query: 123 GGRGNRGSSKSGNKERYK 140
RG RG+ +G+++R K
Sbjct: 376 TARGKRGTGTAGSRQRRK 393
>RL2_HELPY (P56030) 50S ribosomal protein L2
Length = 276
Score = 30.8 bits (68), Expect = 2.1
Identities = 26/83 (31%), Positives = 37/83 (44%), Gaps = 2/83 (2%)
Query: 96 RTKKLTKSKDLRANENSEGLCVSRGNGGGRGNRGSSKSGNKERYKCFKCHKFGHFKRDFS 155
R + SKD+ A + +GL GR N G S +KER K ++ FKR+
Sbjct: 14 RFMSVLDSKDITAKSSVKGLLTKLKATAGRNNNGRITSRHKER-GAKKLYRIIDFKRNKY 72
Query: 156 EDNENFAQVVSEEYEDAG-ALVV 177
A + + Y +A ALVV
Sbjct: 73 NIEGKVAAIEYDPYRNARIALVV 95
>RL2_HELPJ (Q9ZJR6) 50S ribosomal protein L2
Length = 276
Score = 30.8 bits (68), Expect = 2.1
Identities = 26/83 (31%), Positives = 37/83 (44%), Gaps = 2/83 (2%)
Query: 96 RTKKLTKSKDLRANENSEGLCVSRGNGGGRGNRGSSKSGNKERYKCFKCHKFGHFKRDFS 155
R + SKD+ A + +GL GR N G S +KER K ++ FKR+
Sbjct: 14 RFMSVLDSKDITAKSSVKGLLTKLKATAGRNNNGRITSRHKER-GAKKLYRIIDFKRNKY 72
Query: 156 EDNENFAQVVSEEYEDAG-ALVV 177
A + + Y +A ALVV
Sbjct: 73 NIEGKVAAIEYDPYRNARIALVV 95
>P3K2_DICDI (P54674) Phosphatidylinositol 3-kinase 2 (EC 2.7.1.137)
(PI3-kinase) (PtdIns-3-kinase) (PI3K)
Length = 1858
Score = 30.8 bits (68), Expect = 2.1
Identities = 26/129 (20%), Positives = 56/129 (43%), Gaps = 12/129 (9%)
Query: 7 YKTKSLAHRQLLKQQLYSFKMLESKSISEQLAEFNKILDDLANIEVNMEDEDKALLLLCS 66
Y + + Q ++ L + +L +SE+++ +KIL+ + +D D+ L + S
Sbjct: 917 YIRQCVGKNQTVELSLTNNSILSLNQVSEKVSFIDKILETS-----DFDDYDEDLDSINS 971
Query: 67 LPKSFEHFKDTILYGKEGTTTLEEIQSALRTKKLTKSKDLRANENSEGLCVSRGNGGGRG 126
SF+ K +I ++ ++IQ+ + K+ K EN + + N
Sbjct: 972 --NSFDDLKQSIQQQQQ-----QQIQTVINIKETNKENKDSNKENKDSSSNNNNNNNNNN 1024
Query: 127 NRGSSKSGN 135
N ++ + N
Sbjct: 1025 NNNNNNNNN 1033
>MEDB_GIALA (Q08014) Median body protein
Length = 857
Score = 30.8 bits (68), Expect = 2.1
Identities = 20/81 (24%), Positives = 39/81 (47%)
Query: 31 KSISEQLAEFNKILDDLANIEVNMEDEDKALLLLCSLPKSFEHFKDTILYGKEGTTTLEE 90
K +S++L EF I+DD +++ +++ D + L + ++ +D + K T+T
Sbjct: 740 KEMSDKLIEFEMIMDDNRRLKLQVKELDLKTANMEKLYEEYKKLEDQLKATKAMTSTGMG 799
Query: 91 IQSALRTKKLTKSKDLRANEN 111
+ +A TKS L N
Sbjct: 800 VSAASPAFYKTKSMRLTQQNN 820
>LSM4_CAEEL (Q19952) Probable U6 snRNA-associated Sm-like protein
LSm4
Length = 123
Score = 30.8 bits (68), Expect = 2.1
Identities = 20/65 (30%), Positives = 28/65 (42%), Gaps = 3/65 (4%)
Query: 74 FKDTILYGKEGTTTLEEIQSALRTKKLTKSKDLRANENSEGLCVSRGNGGGRGNRGSSKS 133
FK + Y + T I + T+ ++R + E SRG GGGRG RG +
Sbjct: 55 FKMSEAYVRGSTIKYLRIPETVVDLVKTEVNEVRRQQQREQ---SRGRGGGRGGRGGHRG 111
Query: 134 GNKER 138
G R
Sbjct: 112 GGGNR 116
>LON_MYCGE (P47481) ATP-dependent protease La (EC 3.4.21.53)
Length = 795
Score = 30.8 bits (68), Expect = 2.1
Identities = 21/92 (22%), Positives = 43/92 (45%)
Query: 14 HRQLLKQQLYSFKMLESKSISEQLAEFNKILDDLANIEVNMEDEDKALLLLCSLPKSFEH 73
+ QL +QQ + + + I E++ +K D++A+I +++ + + EH
Sbjct: 241 NEQLSRQQRDFYLREKLRIIREEIGISSKKEDEVASIRKKLDENPYPEAIKKRILSELEH 300
Query: 74 FKDTILYGKEGTTTLEEIQSALRTKKLTKSKD 105
++++ +E T T I + L KSKD
Sbjct: 301 YENSSSSSQESTLTKTYIDTLLNLPWWQKSKD 332
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.313 0.132 0.368
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 23,083,768
Number of Sequences: 164201
Number of extensions: 995890
Number of successful extensions: 3541
Number of sequences better than 10.0: 57
Number of HSP's better than 10.0 without gapping: 15
Number of HSP's successfully gapped in prelim test: 42
Number of HSP's that attempted gapping in prelim test: 3493
Number of HSP's gapped (non-prelim): 85
length of query: 195
length of database: 59,974,054
effective HSP length: 104
effective length of query: 91
effective length of database: 42,897,150
effective search space: 3903640650
effective search space used: 3903640650
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 63 (28.9 bits)
Medicago: description of AC134822.5