
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC122169.4 - phase: 0
(165 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_O80669 Expressed protein [Arabidopsis thaliana] 75 8e-13
UniRef100_Q9LDH1 SigA binding protein [Arabidopsis thaliana] 67 2e-10
UniRef100_Q8S202 Hypothetical protein P0468B07.12 [Oryza sativa] 62 4e-09
UniRef100_Q75KH8 Hypothetical protein OJ1057_G07.4 [Oryza sativa] 61 1e-08
UniRef100_Q8LIG1 Hypothetical protein OJ1200_C08.126 [Oryza sativa] 45 7e-04
UniRef100_Q7XPJ9 OSJNBa0087O24.11 protein [Oryza sativa] 40 0.017
UniRef100_Q9SHH3 F20D23.16 protein [Arabidopsis thaliana] 39 0.064
UniRef100_O48522 Hypothetical protein At2g42140 [Arabidopsis tha... 37 0.14
UniRef100_Q7RFV7 Hypothetical protein [Plasmodium yoelii yoelii] 37 0.24
UniRef100_UPI00003C2631 UPI00003C2631 UniRef100 entry 36 0.32
UniRef100_Q8MMZ9 Hypothetical protein [Dictyostelium discoideum] 36 0.42
UniRef100_Q98R04 LIPOPROTEIN [Mycoplasma pulmonis] 35 0.71
UniRef100_UPI00003A95FA UPI00003A95FA UniRef100 entry 35 0.93
UniRef100_Q5ZK36 Hypothetical protein [Gallus gallus] 35 0.93
UniRef100_Q6DK77 Serine/arginine repetitive matrix 1 [Xenopus tr... 35 0.93
UniRef100_Q63ZQ5 Hypothetical protein [Xenopus laevis] 35 0.93
UniRef100_Q9M2P8 Hypothetical protein T10K17.210 [Arabidopsis th... 35 0.93
UniRef100_Q9M035 Hypothetical protein T10O8_80 [Arabidopsis thal... 34 1.2
UniRef100_UPI00001D1239 UPI00001D1239 UniRef100 entry 34 1.6
UniRef100_O31655 YkrI protein [Bacillus subtilis] 34 1.6
>UniRef100_O80669 Expressed protein [Arabidopsis thaliana]
Length = 141
Score = 74.7 bits (182), Expect = 8e-13
Identities = 46/104 (44%), Positives = 65/104 (62%), Gaps = 8/104 (7%)
Query: 10 SSSVISTVQQKTPTKLTK-PKKKKNNNNTYNKPIKVVYISNPMKVKTSASEFRALVQELT 68
SSS + Q+K+ + T+ P K+K + T +KPIKV YISNPM+V+T S+FR LVQELT
Sbjct: 4 SSSTLLINQRKSSSSPTRIPPKQKRKSTTTHKPIKVRYISNPMRVETCPSKFRELVQELT 63
Query: 69 GQ-YAESPPNPSRFQEFDVNDSGTDQGGCENMMDCDKSDQTVVG 111
GQ A+ PP+P+ F D++ CE+ M+ + D V G
Sbjct: 64 GQDAADLPPSPTTFTAVDLHRP------CESEMNLEPLDGEVRG 101
>UniRef100_Q9LDH1 SigA binding protein [Arabidopsis thaliana]
Length = 151
Score = 67.0 bits (162), Expect = 2e-10
Identities = 41/76 (53%), Positives = 52/76 (67%), Gaps = 3/76 (3%)
Query: 5 STSSTSSSVISTVQQK-TPTKLTKPKKKKNNNNTYNKPIKVVYISNPMKVKTSASEFRAL 63
S+SST + S ++K +P PK+KK +T NKPIKV YISNPM+V+T AS+FR L
Sbjct: 3 SSSSTFLTTTSLDKKKPSPVSRKSPKQKKKTTST-NKPIKVRYISNPMRVQTCASKFREL 61
Query: 64 VQELTGQYA-ESPPNP 78
VQELTGQ A + P P
Sbjct: 62 VQELTGQDAVDLQPEP 77
>UniRef100_Q8S202 Hypothetical protein P0468B07.12 [Oryza sativa]
Length = 184
Score = 62.4 bits (150), Expect = 4e-09
Identities = 36/58 (62%), Positives = 41/58 (70%), Gaps = 1/58 (1%)
Query: 40 KPIKVVYISNPMKVKTSASEFRALVQELTGQYAESPPNPSRFQEFDVNDSGTDQGGCE 97
KPIKVVYISNPM+VKTSA+ FRALVQELTG+ A+ P S D +D G GG E
Sbjct: 37 KPIKVVYISNPMRVKTSAAGFRALVQELTGRNAD-PSKYSPRASADDDDGGGGGGGGE 93
>UniRef100_Q75KH8 Hypothetical protein OJ1057_G07.4 [Oryza sativa]
Length = 216
Score = 60.8 bits (146), Expect = 1e-08
Identities = 45/129 (34%), Positives = 68/129 (51%), Gaps = 17/129 (13%)
Query: 40 KPIKVVYISNPMKVKTSASEFRALVQELTGQYAESPPNPSRFQEFDVNDSGTDQGGCENM 99
KPIKVVYISNPM+V+TSA+ FRALVQELTG+ A+ PS++ +G D GG +
Sbjct: 89 KPIKVVYISNPMRVRTSAAGFRALVQELTGRNAD----PSKYS--PRASAGDDGGGATAL 142
Query: 100 MD---CDKSDQTVVGV-----PSLVDPDDKGKPSEAGSSNESFDEDVLLMPEMMDNIWDL 151
D +D G P+ D+G G ++ D+D + +++D + +
Sbjct: 143 PDTGAASDADALEAGAAPGRHPAETATFDEGGGGGGGGYDD--DDDDVFRSQLLDTSYSV 200
Query: 152 L-PTSAFYE 159
P + Y+
Sbjct: 201 FSPPTLLYD 209
>UniRef100_Q8LIG1 Hypothetical protein OJ1200_C08.126 [Oryza sativa]
Length = 137
Score = 45.1 bits (105), Expect = 7e-04
Identities = 22/56 (39%), Positives = 34/56 (60%)
Query: 17 VQQKTPTKLTKPKKKKNNNNTYNKPIKVVYISNPMKVKTSASEFRALVQELTGQYA 72
+ + P K K +K ++VVYI++PMK+ S EFRA+VQELTG+++
Sbjct: 14 ISKPPPAKKGKAAARKYKPPQCPGAVRVVYIASPMKLTASPEEFRAVVQELTGRHS 69
>UniRef100_Q7XPJ9 OSJNBa0087O24.11 protein [Oryza sativa]
Length = 151
Score = 40.4 bits (93), Expect = 0.017
Identities = 23/56 (41%), Positives = 32/56 (57%), Gaps = 1/56 (1%)
Query: 40 KPIKVVYISNPMKVKTSASEFRALVQELTGQYAESPPNPSRFQEFDVNDSGTDQGG 95
+ IK+V++ P +KT A FR LVQ LTG+ A P + Q D D+ D+GG
Sbjct: 26 RKIKIVHVLAPEVIKTDARHFRDLVQRLTGKPAADGPAAASSQP-DPCDTAGDEGG 80
>UniRef100_Q9SHH3 F20D23.16 protein [Arabidopsis thaliana]
Length = 89
Score = 38.5 bits (88), Expect = 0.064
Identities = 36/110 (32%), Positives = 47/110 (42%), Gaps = 26/110 (23%)
Query: 42 IKVVYISNPMKVKTSASEFRALVQELTGQYAESPPNPSRFQEFDVNDSGTDQGGCENMMD 101
+KVV+I N V+T A F+ +VQELTG+ A P F SG GG +
Sbjct: 1 MKVVFI-NTQYVQTDARSFKTIVQELTGKNAVVADGPYEF-------SGQGYGGKD---- 48
Query: 102 CDKSDQTVVGVPSLVDPDDKGKPSEAGSSNESFDEDVLLMPEM--MDNIW 149
S Q GV GK +E G FD MP + + N+W
Sbjct: 49 ---SSQRFCGV---------GKEAERGVETTEFDSFFREMPPVGELYNLW 86
>UniRef100_O48522 Hypothetical protein At2g42140 [Arabidopsis thaliana]
Length = 172
Score = 37.4 bits (85), Expect = 0.14
Identities = 36/137 (26%), Positives = 57/137 (41%), Gaps = 21/137 (15%)
Query: 15 STVQQKTPTKLTKPKKKKNNNNTYNKP-IKVVYISNPMKVKTSASEFRALVQELTGQ--- 70
+T QK + T K++ T +KP I++++I P +KT + FR +VQ LTG+
Sbjct: 5 ATTVQKRRSLPTIAMHKQSRTLTKSKPKIRIIHIFAPEIIKTDVANFREIVQNLTGKQDH 64
Query: 71 ----------YAESPPNPSRFQEFDVNDSGTDQGGCENMMDCDKSDQTVVGVPSLVDPDD 120
+P + +V+D G C N SD+ G+ S+ +
Sbjct: 65 HHHDLPHQKGLKRNPRSRRSHDHHEVHDMNKSHGFCIN------SDEEEEGMVSMT-WNG 117
Query: 121 KGKPSEAGSSNESFDED 137
G S G N D D
Sbjct: 118 NGDESSGGFLNGLGDLD 134
>UniRef100_Q7RFV7 Hypothetical protein [Plasmodium yoelii yoelii]
Length = 1048
Score = 36.6 bits (83), Expect = 0.24
Identities = 33/129 (25%), Positives = 57/129 (43%), Gaps = 19/129 (14%)
Query: 3 SISTSST---SSSVISTVQQKTPTKLTKPKKKKNNNNTYNKPIKVVYISNP-------MK 52
SIS++S+ SS + + K + + + NNN YNK KV+ I +
Sbjct: 783 SISSNSSFLSDSSFFTNGDNENERKRKRSENESPNNNFYNKKNKVIEIKKRRPTVIDFLN 842
Query: 53 VKTSASEFRALVQELTGQYAESPPN--------PSRFQEFDVNDSGTDQGGC-ENMMDCD 103
K + + V+ + +Y E+ P+ S E D+NDS + C N+++ +
Sbjct: 843 EKWNKNSETQNVRSYSKEYEENEPHENSCNTNLESIDSETDINDSEIESSECSSNIIENE 902
Query: 104 KSDQTVVGV 112
K D+T V
Sbjct: 903 KEDKTEFSV 911
>UniRef100_UPI00003C2631 UPI00003C2631 UniRef100 entry
Length = 439
Score = 36.2 bits (82), Expect = 0.32
Identities = 31/133 (23%), Positives = 49/133 (36%), Gaps = 3/133 (2%)
Query: 1 MDSISTSSTSSSVISTVQQKTPTKLT--KPKKKK-NNNNTYNKPIKVVYISNPMKVKTSA 57
+D +S+ + S S+ S V P+ L KP K+K + T KP K S T A
Sbjct: 113 LDDLSSIAPSPSIASEVSTSKPSSLKAKKPSKQKADGTETKPKPKKPKKKSTTGAATTDA 172
Query: 58 SEFRALVQELTGQYAESPPNPSRFQEFDVNDSGTDQGGCENMMDCDKSDQTVVGVPSLVD 117
+ F + Q A + P P + + +D + + P
Sbjct: 173 TAFSSSAQPSNVTAASTAPKPKKTSKNKASDKDKSKEEAHQQPTSTAASPAPFATPVSSS 232
Query: 118 PDDKGKPSEAGSS 130
+ K K +AG S
Sbjct: 233 KESKPKAKKAGGS 245
>UniRef100_Q8MMZ9 Hypothetical protein [Dictyostelium discoideum]
Length = 2625
Score = 35.8 bits (81), Expect = 0.42
Identities = 18/87 (20%), Positives = 39/87 (44%)
Query: 1 MDSISTSSTSSSVISTVQQKTPTKLTKPKKKKNNNNTYNKPIKVVYISNPMKVKTSASEF 60
+ S S +S+SSS++S + + KL N+N +Y+ P+++ + + +
Sbjct: 515 ISSNSPNSSSSSIVSNLSSNSLNKLVNSNTNNNSNQSYSPPLQIQQQQQQQQQQQQQQQQ 574
Query: 61 RALVQELTGQYAESPPNPSRFQEFDVN 87
+ Q+ Q + + EF+ N
Sbjct: 575 QQQQQQQQQQQQQQQQQQQQINEFNCN 601
>UniRef100_Q98R04 LIPOPROTEIN [Mycoplasma pulmonis]
Length = 773
Score = 35.0 bits (79), Expect = 0.71
Identities = 34/139 (24%), Positives = 55/139 (39%), Gaps = 10/139 (7%)
Query: 1 MDSISTSSTSSSVISTVQQKTPTKLTKPKKKKNNNNTYNKPIKVVYISNPMKVKTSASEF 60
M +S +T S+ K TK T P+ NNT ++ NP KT ++
Sbjct: 21 MALVSCVTTKSNNQIDPSAKQNTKQTSPQSAPKENNTNTNRNSIISPQNPDSSKTPETQV 80
Query: 61 RALVQELTGQYAESPPNPSRFQEFDVNDSGTDQGGCE--NMMDCDKSDQTVVGVPSLVDP 118
E + +SP +P ++ D+G G + + D + DQ VP P
Sbjct: 81 PPTKPEDQNKEPQSPKDP------EIKDNGQKNEGSKAPEIKDMSQKDQ-APQVPQ-KQP 132
Query: 119 DDKGKPSEAGSSNESFDED 137
+D KP +S D++
Sbjct: 133 EDPKKPETQKPPVKSEDQN 151
>UniRef100_UPI00003A95FA UPI00003A95FA UniRef100 entry
Length = 384
Score = 34.7 bits (78), Expect = 0.93
Identities = 33/111 (29%), Positives = 51/111 (45%), Gaps = 12/111 (10%)
Query: 5 STSSTSSSVISTVQQKTPTKLTKPK---KKKNNNNTYNKPIKVVYISNPMKVKTSASEFR 61
ST +SS++ ST+ Q + T + K KNNN + ++ S+ + +S S
Sbjct: 239 STGYSSSALASTLTQTLSSSSTDSRSGRKSKNNNKSSSQQSSSSSSSSSL---SSCSSSS 295
Query: 62 ALVQELTGQYAESPPNPSRFQ---EFDVNDSG---TDQGGCENMMDCDKSD 106
AL QEL+ Q A P + S Q +D N+ +Q M+ CD D
Sbjct: 296 ALAQELSQQTAVIPESDSNSQVDWTYDPNEPRYCICNQVSYGEMVGCDNQD 346
>UniRef100_Q5ZK36 Hypothetical protein [Gallus gallus]
Length = 417
Score = 34.7 bits (78), Expect = 0.93
Identities = 33/111 (29%), Positives = 51/111 (45%), Gaps = 12/111 (10%)
Query: 5 STSSTSSSVISTVQQKTPTKLTKPK---KKKNNNNTYNKPIKVVYISNPMKVKTSASEFR 61
ST +SS++ ST+ Q + T + K KNNN + ++ S+ + +S S
Sbjct: 272 STGYSSSALASTLTQTLSSSSTDSRSGRKSKNNNKSSSQQSSSSSSSSSL---SSCSSSS 328
Query: 62 ALVQELTGQYAESPPNPSRFQ---EFDVNDSG---TDQGGCENMMDCDKSD 106
AL QEL+ Q A P + S Q +D N+ +Q M+ CD D
Sbjct: 329 ALAQELSQQTAVIPESDSNSQVDWTYDPNEPRYCICNQVSYGEMVGCDNQD 379
>UniRef100_Q6DK77 Serine/arginine repetitive matrix 1 [Xenopus tropicalis]
Length = 874
Score = 34.7 bits (78), Expect = 0.93
Identities = 38/152 (25%), Positives = 69/152 (45%), Gaps = 27/152 (17%)
Query: 5 STSSTSSSVISTVQQKTPTKLTK--PKKKKNNNNTYNKPIKVVY---------------- 46
S+SS+SSS +K P +++ P+K + ++ + + P + Y
Sbjct: 351 SSSSSSSSRSPPPPKKQPKRISSSPPRKPRRSSPSSSPPPRRRYRPSPTPPPKRRSPSPT 410
Query: 47 -ISNPMKVKTSASEFRA-LVQELTGQYAESPPNPSRFQEFDVNDSGTDQGGCENMMDCDK 104
S P K++ S S R +++ + +SP PS+ ++ D+++S D+GG M D
Sbjct: 411 QSSRPRKIRGSVSPARPPVIKHKVIEKRKSPSPPSKPRKRDLSESDDDKGG--RMAPPDS 468
Query: 105 SDQTVVGVPSLVDPDDKGKPSEAGSSNESFDE 136
Q ++ SE+GSS+ S DE
Sbjct: 469 VQQR-----RQYRRQNQQSSSESGSSSSSEDE 495
>UniRef100_Q63ZQ5 Hypothetical protein [Xenopus laevis]
Length = 796
Score = 34.7 bits (78), Expect = 0.93
Identities = 37/152 (24%), Positives = 68/152 (44%), Gaps = 27/152 (17%)
Query: 5 STSSTSSSVISTVQQKTPTKLTK--PKKKKNNNNTYNKPIKVVY---------------- 46
S+SS+S S +K P K++ P+K + ++ + + P + Y
Sbjct: 349 SSSSSSCSRSPPAPKKQPKKISSSPPRKPRRSSPSSSPPPRRRYRPSPTPPPKRRSPSPT 408
Query: 47 -ISNPMKVKTSASEFRA-LVQELTGQYAESPPNPSRFQEFDVNDSGTDQGGCENMMDCDK 104
S P KV+ S S R +++ + +SP PS+ ++ D+++S D+GG D +
Sbjct: 409 QSSRPRKVRGSVSPARPPVIKHKVVEKRKSPSPPSKPRKRDLSESDDDKGGKTAPADSVQ 468
Query: 105 SDQTVVGVPSLVDPDDKGKPSEAGSSNESFDE 136
+ ++ SE+GSS+ S DE
Sbjct: 469 QRRQ-------YRRQNQQSSSESGSSSSSEDE 493
>UniRef100_Q9M2P8 Hypothetical protein T10K17.210 [Arabidopsis thaliana]
Length = 175
Score = 34.7 bits (78), Expect = 0.93
Identities = 22/70 (31%), Positives = 33/70 (46%), Gaps = 10/70 (14%)
Query: 1 MDSISTSSTSSSVISTVQQKTPTKLTKPKKKKNNNNTYNKPIKVVYISNPMKVKTSASEF 60
M++ SS+ S K +TK K K I++++I P +KT + F
Sbjct: 1 MEATIFEKRRSSLSSIAVHKQSYSITKSKPK----------IRIIHIFAPEIIKTDVANF 50
Query: 61 RALVQELTGQ 70
R LVQ LTG+
Sbjct: 51 RELVQSLTGK 60
>UniRef100_Q9M035 Hypothetical protein T10O8_80 [Arabidopsis thaliana]
Length = 427
Score = 34.3 bits (77), Expect = 1.2
Identities = 25/74 (33%), Positives = 33/74 (43%), Gaps = 8/74 (10%)
Query: 22 PTKLTKPKKKKNNNNTYNKPIKVVYISNPMKVKTSASEFRAL-------VQELTGQYAES 74
P K+ K KKK N N N KV + +P K K+S+ + L E G+ +ES
Sbjct: 228 PRKVAK-KKKSENRNDVNGAKKVRWFLSPSKSKSSSEKTALLGESKNIRADEFIGKESES 286
Query: 75 PPNPSRFQEFDVND 88
P N S V D
Sbjct: 287 PENKSNASPVSVLD 300
>UniRef100_UPI00001D1239 UPI00001D1239 UniRef100 entry
Length = 1053
Score = 33.9 bits (76), Expect = 1.6
Identities = 31/124 (25%), Positives = 55/124 (44%), Gaps = 9/124 (7%)
Query: 7 SSTSSSVISTVQQKTPTKLTKPKKKKNNNNTYNKPIKVVYISNPMKVKTSASEFRALVQE 66
+ST + + +P+ L KK N P + +S KV++ + + Q+
Sbjct: 492 TSTVKTHLGLKTSFSPSLLDSGKK----NLLSTAPFSSISVSRSEKVRSKSEDNALFTQQ 547
Query: 67 LTGQ----YAESPPNPSRFQEFDVNDSGTDQGGCENMMDCDKSDQTVVGVPSLVDPDDKG 122
G ++S P+ +V+DS +QGG ++M D SD+ +V + SL K
Sbjct: 548 SAGSEVKVISQSFPSKQILTSKNVSDSVDEQGGADHMKDA-VSDKHLVPLKSLGGKASKR 606
Query: 123 KPSE 126
K +E
Sbjct: 607 KRTE 610
>UniRef100_O31655 YkrI protein [Bacillus subtilis]
Length = 381
Score = 33.9 bits (76), Expect = 1.6
Identities = 37/148 (25%), Positives = 51/148 (34%), Gaps = 24/148 (16%)
Query: 13 VISTVQQKTPTKLTKPKKKKNNNNTYNKPIKVVYISNPMKVKTSASEFRALVQELTGQYA 72
+ISTV + T K KK N+ K K Y ++ E TG Y
Sbjct: 159 LISTVYENTEDNTYKKAVKKQLNDVTEK-YKTTYRMESLESDMQTREKAKKEGVSTGSYI 217
Query: 73 ES--------------PPNPSRFQEFDVN-DSGTDQ--------GGCENMMDCDKSDQTV 109
+S P+ Q+ D N D TDQ G E + D D DQ
Sbjct: 218 KSNEKNDNKDIKDDSSKPSGEEDQKSDENEDENTDQTDTQDSKQGDNEQLNDADSGDQKE 277
Query: 110 VGVPSLVDPDDKGKPSEAGSSNESFDED 137
+D DK K + N + ++D
Sbjct: 278 EKADDQIDDSDKDKKIKESDENTNTEKD 305
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.306 0.125 0.346
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 293,357,567
Number of Sequences: 2790947
Number of extensions: 12640750
Number of successful extensions: 51534
Number of sequences better than 10.0: 98
Number of HSP's better than 10.0 without gapping: 21
Number of HSP's successfully gapped in prelim test: 78
Number of HSP's that attempted gapping in prelim test: 51457
Number of HSP's gapped (non-prelim): 142
length of query: 165
length of database: 848,049,833
effective HSP length: 117
effective length of query: 48
effective length of database: 521,509,034
effective search space: 25032433632
effective search space used: 25032433632
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 70 (31.6 bits)
Medicago: description of AC122169.4