
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC140720.11 - phase: 0
(139 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_Q9FJW1 Arabidopsis thaliana genomic DNA, chromosome 5,... 171 3e-42
UniRef100_Q6H599 Hypothetical protein P0027G10.62 [Oryza sativa] 102 2e-21
UniRef100_Q9FM06 Arabidopsis thaliana genomic DNA, chromosome 5,... 100 5e-21
UniRef100_Q9FGA2 Arabidopsis thaliana genomic DNA, chromosome 5,... 97 6e-20
UniRef100_Q67V82 Hypothetical protein OSJNBa0080E19.35 [Oryza sa... 97 1e-19
UniRef100_Q7G6R7 Hypothetical protein OSJNAb0023M11.8 [Oryza sat... 96 1e-19
UniRef100_Q8H341 Hypothetical protein OSJNBa0077M12.102 [Oryza s... 96 2e-19
UniRef100_Q9ZUJ2 T2K10.6 protein [Arabidopsis thaliana] 80 1e-14
UniRef100_Q8LEP3 Hypothetical protein [Arabidopsis thaliana] 80 1e-14
UniRef100_Q9XIJ9 T10O24.15 [Arabidopsis thaliana] 77 1e-13
UniRef100_Q7XJ59 At1g10530 [Arabidopsis thaliana] 77 1e-13
UniRef100_Q9LP48 F28N24.12 protein [Arabidopsis thaliana] 37 0.12
UniRef100_Q681T1 Hypothetical protein At1g29195 [Arabidopsis tha... 36 0.21
UniRef100_Q8DMF0 Tll0168 protein [Synechococcus elongatus] 33 1.0
UniRef100_UPI0000326E94 UPI0000326E94 UniRef100 entry 33 1.4
UniRef100_UPI000032848C UPI000032848C UniRef100 entry 33 1.8
UniRef100_UPI000029BC64 UPI000029BC64 UniRef100 entry 33 1.8
UniRef100_Q6D8B3 Putative integrase [Erwinia carotovora] 33 1.8
UniRef100_Q94D86 Hypothetical protein P0439E11.30 [Oryza sativa] 33 1.8
UniRef100_UPI000034BD8F UPI000034BD8F UniRef100 entry 32 2.3
>UniRef100_Q9FJW1 Arabidopsis thaliana genomic DNA, chromosome 5, TAC clone:K9I9
[Arabidopsis thaliana]
Length = 182
Score = 171 bits (433), Expect = 3e-42
Identities = 85/118 (72%), Positives = 99/118 (83%), Gaps = 4/118 (3%)
Query: 1 MGNCQAAELATVVIHRPG-NKVERIYWSVSAHEVMNSNPGHYVALVVSSPTLKSENGMPL 59
MGNCQAAE ATV+IH P NKVERIYWSV+A ++M SNPGHYVA+VV+SPT+K+E G+PL
Sbjct: 1 MGNCQAAEAATVLIHHPAENKVERIYWSVTASDIMKSNPGHYVAVVVTSPTMKNEKGLPL 60
Query: 60 KHLKLLRPDDTLLIGQVYRLISFEDVLKEFASKKCGKLGKLLKESGNRGI---QMKHR 114
K LKLLRPDDTLLIG VYRL+SFE+VL EFA+KKC KLGKLLKE G + + KHR
Sbjct: 61 KQLKLLRPDDTLLIGHVYRLVSFEEVLNEFATKKCVKLGKLLKEGGGLDLTKKKTKHR 118
>UniRef100_Q6H599 Hypothetical protein P0027G10.62 [Oryza sativa]
Length = 208
Score = 102 bits (253), Expect = 2e-21
Identities = 58/142 (40%), Positives = 79/142 (54%), Gaps = 27/142 (19%)
Query: 1 MGNCQAAELATVVIHRPGNKVERIYWSVSAHEVMNSNPGHYVALVV--------SSPTLK 52
MGNCQAAE A VVI PG KVER+YW +A +VM +NPGHYVALV+ +S
Sbjct: 1 MGNCQAAEAAAVVIQHPGGKVERLYWPTTAADVMRANPGHYVALVILRISADKAASAAAA 60
Query: 53 SEN-------------GMPLKHLKLLRPDDTLLIGQVYRLISFEDVLKEFASKKCGKLGK 99
+N G + +KLL+P DTLL+GQVYRLI+ ++V K ++K K+ +
Sbjct: 61 GDNKTNAGGATGGGGGGAKITRVKLLKPKDTLLLGQVYRLITSQEVTKALRARKNEKMRR 120
Query: 100 LLKESGNRGIQMKHRDFRAPNP 121
I+ +H R +P
Sbjct: 121 C------EAIRQQHEQLRRGDP 136
>UniRef100_Q9FM06 Arabidopsis thaliana genomic DNA, chromosome 5, P1 clone:MQB2
[Arabidopsis thaliana]
Length = 161
Score = 100 bits (250), Expect = 5e-21
Identities = 52/100 (52%), Positives = 70/100 (70%), Gaps = 1/100 (1%)
Query: 1 MGNCQAAELATVVIHRPGNKVERIYWSVSAHEVMNSNPGHYVALVVSSPTLKSENGMPLK 60
MGNCQAAE AT VI +P K R Y +V+A EV+ S+PGH+VAL++SS + + +
Sbjct: 1 MGNCQAAEAATTVIQQPDGKSVRFYCTVNASEVIKSHPGHHVALLLSS-AVPHGGSLRVT 59
Query: 61 HLKLLRPDDTLLIGQVYRLISFEDVLKEFASKKCGKLGKL 100
+KLLRP D LL+G VYRLIS E+V+K +KK GK+ K+
Sbjct: 60 RIKLLRPSDNLLLGHVYRLISSEEVMKGIRAKKSGKMKKI 99
>UniRef100_Q9FGA2 Arabidopsis thaliana genomic DNA, chromosome 5, P1 clone:MPF21
[Arabidopsis thaliana]
Length = 159
Score = 97.4 bits (241), Expect = 6e-20
Identities = 54/112 (48%), Positives = 73/112 (64%), Gaps = 5/112 (4%)
Query: 1 MGNCQAAELATVVIHRPGNKVERIYWSVSAHEVMNSNPGHYVALVVSSPTLKSE---NGM 57
MGNCQA + A VVI P K E++ VSA VM NPGH V+L++S+ L S +G
Sbjct: 1 MGNCQAVDTARVVIQHPNGKEEKLSCPVSASYVMKMNPGHCVSLLISTTALSSASSGHGG 60
Query: 58 PLK--HLKLLRPDDTLLIGQVYRLISFEDVLKEFASKKCGKLGKLLKESGNR 107
PL+ +KLLRP DTL++G VYRLI+ ++V+K +KKC KL K K S ++
Sbjct: 61 PLRLTRIKLLRPTDTLVLGHVYRLITTKEVMKGLMAKKCSKLKKESKGSDDK 112
>UniRef100_Q67V82 Hypothetical protein OSJNBa0080E19.35 [Oryza sativa]
Length = 189
Score = 96.7 bits (239), Expect = 1e-19
Identities = 51/112 (45%), Positives = 69/112 (61%), Gaps = 19/112 (16%)
Query: 1 MGNCQAAELATVVIHRPGNKVERIYWSVSAHEVMNSNPGHYVALVV-------------S 47
MGNCQAAE ATVV+ PG +VER+YW+ +A EVM +NPGHYVALV
Sbjct: 1 MGNCQAAEAATVVVQHPGGRVERLYWATTAAEVMRANPGHYVALVTLRVAEEKRPPPPPP 60
Query: 48 SPTLKSE------NGMPLKHLKLLRPDDTLLIGQVYRLISFEDVLKEFASKK 93
P ++E + + +KLL+P DTLL+GQ YRLI+ ++V + +KK
Sbjct: 61 PPPARAERRGTGTGTVRVTRVKLLKPRDTLLLGQAYRLITVDEVTRALQAKK 112
>UniRef100_Q7G6R7 Hypothetical protein OSJNAb0023M11.8 [Oryza sativa]
Length = 218
Score = 96.3 bits (238), Expect = 1e-19
Identities = 63/156 (40%), Positives = 83/156 (52%), Gaps = 32/156 (20%)
Query: 1 MGNCQAAELATVVIHRP-----------------GNKVERIYWSVSAHEVMNSNPGHYVA 43
MGNCQAA+ A VVI P G +VER Y +VSA VM +NPGHYVA
Sbjct: 1 MGNCQAADAAAVVIQHPSSSSSSSSSSGNGGGGGGGRVERAYGAVSAAAVMAANPGHYVA 60
Query: 44 LVV-------SSPTLKSENGMPLKHLKLLRPDDTLLIGQVYRLISFEDVLKEFASKKCGK 96
VV ++ + + LKLLRPDDTL++G VYRL++FEDVLK+F SK+
Sbjct: 61 EVVRPVATAPAATAATASAPAARRRLKLLRPDDTLVLGGVYRLVTFEDVLKQFVSKRNAT 120
Query: 97 LGKLL------KESGNRGIQMKHRDFRAPNPSPVKV 126
+ + + G+R + HR A +P KV
Sbjct: 121 MSRATIATAAEDDDGHR--RQGHRGGEAAAAAPAKV 154
>UniRef100_Q8H341 Hypothetical protein OSJNBa0077M12.102 [Oryza sativa]
Length = 189
Score = 95.5 bits (236), Expect = 2e-19
Identities = 50/120 (41%), Positives = 70/120 (57%), Gaps = 23/120 (19%)
Query: 1 MGNCQAAELATVVIHRPGNKVERIYWSVSAHEVMNSNPGHYVALVVSSPTLKSENG---- 56
MGNCQAAE+A VVI PG KVER+YW +A +VM SNPGHYVALV+ + S G
Sbjct: 1 MGNCQAAEVAAVVIQHPGGKVERLYWPATAADVMRSNPGHYVALVLLRVSASSSGGGGGG 60
Query: 57 -------------------MPLKHLKLLRPDDTLLIGQVYRLISFEDVLKEFASKKCGKL 97
+ +KLL+P +TLL+G+VYRL++ ++V K +++ K+
Sbjct: 61 KAEHSAVGAAVGDESGGAAAKITKIKLLKPKETLLLGKVYRLVTSQEVTKALQARRQEKM 120
>UniRef100_Q9ZUJ2 T2K10.6 protein [Arabidopsis thaliana]
Length = 173
Score = 80.1 bits (196), Expect = 1e-14
Identities = 44/116 (37%), Positives = 64/116 (54%), Gaps = 12/116 (10%)
Query: 1 MGNCQAAELATVVIHRPGNKVERIYWSVSAHEVMNSNPGHYVALVVSSPTL--------- 51
MGNCQA + A +V+ P K++R Y VS E+M PGHYV+L++ P
Sbjct: 1 MGNCQAVDAAALVLQHPDGKIDRYYGPVSVSEIMRMYPGHYVSLIIPLPEKNIPATTTTT 60
Query: 52 --KSENG-MPLKHLKLLRPDDTLLIGQVYRLISFEDVLKEFASKKCGKLGKLLKES 104
KSE + +KLLRP + L++G YRLI+ ++V+K +KK K K E+
Sbjct: 61 DDKSERKVVRFTRVKLLRPTENLVLGHAYRLITSQEVMKVLRAKKYAKTKKHQSET 116
>UniRef100_Q8LEP3 Hypothetical protein [Arabidopsis thaliana]
Length = 173
Score = 80.1 bits (196), Expect = 1e-14
Identities = 44/116 (37%), Positives = 64/116 (54%), Gaps = 12/116 (10%)
Query: 1 MGNCQAAELATVVIHRPGNKVERIYWSVSAHEVMNSNPGHYVALVVSSPTL--------- 51
MGNCQA + A +V+ P K++R Y VS E+M PGHYV+L++ P
Sbjct: 1 MGNCQAVDAAALVLQHPDGKIDRYYGPVSVSEIMRMYPGHYVSLIIPLPEKNIPATTTTT 60
Query: 52 --KSENG-MPLKHLKLLRPDDTLLIGQVYRLISFEDVLKEFASKKCGKLGKLLKES 104
KSE + +KLLRP + L++G YRLI+ ++V+K +KK K K E+
Sbjct: 61 DDKSERKVVRFTRVKLLRPTENLVLGHAYRLITSQEVMKVLRAKKYAKTKKHQSET 116
>UniRef100_Q9XIJ9 T10O24.15 [Arabidopsis thaliana]
Length = 206
Score = 76.6 bits (187), Expect = 1e-13
Identities = 41/113 (36%), Positives = 58/113 (51%), Gaps = 14/113 (12%)
Query: 1 MGNCQAAELATVVIHRPGNKVERIYWSVSAHEVMNSNPGHYVALVV-------------- 46
MGNCQA A +V+ PG ++R Y SVS EVM PGHYV+L++
Sbjct: 41 MGNCQAVNAAVLVLQHPGGIIDRYYSSVSVTEVMAMYPGHYVSLIIPLSEEEEKNIPATE 100
Query: 47 SSPTLKSENGMPLKHLKLLRPDDTLLIGQVYRLISFEDVLKEFASKKCGKLGK 99
K + ++LLRP + L++G YRLI+ ++V+K KK K K
Sbjct: 101 KGDDKKQRKAVRFTRVQLLRPTENLVLGHAYRLITSQEVMKVLREKKSAKTKK 153
>UniRef100_Q7XJ59 At1g10530 [Arabidopsis thaliana]
Length = 166
Score = 76.6 bits (187), Expect = 1e-13
Identities = 41/113 (36%), Positives = 58/113 (51%), Gaps = 14/113 (12%)
Query: 1 MGNCQAAELATVVIHRPGNKVERIYWSVSAHEVMNSNPGHYVALVV-------------- 46
MGNCQA A +V+ PG ++R Y SVS EVM PGHYV+L++
Sbjct: 1 MGNCQAVNAAVLVLQHPGGIIDRYYSSVSVTEVMAMYPGHYVSLIIPLSEEEEKNIPATE 60
Query: 47 SSPTLKSENGMPLKHLKLLRPDDTLLIGQVYRLISFEDVLKEFASKKCGKLGK 99
K + ++LLRP + L++G YRLI+ ++V+K KK K K
Sbjct: 61 KGDDKKQRKAVRFTRVQLLRPTENLVLGHAYRLITSQEVMKVLREKKSAKTKK 113
>UniRef100_Q9LP48 F28N24.12 protein [Arabidopsis thaliana]
Length = 193
Score = 36.6 bits (83), Expect = 0.12
Identities = 30/104 (28%), Positives = 50/104 (47%), Gaps = 10/104 (9%)
Query: 13 VIHRPGNKVERIYWSVSAHEVMNSNPGHYVALVVSSPTLKSENGMPLKHLK--LLRPDDT 70
++H G+ VE I +++A E+M ++P H V SSPT + + K ++ P+
Sbjct: 23 IVHSNGH-VEEISGTITASEIMKAHPKH-VLKKPSSPTSDHDERDVISATKIVIVPPEAE 80
Query: 71 LLIGQVYRLISFEDVLKEFASKKCGKLGKLLKESGNRGIQMKHR 114
L G++Y L + S KC GK+ +E N +K R
Sbjct: 81 LQRGKIYFL------MPATKSDKCAGGGKIRREKSNANAVVKKR 118
>UniRef100_Q681T1 Hypothetical protein At1g29195 [Arabidopsis thaliana]
Length = 189
Score = 35.8 bits (81), Expect = 0.21
Identities = 30/104 (28%), Positives = 49/104 (46%), Gaps = 10/104 (9%)
Query: 13 VIHRPGNKVERIYWSVSAHEVMNSNPGHYVALVVSSPTLKSENGMPLKHLK--LLRPDDT 70
++H G+ VE I +++A E+M ++P H V SSPT + + K ++ P+
Sbjct: 19 IVHSNGH-VEEISGTITASEIMKAHPKH-VLKKPSSPTSDHDERDVISATKIVIVPPEAE 76
Query: 71 LLIGQVYRLISFEDVLKEFASKKCGKLGKLLKESGNRGIQMKHR 114
L G++Y L + S KC GK+ +E N K R
Sbjct: 77 LQRGKIYFL------MPATKSDKCAGGGKIRREKSNANAVAKKR 114
>UniRef100_Q8DMF0 Tll0168 protein [Synechococcus elongatus]
Length = 134
Score = 33.5 bits (75), Expect = 1.0
Identities = 14/45 (31%), Positives = 25/45 (55%)
Query: 55 NGMPLKHLKLLRPDDTLLIGQVYRLISFEDVLKEFASKKCGKLGK 99
NG PL+H LL +D + +G + + D+ +E + +C +L K
Sbjct: 80 NGFPLQHSTLLHHEDVIGVGTTLLVFYYPDMFREISLDECPELTK 124
>UniRef100_UPI0000326E94 UPI0000326E94 UniRef100 entry
Length = 187
Score = 33.1 bits (74), Expect = 1.4
Identities = 23/83 (27%), Positives = 43/83 (51%), Gaps = 16/83 (19%)
Query: 62 LKLLRPDDTLLIGQVYRLISFEDVLKEFASKKCGKLGKLLKESGNRGIQMKH-------R 114
+K++ D L+ GQ+ L+ E+V + K+G+++K R +++K+ R
Sbjct: 83 IKIINSDKKLVNGQIMNLVG-ENV-------QISKIGQIIKSIYPR-VKIKYENSTSDNR 133
Query: 115 DFRAPNPSPVKVIKFHVNVNVTE 137
D+RA N K+IKF N + +
Sbjct: 134 DYRASNKKAKKIIKFRPNYKIVD 156
>UniRef100_UPI000032848C UPI000032848C UniRef100 entry
Length = 206
Score = 32.7 bits (73), Expect = 1.8
Identities = 21/65 (32%), Positives = 36/65 (55%), Gaps = 3/65 (4%)
Query: 49 PTLKSENGMPLKHL-KLLRPDDTLLIGQVYRLISFEDVLKEFASKKCGKLGK--LLKESG 105
P + + N + K L K+L+ LI Q RL FE+V EF + G++ + + K++G
Sbjct: 93 PIIATLNDLDEKSLIKILKEPKNSLIKQYQRLFEFENVELEFKDEALGEIARKAISKKTG 152
Query: 106 NRGIQ 110
RG++
Sbjct: 153 ARGLR 157
>UniRef100_UPI000029BC64 UPI000029BC64 UniRef100 entry
Length = 1307
Score = 32.7 bits (73), Expect = 1.8
Identities = 29/115 (25%), Positives = 49/115 (42%), Gaps = 12/115 (10%)
Query: 29 SAHEVMNSNPGHYVALVVSSPTLKSENGMPLKHLKLLRPDDTLLIGQV-----YRLISFE 83
S ++ +NP H V + EN P K+ R + T + Y+ SF+
Sbjct: 901 STKTLVEANPEHLVEVWTQLSQPADENWDPAGTKKMWRCESTRSHTTIAKYAQYQAASFQ 960
Query: 84 DVLKEFASKKCGKLGKLLKESGNRGIQMKHRDFRAPNPSPVKVIKFHVNVNVTEK 138
+ L+E KK K ++ + I K R P+K IKF N++V+++
Sbjct: 961 ESLREENEKKALKEPSDVEPASAESIVRKRR-------GPLKHIKFGTNIDVSDE 1008
>UniRef100_Q6D8B3 Putative integrase [Erwinia carotovora]
Length = 654
Score = 32.7 bits (73), Expect = 1.8
Identities = 25/93 (26%), Positives = 42/93 (44%), Gaps = 2/93 (2%)
Query: 12 VVIHRPGNKVERIYWSVSAHEVMNSNPGHYVALVVSSPTLKSENGMPLKHLKLLRPDDTL 71
V+ H+ + + R + +AHE H V +++ TL NGM L +
Sbjct: 410 VLAHKKADNIFRRHGYQNAHERKIYFHSHQVRHLLN--TLAQRNGMTEYELAKWSGRANI 467
Query: 72 LIGQVYRLISFEDVLKEFASKKCGKLGKLLKES 104
+VY +S E++LKE+ S K L+ E+
Sbjct: 468 KQNRVYNHVSDEEILKEYESIKLSATNYLISEA 500
>UniRef100_Q94D86 Hypothetical protein P0439E11.30 [Oryza sativa]
Length = 171
Score = 32.7 bits (73), Expect = 1.8
Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 6/77 (7%)
Query: 4 CQAAELATVVIHRPGNKVERIYWSVSAHEVMNSNPGHYVALVVSSPTLKSENGMPLKHLK 63
C A ++ +V H G+ V+ V+A V+ ++P H + SS + G P K L
Sbjct: 15 CGALDVVRIV-HLSGH-VDEFSCPVTAGAVLAAHPNHTLTTTWSSAGV----GCPTKKLV 68
Query: 64 LLRPDDTLLIGQVYRLI 80
++ PD L G++Y LI
Sbjct: 69 IVSPDSELKRGRIYFLI 85
>UniRef100_UPI000034BD8F UPI000034BD8F UniRef100 entry
Length = 259
Score = 32.3 bits (72), Expect = 2.3
Identities = 17/51 (33%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 62 LKLLRPDDTLLIGQVYRLISFEDVLKEFASKKCGKLGK--LLKESGNRGIQ 110
+K+L+ LI Q RL FEDV+ EF + ++ + K++G RG++
Sbjct: 160 IKILKEPKNSLIKQYKRLFEFEDVILEFKDEAIAEIASKAISKKTGARGLR 210
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.318 0.135 0.394
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 245,519,428
Number of Sequences: 2790947
Number of extensions: 9682447
Number of successful extensions: 19407
Number of sequences better than 10.0: 45
Number of HSP's better than 10.0 without gapping: 16
Number of HSP's successfully gapped in prelim test: 29
Number of HSP's that attempted gapping in prelim test: 19371
Number of HSP's gapped (non-prelim): 52
length of query: 139
length of database: 848,049,833
effective HSP length: 115
effective length of query: 24
effective length of database: 527,090,928
effective search space: 12650182272
effective search space used: 12650182272
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 67 (30.4 bits)
Medicago: description of AC140720.11