
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0074.12
(369 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_Q9SQW9 Putative retroelement pol polyprotein [Arabidop... 46 0.001
UniRef100_Q7XPJ7 OSJNBa0087O24.13 protein [Oryza sativa] 43 0.012
UniRef100_Q94GG1 Putative polyprotein [Oryza sativa] 43 0.016
UniRef100_Q6CBR5 Similarity [Yarrowia lipolytica] 43 0.016
UniRef100_Q8LAQ8 Hypothetical protein [Arabidopsis thaliana] 42 0.021
UniRef100_Q6L3V6 Hypothetical protein [Solanum demissum] 42 0.027
UniRef100_Q8LLX2 Putative retroelement [Oryza sativa] 42 0.035
UniRef100_Q8S653 Putative retroelement [Oryza sativa] 42 0.035
UniRef100_Q8H7X6 Putative retroelement [Oryza sativa] 39 0.23
UniRef100_Q9FWC7 Putative plant disease resistance polyprotein [... 38 0.51
UniRef100_Q9S9R5 F28J9.14 [Arabidopsis thaliana] 37 0.67
UniRef100_Q9FYC9 Hypothetical protein F22J12_60 [Arabidopsis tha... 37 0.67
UniRef100_Q7XZZ8 Putative polyprotein [Oryza sativa] 37 0.67
UniRef100_UPI00004345B9 UPI00004345B9 UniRef100 entry 37 0.87
UniRef100_UPI00004345B8 UPI00004345B8 UniRef100 entry 37 0.87
UniRef100_Q852F5 Putative polyprotein [Oryza sativa] 37 0.87
UniRef100_Q6C914 Similarities with tr|Q9XE85 Sorghum bicolor Pol... 37 0.87
UniRef100_UPI00004193CA UPI00004193CA UniRef100 entry 37 1.1
UniRef100_Q84ZV5 Polyprotein [Glycine max] 37 1.1
UniRef100_Q7XEK0 Hypothetical protein [Oryza sativa] 37 1.1
>UniRef100_Q9SQW9 Putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1661
Score = 46.2 bits (108), Expect = 0.001
Identities = 37/144 (25%), Positives = 64/144 (43%), Gaps = 17/144 (11%)
Query: 22 FYSPPSQPNTSPYYCHNVPSYYPQTHSSLQLSEFNGENPEYWLFMADCYFLQNPYPWETR 81
F + P ++P+ P + ++ + G N + WLF + FL N E +
Sbjct: 207 FQTQTFPPQSAPHQ----PRFEAAPRRTVDYPAYEGGNADDWLFRLEQCFLSNRTLEEEK 262
Query: 82 FQLLAIHLGGKALTWFRLS--HQQIHSWEEFKVAFRLYFVYNRPNGFSTTTVKYGFEKSS 139
+ L G ++TW+R S +QI++W EF+ F L F RP+ S+
Sbjct: 263 LEKAVSCLTGASVTWWRCSKDREQIYTWREFQEKFMLRF---RPSRGSSAV-------DH 312
Query: 140 LVEIQRSSEQVSPHRVVFSETTLE 163
L+ + R + V +R F E T++
Sbjct: 313 LLNV-RQTGTVEEYRERFEELTVD 335
>UniRef100_Q7XPJ7 OSJNBa0087O24.13 protein [Oryza sativa]
Length = 1311
Score = 43.1 bits (100), Expect = 0.012
Identities = 31/97 (31%), Positives = 42/97 (42%), Gaps = 8/97 (8%)
Query: 23 YSPPS--QPNTSPY---YCHNVPSYYPQTHS-SLQLSEFNGENPEYWLFMADCYFLQNPY 76
+ PP Q N P+ Y H S P S ++ F G+ PE W+ A+ YF
Sbjct: 257 FHPPHHHQYNPEPHHTNYAHRPHSADPAKRSRNVDFPTFEGDYPESWIRKAEKYFSLYQT 316
Query: 77 PWETRFQLLAIHLGGKALTWFRLSHQQIH--SWEEFK 111
P E + L +H+ G+A W S SW EFK
Sbjct: 317 PEEDKVLLAEVHISGRADQWIESSAVPTASLSWPEFK 353
>UniRef100_Q94GG1 Putative polyprotein [Oryza sativa]
Length = 869
Score = 42.7 bits (99), Expect = 0.016
Identities = 21/73 (28%), Positives = 38/73 (51%), Gaps = 4/73 (5%)
Query: 53 SEFNGENPEYWLFMADCYFLQNPYPWETRFQLLAIHLGGKALTWFRLSHQQIH---SWEE 109
++F+G+NP+ W ++ YF P+ET +H G A W + +++++H +W E
Sbjct: 96 AKFDGDNPKLWKTNSEKYFSMYQVPYETWSSFATLHFIGNAALWLQ-TYEELHCVENWSE 154
Query: 110 FKVAFRLYFVYNR 122
VA F +R
Sbjct: 155 LSVAVHSKFGKDR 167
>UniRef100_Q6CBR5 Similarity [Yarrowia lipolytica]
Length = 1136
Score = 42.7 bits (99), Expect = 0.016
Identities = 34/96 (35%), Positives = 45/96 (46%), Gaps = 11/96 (11%)
Query: 138 SSLVEIQRSSEQVSPHRVVFSETTLEDGAQLPFEASSDVHLLPEVVSRVELGSAETEIES 197
S++ SS V+ V SE T A +P EA+S PEV S E S
Sbjct: 654 SNVTSSANSSSDVTSSADVSSEVTTS--ADIPSEATSSAETTPEVTSSAEATS------- 704
Query: 198 HESSTSED-SSLITTPVDSTPYATRSDAITPDVNKT 232
E +TS D SS +T ++TP AT S TP+V +
Sbjct: 705 -EVTTSADISSEVTASAEATPEATSSAEATPEVTSS 739
>UniRef100_Q8LAQ8 Hypothetical protein [Arabidopsis thaliana]
Length = 272
Score = 42.4 bits (98), Expect = 0.021
Identities = 23/89 (25%), Positives = 40/89 (44%), Gaps = 1/89 (1%)
Query: 24 SPPSQPNTSPYYCHNVPSYYPQTHSSLQLSEFNGENPEYWLFMADCYFLQNPYPWETRFQ 83
S ++ N+ + H + Y +++ F G N WL A+ YF + + Q
Sbjct: 88 STAAEDNSIDTHPHRMSRYIQLMKPKIEMPVFEGPNVNSWLTRAERYFEFGSFTNAEKIQ 147
Query: 84 LLAIHLGGKALTWFRLSHQQ-IHSWEEFK 111
L+ + + G AL WF L ++ W +FK
Sbjct: 148 LVYMSVEGPALCWFNLENRNPFVDWNDFK 176
>UniRef100_Q6L3V6 Hypothetical protein [Solanum demissum]
Length = 490
Score = 42.0 bits (97), Expect = 0.027
Identities = 61/276 (22%), Positives = 112/276 (40%), Gaps = 28/276 (10%)
Query: 59 NPEYWLFMADCYFLQNPYPWETRFQLLAIHLGGKALTWFR--LSHQQIHSWEEFKVAFRL 116
NP+ W+F A+ YF + L ++L G+AL WFR ++Q W+ FK L
Sbjct: 14 NPDDWIFRAERYFAYLGFLENDWIPLPFLYLDGEALDWFRWMYRNKQFWDWKHFKEKLSL 73
Query: 117 YFVYNRPNGFSTTTVKYGFEKSSLVEIQRSSEQVSPHRVVFSETTLEDGAQLPFEASSDV 176
F + TT + S + I ++V + V S QL +S
Sbjct: 74 CF-------RARTTPESALAHSQITTILTLLQKVQDNFSVMSN-------QLDNTHNSIS 119
Query: 177 HLLPEVVSRVELGSAETEIESHESSTSEDSSLITTPVDSTPYATRSDAITPDVNKTFVLK 236
++ + V+ + E I+S + SE+ + + D + + P++ T +
Sbjct: 120 NITGLSPTIVQEAAIEDAIDSATTRVSEE--IASALGDGQSSIANNFEVHPEMFDTLI-- 175
Query: 237 TTDLVLRNPRQVFGNLLEGKFVQKNNEAQFEKHFDKLLMQFHNSSTTVDVQKIGVPCPKV 296
T+L+L V G+ G Q +E+ + + L+ +F + V +G+ P
Sbjct: 176 DTNLLLAGTTFVIGDEHSGHVDQVFDESSHQ--IEDLVHEFGMAFDLVADNLVGLQIP-- 231
Query: 297 FEKMLVTRNTRICVDNTFEKLLETYINLLLDWQRMR 332
+ TR+ + ++ LET N++ D R
Sbjct: 232 ---LQATRSVEPPLVEE-DECLETKTNIVFDGSLQR 263
>UniRef100_Q8LLX2 Putative retroelement [Oryza sativa]
Length = 813
Score = 41.6 bits (96), Expect = 0.035
Identities = 23/67 (34%), Positives = 34/67 (50%), Gaps = 4/67 (5%)
Query: 55 FNGENPEYWLFMADCYFLQNPYPWETRFQLLAIHLGGKALTWFRLSHQQIH---SWEEFK 111
F+GENP+ W A+ YF +ET Q +H G A W + +++++H SW E
Sbjct: 117 FSGENPKLWKKNAEKYFGMYNVAYETWAQFATLHFTGNAALWLQ-TYEELHSVESWAELC 175
Query: 112 VAFRLYF 118
VA F
Sbjct: 176 VAVNSKF 182
>UniRef100_Q8S653 Putative retroelement [Oryza sativa]
Length = 1043
Score = 41.6 bits (96), Expect = 0.035
Identities = 23/67 (34%), Positives = 34/67 (50%), Gaps = 4/67 (5%)
Query: 55 FNGENPEYWLFMADCYFLQNPYPWETRFQLLAIHLGGKALTWFRLSHQQIH---SWEEFK 111
F+GENP+ W A+ YF +ET Q +H G A W + +++++H SW E
Sbjct: 117 FSGENPKLWKKNAEKYFGMYNVAYETWAQFATLHFTGNAALWLQ-TYEELHSVESWAELC 175
Query: 112 VAFRLYF 118
VA F
Sbjct: 176 VAVNSKF 182
>UniRef100_Q8H7X6 Putative retroelement [Oryza sativa]
Length = 1021
Score = 38.9 bits (89), Expect = 0.23
Identities = 26/84 (30%), Positives = 39/84 (45%), Gaps = 16/84 (19%)
Query: 35 YCHNVPSYYPQTHSSLQLSEFNGENPEYWLFMADCYF-LQNPYP--WETRFQLLAIHLGG 91
Y H VP L +F+G +PE W + YF + N +P W ++ I+ G
Sbjct: 226 YNHRVPK--------LDFPKFDGTDPEDWRMRCEHYFDVNNTFPGLW---VRIATIYFSG 274
Query: 92 KALTWFR--LSHQQIHSWEEFKVA 113
+A +W R +H + WE F VA
Sbjct: 275 RAASWLRSTKAHVRFPIWENFCVA 298
>UniRef100_Q9FWC7 Putative plant disease resistance polyprotein [Oryza sativa]
Length = 894
Score = 37.7 bits (86), Expect = 0.51
Identities = 16/71 (22%), Positives = 32/71 (44%), Gaps = 2/71 (2%)
Query: 50 LQLSEFNGENPEYWLFMADCYFLQNPYPWETRFQLLAIHLGGKALTWFRLSH--QQIHSW 107
+ F+G + WL M + YF + + +H+ G A W+ +++SW
Sbjct: 122 MDFPRFDGSDVRIWLNMCETYFDMYQITQNFKVSAVVLHMSGNAAQWYHSYKLVNEVNSW 181
Query: 108 EEFKVAFRLYF 118
++F++A F
Sbjct: 182 DQFRMAVATEF 192
>UniRef100_Q9S9R5 F28J9.14 [Arabidopsis thaliana]
Length = 525
Score = 37.4 bits (85), Expect = 0.67
Identities = 32/120 (26%), Positives = 51/120 (41%), Gaps = 9/120 (7%)
Query: 50 LQLSEFNGENPEYWLFMADCYFLQNPYPWETRFQLLAIHLGGKALTWFR-LSHQQI---- 104
L FNG+ + WL + + +F P E + L++IH G A W + +SH +
Sbjct: 139 LDFPRFNGDKIQEWLLLVEQFFEIYQTPDEFKVCLVSIHFDGLANAWHQSISHSVMWEHV 198
Query: 105 -HSWEEFKVAFRLYFVYNRPNGFSTTTVKYGFEKSSLVEIQRSSEQVSPHRVVFSETTLE 163
H W +K+ ++ YN S + E + E E +S RV F E L+
Sbjct: 199 RHDWWSYKLLLQVR--YNEHVDDSIAKLTQLQETEGIEEYHARFELIST-RVNFGEDYLK 255
>UniRef100_Q9FYC9 Hypothetical protein F22J12_60 [Arabidopsis thaliana]
Length = 221
Score = 37.4 bits (85), Expect = 0.67
Identities = 29/120 (24%), Positives = 47/120 (39%), Gaps = 11/120 (9%)
Query: 10 SILSAQRDPRSIFYSPPSQPNTS-PYYCHNVPSYYPQTH----SSLQLSEFNGENPEYWL 64
S+ R + SP SQ S P + S Y + FNG+ + WL
Sbjct: 24 SMAPHDRSSSMVLGSPESQSGCSDPNHDVRSDSQYHYRRLRRLGKVDFPRFNGDGIKDWL 83
Query: 65 FMADCYFLQNPYPWETRFQLLAIHLGGKALTWFRLSHQQI------HSWEEFKVAFRLYF 118
F + +FL + P E + + +IH TW + Q I H W +K+ ++ +
Sbjct: 84 FQIEQFFLIDHTPEELKVDIASIHFDDIDATWHQSIVQSIMWRHVRHDWWNYKLLLQVRY 143
>UniRef100_Q7XZZ8 Putative polyprotein [Oryza sativa]
Length = 1246
Score = 37.4 bits (85), Expect = 0.67
Identities = 24/70 (34%), Positives = 34/70 (48%), Gaps = 4/70 (5%)
Query: 33 PYYCHNV-PSYYPQTH---SSLQLSEFNGENPEYWLFMADCYFLQNPYPWETRFQLLAIH 88
P Y H V P+ Y + L+L+ FNGE+ WL + +F P + L + H
Sbjct: 279 PRYLHGVIPNPYTEMSLRSQRLELTLFNGEDAVGWLQQCEKFFEMTGTPVDQWVNLASGH 338
Query: 89 LGGKALTWFR 98
L G+A WFR
Sbjct: 339 LVGRAGKWFR 348
>UniRef100_UPI00004345B9 UPI00004345B9 UniRef100 entry
Length = 5078
Score = 37.0 bits (84), Expect = 0.87
Identities = 27/109 (24%), Positives = 49/109 (44%), Gaps = 18/109 (16%)
Query: 132 KYGFEKSSLVEIQRSSEQVSPHRVVFSETTLEDGAQLPFEASSDVHLLPEVVSRVELGSA 191
K EK + +I++ E++ P + T D ++ P SS++H++ E A
Sbjct: 2290 KVDIEKHVVEKIEKVKEEIKPESIPIETKTAIDASKRP-SISSEIHVVTE---------A 2339
Query: 192 ETEIESHESSTSEDSSLITTPVDSTPYATRSDAITPDVNKTFVLKTTDL 240
ET++ S SE +D T TRS +I D+ + ++T +
Sbjct: 2340 ETKVIDKSKSPSE--------IDETEDKTRSPSIAGDIEDSKEVETASI 2380
>UniRef100_UPI00004345B8 UPI00004345B8 UniRef100 entry
Length = 4971
Score = 37.0 bits (84), Expect = 0.87
Identities = 27/109 (24%), Positives = 49/109 (44%), Gaps = 18/109 (16%)
Query: 132 KYGFEKSSLVEIQRSSEQVSPHRVVFSETTLEDGAQLPFEASSDVHLLPEVVSRVELGSA 191
K EK + +I++ E++ P + T D ++ P SS++H++ E A
Sbjct: 1825 KVDIEKHVVEKIEKVKEEIKPESIPIETKTAIDASKRP-SISSEIHVVTE---------A 1874
Query: 192 ETEIESHESSTSEDSSLITTPVDSTPYATRSDAITPDVNKTFVLKTTDL 240
ET++ S SE +D T TRS +I D+ + ++T +
Sbjct: 1875 ETKVIDKSKSPSE--------IDETEDKTRSPSIAGDIEDSKEVETASI 1915
>UniRef100_Q852F5 Putative polyprotein [Oryza sativa]
Length = 1155
Score = 37.0 bits (84), Expect = 0.87
Identities = 16/71 (22%), Positives = 31/71 (43%), Gaps = 2/71 (2%)
Query: 50 LQLSEFNGENPEYWLFMADCYFLQNPYPWETRFQLLAIHLGGKALTWFRLSH--QQIHSW 107
+ F+G + WL M + YF + +H+ G A W+ +++SW
Sbjct: 122 MDFPRFDGSDVRIWLDMCETYFDMYQITQNFKVSAAVLHMSGNAAQWYHSYKLVNEVNSW 181
Query: 108 EEFKVAFRLYF 118
++F++A F
Sbjct: 182 DQFRMAVATEF 192
>UniRef100_Q6C914 Similarities with tr|Q9XE85 Sorghum bicolor Polyprotein [Yarrowia
lipolytica]
Length = 678
Score = 37.0 bits (84), Expect = 0.87
Identities = 30/108 (27%), Positives = 47/108 (42%), Gaps = 10/108 (9%)
Query: 32 SPYYCHNVPSYYPQTHSSLQLSEFNGENPEYWLFMADCYFLQNPYPWETRFQLLAIHLGG 91
S Y H +P + T ++ +S+ N + WL + + P P R LL L G
Sbjct: 54 SIYPVHTIPIF---TGDAVLVSQ-NAQLAHDWLIAVENFLCTFPVPHHARPHLLGSRLFG 109
Query: 92 KALTWFRLSHQQ--IHSWEEFKVAFRLYFVYNRPNGFSTTTVKYGFEK 137
A W+R S + + +W EFK F Y+ F++ T + F K
Sbjct: 110 SAGLWWRQSMAKNILSNWHEFKSNFASYWCPE----FNSQTESHFFHK 153
>UniRef100_UPI00004193CA UPI00004193CA UniRef100 entry
Length = 1507
Score = 36.6 bits (83), Expect = 1.1
Identities = 31/119 (26%), Positives = 52/119 (43%), Gaps = 16/119 (13%)
Query: 119 VYNRPNGFSTTTVKYGFEKSSLVEIQRS------SEQVSPHRVVFSETTL-----EDGAQ 167
VY+ G + TTV F +S+ ++R S S H +F+E + E+
Sbjct: 1326 VYSSSPGSTETTV---FPRSTTTSVRREEPTTFHSRPASTHTTLFTEDSTTSGLTEESTA 1382
Query: 168 LPFEASSDVHLLPEVVSRVELGSAETEIESHESSTSEDSSLITTPVDSTPYATRSDAIT 226
P +S LP ++ +LG E +H S+ S ++L TP ST + ++ T
Sbjct: 1383 FPGSPASTQTGLPATLTTADLGLVEASTPTHSSTGSLHTTL--TPASSTSTGLQEESTT 1439
Score = 34.3 bits (77), Expect = 5.7
Identities = 28/105 (26%), Positives = 46/105 (43%), Gaps = 12/105 (11%)
Query: 127 STTTVKYGFEKSSLVEIQRSSEQVSPHRVVFSETTL-----EDGAQLPFEASSDVHLLPE 181
STTT +G E ++ S S H +F+E + E+ P +S LP
Sbjct: 506 STTTSVHGEEPTTF-----HSRPASTHTTLFTEDSTTSGLTEESTAFPGSPASTQTGLPA 560
Query: 182 VVSRVELGSAETEIESHESSTSEDSSLITTPVDSTPYATRSDAIT 226
++ +LG E +H S+ S ++L TP ST + ++ T
Sbjct: 561 TLTTADLGLVEASTPTHSSTGSLHTTL--TPASSTSAGLQEESTT 603
>UniRef100_Q84ZV5 Polyprotein [Glycine max]
Length = 1552
Score = 36.6 bits (83), Expect = 1.1
Identities = 19/71 (26%), Positives = 33/71 (45%), Gaps = 2/71 (2%)
Query: 50 LQLSEFNGENPEYWLFMADCYFLQNPYPWETRFQLLAIHLGGKALTWFRL--SHQQIHSW 107
L F+G+N W+F A+ +F P R + ++HL + W+++ + SW
Sbjct: 100 LDFPRFDGKNVMDWIFKAEQFFDYYATPDADRLIIASVHLDQDVVPWYQMLQKTEPFSSW 159
Query: 108 EEFKVAFRLYF 118
+ F A L F
Sbjct: 160 QAFTRALELDF 170
>UniRef100_Q7XEK0 Hypothetical protein [Oryza sativa]
Length = 1611
Score = 36.6 bits (83), Expect = 1.1
Identities = 18/49 (36%), Positives = 25/49 (50%)
Query: 50 LQLSEFNGENPEYWLFMADCYFLQNPYPWETRFQLLAIHLGGKALTWFR 98
L++S F GE+P WL + +F P + L HL G+A WFR
Sbjct: 232 LEISLFTGEDPVDWLKQCEKFFEITGTPVDQWVNLAVAHLYGRAAKWFR 280
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.318 0.134 0.395
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 600,302,556
Number of Sequences: 2790947
Number of extensions: 24036212
Number of successful extensions: 70104
Number of sequences better than 10.0: 80
Number of HSP's better than 10.0 without gapping: 12
Number of HSP's successfully gapped in prelim test: 69
Number of HSP's that attempted gapping in prelim test: 70007
Number of HSP's gapped (non-prelim): 145
length of query: 369
length of database: 848,049,833
effective HSP length: 129
effective length of query: 240
effective length of database: 488,017,670
effective search space: 117124240800
effective search space used: 117124240800
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 75 (33.5 bits)
Lotus: description of TM0074.12