Miyakogusa Predicted Gene
- Lj0g3v0225769.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0225769.1 Non Chatacterized Hit- tr|D7U3Z2|D7U3Z2_VITVI
Putative uncharacterized protein OS=Vitis vinifera
GN=,32,0.000005,seg,NULL,CUFF.14696.1
(172 letters)
Database: trembl
41,451,118 sequences; 13,208,986,710 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
I1K1K5_SOYBN (tr|I1K1K5) Uncharacterized protein OS=Glycine max ... 202 5e-50
G7KPZ4_MEDTR (tr|G7KPZ4) Pentatricopeptide repeat-containing pro... 190 1e-46
M5WUJ9_PRUPE (tr|M5WUJ9) Uncharacterized protein (Fragment) OS=P... 171 1e-40
D7SJ51_VITVI (tr|D7SJ51) Putative uncharacterized protein OS=Vit... 171 1e-40
A5B987_VITVI (tr|A5B987) Putative uncharacterized protein OS=Vit... 169 4e-40
B9IJM1_POPTR (tr|B9IJM1) Predicted protein OS=Populus trichocarp... 164 7e-39
B9S636_RICCO (tr|B9S636) Pentatricopeptide repeat-containing pro... 155 4e-36
K4BEG2_SOLLC (tr|K4BEG2) Uncharacterized protein OS=Solanum lyco... 147 2e-33
M4E9X2_BRARP (tr|M4E9X2) Uncharacterized protein OS=Brassica rap... 145 4e-33
M1AIH1_SOLTU (tr|M1AIH1) Uncharacterized protein OS=Solanum tube... 141 6e-32
D7MJ86_ARALL (tr|D7MJ86) Binding protein OS=Arabidopsis lyrata s... 134 1e-29
R0F1A0_9BRAS (tr|R0F1A0) Uncharacterized protein OS=Capsella rub... 133 2e-29
>I1K1K5_SOYBN (tr|I1K1K5) Uncharacterized protein OS=Glycine max PE=4 SV=2
Length = 626
Score = 202 bits (513), Expect = 5e-50, Method: Composition-based stats.
Identities = 113/171 (66%), Positives = 125/171 (73%), Gaps = 19/171 (11%)
Query: 1 MIRRPSISATTVPILSESFSNTTXXXXXXXXXXXXXXXXLVTIHDSQSKPISSPFYNLLP 60
MIRRPSI VP LS+S ++ L+ +HDS SK SPFYNLLP
Sbjct: 1 MIRRPSI---YVPTLSQSMFFSSSSS-------------LIALHDSHSKSTLSPFYNLLP 44
Query: 61 PTQNPHNIVNLISSLLKQKSFHLSLFQN---DIKGILPHMGTHEISRVLLRCQSDHSSAL 117
PTQNP+NIVNLISS+LK KS +LSL + DIKGILPHMG HEISR+LLRCQSDHSS L
Sbjct: 45 PTQNPNNIVNLISSILKHKSSNLSLLHSSNNDIKGILPHMGPHEISRILLRCQSDHSSVL 104
Query: 118 TFFNWVKNDLGVKPAVQNYCVIVHILAWSRVFSKAMKLLCELIHLVEVEGV 168
TFFNWVKNDL + P + NYCVIVHILAWSRVFS AM LL ELI LVEVEGV
Sbjct: 105 TFFNWVKNDLNITPTLHNYCVIVHILAWSRVFSHAMNLLSELIQLVEVEGV 155
>G7KPZ4_MEDTR (tr|G7KPZ4) Pentatricopeptide repeat-containing protein OS=Medicago
truncatula GN=MTR_6g005000 PE=4 SV=1
Length = 846
Score = 190 bits (483), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 95/158 (60%), Positives = 113/158 (71%), Gaps = 1/158 (0%)
Query: 5 PSISATTVPILSESFSNTTXXXXXXXXXXXXXXXXLVTIHDSQSKPISSPFYNLLPPTQN 64
P++ T P +S S + + L+ +HDS S PISSPFYNLLPPT N
Sbjct: 11 PAMKGFTFPRISNSIYSLSDSIFTKYSSFSSSSSSLI-LHDSDSNPISSPFYNLLPPTHN 69
Query: 65 PHNIVNLISSLLKQKSFHLSLFQNDIKGILPHMGTHEISRVLLRCQSDHSSALTFFNWVK 124
P+NIVNLIS+ LKQKSFHLS FQ K ILPH+G HEISRVL+R QSD SSALTFFNWVK
Sbjct: 70 PNNIVNLISTALKQKSFHLSHFQTQFKTILPHLGAHEISRVLIRTQSDASSALTFFNWVK 129
Query: 125 NDLGVKPAVQNYCVIVHILAWSRVFSKAMKLLCELIHL 162
NDL ++QNYC+IVHIL W+++F +AMKLLCELI L
Sbjct: 130 NDLRFTLSLQNYCLIVHILGWNQIFDQAMKLLCELIQL 167
>M5WUJ9_PRUPE (tr|M5WUJ9) Uncharacterized protein (Fragment) OS=Prunus persica
GN=PRUPE_ppa017372mg PE=4 SV=1
Length = 601
Score = 171 bits (432), Expect = 1e-40, Method: Composition-based stats.
Identities = 81/124 (65%), Positives = 96/124 (77%)
Query: 40 LVTIHDSQSKPISSPFYNLLPPTQNPHNIVNLISSLLKQKSFHLSLFQNDIKGILPHMGT 99
L TI DS S +S+P Y+ LP TQNP+NIVNLI S LKQ + HLSL QNDIK + PH+G
Sbjct: 10 LQTIPDSHSISVSNPLYHFLPQTQNPNNIVNLICSSLKQGNAHLSLLQNDIKELFPHLGA 69
Query: 100 HEISRVLLRCQSDHSSALTFFNWVKNDLGVKPAVQNYCVIVHILAWSRVFSKAMKLLCEL 159
EISRVLLR SD+SSAL FFNWVKN LG++P QNYC+++HILA S+ F +AMKLL EL
Sbjct: 70 QEISRVLLRFHSDYSSALVFFNWVKNGLGLRPTTQNYCIVIHILACSKKFPQAMKLLWEL 129
Query: 160 IHLV 163
I LV
Sbjct: 130 IELV 133
>D7SJ51_VITVI (tr|D7SJ51) Putative uncharacterized protein OS=Vitis vinifera
GN=VIT_17s0000g03060 PE=4 SV=1
Length = 660
Score = 171 bits (432), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 81/137 (59%), Positives = 105/137 (76%), Gaps = 4/137 (2%)
Query: 40 LVTIHDSQSKPISSPFYNLLPPTQNPHNIVNLISSLLKQKSFHLSLFQNDIKGILPHMGT 99
L T++ SQ+ IS+ Y+LLP TQNP+NIVNLI S LKQ + +L+L ++ +LPH+G
Sbjct: 41 LQTLNQSQTNSISNSLYHLLPQTQNPNNIVNLICSNLKQHNSNLALLHTEVNALLPHLGA 100
Query: 100 HEISRVLLRCQSDHSSALTFFNWVKNDLGVKPAVQNYCVIVHILAWSRVFSKAMKLLCEL 159
EISRVLLRCQSD +AL+FFNWVKNDLG++P+ QNYC++VH LAWSR FS+AMK LCEL
Sbjct: 101 QEISRVLLRCQSDSFTALSFFNWVKNDLGLRPSTQNYCIVVHTLAWSRNFSQAMKFLCEL 160
Query: 160 IHLVE----VEGVFSNV 172
+ LV+ E VF N+
Sbjct: 161 VELVKDDLPSEDVFKNL 177
>A5B987_VITVI (tr|A5B987) Putative uncharacterized protein OS=Vitis vinifera
GN=VITISV_038361 PE=4 SV=1
Length = 676
Score = 169 bits (427), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 81/137 (59%), Positives = 105/137 (76%), Gaps = 4/137 (2%)
Query: 40 LVTIHDSQSKPISSPFYNLLPPTQNPHNIVNLISSLLKQKSFHLSLFQNDIKGILPHMGT 99
L T++ SQ+ IS+ Y+LLP TQNP+NIVNLI S LKQ + +L+L ++ +LPH+G
Sbjct: 86 LQTLNQSQTNSISNSLYHLLPQTQNPNNIVNLICSNLKQHNSNLALLHTEVNALLPHLGA 145
Query: 100 HEISRVLLRCQSDHSSALTFFNWVKNDLGVKPAVQNYCVIVHILAWSRVFSKAMKLLCEL 159
EISRVLLRCQSD +AL+FFNWVKNDLG++P+ QNYC+IVH LAWSR FS+A+K LCEL
Sbjct: 146 QEISRVLLRCQSDSFTALSFFNWVKNDLGLRPSTQNYCIIVHTLAWSRNFSQAIKFLCEL 205
Query: 160 IHLVE----VEGVFSNV 172
+ LV+ E VF N+
Sbjct: 206 VELVKDDLPGEDVFKNL 222
>B9IJM1_POPTR (tr|B9IJM1) Predicted protein OS=Populus trichocarpa
GN=POPTRDRAFT_908237 PE=4 SV=1
Length = 626
Score = 164 bits (416), Expect = 7e-39, Method: Composition-based stats.
Identities = 80/151 (52%), Positives = 105/151 (69%), Gaps = 1/151 (0%)
Query: 15 LSESFSNTTXXXXXXXXXXXXXXXXLVTIHDSQSKPISSPFYNLLPPTQNPHNIVNLISS 74
+S++F++ L TI +S S +S+P Y+ LP QNP+N VNLI S
Sbjct: 11 ISKTFASNAQNHTLTKNLVPSSSSSLQTIQNSNSISLSNPLYSFLPENQNPNNFVNLIYS 70
Query: 75 LLKQKSFHLSLFQND-IKGILPHMGTHEISRVLLRCQSDHSSALTFFNWVKNDLGVKPAV 133
LK+ + L+L QND IKG++ H+G +EISRVLLRCQSD SALTFFNWVKNDLG+KP+
Sbjct: 71 SLKRDNTQLTLLQNDDIKGLIHHLGANEISRVLLRCQSDSVSALTFFNWVKNDLGLKPST 130
Query: 134 QNYCVIVHILAWSRVFSKAMKLLCELIHLVE 164
NYC+I+H+LAWS+ F +AMK L ELI LV+
Sbjct: 131 LNYCLILHVLAWSKEFEQAMKFLTELILLVK 161
>B9S636_RICCO (tr|B9S636) Pentatricopeptide repeat-containing protein, putative
OS=Ricinus communis GN=RCOM_1063960 PE=4 SV=1
Length = 623
Score = 155 bits (393), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 78/128 (60%), Positives = 100/128 (78%), Gaps = 3/128 (2%)
Query: 40 LVTIHDSQSKPISSPFYNLLPPTQNPHNIVNLISSLLKQKSFHLSLFQN---DIKGILPH 96
L+ I DS S+ +S+P Y+LLP TQNP+NIVN++ S LKQ + + S D+K +LPH
Sbjct: 29 LLAIPDSNSRSLSNPLYHLLPQTQNPNNIVNIVYSSLKQHNNNNSHLNLLQNDVKHLLPH 88
Query: 97 MGTHEISRVLLRCQSDHSSALTFFNWVKNDLGVKPAVQNYCVIVHILAWSRVFSKAMKLL 156
+GT EISRVLLRCQSD SALTFF+WVKNDLG++P++QNYC +VHILAWS+ F +AMK L
Sbjct: 89 LGTDEISRVLLRCQSDSISALTFFSWVKNDLGLQPSIQNYCFLVHILAWSKEFKEAMKFL 148
Query: 157 CELIHLVE 164
ELI LV+
Sbjct: 149 TELIKLVK 156
>K4BEG2_SOLLC (tr|K4BEG2) Uncharacterized protein OS=Solanum lycopersicum
GN=Solyc03g007390.2 PE=4 SV=1
Length = 635
Score = 147 bits (370), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 71/126 (56%), Positives = 93/126 (73%), Gaps = 3/126 (2%)
Query: 40 LVTIHDSQSKPISSPFYNLLPPTQNPHNIVNLISSLLKQ-KSFHLSLFQNDIK--GILPH 96
++T ++ ++PIS+P +N LP TQNP N V+LI S LK + HLSL Q I+ G++ +
Sbjct: 38 ILTDYNGSARPISNPLHNFLPKTQNPQNTVSLICSALKNGNNDHLSLLQKTIRDNGLICY 97
Query: 97 MGTHEISRVLLRCQSDHSSALTFFNWVKNDLGVKPAVQNYCVIVHILAWSRVFSKAMKLL 156
EISRVLLRCQSD SAL+FFNWVKNDLGV+P QNYC+++HIL W+R FS+AMK L
Sbjct: 98 FSDSEISRVLLRCQSDSFSALSFFNWVKNDLGVEPNTQNYCLVIHILTWTRNFSQAMKFL 157
Query: 157 CELIHL 162
EL+ L
Sbjct: 158 SELVDL 163
>M4E9X2_BRARP (tr|M4E9X2) Uncharacterized protein OS=Brassica rapa subsp.
pekinensis GN=Bra025578 PE=4 SV=1
Length = 623
Score = 145 bits (367), Expect = 4e-33, Method: Composition-based stats.
Identities = 72/129 (55%), Positives = 92/129 (71%), Gaps = 3/129 (2%)
Query: 45 DSQSKPIS-SPFYNLLPPTQNPHNIVNLISSLLKQKSFHLSLFQNDIKGILPHMGTHEIS 103
D KPI+ +P YNLLP TQNP+ IV++I S L Q+ S N++K ++PH+G +IS
Sbjct: 39 DDPPKPITFTPLYNLLPNTQNPNRIVDVICSSLNQRDSLSSNLHNEVKSLIPHLGHRQIS 98
Query: 104 RVLLRCQSDHSSALTFFNWVKNDLGVKPAVQNYCVIVHILAWSRVFSKAMKLLCELIHLV 163
RVLLR QSD + AL FFNWVK+DLG P V NYC+++H+LAWS+ F AM+ LCELI LV
Sbjct: 99 RVLLRFQSDATRALAFFNWVKSDLGKTPNVGNYCLLLHVLAWSKKFPLAMQFLCELIELV 158
Query: 164 --EVEGVFS 170
E E VFS
Sbjct: 159 VKEEEDVFS 167
>M1AIH1_SOLTU (tr|M1AIH1) Uncharacterized protein OS=Solanum tuberosum
GN=PGSC0003DMG400009102 PE=4 SV=1
Length = 629
Score = 141 bits (356), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 68/126 (53%), Positives = 92/126 (73%), Gaps = 3/126 (2%)
Query: 40 LVTIHDSQSKPISSPFYNLLPPTQNPHNIVNLISSLLKQKS-FHLSLFQNDIK--GILPH 96
++T + ++P S+P +N LP TQNP N V+LI S LK ++ HLSL Q I+ G++ +
Sbjct: 38 ILTDYSGSARPFSNPLHNFLPKTQNPQNTVSLICSALKNRNNDHLSLLQKTIQDNGLISY 97
Query: 97 MGTHEISRVLLRCQSDHSSALTFFNWVKNDLGVKPAVQNYCVIVHILAWSRVFSKAMKLL 156
EISRVLLRCQSD SAL+FFNWVKNDLGV+P +N C+++HIL W+R FS+AMK L
Sbjct: 98 FSDSEISRVLLRCQSDSCSALSFFNWVKNDLGVEPNTRNCCLVIHILTWTRNFSQAMKFL 157
Query: 157 CELIHL 162
EL++L
Sbjct: 158 SELVNL 163
>D7MJ86_ARALL (tr|D7MJ86) Binding protein OS=Arabidopsis lyrata subsp. lyrata
GN=ARALYDRAFT_493948 PE=4 SV=1
Length = 608
Score = 134 bits (336), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 70/127 (55%), Positives = 89/127 (70%), Gaps = 5/127 (3%)
Query: 49 KPISSPFYNLLPPTQNPHNIVNLISSLLKQKSFHLSL--FQNDIKGILPHMGTHEISRVL 106
KPI +P YNLLP TQNP+ IV++I S L + + L ++++K ++PH+G EISRVL
Sbjct: 25 KPILNPLYNLLPQTQNPNKIVDVICSTLNHRDHSVLLPNLRDEVKSLIPHLGYPEISRVL 84
Query: 107 LRCQSDHSSALTFFNWVKNDLGVKPAVQNYCVIVHILAWSRVFSKAMKLLCELIHLV--- 163
LR QSD S ALTFF WVK DLG +P V NYC+++HILA S+ F AM+ LCELI L
Sbjct: 85 LRFQSDASRALTFFKWVKFDLGKRPNVGNYCLLLHILASSKKFPLAMQFLCELIELTSKK 144
Query: 164 EVEGVFS 170
E E VFS
Sbjct: 145 EEEDVFS 151
>R0F1A0_9BRAS (tr|R0F1A0) Uncharacterized protein OS=Capsella rubella
GN=CARUB_v10006941mg PE=4 SV=1
Length = 615
Score = 133 bits (335), Expect = 2e-29, Method: Composition-based stats.
Identities = 68/121 (56%), Positives = 85/121 (70%), Gaps = 4/121 (3%)
Query: 49 KPISSPFYNLLPPTQNPHNIVNLISSLLKQKSFHLSLFQN---DIKGILPHMGTHEISRV 105
KPI +PFYNLLP TQNP+ IV++I S L + HL L N +++ ++PH+G EISRV
Sbjct: 28 KPIYNPFYNLLPQTQNPNKIVDVICSTLNHRD-HLVLLPNLRGEVETLIPHLGYPEISRV 86
Query: 106 LLRCQSDHSSALTFFNWVKNDLGVKPAVQNYCVIVHILAWSRVFSKAMKLLCELIHLVEV 165
LLR QSD S AL FF WVK DLG +P V NYC+++HILA S+ F AM+ LCELI L
Sbjct: 87 LLRFQSDASRALVFFKWVKFDLGKRPNVGNYCLLLHILASSKKFPLAMQYLCELIDLTSK 146
Query: 166 E 166
E
Sbjct: 147 E 147