Miyakogusa Predicted Gene
- Lj4g3v2267350.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v2267350.1 CUFF.50646.1
(165 letters)
Database: trembl
41,451,118 sequences; 13,208,986,710 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
G7JNW3_MEDTR (tr|G7JNW3) Pre-mRNA polyadenylation factor fip1 OS... 192 4e-47
K7MBA3_SOYBN (tr|K7MBA3) Uncharacterized protein OS=Glycine max ... 176 2e-42
K7LBM5_SOYBN (tr|K7LBM5) Uncharacterized protein OS=Glycine max ... 159 3e-37
A5BBK2_VITVI (tr|A5BBK2) Putative uncharacterized protein OS=Vit... 95 7e-18
D7SYL3_VITVI (tr|D7SYL3) Putative uncharacterized protein OS=Vit... 95 1e-17
B9RZQ4_RICCO (tr|B9RZQ4) Putative uncharacterized protein OS=Ric... 86 5e-15
B9HX18_POPTR (tr|B9HX18) Predicted protein OS=Populus trichocarp... 78 1e-12
K4DHU9_SOLLC (tr|K4DHU9) Uncharacterized protein OS=Solanum lyco... 69 6e-10
M1A050_SOLTU (tr|M1A050) Uncharacterized protein OS=Solanum tube... 68 1e-09
M1A049_SOLTU (tr|M1A049) Uncharacterized protein OS=Solanum tube... 67 2e-09
>G7JNW3_MEDTR (tr|G7JNW3) Pre-mRNA polyadenylation factor fip1 OS=Medicago
truncatula GN=MTR_4g122070 PE=4 SV=1
Length = 1110
Score = 192 bits (488), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 106/174 (60%), Positives = 128/174 (73%), Gaps = 13/174 (7%)
Query: 1 MDQGFTKKRTALLAFDDSHNNKTIKFDISKSVCENENQKWLENLPNKGQKESLDIEEGEI 60
+D+G +KR AL+ FDDS K IK D+SKS C ++N+K L+NL +KGQKE LD+EEGEI
Sbjct: 938 VDRGIAEKRKALVGFDDSRK-KAIKLDVSKSQCVDQNKKLLQNLSDKGQKEGLDVEEGEI 996
Query: 61 VTEEPHMEPSVSRRDGSEGAALTDNTVKKRKSQNGYNSELHTGNIASQKILDTLAKMEKR 120
VTEEP +E SVSRRD SEGA L +N VKK+ SQNG NSE N+ SQKILDTLAKMEKR
Sbjct: 997 VTEEPSVEVSVSRRDVSEGATLAEN-VKKKISQNGNNSEPQIDNLDSQKILDTLAKMEKR 1055
Query: 121 GERFKQPMNMIKEA-----------EKTLKLNADTAVNMSEIKQHRPARKRRWN 163
ERFKQP+ M KEA K+LKLN ++AV++ E+KQ RP RKRRWN
Sbjct: 1056 RERFKQPIGMNKEAVKQPISLNNEVVKSLKLNTNSAVDIGEMKQQRPVRKRRWN 1109
>K7MBA3_SOYBN (tr|K7MBA3) Uncharacterized protein OS=Glycine max PE=4 SV=1
Length = 1098
Score = 176 bits (446), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 96/164 (58%), Positives = 122/164 (74%), Gaps = 3/164 (1%)
Query: 1 MDQGFTKKRT-ALLAFDDSHNNKTIKFDISKSVCENENQKWLENLPNKGQKESLDIEEGE 59
++QG KKR A + FD+S+ N T KFD K +++KW++NL ++ QKES DIEEG+
Sbjct: 933 VNQGIAKKRKRASVGFDESNKN-TFKFDSPKYESNLKSKKWVQNLQDQAQKESSDIEEGQ 991
Query: 60 IVTEEPHMEP-SVSRRDGSEGAALTDNTVKKRKSQNGYNSELHTGNIASQKILDTLAKME 118
IV EEP+ME SVSRRD SEG A+TD+ KKR SQN +S+ + G SQ+ILD+LAKME
Sbjct: 992 IVAEEPYMEKVSVSRRDASEGPAVTDSVNKKRMSQNENSSDQYIGGYDSQRILDSLAKME 1051
Query: 119 KRGERFKQPMNMIKEAEKTLKLNADTAVNMSEIKQHRPARKRRW 162
KR ERFKQPM M KEAE++LKLN D+ V+ E+KQHRP RKRRW
Sbjct: 1052 KRRERFKQPMTMKKEAEESLKLNNDSIVDTGEMKQHRPTRKRRW 1095
>K7LBM5_SOYBN (tr|K7LBM5) Uncharacterized protein OS=Glycine max PE=4 SV=1
Length = 1094
Score = 159 bits (403), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 86/156 (55%), Positives = 111/156 (71%), Gaps = 7/156 (4%)
Query: 7 KKRTALLAFDDSHNNKTIKFDISKSVCENENQKWLENLPNKGQKESLDIEEGEIVTEEPH 66
K+R A + FD+S+ N + KFD K E++KW+++L ++ QKES +IEEG+ V EEP+
Sbjct: 943 KRRRAAVGFDESNKNAS-KFDTPKHKSNQESKKWVQDLQDQAQKESSEIEEGQFVAEEPY 1001
Query: 67 MEPSVSRRDGSEGAALTDNTVKKRKSQNGYNSELHTGNIASQKILDTLAKMEKRGERFKQ 126
ME + SEG A+TD KKR SQN +SE G SQ+ILD+LAKMEKR ERFKQ
Sbjct: 1002 ME------EASEGPAVTDGVNKKRMSQNENSSEQCIGGYDSQRILDSLAKMEKRRERFKQ 1055
Query: 127 PMNMIKEAEKTLKLNADTAVNMSEIKQHRPARKRRW 162
PM M KEAE++LKLN D+ V+ E+KQHRPARKRRW
Sbjct: 1056 PMTMKKEAEESLKLNDDSIVDKGEMKQHRPARKRRW 1091
>A5BBK2_VITVI (tr|A5BBK2) Putative uncharacterized protein OS=Vitis vinifera
GN=VITISV_011790 PE=4 SV=1
Length = 1338
Score = 95.1 bits (235), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 80/142 (56%), Gaps = 1/142 (0%)
Query: 22 KTIKFDISKSVCENENQKWLENLPNKGQKESLDIEEGEIVTEEPHMEPSVSRRDGSEGAA 81
K I+ D+ KS N+K L+ E+LDIEEG+I+ EE + + SV +D SE
Sbjct: 1196 KIIQPDL-KSESNWNNEKCLDKFLVTEHDEALDIEEGQIIPEEMNXDDSVETKDASESIT 1254
Query: 82 LTDNTVKKRKSQNGYNSELHTGNIASQKILDTLAKMEKRGERFKQPMNMIKEAEKTLKLN 141
+ N ++ + N N +Q+IL TLAKMEKR ERFK+P+ + KE +K K
Sbjct: 1255 PSRNVKRRLGNANAANGNKVVAECDNQRILQTLAKMEKRQERFKKPITLKKEPDKIPKPQ 1314
Query: 142 ADTAVNMSEIKQHRPARKRRWN 163
D V M+E Q RP RKRRWN
Sbjct: 1315 VDPIVEMAETMQQRPLRKRRWN 1336
>D7SYL3_VITVI (tr|D7SYL3) Putative uncharacterized protein OS=Vitis vinifera
GN=VIT_05s0077g01000 PE=4 SV=1
Length = 1300
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 80/142 (56%), Gaps = 1/142 (0%)
Query: 22 KTIKFDISKSVCENENQKWLENLPNKGQKESLDIEEGEIVTEEPHMEPSVSRRDGSEGAA 81
K I+ D+ KS N+K L+ E+LDIEEG+I+ EE + + SV +D SE
Sbjct: 1158 KIIQPDL-KSESNWNNEKCLDKFLVTEHDEALDIEEGQIIPEEMNEDDSVETKDASESIT 1216
Query: 82 LTDNTVKKRKSQNGYNSELHTGNIASQKILDTLAKMEKRGERFKQPMNMIKEAEKTLKLN 141
+ N ++ + N N +Q+IL TLAKMEKR ERFK+P+ + KE +K K
Sbjct: 1217 PSRNVKRRLGNANAANGNKVVAECDNQRILQTLAKMEKRQERFKKPITLKKEPDKIPKPQ 1276
Query: 142 ADTAVNMSEIKQHRPARKRRWN 163
D V M+E Q RP RKRRWN
Sbjct: 1277 VDPIVEMAETMQQRPLRKRRWN 1298
>B9RZQ4_RICCO (tr|B9RZQ4) Putative uncharacterized protein OS=Ricinus communis
GN=RCOM_1000520 PE=4 SV=1
Length = 1155
Score = 85.9 bits (211), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 52/133 (39%), Positives = 75/133 (56%), Gaps = 16/133 (12%)
Query: 37 NQKWLENLPNKGQKESLDIEEGEIVTEEPHMEPSVSRRDGSEGAALTDNTVKKRKSQNGY 96
+++WL+ P Q LDIEEG+IV EEP + + + E +L R +N +
Sbjct: 1033 DERWLDKFPVSKQDGYLDIEEGQIVPEEPTIGNRLEEKQAPETVSLM------RSMKNAF 1086
Query: 97 NSELHTGNIAS-----QKILDTLAKMEKRGERFKQPMNMIKEAEKTLKLNADTAVNMSEI 151
H+GN+ + Q+IL++LAKMEKR ERFK P+ +E +K +K D + +
Sbjct: 1087 ----HSGNMTNKRYDDQQILESLAKMEKRRERFKDPIAFKREPDKPMK-PIDLIADAIKS 1141
Query: 152 KQHRPARKRRWND 164
KQ RPARKRRW D
Sbjct: 1142 KQERPARKRRWAD 1154
>B9HX18_POPTR (tr|B9HX18) Predicted protein OS=Populus trichocarpa
GN=POPTRDRAFT_769829 PE=4 SV=1
Length = 1253
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/164 (33%), Positives = 89/164 (54%), Gaps = 6/164 (3%)
Query: 1 MDQGFTKKRTALLAFDDSHNNKTIKFDISKSVCENENQKWLENLPNKGQKESLDIEEGEI 60
MD F K+ F++S I+ D+ ++ +++ +KW+ E L+IE+G+I
Sbjct: 1091 MDLKFAKEPMCSKDFNESQTG--IQTDVLETGGDDK-EKWIGKSQVTEHNEKLNIEDGQI 1147
Query: 61 VTEEPHMEPSVSRRDGSEGAALTDNTVKKRK--SQNGYNSELHTGNIASQKILDTLAKME 118
+ EE ME ++++ + T N K R +N + + G + S++ILDT+AKME
Sbjct: 1148 MAEESSMESKLAKKCAFKSVVPTCNA-KNRNFLCENASSRNKNDGAVDSKRILDTIAKME 1206
Query: 119 KRGERFKQPMNMIKEAEKTLKLNADTAVNMSEIKQHRPARKRRW 162
KR ERFK P+ KE +KT + + ++ Q RPARKRRW
Sbjct: 1207 KRRERFKDPIAQKKELDKTSEPQVEVIIDTVPANQDRPARKRRW 1250
>K4DHU9_SOLLC (tr|K4DHU9) Uncharacterized protein OS=Solanum lycopersicum
GN=Solyc12g099280.1 PE=4 SV=1
Length = 1130
Score = 68.9 bits (167), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 52/138 (37%), Positives = 71/138 (51%), Gaps = 28/138 (20%)
Query: 34 ENENQKWLENLPNKGQKESLDIEEGEIVTEEPHMEPSVSRRDGSEGAALTDNTVKKR--- 90
EN+ ++ L + Q+ESLDIEEG+I+ E + + VKKR
Sbjct: 1005 ENDKER-LAIFSDANQEESLDIEEGQIIEE------------------MNEKIVKKRITY 1045
Query: 91 --KSQNGYNSELHTG-NIASQ---KILDTLAKMEKRGERFKQPMNMIKEAEKTLKLNADT 144
KS+ G TG N+ Q KIL+ +AKMEKRGERFKQP+ + + + D+
Sbjct: 1046 SGKSEIGEMKNFATGKNVEGQGSPKILEIIAKMEKRGERFKQPIALKSDTKNISTPLVDS 1105
Query: 145 AVNMSEIKQHRPARKRRW 162
+E Q RPARKRRW
Sbjct: 1106 FAVSTEPMQPRPARKRRW 1123
>M1A050_SOLTU (tr|M1A050) Uncharacterized protein OS=Solanum tuberosum
GN=PGSC0003DMG400004621 PE=4 SV=1
Length = 184
Score = 67.8 bits (164), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/132 (34%), Positives = 65/132 (49%), Gaps = 28/132 (21%)
Query: 41 LENLPNKGQKESLDIEEGEIVTEEPHMEPSVSRRDGSEGAALTDNTVKKRKSQNGYNSEL 100
L+ + Q+ESLDIEEG+I+ E + + +KKR + +G +
Sbjct: 64 LDIFSDANQEESLDIEEGQIIEE------------------MNEKIIKKRITCSGKSQIS 105
Query: 101 HTGNIASQK----------ILDTLAKMEKRGERFKQPMNMIKEAEKTLKLNADTAVNMSE 150
N A K IL+ +AKMEKRGERFKQP+ + + + K D+ +E
Sbjct: 106 EMKNFAYDKNVEGQDNNPRILEIMAKMEKRGERFKQPIALKSDTKNVSKPLVDSFALSTE 165
Query: 151 IKQHRPARKRRW 162
Q RPARKRRW
Sbjct: 166 PMQPRPARKRRW 177
>M1A049_SOLTU (tr|M1A049) Uncharacterized protein OS=Solanum tuberosum
GN=PGSC0003DMG400004621 PE=4 SV=1
Length = 1130
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 70/132 (53%), Gaps = 28/132 (21%)
Query: 41 LENLPNKGQKESLDIEEGEIVTEEPHMEPSVSRRDGSEGAALTDNTVKKRKSQNGYN--S 98
L+ + Q+ESLDIEEG+I+ E + + +KKR + +G + S
Sbjct: 1010 LDIFSDANQEESLDIEEGQIIEE------------------MNEKIIKKRITCSGKSQIS 1051
Query: 99 EL----HTGNIASQ----KILDTLAKMEKRGERFKQPMNMIKEAEKTLKLNADTAVNMSE 150
E+ + N+ Q +IL+ +AKMEKRGERFKQP+ + + + K D+ +E
Sbjct: 1052 EMKNFAYDKNVEGQDNNPRILEIMAKMEKRGERFKQPIALKSDTKNVSKPLVDSFALSTE 1111
Query: 151 IKQHRPARKRRW 162
Q RPARKRRW
Sbjct: 1112 PMQPRPARKRRW 1123