Miyakogusa Predicted Gene
- Lj4g3v1386900.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v1386900.1 Non Chatacterized Hit- tr|I1MUQ6|I1MUQ6_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,77.81,0,INTEGRATOR
COMPLEX SUBUNIT 11,NULL; CLEAVAGE AND POLYADENYLATION SPECIFICITY
FACTOR,NULL; seg,NULL,CUFF.49095.1
(314 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G11680.1 | Symbols: | Zinc finger, C3HC4 type (RING finger) ... 330 6e-91
AT1G12760.1 | Symbols: | Zinc finger, C3HC4 type (RING finger) ... 321 5e-88
AT1G63170.1 | Symbols: | Zinc finger, C3HC4 type (RING finger) ... 310 6e-85
AT1G12760.2 | Symbols: | Zinc finger, C3HC4 type (RING finger) ... 294 5e-80
AT3G61180.1 | Symbols: | RING/U-box superfamily protein | chr3:... 293 9e-80
AT1G68070.1 | Symbols: | Zinc finger, C3HC4 type (RING finger) ... 221 4e-58
AT2G01735.1 | Symbols: RIE1 | RING-finger protein for embryogene... 202 3e-52
AT1G80400.1 | Symbols: | RING/U-box superfamily protein | chr1:... 87 1e-17
AT4G32600.1 | Symbols: | RING/U-box superfamily protein | chr4:... 79 4e-15
>AT4G11680.1 | Symbols: | Zinc finger, C3HC4 type (RING finger)
family protein | chr4:7053737-7055516 REVERSE LENGTH=390
Length = 390
Score = 330 bits (847), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 175/314 (55%), Positives = 203/314 (64%), Gaps = 14/314 (4%)
Query: 4 TRTQEEPPDSGRRLNRRPSLRQAARFLRQASGRRMMREPSMVVRETAAEQLEERQSDWAY 63
T + +E + R RR LR+AARFLR A RRMMREPSM+VRETAAEQLEERQSDWAY
Sbjct: 46 TISVDEESNPIHRSARRQGLREAARFLRHAGSRRMMREPSMLVRETAAEQLEERQSDWAY 105
Query: 64 SKPVVVLDILWNXXXXXXXXXXXXXSRSETPSTPLRLWILGYALQXXXXXXXXXXEFXXX 123
SKPVV LDILWN SR E P+ PLR+W++GY +Q E+
Sbjct: 106 SKPVVFLDILWNLAFVAIGVAVLILSRDEKPNMPLRVWVVGYGIQCWLHMACVCVEYRRR 165
Query: 124 XXXXEQPDGEAVXXXXXXXXXXXXXXXNYANLGQLEDPG--TSMAKHLESANTMFSFIWW 181
DG Y +L QLED G ++ AKHLESANTMFSFIWW
Sbjct: 166 RRRRHPEDGGG-------SGLTNSSSQQYVSLAQLEDRGETSNPAKHLESANTMFSFIWW 218
Query: 182 VIGFYWVSADGQTLAQDSPQLYWLCIXXXXXXXXXXXXCIALACIIGIAVCCCLPCIIAL 241
+IGFYWVSA GQTL+ DSPQLYWLCI C+ALAC+IG+AVCCCLPCIIA+
Sbjct: 219 IIGFYWVSAGGQTLSSDSPQLYWLCIIFLGFDVFFVVFCVALACVIGLAVCCCLPCIIAI 278
Query: 242 LYAVADQEGASQEDIEQLSKFKFQRKSN-EKLAGDTEGPVGGIMTECHSDSPTEHMLSAE 300
LYAVADQEGAS+ DI+Q+ KF+F + N EKL+G GIMTEC +DSP E LS E
Sbjct: 279 LYAVADQEGASKNDIDQMPKFRFTKTGNVEKLSGKAR----GIMTECGTDSPIERSLSPE 334
Query: 301 DAECCICLSAYDDG 314
DAECCICL Y+DG
Sbjct: 335 DAECCICLCEYEDG 348
>AT1G12760.1 | Symbols: | Zinc finger, C3HC4 type (RING finger)
family protein | chr1:4348728-4350512 FORWARD LENGTH=408
Length = 408
Score = 321 bits (822), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 170/316 (53%), Positives = 199/316 (62%), Gaps = 16/316 (5%)
Query: 15 RRLNRRPSLRQAARFLRQASGRRMMREPSMVVRETAAEQLEERQSDWAYSKPVVVLDILW 74
RR RR LR+AARFL +AS R+MREPSM+VRE AAEQLEERQSDWAYSKPVVVLDI+W
Sbjct: 48 RRSVRRQGLREAARFLSRASSGRVMREPSMLVREAAAEQLEERQSDWAYSKPVVVLDIVW 107
Query: 75 NXXXXXXXXXXXXXSRSETPSTPLRLWILGYALQXXXXXXXXXXEFXXXXXXXEQPDGEA 134
N SR E P PLR+W+LGYALQ E+
Sbjct: 108 NLAFVSVATAILVMSRKEHPIMPLRVWLLGYALQCVLHMVCVCVEYRRRNRRRTNRTTTT 167
Query: 135 VXXXXX--------------XXXXXXXXXXNYANLGQLEDPGTSMAKHLESANTMFSFIW 180
+LG L+ +S+AKHLESANTMFSFIW
Sbjct: 168 TPPRSRSSSSSSSSSSLEEEALGSRRNSGVQDLSLGHLDTESSSVAKHLESANTMFSFIW 227
Query: 181 WVIGFYWVSADGQTLAQDSPQLYWLCIXXXXXXXXXXXXCIALACIIGIAVCCCLPCIIA 240
W+IGFYWVSA GQ LAQ+SP++YWL I C+ALAC+IGIAVCCCLPCIIA
Sbjct: 228 WIIGFYWVSAGGQELAQESPRIYWLSIVFLGFDVFFVVFCVALACVIGIAVCCCLPCIIA 287
Query: 241 LLYAVADQEGASQEDIEQLSKFKFQR--KSNEKLAGDTEGPVGGIMTECHSDSPTEHMLS 298
+LYAVADQEGAS+EDIEQL+KFKF++ +N+ + +G GIMTEC +DSP EH L
Sbjct: 288 VLYAVADQEGASKEDIEQLTKFKFRKLGDANKHTNDEAQGTTEGIMTECGTDSPIEHTLL 347
Query: 299 AEDAECCICLSAYDDG 314
EDAECCICLSAY+DG
Sbjct: 348 QEDAECCICLSAYEDG 363
>AT1G63170.1 | Symbols: | Zinc finger, C3HC4 type (RING finger)
family protein | chr1:23425574-23427073 FORWARD
LENGTH=381
Length = 381
Score = 310 bits (795), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 169/306 (55%), Positives = 192/306 (62%), Gaps = 15/306 (4%)
Query: 23 LRQAARFLRQAS-GRRMMREPSMVVRETAAEQLEERQSDWAYSKPVVVLDILWNXXXXXX 81
LR+AAR LR AS GR MMREPSM+VRE AAEQLEERQSDWAYSKPVVVLD +WN
Sbjct: 29 LREAARLLRHASSGRMMMREPSMLVREAAAEQLEERQSDWAYSKPVVVLDFVWNLAFVVV 88
Query: 82 XXXXXXXSRSETPSTPLRLWILGYALQXXXXXXXXXXEFXXXXXXXEQPDGEAVXXXXXX 141
S E P+ PLR+WI+GY LQ E+ +
Sbjct: 89 ATAVLVLSSDENPNMPLRVWIIGYGLQCMMHMVCVCVEYRRRNSRRRRDLSPRSSSSSSS 148
Query: 142 XXXXX----------XXXXNYANLGQLEDPGTSMAKHLESANTMFSFIWWVIGFYWVSAD 191
Y LGQLE+ S AKHLESANTM SFIWWVIGFYWVS+
Sbjct: 149 SSSSMDEEEGLGLSRNSDERYLELGQLENENNSFAKHLESANTMISFIWWVIGFYWVSSG 208
Query: 192 GQTLAQDSPQLYWLCIXXXXXXXXXXXXCIALACIIGIAVCCCLPCIIALLYAVADQEGA 251
GQ LAQ SPQLYWLCI C+ALAC+IGIAVCCCLPCIIA+LYAVA+QEGA
Sbjct: 209 GQELAQGSPQLYWLCIVFLGFDVFFVVFCVALACVIGIAVCCCLPCIIAVLYAVAEQEGA 268
Query: 252 SQEDIEQLSKFKFQRKSNE-KLAGDTE---GPVGGIMTECHSDSPTEHMLSAEDAECCIC 307
S+EDI+QL+KFKF++ + K D E G GG+MTEC +DSP EH L EDAECCIC
Sbjct: 269 SKEDIDQLTKFKFRKVGDTMKHTVDEEQGQGDSGGVMTECGTDSPVEHALPHEDAECCIC 328
Query: 308 LSAYDD 313
LSAY+D
Sbjct: 329 LSAYED 334
>AT1G12760.2 | Symbols: | Zinc finger, C3HC4 type (RING finger)
family protein | chr1:4348941-4350512 FORWARD LENGTH=337
Length = 337
Score = 294 bits (753), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 156/292 (53%), Positives = 182/292 (62%), Gaps = 16/292 (5%)
Query: 39 MREPSMVVRETAAEQLEERQSDWAYSKPVVVLDILWNXXXXXXXXXXXXXSRSETPSTPL 98
MREPSM+VRE AAEQLEERQSDWAYSKPVVVLDI+WN SR E P PL
Sbjct: 1 MREPSMLVREAAAEQLEERQSDWAYSKPVVVLDIVWNLAFVSVATAILVMSRKEHPIMPL 60
Query: 99 RLWILGYALQXXXXXXXXXXEFXXXXXXXEQPDGEAVXXXXXXXXXXX------------ 146
R+W+LGYALQ E+
Sbjct: 61 RVWLLGYALQCVLHMVCVCVEYRRRNRRRTNRTTTTTPPRSRSSSSSSSSSSLEEEALGS 120
Query: 147 --XXXXNYANLGQLEDPGTSMAKHLESANTMFSFIWWVIGFYWVSADGQTLAQDSPQLYW 204
+LG L+ +S+AKHLESANTMFSFIWW+IGFYWVSA GQ LAQ+SP++YW
Sbjct: 121 RRNSGVQDLSLGHLDTESSSVAKHLESANTMFSFIWWIIGFYWVSAGGQELAQESPRIYW 180
Query: 205 LCIXXXXXXXXXXXXCIALACIIGIAVCCCLPCIIALLYAVADQEGASQEDIEQLSKFKF 264
L I C+ALAC+IGIAVCCCLPCIIA+LYAVADQEGAS+EDIEQL+KFKF
Sbjct: 181 LSIVFLGFDVFFVVFCVALACVIGIAVCCCLPCIIAVLYAVADQEGASKEDIEQLTKFKF 240
Query: 265 QR--KSNEKLAGDTEGPVGGIMTECHSDSPTEHMLSAEDAECCICLSAYDDG 314
++ +N+ + +G GIMTEC +DSP EH L EDAECCICLSAY+DG
Sbjct: 241 RKLGDANKHTNDEAQGTTEGIMTECGTDSPIEHTLLQEDAECCICLSAYEDG 292
>AT3G61180.1 | Symbols: | RING/U-box superfamily protein |
chr3:22645680-22647290 FORWARD LENGTH=379
Length = 379
Score = 293 bits (750), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 155/298 (52%), Positives = 185/298 (62%), Gaps = 3/298 (1%)
Query: 20 RPSLRQAARFLRQASGRRMM-REPSMVVRETAAEQLEERQSDWAYSKPVVVLDILWNXXX 78
+P A+R LR+AS RRMM REPS+ VRE AAEQLEERQS WAYSKP++VLDILWN
Sbjct: 36 QPIRGAASRLLRRASNRRMMLREPSVRVREVAAEQLEERQSQWAYSKPIIVLDILWNFLF 95
Query: 79 XXXXXXXXXXSRSETPSTPLRLWILGYALQXXXXXXXXXXEFXXXXXXXEQP-DGEAVXX 137
S E P PLRLWI+GY +Q E+ P GE
Sbjct: 96 VIVSIAILGFSSDEDPDVPLRLWIIGYNVQCLFHVGCVIAEYKRRRVANSPPPSGEDSSN 155
Query: 138 XXXXXXXXXXXXXNYANLGQLEDPGTSMAKHLESANTMFSFIWWVIGFYWVSADGQTLAQ 197
N +D GTS KHLESANTMFSF+WW+IGFYWV+AD + LAQ
Sbjct: 156 HESLSGSEDESDGYSINNTDDDDHGTSFTKHLESANTMFSFVWWIIGFYWVTADTEALAQ 215
Query: 198 DSPQLYWLCIXXXXXXXXXXXXCIALACIIGIAVCCCLPCIIALLYAVADQEGASQEDIE 257
SPQLYWLC+ C+A+A +IGIAVCCCLPCIIA+LYA+ADQEGA E+IE
Sbjct: 216 SSPQLYWLCVAFLAFDVMFVVICVAVASLIGIAVCCCLPCIIAILYALADQEGAPDEEIE 275
Query: 258 QLSKFKF-QRKSNEKLAGDTEGPVGGIMTECHSDSPTEHMLSAEDAECCICLSAYDDG 314
+L KFKF K++EK+ G+ GGIMT ++S TE ML +EDAEC ICL AY+DG
Sbjct: 276 RLLKFKFLTVKNSEKVNGEIRETQGGIMTGLDTESQTERMLLSEDAECSICLCAYEDG 333
>AT1G68070.1 | Symbols: | Zinc finger, C3HC4 type (RING finger)
family protein | chr1:25515412-25516767 REVERSE
LENGTH=343
Length = 343
Score = 221 bits (564), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 124/299 (41%), Positives = 162/299 (54%), Gaps = 25/299 (8%)
Query: 19 RRPSLRQAARFLRQASGRRMMREPSMVVRETAAEQLEERQSDWAYSKPVVVLDILWNXXX 78
R+P + A L +ASGRR SMVVRETAA++LEER++DW YSKPVV LD+LWN
Sbjct: 26 RQPVI---AVLLNRASGRR---GASMVVRETAAQELEERRADWGYSKPVVALDMLWNTAF 79
Query: 79 XXXXXXXXXXSRSETPSTPLRLWILGYALQXXXXXXXXXXEFXXXXXXXEQPDGEAVXXX 138
+ E P+ P+R+WI GYA+Q EF D EA
Sbjct: 80 VLVAIVMLLVFKEEKPNVPIRIWICGYAIQCLVHVVLVWLEFRKRNARSRPGDLEAAQAT 139
Query: 139 XXXXXXXXXXXXNYANLGQLEDPGTSMAKHLESANTMFSFIWWVIGFYWVSADGQTLAQD 198
N + + D K ES NT+ SF+WW++GFYW+ + G L Q+
Sbjct: 140 ------------NQDSEDEDNDERFLSTKTCESMNTIISFVWWIVGFYWLVSGGDILLQN 187
Query: 199 SPQLYWLCIXXXXXXXXXXXXCIALACIIGIAVCCCLPCIIALLYAVADQEGASQEDIEQ 258
+ LYWL C+ LAC+IGIA+CCCLPCIIALLYAVA QEGAS+ D+
Sbjct: 188 ATHLYWLTFVFLAFDVFFAIFCVVLACLIGIALCCCLPCIIALLYAVAGQEGASEADLSI 247
Query: 259 LSKFKFQRKSNEKLAGDTEGPVGGIMTECHSDSP---TEHMLSAEDAECCICLSAYDDG 314
L K++F +N++ D GG M + S E +L EDA+CCICLS+Y+DG
Sbjct: 248 LPKYRFHTMNNDEKQSDG----GGKMIPVDAASENLGNERVLLPEDADCCICLSSYEDG 302
>AT2G01735.1 | Symbols: RIE1 | RING-finger protein for embryogenesis
| chr2:324499-325895 FORWARD LENGTH=359
Length = 359
Score = 202 bits (514), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 116/279 (41%), Positives = 144/279 (51%), Gaps = 15/279 (5%)
Query: 40 REPSMVVRETAAEQLEERQSDWAYSKPVVVLDILWNXXXXXXXXXXXXXSRSETPSTPLR 99
R PSM+VRETAA LEER+ DW YSKPVV DILWN + E P+ P+R
Sbjct: 50 RAPSMLVRETAARALEERRIDWGYSKPVVAADILWNAALVLASAVMLVGTVEERPNEPIR 109
Query: 100 LWILGYALQXXXXXXXXXXEFXXXXXXXEQPDGEAVXXXXXXXXXXXXXXXNYANLGQLE 159
+WI Y LQ E+ D E+ +Y
Sbjct: 110 VWICVYGLQCLFHVVLVWSEYWRRNSTRRARDLES------YDHEDYNIEYDYEQDSDDN 163
Query: 160 DPGTSMAKHLESANTMFSFIWWVIGFYWVSADGQTLAQDSPQLYWLCIXXXXXXXXXXXX 219
S K ES NT+ SFIWW+IGFYWV G L ++P LYWL +
Sbjct: 164 STTYSFVKRCESINTVISFIWWIIGFYWVVEGGDKLLGEAPNLYWLSVIFLAIDVFFAVF 223
Query: 220 CIALACIIGIAVCCCLPCIIALLYAVADQEGASQEDIEQLSKFKFQR-KSNEKLAGDTEG 278
C+ LAC++GIA+CCCLPCIIALLYAVA EG S+ ++ L +KF+ SNEK + G
Sbjct: 224 CVVLACLVGIALCCCLPCIIALLYAVAGTEGVSEAELGVLPLYKFKAFHSNEK---NITG 280
Query: 279 PVGGIMTECHSDS---PTEHMLSAEDAECCICLSAYDDG 314
P G M + TE L AEDA+CCICLS+Y+DG
Sbjct: 281 P--GKMVPIPINGLCLATERTLLAEDADCCICLSSYEDG 317
>AT1G80400.1 | Symbols: | RING/U-box superfamily protein |
chr1:30225864-30227360 FORWARD LENGTH=407
Length = 407
Score = 87.0 bits (214), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 52/154 (33%), Positives = 77/154 (50%), Gaps = 16/154 (10%)
Query: 165 MAKHLESANTMFSFIWWVIGFYWVSADGQTLAQDSPQLYWLCIXXXXXXXXXXXXCI--A 222
+ H + A F +W+V+G W+ G + DSP+LY LCI CI A
Sbjct: 222 LVDHFKMAIDCFFAVWFVVGNVWIFG-GHSSPSDSPKLYRLCIAFLTFS------CIGYA 274
Query: 223 LACIIGIAVCCCLPCIIALL---YAVADQEGASQEDIEQLSKFKFQRKSNEKLAGDTEGP 279
+ I+ +CCCLPC+I++L + GA+ E I L ++F+ KS L EG
Sbjct: 275 MPFILCATICCCLPCLISVLGFRENFSQTRGATAEAINALPVYRFKSKSRNDLEFSEEGE 334
Query: 280 VGGIMTECHSDSPTEHMLSAEDAECCICLSAYDD 313
G ++ S + ++S EDA CCICL+ Y D
Sbjct: 335 GGFLLL----GSQKKRLISGEDASCCICLTRYGD 364
>AT4G32600.1 | Symbols: | RING/U-box superfamily protein |
chr4:15724010-15725737 FORWARD LENGTH=453
Length = 453
Score = 79.0 bits (193), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 66/251 (26%), Positives = 99/251 (39%), Gaps = 36/251 (14%)
Query: 89 SRSETPSTPLRLWILGYALQXXXXXXXXXXEFXXXXXXXEQPDGEAVXXXXXXXXXXXXX 148
S+ E P PL WI+GYA + EQ G+
Sbjct: 134 SKHEHPRAPLFTWIVGYACGCVATLPLLYWRYYHSNQASEQDSGQHRPNLNVAAGPFAFS 193
Query: 149 XXNYANLGQLEDPGTS-----------------MAKHLESANTMFSFIWWVIGFYWVSAD 191
+ + TS + ++ + A F +W+V+G W+
Sbjct: 194 ISRTSEADGRQTNTTSSRGSRYPGFISAARLKVIVEYFKMALDCFFAVWFVVGNVWIFG- 252
Query: 192 GQTLAQDSPQLYWLCIXXXXXXXXXXXXCI--ALACIIGIAVCCCLPCIIALLYAVAD-- 247
G + A ++P LY LC+ CI A+ I+ +CCCLPCII++L D
Sbjct: 253 GHSSAAEAPNLYRLCLVFLTFS------CIGYAMPFILCTTICCCLPCIISILGYREDLT 306
Query: 248 -QEGASQEDIEQLSKFKFQRKSNEKLAGDTEGPV---GGIMTECHSDSPTEHMLSAEDAE 303
GA+ E I L KF+ K + GD G GG++ + + E +S EDA
Sbjct: 307 QPRGATPESINALPTHKFKLKKSRS-NGDDNGSSTSEGGVVA---AGTDNERAISGEDAV 362
Query: 304 CCICLSAYDDG 314
CCICL+ Y +
Sbjct: 363 CCICLAKYANN 373