Miyakogusa Predicted Gene
- Lj6g3v1422230.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj6g3v1422230.1 Non Chatacterized Hit- tr|I1MRG5|I1MRG5_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,47.15,0,zf-CW,Zinc
finger, CW-type; ZF_CW,Zinc finger, CW-type; ZINC ION BINDING,NULL;
ZINC FINGER CW-TYPE C,CUFF.59512.1
(1669 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G15730.1 | Symbols: | CW-type Zinc Finger | chr4:8951887-895... 238 4e-62
AT3G62900.1 | Symbols: | CW-type Zinc Finger | chr3:23248868-23... 222 2e-57
AT1G02990.3 | Symbols: | FUNCTIONS IN: molecular_function unkno... 206 1e-52
AT1G02990.2 | Symbols: | BEST Arabidopsis thaliana protein matc... 206 1e-52
AT1G02990.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 67 1e-10
>AT4G15730.1 | Symbols: | CW-type Zinc Finger | chr4:8951887-8957214
REVERSE LENGTH=1059
Length = 1059
Score = 238 bits (606), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 121/248 (48%), Positives = 172/248 (69%), Gaps = 1/248 (0%)
Query: 1421 DTLDEAIKLKDRADHYKNSGFNFESNETYFQAGLKFLHGASRLESCHSESSKHGEMNHMQ 1480
D L EA KL+ AD +K+SGF +E E F+A L+FL GAS LE C +++ + G+M+H++
Sbjct: 812 DILQEAEKLRKLADCFKSSGFEYEYKEINFKAALRFLLGASVLEMCSTDNVEVGKMSHIE 871
Query: 1481 IYVTAAKLLKSCAHEYESRQEMATAALAYKCMEVAYMRVVYCKHSSINRDRHELQTTLQM 1540
Y TAAKL +SCAH+YE+ QEMA A LAYKC EVA MR+VY + ++ + +ELQ +QM
Sbjct: 872 AYHTAAKLSESCAHQYETSQEMAAATLAYKCTEVACMRLVYGRSLGLSGEWNELQKMVQM 931
Query: 1541 ISQGESPSSSASDVDNFNIQAAVDKTALPRVTNAHVVGNQVISAQTRPSLVKLLDFTQYM 1600
QGESPSSSASDVD+FN Q + K+A R +HV GN + A+++ + V LLDFT M
Sbjct: 932 TPQGESPSSSASDVDSFNHQGVIKKSAKTRRGLSHVAGNLLPVARSQLNFVPLLDFTGSM 991
Query: 1601 HFAMEASRKCESTFAAASNVIMEEARKRDCITCIRKIIDFSFQDVDELIRLVWNAANAIS 1660
+ AMEAS K ++ F A ++ EE + DCI+ I+K++DFSF DV+ LI+++ A +A+S
Sbjct: 992 NLAMEASAKSQNAFKAVTDT-SEERKHGDCISAIKKVVDFSFHDVEALIKMIEVAMDALS 1050
Query: 1661 HACLGGAR 1668
+ GG +
Sbjct: 1051 SSRFGGPK 1058
Score = 94.0 bits (232), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 125/264 (47%), Gaps = 26/264 (9%)
Query: 1 MEENTELEEGEACYHDDDDDKGINYLD-SLSYFDEKIQHLLGHFKKDFEAGLSAQN-LGP 58
M E+ ELEEGE + D+ + LD LSY D+K+Q++LGH +K F G A++ GP
Sbjct: 1 MGEDYELEEGEM---NCSSDEAVVDLDVDLSYIDKKVQNVLGHLQKGF--GEEARDRFGP 55
Query: 59 RFGDYGSFLPTYERSPCIPSHPKTPQRNHRSPKPPIKLH-KEAASHNRKEPSDVPPFARL 117
DYGSFLPTY+R P +PS ++ NH + L K + P+ R
Sbjct: 56 EIFDYGSFLPTYKRLPAVPSCQRSSLGNHAVQRISNSLPGKNVVQKFQSPPATSCKLVR- 114
Query: 118 GNASHNSHSFHDAIAPSVDDSVKSNGGISSNDVAGRFTLKDDSRAKTGNSAEQRTLKFRI 177
N ++ ++ V N G ++R + + ++ RI
Sbjct: 115 -NQDPQNYQTSGSLLAQAPGKVPINKG--------------NARTPANDLPHNKPIRVRI 159
Query: 178 KMNSNILVKKNAEIYXXXXXXXXXXXXMGNSPAESEGILP-VSQQKAEDSPTSIIQVMTS 236
KM S IL + A + S +S +LP S K +SP+ I+Q MT+
Sbjct: 160 KMGSEILSQSVAMVCKDLGLDGSPNSPPRISQDDSSRMLPHTSLGKTSESPSRILQEMTA 219
Query: 237 LTVPGGVLISPLHESLLNLIKTEK 260
++VP +L+SPL +SLL L+K +K
Sbjct: 220 ISVPEDLLMSPLPDSLL-LVKDKK 242
Score = 94.0 bits (232), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 48/124 (38%), Positives = 72/124 (58%), Gaps = 19/124 (15%)
Query: 623 WVACDNCEKWRLLPTGLKPEQLPEKWLCSMLNWLPGMNSCDISEDETTKAVQAFYQLPIS 682
W C++CEKWRLLP L E+LP+KWLCSM WLPGMN C +SE+ETT A+++F+
Sbjct: 419 WAQCESCEKWRLLPYDLNTEKLPDKWLCSMQTWLPGMNHCGVSEEETTNAIKSFHA---- 474
Query: 683 ECQNNMQTHATGTAI----GVSSADSLQFGLNHKKSISDVLPDGVKKKHVVKEKTMSGIN 738
+ H T + V +AD + ++ S LP+ ++KK V E G++
Sbjct: 475 -----SEGHGPDTGVKLLSDVRNADKI-----YQPLTSGSLPNPIEKKSNV-EDLSQGVS 523
Query: 739 NDVL 742
+++L
Sbjct: 524 SNIL 527
>AT3G62900.1 | Symbols: | CW-type Zinc Finger |
chr3:23248868-23254810 REVERSE LENGTH=1465
Length = 1465
Score = 222 bits (565), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 124/253 (49%), Positives = 161/253 (63%), Gaps = 10/253 (3%)
Query: 1415 SSQTAGDTLDEAIKLKDRADHYKNSGFNFESNETYFQAGLKFLHGASRLESCHSESSKHG 1474
S+Q A +TL EA LK AD KNS N E E YFQA LKFLHGA LE +ES++ G
Sbjct: 1162 SAQAAHNTLKEAKDLKHTADRLKNSVSNLEHIELYFQACLKFLHGAFLLEMSSNESARQG 1221
Query: 1475 E--MNHMQIYVTAAKLLKSCAHEYESRQEMATAALAYKCMEVAYMRVVYCKHSSINRDRH 1532
E + M+IY + A L CAHEYE ++M AALAYKCMEVAYMRVV ++S NR R+
Sbjct: 1222 ETMVQSMKIYSSTANLCGFCAHEYEKSKDMGAAALAYKCMEVAYMRVVNSSYTSANRYRN 1281
Query: 1533 ELQTTLQMISQGESPSSSASDVDNFNIQAAVDKTALPR-VTNAHVVGNQVISAQTRPSLV 1591
ELQT+LQM+ GESPSSSASDVDN N AAVD+ R +++ V GN VISAQ R +L+
Sbjct: 1282 ELQTSLQMVPPGESPSSSASDVDNVNHPAAVDRVGTSRGISSPLVAGNHVISAQNRSNLL 1341
Query: 1592 KLLDFTQYMHFAMEASRKCESTFAAASNVIMEEARKRDCITCIRKIIDFSFQDVDELIRL 1651
+LL F Q ++ +M+ASRK A E ++ + I I+ +D++FQD++ L+
Sbjct: 1342 RLLQFAQDVNLSMDASRKSRVALTACIENSGEAQQQGEGIISIKSALDYNFQDMEGLL-- 1399
Query: 1652 VWNAANAISHACL 1664
+ IS CL
Sbjct: 1400 -----HGISGFCL 1407
Score = 135 bits (341), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 105/328 (32%), Positives = 154/328 (46%), Gaps = 35/328 (10%)
Query: 596 GLFPMVENNPAPETIPSVAAPLVIAEDWVACDNCEKWRLLPTGLKPEQLPEKWLCSMLNW 655
G PM+ + ++P A P++I E WVACD C KWRLLP G+ PE LPEKW+C+MLNW
Sbjct: 553 GPEPMLRKLGSDASLPK-ANPVIIQEHWVACDKCGKWRLLPFGVFPEDLPEKWMCTMLNW 611
Query: 656 LPGMNSCDISEDETTKAVQAFYQLPISECQNNMQTHATGTAIGVSSADSLQFGLNHKKSI 715
LPG+N C++ EDETTKA+ A YQ+P+ E Q +MQ++ +G + D KK
Sbjct: 612 LPGVNYCNVPEDETTKALYAMYQIPVPENQASMQSNPSGPKPQFTQGDD---NTKKKKKG 668
Query: 716 SDVLPDGVKKKHVVKEKTMSGINNDVLQFSNSAKINAQVSGNNRSLNDMNQQPADSNPMK 775
+ +G+ KE + N +Q S+ I N++ L D+ + + K
Sbjct: 669 FKKIDNGMD-----KEGARTAETNKTIQTSSRNGIQ-----NSQGLGDLAEDERQIHKQK 718
Query: 776 KMGSKQSSRFNNIVEEKHVPKQNDKQVNGGDRKHIKLKRKM--------DADHYGLGTPK 827
+ G +++ +E K N+K+ D + L +KM D YG G P
Sbjct: 719 EKGKA----VDHLSDESKSLKANNKRKT--DLESSMLAKKMKIESFLFPDESEYGNGRPT 772
Query: 828 KSKTENVCYADEKLDPSIG--LEKVGLSARNGGLPAQASGKDMRKYDE-----YCSSLDV 880
S + AD K P + + K A + G G RK E S +
Sbjct: 773 SSSGVPITSADIKPKPRVSSKMPKEEGGASDTGNSNSTGGIKKRKLRESHGSRIYSENEN 832
Query: 881 QDRLLVPVKKEGDQAQVSSGGGSLDVKN 908
+R V+KE + S G G L+ KN
Sbjct: 833 HERKKARVRKEEKEPSYSQGNGKLEKKN 860
Score = 116 bits (291), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/258 (36%), Positives = 130/258 (50%), Gaps = 41/258 (15%)
Query: 18 DDDKGINYLDSLSYF----DEKIQHLLGHFKKDFEAGLSAQNLGPRFGDYGSFLPTYERS 73
D D ++Y+ S S+ DEK+QH+LGHF+KDFE G+SA+NLG ++G YGSFLPTY+RS
Sbjct: 25 DPDNDLSYIVSTSFSAALRDEKLQHILGHFQKDFEGGVSAENLGAKYGGYGSFLPTYQRS 84
Query: 74 PCIPSHPKTPQRNHRSPKPPIKLHKEAASHNRKEPSDVPPFARLGNASHNSHSFHDAIAP 133
P + SHPKTP K +S + P+++ GNA+ +
Sbjct: 85 P-VWSHPKTPA-------------KPQSSTGTRSPNNL--LGESGNAASS---------- 118
Query: 134 SVDDSVKSNGGISSNDVAGRFTLKDDSRAKTGNS-------AEQRTLKFRIKMNSNILV- 185
SV KS S N + K S A+ ++ ++Q +LK RIKM + L
Sbjct: 119 SVPKKAKSGLASSGNPKKSVKSKKPSSSARMESATKKPCVFSKQNSLKLRIKMVPDGLST 178
Query: 186 -KKNAEIYXXXXXXXXXXXXM-GNSPAESEGILPVSQ-QKAEDSPTSIIQVMTSLTVPGG 242
K A IY + NS + SEG+ Q +SPTSI+ VMTSL V
Sbjct: 179 EKNAAAIYSGLGLDVSPSLSLDNNSLSGSEGMNEEPQGYSPTESPTSILNVMTSLPVDHC 238
Query: 243 VLISPLHESLLNLIKTEK 260
+SPL E L+ I+ EK
Sbjct: 239 QFLSPLSEDLIRFIEREK 256
>AT1G02990.3 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana
protein match is: CW-type Zinc Finger (TAIR:AT3G62900.1).
| chr1:681724-686996 REVERSE LENGTH=1278
Length = 1278
Score = 206 bits (524), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 114/248 (45%), Positives = 156/248 (62%), Gaps = 8/248 (3%)
Query: 1414 NSSQTAGDTLDEAIKLKDRADHYKNSGFNFESNETYFQAGLKFLHGASRLESCHSESSKH 1473
++SQTA +++ EA LK AD KN+ N ES YFQA LKFLHGAS LES + ++
Sbjct: 1035 STSQTASNSIKEATDLKHMADRLKNAVSNHESTGVYFQAALKFLHGASLLESSGTTIARS 1094
Query: 1474 GEMNHMQIYVTAAKLLKSCAHEYESRQEMATAALAYKCMEVAYMRVVYCKHSSINRDRHE 1533
+ IY + AKL + CAHEYE ++M AALAYKCMEVAY+R+ Y H +I R R+E
Sbjct: 1095 KD-----IYGSTAKLCEFCAHEYEKNKDMGAAALAYKCMEVAYLRITYTSHGNIRRCRYE 1149
Query: 1534 LQTTLQMISQGESPSSSASDVDNFNIQAAVDKTALPRV--TNAHVVGNQVISAQTRPSLV 1591
LQ LQ+I GESP S ASD +N N +K AL ++ V GN VIS+ SL
Sbjct: 1150 LQAALQVIPSGESP-SFASDGENSNHTLTAEKFALSNTVRSSPSVTGNHVISSGNNSSLS 1208
Query: 1592 KLLDFTQYMHFAMEASRKCESTFAAASNVIMEEARKRDCITCIRKIIDFSFQDVDELIRL 1651
+LL F++ +++AMEASRK + AAA E + ITCI++ +DF+FQD+++L+ +
Sbjct: 1209 QLLAFSKNVNYAMEASRKAQIALAAAKGKSFETRYSSNGITCIKRALDFNFQDMEKLLHV 1268
Query: 1652 VWNAANAI 1659
V A +I
Sbjct: 1269 VRLAMESI 1276
Score = 115 bits (288), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 88/245 (35%), Positives = 124/245 (50%), Gaps = 35/245 (14%)
Query: 28 SLSYFDEKIQHLLGHFKKDFEAGLSAQNLGPRFGDYGSFLPTYERSPCIPSHPKTPQRNH 87
+LSY DEK++++LGHF+KDFE G+SA+NLG +FG YGSFL Y+RSP + S PKT
Sbjct: 47 ALSYIDEKLENVLGHFQKDFEGGVSAENLGAKFGGYGSFLSMYQRSP-VCSRPKT----- 100
Query: 88 RSPKPPIKLHKEAASHNRKEPSDVPPFARLGNASHNSHSFHDAIAPSVDDSVKSNGGISS 147
P ++ ++ N S VP + G+AS P+ D VK N + S
Sbjct: 101 ---SPEVQQNQLGGRSNCSASSLVPQLSISGSASK---------PPASDVLVKLNKFVKS 148
Query: 148 NDVAGRFTLKDDSRAKTGNSA--EQRTLKFRIKMNSNILVK-KNAEIYXXXXXXXXXXXX 204
+ + G K S AKT +SA +TL+FRIK+ S+ L KN +
Sbjct: 149 SHI-GTPDSKHMSDAKTSSSAPSNHKTLRFRIKVGSSDLSSLKNVSTFTKEGLNMLPSAS 207
Query: 205 MGNSPAESE-----GILPVSQQKAEDSPTSIIQVMTSLTVPGGVLISPLHESLLNLIKTE 259
N +E E GI DSPT I+ M S + L+SPL + L+ L E
Sbjct: 208 RVNCLSEVEQDLLNGIC--------DSPTKILMAMVSFPLHKDQLLSPLSDDLIQLGSKE 259
Query: 260 KVIED 264
K+++D
Sbjct: 260 KILKD 264
>AT1G02990.2 | Symbols: | BEST Arabidopsis thaliana protein match is:
CW-type Zinc Finger (TAIR:AT3G62900.1); Has 35333 Blast
hits to 34131 proteins in 2444 species: Archae - 798;
Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
BLink). | chr1:681724-686884 REVERSE LENGTH=1238
Length = 1238
Score = 206 bits (524), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 114/248 (45%), Positives = 156/248 (62%), Gaps = 8/248 (3%)
Query: 1414 NSSQTAGDTLDEAIKLKDRADHYKNSGFNFESNETYFQAGLKFLHGASRLESCHSESSKH 1473
++SQTA +++ EA LK AD KN+ N ES YFQA LKFLHGAS LES + ++
Sbjct: 995 STSQTASNSIKEATDLKHMADRLKNAVSNHESTGVYFQAALKFLHGASLLESSGTTIARS 1054
Query: 1474 GEMNHMQIYVTAAKLLKSCAHEYESRQEMATAALAYKCMEVAYMRVVYCKHSSINRDRHE 1533
+ IY + AKL + CAHEYE ++M AALAYKCMEVAY+R+ Y H +I R R+E
Sbjct: 1055 KD-----IYGSTAKLCEFCAHEYEKNKDMGAAALAYKCMEVAYLRITYTSHGNIRRCRYE 1109
Query: 1534 LQTTLQMISQGESPSSSASDVDNFNIQAAVDKTALPRV--TNAHVVGNQVISAQTRPSLV 1591
LQ LQ+I GESP S ASD +N N +K AL ++ V GN VIS+ SL
Sbjct: 1110 LQAALQVIPSGESP-SFASDGENSNHTLTAEKFALSNTVRSSPSVTGNHVISSGNNSSLS 1168
Query: 1592 KLLDFTQYMHFAMEASRKCESTFAAASNVIMEEARKRDCITCIRKIIDFSFQDVDELIRL 1651
+LL F++ +++AMEASRK + AAA E + ITCI++ +DF+FQD+++L+ +
Sbjct: 1169 QLLAFSKNVNYAMEASRKAQIALAAAKGKSFETRYSSNGITCIKRALDFNFQDMEKLLHV 1228
Query: 1652 VWNAANAI 1659
V A +I
Sbjct: 1229 VRLAMESI 1236
Score = 67.0 bits (162), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 68/213 (31%), Positives = 95/213 (44%), Gaps = 35/213 (16%)
Query: 60 FGDYGSFLPTYERSPCIPSHPKTPQRNHRSPKPPIKLHKEAASHNRKEPSDVPPFARLGN 119
FG YGSFL Y+RSP + S PKT P ++ ++ N S VP + G+
Sbjct: 39 FGGYGSFLSMYQRSP-VCSRPKT--------SPEVQQNQLGGRSNCSASSLVPQLSISGS 89
Query: 120 ASHNSHSFHDAIAPSVDDSVKSNGGISSNDVAGRFTLKDDSRAKTGNSA--EQRTLKFRI 177
AS P+ D VK N + S+ + G K S AKT +SA +TL+FRI
Sbjct: 90 ASK---------PPASDVLVKLNKFVKSSHI-GTPDSKHMSDAKTSSSAPSNHKTLRFRI 139
Query: 178 KMNSNILVK-KNAEIYXXXXXXXXXXXXMGNSPAESE-----GILPVSQQKAEDSPTSII 231
K+ S+ L KN + N +E E GI DSPT I+
Sbjct: 140 KVGSSDLSSLKNVSTFTKEGLNMLPSASRVNCLSEVEQDLLNGIC--------DSPTKIL 191
Query: 232 QVMTSLTVPGGVLISPLHESLLNLIKTEKVIED 264
M S + L+SPL + L+ L EK+++D
Sbjct: 192 MAMVSFPLHKDQLLSPLSDDLIQLGSKEKILKD 224
>AT1G02990.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: CW-type Zinc Finger (TAIR:AT3G62900.1); Has 5847
Blast hits to 4410 proteins in 438 species: Archae - 17;
Bacteria - 452; Metazoa - 2463; Fungi - 354; Plants -
306; Viruses - 11; Other Eukaryotes - 2244 (source: NCBI
BLink). | chr1:683065-686884 REVERSE LENGTH=1069
Length = 1069
Score = 66.6 bits (161), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 68/213 (31%), Positives = 95/213 (44%), Gaps = 35/213 (16%)
Query: 60 FGDYGSFLPTYERSPCIPSHPKTPQRNHRSPKPPIKLHKEAASHNRKEPSDVPPFARLGN 119
FG YGSFL Y+RSP + S PKT P ++ ++ N S VP + G+
Sbjct: 39 FGGYGSFLSMYQRSP-VCSRPKT--------SPEVQQNQLGGRSNCSASSLVPQLSISGS 89
Query: 120 ASHNSHSFHDAIAPSVDDSVKSNGGISSNDVAGRFTLKDDSRAKTGNSA--EQRTLKFRI 177
AS P+ D VK N + S+ + G K S AKT +SA +TL+FRI
Sbjct: 90 ASK---------PPASDVLVKLNKFVKSSHI-GTPDSKHMSDAKTSSSAPSNHKTLRFRI 139
Query: 178 KMNSNILVK-KNAEIYXXXXXXXXXXXXMGNSPAESE-----GILPVSQQKAEDSPTSII 231
K+ S+ L KN + N +E E GI DSPT I+
Sbjct: 140 KVGSSDLSSLKNVSTFTKEGLNMLPSASRVNCLSEVEQDLLNGIC--------DSPTKIL 191
Query: 232 QVMTSLTVPGGVLISPLHESLLNLIKTEKVIED 264
M S + L+SPL + L+ L EK+++D
Sbjct: 192 MAMVSFPLHKDQLLSPLSDDLIQLGSKEKILKD 224
Score = 60.5 bits (145), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 35/75 (46%), Positives = 46/75 (61%), Gaps = 5/75 (6%)
Query: 1414 NSSQTAGDTLDEAIKLKDRADHYKNSGFNFESNETYFQAGLKFLHGASRLESCHSESSKH 1473
++SQTA +++ EA LK AD KN+ N ES YFQA LKFLHGAS LES + ++
Sbjct: 995 STSQTASNSIKEATDLKHMADRLKNAVSNHESTGVYFQAALKFLHGASLLESSGTTIARS 1054
Query: 1474 GEMNHMQIYVTAAKL 1488
+ IY + AKL
Sbjct: 1055 KD-----IYGSTAKL 1064