Miyakogusa Predicted Gene
- Lj4g3v2434830.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v2434830.1 Non Chatacterized Hit- tr|I1KRS8|I1KRS8_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,80,0,CLATHRIN
ASSEMBLY PROTEIN,NULL; GAT-like domain,NULL; ANTH,ANTH; seg,NULL; no
description,Clathrin a,CUFF.51046.1
(319 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G01600.1 | Symbols: | ENTH/ANTH/VHS superfamily protein | ch... 271 5e-73
AT1G14910.1 | Symbols: | ENTH/ANTH/VHS superfamily protein | ch... 268 3e-72
AT4G25940.1 | Symbols: | ENTH/ANTH/VHS superfamily protein | ch... 155 3e-38
AT5G35200.1 | Symbols: | ENTH/ANTH/VHS superfamily protein | ch... 141 5e-34
AT5G57200.1 | Symbols: | ENTH/ANTH/VHS superfamily protein | ch... 67 2e-11
AT2G25430.1 | Symbols: | epsin N-terminal homology (ENTH) domai... 58 8e-09
AT1G03050.1 | Symbols: | ENTH/ANTH/VHS superfamily protein | ch... 56 4e-08
>AT2G01600.1 | Symbols: | ENTH/ANTH/VHS superfamily protein |
chr2:268975-272356 FORWARD LENGTH=571
Length = 571
Score = 271 bits (693), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 167/336 (49%), Positives = 193/336 (57%), Gaps = 30/336 (8%)
Query: 1 MPRHEAIKALEAYKRAGQQALNLSDFYEVCKGLELARNFQFPVLREPPQSFLTTMEEYIK 60
M +HEAI +LE YKRAGQQA +LSDFYE CKGLELARNFQFPVLREPPQSFLTTMEEYIK
Sbjct: 249 MAKHEAITSLEIYKRAGQQARSLSDFYEACKGLELARNFQFPVLREPPQSFLTTMEEYIK 308
Query: 61 EAPRVVTVPSEPMLQLTYRPDEVLAIEDTKSPEEEETSVPIDXXXXXXXXXXXXXXXXXX 120
EAPRVV VP+EP+L LTYRPD+ L EDT+ EE +P D
Sbjct: 309 EAPRVVDVPAEPLL-LTYRPDDGLTTEDTEPSHEEREMLPSDDVVVVSEETEPSPPPPPS 367
Query: 121 XXXX--FDTGDLLGLNDIAPNASAIEESNALALAIVPTENGTASTFNSGAAQTKDFDPTG 178
DT DL GLN AP+ S IE+ NALALAIV T+ A Q ++DPTG
Sbjct: 368 ANAQNFIDTDDLWGLNTGAPDTSVIEDQNALALAIVSTD---ADPPTPHFGQPNNYDPTG 424
Query: 179 WELALVSTPSTDISSVNERQLAGGLDSLTLNSLYDEGAYKAARQPVDGVPSAPNPFEVQD 238
WELALV+ PS+DIS+ ER+LAGGLD+LTL+SLYD+GAY A+++PV G P APNPF D
Sbjct: 425 WELALVTAPSSDISASTERKLAGGLDTLTLSSLYDDGAYIASQRPVYGAP-APNPFASHD 483
Query: 239 PFXXXXXXXXXXXXXXXXXXXXXXNPFGXXXXXXXXXXXXXXMLM-----NPANPFADGG 293
PF NPFG N +NPF D
Sbjct: 484 PF------ASSNGTAPPPQQQAVNNPFGAYQQTYQHQPQPTYQHQSNPPTNNSNPFGD-- 535
Query: 294 FGAFPANSISH----------PQXXXXXPFGSTGLL 319
FG FP N +S PF STGL+
Sbjct: 536 FGEFPVNPVSQQPNTSGYGDFSVNQHNNPFRSTGLI 571
>AT1G14910.1 | Symbols: | ENTH/ANTH/VHS superfamily protein |
chr1:5139928-5143571 REVERSE LENGTH=692
Length = 692
Score = 268 bits (686), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 167/312 (53%), Positives = 194/312 (62%), Gaps = 22/312 (7%)
Query: 1 MPRHEAIKALEAYKRAGQQALNLSDFYEVCKGLELARNFQFPVLREPPQSFLTTMEEYIK 60
MPRHEAIKALE YKRAG QA NLS FYEVCKGLELARNFQFPVLREPPQSFLTTMEEY++
Sbjct: 249 MPRHEAIKALEIYKRAGLQAGNLSAFYEVCKGLELARNFQFPVLREPPQSFLTTMEEYMR 308
Query: 61 EAPRVVTVPSEPMLQLTYRPDEVLAIEDTK-SPEEEETSVPIDXXXXXXXXXXXXXX--- 116
+AP++V V S P+L LTY PD+ L ED S EE ETS P D
Sbjct: 309 DAPQMVDVTSGPLL-LTYTPDDGLTSEDVGPSHEEHETSSPSDSAVVPSEETQLSSQSPP 367
Query: 117 XXXXXXXXFDTGDLLGLNDIAPNASAIEESNALALAIVPTENGTASTFNSGAAQTKDFDP 176
DT DLLGL+D P+ AI + NALALA+V + + +S F+ G Q +D DP
Sbjct: 368 SVETPQNFIDTDDLLGLHDDTPDPLAILDQNALALALV-SNDVDSSPFSFG--QARDLDP 424
Query: 177 TGWELALVSTPSTDISSVNERQLAGGLDSLTLNSLYDEGAYKAARQPVDGVPSAPNPFEV 236
+GWELALV+TPS DIS+ ERQLAGGLD+LTLNSLYD+GA +AA+QP GVP A NPFEV
Sbjct: 425 SGWELALVTTPSNDISAATERQLAGGLDTLTLNSLYDDGALRAAQQPAYGVP-ASNPFEV 483
Query: 237 QDPFXXXXXXXXXXXXXXXXXXXXXXNPFGXXXXXXXXXXXXXXMLM--NPANPFADGGF 294
QD F NPFG + + +PANPF D F
Sbjct: 484 QDLF---------AFSDSVSPPSAVNNPFGLYEPTYHQQEQQPQLQVAPSPANPFGD--F 532
Query: 295 GAFPANSISHPQ 306
G FP +S PQ
Sbjct: 533 GEFPIVPVSEPQ 544
>AT4G25940.1 | Symbols: | ENTH/ANTH/VHS superfamily protein |
chr4:13169792-13172700 REVERSE LENGTH=601
Length = 601
Score = 155 bits (392), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 107/267 (40%), Positives = 144/267 (53%), Gaps = 33/267 (12%)
Query: 1 MPRHEAIKALEAYKRAGQQALNLSDFYEVCKGLELARNFQFPVLREPPQSFLTTMEEYIK 60
M RH+A+KAL YKRAGQQA NL+DFYE CKGLELARNFQFP LR+PP SFL TME+YIK
Sbjct: 256 MSRHDAVKALNIYKRAGQQAENLADFYEYCKGLELARNFQFPTLRQPPPSFLATMEDYIK 315
Query: 61 EAPRVVTVPSE-------------PMLQLTYRPDEVLAIEDTKSPEEEETSVPIDXXXXX 107
EAP+ +V + + + +P+E ++ K E E P+
Sbjct: 316 EAPQSGSVQKKLEYQEKEEEEQEEEEAEHSVQPEEPAEADNQK--ENSEGDQPL--IEEE 371
Query: 108 XXXXXXXXXXXXXXXXXFDTGDLLGLNDIAPNASAIEESNALALAIVPTENGTASTFNSG 167
DT DLLGLN+I P A+ IE+ NALALAI P G + S
Sbjct: 372 EEDQEKIEEEDAKPSFLIDTDDLLGLNEINPKAAEIEDRNALALAIYPP--GHEAPGPSN 429
Query: 168 AAQTKDFDPTGWELALVSTPSTDISSVNER----QLAGGLDSLTLNSLYDEGAYKAARQ- 222
+ +GWELALV+ + + ++ +LAGG D+L L+SLY++ + + Q
Sbjct: 430 ILSLIETGGSGWELALVTPQNNNNNNNPRPAPNTKLAGGFDNLLLDSLYEDDSARRQIQL 489
Query: 223 --------PVDGVPSAPNPFEV-QDPF 240
+D + PNPF++ QDPF
Sbjct: 490 TNAGYGHGGIDTTAAPPNPFQMQQDPF 516
>AT5G35200.1 | Symbols: | ENTH/ANTH/VHS superfamily protein |
chr5:13462463-13465581 REVERSE LENGTH=544
Length = 544
Score = 141 bits (356), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 91/240 (37%), Positives = 131/240 (54%), Gaps = 25/240 (10%)
Query: 1 MPRHEAIKALEAYKRAGQQALNLSDFYEVCKGLELARNFQFPVLREPPQSFLTTMEEYIK 60
M R++A+KAL+ Y+RA +QA LS+F+EVCK + + R +F + +PP SFL MEEY+K
Sbjct: 242 MQRNDAVKALDMYRRAVKQAGRLSEFFEVCKSVNVGRGERFIKIEQPPTSFLQAMEEYVK 301
Query: 61 EAPRVVTVPSEPMLQLTYRPDEVLAIEDTKSPE--EEETSVPIDXXXXXXXXXXXXXXXX 118
EAP V E +++ P E+LAIE P+ EE+ + P
Sbjct: 302 EAPLAAGVKKEQVVEKLTAPKEILAIEYEIPPKVVEEKPASP-------------EPVKA 348
Query: 119 XXXXXXFDTGDLLGLNDIAPNASAIEESNALALAIVPT---ENGTASTFNSGAAQTKDFD 175
DLL ++D AP S +EE NALALAIVP + + + F +G +
Sbjct: 349 EAEKPVEKQPDLLSMDDPAPMVSELEEKNALALAIVPVSVEQPHSTTDFTNG-------N 401
Query: 176 PTGWELALVSTPSTDISSVNERQLAGGLDSLTLNSLYDEGAYKAARQPVDGVPSAPNPFE 235
TGWELALV+ PS++ + + +LAGGLD LTL+SLY++ + +Q P NP
Sbjct: 402 STGWELALVTAPSSNEGAAADSKLAGGLDKLTLDSLYEDAIRVSQQQNRSYNPWEQNPVH 461
>AT5G57200.1 | Symbols: | ENTH/ANTH/VHS superfamily protein |
chr5:23177696-23180601 FORWARD LENGTH=591
Length = 591
Score = 66.6 bits (161), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/135 (37%), Positives = 75/135 (55%), Gaps = 21/135 (15%)
Query: 125 FDTGDLLGLNDIAPNASAIEESNALALAIVPTENGTASTFNSGAAQTKDFDPTGWELALV 184
DT DLLGL++I P A+ IE++NA +LAI P + T++ NS + +GWELALV
Sbjct: 378 IDTDDLLGLHEINPKAAEIEQNNAFSLAIYPPGHETSAPSNS--LSLIEAGGSGWELALV 435
Query: 185 STPSTDISSVNER-----QLAGGLDSLTLNSLYDEGAYKAARQPVD--------GVPSA- 230
+ + + ++ N R +L GG D+L L+SLY++ + Q + +P A
Sbjct: 436 TPQNNNNNNNNPRPVIATKLGGGFDNLLLDSLYEDDTARRQIQLTNAGYGFGATAIPGAL 495
Query: 231 ----PNPFEV-QDPF 240
PNPF V QDPF
Sbjct: 496 ASSNPNPFGVQQDPF 510
>AT2G25430.1 | Symbols: | epsin N-terminal homology (ENTH)
domain-containing protein / clathrin assembly
protein-related | chr2:10822716-10824677 FORWARD
LENGTH=653
Length = 653
Score = 58.2 bits (139), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 58/230 (25%), Positives = 95/230 (41%), Gaps = 26/230 (11%)
Query: 1 MPRHEAIKALEAYKRAGQQALNLSDFYEVCKGLELARNFQFPVLREPPQSFLTTMEEYIK 60
M + +KA +AY A +Q L FY CK +AR+ ++P ++ L T+EE+++
Sbjct: 315 MEYSDCVKAFDAYASAAKQIDELIAFYNWCKETGVARSSEYPEVQRITSKLLETLEEFVR 374
Query: 61 EAPRVVTVPSEPMLQLTYRPDEVLAIEDTKSPEEEETSVPIDXXXXXXXXXXXXXXXXXX 120
+ + P ++ + + + PE + +
Sbjct: 375 DRAKRGKSPERKEIEAP------PPVVEEEEPEPDMNEIKALPPPENYTPPPPPEPEPQP 428
Query: 121 XXXXFDTGDLLGLNDIAPNASAIEESNALALAIV---PTENGTASTFNSGAAQTKDFDPT 177
F T DL+ L + +A ++ N ALA+ P NG F+S + +P
Sbjct: 429 EKPQF-TEDLVNLRE--DEVTADDQGNKFALALFAGPPGNNGKWEAFSSNGVTSAWQNPA 485
Query: 178 G------WELALVSTPSTDISSVNERQ---LAGGLDSLTLNSLYDEGAYK 218
WELALV T S E+Q L GG D+L LN +YD+G +
Sbjct: 486 AEPGKADWELALVETTSN-----LEKQTAALGGGFDNLLLNGMYDQGMVR 530
>AT1G03050.1 | Symbols: | ENTH/ANTH/VHS superfamily protein |
chr1:707726-709860 FORWARD LENGTH=599
Length = 599
Score = 55.8 bits (133), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 62/223 (27%), Positives = 97/223 (43%), Gaps = 10/223 (4%)
Query: 5 EAIKALEAYKRAGQQALNLSDFYEVCKGLELARNFQFPVLREPPQSFLTTMEEYIKEAPR 64
++IK + + R +Q L FY CK + +AR+ ++P + + Q L M+E+I++
Sbjct: 268 DSIKVYDIFCRVSKQFEELDQFYSWCKNMGIARSSEYPEIEKITQKKLDLMDEFIRDKSA 327
Query: 65 V-VTVPSEPMLQLTYRPDEVLAIEDTKSPEEEETSV-----PIDXXXXXXXXXXXXXXXX 118
+ T S+ + D+ E+ +E+ ++ P
Sbjct: 328 LEHTKQSKSVKSEADEDDDEARTEEVNEEQEDMNAIKALPEPPPKEEDDVKPEEEAKEEV 387
Query: 119 XXXXXXFDTGDLLGL-NDIAPNASAIEESNALALAIVPTENGTASTFNSGAAQTKDFDPT 177
+ GDLL L N A +S ALAL P +G+ S G KD D
Sbjct: 388 IIEKKQEEMGDLLDLGNTNGGEAGQAGDSLALALFDGPYASGSGSESGPGWEAFKD-DSA 446
Query: 178 GWELALVSTPSTDISSVNERQLAGGLDSLTLNSLYDEGAYKAA 220
WE ALV T +T++S + +L GG D L LN +Y GA AA
Sbjct: 447 DWETALVQT-ATNLSG-QKSELGGGFDMLLLNGMYQHGAVNAA 487