Miyakogusa Predicted Gene
- Lj0g3v0125289.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0125289.2 tr|G7L0K5|G7L0K5_MEDTR Nuclear pore complex
protein Nup205 OS=Medicago truncatula GN=MTR_7g099880
PE,84.86,0,seg,NULL; SUBFAMILY NOT NAMED,NULL; FAMILY NOT NAMED,NULL;
DUF3414,Protein of unknown function DUF34,CUFF.7507.2
(1539 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G51200.1 | Symbols: | Protein of unknown function (DUF3414) ... 2035 0.0
>AT5G51200.1 | Symbols: | Protein of unknown function (DUF3414) |
chr5:20804926-20819408 FORWARD LENGTH=1838
Length = 1838
Score = 2035 bits (5273), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1002/1529 (65%), Positives = 1222/1529 (79%), Gaps = 35/1529 (2%)
Query: 1 MVSPKHLLSTLESVLLAPTPPTAAQRVELFHAIRSSLHSFQSLLSYPPPKPSDRSQVQSK 60
MVSPK L++ + S LL + PT QR+EL HAIR+S S Q+LLS+PPPKPSDR+QVQSK
Sbjct: 1 MVSPKDLVAIVHSSLLGTSRPTPTQRIELTHAIRNSFPSLQNLLSFPPPKPSDRAQVQSK 60
Query: 61 SIRLPDSPPISLDDQDVQIALKLSDDLHLNEVDCVRLLVSANQEWGLMGREPLEILRLAA 120
IRLPDS PISLDDQD+ I+LKLSD+LHLNE+D VRLLVS+NQEWGLMGR+PLEI RLA
Sbjct: 61 EIRLPDSLPISLDDQDIAISLKLSDELHLNEIDSVRLLVSSNQEWGLMGRDPLEIQRLAT 120
Query: 121 GLWYTERRDLITSLHLLLRAVVLDQGLEDDVLVDIQKYLEDLINSGLRQRLISLIKELNR 180
GLWYT RRDL ++L+ LLRAVVLD+GLE D++ DIQ LE+LI +GLRQRLI+LIKELNR
Sbjct: 121 GLWYTGRRDLTSTLYTLLRAVVLDEGLEPDLIADIQGLLEELIEAGLRQRLITLIKELNR 180
Query: 181 EEPSGFGGPQCERYIIDSRGSLVERLAVVSRERLILGHCLVLSILVVRTSPKDVKDIFSV 240
E+P+G GGP CERY+IDSRG+LVER AVV RERLILGHCLVLSILV R KDVKDI+ +
Sbjct: 181 EDPTGLGGPLCERYLIDSRGALVERRAVVQRERLILGHCLVLSILVDRPGSKDVKDIYYI 240
Query: 241 LKDSASEVSESIAPVKYQITFSLLFAVVIAFISDGLSTVPDKASVLSSNASFRHEFHELV 300
LKD+A++++E + QITFSLLF+++I F+SD +S + DK+S++S +ASFR +F ++V
Sbjct: 241 LKDNAAQLTEGNDTISSQITFSLLFSLIITFVSDAISRLSDKSSMISQDASFRTDFQDIV 300
Query: 301 MAAGNDPIVEGFVGGIRLAWAVRLMLIQDGVAARETVSSGSSNEMGYLSQCLEVIFSNNN 360
MA+G+DP +GF+GGIRLAWAV LMLI DG++ +T+S+ S+ +MG++ CLE IFS N
Sbjct: 301 MASGSDPTADGFIGGIRLAWAVHLMLIHDGISGMDTISTASTTDMGHICSCLESIFSKNV 360
Query: 361 FQFLLDKVLRTAAYQTEDEDMVYMYNAYLHKLITCFLSNPLARDKIKESKEKIMSVLSPY 420
FQFLLD VLRTAAYQ ++ED++Y+YNAYLHKL +CFLS+P+ARDK+KESK+ MSVL+ Y
Sbjct: 361 FQFLLDNVLRTAAYQNDEEDIIYIYNAYLHKLASCFLSHPIARDKVKESKDMAMSVLNSY 420
Query: 421 RSVGTHDFAQDSNSSSLHGTEMGSIPFNSILDFVSEIYQKEPELLSGNDVLWTFVNFAGE 480
R+ D + + S +PF S+++F KEPELLSGNDVLWTFVNFAGE
Sbjct: 421 RTSDPLDGSMQTEESD------RPLPFISLMEF------KEPELLSGNDVLWTFVNFAGE 468
Query: 481 DHTNFRTLVAFLNMLSTLASSQEGASKVYELLQNKSFRSIGWSTLFECLTIYDEKFKQSL 540
DHTNF+TLVAFL ML TLAS+QEGASKVYELL+ SFRSIGW TLF+C+ IYDEKFKQSL
Sbjct: 469 DHTNFKTLVAFLEMLCTLASTQEGASKVYELLRGTSFRSIGWPTLFDCIRIYDEKFKQSL 528
Query: 541 QTAGAMLPEIQEGDAKALVAYLNVLKKVVENGNPIEKKNWFPDIEPLFKLLSYENVPPYL 600
QTAGAM+PE EGDAKALVAYLNVL+KVVENGNP E+KNWFPDIEP FKLL YEN+PPYL
Sbjct: 529 QTAGAMMPEFLEGDAKALVAYLNVLQKVVENGNPTERKNWFPDIEPFFKLLGYENIPPYL 588
Query: 601 KGALRNAIATFIHVSPVLKDSIWTYLEQYDLPVVVGPDVQNTPSVGTQVYDMQFELNEIE 660
KGALR IA F++V P ++DSIW +LEQYDLPVVVG V + +QVYDMQFELNE+E
Sbjct: 589 KGALRKTIAAFVNVFPEMRDSIWAFLEQYDLPVVVGSQVGKSDQ-SSQVYDMQFELNEVE 647
Query: 661 ARREQYPSTISFLNLINALIAEERDLTDXXXXXXXXXXXXYDHVFGPFPQRAYTDTCEKW 720
ARREQYPSTISFLNLINALIA E+D+ D +RAY+D CEKW
Sbjct: 648 ARREQYPSTISFLNLINALIAGEKDVNDRG-------------------RRAYSDPCEKW 688
Query: 721 QLVGACLKHFHMILSMYDIKDEDYEGVVDQSRLSTTKESSPLQTQLPVLELMKDFMSGKT 780
QLV ACL+HFHMILSMYDI++ED +G + + E+S LQTQLP++EL+KDFMSGK
Sbjct: 689 QLVVACLQHFHMILSMYDIQEEDLDGFTEHPHFLVSLETSSLQTQLPIIELLKDFMSGKA 748
Query: 781 AFRNIMSILLLGVNSIIAERSSQIYGQHLENAVQLSLEIIILVLEKDLLLSDYWRPLYQP 840
+RN+M IL +GVNSII+ER S+ YG+ LE AVQLSLEI++LV EKDLL+SD WRPLYQP
Sbjct: 749 LYRNLMGILQVGVNSIISERLSKTYGKILEKAVQLSLEILLLVFEKDLLVSDVWRPLYQP 808
Query: 841 LDIILSHDHNQIVALLEYVRYDFQPKVQQSSIKIMSIF-SSRMVGLVQLLLKSNASNSLI 899
LDIILS DHNQI+ALLEYVRYD P++Q+SSIKIM+I SR+VGLV +L+K +A+NSLI
Sbjct: 809 LDIILSQDHNQIIALLEYVRYDSLPQIQRSSIKIMNILRCSRLVGLVPMLIKIDAANSLI 868
Query: 900 EDYAACLELRSEESQKVENN-NDSGILIMQLLIDNIGRPAPNITHLLLKFDIDTPVERTV 958
EDYAACLE R EE + VEN+ +D G+LIMQLL+DNI RPAP+ITHLLLKFD+D PVE TV
Sbjct: 869 EDYAACLEGRLEEGEVVENSCDDLGVLIMQLLVDNINRPAPSITHLLLKFDLDAPVEGTV 928
Query: 959 LQPKFYYSCMKVVLDILEKLSKPDVNALLHEFGFQLFYELCVDPLTCFPTMDLLSNKKYQ 1018
LQPKF+YSC+KV+L++LEKL PD+N LL EFGFQL EL +DPLT PTMDLLS+KKYQ
Sbjct: 929 LQPKFHYSCLKVILEMLEKLPNPDINFLLFEFGFQLLCELNLDPLTSGPTMDLLSSKKYQ 988
Query: 1019 FFVKHLDTIGIAPLPKRNSNQPLRISSLHQRAWILKLLAVELHAGDVSSSKHREACQTIL 1078
FF++HLDTIG+A LPKR+ +Q LRISSLHQRAW+LKLLA+ LH G SSS H EACQ+IL
Sbjct: 989 FFLQHLDTIGVATLPKRSGSQALRISSLHQRAWLLKLLAIALHTGSGSSSAHLEACQSIL 1048
Query: 1079 SHLYGQGITEIGGGQAISQFSLQGASENAAIRTVSKSKVLELLEIIQFRCPDSTTKLSNV 1138
SHL+G+ +TE S Q + A ++SKSK L LLEI+QFR PD++ +L +
Sbjct: 1049 SHLFGREVTEAANEPFSSSTYPQDGLDYAGTSSISKSKALALLEILQFRSPDASMQLPQI 1108
Query: 1139 VAGMKYDLLPEDILGNPGNSGNG-VYYYSERGDRLIDLASFHDKLWLKYNSAYQHGSNLG 1197
V+ +KYD L EDILGN S +G +YYYSERGDRLIDL+SF +KLW K +S + +
Sbjct: 1109 VSSLKYDSLVEDILGNRDTSVSGSIYYYSERGDRLIDLSSFSNKLWQKLHSGFPLVDSFP 1168
Query: 1198 SDIELNDVRETIQQLLRWGWKYNKNLEEQAAQLHMLTAWSQIVEVSASRRLTMLEDRSEI 1257
+ EL++VRETIQQLL+WGWKYN+NLEEQAAQLHML WSQIVEVSA RR++ L++RSEI
Sbjct: 1169 NVAELSEVRETIQQLLKWGWKYNRNLEEQAAQLHMLAGWSQIVEVSACRRISSLDNRSEI 1228
Query: 1258 LFQVXXXXXXXXXXXXXXXKMAFILSQVALTCMAKLRDERFMFPGSLSSDNITCIDLIVV 1317
L+++ KMAF+L+QVALTC+AKLRD+RF F G+LSSD +TC+D+++V
Sbjct: 1229 LYRILDASLSASASPDCSLKMAFVLTQVALTCIAKLRDDRFSFQGALSSDTVTCLDVMMV 1288
Query: 1318 KQLSNGACLTILFKLIMAIXXXXXXXXXXXXQYALLLSYFQYCLNVVDPDVPTSVLQFLL 1377
K LS GAC ++LFKL+MAI QYALLLSYFQYC +++ DVPTSV+QFLL
Sbjct: 1289 KHLSTGACHSVLFKLVMAILRHESSESLRRRQYALLLSYFQYCQHMIALDVPTSVVQFLL 1348
Query: 1378 LNEQDSEHIDLPKIDNEQAELARANFSTLRKEAQSILDLVVKDATHGSEPGKTIALYVLD 1437
LNEQD E +D+ KID EQA+LARANF ++KEAQ ILDLV+KDA+ GSE GKTI+LYVL+
Sbjct: 1349 LNEQDGEDLDIQKIDKEQADLARANFFIIKKEAQGILDLVIKDASQGSEFGKTISLYVLE 1408
Query: 1438 GLICIDHERYFLSQLQSRGFLRSCFTAISSVSNQDGGLSLDSLQRACTFEAELALLLRIS 1497
L+CIDHERYFLSQLQSRGF+RSC +IS++S QDG L+S QRACT EAELALLLRIS
Sbjct: 1409 ALVCIDHERYFLSQLQSRGFIRSCLGSISNISYQDGTHLLESQQRACTLEAELALLLRIS 1468
Query: 1498 HKYGKSGAQVLFSMGIFEHLASGRAINLQ 1526
HKYGKSG QVLFSMG EH+AS RAI+ +
Sbjct: 1469 HKYGKSGGQVLFSMGALEHIASCRAISFK 1497