Miyakogusa Predicted Gene
- Lj3g3v3336120.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v3336120.1 Non Chatacterized Hit- tr|I1LJM2|I1LJM2_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.37576 PE,78.59,0,no
description,NULL; coiled-coil,NULL; AARP2CN (NUC121) domain,AARP2CN;
seg,NULL; DUF663,Ribosome bi,CUFF.45627.1
(1022 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G06720.1 | Symbols: | P-loop containing nucleoside triphosph... 674 0.0
AT1G42440.1 | Symbols: | FUNCTIONS IN: molecular_function unkno... 82 2e-15
>AT1G06720.1 | Symbols: | P-loop containing nucleoside triphosphate
hydrolases superfamily protein | chr1:2060615-2067281
REVERSE LENGTH=1147
Length = 1147
Score = 674 bits (1739), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/596 (58%), Positives = 426/596 (71%), Gaps = 49/596 (8%)
Query: 379 GNVSKWKESLAERTLSQKTPSLMQLVYGESTANLISINKENXXXXXXXXXGD-FFEPIEE 437
GN+S+WK L E +K P+LMQ+VYG S+ IN+ + + FF+P E
Sbjct: 539 GNISQWKAPLKE-IARKKNPNLMQIVYGASSLATPLINENHDISDDDESDDEDFFKPKGE 597
Query: 438 VKKKNVRDGFNDGNVNTEDYSKCAQFMD----KRWDKKD-NEEIRNRFVTGNLAKAALRN 492
+ KN+ G + G VN+ED C++F++ K W +K+ E IR+RF TG+ +KAALR+
Sbjct: 598 -QHKNLGGGLDVGYVNSED---CSKFVNYGYLKNWKEKEVCESIRDRFTTGDWSKAALRD 653
Query: 493 GLPKATTEEENDDVYGDFEDLETGEKIENNQTDDTTHN-GDDLEAETRRLKKLALRAKFD 551
E E+D++YGDFEDLETGEK ++++ ++ N +D +AE
Sbjct: 654 KNLGTGGEGEDDELYGDFEDLETGEKHKSHENLESGANENEDEDAEV------------- 700
Query: 552 SRFXXXXXXXXXXXXXXXXAKFRRGQPNETSYFDKLKEEIELQKQMNIAELNDLDEATRL 611
R Q +E Y DKLKE E+ KQ N E NDLDE TR+
Sbjct: 701 ---------------VERDGNNPRSQADEPGYADKLKEAQEITKQRNELEYNDLDEETRI 745
Query: 612 EIEGFRTGTYVRLEVHDVPCEMVEYFDPYHPILIGGVGLGEENVGYMQARLKRHRWHKKV 671
E+ GFRTGTY+RLE+H+VP EMVE+FDP HPIL+GG+G GE+NVGYMQARLK+HRWHKKV
Sbjct: 746 ELAGFRTGTYLRLEIHNVPYEMVEFFDPCHPILVGGIGFGEDNVGYMQARLKKHRWHKKV 805
Query: 672 LKTRDPIIVSAGWRRYQTTPVYAIEDHNGRHRMLKYTPEHMHCLAMFYGPLAPPNTGIVA 731
LKTRDPIIVS GWRRYQT PV+AIED NGRHRMLKYTPEHMHCLA F+GPL PPNTG VA
Sbjct: 806 LKTRDPIIVSIGWRRYQTIPVFAIEDRNGRHRMLKYTPEHMHCLASFWGPLVPPNTGFVA 865
Query: 732 VQNLSNNQATFRITATAVVLEFNHAARIVKKIKLVGYPCKIFKKTALIKEMFTSDLEIAR 791
QNLSNNQA FRITAT+VVLEFNH ARIVKKIKLVG PCKI KKTA IK+MFTSDLEIAR
Sbjct: 866 FQNLSNNQAGFRITATSVVLEFNHQARIVKKIKLVGTPCKIKKKTAFIKDMFTSDLEIAR 925
Query: 792 FEGAAIRTVSGIRGQVKKAAKEEIGNQPKRKGGITKEGIARCTFEDKILMSDIVFLRAWT 851
FEG+++RTVSGIRGQVKKA K + N+ +EGIARCTFED+I MSD+VFLRAWT
Sbjct: 926 FEGSSVRTVSGIRGQVKKAGKNMLDNK-------AEEGIARCTFEDQIHMSDMVFLRAWT 978
Query: 852 QVEVPQFYNLLTTALQPRDQTWKGMRTVAELRRDHDLPVPVNKDSLYKKIERKPRKFNPL 911
VEVPQFYN LTTALQPRD+TW GM+T ELRR+ ++P+PVNKDSLYK IERK +KFNPL
Sbjct: 979 TVEVPQFYNPLTTALQPRDKTWNGMKTFGELRRELNIPIPVNKDSLYKAIERKQKKFNPL 1038
Query: 912 VVPRSLQERLPFASKPKETPKRNQPLLEQRRQKGVVMEPRERKVLALVQHIQLINH 967
+P+ L++ LPF SKPK PKR +P LE +R V+MEP+ERK ++Q QL+ H
Sbjct: 1039 QIPKRLEKDLPFMSKPKNIPKRKRPSLEDKR--AVIMEPKERKEHTIIQQFQLLQH 1092
Score = 424 bits (1089), Expect = e-118, Method: Compositional matrix adjust.
Identities = 209/346 (60%), Positives = 246/346 (71%), Gaps = 27/346 (7%)
Query: 1 METFEFLNILQVHGFPKVMGVLTHLDKFKDVKKLRKTKQRLKHRFWTEIYDGAKLFYLSG 60
METFEFLNI+QVHGFP+VMGVLTHLDKF DVKKLRKTK LKHRFWTEIY GAKLFYLSG
Sbjct: 163 METFEFLNIMQVHGFPRVMGVLTHLDKFNDVKKLRKTKHHLKHRFWTEIYHGAKLFYLSG 222
Query: 61 LIHGKYVKREVHNLARFISVMKFHPLSWRTSHPYVLADRFEDITPPEKVHSNNKCDRKIT 120
LIHGKY REVHNLARF+ V+K PL+WRT+HPYVL DR ED+TPPEKV + KCDR IT
Sbjct: 223 LIHGKYTPREVHNLARFVIVIKPQPLTWRTAHPYVLVDRLEDVTPPEKVQMDKKCDRNIT 282
Query: 121 LYGYLRGCNLKKGNKVHIAGVGDYSLAGVTALPDPCPLPSAAKKKGLRDKEKLFYAPMSG 180
++GYLRGCN KK KVHIAGVGD+ +AGVTAL DPCPLPSA KKKGLRD++KLFYAPMSG
Sbjct: 283 VFGYLRGCNFKKRMKVHIAGVGDFIVAGVTALTDPCPLPSAGKKKGLRDRDKLFYAPMSG 342
Query: 181 LGDLLYDKDAVYININDHFVQFSKVDDENSAITRKGNERDIGEVLVRSLQETKYPINEKL 240
+GDL+YDKDAVYININ H VQ+SK DD T KG RD+GE LV+SLQ TKY ++EKL
Sbjct: 343 IGDLVYDKDAVYININSHQVQYSKTDDGKGEPTNKGKGRDVGEDLVKSLQNTKYSVDEKL 402
Query: 241 ENSMINLFAQSSEALAEAQGTNMDVEQDGIIETVDYNEMXXXXXXXXXXXXXXXXMTGSG 300
+ + IN F + + A +E + D Y+ + GS
Sbjct: 403 DKTFINFFGKKTSASSETKLKAEDA----------YHSLPE----------------GSD 436
Query: 301 LPDQAEDDAAGDKSTNKDHLEEHIEFHNGRRRRRAIFGNDADQSDL 346
Q+ DD D N+ +++ E H GR RR+AIF D ++ D
Sbjct: 437 SESQSGDDEE-DIVGNESEMKQETEIHGGRLRRKAIFKTDLNEDDF 481
>AT1G42440.1 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: ribosome biogenesis; LOCATED IN: nucleus;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; CONTAINS InterPro DOMAIN/s: AARP2CN
(InterPro:IPR012948), Protein of unknown function DUF663
(InterPro:IPR007034); BEST Arabidopsis thaliana protein
match is: P-loop containing nucleoside triphosphate
hydrolases superfamily protein (TAIR:AT1G06720.1); Has
2741 Blast hits to 2088 proteins in 291 species: Archae
- 2; Bacteria - 131; Metazoa - 833; Fungi - 650; Plants
- 171; Viruses - 49; Other Eukaryotes - 905 (source:
NCBI BLink). | chr1:15895528-15899939 REVERSE LENGTH=793
Length = 793
Score = 81.6 bits (200), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/237 (23%), Positives = 112/237 (47%), Gaps = 20/237 (8%)
Query: 619 GTYVRLEVHDVP----CEMVEYFDPYHPILIGGVGLGEENVGYMQARLKRHRWHKKVLKT 674
G+YVRL + +VP ++ + PI+ G+ E + + +K++ ++ +KT
Sbjct: 556 GSYVRLHIKEVPLGAASKLSSLVNTTKPIIGFGLLQHESKMSVLHFSVKKYDGYEAPIKT 615
Query: 675 RDPIIVSAGWRRYQTTPVYAIEDHNG-RHRMLKYTPEHMHCLAMFYGPLAPPNTGIVAVQ 733
++ ++ G+R++ PV+A ++ + +H+M ++ LA YGP++ P +V ++
Sbjct: 616 KEELMFHVGFRQFIARPVFATDNFSSDKHKMERFLHPGCFSLASIYGPISFPPLPLVVLK 675
Query: 734 NLSNNQATFRITATAVVLEFNHAARIVKKIKLVGYPCKIFKKTALIKEMFTSDLEIARFE 793
+S I A + I+KKI L GYP ++ K A ++ MF + ++ F+
Sbjct: 676 -ISEGSDPPAIAALGSLKSVEPNKIILKKIILTGYPQRVSKMKASVRYMFHNPEDVKWFK 734
Query: 794 GAAIRTVSGIRGQVKKAAKEEIGNQPKRKGGITKEGIARCTFEDKILMSDIVFLRAW 850
+ + G RG+V KE +G G +C F + D+V + +
Sbjct: 735 PVEVWSKCGRRGRV----KEPVGTH----------GAMKCIFNGVVQQHDVVCMNLY 777
Score = 54.7 bits (130), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 40/154 (25%), Positives = 67/154 (43%), Gaps = 14/154 (9%)
Query: 5 EFLNILQVHGFPKVMGVLTHLDKFKDVKKLRKTKQRLKHRFWTEIYDGAKLFYLSGLIHG 64
+ L++ + G P ++ L DVKK + K+ + +E + K FY +
Sbjct: 172 QCLSVFRSIGLPSTTVLIRDLPS--DVKKKNEMKKMCASQLASEFPEDCK-FYPAD---- 224
Query: 65 KYVKREVHNLARFISVMKFHPLSWRTSHPYVLADRFEDITPPEKVHSNNKCDRKITLYGY 124
+ E+H + WR+ Y++A + + E S+ KC + L GY
Sbjct: 225 --TRDELHKFMWLFKAQRLTVPHWRSQRSYIVARKAGMLVDDE---SSGKC--TLLLSGY 277
Query: 125 LRGCNLKKGNKVHIAGVGDYSLAGVTALPDPCPL 158
LR L VH++GVGD+ + + L DP PL
Sbjct: 278 LRARKLSVNQLVHVSGVGDFQFSKIEVLKDPFPL 311