Miyakogusa Predicted Gene
- Lj1g3v2372280.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v2372280.1 Non Chatacterized Hit- tr|Q1PES0|Q1PES0_ARATH
Putative uncharacterized protein OS=Arabidopsis thalia,63.38,2e-19,
,CUFF.28954.1
(1009 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G08960.1 | Symbols: | ARM repeat superfamily protein | chr3:... 1526 0.0
AT1G26170.1 | Symbols: | ARM repeat superfamily protein | chr1:... 71 3e-12
AT3G59020.1 | Symbols: | ARM repeat superfamily protein | chr3:... 54 4e-07
AT3G59020.2 | Symbols: | ARM repeat superfamily protein | chr3:... 54 5e-07
>AT3G08960.1 | Symbols: | ARM repeat superfamily protein |
chr3:2730405-2736762 REVERSE LENGTH=1010
Length = 1010
Score = 1526 bits (3951), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 732/1010 (72%), Positives = 848/1010 (83%), Gaps = 1/1010 (0%)
Query: 1 MAVSTSDVAAMYSLLSNSMSADHRHRGPAEQALTQSESRPGFCSCLLEVITAKDLAAQVD 60
MA+S SD+ AMY+LL+NSMS D R PAE AL+ SESRPGFCSCL+EVI +KDL + VD
Sbjct: 1 MALSASDLPAMYTLLANSMSGDETVRRPAEAALSLSESRPGFCSCLMEVIASKDLVSHVD 60
Query: 61 VRLMATVYFKNSINRYWRHRRYSSGISNEEKIHLRQKLLMHLREENDQIALMLAVLISKI 120
VRLMA+VYFKNSINR+W+ RR S +SNEEK HLRQKLL HLREEN QIA MLAVLISKI
Sbjct: 61 VRLMASVYFKNSINRHWKSRRNSWSMSNEEKSHLRQKLLSHLREENYQIAEMLAVLISKI 120
Query: 121 ARIDYPKEWPDIFLILSQQLQSADILASHRIFMILFRTLKELSTKRLTADQRHFAEISSH 180
AR DYP+EWPD+F +L+QQL SAD+LASHRIF+ILFRTLKELSTKRLTADQ+ FAEISS
Sbjct: 121 ARFDYPREWPDLFSVLAQQLHSADVLASHRIFLILFRTLKELSTKRLTADQKTFAEISSQ 180
Query: 181 FFDYSWRLWQSDVQTILHGFSTLSQSFNSNAEDQHHD-LYLTCERWLLCSKIIRQLVISG 239
FFD+SW LWQ+DVQTILHGFST+ QS+ SN+ +QHHD L+LTCERW LC KI+RQL+ISG
Sbjct: 181 FFDFSWHLWQTDVQTILHGFSTMVQSYGSNSAEQHHDELFLTCERWFLCLKIVRQLIISG 240
Query: 240 FQSDSKCFQEVRPVKEVSPVLLSSIQSLLPYYSTFQKQYPKFWDFLKRTCTKLMKILVAF 299
F SD+ QE++PVKEVSP LL++ QS LPYYS+FQ + PKFW+F+K+ C KLMK+L A
Sbjct: 241 FLSDANNIQEIQPVKEVSPALLNAAQSFLPYYSSFQNRDPKFWEFVKKACVKLMKVLGAI 300
Query: 300 QGRHPYSFGDKFVLSSVMDFCLNRITDPEPYLLSFEQFLIQCMVMIKNILECKEYKPTLT 359
Q RHP+SFGDK L V+DFCLN+ITDPE LL FE F IQCMVM+K++LECKEYKP+ T
Sbjct: 301 QSRHPFSFGDKCALPVVVDFCLNKITDPEQALLPFEDFFIQCMVMVKSVLECKEYKPSRT 360
Query: 360 GRVMDENGVTLEQIKKNIXXXXXXXXXXXXPNERIVLLCNVLIQRYFVLTASDLEEWYRN 419
GRVMD+NG T EQ KKN PNERIVLLCNVL++RYFVLTASDLEEWY+N
Sbjct: 361 GRVMDDNGDTFEQRKKNASNTVGGIVSSLLPNERIVLLCNVLVRRYFVLTASDLEEWYQN 420
Query: 420 PESFHHEQDMVQWTEKLRPCAEALYIVLFENYSQLLCPVVVSLLQEAMNSCPTSVTEITP 479
PESFHHEQDM+QWTEKLRPCAEALY+VLFENYSQLL P+VVS+LQEAMN+CP SVTEITP
Sbjct: 421 PESFHHEQDMIQWTEKLRPCAEALYMVLFENYSQLLGPIVVSILQEAMNNCPPSVTEITP 480
Query: 480 XXXXXXXXXXXXXXXXXXXSNYLSFKDWFNGALSPELSNEHPNSRIIHRKVAVILGQWVS 539
SNYL+F+DWFNGALS ELSN+HPN RIIHRKVA+ILG WVS
Sbjct: 481 ALLLKDAAYAATAYVYYELSNYLNFRDWFNGALSLELSNDHPNRRIIHRKVAMILGHWVS 540
Query: 540 EIKDDTKRPVYCALIRLLQGNDLSVRLAACRSMCLHIEDANFSDKEFVDLLPNCWESCFK 599
EIKDDTKR VYCALI+LLQ NDL+V+LAA RS+CLH+EDANFS++ F+DLLP CW+SCFK
Sbjct: 541 EIKDDTKRAVYCALIKLLQDNDLAVKLAASRSLCLHVEDANFSEQSFLDLLPICWDSCFK 600
Query: 600 LFEEVQEFDSKVQILNLISILIGHVSEVIPFANKLVQFFQKVWEESSGESLLQIQLLVAL 659
+ E VQEFDSKVQILNLIS LIGHVSEVIP+A KLVQFFQKVWEESSGESLLQIQLLVAL
Sbjct: 601 MVEVVQEFDSKVQILNLISTLIGHVSEVIPYAQKLVQFFQKVWEESSGESLLQIQLLVAL 660
Query: 660 RNFVVALGYQSPICYNILLPILENGIDINSPDELNLLEDSMLLWEATLSQAPSMVPQLLS 719
RNFV+ALGYQSPICY+ILLPIL+ GIDINSPD LNLLEDSM LWE TLS AP MVPQLL+
Sbjct: 661 RNFVIALGYQSPICYSILLPILQKGIDINSPDSLNLLEDSMALWETTLSYAPMMVPQLLA 720
Query: 720 YFSRLVAIMERNFDHLQVAVNIIEDYIILGGNDFLSMHATNIAKILDLIVGNVSDKGLLS 779
F +V I+ER+FDHLQVAV+I++ YIIL G +FL+MHA+++AKILDLIVGNV+DKGLLS
Sbjct: 721 LFPYMVEIIERSFDHLQVAVSIMDSYIILDGGEFLNMHASSVAKILDLIVGNVNDKGLLS 780
Query: 780 VLPVIDILIQCFPMEVPPLISSALQKLIIVCLSGGDDRDPSKTSVKASSAAILARLLVMN 839
+LPVIDIL+QCFP+EVPPLISS LQKL+I+CLSGGDDRDPSKT+VK SSAAILAR+LVMN
Sbjct: 781 ILPVIDILVQCFPVEVPPLISSCLQKLVIICLSGGDDRDPSKTAVKVSSAAILARILVMN 840
Query: 840 TNSLAQLASDPSTSLLLQAASIPIQENILLCLVDIWVDKVDNVSSIQKKAIGLALSIILT 899
T LAQL SD S S+LLQ A +P+++NILLCL+DIW+DKVD+ S +Q+K GLALSIILT
Sbjct: 841 TTYLAQLTSDSSLSVLLQQAGVPVEDNILLCLIDIWLDKVDHASPMQQKTFGLALSIILT 900
Query: 900 LRLPQVLDKLDQILSVCTSVILGRSDDLTXXXXXXXXXXXXXXXXGSIPSKEFRKRQIKF 959
LR+PQVLDKLD ILS CTSVILG + DLT + PSKE RK QIK
Sbjct: 901 LRMPQVLDKLDLILSTCTSVILGENKDLTEEESSGDMSSSRSQGEETPPSKELRKSQIKV 960
Query: 960 SDSINQLSLEDSVRENLQTCAAIHGESFNAAMSSMHPSAFAQLKQALKMP 1009
SD I Q+SLE+S RENLQTC+ +HG++FN+A+S MHPSA AQ+KQALK+P
Sbjct: 961 SDPIYQMSLENSTRENLQTCSTLHGDAFNSAISRMHPSALAQVKQALKLP 1010
>AT1G26170.1 | Symbols: | ARM repeat superfamily protein |
chr1:9047539-9054438 REVERSE LENGTH=1022
Length = 1022
Score = 71.2 bits (173), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 42/121 (34%), Positives = 68/121 (56%), Gaps = 5/121 (4%)
Query: 15 LSNSMSADHRHRGPAEQALTQSESRPGFCSCLLEVITAKDLAAQVDVRLMATVYFKNSIN 74
LS S+ + R AE +L Q+ +PGF S L V KDL+ + +R +A V K I
Sbjct: 14 LSASLDPNQNVRSFAETSLNQASLQPGFGSALCRVAANKDLS--LGLRQLAAVLLKQFIK 71
Query: 75 RYWRHRRYSSG---ISNEEKIHLRQKLLMHLREENDQIALMLAVLISKIARIDYPKEWPD 131
++WR + +S+EEK +R +LL L + + +I +++ IS IA D+P+EWP+
Sbjct: 72 KHWRENEEAFEYPLVSSEEKALIRGQLLGSLDDSHRKICTAISMDISSIATYDWPEEWPE 131
Query: 132 I 132
+
Sbjct: 132 L 132
>AT3G59020.1 | Symbols: | ARM repeat superfamily protein |
chr3:21810973-21817418 REVERSE LENGTH=1029
Length = 1029
Score = 54.3 bits (129), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/139 (25%), Positives = 69/139 (49%), Gaps = 3/139 (2%)
Query: 18 SMSADHRHRGPAEQALTQSESRPGFCSCLLEVITAKDLAAQVDVRLMATVYFKNSINRYW 77
+ S + R AEQ+L Q + P +L++I D + + VR A+++FKN I ++W
Sbjct: 14 AFSPNPDERRAAEQSLNQLQHTPQHLIRILQIIV--DGGSDLSVRQSASIHFKNFIAKHW 71
Query: 78 R-HRRYSSGISNEEKIHLRQKLLMHLREENDQIALMLAVLISKIARIDYPKEWPDIFLIL 136
H + I +K +R ++L+ + + + + + + I DYP++WP++ +
Sbjct: 72 EPHSGDQNIILPSDKNVVRNQILVFVSQVPPILRVQMGECLKTIIYADYPEQWPELLDWV 131
Query: 137 SQQLQSADILASHRIFMIL 155
Q LQ + + + IL
Sbjct: 132 KQNLQKPQVYGALFVLRIL 150
>AT3G59020.2 | Symbols: | ARM repeat superfamily protein |
chr3:21810973-21817418 REVERSE LENGTH=1030
Length = 1030
Score = 54.3 bits (129), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 35/139 (25%), Positives = 69/139 (49%), Gaps = 3/139 (2%)
Query: 18 SMSADHRHRGPAEQALTQSESRPGFCSCLLEVITAKDLAAQVDVRLMATVYFKNSINRYW 77
+ S + R AEQ+L Q + P +L++I D + + VR A+++FKN I ++W
Sbjct: 14 AFSPNPDERRAAEQSLNQLQHTPQHLIRILQIIV--DGGSDLSVRQSASIHFKNFIAKHW 71
Query: 78 R-HRRYSSGISNEEKIHLRQKLLMHLREENDQIALMLAVLISKIARIDYPKEWPDIFLIL 136
H + I +K +R ++L+ + + + + + + I DYP++WP++ +
Sbjct: 72 EPHSGDQNIILPSDKNVVRNQILVFVSQVPPILRVQMGECLKTIIYADYPEQWPELLDWV 131
Query: 137 SQQLQSADILASHRIFMIL 155
Q LQ + + + IL
Sbjct: 132 KQNLQKPQVYGALFVLRIL 150