Miyakogusa Predicted Gene
- Lj0g3v0258939.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0258939.1 Non Chatacterized Hit- tr|K3Z576|K3Z576_SETIT
Uncharacterized protein OS=Setaria italica
GN=Si021694,33.72,2e-18,seg,NULL; SUBFAMILY NOT NAMED,NULL; FAMILY NOT
NAMED,NULL,CUFF.17174.1
(414 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G23490.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 280 2e-75
AT5G08440.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 221 1e-57
AT5G08440.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 190 2e-48
AT3G03560.1 | Symbols: | unknown protein; LOCATED IN: plasma me... 82 9e-16
>AT5G23490.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G08440.1); Has 202 Blast hits to 197 proteins
in 48 species: Archae - 0; Bacteria - 13; Metazoa - 25;
Fungi - 9; Plants - 109; Viruses - 0; Other Eukaryotes -
46 (source: NCBI BLink). | chr5:7919831-7926499 FORWARD
LENGTH=729
Length = 729
Score = 280 bits (715), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 188/467 (40%), Positives = 244/467 (52%), Gaps = 94/467 (20%)
Query: 1 MENGHDGKLAEKFSGLAINQQHGQQGVHDQSNLSSNH--NESLYQVMKAVESAEVTIKQQ 58
MENGH+ +LAE+FSGL G D S L N N++L+QV+KAVE+AE TIK
Sbjct: 1 MENGHEERLAERFSGL---------GFEDSSLLPENEFKNDNLFQVIKAVEAAETTIK-- 49
Query: 59 RRNEQVDQNSHPWKEQ-----VYGSYEARQSIPSSAISNTSNYSGSSEIN---------- 103
EQV++NS E Y++ +S+P + SN +++ S+ ++
Sbjct: 50 ---EQVEENSRLKAELQRSALELAKYKSDESLPQT--SNIGDHTNSTTVSRLVHQPVDWK 104
Query: 104 ------------GTLRVQPNE--------------------------RLPVENTGNSQLS 125
G L V P+ + ++ TG SQ
Sbjct: 105 PVVIKASDADSSGLLVVHPHVNANGEEATVSNRFESHSEETISNGTVKRAIDGTGPSQ-- 162
Query: 126 SPFTRSISPNRHLLGGDLDPQFNPPRQGLTPMAETNNSNTSLQQDLAIKVXXXXXXXXXX 185
F SISP R L G+ D F+ G P+ E N+S + +QDL KV
Sbjct: 163 --FDSSISPMRMRLEGEHDAHFSSSTHGSMPVGEVNHSGNAWKQDLIHKVQEQEQEISQL 220
Query: 186 XKHLADYAAKEAQIRNEKYVLDKRIAYMRVAFDQQQQDLVDAASKALSYRQDVIEENIRL 245
++L D + KEAQIRNEKYVL+KRIAYMR+AFDQQQQDLVDA+SKALSYRQ++IEENIRL
Sbjct: 221 RRYLTDCSVKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDASSKALSYRQEIIEENIRL 280
Query: 246 TYALQDAQQERSTFVSSLVPLLAEYSLQPNVLDAQSIVSNVKVLFKHXXXXXXXXXXXXX 305
TYALQ QQERSTFVS L+PLL+EYSLQP V DAQSIVSNVKVLFKH
Sbjct: 281 TYALQATQQERSTFVSYLLPLLSEYSLQPQVSDAQSIVSNVKVLFKHLQEKLLLTETKLK 340
Query: 306 XXXYQLTPWRSDMNQNHATAATQSPSHSIGAPLATSNKNGLELVPRHIYSQVKTQVSVDT 365
YQL PW+SD+ NH+ + +PS S G L S K+ + YS T +
Sbjct: 341 ESEYQLAPWQSDV--NHSNDSPLAPSRSAGVALTHSTKDSM-------YSHDHTAI---- 387
Query: 366 QAGTDWGMLGRHQSGLGGGVASNVDADDLERYSPL--ASRGILDLHI 410
DW + + Q G N DD +SPL + ++H+
Sbjct: 388 ----DWNLERQQQDEPGSSAVRNYHLDDSSTFSPLENSQSAAFEMHV 430
>AT5G08440.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane;
EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G23490.1); Has 141 Blast
hits to 139 proteins in 35 species: Archae - 0; Bacteria
- 9; Metazoa - 21; Fungi - 6; Plants - 94; Viruses - 0;
Other Eukaryotes - 11 (source: NCBI BLink). |
chr5:2721037-2726970 FORWARD LENGTH=726
Length = 726
Score = 221 bits (562), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 129/264 (48%), Positives = 154/264 (58%), Gaps = 31/264 (11%)
Query: 136 RHLLGGDLDPQFNPPRQGLTPMAETNNSNTSLQQDLAIKVXXXXXXXXXXXKHLADYAAK 195
R LL GD D N L P+ E NNS T+ +Q+L KV K+LADY+ K
Sbjct: 184 RPLLEGDHDLHINSSSHELMPVGEVNNSGTAWKQELIHKVQEQDQEILRLRKYLADYSTK 243
Query: 196 EAQIRNEKYVLDKRIAYMRVAFDQQQQDLVDAASKALSYRQDVIEENIRLTYALQDAQQE 255
E QIRNEKYVL+KRIA+MR AFDQQQQDLVDAASKALSYRQ++IEENIRLTYALQ A+QE
Sbjct: 244 EVQIRNEKYVLEKRIAHMRSAFDQQQQDLVDAASKALSYRQEIIEENIRLTYALQAAEQE 303
Query: 256 RSTFVSSLVPLLAEYSLQPNVLDAQSIVSNVKVLFKHXXXXXXXXXXXXXXXXYQLTPWR 315
RS FVS L+PLL+EYSL P + D+QSIVS+VKVLF+H YQL PW+
Sbjct: 304 RSLFVSILLPLLSEYSLHPQISDSQSIVSSVKVLFRHLQEKLNVTETKLKETEYQLAPWQ 363
Query: 316 SDMNQNHATAATQSPSHSIGAPLATSNKNGLELVPRHIYSQVKTQVSVDTQAGTDWGMLG 375
SD+N H+ A+ SP +G L + S D++
Sbjct: 364 SDVN--HSNASPLSPYQPVGVGL---------------------RYSTDSEH-------- 392
Query: 376 RHQSGLGGGVASNVDADDLERYSP 399
HQ GG ASN D E SP
Sbjct: 393 HHQDRRGGSAASNYHLDGPESRSP 416
Score = 56.6 bits (135), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 63/106 (59%), Gaps = 10/106 (9%)
Query: 1 MENGHDGKLAEKFSGLAINQQHGQQGVHDQSNLSSNHNESLYQVMKAVESAEVTIKQQRR 60
M+NGH+ +LAE+FSG+ + + G S+ + N+SL+QV+KAVE+AE TIKQQ
Sbjct: 1 MDNGHEERLAERFSGVGLGESSG-------SHENDVKNDSLFQVIKAVEAAEATIKQQVE 53
Query: 61 NEQVDQNSHPWKEQVYGSYEARQSIP-SSAISNTSNYS--GSSEIN 103
+ + + Y++ +S+P +S + N SN + GSS ++
Sbjct: 54 ENNLLKAELQRRYLELAKYKSGESLPQTSDLGNHSNTTTGGSSPLH 99
>AT5G08440.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; EXPRESSED IN: 21 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT5G23490.1). | chr5:2721037-2726970 FORWARD
LENGTH=772
Length = 772
Score = 190 bits (482), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 98/153 (64%), Positives = 115/153 (75%)
Query: 136 RHLLGGDLDPQFNPPRQGLTPMAETNNSNTSLQQDLAIKVXXXXXXXXXXXKHLADYAAK 195
R LL GD D N L P+ E NNS T+ +Q+L KV K+LADY+ K
Sbjct: 184 RPLLEGDHDLHINSSSHELMPVGEVNNSGTAWKQELIHKVQEQDQEILRLRKYLADYSTK 243
Query: 196 EAQIRNEKYVLDKRIAYMRVAFDQQQQDLVDAASKALSYRQDVIEENIRLTYALQDAQQE 255
E QIRNEKYVL+KRIA+MR AFDQQQQDLVDAASKALSYRQ++IEENIRLTYALQ A+QE
Sbjct: 244 EVQIRNEKYVLEKRIAHMRSAFDQQQQDLVDAASKALSYRQEIIEENIRLTYALQAAEQE 303
Query: 256 RSTFVSSLVPLLAEYSLQPNVLDAQSIVSNVKV 288
RS FVS L+PLL+EYSL P + D+QSIVS+VK+
Sbjct: 304 RSLFVSILLPLLSEYSLHPQISDSQSIVSSVKI 336
Score = 56.6 bits (135), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 63/106 (59%), Gaps = 10/106 (9%)
Query: 1 MENGHDGKLAEKFSGLAINQQHGQQGVHDQSNLSSNHNESLYQVMKAVESAEVTIKQQRR 60
M+NGH+ +LAE+FSG+ + + G S+ + N+SL+QV+KAVE+AE TIKQQ
Sbjct: 1 MDNGHEERLAERFSGVGLGESSG-------SHENDVKNDSLFQVIKAVEAAEATIKQQVE 53
Query: 61 NEQVDQNSHPWKEQVYGSYEARQSIP-SSAISNTSNYS--GSSEIN 103
+ + + Y++ +S+P +S + N SN + GSS ++
Sbjct: 54 ENNLLKAELQRRYLELAKYKSGESLPQTSDLGNHSNTTTGGSSPLH 99
>AT3G03560.1 | Symbols: | unknown protein; LOCATED IN: plasma
membrane; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G23490.1);
Has 157 Blast hits to 146 proteins in 38 species: Archae
- 3; Bacteria - 14; Metazoa - 8; Fungi - 0; Plants -
120; Viruses - 0; Other Eukaryotes - 12 (source: NCBI
BLink). | chr3:853153-856486 REVERSE LENGTH=521
Length = 521
Score = 81.6 bits (200), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 77/133 (57%), Gaps = 5/133 (3%)
Query: 162 NSNTSLQQD-----LAIKVXXXXXXXXXXXKHLADYAAKEAQIRNEKYVLDKRIAYMRVA 216
++NT L QD L KV + +A K+ Q+ NEKY L+++ A +RVA
Sbjct: 27 DTNTKLIQDPEEMALYAKVRSQEEEIHSLQERIAAACLKDMQLLNEKYGLERKCADLRVA 86
Query: 217 FDQQQQDLVDAASKALSYRQDVIEENIRLTYALQDAQQERSTFVSSLVPLLAEYSLQPNV 276
D++Q + V +A L+ R+ +EEN++L + L+ + ER F++SL+ LLAEY + P V
Sbjct: 87 IDEKQNESVTSALNELARRKGDLEENLKLAHDLKVTEDERYIFMTSLLGLLAEYGVWPRV 146
Query: 277 LDAQSIVSNVKVL 289
+A +I S +K L
Sbjct: 147 ANATAISSGIKHL 159