Miyakogusa Predicted Gene
- Lj3g3v0030810.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v0030810.1 Non Chatacterized Hit- tr|D7LP21|D7LP21_ARALL
Putative uncharacterized protein OS=Arabidopsis
lyrata,37.44,1e-17,seg,NULL,
gene.Ljchr3_pseudomol_20120830.path1.gene68.1
(458 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G15780.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 234 1e-61
AT1G15770.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 213 3e-55
AT2G10440.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 93 4e-19
AT2G10440.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 84 2e-16
AT1G15772.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 50 4e-06
>AT1G15780.1 | Symbols: | unknown protein; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT2G10440.1); Has
103701 Blast hits to 43153 proteins in 1828 species:
Archae - 30; Bacteria - 7385; Metazoa - 38639; Fungi -
11531; Plants - 7727; Viruses - 307; Other Eukaryotes -
38082 (source: NCBI BLink). | chr1:5430446-5435921
REVERSE LENGTH=1335
Length = 1335
Score = 234 bits (597), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 127/216 (58%), Positives = 152/216 (70%), Gaps = 6/216 (2%)
Query: 234 ANIGHQHTGCAAVAAQSLAIGTSGISAPPESAEFTGPDGVHGNSFPPTLGKSTVSEQPLD 293
NI Q QSLAIGT GISA P EFT PDG NS T GK + +E P++
Sbjct: 989 GNIARQQATGMQGVVQSLAIGTPGISASPLLQEFTSPDGNILNSSTITSGKPSATELPIE 1048
Query: 294 RLIKAVKSLTPNALSTAVSDIGSVVSMNDRIVGSAPGNGSKAAAGADLVAMTNCRLQARN 353
RLI+AVKS++P ALS+AVSDIGSVVSM DRI GSAPGNGS+A+ G DLVAMT CRLQARN
Sbjct: 1049 RLIRAVKSISPQALSSAVSDIGSVVSMVDRIAGSAPGNGSRASVGEDLVAMTKCRLQARN 1108
Query: 354 FINQDGANGTRRMKRCTHATPLNDVSYAGCLNNSIKQL---EASDLDSTATCI-KKPRIQ 409
F+ Q+G T++MKR T A PL+ S G + ++ KQ E SDL+STAT KK R +
Sbjct: 1109 FMTQEGMMATKKMKRHTTAMPLSVASLGGSVGDNYKQFAGSETSDLESTATSDGKKARTE 1168
Query: 410 TNHALLEEIREVNQRLIDTVVDISDEE--IDPTAAA 443
T HALLEEI+E+NQRLIDTVV+ISD+E DP+ A
Sbjct: 1169 TEHALLEEIKEINQRLIDTVVEISDDEDAADPSEVA 1204
Score = 208 bits (530), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 119/203 (58%), Positives = 145/203 (71%), Gaps = 12/203 (5%)
Query: 21 DMWRRLQASG----SLTLPQSVLDQQKQLCQSQRPLPKISSRKKTSLDSTTQTGQPNGGD 76
D+ +RLQASG SL PQ+V+DQQ+QL QSQR LP++ S +SLDST QT NGGD
Sbjct: 536 DVQQRLQASGQVTGSLLPPQNVVDQQRQLYQSQRTLPEMPS---SSLDSTAQTESANGGD 592
Query: 77 WQEEVYQKIKTMKENYLPDLNELYQKIGTKLQQHDSLAQQSKSDKLEKYKLFKMMLERMI 136
WQEEVYQKIK+MKE YLPDLNE+YQ++ KLQQ DS+ QQ +SD+LEK + FK MLERMI
Sbjct: 593 WQEEVYQKIKSMKETYLPDLNEIYQRVAAKLQQ-DSMPQQQRSDQLEKLRQFKTMLERMI 651
Query: 137 SFLQVSKSNIAPSFKEKLGSYEKQIINFININRPRKGMSSLQHPGHLPPQHMHSMSQSHP 196
FL VSKSNI P+ K+K+ YEKQII F+N++RPRK + G LP M M Q
Sbjct: 652 QFLSVSKSNIMPALKDKVAYYEKQIIGFLNMHRPRKPV----QQGQLPQSQMQPMQQPQS 707
Query: 197 QVTQVQPLENQINSQMQTTNMQG 219
Q Q Q +NQ N QMQ+ +MQG
Sbjct: 708 QTVQDQSHDNQTNPQMQSMSMQG 730
>AT1G15770.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G15780.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr1:5426892-5428215 REVERSE LENGTH=293
Length = 293
Score = 213 bits (541), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 123/266 (46%), Positives = 159/266 (59%), Gaps = 21/266 (7%)
Query: 95 DLNELYQKIGTKLQQHDSLA-QQSKSDKLEKYKLFKMMLERMISFLQVSKSNIAPSFKEK 153
DLNE+YQ++ KLQQ DSL+ Q+ +SD+ EK K K +LE M+ FL +SKSNI P K+
Sbjct: 15 DLNEIYQRVAAKLQQEDSLSHQKQRSDQFEKLKRGKTVLEGMLRFLSLSKSNIKPDLKDS 74
Query: 154 LGSYEKQIINFININRPRKGMSSLQHPGHLPPQHMHSMSQSHPQVTQVQPLENQINSQMQ 213
+ + I+NF+N+ RK + LQ L + M Q Q Q Q ++Q QMQ
Sbjct: 75 MDYRKNNIMNFLNMQSLRKTVQKLQ----LTKSEIQPMQQPLSQTVQDQSHDDQTTLQMQ 130
Query: 214 TTNMQGXXXXXXXXXXXXHGANIGHQHTGCAAVAAQSLAIGTSGISAPPESAEFTGPDGV 273
+ +MQG G+ + G QSL IGT GISA P E T PDG
Sbjct: 131 SMSMQGA------------GSRVQQIRQG----VLQSLEIGTPGISASPLLPELTSPDGN 174
Query: 274 HGNSFPPTLGKSTVSEQPLDRLIKAVKSLTPNALSTAVSDIGSVVSMNDRIVGSAPGNGS 333
N T GKS+ +E P++RLI+A+KS++P ALS+AV DI SVVSM DRI GS PG GS
Sbjct: 175 IINPLTSTCGKSSATELPIERLIRAMKSISPQALSSAVCDIRSVVSMVDRIAGSVPGKGS 234
Query: 334 KAAAGADLVAMTNCRLQARNFINQDG 359
+A+ G DLVAMT C LQ RNF+ QDG
Sbjct: 235 RASFGVDLVAMTKCHLQERNFMTQDG 260
>AT2G10440.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G15780.1); Has 8319 Blast hits to 5104 proteins
in 317 species: Archae - 0; Bacteria - 285; Metazoa -
1706; Fungi - 535; Plants - 320; Viruses - 18; Other
Eukaryotes - 5455 (source: NCBI BLink). |
chr2:4013752-4018046 REVERSE LENGTH=935
Length = 935
Score = 93.2 bits (230), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 67/187 (35%), Positives = 100/187 (53%), Gaps = 13/187 (6%)
Query: 25 RLQASGSLTLPQSVLDQQKQLCQSQRPLPKISSRK--KTSLDSTTQTGQPNGGDWQEEVY 82
+ QA+ SL Q++ DQQ Q Q +R P I S DST +T N G+WQEE Y
Sbjct: 256 QFQAASSLRQTQNITDQQNQPQQLERANPSILIMNIIVASQDSTGKTVNVNAGNWQEETY 315
Query: 83 QKIKTMKENYLPDLNELYQKIGTKLQQHDSLAQQS-KSDKLEKYKLFKMMLERMISFLQV 141
QKIK +KE LP L+ ++Q++ KL++ +SL Q ++ +EK K K+ +E ++ FL V
Sbjct: 316 QKIKKLKEMCLPVLSLMHQRVAEKLRETESLPPQPMQAQWIEKLKAGKLSMEHLMFFLNV 375
Query: 142 SKSNIAPSFKEKLGSYEKQIINFIN----INRPRKGMSSLQHPGHLPPQHMHSMSQSHPQ 197
+S+++ ++K YE I+ F + RP + Q G PP +QS PQ
Sbjct: 376 HRSSVSEKHRDKFSQYEYHILKFTKSQTMVLRPTQ-----QQQGQFPPSQTAMQTQS-PQ 429
Query: 198 VTQVQPL 204
V Q L
Sbjct: 430 VHVSQSL 436
Score = 76.3 bits (186), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 57/151 (37%), Positives = 89/151 (58%), Gaps = 8/151 (5%)
Query: 287 VSEQPLDRLIKAVKSLTPNALSTAVSDIGSVVSMNDRI-VGSAPGNGSKAAAGADLVAMT 345
++E+P+DRLIKA ++ +P +L+ +VS+I SV+SM D I GS+A G DL T
Sbjct: 655 ITERPIDRLIKAFQAASPKSLAESVSEISSVISMVDMIGGSFPSSGGSRAGLGEDLSERT 714
Query: 346 NCRLQARNFINQDGANGTRRMKRCTHATPLNDVSYAGCLNNSIKQLEASDLDSTATCIKK 405
RNF + N ++RMKR + P D+S + LE+ + +T++ +K
Sbjct: 715 ------RNFTTHEETNLSKRMKRSINIVP-PDMSSQIDSYEQLSSLESEVVSTTSSGLKV 767
Query: 406 PRIQTNHALLEEIREVNQRLIDTVVDISDEE 436
I +ALL+EI+E N RL++TVV+I DE+
Sbjct: 768 NNIAPGYALLQEIKETNGRLVETVVEICDED 798
>AT2G10440.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT1G15780.1);
Has 1628 Blast hits to 1350 proteins in 149 species:
Archae - 0; Bacteria - 39; Metazoa - 480; Fungi - 159;
Plants - 187; Viruses - 2; Other Eukaryotes - 761
(source: NCBI BLink). | chr2:4013752-4018046 REVERSE
LENGTH=845
Length = 845
Score = 84.3 bits (207), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/148 (36%), Positives = 82/148 (55%), Gaps = 11/148 (7%)
Query: 62 SLDSTTQTGQPNGGDWQEEVYQKIKTMKENYLPDLNELYQKIGTKLQQHDSL-AQQSKSD 120
S DST +T N G+WQEE YQKIK +KE LP L+ ++Q++ KL++ +SL Q ++
Sbjct: 215 SQDSTGKTVNVNAGNWQEETYQKIKKLKEMCLPVLSLMHQRVAEKLRETESLPPQPMQAQ 274
Query: 121 KLEKYKLFKMMLERMISFLQVSKSNIAPSFKEKLGSYEKQIINFIN----INRPRKGMSS 176
+EK K K+ +E ++ FL V +S+++ ++K YE I+ F + RP +
Sbjct: 275 WIEKLKAGKLSMEHLMFFLNVHRSSVSEKHRDKFSQYEYHILKFTKSQTMVLRPTQ---- 330
Query: 177 LQHPGHLPPQHMHSMSQSHPQVTQVQPL 204
Q G PP +QS PQV Q L
Sbjct: 331 -QQQGQFPPSQTAMQTQS-PQVHVSQSL 356
Score = 75.9 bits (185), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 57/151 (37%), Positives = 89/151 (58%), Gaps = 8/151 (5%)
Query: 287 VSEQPLDRLIKAVKSLTPNALSTAVSDIGSVVSMNDRI-VGSAPGNGSKAAAGADLVAMT 345
++E+P+DRLIKA ++ +P +L+ +VS+I SV+SM D I GS+A G DL T
Sbjct: 575 ITERPIDRLIKAFQAASPKSLAESVSEISSVISMVDMIGGSFPSSGGSRAGLGEDLSERT 634
Query: 346 NCRLQARNFINQDGANGTRRMKRCTHATPLNDVSYAGCLNNSIKQLEASDLDSTATCIKK 405
RNF + N ++RMKR + P D+S + LE+ + +T++ +K
Sbjct: 635 ------RNFTTHEETNLSKRMKRSINIVP-PDMSSQIDSYEQLSSLESEVVSTTSSGLKV 687
Query: 406 PRIQTNHALLEEIREVNQRLIDTVVDISDEE 436
I +ALL+EI+E N RL++TVV+I DE+
Sbjct: 688 NNIAPGYALLQEIKETNGRLVETVVEICDED 718
>AT1G15772.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT1G15780.1);
Has 30201 Blast hits to 17322 proteins in 780 species:
Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
2996 (source: NCBI BLink). | chr1:5428877-5429232
REVERSE LENGTH=87
Length = 87
Score = 49.7 bits (117), Expect = 4e-06, Method: Composition-based stats.
Identities = 18/37 (48%), Positives = 29/37 (78%)
Query: 73 NGGDWQEEVYQKIKTMKENYLPDLNELYQKIGTKLQQ 109
NG DW+EEV+QK+++M+E YLP + ++YQ++ KL
Sbjct: 5 NGDDWREEVFQKLRSMRERYLPHVTDVYQRLADKLNH 41