Miyakogusa Predicted Gene
- Lj6g3v1421150.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj6g3v1421150.1 Non Chatacterized Hit- tr|K3Y106|K3Y106_SETIT
Uncharacterized protein OS=Setaria italica
GN=Si007868,30.91,2e-18,seg,NULL,CUFF.59478.1
(356 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G20190.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 113 2e-25
AT1G30850.1 | Symbols: RSH4 | root hair specific 4 | chr1:109851... 94 1e-19
AT2G34910.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 88 7e-18
AT5G44660.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 77 1e-14
>AT4G20190.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G44660.1); Has 271 Blast hits to 209 proteins
in 52 species: Archae - 0; Bacteria - 15; Metazoa - 63;
Fungi - 14; Plants - 48; Viruses - 3; Other Eukaryotes -
128 (source: NCBI BLink). | chr4:10906508-10907677
REVERSE LENGTH=389
Length = 389
Score = 113 bits (283), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 97/275 (35%), Positives = 128/275 (46%), Gaps = 76/275 (27%)
Query: 139 KSKSCGEARSLTPFEEFVSFDHWLSKTSAVEQVKLHHDGK------------------IS 180
+SKSCGE R+ TP + FD L K+ + HH G S
Sbjct: 127 RSKSCGEGRACTPS---LDFDMLLHKSRNAHHNQNHHRGFSSSNSKSLSHKSSGNNSFFS 183
Query: 181 KTEAVKES-----PKSNKSIKHMKTPENDGLKCSALCLYLPGFGGSKVKPVKTRKEGS-- 233
KTE+ K + ++KSI + DG KCSALCLYLPGF SK KPV++ ++G
Sbjct: 184 KTESNKSNRSNSNTANSKSINSFE----DGFKCSALCLYLPGF--SKGKPVRSSRKGDSS 237
Query: 234 -------------------KREAVMSRTVSLENFECGSWASAAMFHEIEGDSVNNYYFDL 274
+ AV+S SLE FECGSW S+AM ++ D + ++FDL
Sbjct: 238 FTRTTTMTSSQSMARTASIRDTAVLSARASLERFECGSWTSSAMIYDDNAD-LGGHFFDL 296
Query: 275 PIELMKHN-SASEECSPV----------------KGVLKNSSSRGSARKSDTSSPRHHVR 317
P EL+K +++ PV KGVLK S GS + SPR HVR
Sbjct: 297 PSELIKGGPGGNDQDDPVSAAFVFDKEPNLDKEIKGVLKTS---GSKSRRSMESPR-HVR 352
Query: 318 FTTXXXXXXXXXXXXXCITPRLRKARDDFNSFLAA 352
F+T ITPRL +A +DF+SFL A
Sbjct: 353 FSTSSPVSYPTSPTHS-ITPRLLQATEDFSSFLEA 386
>AT1G30850.1 | Symbols: RSH4 | root hair specific 4 |
chr1:10985116-10986018 FORWARD LENGTH=300
Length = 300
Score = 94.0 bits (232), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 72/210 (34%), Positives = 99/210 (47%), Gaps = 39/210 (18%)
Query: 178 KISKTEAVKESPKSNKSIKHMKTPENDGLKCSALCLYLPGFGGSKVKPVKTRKEGSKRE- 236
K+S E+V KS + K + E+ KC+A CL LPGFG +K+ ++++ S +
Sbjct: 98 KLSHQESVIFMSKSRFAEKILYKEED--FKCNAFCLSLPGFGKNKLIRSSSKRQNSMEKK 155
Query: 237 ---------AVMSRTVSLENFECGSWASAAMFHEIEGDSVNNYYFDLPIELMKHNSAS-- 285
+ +S SLE FECGSWAS + G +FD P+E+ K NS
Sbjct: 156 MIRASSFTGSTVSVRASLEKFECGSWASTTALIQDNG----RLFFDFPVEMTKCNSRGGN 211
Query: 286 ------------------EECSPVKGVLKNSSSRGSARKSDTSSPRHHVRFTTXXXXXXX 327
E ++ VLK S+R R+S SSP+ VRF+T
Sbjct: 212 GGRDVQEPVTSGFLFDRETETLALRSVLKTRSTR-DHRRSAESSPQRRVRFSTSSSSASV 270
Query: 328 X--XXXXXCITPRLRKARDDFNSFLAAAQS 355
CITPRLRKARDDFN+FL A +
Sbjct: 271 SCPTSPRTCITPRLRKARDDFNTFLTAQNA 300
>AT2G34910.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: root hair specific 4 (TAIR:AT1G30850.1); Has 43
Blast hits to 43 proteins in 9 species: Archae - 0;
Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 43;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
| chr2:14726900-14727766 FORWARD LENGTH=288
Length = 288
Score = 88.2 bits (217), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 62/176 (35%), Positives = 77/176 (43%), Gaps = 31/176 (17%)
Query: 202 ENDGLKCSALCLYLPGFGGSKVKPVKTR--------KEGSKREAVMSRTVSLENFECGSW 253
E + KC+A CL LPGFG V+ K+ K S + +S + SLE FECGSW
Sbjct: 116 EEENFKCNAFCLSLPGFGKRPVRSPKSEDSIKKKMIKASSFSNSTVSLSASLEKFECGSW 175
Query: 254 ASAAMFHEIEGDSVNNYYFDLPIELMKHNSASEECSPVKGVLKNSSSRGSA--------- 304
AS G Y DLP+E++K + PV GS
Sbjct: 176 ASTTALTRENG----RLYIDLPVEMIKCGGGDVQ-EPVSSGFFFDKETGSLALRSVLKKS 230
Query: 305 --------RKSDTSSPRHHVRFTTXXXXXXXXXXXXXCITPRLRKARDDFNSFLAA 352
R +SP+ VRF+T CITPRL KARDDFN+FLAA
Sbjct: 231 SSLSGRQLRDLAETSPQRRVRFSTTTSDSCPASPRT-CITPRLLKARDDFNTFLAA 285
>AT5G44660.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G20190.1); Has 944 Blast hits to 462 proteins
in 141 species: Archae - 2; Bacteria - 370; Metazoa -
161; Fungi - 102; Plants - 64; Viruses - 6; Other
Eukaryotes - 239 (source: NCBI BLink). |
chr5:18015810-18017081 FORWARD LENGTH=423
Length = 423
Score = 77.4 bits (189), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 93/208 (44%), Gaps = 59/208 (28%)
Query: 191 SNKSIKHMKTPENDGLKCSALCLYLPGFGGSKVKPVKTR--------------------- 229
SNKSI + T E D KC+ALCL+LPGF SK KP+++
Sbjct: 226 SNKSISNNSTLE-DRFKCNALCLFLPGF--SKGKPIRSSQKDDSSSFTRTTTMTRSSSST 282
Query: 230 ---------KEGSKREAVMSRTVSLENFECGSWASAAMFHEIEGDSVNNYYFDLPIELMK 280
+E + V+S S+E F+CGS+ S E G+ N++FDLP EL+K
Sbjct: 283 ITVSRTVSVRESTTTTTVISARASMEKFDCGSYTS-----ESCGEEGGNHFFDLPSELIK 337
Query: 281 HNSASE------------ECSPV----KGVLKNSSSRGSARKSDTSSPRHHVRFTTXXXX 324
S + PV KGVLK S S+ RK+ S VRF+T
Sbjct: 338 SGSGDNDHDEPVSAAFVFDKEPVEKEIKGVLKVSGSKN--RKAMESPSLRQVRFST---S 392
Query: 325 XXXXXXXXXCITPRLRKARDDFNSFLAA 352
I+PRL +A +FN+FL A
Sbjct: 393 SPVSYPTSPAISPRLLEATKNFNAFLEA 420