Miyakogusa Predicted Gene
- Lj2g3v1874590.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj2g3v1874590.1 Non Chatacterized Hit- tr|J2H350|J2H350_9CAUL
Uncharacterized protein OS=Caulobacter sp. AP07 PE=4
S,39.86,3e-18,seg,NULL; FAMILY NOT NAMED,NULL,CUFF.37982.1
(403 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G50340.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 439 e-123
AT5G67020.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 433 e-122
AT2G22790.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 90 2e-18
>AT3G50340.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G67020.1); Has 128 Blast hits to 128 proteins
in 39 species: Archae - 0; Bacteria - 46; Metazoa - 0;
Fungi - 3; Plants - 76; Viruses - 0; Other Eukaryotes -
3 (source: NCBI BLink). | chr3:18665333-18666544 REVERSE
LENGTH=403
Length = 403
Score = 439 bits (1130), Expect = e-123, Method: Compositional matrix adjust.
Identities = 250/409 (61%), Positives = 293/409 (71%), Gaps = 12/409 (2%)
Query: 1 MVDVDRRMTGLNQAHIAGXXXXXXXXXXXXXXXXXXXNGLLSFSPLADKVITHLRTSGIE 60
MVDVDRRMTGL AH AG N L+SFS LAD+VI+HL TS I+
Sbjct: 1 MVDVDRRMTGLRPAHAAGLRRLSARAAAPTTPTVR--NSLVSFSSLADQVISHLHTSRIQ 58
Query: 61 VQPGLSDXXXXXXXXXXXXXXPPDLRAILAAGLPVGAGFPNWXXXXXXXXXXXXSLDLPI 120
VQPGL+D PPDLRA+L AGLPVGAGFP+W +DLPI
Sbjct: 59 VQPGLTDSEFARAEAEFAFAFPPDLRAVLTAGLPVGAGFPDWRSPGARLHLRAM-IDLPI 117
Query: 121 AAISFQIARNALWARCWGPRPAEPEKALRVARNALKRAPLLIPIFNHCYIPCNPSLAGNP 180
AA+SFQIARN LW++ WG RP++PEKALRVARNALKRAPL+IPIF+HCYIPCNPSLAGNP
Sbjct: 118 AAVSFQIARNTLWSKSWGLRPSDPEKALRVARNALKRAPLMIPIFDHCYIPCNPSLAGNP 177
Query: 181 IFFVDESRVFCCGFDLSDFFDRESPFRGSESEPGPLVLKKQRSVAEKTKTTSVCSASGFQ 240
+F++DE+R+FCCG DLSDFF+RES FRGS++ P+VL KQRSV+EK+ +S S+S F
Sbjct: 178 VFYIDETRIFCCGSDLSDFFERESVFRGSDT--CPVVLTKQRSVSEKSAGSSSSSSSNFS 235
Query: 241 RRSLDA----GGRTPRWVEFWXXXXX--XXXXXXXXXXXXXXXXPEKFFDVRRWELPKWV 294
R SLD+ G TPRWVEFW PE++ D+ R E PKWV
Sbjct: 236 RMSLDSGRVHGSSTPRWVEFWSDAAVDRRRRNSASSMSSSHSSSPERYLDLPRSETPKWV 295
Query: 295 EDYVGGIGSVLREGGWSESDISEMVEVSGSGFFEGDMVMLDNQAVLDAMLLKVDRFSDSL 354
+DYV IGSVLR GGWSESD+ ++V VS SGFFEG+MV+LDNQAVLDA+LLK RFS+SL
Sbjct: 296 DDYVNRIGSVLRGGGWSESDVDDIVHVSASGFFEGEMVILDNQAVLDALLLKAGRFSESL 355
Query: 355 RKSGWSSEEVSDALGFDFRLPEKERRPPMKLSPELVQRIEKLAESVSRS 403
RK+GWSSEEVSDALGFDFR EKE++P KLSPELVQRI KLAESVSRS
Sbjct: 356 RKAGWSSEEVSDALGFDFRP-EKEKKPVKKLSPELVQRIGKLAESVSRS 403
>AT5G67020.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G50340.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:26749962-26751146
FORWARD LENGTH=394
Length = 394
Score = 433 bits (1114), Expect = e-122, Method: Compositional matrix adjust.
Identities = 238/408 (58%), Positives = 276/408 (67%), Gaps = 19/408 (4%)
Query: 1 MVDVDRRMTGLNQAHIAGXXXXXXXXXXXXXXXXXXXNGLLSFSPLADKVITHLRTSGIE 60
MVDVDRRMTGL AH AG N L SFSP ADKVI HL+ SGI+
Sbjct: 1 MVDVDRRMTGLTPAHAAGLRRLSARAAAPSTPTIR--NSLQSFSPFADKVINHLKNSGIK 58
Query: 61 VQPGLSDXXXXXXXXXXXXXXPPDLRAILAAGLPVGAGFPNWXXXXXXXXXXXXSLDLPI 120
+QPGLSD PPDLR IL+AGL VGAGFP+W +DLP+
Sbjct: 59 IQPGLSDTEFARVEAEFGFTFPPDLRVILSAGLSVGAGFPDWRSPGARLHLRAM-IDLPV 117
Query: 121 AAISFQIARNALWARCWGPRPAEPEKALRVARNALKRAPLLIPIFNHCYIPCNPSLAGNP 180
AA+SFQIA+N+LW + WG +P +PEKALRVARNALKRAPLLIPIF+HCYIPCNPSLAGNP
Sbjct: 118 AAVSFQIAKNSLWCKSWGLKPPDPEKALRVARNALKRAPLLIPIFDHCYIPCNPSLAGNP 177
Query: 181 IFFVDESRVFCCGFDLSDFFDRESPFRGSESEPGPLVLKKQRSVAEKTKTTSVCSASGFQ 240
+FF+DE+R+FCCG DLS+FF+RES FR SE P +L KQRSV+EK S S+S F
Sbjct: 178 VFFIDETRIFCCGSDLSEFFERESAFRSSEF--FPRILTKQRSVSEK----SAGSSSNFS 231
Query: 241 RRSLDAG-----GRTPRWVEFWXXXXXXXXXXXXXXXXXXXXXPEKFFDVRRWELPKWVE 295
RRSLD G G++ RWVEFW D+ + E PKWV
Sbjct: 232 RRSLDLGRANGAGKS-RWVEFWSDAAVDRCRRNSASTSSSSSSSP---DLPKTETPKWVN 287
Query: 296 DYVGGIGSVLREGGWSESDISEMVEVSGSGFFEGDMVMLDNQAVLDAMLLKVDRFSDSLR 355
YV IGSVLR GGWSESDI E++ VS SGFFEG+MV++DNQ VLD +LLK R S+SLR
Sbjct: 288 QYVNRIGSVLRRGGWSESDIDEIIHVSASGFFEGEMVIIDNQTVLDVLLLKAGRISESLR 347
Query: 356 KSGWSSEEVSDALGFDFRLPEKERRPPMKLSPELVQRIEKLAESVSRS 403
KSGWSSEEVSDALGFDFR PEKER+P KLSP LV++ EKLAE VS+S
Sbjct: 348 KSGWSSEEVSDALGFDFR-PEKERKPVKKLSPMLVEQFEKLAEWVSQS 394
>AT2G22790.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G67020.1); Has 111 Blast hits to 111 proteins
in 33 species: Archae - 0; Bacteria - 44; Metazoa - 0;
Fungi - 0; Plants - 67; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr2:9695932-9696909 FORWARD
LENGTH=325
Length = 325
Score = 90.1 bits (222), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/153 (31%), Positives = 72/153 (47%), Gaps = 7/153 (4%)
Query: 50 VITHLRT-SGIEVQPGLSDXXXXXXXXXXXXXXPPDLRAILAAGLPVGAGFPNWXXXXXX 108
++ H ++ +G V PGL++ P DLR+IL GLPVG FPNW
Sbjct: 36 IVNHFKSQTGNHVSPGLTNQEISAVESSHGFSFPLDLRSILQTGLPVGTNFPNWRTGSNR 95
Query: 109 XXXXXXSLDLPIAAISFQIARNALWARCWGPRPAEPEKALRVARNALKRAPLLIPIFNHC 168
+S + RN W WG RP +AL + + ++ AP+L+P++
Sbjct: 96 NNLLLPL-----LNLSQHVVRNGFWVDSWGIRPGNDAEALSLVKKLIEIAPVLVPVYGDF 150
Query: 169 YIP-CNPSLAGNPIFFVDESRVFCCGFDLSDFF 200
Y+P P+LAGNP+F +D V D+ F
Sbjct: 151 YVPSTTPNLAGNPVFQIDGDGVRELSCDVVGFL 183