Miyakogusa Predicted Gene
- Lj5g3v1749230.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v1749230.1 tr|C5CVY7|C5CVY7_VARPS Vault protein
inter-alpha-trypsin domain protein (Precursor) OS=Variovorax
pa,24.85,3e-16,VWA_3,NULL; VWFA,von Willebrand factor, type A;
vWA-like,NULL; INTER-ALPHA-TRYPSIN INHIBITOR HEAVY C,CUFF.55921.1
(375 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G19110.1 | Symbols: | inter-alpha-trypsin inhibitor heavy ch... 511 e-145
AT1G72500.1 | Symbols: | LOCATED IN: plasma membrane; EXPRESSED... 353 8e-98
>AT1G19110.1 | Symbols: | inter-alpha-trypsin inhibitor heavy
chain-related | chr1:6602270-6605766 FORWARD LENGTH=754
Length = 754
Score = 511 bits (1316), Expect = e-145, Method: Compositional matrix adjust.
Identities = 247/377 (65%), Positives = 302/377 (80%), Gaps = 2/377 (0%)
Query: 1 MAEEFSRAVDDGLKLSKRIYFGKDRAVAPPKLPAPMARSPDA--LLPTAPMVYAVIYDPG 58
MAE+F+RAVDDGLKL+KRIYFGKDRAVA P+ PAPM RS LPTAPMVYAVI DPG
Sbjct: 1 MAEDFARAVDDGLKLAKRIYFGKDRAVAAPRPPAPMDRSSTTQPYLPTAPMVYAVIPDPG 60
Query: 59 IVDNPDIPSYQPHVYGRCDPPALIPLQMNGVEMEIDCYLDTAFVTVSGSWRLHCVMGSRA 118
IVDNPD+PSYQPHV+GRCDPPALIPLQMN +E+++DCYLDTA VTV+GSWR+HCVMGS+
Sbjct: 61 IVDNPDLPSYQPHVHGRCDPPALIPLQMNSIELDVDCYLDTALVTVTGSWRVHCVMGSKR 120
Query: 119 CDCRIAIPMGHQGSILGVEVTAHRKTYSTQLVVMEDNSVNENATIAQQGGFLKSNIFTLT 178
CDCRIAIPMG QGSILGVEV RK+Y+TQL+ ED + E + + GGFLK NIFTLT
Sbjct: 121 CDCRIAIPMGEQGSILGVEVEIPRKSYTTQLITAEDGNEFEKTALPETGGFLKPNIFTLT 180
Query: 179 VPQIDGGTNLSIKIQWSQKIVNCNGELTLNVPFTFPEFVTPAGKKMSKKEKIQINVEAVA 238
+PQ+DGGTNLSIK+ WSQK+ G+ L++PF FPE+VTPA KK+SK+EKI ++V A
Sbjct: 181 IPQVDGGTNLSIKMTWSQKLTYNQGQFFLDIPFNFPEYVTPAVKKISKREKIYLSVNAGT 240
Query: 239 GSELLCRTMSHPLKEMRRDAGSIGFLYDSQVLSWSNTDFSFSYAVSSSRIDGGVLLQSAS 298
G+E+LC+ SH LKE R AG + F Y++ VL WSNTDFSFSY SSS I GG+ LQSA
Sbjct: 241 GTEVLCKGCSHQLKEKLRSAGKLRFAYEADVLKWSNTDFSFSYTASSSNIVGGLFLQSAP 300
Query: 299 VHDFDQREMFYMYLSPGDIHRKKVFRKDIIFVIDISGSMQGKLIDDTKNALSSALSKLNP 358
VHD DQR++F YL PG + K F+++++FV+DIS SM GK ++D KNA+S+ALSKL+P
Sbjct: 301 VHDVDQRDIFSFYLFPGKQQKTKAFKREVVFVVDISKSMTGKPLEDVKNAISTALSKLDP 360
Query: 359 DDSFNIIAFNGESFLFS 375
DSFNII F+ ++ LFS
Sbjct: 361 GDSFNIITFSNDTALFS 377
>AT1G72500.1 | Symbols: | LOCATED IN: plasma membrane; EXPRESSED
IN: 23 plant structures; EXPRESSED DURING: 13 growth
stages; CONTAINS InterPro DOMAIN/s: von Willebrand
factor, type A (InterPro:IPR002035); BEST Arabidopsis
thaliana protein match is: inter-alpha-trypsin inhibitor
heavy chain-related (TAIR:AT1G19110.1); Has 1407 Blast
hits to 1406 proteins in 307 species: Archae - 6;
Bacteria - 522; Metazoa - 484; Fungi - 59; Plants - 110;
Viruses - 0; Other Eukaryotes - 226 (source: NCBI
BLink). | chr1:27295336-27298556 REVERSE LENGTH=756
Length = 756
Score = 353 bits (907), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 179/378 (47%), Positives = 248/378 (65%), Gaps = 6/378 (1%)
Query: 1 MAEEFSRAVDDGLKLSKRIYFGKDRAVAPPKLPAPMARSPDALLPTAPMVYAVIYDPGIV 60
M+EEF+ V+ GL+L++RIY+GK A + SP+ LPTA YA I DP V
Sbjct: 1 MSEEFALRVEQGLQLARRIYYGKGIAPP---VVPDPPSSPENFLPTAITAYASITDPVAV 57
Query: 61 DNPDIPSYQPHVYGRCDPPALIPLQMNGVEMEIDCYLDTAFVTVSGSWRLHCVMGSRACD 120
DNPD+PSYQP+V+ RCDP AL+PLQM G+EM IDC+LDTAFVTV+G WR+HCV S+ D
Sbjct: 58 DNPDVPSYQPYVHARCDPSALVPLQMLGIEMNIDCWLDTAFVTVTGRWRVHCVRPSKRFD 117
Query: 121 CRIAIPMGHQGSILGVE--VTAHRKTYSTQLVVMEDNSVNENATIAQQGGFLKSNIFTLT 178
C + +PMG +GS LG E V + K+Y T+LV ++ S +N + F KS+I+T
Sbjct: 118 CCVGVPMGEKGSFLGAEIDVLNNEKSYQTKLVTEDETSDFDNVHKDKDSRFFKSHIYTFK 177
Query: 179 VPQIDGGTNLSIKIQWSQKIVNCNGELTLNVPFTFPEFVTPAGKKMSKKEKIQINVEA-V 237
+P + GG+ S+ + WSQK++ +G+ LNVPF FP +V P GK++ K+EKI +N+ + V
Sbjct: 178 IPHVAGGSIFSVNVTWSQKLIYKDGKFHLNVPFRFPSYVNPIGKEIIKREKIVLNMNSCV 237
Query: 238 AGSELLCRTMSHPLKEMRRDAGSIGFLYDSQVLSWSNTDFSFSYAVSSSRIDGGVLLQSA 297
+G E+ SHPLK + R AG + Y+++V SWS DF S+ VSS + G VL++S
Sbjct: 238 SGGEIASSFTSHPLKIIHRVAGELSCEYEAEVPSWSRVDFGVSFTVSSGDLCGNVLVKSP 297
Query: 298 SVHDFDQREMFYMYLSPGDIHRKKVFRKDIIFVIDISGSMQGKLIDDTKNALSSALSKLN 357
S D D R +F +YL PG K+F++ ++FVIDIS SM+ K ++D K AL L+KL
Sbjct: 298 SPWDSDDRGIFCLYLFPGTTKHTKLFKRRVVFVIDISASMKWKPLEDVKKALLECLAKLQ 357
Query: 358 PDDSFNIIAFNGESFLFS 375
+D FNIIAFN E FS
Sbjct: 358 AEDVFNIIAFNDEILEFS 375