Miyakogusa Predicted Gene
- Lj0g3v0274219.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0274219.1 Non Chatacterized Hit- tr|I1L3K9|I1L3K9_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.21254
PE,82.6,0,seg,NULL; Ribonuclease H-like,Ribonuclease H-like
domain,CUFF.18208.1
(811 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G12380.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 876 0.0
AT1G62870.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 864 0.0
>AT1G12380.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G62870.1); Has 173 Blast hits to 170 proteins
in 34 species: Archae - 0; Bacteria - 4; Metazoa - 25;
Fungi - 8; Plants - 123; Viruses - 7; Other Eukaryotes -
6 (source: NCBI BLink). | chr1:4214499-4216880 REVERSE
LENGTH=793
Length = 793
Score = 876 bits (2263), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/764 (56%), Positives = 562/764 (73%), Gaps = 33/764 (4%)
Query: 74 VAVKAVNKRYEGLVTVRTKAIKGKGAWYWAHLEPLLVRNPDTGMPKSVKLKCSLCDSVFS 133
+ KA+NKRYEGL+TVRTKA+KGKGAWYW HLEP+LVRN DTG+PK+VKL+CSLCD+VFS
Sbjct: 37 LTAKALNKRYEGLMTVRTKAVKGKGAWYWTHLEPILVRNTDTGLPKAVKLRCSLCDAVFS 96
Query: 134 ASNPSRTASEHLKRGTCANFSSGLKXXXXXXXXXXXXXXXXXXXXXXPEMGGVGSPVASY 193
ASNPSRTASEHLKRGTC NF+S G V + + S
Sbjct: 97 ASNPSRTASEHLKRGTCPNFNSVTPISTITPSPTSSSSSPQTHHRKRNSSGAVTTAIPSR 156
Query: 194 QN--------HALAMVEMGYPQVHGNSV---SPHQQHHLVLSGGKEDLCALAMFEDSVKK 242
N H + + + G + +P HL+LSGGK+DL LAM EDSVKK
Sbjct: 157 LNPPPIGGSYHVTPITVVDPSRFCGGELHYSTPPPPQHLMLSGGKDDLGPLAMLEDSVKK 216
Query: 243 LKSPKTSPGPALSKEQVNSALELLADWFYECCGSVSLSTLEHRKFQAFLGQVGLPTTLKR 302
LKSPK S +L++ Q+ SAL+ L+DW +E CGSVSLS LEH KF+AFL QVGLP KR
Sbjct: 217 LKSPKPSQTQSLTRSQIESALDSLSDWVFESCGSVSLSGLEHPKFRAFLTQVGLPIISKR 276
Query: 303 EISGPRLDARFSEAKAESEAKVRDAMFFQVASDGWKNRNWYSLCCGGESLVKFMVNLPNG 362
+ + RLD + EA+AE+E+++RDAMFFQ++SDGWK GESLV +VNLPNG
Sbjct: 277 DFATTRLDLKHEEARAEAESRIRDAMFFQISSDGWKPGE------SGESLVNLIVNLPNG 330
Query: 363 TSVFQKAVSTGGVVSSKYAEEVLWESVTGVSGSVVQRCVGIVADKFKAKALRNLEIQNHW 422
TS++++AV G V S YAEEVL E+V G+ G+ QRCVGIV+DKFK KALRNLE Q+ W
Sbjct: 331 TSLYRRAVLVNGAVPSNYAEEVLLETVKGICGNSPQRCVGIVSDKFKTKALRNLESQHQW 390
Query: 423 MVNTSCQLQGFISLIKDFNKELPLFRVVTENCLKVANFIDTESQVRNIFLKYRMQEMDYP 482
MVN SCQ QG SLIKDF KELPLF+ V++NC+++A FI+ +Q+RN KY++QE
Sbjct: 391 MVNLSCQFQGLNSLIKDFVKELPLFKSVSQNCVRLAKFINNTAQIRNAHCKYQLQEHGES 450
Query: 483 GLLRVP---------------PPKCDPMTNFASVFPMLEDILSCARIIQMVMLEDSFKVM 527
+LR+P + + + +F +LED+LS AR IQ+V+ +D+ KV+
Sbjct: 451 IMLRLPLHCYYDDERRSCSSSSSGSNKVCFYEPLFNLLEDVLSSARAIQLVVHDDACKVV 510
Query: 528 CMEDPLAREVAGMVQNEEFWNELEAVYSLVKIIKGMVQDMEAERPLIGRCLPLWEELRTK 587
MED +AREV MV +E FWNE+EAV++L+K++K M + +E E+ L+G+CLPLW+ELR K
Sbjct: 511 LMEDHMAREVREMVGDEGFWNEVEAVHALIKLVKEMARRIEEEKLLVGQCLPLWDELRAK 570
Query: 588 VKEWCGKYNVMEGPVDKIVEKRFRKNYHPAWSAAFILDPLYLIKDTSGKYLPPFKCLTRE 647
VK+W K+NV EG V+K+VE+RF+K+YHPAW+AAFILDPLYLI+D+SGKYLPPFKCL+ E
Sbjct: 571 VKDWDSKFNVGEGHVEKVVERRFKKSYHPAWAAAFILDPLYLIRDSSGKYLPPFKCLSPE 630
Query: 648 QEKDVDKLLTRLASREEAHVVLMELMKWRSEGLDPLYAQAVQMKQRDPMTGKMKVANPLS 707
QEKDVDKL+TRL SR+EAH+ LMELMKWR+EGLDP+YA+AVQMK+RDP++GKM++ANP S
Sbjct: 631 QEKDVDKLITRLVSRDEAHIALMELMKWRTEGLDPMYARAVQMKERDPVSGKMRIANPQS 690
Query: 708 SRLVWETCLCEFKSLGKIAVRLIFLHATSSGFKSNWSFMRKISANKQHSSASLERAQKMV 767
SRLVWET L EF+SLGK+AVRLIFLHAT+ GFK N S ++ +++N + S A+++RAQK++
Sbjct: 691 SRLVWETYLSEFRSLGKVAVRLIFLHATTGGFKCNSSLLKWVNSNGR-SHAAVDRAQKLI 749
Query: 768 YIAAHAKLERRDFSSEEEKDAELFAMSGSEDGMLAEVYADATLV 811
+I+A++K ERRDFS+EE++DAEL AM+ +D ML +V D + V
Sbjct: 750 FISANSKFERRDFSNEEDRDAELLAMANGDDHMLNDVLVDTSSV 793
>AT1G62870.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G12380.1); Has 351 Blast hits to 343 proteins
in 42 species: Archae - 2; Bacteria - 0; Metazoa - 27;
Fungi - 5; Plants - 299; Viruses - 0; Other Eukaryotes -
18 (source: NCBI BLink). | chr1:23284220-23286508
REVERSE LENGTH=762
Length = 762
Score = 864 bits (2233), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 424/747 (56%), Positives = 556/747 (74%), Gaps = 23/747 (3%)
Query: 74 VAVKAVNKRYEGLVTVRTKAIKGKGAWYWAHLEPLLVRNPDTGMPKSVKLKCSLCDSVFS 133
+A KA+ KRYEGL+ VRTKA+KGKGAWYW+HLEP+L+ N DTG PK+VKL+CSLCD+VFS
Sbjct: 30 LATKALQKRYEGLMMVRTKAVKGKGAWYWSHLEPILLHNTDTGFPKAVKLRCSLCDAVFS 89
Query: 134 ASNPSRTASEHLKRGTCANFSSGLKXXXXXXXXXXXXXXXXXXXXXXPEMGGVGSPV--- 190
ASNPSRTASEHLKRGTC NF+S K + +
Sbjct: 90 ASNPSRTASEHLKRGTCPNFNSLPKPISTISPSPPPPPSSSHRKRNSSAVEALNHHHHHP 149
Query: 191 -----ASYQNHALAMVEMGYPQVHGNSVSPHQQHHLVLSGGKEDLCALAMFEDSVKKLKS 245
SY L++V+ P QQ HL+LSGGK+DL LAM EDSVKKLKS
Sbjct: 150 HHHHQGSYNVTPLSVVD---PSRFCGQFPVTQQPHLMLSGGKDDLGPLAMLEDSVKKLKS 206
Query: 246 PKTSPGPALSKEQVNSALELLADWFYECCGSVSLSTLEHRKFQAFLGQVGLPTTLKREIS 305
PKTS L+K Q++SAL+ L+DW +E CGSVSLS LEH K +AFL QVGLP +R+
Sbjct: 207 PKTSQTRNLTKAQIDSALDSLSDWVFESCGSVSLSGLEHPKLRAFLTQVGLPIISRRDFV 266
Query: 306 GPRLDARFSEAKAESEAKVRDAMFFQVASDGWKNRNWYSLCCGGESLVKFMVNLPNGTSV 365
RLD ++ +++AE+E+++ DAMFFQ+ASDGWK GE+LV +VNLPNGTS+
Sbjct: 267 TGRLDLKYEDSRAEAESRIHDAMFFQIASDGWK------FDSSGENLVNLIVNLPNGTSL 320
Query: 366 FQKAVSTGGVVSSKYAEEVLWESVTGVSGSVVQRCVGIVADKFKAKALRNLEIQNHWMVN 425
+++AV G V S YAEEVLWE+V G+ G+ QRCVGIV+D+F +KALRNLE Q+ WMVN
Sbjct: 321 YRRAVFVNGAVPSNYAEEVLWETVRGICGNSPQRCVGIVSDRFMSKALRNLESQHQWMVN 380
Query: 426 TSCQLQGFISLIKDFNKELPLFRVVTENCLKVANFIDTESQVRNIFLKYRMQEMDYPGLL 485
SCQ QGF SLI+DF KELPLF+ V+++C ++ NF+++ +Q+RN KY++QE +L
Sbjct: 381 LSCQFQGFNSLIRDFVKELPLFKSVSQSCSRLVNFVNSTAQIRNAVCKYQLQEQGETRML 440
Query: 486 RVPPPKCDPMTNFASVFPMLEDILSCARIIQMVMLEDSFKVMCMEDPLAREVAGMVQNEE 545
+P + F ++ +LED+LS AR IQ+VM +D K + MED +AREV MV +
Sbjct: 441 HLPLDS----SLFEPLYNLLEDVLSFARAIQLVMHDDVCKAVLMEDHMAREVGEMVGDVG 496
Query: 546 FWNELEAVYSLVKIIKGMVQDMEAERPLIGRCLPLWEELRTKVKEWCGKYNVME-GPVDK 604
FWNE+EAVY L+K++K M + +E ERPL+G+CLPLW+ELR+K+K+W K+NV+E V+K
Sbjct: 497 FWNEVEAVYLLLKLVKEMARRIEEERPLVGQCLPLWDELRSKIKDWYAKFNVVEERQVEK 556
Query: 605 IVEKRFRKNYHPAWSAAFILDPLYLIKDTSGKYLPPFKCLTREQEKDVDKLLTRLASREE 664
IVE+RF+K+YHPAW+AAFILDPLYLIKD+SGKYLPPFKCL+ EQEKDVDKL+TRL SR+E
Sbjct: 557 IVERRFKKSYHPAWAAAFILDPLYLIKDSSGKYLPPFKCLSPEQEKDVDKLITRLVSRDE 616
Query: 665 AHVVLMELMKWRSEGLDPLYAQAVQMKQRDPMTGKMKVANPLSSRLVWETCLCEFKSLGK 724
AH+ +MELMKWR+EGLDP+YA+AVQMK+RDP++GKM++ANP SSRLVWET L EF+SLG+
Sbjct: 617 AHIAMMELMKWRTEGLDPVYARAVQMKERDPVSGKMRIANPQSSRLVWETYLSEFRSLGR 676
Query: 725 IAVRLIFLHATSSGFKSNWSFMRKISANKQHSSASLERAQKMVYIAAHAKLERRDFSSEE 784
+AVRLIFLHATS GFK N S +R +++N + S A+++RAQK+++I+A++K ERRDFS+EE
Sbjct: 677 VAVRLIFLHATSCGFKCNSSVLRWVNSNGR-SRAAVDRAQKLIFISANSKFERRDFSNEE 735
Query: 785 EKDAELFAMSGSEDGMLAEVYADATLV 811
E+DAEL AM+ ED +L +V D + V
Sbjct: 736 ERDAELLAMANGEDDVLNDVLIDTSSV 762