Miyakogusa Predicted Gene
- Lj2g3v2002620.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj2g3v2002620.1 Non Chatacterized Hit- tr|I1LGL0|I1LGL0_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.51901
PE,75.89,0,seg,NULL; Ribonuclease H-like,Ribonuclease H-like
domain,CUFF.38369.1
(785 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G62870.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 960 0.0
AT1G12380.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 951 0.0
>AT1G62870.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G12380.1); Has 351 Blast hits to 343 proteins
in 42 species: Archae - 2; Bacteria - 0; Metazoa - 27;
Fungi - 5; Plants - 299; Viruses - 0; Other Eukaryotes -
18 (source: NCBI BLink). | chr1:23284220-23286508
REVERSE LENGTH=762
Length = 762
Score = 960 bits (2481), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 483/749 (64%), Positives = 585/749 (78%), Gaps = 35/749 (4%)
Query: 58 DELTVKAVQKRYDGLVTVRTKAVKGKGAWYWAHLEPMLVHNADTGLPKAVKLKCSLCDAV 117
+EL KA+QKRY+GL+ VRTKAVKGKGAWYW+HLEP+L+HN DTG PKAVKL+CSLCDAV
Sbjct: 28 EELATKALQKRYEGLMMVRTKAVKGKGAWYWSHLEPILLHNTDTGFPKAVKLRCSLCDAV 87
Query: 118 FSASNPSRTASEHLKRGTCPNFHSAAKPISSVSPGGATVSSPPSPANHSQRKRTCSA--- 174
FSASNPSRTASEHLKRGTCPNF+S KPIS++SP S S RKR SA
Sbjct: 88 FSASNPSRTASEHLKRGTCPNFNSLPKPISTISPSPPPPPS------SSHRKRNSSAVEA 141
Query: 175 ----------GPTSSYHVPPL--VDPCXXXXXXXXXXXXXXXXXXXXXSGGKDDLGALAM 222
SY+V PL VDP SGGKDDLG LAM
Sbjct: 142 LNHHHHHPHHHHQGSYNVTPLSVVDP-----SRFCGQFPVTQQPHLMLSGGKDDLGPLAM 196
Query: 223 LEDSVKKLKSPKTSPGTSLSKIQVDTAFDYLADWVYESYGSVSFSGLEHPKFRAFLNQVG 282
LEDSVKKLKSPKTS +L+K Q+D+A D L+DWV+ES GSVS SGLEHPK RAFL QVG
Sbjct: 197 LEDSVKKLKSPKTSQTRNLTKAQIDSALDSLSDWVFESCGSVSLSGLEHPKLRAFLTQVG 256
Query: 283 LPPVSRRDFTGSRLDTRFEEAKADSEARIRDAMFFQIASDGWKQRNHGEEKLVNMTVNLP 342
LP +SRRDF RLD ++E+++A++E+RI DAMFFQIASDGWK + GE LVN+ VNLP
Sbjct: 257 LPIISRRDFVTGRLDLKYEDSRAEAESRIHDAMFFQIASDGWKFDSSGE-NLVNLIVNLP 315
Query: 343 NGTILYRRAVFVPGSAPSNYAEDVLWETITGICGNVLQQCVGIVADKFNAKALRNLENQN 402
NGT LYRRAVFV G+ PSNYAE+VLWET+ GICGN Q+CVGIV+D+F +KALRNLE+Q+
Sbjct: 316 NGTSLYRRAVFVNGAVPSNYAEEVLWETVRGICGNSPQRCVGIVSDRFMSKALRNLESQH 375
Query: 403 HWMVNLSCQYQGFISLIKDFTKEIPLFTTVSESCFKLANFINYKSQLVRNSFHKYQMQEH 462
WMVNLSCQ+QGF SLI+DF KE+PLF +VS+SC +L NF+N +Q +RN+ KYQ+QE
Sbjct: 376 QWMVNLSCQFQGFNSLIRDFVKELPLFKSVSQSCSRLVNFVNSTAQ-IRNAVCKYQLQEQ 434
Query: 463 GHAWLIRMHSFEDFNFGSVYTMMEDILSSARALEHVLLDEPFKIASIKDASAREVGDIIR 522
G ++ + + F +Y ++ED+LS ARA++ V+ D+ K ++D AREVG+++
Sbjct: 435 GETRMLHL-PLDSSLFEPLYNLLEDVLSFARAIQLVMHDDVCKAVLMEDHMAREVGEMVG 493
Query: 523 DVGFWNDLEAAHSLIKLVKDMVCEIETERPLVGQCLPLWDELRAKVKIWCSKFQF-SEAL 581
DVGFWN++EA + L+KLVK+M IE ERPLVGQCLPLWDELR+K+K W +KF E
Sbjct: 494 DVGFWNEVEAVYLLLKLVKEMARRIEEERPLVGQCLPLWDELRSKIKDWYAKFNVVEERQ 553
Query: 582 LEKVIEKRFRKNYHPAWAAAYILDPLYLVRDTSGKYLPPYKFLAPEQEKDVDKLITRLVS 641
+EK++E+RF+K+YHPAWAAA+ILDPLYL++D+SGKYLPP+K L+PEQEKDVDKLITRLVS
Sbjct: 554 VEKIVERRFKKSYHPAWAAAFILDPLYLIKDSSGKYLPPFKCLSPEQEKDVDKLITRLVS 613
Query: 642 RDEAHVVLMELMKWRTQGLDPVYARAVQMKERDPVTGKMRIVNPQSSRLVWETYLTEFRS 701
RDEAH+ +MELMKWRT+GLDPVYARAVQMKERDPV+GKMRI NPQSSRLVWETYL+EFRS
Sbjct: 614 RDEAHIAMMELMKWRTEGLDPVYARAVQMKERDPVSGKMRIANPQSSRLVWETYLSEFRS 673
Query: 702 LGRVAVRLIFLHATSRGFKCN-----WPLRLGHSRVALERAQKLIFIAAHSKLERRDFSS 756
LGRVAVRLIFLHATS GFKCN W G SR A++RAQKLIFI+A+SK ERRDFS+
Sbjct: 674 LGRVAVRLIFLHATSCGFKCNSSVLRWVNSNGRSRAAVDRAQKLIFISANSKFERRDFSN 733
Query: 757 DEDKDAELFALVNGEDDVLNEVFLDTSSV 785
+E++DAEL A+ NGEDDVLN+V +DTSSV
Sbjct: 734 EEERDAELLAMANGEDDVLNDVLIDTSSV 762
>AT1G12380.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G62870.1); Has 173 Blast hits to 170 proteins
in 34 species: Archae - 0; Bacteria - 4; Metazoa - 25;
Fungi - 8; Plants - 123; Viruses - 7; Other Eukaryotes -
6 (source: NCBI BLink). | chr1:4214499-4216880 REVERSE
LENGTH=793
Length = 793
Score = 951 bits (2459), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 481/766 (62%), Positives = 583/766 (76%), Gaps = 45/766 (5%)
Query: 58 DELTVKAVQKRYDGLVTVRTKAVKGKGAWYWAHLEPMLVHNADTGLPKAVKLKCSLCDAV 117
DELT KA+ KRY+GL+TVRTKAVKGKGAWYW HLEP+LV N DTGLPKAVKL+CSLCDAV
Sbjct: 35 DELTAKALNKRYEGLMTVRTKAVKGKGAWYWTHLEPILVRNTDTGLPKAVKLRCSLCDAV 94
Query: 118 FSASNPSRTASEHLKRGTCPNFHSAAKPISSVSPGGATVSSPPSPANHSQRKRTCSAGPT 177
FSASNPSRTASEHLKRGTCPNF+S ++ +S + +S S RKR S T
Sbjct: 95 FSASNPSRTASEHLKRGTCPNFNS----VTPISTITPSPTSSSSSPQTHHRKRNSSGAVT 150
Query: 178 S-------------SYHVPPL--VDPCXXXXXXXXXXXXXXXXXXXXXSGGKDDLGALAM 222
+ SYHV P+ VDP SGGKDDLG LAM
Sbjct: 151 TAIPSRLNPPPIGGSYHVTPITVVDP-SRFCGGELHYSTPPPPQHLMLSGGKDDLGPLAM 209
Query: 223 LEDSVKKLKSPKTSPGTSLSKIQVDTAFDYLADWVYESYGSVSFSGLEHPKFRAFLNQVG 282
LEDSVKKLKSPK S SL++ Q+++A D L+DWV+ES GSVS SGLEHPKFRAFL QVG
Sbjct: 210 LEDSVKKLKSPKPSQTQSLTRSQIESALDSLSDWVFESCGSVSLSGLEHPKFRAFLTQVG 269
Query: 283 LPPVSRRDFTGSRLDTRFEEAKADSEARIRDAMFFQIASDGWKQRNHGEEKLVNMTVNLP 342
LP +S+RDF +RLD + EEA+A++E+RIRDAMFFQI+SDGWK GE LVN+ VNLP
Sbjct: 270 LPIISKRDFATTRLDLKHEEARAEAESRIRDAMFFQISSDGWKPGESGES-LVNLIVNLP 328
Query: 343 NGTILYRRAVFVPGSAPSNYAEDVLWETITGICGNVLQQCVGIVADKFNAKALRNLENQN 402
NGT LYRRAV V G+ PSNYAE+VL ET+ GICGN Q+CVGIV+DKF KALRNLE+Q+
Sbjct: 329 NGTSLYRRAVLVNGAVPSNYAEEVLLETVKGICGNSPQRCVGIVSDKFKTKALRNLESQH 388
Query: 403 HWMVNLSCQYQGFISLIKDFTKEIPLFTTVSESCFKLANFINYKSQLVRNSFHKYQMQEH 462
WMVNLSCQ+QG SLIKDF KE+PLF +VS++C +LA FIN +Q +RN+ KYQ+QEH
Sbjct: 389 QWMVNLSCQFQGLNSLIKDFVKELPLFKSVSQNCVRLAKFINNTAQ-IRNAHCKYQLQEH 447
Query: 463 GHAWLIRM--HSFEDFN----------------FGSVYTMMEDILSSARALEHVLLDEPF 504
G + ++R+ H + D + ++ ++ED+LSSARA++ V+ D+
Sbjct: 448 GESIMLRLPLHCYYDDERRSCSSSSSGSNKVCFYEPLFNLLEDVLSSARAIQLVVHDDAC 507
Query: 505 KIASIKDASAREVGDIIRDVGFWNDLEAAHSLIKLVKDMVCEIETERPLVGQCLPLWDEL 564
K+ ++D AREV +++ D GFWN++EA H+LIKLVK+M IE E+ LVGQCLPLWDEL
Sbjct: 508 KVVLMEDHMAREVREMVGDEGFWNEVEAVHALIKLVKEMARRIEEEKLLVGQCLPLWDEL 567
Query: 565 RAKVKIWCSKFQFSEALLEKVIEKRFRKNYHPAWAAAYILDPLYLVRDTSGKYLPPYKFL 624
RAKVK W SKF E +EKV+E+RF+K+YHPAWAAA+ILDPLYL+RD+SGKYLPP+K L
Sbjct: 568 RAKVKDWDSKFNVGEGHVEKVVERRFKKSYHPAWAAAFILDPLYLIRDSSGKYLPPFKCL 627
Query: 625 APEQEKDVDKLITRLVSRDEAHVVLMELMKWRTQGLDPVYARAVQMKERDPVTGKMRIVN 684
+PEQEKDVDKLITRLVSRDEAH+ LMELMKWRT+GLDP+YARAVQMKERDPV+GKMRI N
Sbjct: 628 SPEQEKDVDKLITRLVSRDEAHIALMELMKWRTEGLDPMYARAVQMKERDPVSGKMRIAN 687
Query: 685 PQSSRLVWETYLTEFRSLGRVAVRLIFLHATSRGFKCN-----WPLRLGHSRVALERAQK 739
PQSSRLVWETYL+EFRSLG+VAVRLIFLHAT+ GFKCN W G S A++RAQK
Sbjct: 688 PQSSRLVWETYLSEFRSLGKVAVRLIFLHATTGGFKCNSSLLKWVNSNGRSHAAVDRAQK 747
Query: 740 LIFIAAHSKLERRDFSSDEDKDAELFALVNGEDDVLNEVFLDTSSV 785
LIFI+A+SK ERRDFS++ED+DAEL A+ NG+D +LN+V +DTSSV
Sbjct: 748 LIFISANSKFERRDFSNEEDRDAELLAMANGDDHMLNDVLVDTSSV 793