Miyakogusa Predicted Gene
- Lj4g3v2400560.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v2400560.1 Non Chatacterized Hit- tr|I1KN86|I1KN86_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.45741 PE,86.08,0,
,CUFF.50913.1
(580 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G06150.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 766 0.0
AT5G19060.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Immunoglob... 738 0.0
>AT3G06150.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G19060.1); Has 61 Blast hits to 59 proteins in
10 species: Archae - 0; Bacteria - 0; Metazoa - 3; Fungi
- 0; Plants - 58; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr3:1861971-1863755 REVERSE
LENGTH=594
Length = 594
Score = 766 bits (1977), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/590 (61%), Positives = 455/590 (77%), Gaps = 17/590 (2%)
Query: 7 VVISPPPNQWP-LWSIPVLHWRVGLLTALVLVGMVLVWSIDGCTVKTFIQLWRY-----R 60
++ P P+Q L P+L WR+G LTALV M++VWSIDGC++++F+Q WR+ R
Sbjct: 6 MIFPPVPSQLVILRPSPLLQWRLGALTALVCFLMLVVWSIDGCSIQSFVQPWRFNAYSVR 65
Query: 61 QDYQNSPFVSPYDPVSSDQSQSNTEKLGKSSIKSLLVKGHSSS--------WVSSELEPN 112
SPF+S + S++ L + + K + +S W+++ + N
Sbjct: 66 ISPSPSPFMSTKPNLVSEKPHRQNLTLMMAPRNLVPKKTNLTSNSTRVQFEWITAGSQKN 125
Query: 113 LTSNLLARWSARGGEPCKDSKTVEIAIPGLDGGGKLMELSAGDVHEFGFQALDESGKVHC 172
T+NL+ W A GG PC+++KTVEI++PG+DG + EL+AG++HEF FQA+DESGK C
Sbjct: 126 FTANLMRGWLAPGGAPCREAKTVEISVPGVDGIDSV-ELTAGEIHEFKFQAIDESGKNVC 184
Query: 173 SGGDYFETDLSGESWKSRPLVKDFSNGSYSISLQVHPDFDGVYNLTIILLYRHFEGLKFT 232
GGDYFETD+SGE+WKSRP VKDF NG+YS SLQVHP+F G +NLT+ILL+RH++GLKF+
Sbjct: 185 IGGDYFETDISGENWKSRPPVKDFGNGTYSFSLQVHPEFAGDFNLTVILLFRHYQGLKFS 244
Query: 233 PWRFAYDRVLRNVTIRFYKSSVQMMPGLQTCKASDFARDVWCGRWTRHGKNDDCNIGNDG 292
R +DR LRNV +RF K+ +P L++CK SDF RD W GRWTR GKND+C I NDG
Sbjct: 245 TSRLGFDRKLRNVRLRFVKTPDVTLPELRSCKKSDFNRDAWSGRWTRLGKNDECQISNDG 304
Query: 293 RYRCLAPDFPCTAPWCDGSLGILESNGWVYSSHCSFKMYSADSAWNCLKNRWIFFWGDSN 352
RYRCLA DFPC PWCDG++G +ESNGWVYSSHCSFK++SA+ AW+CLK +WIFFWGDSN
Sbjct: 305 RYRCLAADFPCRKPWCDGAVGAIESNGWVYSSHCSFKLFSAEKAWDCLKGKWIFFWGDSN 364
Query: 353 HVDTIRNLLNFVLDLPEVHSVPRRFDMNFSNPRDSSQSVRITSIFNGHWNETQNYLGLDS 412
HVD+IRNLLNFVL PE+ +VPRRFDM FSNP++ S++VRITSIFNGHWNET+NY GLDS
Sbjct: 365 HVDSIRNLLNFVLGHPEIPAVPRRFDMKFSNPKNPSETVRITSIFNGHWNETKNYQGLDS 424
Query: 413 LRDEGFQTLLKKYFSGET--IPDTMIMNSGLHDGVHWRNIRAFSVGADYAASFWGDVMKT 470
L+D F+ LLKKYF+ ET +PD MI+NSGLHDG+HW ++RAF+ GA+ AA+FW +V
Sbjct: 425 LKDRDFRELLKKYFNEETNRVPDVMIVNSGLHDGIHWTSLRAFAKGAETAAAFWREVFDG 484
Query: 471 VKQRGLAWPRVFYRTTVATGGYARSLAFNPNKMEVFNGVFLEKLKQAGVVSGVIDNFDMT 530
VK RGL P V +R T+ATGGYAR LAFNP+KME FNGVFLEK++ AG+V+ V+DNFDMT
Sbjct: 485 VKSRGLQPPEVIFRNTIATGGYARMLAFNPSKMEAFNGVFLEKMRDAGLVTSVVDNFDMT 544
Query: 531 FPWHFDNRCNDGVHYGRAPLKMKWRDGQIGHQYFVDLMLAHVLLNALCAR 580
+PWH+DNRCNDGVHYGRAP KM+WRDG+IGHQYFVDLML HVLLNALC R
Sbjct: 545 YPWHYDNRCNDGVHYGRAPAKMRWRDGEIGHQYFVDLMLVHVLLNALCVR 594
>AT5G19060.1 | Symbols: | CONTAINS InterPro DOMAIN/s:
Immunoglobulin-like fold (InterPro:IPR013783); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT3G06150.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:6373011-6374666
FORWARD LENGTH=551
Length = 551
Score = 738 bits (1904), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/560 (61%), Positives = 436/560 (77%), Gaps = 23/560 (4%)
Query: 22 PVLHWRVGLLTALVLVGMVLVWSIDGCTVKTFIQLWRYRQDYQNSPFVSPYDPVSSDQSQ 81
P+ WR+ LT+LV +V+VWSID C++++FI+ WR+ NS + P S D
Sbjct: 14 PLHQWRLSALTSLVFFLIVVVWSIDSCSIRSFIKSWRF-----NSYSIRLTSPPSLD--- 65
Query: 82 SNTEKLGKSSIKSLLVKGHSSSWVSSELEPNLTSNLLARWSARGGEPCKDSKTVEIAIPG 141
L + +K +W+S E E N T+N+L W A GGE C+++ TVEI++PG
Sbjct: 66 -----LDPTRVKL--------AWISVEQEQNFTANVLKNWLAPGGEKCREANTVEISVPG 112
Query: 142 LDGGGKLMELSAGDVHEFGFQALDESGKVHCSGGDYFETDLSGESWKSRPLVKDFSNGSY 201
++G G L+EL+AG++HEF F +LD+SG+ C GGDYFETDLSGE+WKSRP VKD NG+Y
Sbjct: 113 IEGKG-LVELTAGEIHEFRFHSLDDSGERVCIGGDYFETDLSGENWKSRPPVKDLGNGTY 171
Query: 202 SISLQVHPDFDGVYNLTIILLYRHFEGLKFTPWRFAYDRVLRNVTIRFYKSSVQMMPGLQ 261
S+SLQ+HPDF G Y+LT++LL+R F+GLK +P RFA++R LRN +RF K ++P L+
Sbjct: 172 SLSLQIHPDFAGDYDLTVVLLFRRFQGLKLSPARFAFNRTLRNFKLRFIKKPHVVLPELR 231
Query: 262 TCKASDFARDVWCGRWTRHGKNDDCNIGNDGRYRCLAPDFPCTAPWCDGSLGILESNGWV 321
C+ SDF RDVW GRW R GKND+C I NDGRYRCL + C PWCDG+L LESNGWV
Sbjct: 232 RCELSDFDRDVWSGRWIRLGKNDECEISNDGRYRCLPDGYRCREPWCDGALSALESNGWV 291
Query: 322 YSSHCSFKMYSADSAWNCLKNRWIFFWGDSNHVDTIRNLLNFVLDLPEVHSVPRRFDMNF 381
YSSHCSFK++S++SAW+CLKN+WIFFWGDSNHVD+IRNLLNFVL PE+ +VPRRFD+ F
Sbjct: 292 YSSHCSFKLFSSESAWDCLKNKWIFFWGDSNHVDSIRNLLNFVLGHPEIGAVPRRFDLKF 351
Query: 382 SNPRDSSQSVRITSIFNGHWNETQNYLGLDSLRDEGFQTLLKKYFSGET-IPDTMIMNSG 440
SNP++SS++VRITSIFNGHWNETQNYLGLDSL D+ F+ LLK YF ET +PD MI+NSG
Sbjct: 352 SNPKNSSETVRITSIFNGHWNETQNYLGLDSLEDDSFRELLKSYFVEETGVPDVMIVNSG 411
Query: 441 LHDGVHWRNIRAFSVGADYAASFWGDVMKTVKQRGLAWPRVFYRTTVATGGYARSLAFNP 500
LHDG+HW N+RAF+ GA+ AA+FW +V +VK RGL P+V +R T+ATGGYAR LAFNP
Sbjct: 412 LHDGIHWSNLRAFTKGAETAAAFWRNVFDSVKARGLRPPKVIFRNTIATGGYARKLAFNP 471
Query: 501 NKMEVFNGVFLEKLKQAGVVSGVIDNFDMTFPWHFDNRCNDGVHYGRAPLKMKWRDGQIG 560
+KMEV+NGVFLEK+K G+VS VIDNFDMT+PWHFDNRCNDGVHYGR P K++W DG+IG
Sbjct: 472 SKMEVYNGVFLEKMKGLGLVSSVIDNFDMTYPWHFDNRCNDGVHYGRPPAKVRWIDGEIG 531
Query: 561 HQYFVDLMLAHVLLNALCAR 580
HQYFVDLML HVLLNA+C R
Sbjct: 532 HQYFVDLMLVHVLLNAVCLR 551