Miyakogusa Predicted Gene
- Lj2g3v2736600.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj2g3v2736600.1 Non Chatacterized Hit- tr|F6I4J6|F6I4J6_VITVI
Putative uncharacterized protein OS=Vitis vinifera
GN=,26.8,2e-18,seg,NULL; SUBFAMILY NOT NAMED,NULL; FAMILY NOT
NAMED,NULL,CUFF.39223.1
(561 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G40070.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 283 3e-76
AT2G40070.2 | Symbols: | FUNCTIONS IN: molecular_function unkno... 282 4e-76
AT3G09000.1 | Symbols: | proline-rich family protein | chr3:274... 170 3e-42
AT5G01280.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 92 8e-19
AT2G38160.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 76 6e-14
AT2G38160.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 76 6e-14
AT3G08670.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 67 4e-11
AT1G27850.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 51 2e-06
>AT2G40070.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: proline-rich family protein (TAIR:AT3G09000.1); Has
35333 Blast hits to 34131 proteins in 2444 species:
Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
(source: NCBI BLink). | chr2:16728378-16731160 REVERSE
LENGTH=607
Length = 607
Score = 283 bits (723), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 172/266 (64%), Positives = 199/266 (74%), Gaps = 10/266 (3%)
Query: 301 KPAPAVSRQPVPSRGTSPTVGSRSWKPSEMPGFSLDAPPNLRTTIPERSLSTTRGRPGAP 360
KP P S+ P SR SPTV SR WKPS+MPGFSL+ PPNLRTT+PER LS TRGRPGAP
Sbjct: 347 KPMPTPSKNPALSRAASPTVRSRPWKPSDMPGFSLETPPNLRTTLPERPLSATRGRPGAP 406
Query: 361 ASRSSSV--GPASGGRPKRQSCSPSRGRXXXXXXXXXXXXMPAVNRGYSKANDNESPVLM 418
+SRS SV G GGRP+RQSCSPSRGR +PAVNRGYSKA+DN SPV+M
Sbjct: 407 SSRSGSVEPGGPPGGRPRRQSCSPSRGR---APMYSSGSSVPAVNRGYSKASDNVSPVMM 463
Query: 419 GTKMVERVINMSKLPPSRLEVKIYPHSNLPGKPSSSPDNSGFGRSLSKKSLDMAIRHMEI 478
GTKMVERVINM KL P R + K PH NL K SSSPD++GFGR+LSKKSLDMAIRHM+I
Sbjct: 464 GTKMVERVINMRKLAPPRSDDKGSPHGNLSAK-SSSPDSAGFGRTLSKKSLDMAIRHMDI 522
Query: 479 RQRAPGNLRRPLMTNIPASSMYSVRSSPRHSQTVSYS-GSPHATSSNAGSEVSV-DQNGL 536
R+ PGNL RPLMTNIPASSMYSVRS + ++ S SP ATSSNA SE+SV + NG+
Sbjct: 523 RRTIPGNL-RPLMTNIPASSMYSVRSGHTRGRPMNVSDSSPLATSSNASSEISVCNNNGI 581
Query: 537 CI-YNETDDDIVGERGDRSPSSVRYR 561
C+ +E +DD ERG RSP+S++ R
Sbjct: 582 CLEASEKEDDAGSERGCRSPASLQGR 607
Score = 128 bits (321), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 75/163 (46%), Positives = 94/163 (57%), Gaps = 14/163 (8%)
Query: 1 MNRSFRAQESL----------KPPSSRVKDNDDELALFLEMRRRENERNGVLLRASNREL 50
MNRSFRA+ESL + +S + + D+EL+LFLEMRRRE E++ +LL + E
Sbjct: 1 MNRSFRAKESLLLDSAERQRQQLRASMMAEKDEELSLFLEMRRREKEQDNLLLNNNPDEF 60
Query: 51 ADSPPLXXXXXXXXXXXXXXXXXXXXVLKTGVDDFLNFEDDKNDYEWLLTPPGTPRFPSL 110
PL K DDFLN E DKNDYEWLLTPPGTP FPSL
Sbjct: 61 ET--PLGSKHGTSPVFNISSGAPPSR--KAAPDDFLNSEGDKNDYEWLLTPPGTPLFPSL 116
Query: 111 EMETQKTVKSQLDAPTTRPTALTSRLANPPSELIGRNNLVSKK 153
EME+ +T+ SQ +RP LTSRLAN +E RN+L S++
Sbjct: 117 EMESHRTMMSQTGDSKSRPATLTSRLANSSTESAARNHLTSRQ 159
>AT2G40070.2 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 17 plant
structures; EXPRESSED DURING: 7 growth stages; BEST
Arabidopsis thaliana protein match is: proline-rich
family protein (TAIR:AT3G09000.1); Has 108635 Blast hits
to 60786 proteins in 2176 species: Archae - 287;
Bacteria - 15142; Metazoa - 39415; Fungi - 26849; Plants
- 4416; Viruses - 2864; Other Eukaryotes - 19662
(source: NCBI BLink). | chr2:16728378-16731040 REVERSE
LENGTH=567
Length = 567
Score = 282 bits (722), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 172/266 (64%), Positives = 199/266 (74%), Gaps = 10/266 (3%)
Query: 301 KPAPAVSRQPVPSRGTSPTVGSRSWKPSEMPGFSLDAPPNLRTTIPERSLSTTRGRPGAP 360
KP P S+ P SR SPTV SR WKPS+MPGFSL+ PPNLRTT+PER LS TRGRPGAP
Sbjct: 307 KPMPTPSKNPALSRAASPTVRSRPWKPSDMPGFSLETPPNLRTTLPERPLSATRGRPGAP 366
Query: 361 ASRSSSV--GPASGGRPKRQSCSPSRGRXXXXXXXXXXXXMPAVNRGYSKANDNESPVLM 418
+SRS SV G GGRP+RQSCSPSRGR +PAVNRGYSKA+DN SPV+M
Sbjct: 367 SSRSGSVEPGGPPGGRPRRQSCSPSRGR---APMYSSGSSVPAVNRGYSKASDNVSPVMM 423
Query: 419 GTKMVERVINMSKLPPSRLEVKIYPHSNLPGKPSSSPDNSGFGRSLSKKSLDMAIRHMEI 478
GTKMVERVINM KL P R + K PH NL K SSSPD++GFGR+LSKKSLDMAIRHM+I
Sbjct: 424 GTKMVERVINMRKLAPPRSDDKGSPHGNLSAK-SSSPDSAGFGRTLSKKSLDMAIRHMDI 482
Query: 479 RQRAPGNLRRPLMTNIPASSMYSVRSSPRHSQTVSYS-GSPHATSSNAGSEVSV-DQNGL 536
R+ PGNL RPLMTNIPASSMYSVRS + ++ S SP ATSSNA SE+SV + NG+
Sbjct: 483 RRTIPGNL-RPLMTNIPASSMYSVRSGHTRGRPMNVSDSSPLATSSNASSEISVCNNNGI 541
Query: 537 CI-YNETDDDIVGERGDRSPSSVRYR 561
C+ +E +DD ERG RSP+S++ R
Sbjct: 542 CLEASEKEDDAGSERGCRSPASLQGR 567
Score = 100 bits (248), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 57/123 (46%), Positives = 69/123 (56%), Gaps = 4/123 (3%)
Query: 31 MRRRENERNGVLLRASNRELADSPPLXXXXXXXXXXXXXXXXXXXXVLKTGVDDFLNFED 90
MRRRE E++ +LL + E PL K DDFLN E
Sbjct: 1 MRRREKEQDNLLLNNNPDEFET--PLGSKHGTSPVFNISSGAPPSR--KAAPDDFLNSEG 56
Query: 91 DKNDYEWLLTPPGTPRFPSLEMETQKTVKSQLDAPTTRPTALTSRLANPPSELIGRNNLV 150
DKNDYEWLLTPPGTP FPSLEME+ +T+ SQ +RP LTSRLAN +E RN+L
Sbjct: 57 DKNDYEWLLTPPGTPLFPSLEMESHRTMMSQTGDSKSRPATLTSRLANSSTESAARNHLT 116
Query: 151 SKK 153
S++
Sbjct: 117 SRQ 119
>AT3G09000.1 | Symbols: | proline-rich family protein |
chr3:2746014-2748326 FORWARD LENGTH=541
Length = 541
Score = 170 bits (430), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 134/288 (46%), Positives = 171/288 (59%), Gaps = 36/288 (12%)
Query: 288 TVSAPSVKTSSIPK----PAPAV-SRQPVPSRGTSPT---VGSRSWKPSEMPGFSLDAPP 339
T + PS+ +S P P+P V S PSRGTSP+ SR WKP EMPGFSL+APP
Sbjct: 250 TPTGPSIVSSKAPSRGTSPSPTVNSLSKAPSRGTSPSPTLNSSRPWKPPEMPGFSLEAPP 309
Query: 340 NLRTTIPERSLSTTRGRPG---APASRSSSV------GPASGGRPKRQSCSPSRGRXXXX 390
NLRTT+ +R +S +RGRPG AP SRS S+ G +RQSCSPSRGR
Sbjct: 310 NLRTTLADRPVSASRGRPGVASAPGSRSGSIERGGGPTSGGSGNARRQSCSPSRGRAPIG 369
Query: 391 XXXXXXXXMPAVNRGYSKAN------DNESPVLMGTKMVERVINMSKLPPSRLEVKIYPH 444
+ RG +KA+ DN SPV MG KMVERV+NM KL P RL
Sbjct: 370 NTNGSLTGV----RGRAKASNGGSGCDNLSPVAMGNKMVERVVNMRKLGPPRLTENGGRG 425
Query: 445 SNLPGKPSSSPDNSGFGRSLSKKSLDMAIRHMEIRQRAPGNLRRPLMTNIPASSMYSVRS 504
S K SS+ ++ G+GR+LSK S+DMAIRHM+IR+ GNL RPL+T +PASSMYSVRS
Sbjct: 426 SG---KSSSAFNSLGYGRNLSKSSIDMAIRHMDIRRGMTGNL-RPLVTKVPASSMYSVRS 481
Query: 505 SPRHSQTVSYSGSPHATSSNAGSEVSVDQNGLCI-YNETD-DDIVGER 550
P +VS S +++ ++ + N LC+ NE + DD++ ER
Sbjct: 482 RP---GSVSSSPVATSSTVSSSDPSVDNINILCLDGNEAENDDLLSER 526
Score = 96.3 bits (238), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 52/132 (39%), Positives = 74/132 (56%), Gaps = 4/132 (3%)
Query: 18 VKDNDDELALFLEMRRRENE-RNGVLLRASNRELADSPPLXXXXXXXXXXXXXXXXXXXX 76
D D+EL+LFLEMRRRE E R LL S+ ++
Sbjct: 3 THDRDEELSLFLEMRRREKEHRADSLLTGSDNVSINATLTAAAAAALSGVSETASSQRYP 62
Query: 77 VLKTGVDDFLNFEDDKNDYEWLLTPPGTPRFPSLEMETQKTVKSQLDAPTTRPTALTSRL 136
+ +T ++FL E++K+DY+WLLTPPGTP+F E E+ ++V +Q DAP +RPT L SRL
Sbjct: 63 LRRTAAENFLYSENEKSDYDWLLTPPGTPQF---EKESHRSVMNQHDAPNSRPTVLKSRL 119
Query: 137 ANPPSELIGRNN 148
N +++ NN
Sbjct: 120 GNCREDIVSGNN 131
>AT5G01280.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: proline-rich family protein (TAIR:AT3G09000.1); Has
1807 Blast hits to 1807 proteins in 277 species: Archae
- 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants -
385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI
BLink). | chr5:114185-116237 REVERSE LENGTH=460
Length = 460
Score = 92.0 bits (227), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 84/208 (40%), Positives = 116/208 (55%), Gaps = 16/208 (7%)
Query: 309 QPVPSRGTSPTVGSRSWKPSEMPGFSLDAPPNLRTTIPERSLSTTRGRPGA-PASRSSSV 367
+P S SP V SR W+P EMPGFS++AP NLRTT+P+R + + R A AS SS
Sbjct: 223 KPALSLEASPIVRSRPWEPYEMPGFSVEAPSNLRTTLPDRPQTASSSRTRAFDASSSSRS 282
Query: 368 GPASGGRPKRQSCSPSRGRXXXXXXXXXXXXMPAVNRGYSKANDNESPVL----MGTKMV 423
KRQSCSPSR R +P++ +K N+++ ++ G + V
Sbjct: 283 ASTERDVAKRQSCSPSRSRAPNGNVNGA---VPSLRGQRAKTNNDDGRLISHAAKGNQKV 339
Query: 424 ERVINMSKLPPSRL------EVKIYPHSNLPGKPSSSPDNSGFGRSLSKKSLDMAIRHME 477
E+V+NM KL RL + + GK SS GFGR+LSK S+DMA+RHM+
Sbjct: 340 EKVVNMRKLATPRLTESGSRRLGGGGGDSSAGKSSSGSGGFGFGRNLSKSSIDMALRHMD 399
Query: 478 IRQRA-PGNLRRPLMTNIPASSMYSVRS 504
+R+ + GN R + T PA+S+YSVRS
Sbjct: 400 VRKGSMAGNFRHSV-TKAPATSVYSVRS 426
>AT2G38160.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 9 plant
structures; EXPRESSED DURING: 4 anthesis, F mature
embryo stage, petal differentiation and expansion stage,
E expanded cotyledon stage, D bilateral stage; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G40070.2). | chr2:15986643-15988464 REVERSE
LENGTH=314
Length = 314
Score = 76.3 bits (186), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 41/85 (48%), Positives = 53/85 (62%), Gaps = 19/85 (22%)
Query: 405 GYSKANDNESPVLMGTKMVERVINMSKLPPSRLEVKIYPHSNLPGKPSSSPDNS--GFGR 462
G SK +++ +PVLMGT+MVERV+NM KLPP + + DN+ GFGR
Sbjct: 231 GRSKISNDVNPVLMGTQMVERVVNMRKLPPPKHD-----------------DNTTLGFGR 273
Query: 463 SLSKKSLDMAIRHMEIRQRAPGNLR 487
+LS+ SLDMA+RHM IR NLR
Sbjct: 274 TLSRSSLDMALRHMNIRHSVSKNLR 298
>AT2G38160.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G40070.2); Has 972 Blast hits to 731 proteins
in 211 species: Archae - 0; Bacteria - 236; Metazoa -
194; Fungi - 201; Plants - 218; Viruses - 32; Other
Eukaryotes - 91 (source: NCBI BLink). |
chr2:15986643-15988464 REVERSE LENGTH=314
Length = 314
Score = 76.3 bits (186), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 41/85 (48%), Positives = 53/85 (62%), Gaps = 19/85 (22%)
Query: 405 GYSKANDNESPVLMGTKMVERVINMSKLPPSRLEVKIYPHSNLPGKPSSSPDNS--GFGR 462
G SK +++ +PVLMGT+MVERV+NM KLPP + + DN+ GFGR
Sbjct: 231 GRSKISNDVNPVLMGTQMVERVVNMRKLPPPKHD-----------------DNTTLGFGR 273
Query: 463 SLSKKSLDMAIRHMEIRQRAPGNLR 487
+LS+ SLDMA+RHM IR NLR
Sbjct: 274 TLSRSSLDMALRHMNIRHSVSKNLR 298
>AT3G08670.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G51540.1); Has 48380 Blast hits to 29827
proteins in 1356 species: Archae - 46; Bacteria - 5589;
Metazoa - 17361; Fungi - 13192; Plants - 2237; Viruses -
905; Other Eukaryotes - 9050 (source: NCBI BLink). |
chr3:2633946-2636536 FORWARD LENGTH=567
Length = 567
Score = 67.0 bits (162), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 76/269 (28%), Positives = 117/269 (43%), Gaps = 38/269 (14%)
Query: 287 PTVSAPSVKTSSIPKPAPAVSRQPVPSRGTSPTVGSRSWKPSEMPGFSLDAPPNLRTTIP 346
PT+S + +S + P++SR P P V + +P + F LD PPNLRT++P
Sbjct: 288 PTISG--GRAASNGRTGPSLSRPSSPG----PRVRNTPQQPIVLADFPLDTPPNLRTSLP 341
Query: 347 ERSLSTTRGRPGAPASRSSSVGPASGGRPKRQSCSP--SRGRXXXXXXXXXXXXMPAVNR 404
+R +S R RP S + P G R++ SP +RGR +
Sbjct: 342 DRPISAGRSRP-VGGSSMAKASPEPKGPITRRNSSPIVTRGRLTETQ-----------GK 389
Query: 405 GYSKANDNESPVLMGTKMVERVINMSKLPPSRLEVKIYPHSNLPGKPSSSPDNSGFGRSL 464
G N L R+ N+S + SR VK + + +N+G GRS
Sbjct: 390 GRFGGNGQH---LTDAPEPRRISNVSDI-TSRRTVKT--------STTVTDNNNGLGRSF 437
Query: 465 SKKSLDMAIRHMEIRQRAPGNLRRPLMTNIP------ASSMYSVRSSPRHSQTVSYSGSP 518
SK SLDMAIRHM+IR T P +S + +RS HS ++S +G+
Sbjct: 438 SKSSLDMAIRHMDIRNGKTNGCALSTTTLFPQSIRPASSKIQPIRSGNNHSDSISSNGTE 497
Query: 519 HATSSNAGSEVSVDQNGLCIYNETDDDIV 547
+ +N G + + + +Y + D +
Sbjct: 498 NGNEANEGRRLMGKLSDMDMYESSRYDAL 526
>AT1G27850.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT2G40070.1); Has 9215 Blast
hits to 5316 proteins in 473 species: Archae - 6;
Bacteria - 773; Metazoa - 3392; Fungi - 1710; Plants -
539; Viruses - 143; Other Eukaryotes - 2652 (source:
NCBI BLink). | chr1:9699265-9703701 FORWARD LENGTH=1148
Length = 1148
Score = 51.2 bits (121), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/79 (46%), Positives = 49/79 (62%), Gaps = 5/79 (6%)
Query: 287 PTVSAPSVKTSSIPKPAPAV-SRQPVPS-RGTSPTVGSRSWKPSEMPGFSLDAPPNLRTT 344
PT ++ + T S +PAV PV S RG SP+ + W+ S +PGFSLDAPPNLRT+
Sbjct: 206 PTPTSRRMSTGSTTMASPAVRGTSPVSSSRGNSPSPKIKVWQ-SNIPGFSLDAPPNLRTS 264
Query: 345 IPERSLSTTRGRPGAPASR 363
+ +R S RG +PASR
Sbjct: 265 LGDRPASYVRG--SSPASR 281