Miyakogusa Predicted Gene
- Lj1g3v4289080.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v4289080.2 tr|C7J9Q6|C7J9Q6_ORYSJ Os12g0236050 protein
(Fragment) OS=Oryza sativa subsp. japonica
GN=Os12g02360,53.01,2e-18,coiled-coil,NULL; seg,NULL; SUBFAMILY NOT
NAMED,NULL; FAMILY NOT NAMED,NULL,CUFF.32219.2
(544 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G42320.2 | Symbols: | nucleolar protein gar2-related | chr2:... 348 8e-96
AT2G42320.1 | Symbols: | nucleolar protein gar2-related | chr2:... 348 8e-96
AT3G57780.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 308 6e-84
AT3G01810.3 | Symbols: | FUNCTIONS IN: molecular_function unkno... 237 2e-62
AT3G01810.1 | Symbols: | FUNCTIONS IN: molecular_function unkno... 237 2e-62
AT3G01810.2 | Symbols: | FUNCTIONS IN: molecular_function unkno... 237 2e-62
AT5G06930.1 | Symbols: | LOCATED IN: chloroplast; EXPRESSED IN:... 214 2e-55
AT5G43230.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 209 3e-54
>AT2G42320.2 | Symbols: | nucleolar protein gar2-related |
chr2:17628102-17630657 FORWARD LENGTH=669
Length = 669
Score = 348 bits (892), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 166/294 (56%), Positives = 211/294 (71%), Gaps = 17/294 (5%)
Query: 233 EILDGASNGAQSLGSEDERHETVNAEENGEHEDKAALELKIXXXXXXXXXXXXXXXXXXX 292
++ + ASNGA S GSE+E + E NG + + + E KI
Sbjct: 139 DVWEDASNGALSAGSENEAADVT--ENNGGNFEDGSSEEKIERLETRIEKLEEELREVAA 196
Query: 293 XXVSLYSVVPEHGSSAHKVHTPARRLSRLYLHASKHWTQNRRATIAKNTVSGLILVAKSC 352
+SLYSVVP+H SSAHK+HTPARR+SR+Y+HA KH+TQ +RATIA+N+VSGL+LVAKSC
Sbjct: 197 LEISLYSVVPDHCSSAHKLHTPARRISRIYIHACKHFTQGKRATIARNSVSGLVLVAKSC 256
Query: 353 GNDVSRLTFWLSNTIVLREIISHAFGNSCQVSPLMRLAESNGAGN---GKSASLKWKGFP 409
GNDVSRLTFWLSN I LR+IIS AFG S + +++E N +GN GK +L+WK
Sbjct: 257 GNDVSRLTFWLSNIIALRQIISQAFGRS----RITQISEPNESGNSDSGKKTNLRWK--- 309
Query: 410 NGKAGSGFTQFVEDWQETGTFTSALERVESWIFSRLVESVWWQALTPYMQSPAGDFSSNK 469
+GF Q +EDWQET TFT+ALE++E W+FSR+VESVWWQ TP+MQSP D S++K
Sbjct: 310 -----NGFQQLLEDWQETETFTTALEKIEFWVFSRIVESVWWQVFTPHMQSPEDDSSASK 364
Query: 470 SFGRVLGPALGDHNQGNFSINLWRYAFEDAFQRLCPLRAGGHECGCLPVLARMV 523
S G+++GP+LGD NQG FSI+LW+ AF DA QR+CP+R GHECGCLPVLARMV
Sbjct: 365 SNGKLMGPSLGDQNQGTFSISLWKNAFRDALQRICPMRGAGHECGCLPVLARMV 418
>AT2G42320.1 | Symbols: | nucleolar protein gar2-related |
chr2:17628102-17630657 FORWARD LENGTH=669
Length = 669
Score = 348 bits (892), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 166/294 (56%), Positives = 211/294 (71%), Gaps = 17/294 (5%)
Query: 233 EILDGASNGAQSLGSEDERHETVNAEENGEHEDKAALELKIXXXXXXXXXXXXXXXXXXX 292
++ + ASNGA S GSE+E + E NG + + + E KI
Sbjct: 139 DVWEDASNGALSAGSENEAADVT--ENNGGNFEDGSSEEKIERLETRIEKLEEELREVAA 196
Query: 293 XXVSLYSVVPEHGSSAHKVHTPARRLSRLYLHASKHWTQNRRATIAKNTVSGLILVAKSC 352
+SLYSVVP+H SSAHK+HTPARR+SR+Y+HA KH+TQ +RATIA+N+VSGL+LVAKSC
Sbjct: 197 LEISLYSVVPDHCSSAHKLHTPARRISRIYIHACKHFTQGKRATIARNSVSGLVLVAKSC 256
Query: 353 GNDVSRLTFWLSNTIVLREIISHAFGNSCQVSPLMRLAESNGAGN---GKSASLKWKGFP 409
GNDVSRLTFWLSN I LR+IIS AFG S + +++E N +GN GK +L+WK
Sbjct: 257 GNDVSRLTFWLSNIIALRQIISQAFGRS----RITQISEPNESGNSDSGKKTNLRWK--- 309
Query: 410 NGKAGSGFTQFVEDWQETGTFTSALERVESWIFSRLVESVWWQALTPYMQSPAGDFSSNK 469
+GF Q +EDWQET TFT+ALE++E W+FSR+VESVWWQ TP+MQSP D S++K
Sbjct: 310 -----NGFQQLLEDWQETETFTTALEKIEFWVFSRIVESVWWQVFTPHMQSPEDDSSASK 364
Query: 470 SFGRVLGPALGDHNQGNFSINLWRYAFEDAFQRLCPLRAGGHECGCLPVLARMV 523
S G+++GP+LGD NQG FSI+LW+ AF DA QR+CP+R GHECGCLPVLARMV
Sbjct: 365 SNGKLMGPSLGDQNQGTFSISLWKNAFRDALQRICPMRGAGHECGCLPVLARMV 418
>AT3G57780.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: nucleolar protein gar2-related (TAIR:AT2G42320.2);
Has 3163 Blast hits to 2460 proteins in 357 species:
Archae - 16; Bacteria - 291; Metazoa - 841; Fungi - 335;
Plants - 248; Viruses - 72; Other Eukaryotes - 1360
(source: NCBI BLink). | chr3:21399766-21402329 REVERSE
LENGTH=671
Length = 671
Score = 308 bits (789), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 157/311 (50%), Positives = 201/311 (64%), Gaps = 13/311 (4%)
Query: 213 ASSESSEGVDENHVLEVKEIEILDGASNGAQSLGSEDERHETVNAEENGEHEDKAALELK 272
+S E+ EGV+ + V +E+ D ASNG S GSE+E + EN E ED+ L+
Sbjct: 115 SSPETCEGVNVDKV-----VEVWDDASNGGLSGGSENEAGDVKEKNENFE-EDEEMLKQM 168
Query: 273 IXXXXXXXXXXXXXXXXXXXXXVSLYSVVPEHGSSAHKVHTPARRLSRLYLHASKHWTQN 332
+ +SLYSVVP+H SSAHK+HTPARR+SR+Y+HA KHW+Q
Sbjct: 169 VETLETRVEKLEEELREVAALEISLYSVVPDHSSSAHKLHTPARRISRIYIHACKHWSQG 228
Query: 333 RRATIAKNTVSGLILVAKSCGNDVSRLTFWLSNTIVLREIISHAFGNSCQVSPLMRLAES 392
+RAT+A+N+VSGLIL AKSCGNDVSRLTFWLSN I LREII AFG + S + S
Sbjct: 229 KRATVARNSVSGLILAAKSCGNDVSRLTFWLSNIISLREIILQAFGKTSVPSHFTETSAS 288
Query: 393 NGAGNGKSASLKWKGFPNGKAGSGFTQFVEDWQETGTFTSALERVESWIFSRLVESVWWQ 452
NG+ + ++ K K +GF Q EDWQE+ TFT+ALE+VE WIFSR+VESVWWQ
Sbjct: 289 NGSEHNVLGKVRRKKNQWTKQSNGFKQVFEDWQESQTFTAALEKVEFWIFSRIVESVWWQ 348
Query: 453 ALTPYMQSPAGDFSSNKSFGRVLGPALGDHNQGNFSINLWRYAFEDAFQRLCPLRAGGHE 512
TP+MQSP ++ G+ LGD QG+FSI+LW+ AF+ RLCP+R HE
Sbjct: 349 VFTPHMQSP-------ENGGKTKEHILGDIEQGSFSISLWKNAFKVTLSRLCPMRGARHE 401
Query: 513 CGCLPVLARMV 523
CGCLP+LA+MV
Sbjct: 402 CGCLPILAKMV 412
>AT3G01810.3 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; EXPRESSED IN:
21 plant structures; EXPRESSED DURING: 13 growth stages;
BEST Arabidopsis thaliana protein match is: nucleolar
protein gar2-related (TAIR:AT2G42320.2). |
chr3:289218-292557 FORWARD LENGTH=921
Length = 921
Score = 237 bits (604), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 126/246 (51%), Positives = 155/246 (63%), Gaps = 30/246 (12%)
Query: 296 SLYSVVPEHGSSAHKVHTPARRLSRLYLHASKHWTQNRRATIAKNTVSGLILVAKSCGND 355
+LYSVV EHGSS+ KVH PARRL RLYLHA + +RRA A++ VSGL+LVAK+CGND
Sbjct: 430 ALYSVVAEHGSSSSKVHAPARRLLRLYLHACRETHLSRRANAAESAVSGLVLVAKACGND 489
Query: 356 VSRLTFWLSNTIVLREIISHAFGNSCQVSPLMRLAESNGAG---------NGKSASLKWK 406
V RLTFWLSNTIVLR IIS L S G G K +SLKWK
Sbjct: 490 VPRLTFWLSNTIVLRTIISDTSAEE-------ELPVSAGPGPRKQKAERETEKRSSLKWK 542
Query: 407 GFPNGKAGSGFTQFVEDWQETGTFTSALERVESWIFSRLVESVWWQALTPYMQSPAG--- 463
P K + W + TF +ALE+VE+WIFSR+VES+WWQ LTP MQS A
Sbjct: 543 DSPLSKKD---IKSFGAWDDPVTFITALEKVEAWIFSRVVESIWWQTLTPRMQSSAASTR 599
Query: 464 DF------SSNKSFGRVLGPALGDHNQGNFSINLWRYAFEDAFQRLCPLRAGGHECGCLP 517
+F +S K+FGR P+ + G+FS+ LW+ AF +A +RLCPLR GHECGCLP
Sbjct: 600 EFDKGNGSASKKTFGRT--PSSTNQELGDFSLELWKKAFREAHERLCPLRGSGHECGCLP 657
Query: 518 VLARMV 523
+ AR++
Sbjct: 658 IPARLI 663
>AT3G01810.1 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
plasma membrane; EXPRESSED IN: 21 plant structures;
EXPRESSED DURING: 13 growth stages; BEST Arabidopsis
thaliana protein match is: nucleolar protein
gar2-related (TAIR:AT2G42320.2); Has 1327 Blast hits to
470 proteins in 132 species: Archae - 2; Bacteria - 131;
Metazoa - 139; Fungi - 114; Plants - 114; Viruses - 0;
Other Eukaryotes - 827 (source: NCBI BLink). |
chr3:289218-292557 FORWARD LENGTH=921
Length = 921
Score = 237 bits (604), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 126/246 (51%), Positives = 155/246 (63%), Gaps = 30/246 (12%)
Query: 296 SLYSVVPEHGSSAHKVHTPARRLSRLYLHASKHWTQNRRATIAKNTVSGLILVAKSCGND 355
+LYSVV EHGSS+ KVH PARRL RLYLHA + +RRA A++ VSGL+LVAK+CGND
Sbjct: 430 ALYSVVAEHGSSSSKVHAPARRLLRLYLHACRETHLSRRANAAESAVSGLVLVAKACGND 489
Query: 356 VSRLTFWLSNTIVLREIISHAFGNSCQVSPLMRLAESNGAG---------NGKSASLKWK 406
V RLTFWLSNTIVLR IIS L S G G K +SLKWK
Sbjct: 490 VPRLTFWLSNTIVLRTIISDTSAEE-------ELPVSAGPGPRKQKAERETEKRSSLKWK 542
Query: 407 GFPNGKAGSGFTQFVEDWQETGTFTSALERVESWIFSRLVESVWWQALTPYMQSPAG--- 463
P K + W + TF +ALE+VE+WIFSR+VES+WWQ LTP MQS A
Sbjct: 543 DSPLSKKD---IKSFGAWDDPVTFITALEKVEAWIFSRVVESIWWQTLTPRMQSSAASTR 599
Query: 464 DF------SSNKSFGRVLGPALGDHNQGNFSINLWRYAFEDAFQRLCPLRAGGHECGCLP 517
+F +S K+FGR P+ + G+FS+ LW+ AF +A +RLCPLR GHECGCLP
Sbjct: 600 EFDKGNGSASKKTFGRT--PSSTNQELGDFSLELWKKAFREAHERLCPLRGSGHECGCLP 657
Query: 518 VLARMV 523
+ AR++
Sbjct: 658 IPARLI 663
>AT3G01810.2 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
plasma membrane; EXPRESSED IN: 21 plant structures;
EXPRESSED DURING: 13 growth stages; BEST Arabidopsis
thaliana protein match is: nucleolar protein
gar2-related (TAIR:AT2G42320.2); Has 1232 Blast hits to
443 proteins in 120 species: Archae - 2; Bacteria - 119;
Metazoa - 136; Fungi - 117; Plants - 114; Viruses - 0;
Other Eukaryotes - 744 (source: NCBI BLink). |
chr3:289218-292375 FORWARD LENGTH=859
Length = 859
Score = 237 bits (604), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 126/246 (51%), Positives = 155/246 (63%), Gaps = 30/246 (12%)
Query: 296 SLYSVVPEHGSSAHKVHTPARRLSRLYLHASKHWTQNRRATIAKNTVSGLILVAKSCGND 355
+LYSVV EHGSS+ KVH PARRL RLYLHA + +RRA A++ VSGL+LVAK+CGND
Sbjct: 430 ALYSVVAEHGSSSSKVHAPARRLLRLYLHACRETHLSRRANAAESAVSGLVLVAKACGND 489
Query: 356 VSRLTFWLSNTIVLREIISHAFGNSCQVSPLMRLAESNGAG---------NGKSASLKWK 406
V RLTFWLSNTIVLR IIS L S G G K +SLKWK
Sbjct: 490 VPRLTFWLSNTIVLRTIISDTSAEE-------ELPVSAGPGPRKQKAERETEKRSSLKWK 542
Query: 407 GFPNGKAGSGFTQFVEDWQETGTFTSALERVESWIFSRLVESVWWQALTPYMQSPAG--- 463
P K + W + TF +ALE+VE+WIFSR+VES+WWQ LTP MQS A
Sbjct: 543 DSPLSKKD---IKSFGAWDDPVTFITALEKVEAWIFSRVVESIWWQTLTPRMQSSAASTR 599
Query: 464 DF------SSNKSFGRVLGPALGDHNQGNFSINLWRYAFEDAFQRLCPLRAGGHECGCLP 517
+F +S K+FGR P+ + G+FS+ LW+ AF +A +RLCPLR GHECGCLP
Sbjct: 600 EFDKGNGSASKKTFGRT--PSSTNQELGDFSLELWKKAFREAHERLCPLRGSGHECGCLP 657
Query: 518 VLARMV 523
+ AR++
Sbjct: 658 IPARLI 663
>AT5G06930.1 | Symbols: | LOCATED IN: chloroplast; EXPRESSED IN: 15
plant structures; EXPRESSED DURING: 7 growth stages;
BEST Arabidopsis thaliana protein match is: nucleolar
protein gar2-related (TAIR:AT2G42320.2); Has 3369 Blast
hits to 1526 proteins in 313 species: Archae - 2;
Bacteria - 910; Metazoa - 754; Fungi - 336; Plants -
137; Viruses - 11; Other Eukaryotes - 1219 (source: NCBI
BLink). | chr5:2145139-2147849 FORWARD LENGTH=723
Length = 723
Score = 214 bits (544), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 156/473 (32%), Positives = 217/473 (45%), Gaps = 65/473 (13%)
Query: 57 VGDSNTASENSETYENVVIDYVDDVNRSEEALAEMKVNAMVANQASDTEK-----EQKEG 111
V DS T SE+SE YENV + Y+DD AN+ S T+ E++EG
Sbjct: 64 VTDSTTGSESSEVYENVNVHYMDD-----------------ANEKSRTDGNLVGCEEEEG 106
Query: 112 NG-EXXXXXXXXXXXXXQGDSFTNXXXXXXXXXXXXXXXXXXXXXXXXRGLKERSDRKTN 170
NG E Q + F+ +G +D +
Sbjct: 107 NGDESDTETNNGSVSWSQCELFSPEEKKSERPSMVSKSKSSQDRPLTSKGRTNIAD--SV 164
Query: 171 KLQSKVSDSNQKKPMNSTKGPSRVXXXXXXXXXXXPVKAPVKASSESSEGVDENHVLEVK 230
+ +S S +K + S+K ++ KA AS VD E K
Sbjct: 165 RSRSNTFHSTARKTVRSSKSQAKALSDFSSYRSSENNKAFSSASP-----VDSTPFEEGK 219
Query: 231 EIEILDGASNGAQSLGSEDERHETVNAEENGEHEDKAALELKIXXXXXXXXXXXXXXXXX 290
E + + A N S+ + + +ET+ +E + + L KI
Sbjct: 220 EDDEFEDALN---SVHNNESDNETLVYKEKKRSDVEKVLAQKIETMEARIEKLEEELREV 276
Query: 291 XXXXVSLYSVVPEHGSSAHKVHTPARRLSRLYLHASKHWTQNRRATIAKNTVSGLILVAK 350
+SLYSV PEHGSS+HK+H PAR LSRLY A K+ ++N+ ++ KN VSGL L+ K
Sbjct: 277 AALEMSLYSVFPEHGSSSHKLHKPARNLSRLYALARKNQSENKIISVTKNIVSGLSLLLK 336
Query: 351 SCGNDVSRLTFWLSNTIVLREIISHAFGNSCQVSPLMRLAESNGAGNGKSASLKWKGFPN 410
SCG+DVSRLT+WLSNT++LREIIS FG+S
Sbjct: 337 SCGSDVSRLTYWLSNTVMLREIISLDFGSS------------------------------ 366
Query: 411 GKAGSGFTQFVEDWQETGTFTSALERVESWIFSRLVESVWWQALTPYMQSPAGDFSSNKS 470
+G EDW + T +AL RVES F++ VES+W Q + +M D + +
Sbjct: 367 --KLNGLNSLKEDWGDVRTLIAALRRVESCFFTQAVESIWSQVMMVHMIPQGVDSTMGEM 424
Query: 471 FGRVLGPALGDHNQGNFSINLWRYAFEDAFQRLCPLRAGGHECGCLPVLARMV 523
G PA D Q +FS+NLW+ AFE+A QRLCP++A +CGCL VL RMV
Sbjct: 425 IGNFSEPATCDRLQESFSVNLWKEAFEEALQRLCPVQATRRQCGCLHVLTRMV 477
>AT5G43230.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G01810.3); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr5:17349125-17352747 FORWARD LENGTH=848
Length = 848
Score = 209 bits (533), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 114/232 (49%), Positives = 143/232 (61%), Gaps = 40/232 (17%)
Query: 296 SLYSVVPEHGSSAHKVHTPARRLSRLYLHASKH--WTQNRRATIAKNTVSGLILVAKSCG 353
++YSVV EH SS KVH PARRL+R YLHA K ++RAT A+ VSGLILV+K+CG
Sbjct: 384 AIYSVVAEHTSSMSKVHAPARRLARFYLHACKGNGSDHSKRATAARAAVSGLILVSKACG 443
Query: 354 NDVSRLTFWLSNTIVLREIISHAFGNSCQVSPLMRLAESNGAGNGKSASLKWKGFPNGKA 413
NDV RLTFWLSN+IVLR I+S K K P KA
Sbjct: 444 NDVPRLTFWLSNSIVLRAILSRGME-------------------------KMKIVPE-KA 477
Query: 414 GSGFTQFVEDWQETGTFTSALERVESWIFSRLVESVWWQALTPYMQSPA--GDFSSNKSF 471
GS ++W++ F +ALE+ ESWIFSR+V+SVWWQ++TP+MQSPA G + S
Sbjct: 478 GS------DEWEDPRAFLAALEKFESWIFSRVVKSVWWQSMTPHMQSPAVKGSIARKVSG 531
Query: 472 GRVLGPALGDHNQGNFSINLWRYAFEDAFQRLCPLRAGGHECGCLPVLARMV 523
R LG NQG ++I LW+ AF A +RLCPLR ECGCLP+LA++V
Sbjct: 532 KR----RLGHRNQGLYAIELWKNAFRAACERLCPLRGSRQECGCLPMLAKLV 579