Miyakogusa Predicted Gene
- Lj0g3v0132739.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0132739.1 tr|A9SCD4|A9SCD4_PHYPA Predicted protein
OS=Physcomitrella patens subsp. patens
GN=PHYPADRAFT_164112,24.04,8e-17,seg,NULL; SUBFAMILY NOT NAMED,NULL;
FAMILY NOT NAMED,NULL; coiled-coil,NULL,CUFF.8094.1
(540 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G12900.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 434 e-122
AT1G12330.1 | Symbols: | unknown protein; INVOLVED IN: biologic... 204 2e-52
AT3G14870.2 | Symbols: | Plant protein of unknown function (DUF... 52 7e-07
AT3G14870.1 | Symbols: | Plant protein of unknown function (DUF... 52 8e-07
AT3G14870.3 | Symbols: | Plant protein of unknown function (DUF... 52 8e-07
AT3G60680.1 | Symbols: | Plant protein of unknown function (DUF... 52 9e-07
>AT5G12900.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT1G12330.1); Has 1807 Blast
hits to 1807 proteins in 277 species: Archae - 0;
Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
Viruses - 0; Other Eukaryotes - 339 (source: NCBI
BLink). | chr5:4072151-4074445 REVERSE LENGTH=562
Length = 562
Score = 434 bits (1117), Expect = e-122, Method: Compositional matrix adjust.
Identities = 253/528 (47%), Positives = 335/528 (63%), Gaps = 48/528 (9%)
Query: 32 EIQTAIAKAVELRALHAALMRGNXXXXX--------XXXXXXXXXXXVSQFSAQDYPVFT 83
E + AIA V+LRAL A+LM+ + SA DYPVFT
Sbjct: 13 EKEEAIANGVKLRALQASLMQMKSSPSSNYSLRNPSSSSAASPASRPLPNLSAHDYPVFT 72
Query: 84 PSYEDDSILGYSQNLMSQTISESWEESALEGGNSIESIVQDYKQKSSSRKGLTSAFANLV 143
PSYED+ + + + T+SE+W+E + G ++ + D + S+SRK +
Sbjct: 73 PSYEDEPVSAFHHKNL--TLSETWDEDGV-GLVDGDTYLSDSYKTSTSRKTVM------- 122
Query: 144 SHSCPADDSKSVTGSCANHITVLQTSPCNDYYKSSRRN-SLEDFKSISSCNNCNPAIITT 202
P DS + ++ + SP +Y + R N DF+S+SSCN+ N
Sbjct: 123 ---MPHQDSHHHVYTMSDALR----SPPLHFYTTGRSNCGSVDFRSVSSCNDYNKQ---K 172
Query: 203 EFESARNSKSSNTVVLPVTDXXXXXXXXX-KTKG--LISWLLPRLKKKKQKNENFPN--- 256
F++ ++ KSSN VV P+TD + +G ++SWL P+LKKK++ N F +
Sbjct: 173 GFDT-KSLKSSNLVV-PLTDSHSAVVSSQPRNRGGRVMSWLFPKLKKKQKSNSIFNSPSI 230
Query: 257 TAESEDVSQALKDFGIMSIDTLKKELNEANENRDLALMEVSEMRSSLGELQQKLEYLESY 316
T +SE+VS+ LKD G ++ LK+EL EAN +RD AL +VSEM+SSLGEL +KL+YLESY
Sbjct: 231 TEKSEEVSEVLKDSG-SGVEKLKRELMEANRSRDAALTQVSEMKSSLGELSEKLQYLESY 289
Query: 317 CEDLKKALRQAMEPKDTQLYEQLNNIPKPFDGNGKNSMPVSEEAMVEGFLQIVSESRLSV 376
C++LKKALR+A T++ Q N+ + G + MPVSEE MVEGFLQIVSE+RLS+
Sbjct: 290 CDNLKKALREA-----TEVVSQENSGGRS-SGKKNSEMPVSEEVMVEGFLQIVSEARLSI 343
Query: 377 KQFCKILINNIEETDHSL----NLLLQPYKLSLNSKNSKAVLYHFEAFINQCFFQDFENC 432
KQF K L++ I+E D +L N LLQP+ LS SK SK + YH EA I+Q +QDFENC
Sbjct: 344 KQFLKTLVSEIDEEDSTLIGNINTLLQPHNLSFTSKYSKIIQYHLEAIISQSVYQDFENC 403
Query: 433 VFQKNGCAKLLDPRVYRESQFSSFVALRKLSWNEVLRKGTKYYSEGFSKFCDQKMSCVNT 492
VFQKNG KLLDP R++ FSSF +LR LSWNEVL+KGTKYYS+ FS+FCD+KMS + T
Sbjct: 404 VFQKNGKPKLLDPEQDRQANFSSFASLRNLSWNEVLKKGTKYYSDEFSRFCDEKMSLIIT 463
Query: 493 SLNWTRPWPEQLLQAFFVAAKCMWLLHFLAFSFNPPLGVFRVEENATF 540
+LNWTRPW EQ+LQAFFVAAKC+WLLH LAFSFNP LG+ RVEEN F
Sbjct: 464 TLNWTRPWSEQMLQAFFVAAKCVWLLHLLAFSFNPALGILRVEENREF 511
>AT1G12330.1 | Symbols: | unknown protein; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 12
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G12900.1); Has 249 Blast
hits to 249 proteins in 27 species: Archae - 0; Bacteria
- 0; Metazoa - 7; Fungi - 14; Plants - 217; Viruses - 0;
Other Eukaryotes - 11 (source: NCBI BLink). |
chr1:4194673-4196627 FORWARD LENGTH=505
Length = 505
Score = 204 bits (518), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 109/278 (39%), Positives = 172/278 (61%), Gaps = 30/278 (10%)
Query: 273 MSIDTLKKELNEANENRDLALMEVSEMRSSLGELQQKLEYLESYCEDLKKALRQAMEPKD 332
+++ L +L +A + ++ A++E S ++SS+ EL++KL LE YC +LK L +
Sbjct: 176 LAVAELSHKLIQATKKKEDAVIEASRLKSSMAELEKKLNKLEIYCHNLKSGLDEC----- 230
Query: 333 TQLYEQLNNIPKPFDGNGKNSMPVSEEA----MVEGFLQIVSESRLSVKQFCKILINNIE 388
N K S+P+ ++ +++ FL VSESR S++ + L + +
Sbjct: 231 ---------------SNKKQSVPIRKDGFNDRIIQQFLVSVSESRSSIRALSRSLASQLR 275
Query: 389 ETD----HSLNLLLQPYKLSLNS--KNSKAVLYHFEAFINQCFFQDFENCVFQKNGCAKL 442
L+LLLQP+ + +NS KN K+++++ EA +++ FF+DFE FQKNG ++
Sbjct: 276 TVGGKVYERLSLLLQPFDVKINSFAKNPKSLIFYLEAILSRAFFEDFEAPGFQKNGSTRI 335
Query: 443 LDPRVYRESQFSSFVALRKLSWNEVLRKGTKYYSEGFSKFCDQKMSCVNTSLNWTRPWPE 502
L+P ES ++SF L +L+W+EVL +GTK++SE FS+FCD+KMS V + L+W R WPE
Sbjct: 336 LNPIDRCESNYASFNVLMELTWDEVLSRGTKHFSEEFSRFCDRKMSDVVSMLSWNRAWPE 395
Query: 503 QLLQAFFVAAKCMWLLHFLAFSFNPPLGVFRVEENATF 540
LLQAFF A+K +WL+H LA S NP L +FRVE++ F
Sbjct: 396 PLLQAFFGASKSVWLVHLLANSVNPGLQIFRVEKDDRF 433
>AT3G14870.2 | Symbols: | Plant protein of unknown function
(DUF641) | chr3:5004171-5005586 FORWARD LENGTH=471
Length = 471
Score = 52.4 bits (124), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 77/316 (24%), Positives = 131/316 (41%), Gaps = 45/316 (14%)
Query: 255 PNTAESED--VSQALKDFGIMSIDTLKKELNEANENRDLALMEVSEMRSSLG-------E 305
PN + D V LK + LKK+L+ N +R L L E+ E+RS L +
Sbjct: 111 PNGIQKADNLVVAELKTLSELKQSFLKKQLD-PNPDRTLVLAEIQELRSVLKTYEIMGKK 169
Query: 306 LQQKLEYLESYCEDLKKALRQAMEPKDTQLYEQLNN---IPKPFDGNGKNSMPVSEEAMV 362
L+ +L+ +S LK+ +++M ++ + ++LN + P D N S S
Sbjct: 170 LECQLKLKDSEIIFLKEKFQESM-TQNKLMEKRLNQSGQLCNPLDHNLHLSAVSSTH--- 225
Query: 363 EGFLQIVSESRLSVKQFCKILINNIEETDHSLNL---LLQPYKLSLNSKNSKAVLYHFEA 419
F+ + + S++ F K+++ ++ +++ L+QP L + L H+
Sbjct: 226 --FVTYLHHTVKSIRGFVKLMVEQMKLAAWDIDMAAELIQPDVLYYKQDHKCFALEHY-- 281
Query: 420 FINQCFFQDFENCVFQKNGCAKLLDPRVYRESQFSSFVALRKLSWNEVLRKGTKYYSEGF 479
+ + + F+ F K R + F F LR + E L K
Sbjct: 282 -VCKIMLEAFQLPYFSNESSKK--TSREDKAMFFERFTELRSMKPREYLASRPK---SRL 335
Query: 480 SKFCDQK-MSCVNTSLNWT-------------RPWPE-QLLQAFFVAAKCMWLLHFLAFS 524
+KFC K + ++ + +PE L AF AK +WLLH LAFS
Sbjct: 336 AKFCRTKYLQLIHPKMEQAFFGHLHQRNQVTAGEFPETSLCTAFLEMAKRVWLLHCLAFS 395
Query: 525 FNPPLGVFRVEENATF 540
F+P +F+V F
Sbjct: 396 FDPEASIFQVSRGCRF 411
>AT3G14870.1 | Symbols: | Plant protein of unknown function
(DUF641) | chr3:5004159-5005586 FORWARD LENGTH=475
Length = 475
Score = 52.4 bits (124), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 77/316 (24%), Positives = 131/316 (41%), Gaps = 45/316 (14%)
Query: 255 PNTAESED--VSQALKDFGIMSIDTLKKELNEANENRDLALMEVSEMRSSLG-------E 305
PN + D V LK + LKK+L+ N +R L L E+ E+RS L +
Sbjct: 115 PNGIQKADNLVVAELKTLSELKQSFLKKQLD-PNPDRTLVLAEIQELRSVLKTYEIMGKK 173
Query: 306 LQQKLEYLESYCEDLKKALRQAMEPKDTQLYEQLNN---IPKPFDGNGKNSMPVSEEAMV 362
L+ +L+ +S LK+ +++M ++ + ++LN + P D N S S
Sbjct: 174 LECQLKLKDSEIIFLKEKFQESM-TQNKLMEKRLNQSGQLCNPLDHNLHLSAVSSTH--- 229
Query: 363 EGFLQIVSESRLSVKQFCKILINNIEETDHSLNL---LLQPYKLSLNSKNSKAVLYHFEA 419
F+ + + S++ F K+++ ++ +++ L+QP L + L H+
Sbjct: 230 --FVTYLHHTVKSIRGFVKLMVEQMKLAAWDIDMAAELIQPDVLYYKQDHKCFALEHY-- 285
Query: 420 FINQCFFQDFENCVFQKNGCAKLLDPRVYRESQFSSFVALRKLSWNEVLRKGTKYYSEGF 479
+ + + F+ F K R + F F LR + E L K
Sbjct: 286 -VCKIMLEAFQLPYFSNESSKK--TSREDKAMFFERFTELRSMKPREYLASRPK---SRL 339
Query: 480 SKFCDQK-MSCVNTSLNWT-------------RPWPE-QLLQAFFVAAKCMWLLHFLAFS 524
+KFC K + ++ + +PE L AF AK +WLLH LAFS
Sbjct: 340 AKFCRTKYLQLIHPKMEQAFFGHLHQRNQVTAGEFPETSLCTAFLEMAKRVWLLHCLAFS 399
Query: 525 FNPPLGVFRVEENATF 540
F+P +F+V F
Sbjct: 400 FDPEASIFQVSRGCRF 415
>AT3G14870.3 | Symbols: | Plant protein of unknown function
(DUF641) | chr3:5004040-5005586 FORWARD LENGTH=472
Length = 472
Score = 52.4 bits (124), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 77/316 (24%), Positives = 131/316 (41%), Gaps = 45/316 (14%)
Query: 255 PNTAESED--VSQALKDFGIMSIDTLKKELNEANENRDLALMEVSEMRSSLG-------E 305
PN + D V LK + LKK+L+ N +R L L E+ E+RS L +
Sbjct: 112 PNGIQKADNLVVAELKTLSELKQSFLKKQLD-PNPDRTLVLAEIQELRSVLKTYEIMGKK 170
Query: 306 LQQKLEYLESYCEDLKKALRQAMEPKDTQLYEQLNN---IPKPFDGNGKNSMPVSEEAMV 362
L+ +L+ +S LK+ +++M ++ + ++LN + P D N S S
Sbjct: 171 LECQLKLKDSEIIFLKEKFQESM-TQNKLMEKRLNQSGQLCNPLDHNLHLSAVSSTH--- 226
Query: 363 EGFLQIVSESRLSVKQFCKILINNIEETDHSLNL---LLQPYKLSLNSKNSKAVLYHFEA 419
F+ + + S++ F K+++ ++ +++ L+QP L + L H+
Sbjct: 227 --FVTYLHHTVKSIRGFVKLMVEQMKLAAWDIDMAAELIQPDVLYYKQDHKCFALEHY-- 282
Query: 420 FINQCFFQDFENCVFQKNGCAKLLDPRVYRESQFSSFVALRKLSWNEVLRKGTKYYSEGF 479
+ + + F+ F K R + F F LR + E L K
Sbjct: 283 -VCKIMLEAFQLPYFSNESSKK--TSREDKAMFFERFTELRSMKPREYLASRPK---SRL 336
Query: 480 SKFCDQK-MSCVNTSLNWT-------------RPWPE-QLLQAFFVAAKCMWLLHFLAFS 524
+KFC K + ++ + +PE L AF AK +WLLH LAFS
Sbjct: 337 AKFCRTKYLQLIHPKMEQAFFGHLHQRNQVTAGEFPETSLCTAFLEMAKRVWLLHCLAFS 396
Query: 525 FNPPLGVFRVEENATF 540
F+P +F+V F
Sbjct: 397 FDPEASIFQVSRGCRF 412
>AT3G60680.1 | Symbols: | Plant protein of unknown function
(DUF641) | chr3:22430246-22431745 FORWARD LENGTH=499
Length = 499
Score = 52.4 bits (124), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 70/279 (25%), Positives = 122/279 (43%), Gaps = 44/279 (15%)
Query: 290 DLALMEVSEMRSSLGELQQKLEYLESYCEDLKKALRQAMEPKDTQLYEQLNNIPKPFDGN 349
DLA+ E R + E Q KL LE+ + L+ M+ KD Q++ N + +
Sbjct: 184 DLAIGSCLESR--VQENQSKLRALET----VSNRLQAEMDAKDLQVWSLRNKLGEIQKST 237
Query: 350 GKNSMPVSEEAMVEGFLQI-VSESRL-----SVKQFCKILINNIEETDHSLNLLLQPYKL 403
K S +S + ++ L + V ES L + ++F KILI +E+ L+L+ +
Sbjct: 238 SKLSKRLSSNSSLDVLLSVRVFESLLYDAFKATQKFTKILIELMEKAGWDLDLVAKSVHP 297
Query: 404 SLNSKNSKAVLYHFEAFINQCFFQDFENCVFQKNGCAKLLDPRVYRESQFSSF-VALRKL 462
++ + Y +++ F+ F+ F L+ Y ES+ SS +LR+L
Sbjct: 298 EVDYAKERHNRYALLSYVCLGMFRGFDGEGFD-------LNENDYEESERSSVDSSLREL 350
Query: 463 SWN------EVLRKGTKYYSEGFSKFCDQK---------MSCVNTSLNWTRP----WPE- 502
+ E+L + FS+FCD+K S + ++++ W
Sbjct: 351 MQHVSSNPMELLDRDKDC---AFSRFCDKKYHELIHPNMASSIFSNMDENEAVLSSWRSL 407
Query: 503 -QLLQAFFVAAKCMWLLHFLAFSFNPPLGVFRVEENATF 540
++F A +W LH LA SF+P + +F+VE F
Sbjct: 408 STFYESFVTMASSIWTLHKLALSFDPAVEIFQVESGVEF 446