Miyakogusa Predicted Gene
- Lj0g3v0214359.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0214359.1 CUFF.13808.1
(447 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G67100.1 | Symbols: ICU2 | DNA-directed DNA polymerases | chr... 600 e-172
AT5G63960.1 | Symbols: EMB2780 | DNA binding;nucleotide binding;... 93 5e-19
AT5G63960.2 | Symbols: EMB2780 | DNA binding;nucleotide binding;... 92 6e-19
AT1G67500.2 | Symbols: REV3 | recovery protein 3 | chr1:25287707... 74 2e-13
AT1G67500.1 | Symbols: ATREV3, REV3 | recovery protein 3 | chr1:... 74 2e-13
>AT5G67100.1 | Symbols: ICU2 | DNA-directed DNA polymerases |
chr5:26776994-26785104 FORWARD LENGTH=1524
Length = 1524
Score = 600 bits (1547), Expect = e-172, Method: Compositional matrix adjust.
Identities = 301/444 (67%), Positives = 353/444 (79%), Gaps = 5/444 (1%)
Query: 2 PPVVVTAINLKTILNEEQNINEIVSASVVCCNMVKIDTPMLASEWKRPGKLTHFTVIRKL 61
PP VVTAINLKTI+NE+QNI+EIVSASV+C + KID PM A E KR G L+HFTV+R
Sbjct: 559 PPAVVTAINLKTIVNEKQNISEIVSASVLCFHNAKIDVPMPAPERKRSGILSHFTVVRNP 618
Query: 62 HGNIFPLGFNKEVTDRNTKAGSNIVCAESSERALLNRLMLELHKLDTDVLVGHNISGFDL 121
G +P+G+ KEV+DRN+K G N++ E+SERALLNRL LEL+KLD+D+LVGHNISGFDL
Sbjct: 619 EGTGYPIGWKKEVSDRNSKNGCNVLSIENSERALLNRLFLELNKLDSDILVGHNISGFDL 678
Query: 122 DVLLHRSQACKVPSSMWSKLGRLNRSTMPKLDRRKKTFGSGADPGIMSCIAGRLLCDTYL 181
DVLL R+QACKV SSMWSK+GRL RS MPKL + +GSGA PG+MSCIAGRLLCDT L
Sbjct: 679 DVLLQRAQACKVQSSMWSKIGRLKRSFMPKL-KGNSNYGSGATPGLMSCIAGRLLCDTDL 737
Query: 182 SSRDLLKEVSYSLTELAKTQLNKFRKEVAPHGIPKMFQTAESLMELIEYGETDAWLSMEL 241
SRDLLKEVSYSLT+L+KTQLN+ RKE+AP+ IPKMFQ++++L+ELIE GETDAWLSMEL
Sbjct: 738 CSRDLLKEVSYSLTDLSKTQLNRDRKEIAPNDIPKMFQSSKTLVELIECGETDAWLSMEL 797
Query: 242 MFHLSILPLTRQLTNISGNLWGKTLQGARAQRVEYLLLHAFHAKKYIVPDKFSNYAKETK 301
MFHLS+LPLT QLTNISGNLWGKTLQGARAQR+EY LLH FH+KK+I+PDK S KE K
Sbjct: 798 MFHLSVLPLTLQLTNISGNLWGKTLQGARAQRIEYYLLHTFHSKKFILPDKISQRMKEIK 857
Query: 302 LTKRRVTHGVXXXXXXXXXXXXXXYHNNASEIDHXXXXXXXXYAGGLVLEPKKGLYDKYI 361
+KRR+ + N+ S+ YAGGLVLEPK+GLYDKY+
Sbjct: 858 SSKRRMDY-APEDRNVDELDADLTLENDPSK--GSKTKKGPAYAGGLVLEPKRGLYDKYV 914
Query: 362 LLLDFNSLYPSIIQEYNICFTTVERSFDGSFPRLPSSTITGILPELLENLVKRRKSVKTW 421
LLLDFNSLYPSIIQEYNICFTT+ RS DG PRLPSS GILP+L+E+LV RKSVK
Sbjct: 915 LLLDFNSLYPSIIQEYNICFTTIPRSEDG-VPRLPSSQTPGILPKLMEHLVSIRKSVKLK 973
Query: 422 MKTASGLKYQQFDIQQQALKLTAN 445
MK +GLKY + DI+QQALKLTAN
Sbjct: 974 MKKETGLKYWELDIRQQALKLTAN 997
>AT5G63960.1 | Symbols: EMB2780 | DNA binding;nucleotide
binding;nucleic acid binding;DNA-directed DNA
polymerases;DNA-directed DNA polymerases |
chr5:25599597-25606672 FORWARD LENGTH=1095
Length = 1095
Score = 92.8 bits (229), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 109/387 (28%), Positives = 162/387 (41%), Gaps = 74/387 (19%)
Query: 82 GSNIVCAESSERALLNRLMLELHKLDTDVLVGHNISGFDLDVLLHRSQACKVPSSMWSKL 141
G +++ E+ LL L + +D D+++G+NI FDL L+ R+ + + L
Sbjct: 361 GVDVMSFETEREVLLAWRDL-IRDVDPDIIIGYNICKFDLPYLIERAATLGIEE--FPLL 417
Query: 142 GRLNRSTMPKLDRRKKTFGS---GADPGIMSCIAGRLLCDTYLSSRDLLKEVSYSLTELA 198
GR+ S ++ R TF S G + I GR D + K SYSL ++
Sbjct: 418 GRVKNS---RVRVRDSTFSSRQQGIRESKETTIEGRFQFDLIQAIHRDHKLSSYSLNSVS 474
Query: 199 KTQLNKFRKEVAPHGIPKMFQT--AESLMELIEYGETDAWLSMELMFHLSILPLTRQLTN 256
L++ +KE H I Q AE+ L Y DA+L L+ L + ++
Sbjct: 475 AHFLSE-QKEDVHHSIITDLQNGNAETRRRLAVYCLKDAYLPQRLLDKLMFIYNYVEMAR 533
Query: 257 ISGNLWGKTLQGARAQRVEYL--LLHAFHAKKYIVPDKFSNYAKETKLTKRRVTHGVXXX 314
++G L AR Q ++ L LL K ++P+ AK++
Sbjct: 534 VTGVPISFLL--ARGQSIKVLSQLLRKGKQKNLVLPN-----AKQS-------------- 572
Query: 315 XXXXXXXXXXXYHNNASEIDHXXXXXXXXYAGGLVLEPKKGLYDKYILLLDFNSLYPSII 374
SE Y G VLE + G Y+K I LDF SLYPSI+
Sbjct: 573 ---------------GSE--------QGTYEGATVLEARTGFYEKPIATLDFASLYPSIM 609
Query: 375 QEYNICFTTVERSFDGSFPRLPSSTIT---------------GILPELLENLVKRRKSVK 419
YN+C+ T+ D LP +T GILPE+LE L+ RK K
Sbjct: 610 MAYNLCYCTLVTPEDVRKLNLPPEHVTKTPSGETFVKQTLQKGILPEILEELLTARKRAK 669
Query: 420 TWMKTASG-LKYQQFDIQQQALKLTAN 445
+K A L+ D +Q ALK++AN
Sbjct: 670 ADLKEAKDPLEKAVLDGRQLALKISAN 696
>AT5G63960.2 | Symbols: EMB2780 | DNA binding;nucleotide
binding;nucleic acid binding;DNA-directed DNA
polymerases;DNA-directed DNA polymerases |
chr5:25599597-25606672 FORWARD LENGTH=1112
Length = 1112
Score = 92.4 bits (228), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 108/383 (28%), Positives = 158/383 (41%), Gaps = 80/383 (20%)
Query: 86 VCAESSERALLNRLMLELHKLDTDVLVGHNISGFDLDVLLHRSQACKVPSSMWSKLGRLN 145
+C SS L+ +D D+++G+NI FDL L+ R+ + + LGR+
Sbjct: 388 ICITSSSSDLI-------RDVDPDIIIGYNICKFDLPYLIERAATLGIEE--FPLLGRVK 438
Query: 146 RSTMPKLDRRKKTFGS---GADPGIMSCIAGRLLCDTYLSSRDLLKEVSYSLTELAKTQL 202
S ++ R TF S G + I GR D + K SYSL ++ L
Sbjct: 439 NS---RVRVRDSTFSSRQQGIRESKETTIEGRFQFDLIQAIHRDHKLSSYSLNSVSAHFL 495
Query: 203 NKFRKEVAPHGIPKMFQT--AESLMELIEYGETDAWLSMELMFHLSILPLTRQLTNISGN 260
++ +KE H I Q AE+ L Y DA+L L+ L + ++ ++G
Sbjct: 496 SE-QKEDVHHSIITDLQNGNAETRRRLAVYCLKDAYLPQRLLDKLMFIYNYVEMARVTGV 554
Query: 261 LWGKTLQGARAQRVEYL--LLHAFHAKKYIVPDKFSNYAKETKLTKRRVTHGVXXXXXXX 318
L AR Q ++ L LL K ++P+ AK++
Sbjct: 555 PISFLL--ARGQSIKVLSQLLRKGKQKNLVLPN-----AKQS------------------ 589
Query: 319 XXXXXXXYHNNASEIDHXXXXXXXXYAGGLVLEPKKGLYDKYILLLDFNSLYPSIIQEYN 378
SE Y G VLE + G Y+K I LDF SLYPSI+ YN
Sbjct: 590 -----------GSE--------QGTYEGATVLEARTGFYEKPIATLDFASLYPSIMMAYN 630
Query: 379 ICFTTVERSFDGSFPRLPSSTIT---------------GILPELLENLVKRRKSVKTWMK 423
+C+ T+ D LP +T GILPE+LE L+ RK K +K
Sbjct: 631 LCYCTLVTPEDVRKLNLPPEHVTKTPSGETFVKQTLQKGILPEILEELLTARKRAKADLK 690
Query: 424 TASG-LKYQQFDIQQQALKLTAN 445
A L+ D +Q ALK++AN
Sbjct: 691 EAKDPLEKAVLDGRQLALKISAN 713
>AT1G67500.2 | Symbols: REV3 | recovery protein 3 |
chr1:25287707-25296714 REVERSE LENGTH=1916
Length = 1916
Score = 73.9 bits (180), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 93/411 (22%), Positives = 153/411 (37%), Gaps = 99/411 (24%)
Query: 92 ERALLNRLMLELHKLDTDVLVGHNISGFDLDVLLHRSQACKVPSSMWSKLGRLNRSTMPK 151
ER L + L K D DVL+G +I G + L R A ++ + + R T
Sbjct: 1129 ERQLFRYFIETLCKWDPDVLLGWDIQGGSIGFLAER--AAQLGIRFLNNISRTPSPTTTN 1186
Query: 152 LDRRKKTFGSG--ADPGIMSC---------------------IAGRLLCDTYLSSRDLLK 188
K+ G+ DP + + + GR++ + + R +K
Sbjct: 1187 NSDNKRKLGNNLLPDPLVANPAQVEEVVIEDEWGRTHASGVHVGGRIVLNAWRLIRGEVK 1246
Query: 189 EVSYSLTELAKTQLNKFRKEVAPHGIPKMFQT--AESLMELIEYGETDAWLSMELMFHLS 246
Y++ +++ L + + + + F + A + IEY A L++E+M L
Sbjct: 1247 LNMYTIEAVSEAVLRQKVPSIPYKVLTEWFSSGPAGARYRCIEYVIRRANLNLEIMSQLD 1306
Query: 247 ILPLTRQLTNISGNLWGKTLQGARAQRVEYLLLHAFHAKKYIVPDKFSNYAKETKLTKRR 306
++ T +L + G + L RVE +LL H + Y+
Sbjct: 1307 MINRTSELARVFGIDFFSVLSRGSQYRVESMLLRLAHTQNYLA----------------- 1349
Query: 307 VTHGVXXXXXXXXXXXXXXYHNNASEIDHXXXXXXXXYAGGLVLEPKKGLYDKYILLLDF 366
++ G + LV+EP+ YD +++LDF
Sbjct: 1350 ISPG-----------------------NQQVASQPAMECVPLVMEPESAFYDDPVIVLDF 1386
Query: 367 NSLYPSIIQEYNICFTTV---------------ERSFD--------------GSFPRLPS 397
SLYPS+I YN+CF+T S D S +P
Sbjct: 1387 QSLYPSMIIAYNLCFSTCLGKLAHLKMNTLGVSSYSLDLDVLQDLNQILQTPNSVMYVPP 1446
Query: 398 STITGILPELLENLVKRRKSVKTWMK---TASGLKYQQFDIQQQALKLTAN 445
GILP LLE ++ R VK MK + + ++ F+ +Q ALKL AN
Sbjct: 1447 EVRRGILPRLLEEILSTRIMVKKAMKKLTPSEAVLHRIFNARQLALKLIAN 1497
>AT1G67500.1 | Symbols: ATREV3, REV3 | recovery protein 3 |
chr1:25287707-25296714 REVERSE LENGTH=1890
Length = 1890
Score = 73.9 bits (180), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 93/411 (22%), Positives = 153/411 (37%), Gaps = 99/411 (24%)
Query: 92 ERALLNRLMLELHKLDTDVLVGHNISGFDLDVLLHRSQACKVPSSMWSKLGRLNRSTMPK 151
ER L + L K D DVL+G +I G + L R A ++ + + R T
Sbjct: 1103 ERQLFRYFIETLCKWDPDVLLGWDIQGGSIGFLAER--AAQLGIRFLNNISRTPSPTTTN 1160
Query: 152 LDRRKKTFGSG--ADPGIMSC---------------------IAGRLLCDTYLSSRDLLK 188
K+ G+ DP + + + GR++ + + R +K
Sbjct: 1161 NSDNKRKLGNNLLPDPLVANPAQVEEVVIEDEWGRTHASGVHVGGRIVLNAWRLIRGEVK 1220
Query: 189 EVSYSLTELAKTQLNKFRKEVAPHGIPKMFQT--AESLMELIEYGETDAWLSMELMFHLS 246
Y++ +++ L + + + + F + A + IEY A L++E+M L
Sbjct: 1221 LNMYTIEAVSEAVLRQKVPSIPYKVLTEWFSSGPAGARYRCIEYVIRRANLNLEIMSQLD 1280
Query: 247 ILPLTRQLTNISGNLWGKTLQGARAQRVEYLLLHAFHAKKYIVPDKFSNYAKETKLTKRR 306
++ T +L + G + L RVE +LL H + Y+
Sbjct: 1281 MINRTSELARVFGIDFFSVLSRGSQYRVESMLLRLAHTQNYLA----------------- 1323
Query: 307 VTHGVXXXXXXXXXXXXXXYHNNASEIDHXXXXXXXXYAGGLVLEPKKGLYDKYILLLDF 366
++ G + LV+EP+ YD +++LDF
Sbjct: 1324 ISPG-----------------------NQQVASQPAMECVPLVMEPESAFYDDPVIVLDF 1360
Query: 367 NSLYPSIIQEYNICFTTV---------------ERSFD--------------GSFPRLPS 397
SLYPS+I YN+CF+T S D S +P
Sbjct: 1361 QSLYPSMIIAYNLCFSTCLGKLAHLKMNTLGVSSYSLDLDVLQDLNQILQTPNSVMYVPP 1420
Query: 398 STITGILPELLENLVKRRKSVKTWMK---TASGLKYQQFDIQQQALKLTAN 445
GILP LLE ++ R VK MK + + ++ F+ +Q ALKL AN
Sbjct: 1421 EVRRGILPRLLEEILSTRIMVKKAMKKLTPSEAVLHRIFNARQLALKLIAN 1471