Miyakogusa Predicted Gene
- Lj3g3v2655630.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v2655630.1 tr|A3LNE7|A3LNE7_PICST Poly(A)-binding protein
binding protein (Fragment) OS=Scheffersomyces
stipiti,29.86,2e-18,ATAXIN 2-RELATED,NULL; coiled-coil,NULL; SM-ATX,SM
domain found in ataxin-2; LsmAD,LsmAD domain; seg,CUFF.44339.1
(523 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G14010.4 | Symbols: CID4 | CTC-interacting domain 4 | chr3:46... 325 4e-89
AT3G14010.3 | Symbols: CID4 | CTC-interacting domain 4 | chr3:46... 324 9e-89
AT3G14010.2 | Symbols: CID4 | CTC-interacting domain 4 | chr3:46... 324 9e-89
AT3G14010.1 | Symbols: CID4 | CTC-interacting domain 4 | chr3:46... 324 9e-89
AT1G54170.1 | Symbols: CID3 | CTC-interacting domain 3 | chr1:20... 301 9e-82
AT4G26990.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 69 7e-12
AT5G54920.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 66 5e-11
AT5G54920.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 66 5e-11
>AT3G14010.4 | Symbols: CID4 | CTC-interacting domain 4 |
chr3:4637164-4640324 FORWARD LENGTH=549
Length = 549
Score = 325 bits (834), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 179/358 (50%), Positives = 231/358 (64%), Gaps = 22/358 (6%)
Query: 11 SSNGYGRRKSEKEGATKSDNKIPSGKSNTSRLVSTGAMTGNKSGSDESPSHDRLVYLTTC 70
SSNG+ ++ E E K+NTS V G ++GS + S DRLVY TTC
Sbjct: 9 SSNGFPLKRGETEEVLH--------KTNTSNTVFNG-----EAGSLKRLSLDRLVYFTTC 55
Query: 71 LIGHQVEVQVKNGSIYSGIFHATDTDKDFGIILKMARLTKDGSSRGQRSGVELLNKAPSK 130
IGH VEV ++NGS+Y+GIFHA + +KDFGIILKMA L KDG+ RG +S E + K PSK
Sbjct: 56 KIGHHVEVHLRNGSVYTGIFHAANVEKDFGIILKMACLIKDGTLRGHKSRSEFVRKPPSK 115
Query: 131 TFIIPGKELVQVIAKDIAVSRDGQSSESHYDMHQEIMVDSVISQSHHVEIGRELQRWVPD 190
TFIIP ELVQVIAKD++VS + S+ + E++ DS ISQS+HV+ R+LQRWVPD
Sbjct: 116 TFIIPADELVQVIAKDLSVSSNNMSNAVQGEKPSELLTDSSISQSYHVDRERQLQRWVPD 175
Query: 191 EDAPQCPELENIFDDPWNRGWNQFETNEMLFGVKSTFNEEIYTTKLEKGPRTRELERQAL 250
E P +LEN+FD+PWNR WNQFE N+ LFGVKSTF+E++YTT+LE+GP+T++LE A
Sbjct: 176 ETIPHGADLENVFDNPWNRKWNQFEVNKSLFGVKSTFDEDLYTTRLERGPQTKQLEEHAQ 235
Query: 251 RIAREIEGEETQDLHLAEERGL-YHDNFDID------AVYRGKGVDDSGCDENEDKLLDS 303
+IAREIE E T+D+H+AEERGL ++NFD D +V G DSG D ++ LLD+
Sbjct: 236 KIAREIEAETTRDIHVAEERGLQLNENFDFDEEARYSSVRPVTGFGDSGFDLEDNALLDT 295
Query: 304 LNTETFGDMSGSVIMRPGDISGGKGNNGAQTWSNLSSVDHLRSGSYDHAKQLVSDLPA 361
N TFG S S +P S GKG + S S + S +KQ D PA
Sbjct: 296 CNDLTFGGSSTSDGQKPA--SSGKGCEELRGDSQSSRKNKNVDQSCSTSKQQSKDFPA 351
>AT3G14010.3 | Symbols: CID4 | CTC-interacting domain 4 |
chr3:4637164-4640691 FORWARD LENGTH=595
Length = 595
Score = 324 bits (831), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 171/325 (52%), Positives = 220/325 (67%), Gaps = 22/325 (6%)
Query: 11 SSNGYGRRKSEKEGATKSDNKIPSGKSNTSRLVSTGAMTGNKSGSDESPSHDRLVYLTTC 70
SSNG+ ++ E E K+NTS V G ++GS + S DRLVY TTC
Sbjct: 9 SSNGFPLKRGETEEVLH--------KTNTSNTVFNG-----EAGSLKRLSLDRLVYFTTC 55
Query: 71 LIGHQVEVQVKNGSIYSGIFHATDTDKDFGIILKMARLTKDGSSRGQRSGVELLNKAPSK 130
IGH VEV ++NGS+Y+GIFHA + +KDFGIILKMA L KDG+ RG +S E + K PSK
Sbjct: 56 KIGHHVEVHLRNGSVYTGIFHAANVEKDFGIILKMACLIKDGTLRGHKSRSEFVRKPPSK 115
Query: 131 TFIIPGKELVQVIAKDIAVSRDGQSSESHYDMHQEIMVDSVISQSHHVEIGRELQRWVPD 190
TFIIP ELVQVIAKD++VS + S+ + E++ DS ISQS+HV+ R+LQRWVPD
Sbjct: 116 TFIIPADELVQVIAKDLSVSSNNMSNAVQGEKPSELLTDSSISQSYHVDRERQLQRWVPD 175
Query: 191 EDAPQCPELENIFDDPWNRGWNQFETNEMLFGVKSTFNEEIYTTKLEKGPRTRELERQAL 250
E P +LEN+FD+PWNR WNQFE N+ LFGVKSTF+E++YTT+LE+GP+T++LE A
Sbjct: 176 ETIPHGADLENVFDNPWNRKWNQFEVNKSLFGVKSTFDEDLYTTRLERGPQTKQLEEHAQ 235
Query: 251 RIAREIEGEETQDLHLAEERGL-YHDNFDID------AVYRGKGVDDSGCDENEDKLLDS 303
+IAREIE E T+D+H+AEERGL ++NFD D +V G DSG D ++ LLD+
Sbjct: 236 KIAREIEAETTRDIHVAEERGLQLNENFDFDEEARYSSVRPVTGFGDSGFDLEDNALLDT 295
Query: 304 LNTETFGDMSGSVIMRPGDISGGKG 328
N TFG S S +P S GKG
Sbjct: 296 CNDLTFGGSSTSDGQKPA--SSGKG 318
>AT3G14010.2 | Symbols: CID4 | CTC-interacting domain 4 |
chr3:4637164-4640691 FORWARD LENGTH=595
Length = 595
Score = 324 bits (831), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 171/325 (52%), Positives = 220/325 (67%), Gaps = 22/325 (6%)
Query: 11 SSNGYGRRKSEKEGATKSDNKIPSGKSNTSRLVSTGAMTGNKSGSDESPSHDRLVYLTTC 70
SSNG+ ++ E E K+NTS V G ++GS + S DRLVY TTC
Sbjct: 9 SSNGFPLKRGETEEVLH--------KTNTSNTVFNG-----EAGSLKRLSLDRLVYFTTC 55
Query: 71 LIGHQVEVQVKNGSIYSGIFHATDTDKDFGIILKMARLTKDGSSRGQRSGVELLNKAPSK 130
IGH VEV ++NGS+Y+GIFHA + +KDFGIILKMA L KDG+ RG +S E + K PSK
Sbjct: 56 KIGHHVEVHLRNGSVYTGIFHAANVEKDFGIILKMACLIKDGTLRGHKSRSEFVRKPPSK 115
Query: 131 TFIIPGKELVQVIAKDIAVSRDGQSSESHYDMHQEIMVDSVISQSHHVEIGRELQRWVPD 190
TFIIP ELVQVIAKD++VS + S+ + E++ DS ISQS+HV+ R+LQRWVPD
Sbjct: 116 TFIIPADELVQVIAKDLSVSSNNMSNAVQGEKPSELLTDSSISQSYHVDRERQLQRWVPD 175
Query: 191 EDAPQCPELENIFDDPWNRGWNQFETNEMLFGVKSTFNEEIYTTKLEKGPRTRELERQAL 250
E P +LEN+FD+PWNR WNQFE N+ LFGVKSTF+E++YTT+LE+GP+T++LE A
Sbjct: 176 ETIPHGADLENVFDNPWNRKWNQFEVNKSLFGVKSTFDEDLYTTRLERGPQTKQLEEHAQ 235
Query: 251 RIAREIEGEETQDLHLAEERGL-YHDNFDID------AVYRGKGVDDSGCDENEDKLLDS 303
+IAREIE E T+D+H+AEERGL ++NFD D +V G DSG D ++ LLD+
Sbjct: 236 KIAREIEAETTRDIHVAEERGLQLNENFDFDEEARYSSVRPVTGFGDSGFDLEDNALLDT 295
Query: 304 LNTETFGDMSGSVIMRPGDISGGKG 328
N TFG S S +P S GKG
Sbjct: 296 CNDLTFGGSSTSDGQKPA--SSGKG 318
>AT3G14010.1 | Symbols: CID4 | CTC-interacting domain 4 |
chr3:4637164-4640691 FORWARD LENGTH=595
Length = 595
Score = 324 bits (831), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 171/325 (52%), Positives = 220/325 (67%), Gaps = 22/325 (6%)
Query: 11 SSNGYGRRKSEKEGATKSDNKIPSGKSNTSRLVSTGAMTGNKSGSDESPSHDRLVYLTTC 70
SSNG+ ++ E E K+NTS V G ++GS + S DRLVY TTC
Sbjct: 9 SSNGFPLKRGETEEVLH--------KTNTSNTVFNG-----EAGSLKRLSLDRLVYFTTC 55
Query: 71 LIGHQVEVQVKNGSIYSGIFHATDTDKDFGIILKMARLTKDGSSRGQRSGVELLNKAPSK 130
IGH VEV ++NGS+Y+GIFHA + +KDFGIILKMA L KDG+ RG +S E + K PSK
Sbjct: 56 KIGHHVEVHLRNGSVYTGIFHAANVEKDFGIILKMACLIKDGTLRGHKSRSEFVRKPPSK 115
Query: 131 TFIIPGKELVQVIAKDIAVSRDGQSSESHYDMHQEIMVDSVISQSHHVEIGRELQRWVPD 190
TFIIP ELVQVIAKD++VS + S+ + E++ DS ISQS+HV+ R+LQRWVPD
Sbjct: 116 TFIIPADELVQVIAKDLSVSSNNMSNAVQGEKPSELLTDSSISQSYHVDRERQLQRWVPD 175
Query: 191 EDAPQCPELENIFDDPWNRGWNQFETNEMLFGVKSTFNEEIYTTKLEKGPRTRELERQAL 250
E P +LEN+FD+PWNR WNQFE N+ LFGVKSTF+E++YTT+LE+GP+T++LE A
Sbjct: 176 ETIPHGADLENVFDNPWNRKWNQFEVNKSLFGVKSTFDEDLYTTRLERGPQTKQLEEHAQ 235
Query: 251 RIAREIEGEETQDLHLAEERGL-YHDNFDID------AVYRGKGVDDSGCDENEDKLLDS 303
+IAREIE E T+D+H+AEERGL ++NFD D +V G DSG D ++ LLD+
Sbjct: 236 KIAREIEAETTRDIHVAEERGLQLNENFDFDEEARYSSVRPVTGFGDSGFDLEDNALLDT 295
Query: 304 LNTETFGDMSGSVIMRPGDISGGKG 328
N TFG S S +P S GKG
Sbjct: 296 CNDLTFGGSSTSDGQKPA--SSGKG 318
>AT1G54170.1 | Symbols: CID3 | CTC-interacting domain 3 |
chr1:20221353-20224919 REVERSE LENGTH=587
Length = 587
Score = 301 bits (770), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 183/371 (49%), Positives = 231/371 (62%), Gaps = 36/371 (9%)
Query: 11 SSNGYGRRKSEKEGATKSDNKIPSGKSNTSRLVSTGAMTGNKSGSDESPSHDRLVYLTTC 70
SSNG+ R+ EKE +N PS NT G + S ++PS LVY TTC
Sbjct: 10 SSNGFSHRRFEKEAWM--NNAQPS-VDNTE--------NGWDAESVDTPSQKLLVYFTTC 58
Query: 71 LIGHQVEVQVKNGSIYSGIFHATDTDKDFGIILKMARLTKDGSSRGQRSGVELLNKAPSK 130
IGHQVEV +KNGS+YSGIFHA + +KDFGIILKMA L +D SRG +S ++K SK
Sbjct: 59 NIGHQVEVHLKNGSVYSGIFHAANVEKDFGIILKMACLIRD--SRGTKS--RTVSKPSSK 114
Query: 131 TFIIPGKELVQVIAKDIAVSRDGQSSESHYDMHQEIMVDSVISQSHHVEIGRELQRWVPD 190
IP ELVQVIAKD+ +S D S + E++ DS+ISQ ++V++ REL+ WVPD
Sbjct: 115 LLKIPADELVQVIAKDLPLSSDSVSDSVQCEKPLELLTDSLISQFYNVDLERELKPWVPD 174
Query: 191 EDAPQCPELENIFDDPWNRGWNQFETNEMLFGVKSTFNEEIYTTKLEKGPRTRELERQAL 250
ED P C +LEN+FDDPW RGWNQFE N+ LFGV STF+EE+YTTKLE+GP TRELE QAL
Sbjct: 175 EDVPDCSDLENVFDDPWKRGWNQFEVNKTLFGVTSTFDEELYTTKLERGPGTRELEEQAL 234
Query: 251 RIAREIEGEETQDLHLAEERGL-YHDNFDID------AVYRGKGVDDSGCDENEDK---- 299
RIAREI GE T+D+H+AEERGL FDID +V DD+ +++E++
Sbjct: 235 RIAREIVGENTRDIHVAEERGLQLSGKFDIDEETKYSSVCATNRFDDTCYEDDEEEEEDI 294
Query: 300 LLDSLNTETFGDMSGSVIMRPGDISGGKGNNGAQTWSN---LSS---VDHLRSGSYDHAK 353
LLD N TFGD S S G G +W + L S VD S S H +
Sbjct: 295 LLDCCNNLTFGDSSASD----GKEPASTGKFYEDSWGDSLHLRSNKMVDQSWSNSNKHTR 350
Query: 354 QLVSDLPAQSL 364
QL+S+LP++
Sbjct: 351 QLMSELPSKDF 361
>AT4G26990.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G54920.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr4:13551150-13554253 REVERSE LENGTH=474
Length = 474
Score = 69.3 bits (168), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 34/89 (38%), Positives = 54/89 (60%), Gaps = 5/89 (5%)
Query: 64 LVYLTTCLIGHQVEVQVKNGSIYSGIFHATDTDKDFGIILKMARLTKDGSSRGQRSGVEL 123
L+ T C+IG QV V VK+GS++SGIF D FGI+LK AR+TK G+S + +
Sbjct: 23 LIAATMCIIGLQVHVHVKDGSVFSGIFFTASVDNGFGIVLKDARITKKGTSISNVASGSV 82
Query: 124 LNKAPSKTFIIPGKELVQVIAKDIAVSRD 152
++ T +I +VQ+IA+ +++ +
Sbjct: 83 VD-----TLVILSSTIVQIIAEGVSLPSN 106
>AT5G54920.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G26990.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr5:22302111-22305576 FORWARD LENGTH=517
Length = 517
Score = 66.2 bits (160), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 56/91 (61%), Gaps = 5/91 (5%)
Query: 62 DRLVYLTTCLIGHQVEVQVKNGSIYSGIFHATDTDKDFGIILKMARLTKDGSSRGQRSGV 121
+ L+ T C+IG QV V + +GS++SGIF+ + +F I+LK A+LTK G S+
Sbjct: 25 EALLISTMCIIGLQVHVHINDGSVFSGIFYTVSLENEFSIVLKNAKLTKKGRSKSNVESG 84
Query: 122 ELLNKAPSKTFIIPGKELVQVIAKDIAVSRD 152
+++ +T +I +VQ++A+ +++S +
Sbjct: 85 KIV-----ETLVILSSNIVQIVAEGVSLSSN 110
>AT5G54920.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT4G26990.1);
Has 30201 Blast hits to 17322 proteins in 780 species:
Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
2996 (source: NCBI BLink). | chr5:22302111-22305576
FORWARD LENGTH=522
Length = 522
Score = 66.2 bits (160), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 56/91 (61%), Gaps = 5/91 (5%)
Query: 62 DRLVYLTTCLIGHQVEVQVKNGSIYSGIFHATDTDKDFGIILKMARLTKDGSSRGQRSGV 121
+ L+ T C+IG QV V + +GS++SGIF+ + +F I+LK A+LTK G S+
Sbjct: 25 EALLISTMCIIGLQVHVHINDGSVFSGIFYTVSLENEFSIVLKNAKLTKKGRSKSNVESG 84
Query: 122 ELLNKAPSKTFIIPGKELVQVIAKDIAVSRD 152
+++ +T +I +VQ++A+ +++S +
Sbjct: 85 KIV-----ETLVILSSNIVQIVAEGVSLSSN 110