Miyakogusa Predicted Gene
- Lj1g3v4579480.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v4579480.1 CUFF.32748.1
(457 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G53400.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 375 e-104
AT5G03190.1 | Symbols: CPUORF47 | conserved peptide upstream ope... 370 e-102
AT5G03190.2 | Symbols: CPUORF47 | conserved peptide upstream ope... 360 1e-99
AT5G01710.1 | Symbols: | methyltransferases | chr5:263709-26525... 131 1e-30
AT1G58120.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 75 1e-13
>AT3G53400.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: conserved peptide upstream open reading frame 47
(TAIR:AT5G03190.1); Has 285 Blast hits to 285 proteins
in 23 species: Archae - 0; Bacteria - 0; Metazoa - 1;
Fungi - 0; Plants - 279; Viruses - 0; Other Eukaryotes -
5 (source: NCBI BLink). | chr3:19798526-19799926 FORWARD
LENGTH=466
Length = 466
Score = 375 bits (964), Expect = e-104, Method: Compositional matrix adjust.
Identities = 215/481 (44%), Positives = 280/481 (58%), Gaps = 39/481 (8%)
Query: 1 METKTLKFQILNGSIARRVFLRSIVLASAITMVTLLR--------------APVIFDLGT 46
M + LK +++ S RRV LR++V+ A ++V++LR P D
Sbjct: 1 MGFEVLKLKLMPSSFRRRVILRALVIVCAFSVVSVLRNLAGAYEGETINNTQPCKVD--E 58
Query: 47 LVLNLGSENATAVSPGSYLFKTRF----WG---SGNCEKDFNLTVNVVNELMGKQFLVDC 99
+N S LF RF W S C+ + LT VV EL G L +
Sbjct: 59 CAVNFAFLGPFLFSGNGLLFSNRFLKPVWNYLESDKCKDNIYLTTQVVRELTGMNLLSN- 117
Query: 100 GAESLCIGEGSSMAVTAMKQLGFSSVRGVSTHRFFSLKQRKFVFELDYEDSSFDFVLSRD 159
+++LCIG S AV AM + G S V F+ K RKF EL YED+SF FV S D
Sbjct: 118 DSKALCIGRRSVSAVLAMNRQGISDVSVAYMPPVFAFKHRKFTSELHYEDASFGFVFSMD 177
Query: 160 MDKVSVPALLVLEAERVLKPGGIGALLVSST-GSHPNDLIRSATPXXXXXXXXXXXXXXX 218
++ V+VPA LV E ER+LKPGG GA+LV +T GS N+L+RS +P
Sbjct: 178 LESVAVPASLVYEIERILKPGGTGAMLVGTTSGSDSNELVRSVSPVSSLLKNSSVVHVAS 237
Query: 219 FSELNLVVFKKRSENVSSFYQ--HGLPADCPSLTLTKPLIDLMEPILGEKPPPESEIKIN 276
+ LVVFK+ E+ Q H PADC S+ +P I L+EP+L EK + E +I+
Sbjct: 238 LGKQVLVVFKRDGEDSFRLDQTHHDFPADCSSVLNNRPYIGLLEPLLDEKRS-DFERRIH 296
Query: 277 YLPKFVDVSTRKRLVYIDIGVGEFLNANVSDWFLPSYPIDQKAFNVYFVHYNTSMLLSYV 336
YLP+F+D+S+RKRLVYIDIG + + A S+WF PSYPID+KAFN YFVH+NTS+L SYV
Sbjct: 297 YLPEFIDLSSRKRLVYIDIGAVDHVKAR-SNWFFPSYPIDRKAFNSYFVHHNTSILTSYV 355
Query: 337 KRPGVTFVYHPGLAGEVSEAYATDGDLDPYLGEDEFDFVAWFKETVQYADFVVLKMNAGK 396
K PGVTF+YHPGLA + T +P++ +D FDF+AWFKET +ADFVVLKMN
Sbjct: 356 KSPGVTFIYHPGLAATKTTIANTGDHEEPFVEDDSFDFLAWFKETASFADFVVLKMNTSD 415
Query: 397 VELKFLSDIFETGAICFVDELFLRCPEKGNYDKTMTKKESCIDLYKGLRSNGVYVHQWWG 456
ELKFLS++ +TGAIC VDELFL C T C + K LR++GV+VHQWW
Sbjct: 416 TELKFLSELIKTGAICSVDELFLHC----------TGYSDCTGIIKSLRNSGVFVHQWWE 465
Query: 457 D 457
D
Sbjct: 466 D 466
>AT5G03190.1 | Symbols: CPUORF47 | conserved peptide upstream open
reading frame 47 | chr5:758793-760148 FORWARD LENGTH=451
Length = 451
Score = 370 bits (949), Expect = e-102, Method: Compositional matrix adjust.
Identities = 216/474 (45%), Positives = 292/474 (61%), Gaps = 40/474 (8%)
Query: 1 METKTLKFQILNGSIARRVFLRSIVLASAITMVTLL----RAPVIFDLGTL-VLNLGSEN 55
M K LK I GS R R++++ASA+++V LL + V D G + +++LG
Sbjct: 1 MGIKILKLNIFRGSTNRHALFRALIIASALSVVPLLELTAKNHVFGDHGAVDLVDLGR-- 58
Query: 56 ATAVSPGSYLFKTR----FWGSGNCEKDFNLTV-NVVNELMGKQFLVDCGAESLCIGEGS 110
V PG LF + FWG EK + + ++V+ELMG + L+ A+ LCIG+GS
Sbjct: 59 --FVIPGPKLFFHKLVQPFWGKIETEKYPQVVIADLVDELMGLK-LLHYDAKILCIGQGS 115
Query: 111 SMAVTAMKQLGFSSVRGVSTHRFFSLKQRKFVFELDYE-DSSFDFVLSRDMDKVSVPALL 169
AV+ K++GFS V+GV H FS RK V EL+ D SFDFVL D+D V+ PALL
Sbjct: 116 DSAVSGFKEMGFSVVQGVPKHPLFSFFSRKHVNELELSGDKSFDFVLCGDVDHVASPALL 175
Query: 170 VLEAERVLKPGGIGALLVSSTGSHPNDLIRSATPXXXXXXXXXXXXXXXFSELNLVVFKK 229
VLE ERVLKPGG GA+LVS+ N L++S T F+ ++VFK+
Sbjct: 176 VLEMERVLKPGGTGAVLVSTNA---NRLVKSVTSGLKQSEIVRVNNLDKFT---VIVFKR 229
Query: 230 R-SENVSSFYQHGLPADCPSLTLTKPLIDLMEPILGEKPP--PESEIKINYLPKFVDVST 286
+E + LP DC S+ +P + MEP+L +KP P+S + YLPKF+D+S
Sbjct: 230 NVTETAYCIGKSQLPRDCKSVDTNRPYTEFMEPLLEQKPADFPKS---VAYLPKFLDLSL 286
Query: 287 RKRLVYIDIGVGEFLNANVS-DWFLPSYPIDQKAFNVYFVHYNTSMLLSYVKRPGVTFVY 345
+K LVYIDIG E ++AN++ +WF P YP+D KAFNVYFV +NTS++LSYVK+PGVTFVY
Sbjct: 287 KKSLVYIDIGAAEHMDANLTPNWFFPLYPLDSKAFNVYFVDHNTSVMLSYVKKPGVTFVY 346
Query: 346 HPGLAGEVSEAYATD--GDLDPYLGEDEFDFVAWFKETVQYADFVVLKMNAGKVELKFLS 403
HP LA S L+P+ ++ FDF+AWF+ET +YADFVVLKMN +VE+KFL+
Sbjct: 347 HPDLAENNSTGKKITPLEQLEPFPEDERFDFLAWFEETAKYADFVVLKMNTNQVEMKFLT 406
Query: 404 DIFETGAICFVDELFLRCPEKGNYDKTMTKKESCIDLYKGLRSNGVYVHQWWGD 457
+ ETG IC+VDELFLRC K CI++ + LR+ GV+VHQWW D
Sbjct: 407 VLLETGVICYVDELFLRCS---------NHKSDCINMLQTLRARGVFVHQWWED 451
>AT5G03190.2 | Symbols: CPUORF47 | conserved peptide upstream open
reading frame 47 | chr5:758539-760148 FORWARD LENGTH=475
Length = 475
Score = 360 bits (924), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 210/463 (45%), Positives = 288/463 (62%), Gaps = 40/463 (8%)
Query: 12 NGSIARRVFLRSIVLASAITMVTLL----RAPVIFDLGTL-VLNLGSENATAVSPGSYLF 66
N + +R R++++ASA+++V LL + V D G + +++LG V PG LF
Sbjct: 36 NHTSSRHALFRALIIASALSVVPLLELTAKNHVFGDHGAVDLVDLGR----FVIPGPKLF 91
Query: 67 KTR----FWGSGNCEKDFNLTV-NVVNELMGKQFLVDCGAESLCIGEGSSMAVTAMKQLG 121
+ FWG EK + + ++V+ELMG + L+ A+ LCIG+GS AV+ K++G
Sbjct: 92 FHKLVQPFWGKIETEKYPQVVIADLVDELMGLK-LLHYDAKILCIGQGSDSAVSGFKEMG 150
Query: 122 FSSVRGVSTHRFFSLKQRKFVFELDYE-DSSFDFVLSRDMDKVSVPALLVLEAERVLKPG 180
FS V+GV H FS RK V EL+ D SFDFVL D+D V+ PALLVLE ERVLKPG
Sbjct: 151 FSVVQGVPKHPLFSFFSRKHVNELELSGDKSFDFVLCGDVDHVASPALLVLEMERVLKPG 210
Query: 181 GIGALLVSSTGSHPNDLIRSATPXXXXXXXXXXXXXXXFSELNLVVFKKR-SENVSSFYQ 239
G GA+LVS+ N L++S T F+ ++VFK+ +E +
Sbjct: 211 GTGAVLVSTNA---NRLVKSVTSGLKQSEIVRVNNLDKFT---VIVFKRNVTETAYCIGK 264
Query: 240 HGLPADCPSLTLTKPLIDLMEPILGEKPP--PESEIKINYLPKFVDVSTRKRLVYIDIGV 297
LP DC S+ +P + MEP+L +KP P+S + YLPKF+D+S +K LVYIDIG
Sbjct: 265 SQLPRDCKSVDTNRPYTEFMEPLLEQKPADFPKS---VAYLPKFLDLSLKKSLVYIDIGA 321
Query: 298 GEFLNANVS-DWFLPSYPIDQKAFNVYFVHYNTSMLLSYVKRPGVTFVYHPGLAGEVSEA 356
E ++AN++ +WF P YP+D KAFNVYFV +NTS++LSYVK+PGVTFVYHP LA S
Sbjct: 322 AEHMDANLTPNWFFPLYPLDSKAFNVYFVDHNTSVMLSYVKKPGVTFVYHPDLAENNSTG 381
Query: 357 YATDG--DLDPYLGEDEFDFVAWFKETVQYADFVVLKMNAGKVELKFLSDIFETGAICFV 414
L+P+ ++ FDF+AWF+ET +YADFVVLKMN +VE+KFL+ + ETG IC+V
Sbjct: 382 KKITPLEQLEPFPEDERFDFLAWFEETAKYADFVVLKMNTNQVEMKFLTVLLETGVICYV 441
Query: 415 DELFLRCPEKGNYDKTMTKKESCIDLYKGLRSNGVYVHQWWGD 457
DELFLRC K CI++ + LR+ GV+VHQWW D
Sbjct: 442 DELFLRCS---------NHKSDCINMLQTLRARGVFVHQWWED 475
>AT5G01710.1 | Symbols: | methyltransferases | chr5:263709-265250
REVERSE LENGTH=513
Length = 513
Score = 131 bits (329), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 117/450 (26%), Positives = 188/450 (41%), Gaps = 62/450 (13%)
Query: 58 AVSPGSYLFKTRFWGSGNCEKDFNLTVNVVNELMGKQFLVDCGAESLCIGEGSSMAVTAM 117
+ SPG L+ TR W K ++ +L+ +L +++LC+ V ++
Sbjct: 74 STSPGDDLYTTRDW-----IKSVQFYSSIFQDLIADGYL-SPESKTLCVETAIGQEVHSL 127
Query: 118 KQLGFSSVRGVSTHRFFSLKQRKFVFELDYEDSSFDFVLSR-DMDKVSVPAL-LVLEAER 175
+++G + G+S L R + +ED++FDFV S D S+ L E R
Sbjct: 128 REIGVKNSVGISKKASRPLVVRGEGHAIPFEDNAFDFVFSGGDRLGKSLKQLEFADEITR 187
Query: 176 VLKPGGIGALLVSSTGSHP-NDLIRSATPXXXXXXXXXXXXXXXFSELNLVVFKKRSE-- 232
LKP G + V +T ++ N + + V +K SE
Sbjct: 188 TLKPEGFAVVHVGATDTYSFNSFLDLFNSCRLVKMRDIDGFDPSMPHIREFVIQKYSEID 247
Query: 233 --NVSSFYQHGLPADCPSLTLTKPLIDLMEPILGEKP-PPESEIK-----INYLPKFVDV 284
C + LI EP++ E+P P +K I Y+P VD+
Sbjct: 248 GGGHHHRGGGDSGGKCWIPGYKRDLIRDAEPLIQEEPLKPWITLKRNIKNIKYVPSMVDI 307
Query: 285 STRKRLVYIDIGVGEFLNANVSDWFLPSYPIDQKAFNVYFVHYNTSMLLSY-VKR----- 338
+ R VY+D+G + +++ WF YP K F+V+ + + + Y +K+
Sbjct: 308 RFKSRYVYVDVGARSY-GSSIGSWFKKEYPKQNKTFDVFAIEADKAFHEEYKIKKKVNLL 366
Query: 339 PGVTFVYHPGLAGEVS-------EAYATD---GDLDPYLGEDE------------FDFVA 376
P +V + L+ E++ EA A G + P D FDF
Sbjct: 367 PYAAWVRNETLSFEINHDPGKEVEAKAKGRGMGRIQPVKKSDSSDLAGEVNLIQGFDFAD 426
Query: 377 WFKETVQYADFVVLKMNAGKVELKFLSDIFETGAICFVDELFLRC-----------PEKG 425
W K++V+ DFVV+KM+ E + + +TGAIC +DELFL C
Sbjct: 427 WLKKSVRERDFVVMKMDVEGTEFDLIPRLIKTGAICLIDELFLECHYNRWQRCCPGQRSQ 486
Query: 426 NYDKTMTKKESCIDLYKGLRSNGVYVHQWW 455
Y+KT + C++L+ LR GV VHQWW
Sbjct: 487 KYNKTYNQ---CLELFNSLRQRGVLVHQWW 513
>AT1G58120.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: methyltransferases (TAIR:AT5G01710.1); Has 93 Blast
hits to 93 proteins in 15 species: Archae - 0; Bacteria
- 0; Metazoa - 1; Fungi - 0; Plants - 92; Viruses - 0;
Other Eukaryotes - 0 (source: NCBI BLink). |
chr1:21520962-21522224 FORWARD LENGTH=420
Length = 420
Score = 74.7 bits (182), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 59/215 (27%), Positives = 90/215 (41%), Gaps = 57/215 (26%)
Query: 259 MEPILGEKPPPESEI------KINYLPKFV----DVSTRKRLVYIDIGVGEFLNANVSDW 308
+E +L E P S + YLP + D+ + R V+ID+G G+ ++ +W
Sbjct: 246 LEDVLLEPPRAASRKSRTYFKRTRYLPDLMGDNLDLESYSRRVFIDVGNGK--GSSGMEW 303
Query: 309 FLPSYPIDQKAFNVYFVHYNTSMLLSYVKRPGVTFVYHPGLAGEVSEAYATDGDLDPYLG 368
F+ +YP + F +Y + + E+S L
Sbjct: 304 FVENYPTRNQKFEMYKIE---------------------TVNDEMS------------LE 330
Query: 369 EDEFDFVAWFKETVQYADFVVLKMNAGKVELKFLSDIFETGAICFVDELFLRCPEKGN-- 426
++ W KE V+ ++VV+K A VE ++ + +I VDELFL C KG
Sbjct: 331 SEKMGMTEWLKENVKEEEYVVMKAEAEMVE-----EMMRSKSIKMVDELFLECKPKGLGL 385
Query: 427 YDKTMTKKE-----SCIDLYKGLRSNGVYVHQWWG 456
+ M K C+ LY LR GV VHQWWG
Sbjct: 386 RGRKMQSKSGRAYWECLALYGKLRDEGVAVHQWWG 420