Miyakogusa Predicted Gene
- Lj0g3v0128619.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0128619.1 Non Chatacterized Hit- tr|I1NI37|I1NI37_SOYBN
Uncharacterized protein (Fragment) OS=Glycine max
PE=4,42.21,3e-18,seg,NULL; SUBFAMILY NOT NAMED,NULL;
THIOREDOXIN,NULL,CUFF.7756.1
(680 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G01680.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Mediator c... 305 6e-83
AT3G01670.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 211 2e-54
AT1G67790.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 96 7e-20
>AT3G01680.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Mediator
complex subunit Med28 (InterPro:IPR021640); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT3G01670.1); Has 122 Blast hits to 112 proteins
in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 122; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr3:252033-255246 FORWARD
LENGTH=740
Length = 740
Score = 305 bits (781), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 219/708 (30%), Positives = 356/708 (50%), Gaps = 79/708 (11%)
Query: 29 LTLSDDQ--ILEEIYSTHVHSDAKFDVNSLFSVVDNIVERSTRIADNVVQGSHGSPEQTD 86
L +S D+ +L+ I TH + V L S+V++I++R+T D+ + P T+
Sbjct: 34 LAMSSDESMMLKLIQQTHSPDAREVQVRGLLSLVEDILDRAT--LDSEDTNASMLPLPTE 91
Query: 87 IKTPSANFTSPLCT----LKQINSELSCKPPGEEIAHETTLAILNKLSTYSWVAKPLLTL 142
K ++ S L + + ++ E++ K +HE T+++ LS++ W K +LTL
Sbjct: 92 DKLMQSSMMSVLDSVSYAIDRVACEIAYKSLTGSDSHEITMSVFEHLSSFQWDGKLVLTL 151
Query: 143 GAFALEYGEFWFLSLHQQTEPLAKSLAIIKRVPELTKPSSLKTHRNAILEINNLVTATWQ 202
AFAL YGEFW L LAKSLA++K VP + T + +N+L+
Sbjct: 152 AAFALNYGEFWLLVQFYSKNQLAKSLAMLKLVPVQNR----VTLESVSQGLNDLIREMKS 207
Query: 203 VIKLIFELDNLNLTYDEKDVPSLELALEQIPVDAYWXXXXXXXXXXQIDLLTTNSDKKQ- 261
V + EL L Y DVP L L IP+ YW QI+++T +
Sbjct: 208 VTACVVELSELPDRYITPDVPQLSRILSTIPIAVYWTIRSVIACISQINMITAMGHEMMN 267
Query: 262 ------ELSQFGQKINIILSKLRKYKQQCEKEIEE---AEYNKILVKLFQTP-TEVIEVL 311
E S K+ I L + + C + IE+ +E K+L LF T + +++L
Sbjct: 268 TQMDLWETSMLANKLKNIHDHLAETLRLCYRHIEKQRSSESLKVLHSLFDTTHIDNMKIL 327
Query: 312 KVLFFWKDVPK---TPIYDGATKTLVSIEALKKKDVFLFFSTLDITIEEISIFNPVYDHI 368
L PK TP+ DG TK V ++ L++K V L S L+I +E+SIF +Y
Sbjct: 328 TALVH----PKPHITPLQDGLTKRKVHLDVLRRKTVLLLISDLNILQDELSIFEQIYTES 383
Query: 369 T--------KSKKPHKIVWIPIVE-----EWNDQLKNKFESLKAKMPWYVLQHFAPIKG- 414
KS P+++VW+P+V+ E + L+ KFE L+ MPWY + I+
Sbjct: 384 RRNLVGVDGKSHMPYEVVWVPVVDPIEDFERSPILQKKFEDLRDPMPWYSVDSPKLIERH 443
Query: 415 -IKYIKEKWQFKKQPMVVVLSPQGKVQHTNAFHMIQVWGIKGFPFTQDIEVNIGKQIIWI 473
+++++ +W F +P++VV+ PQG NA HMI +WG + FPFT+ E + ++ +
Sbjct: 444 VVEFMRGRWHFMNKPILVVIDPQGNEASLNALHMIWIWGTEAFPFTRSREEELWRRETFS 503
Query: 474 DSLLVDFGVE--INTWVKEEKYVFIYGGKNKDWIQEFNKLASTFAIELNKEAKIP-IGLF 530
+L+VD G++ I W+K + Y+F+YGG + DWI+ F A A + N ++ +G
Sbjct: 504 LNLIVD-GIDSVIFNWIKPDNYIFLYGGDDLDWIRRFTMAAKATAKDSNVNLEMAYVGKR 562
Query: 531 N----------LESLQSNIITR----------FWTQVEGLFVTKINKTK----DTVTQQV 566
N E ++S ++ FWT++E + +KI K D V Q +
Sbjct: 563 NHSHREQIRRISEVIRSENLSHSWAEPALMWFFWTRLESMLYSKIQLGKADDHDDVMQGI 622
Query: 567 EKLLSYKGETGWALLIKGPFVVAVGHGTTVLKTVAEFEK-WKELVIKKGFEFAFKE-YLD 624
+K+LSY GWALL KGP +V + HG + +T++ +++ WK V KG+ A + + D
Sbjct: 623 KKILSYDKLGGWALLSKGPEIVMIAHG-AIERTMSVYDRTWKTHVPTKGYTKAMSDHHHD 681
Query: 625 KV-SSSLHICSH--LQIPNINGKIPDTIECPECHRTMEVFISYKCCHN 669
+V + C H I +G+IP+ + C EC R ME ++S+ CCH+
Sbjct: 682 EVLRETGKPCGHFDFHITARSGRIPEKMNCFECQRPMEKYMSFSCCHD 729
>AT3G01670.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G01680.1); Has 121 Blast hits to 111 proteins
in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 121; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr3:247288-250261 FORWARD
LENGTH=822
Length = 822
Score = 211 bits (536), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 203/727 (27%), Positives = 337/727 (46%), Gaps = 96/727 (13%)
Query: 13 FGGGNKEQPNKAAHNPLTLSDDQIL-EEIYSTHVHSDAKFDVNSLFSVVDNIVERSTRIA 71
FG G K+ ++ +LSDD+++ + + TH FDV SL SVV++I +
Sbjct: 118 FGPGKKQAFHRNGRPMFSLSDDRVMADRVLKTHSPDMIFFDVTSLLSVVNDIFKSHVPSI 177
Query: 72 DNVVQGSHGSPEQTDIKTPSANFTSPLC---TLKQINSELSCK--PPGE----------- 115
D+ +P+ + + A+ TS + QI+ E+ CK GE
Sbjct: 178 DS------SAPKPSLVFKDYADHTSFETFADLIDQISCEIDCKCLHGGESHGMMTSGLHL 231
Query: 116 EIAHETTLAILNKLSTYSWVAKPLLTLGAFALEYGEFWFLSLHQQTEPLAKSLAIIKRVP 175
+ + TT ++L+ +S Y W AK +L L A A++YG F L+ T L KSLA+IK++P
Sbjct: 232 DSRNTTTFSVLSLVSKYRWDAKLVLVLSALAVKYGVFLLLAETHATNQLTKSLALIKQLP 291
Query: 176 EL-TKPSSL--KTHRNAILEINNLVTATWQVIKLIFELDNLNLTYDEKDVPSLELALEQI 232
+ ++ ++L + + IL + ++V T +I I++L ++T D I
Sbjct: 292 SIFSRQNALHQRLDKTRIL-MQDMVDLTTTIID-IYQLPPNHITAAFTD---------HI 340
Query: 233 PVDAYWXXXXXXXXXXQIDLLTTNSDKK----------QELSQFGQKINI-ILSKLRKYK 281
P YW I + + E S+ +KIN +L + +K K
Sbjct: 341 PTAVYWIVRCVLICVSHISGASGFKQDQIMSFMEVSEIHENSERLRKINAYLLEQFKKSK 400
Query: 282 QQCEKEIEEAEYNKILVKLFQTPTEVIEVLKVLFFWKDVPKTPIYDGA--TKTLVSIEAL 339
E+ I E EY + L++ F T V V +L + P +Y GA +K V I L
Sbjct: 401 MTIEEGIIEEEYQE-LIQTFTTIIHVDVVPPLLRLLR--PIDFLYHGAGVSKRRVGINVL 457
Query: 340 KKKDVFLFFSTLDITIEEISIFNPVYDHITKSKKPHKIVWIPIVEEWNDQLKNKFESLKA 399
+K V L S L+ +E+ I +Y ++ +I+W+P+ + W + KFE+L
Sbjct: 458 TQKHVLLLISDLENIEKELYILESLY--TEAWQQSFEILWVPVQDFWTEADDAKFEALHM 515
Query: 400 KMPWYVLQHFAPIK--GIKYIKEKWQFKKQPMVVVLSPQGKVQHTNAFHMIQVWGIKGFP 457
M WYVL ++ I++++E W FK +P++V L P+G+V TNAF M+ +W P
Sbjct: 516 NMRWYVLGEPRKLRRAAIRFVREWWGFKNRPILVALDPKGQVMSTNAFPMVWIWQPFAHP 575
Query: 458 FTQDIEVNIGKQIIWIDSLLVDFGVEINTW--VKEEKYVFIYGGKNKDWIQEFNKLASTF 515
FT E ++ + W L+D G + ++ + + KY+ +YGG++ WI+ F L
Sbjct: 576 FTTARERDLWSEQEWNLEFLID-GTDPHSLNQLVDGKYICLYGGEDMQWIKNFTSLWRNV 634
Query: 516 AIELNKEAKIP--------------IGLFNLESLQSNI-----ITRFWTQVEGLFVTKIN 556
A N + ++ I E+L + I FWT+VE ++ +K
Sbjct: 635 AKAANIQLEMVYVGKRNPKNGIQPIINTIREENLSHTLPDLFQIWFFWTRVESMWESKQR 694
Query: 557 ---------------KTKDTVTQQVEKLLSYKGETGWALLI-KGPFVVAVGHGTTVLKTV 600
+ KD V Q+V +L Y GE L+ K ++ G + +
Sbjct: 695 MLKAHGIKGREGFKEEEKDLVLQEVVAMLGYGGEGDGWGLVSKASDMMVRAKGNLFSRGL 754
Query: 601 AEFEKWKELVIKKGFEFAFKEYLDKVSSSLHICSHLQIPNINGKIPDTIECPECHRTMEV 660
AEF +W+ + KGF A ++L + H C+ +P G IP+ +EC EC RTME
Sbjct: 755 AEFNEWEVNIPTKGFLTALNDHL-LMRLPPHHCTRFMLPETAGIIPNEVECTECRRTMEK 813
Query: 661 FISYKCC 667
+ Y+CC
Sbjct: 814 YYLYQCC 820
>AT1G67790.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G01680.1); Has 208 Blast hits to 125 proteins
in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 208; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr1:25417542-25420099 REVERSE
LENGTH=576
Length = 576
Score = 95.9 bits (237), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 80/330 (24%), Positives = 151/330 (45%), Gaps = 32/330 (9%)
Query: 360 IFNPVYDHI--TKSKKPHKIVWIPI--VEEWNDQLKNKFESLKAKMPWYVLQH--FAPIK 413
+ +YDH T +++ ++I+W+PI ++W D+ K F+ +PW ++
Sbjct: 255 LLQQLYDHPSNTNTEQNYEIIWVPIPSSQKWTDEEKEIFDFYSNSLPWISVRQPWLMSST 314
Query: 414 GIKYIKEKWQFK-KQPMVVVLSPQGKVQHTNAFHMIQVWGIKGFPFTQDIEVNIGKQIIW 472
+ + K++W +K + M+VV+ G+ + NA M+ +WG+K +PF+ E + K+ W
Sbjct: 315 ILNFFKQEWHYKDNEAMLVVIDSNGRFVNMNAMDMVLIWGVKAYPFSVSREDELWKEHGW 374
Query: 473 IDSLLVDFGVEINTWVKEEKYVFIYGGKNKDWIQEFNKLAST-----FAIE---LNKEAK 524
+LL+D G+ E + + I+G +N DWI EF LA F +E L+ + +
Sbjct: 375 SINLLLD-GIHPTF---EGREICIFGSENLDWIDEFVSLARKIQNLGFQLELIYLSNQRR 430
Query: 525 IPIGLFNLESLQSNIITR-FWTQVEGLFVTKINKT------KDTVTQQVEKLL--SYKGE 575
+ L S + + FW ++E + +K+ + D V ++V LL Y
Sbjct: 431 DERAMEESSILFSPTLQQLFWLRLESIERSKLKRIVIEPSKPDRVFEEVRNLLDFDYGKH 490
Query: 576 TGWALLIKGPFVVAVGHGTTVLKTVAEFEKWKELVIKKGFEFAFKEYLDKVSSSLHICSH 635
GW ++ G V G + + + + +W E GF A + +K H
Sbjct: 491 RGWGIIGNGSTAETV-DGEKMTERMRKIVRWGEYAKGLGFTEAIEIAAEKPCELSHTAV- 548
Query: 636 LQIPNINGKIPDTIECPECHRTMEVFISYK 665
+P + C +C M+ F++Y+
Sbjct: 549 --VPFEEALTMKVVTCEKCKWPMKRFVAYQ 576