Miyakogusa Predicted Gene
- Lj0g3v0104479.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0104479.1 Non Chatacterized Hit- tr|I1KD24|I1KD24_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,75.5,0,seg,NULL;
DUF4378,Domain of unknown function DUF4378; VARLMGL,NULL; SUBFAMILY
NOT NAMED,NULL; FAMILY,CUFF.5934.1
(1096 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G74160.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 672 0.0
AT1G18620.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 556 e-158
AT1G18620.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 529 e-150
AT3G02170.1 | Symbols: LNG2 | longifolia2 | chr3:396208-399393 R... 242 1e-63
AT5G15580.1 | Symbols: LNG1 | longifolia1 | chr5:5068495-5071930... 238 2e-62
AT3G63430.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 59 3e-08
>AT1G74160.1 | Symbols: | unknown protein; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT1G18620.1); Has
3407 Blast hits to 2217 proteins in 314 species: Archae -
0; Bacteria - 264; Metazoa - 1296; Fungi - 318; Plants -
346; Viruses - 34; Other Eukaryotes - 1149 (source: NCBI
BLink). | chr1:27887164-27891151 FORWARD LENGTH=1025
Length = 1025
Score = 672 bits (1733), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 487/1117 (43%), Positives = 611/1117 (54%), Gaps = 116/1117 (10%)
Query: 1 MAAKLLHSLADDNPDLQKQIGCMTGIFQLFDRHPTLTARRSSSQKKFPYGNAYISGGSLD 60
MAAKLLHSLADD+ DLQKQIGCM GIFQ+FDRH LT RR K GN + +
Sbjct: 1 MAAKLLHSLADDSADLQKQIGCMNGIFQIFDRHHVLTGRR----KSLTLGNGNAININYE 56
Query: 61 RDP--NSRHHGHTAIDTSLNKVVTEKQRIXXXXXXXXXXXXXXXXX---XLDCKAEVDA- 114
RD T D+++ V EK+R+ + + DA
Sbjct: 57 RDSVDTIYQQKETFQDSNIGGNVKEKRRVSTESSRVSFSSSCSSSPSSSEFNRGVQPDAS 116
Query: 115 PFDRIIFPETPSRDTVMNQPTISSHLGRHSLDLRDVVKDSMYREARGLSLKT-TAKEEGI 173
+DR F E+P+ D M + SHLG LDLRDVV+DSMYREARGL KT +EE +
Sbjct: 117 AYDRANFQESPTSDPEMTEGNGFSHLG---LDLRDVVRDSMYREARGLLSKTPMTREEVV 173
Query: 174 HAMKHRDSPRPVLLSKSVDGAYGVGIDGKQSAPIDLKESIRVLAKLREAPWYYAETRELP 233
+ DSPRP YG+ KQS P+DL ES RVLA+LRE +Y E
Sbjct: 174 RQSRREDSPRP----------YGL----KQSTPMDLNESFRVLARLRETSQHYNEL---- 215
Query: 234 RSSHEAKDGHWHSIPDGRETSRLSFESRDTIKSTPKLKELPRLSLDSREGSWRAYSSDSK 293
++ R S +S DT+KS KLKELPRLSLDSRE + R S D K
Sbjct: 216 ---------------GMKDAPRYSVDSHDTLKSRQKLKELPRLSLDSRERATRNSSVDPK 260
Query: 294 HSQLQRNVNSGTGTSTSDEKFSSLQQSRPPSVVAKLMGLEALPDSSLGGD-NQSRSTET- 351
S+L + + +S+ S VVAKLMGLE LP S LG D +Q +T
Sbjct: 261 SSKLSESFSESCSSSSKKRPPS---------VVAKLMGLETLPGSPLGRDIHQFGLNKTN 311
Query: 352 FSARNNDQFPRSSSKNGFTRPLRVS-NSPKSSLKDPTSPRRKNPDVVMKPLSSSKFPIEP 410
S +N+D F RS + R +R S +SP+S KDP SPR +N D VMKPLS+++FP+EP
Sbjct: 312 ISDQNDDPFSRSLREKNLNRAIRFSPSSPRSLGKDPASPRWRNSDFVMKPLSNTRFPVEP 371
Query: 411 APWKQQDGNRRSQKLSLRPTKA-PLRTSDSFPSVYSEIEKRLKDLEFKQSGRDLRALKQI 469
APWK D NR QK + P KA P + P+VYSE+E+RL DLEFK SG+DLRALKQI
Sbjct: 372 APWKHADRNRVLQKQASMPVKAKPYEAPNFPPTVYSEMERRLNDLEFKHSGKDLRALKQI 431
Query: 470 LEAMQEKGLLESRKEEQGPNVDGNQSDHYESRATKLIQNSRSARQQNPQGNSFLSSTIKG 529
LE+MQ KG L++ K++Q N + E+ AT ++ S T
Sbjct: 432 LESMQSKGFLDTEKQQQSTNFAVQRDYERENSAT--------------SNHAMSSRTRVQ 477
Query: 530 SDSSARGFESPIVIMKPAKLVEKTGISASSVIPIGGFSGSHKLHNSGAYVDNQNNTSSTR 589
S SS + ++SPIVIMKPAKLVEK GI ASS+IPI +G K+ + ++S R
Sbjct: 478 SSSSNQVYQSPIVIMKPAKLVEKAGIPASSLIPIHSLTGIKKIRREKPDDKGTSASNSKR 537
Query: 590 AAKDQSPKNIRRXXXXXXXXXXXXXXXXXRSAQSQSRSNQLPKDNXXXXXXXXXXXXPRL 649
KD SP N RR RS S + Q+ K++ PRL
Sbjct: 538 VTKDCSPGN-RRAESCTSSFDKKSDSRNVRS--SSKKPQQVSKESASKSSGSVS---PRL 591
Query: 650 QQQKKLELEKXXXXXXXXXXXXXXNKPRRKSSGKKATESGSPGGKLRTKVPNS-QHCDEQ 708
QQ KKLE +K RK S ++ ES SPGG+ R K S Q D+Q
Sbjct: 592 QQ-KKLEYDKRSRPPTPPDSSKS-----RKPSNQQLVESTSPGGRRRPKGQKSLQQVDDQ 645
Query: 709 LSEISNDSLSVSCHGDELSLQSDSTAEVTSRFQSADIDDSRSQS-LKAIKQLVSEAVHEK 767
LS+ SN+S S HG + QS++ A +S + D +S S ++A K +VS + K
Sbjct: 646 LSQASNES-RTSSHG--ICTQSETEASACVE-KSTEADGGKSPSVIEAAKAVVSNLMQNK 701
Query: 768 STPRLDEDETIAELATDAPEHPSPVSVLDDSVYRDDLLSPVKQISEVPNADDAQETEENE 827
S+PR ED A L+ A EHPSP+SVLD S YR+ SPVK V A + +
Sbjct: 702 SSPRFSEDGLSANLSLVALEHPSPISVLDASTYRETEPSPVKTQGNV-----AHDFGDEN 756
Query: 828 AKDEWNPVVDSLSFNSTGSAEINRKKLQNIDHLVQKLRRLNSSHDEARIDYIASLCENTN 887
+D+WNP S+ S EINRKKLQN++HLVQKLRRLNSSHDEA DYIASLCEN +
Sbjct: 757 CEDQWNPAYSFSETTSSFSPEINRKKLQNVEHLVQKLRRLNSSHDEASQDYIASLCENAD 816
Query: 888 P--DHRYISEIXXXXXXXXXXXXXXXXTFQLHSSGHPINPELFLVLEQTKAS---NLLSK 942
P DHRYISEI TFQLH SGHPINPELF VLEQTK S +LL K
Sbjct: 817 PTTDHRYISEILLASGLLLRDLGSGLTTFQLHPSGHPINPELFFVLEQTKGSSTTHLLHK 876
Query: 943 EESNNGKITCLKLNAERFHRKLIFDAVNEILGAKLAS---SPEPWFQPNCDRLTKKSLSA 999
EES K+ L E+ +RKL+FD VNEIL KLAS + P + ++TKK++SA
Sbjct: 877 EES---KV----LKNEKLNRKLVFDLVNEILVEKLASVEATTNPLMKSYA-KVTKKAVSA 928
Query: 1000 QKLLKELCFEIEKVQAKKPGWCXX-XXXXXXXXXKSILWEDVMHEGSESWKNFNGEIPGF 1058
Q+LLKELC IE Q + KSIL EDV S +W +F+GE+ G
Sbjct: 929 QQLLKELCSAIETQQKQATKRSENFLLEEEDDFLKSILAEDVTIR-SGNWADFSGEMSGL 987
Query: 1059 VLDVERLIFKDLVGEIVIGEAASMRLKPSVRRRKLFG 1095
VLDVERL+FKDLV EIV E + ++ K S RRR LF
Sbjct: 988 VLDVERLVFKDLVNEIVHAETSRLQAK-SGRRRTLFA 1023
>AT1G18620.1 | Symbols: | unknown protein; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT1G74160.1); Has
1987 Blast hits to 1263 proteins in 207 species: Archae -
0; Bacteria - 172; Metazoa - 665; Fungi - 149; Plants -
271; Viruses - 6; Other Eukaryotes - 724 (source: NCBI
BLink). | chr1:6410719-6414269 FORWARD LENGTH=978
Length = 978
Score = 556 bits (1434), Expect = e-158, Method: Compositional matrix adjust.
Identities = 447/1132 (39%), Positives = 587/1132 (51%), Gaps = 200/1132 (17%)
Query: 1 MAAKLLHSLADDNPDLQKQIGCMTGIFQLFDRHPTLTARRSSSQKKFPYGNAYISGGSLD 60
MAAKLLH+LAD+N DLQK+IGCM GIFQ+FDRH LT+RR K GNA+++ + +
Sbjct: 1 MAAKLLHTLADENSDLQKKIGCMNGIFQIFDRHHILTSRR----KSLTLGNAHVNSINFE 56
Query: 61 RDP--------------NSRHHGHTAIDTSLNKVVTEKQRIXXXXXXXXXXXXXXXXXXL 106
RD +S + L ++ TE R+
Sbjct: 57 RDSVDAICQQRSAFQCQDSNLVSSNGLSEKLTRLSTECSRVSFSSSCSSSSPLSSEVNR- 115
Query: 107 DCKAEVDAPFDRIIFPETPSRDTVMNQPTISSHLGRHSLDLRDVVKDSMYREARGLSLKT 166
+ + E+ A DR+IFPE+P+ D VM+Q T + R LDLRDVV+DSMYREARGLS
Sbjct: 116 EVQPEISAD-DRVIFPESPTSDPVMSQGTGA----RVGLDLRDVVRDSMYREARGLS--- 167
Query: 167 TAKEEGIHAMKHRDSPRPVLLSKSVDGAYGVGIDGKQSAPIDLKESIRVLAKLREAPWYY 226
+ + DSPRP YG+ KQS P+D ES R LAKLR
Sbjct: 168 ----DVCRQNRREDSPRP----------YGL----KQSRPVDFNESCRALAKLR------ 203
Query: 227 AETRELPRSSHEAKDGHWHSIPDGRETSRLSFESRDTIKSTPKLKELPRLSLDSREGSWR 286
++SH H+++ D ++TSR +SR KS KLKELPRLSLDSR+
Sbjct: 204 -------KTSH-----HYYNEVDMKDTSRYYVDSRGKSKSGKKLKELPRLSLDSRD---- 247
Query: 287 AYSSDSKHSQLQRNVNSGTGTSTSDEKFSSLQQ-----SRPPSVVAKLMGLEALPDSSLG 341
H L+ SG S S + SS+ + RPPSVVAKLMGLE LP S L
Sbjct: 248 -------HVDLK----SGNKLSESFSRSSSMNKVSGSPKRPPSVVAKLMGLETLPGSPL- 295
Query: 342 GDNQSRSTETFSARNNDQFPRSSSKNGFTRPLRVS-NSPKSSLKDPTS----PRRKNPDV 396
SR N+D F RS +N R LR S +SP+S KDP + PR ++ +
Sbjct: 296 ----SRDRFNMFDDNSDPFARSLRENSLNRSLRFSPSSPRSLGKDPAASSSSPRWRSSEF 351
Query: 397 VMKPLSSSKFPIEPAPWKQQDGNRRSQKLSLRPTKAPLRTSDSFPSVYSEIEKRLKDLEF 456
VMKPLSS ++PIEPAPWKQ + NR SQK + R K S+ +E +LKDLE
Sbjct: 352 VMKPLSSLRYPIEPAPWKQTERNRFSQKQACRSVK----------SLSQSMEGKLKDLEV 401
Query: 457 KQSGRDLRALKQILEAMQEKGLLESRKEEQGPNVDGNQSDHYESRATKLIQNSRSARQQN 516
K SG+DLRALK ILEAMQ KGL ++RK++Q N++ R +L ++ S
Sbjct: 402 KHSGKDLRALKDILEAMQSKGLFDTRKQQQCSNLEAQ-------RDYELADSATSKHDSI 454
Query: 517 PQGNSFLSSTIKGSDSSARGFESPIVIMKPAKLVEKTGISASSVIPIGGFSGSHKLHNSG 576
N + S ++G PIVIMKPA+LVEK+GI +SS+IPI SG +K
Sbjct: 455 DLRNPVIPSNMRG----------PIVIMKPARLVEKSGIPSSSLIPIHSLSGLNKTCREE 504
Query: 577 AYVDNQNNTSSTRAAKDQSPKNIRRXXXXXXXXXXXXXXXXXRSAQSQSRSNQLPKDNXX 636
V+ + +++S +A KD+SP N R +S+ S+Q+ K++
Sbjct: 505 P-VNVRRSSTSRKAVKDRSPGNQR---------AEPCISSDKKSSSRNVMSSQVYKESTS 554
Query: 637 XXXXXXXXXXPRLQQQKKLELEKXXXXXXXXXXXXXXNKPRRKSSGKKATESGSPGGKLR 696
+LQQ K + +K R++ S + + SPGG R
Sbjct: 555 KNSGPASS---KLQQMKP----EHDKRSRPPASPSDSSKLRKQISRQPVESTTSPGG--R 605
Query: 697 TKVPNSQHC----DEQLSEISNDSLSVSCHGDELSLQSDSTAEVTSRFQSADIDDSRSQS 752
P Q D QLS++SN S + + ++T + + +S +
Sbjct: 606 RSRPRDQRSLQQNDGQLSQMSNKSRT----------KIEATLSIENGGKSPSV------- 648
Query: 753 LKAIKQLVSEAVHEKSTPRLDEDETIAELATDAPEHPSPVSVLDDSVYRDDLLSPVK-QI 811
++A K +VS + KS+P ED + EHPSPVSVL+ +YR+ SPVK Q
Sbjct: 649 IEAAKAVVSNLIQNKSSPTFSED--------GSSEHPSPVSVLNAEIYREIEPSPVKIQA 700
Query: 812 SEVPNADDAQETEENEAKDEWNPVVDSLSFNSTGSAEINRKKLQNIDHLVQKLRRLNSSH 871
SE E+ +D+WNP ++ S E+NRKKLQN++HLVQKL+RLNSSH
Sbjct: 701 SEGSVNGSINSGVEHCEEDQWNPAYSFSKTTTSFSPEMNRKKLQNVEHLVQKLKRLNSSH 760
Query: 872 DEARIDYIASLCENTNP--DHRYISEIXXXXXXXXXXXXXXXXTFQLHSSGHPINPELFL 929
DE DYIASLCEN++P DHRYISEI TFQLH SGHPINPELFL
Sbjct: 761 DETSQDYIASLCENSDPDTDHRYISEILLASGLLLRDLGSGLTTFQLHPSGHPINPELFL 820
Query: 930 VLEQTKASNLLSKEESNNGKITCLKLNAERFHRKLIFDAVNEILGAKLA---SSPEPWFQ 986
V+EQTK C + E+ +RKL+FDAVNE+LG KLA S +PW +
Sbjct: 821 VIEQTKG---------------CSSSSNEKINRKLVFDAVNEMLGKKLAFVESYVDPWMK 865
Query: 987 PNCDRLTKKSLSAQKLLKELCFEIEKV--QAKKPGWCXXXXXXXXXXX----KSILWEDV 1040
R KK LSAQ LLKELC EIE + QAKK K IL ED
Sbjct: 866 QAKAR--KKVLSAQNLLKELCSEIEILQKQAKKRSENLLLLEEEEEEEEDFLKCILDED- 922
Query: 1041 MHEGSESWKNFNGEIPGFVLDVERLIFKDLVGEIVIGEAASMRLKPSVRRRK 1092
M SE W +F+ IPG VLD+ERL+FKDLV EIV GE RL+ + RR+K
Sbjct: 923 MAIQSEKWTDFDDAIPGLVLDMERLLFKDLVKEIVHGEID--RLQGNSRRQK 972
>AT1G18620.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 22 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT1G74160.1); Has 1906 Blast hits to 1265 proteins
in 203 species: Archae - 0; Bacteria - 130; Metazoa -
671; Fungi - 139; Plants - 265; Viruses - 11; Other
Eukaryotes - 690 (source: NCBI BLink). |
chr1:6410191-6414269 FORWARD LENGTH=1014
Length = 1014
Score = 529 bits (1363), Expect = e-150, Method: Compositional matrix adjust.
Identities = 432/1117 (38%), Positives = 568/1117 (50%), Gaps = 200/1117 (17%)
Query: 16 LQKQIGCMTGIFQLFDRHPTLTARRSSSQKKFPYGNAYISGGSLDRDP------------ 63
QK+IGCM GIFQ+FDRH LT+RR K GNA+++ + +RD
Sbjct: 52 FQKKIGCMNGIFQIFDRHHILTSRR----KSLTLGNAHVNSINFERDSVDAICQQRSAFQ 107
Query: 64 --NSRHHGHTAIDTSLNKVVTEKQRIXXXXXXXXXXXXXXXXXXLDCKAEVDAPFDRIIF 121
+S + L ++ TE R+ + + E+ A DR+IF
Sbjct: 108 CQDSNLVSSNGLSEKLTRLSTECSRVSFSSSCSSSSPLSSEVNR-EVQPEISAD-DRVIF 165
Query: 122 PETPSRDTVMNQPTISSHLGRHSLDLRDVVKDSMYREARGLSLKTTAKEEGIHAMKHRDS 181
PE+P+ D VM+Q T + R LDLRDVV+DSMYREARGLS + + DS
Sbjct: 166 PESPTSDPVMSQGTGA----RVGLDLRDVVRDSMYREARGLS-------DVCRQNRREDS 214
Query: 182 PRPVLLSKSVDGAYGVGIDGKQSAPIDLKESIRVLAKLREAPWYYAETRELPRSSHEAKD 241
PRP YG+ KQS P+D ES R LAKLR ++SH
Sbjct: 215 PRP----------YGL----KQSRPVDFNESCRALAKLR-------------KTSH---- 243
Query: 242 GHWHSIPDGRETSRLSFESRDTIKSTPKLKELPRLSLDSREGSWRAYSSDSKHSQLQRNV 301
H+++ D ++TSR +SR KS KLKELPRLSLDSR+ H L+
Sbjct: 244 -HYYNEVDMKDTSRYYVDSRGKSKSGKKLKELPRLSLDSRD-----------HVDLK--- 288
Query: 302 NSGTGTSTSDEKFSSLQQ-----SRPPSVVAKLMGLEALPDSSLGGDNQSRSTETFSARN 356
SG S S + SS+ + RPPSVVAKLMGLE LP S L SR N
Sbjct: 289 -SGNKLSESFSRSSSMNKVSGSPKRPPSVVAKLMGLETLPGSPL-----SRDRFNMFDDN 342
Query: 357 NDQFPRSSSKNGFTRPLRVS-NSPKSSLKDPTSPRRKNP----DVVMKPLSSSKFPIEPA 411
+D F RS +N R LR S +SP+S KDP + + VMKPLSS ++PIEPA
Sbjct: 343 SDPFARSLRENSLNRSLRFSPSSPRSLGKDPAASSSSPRWRSSEFVMKPLSSLRYPIEPA 402
Query: 412 PWKQQDGNRRSQKLSLRPTKAPLRTSDSFPSVYSEIEKRLKDLEFKQSGRDLRALKQILE 471
PWKQ + NR SQK + R K S+ +E +LKDLE K SG+DLRALK ILE
Sbjct: 403 PWKQTERNRFSQKQACRSVK----------SLSQSMEGKLKDLEVKHSGKDLRALKDILE 452
Query: 472 AMQEKGLLESRKEEQGPNVDGNQSDHYESRATKLIQNSRSARQQNPQGNSFLSSTIKGSD 531
AMQ KGL ++RK++Q N++ R +L ++ S N + S ++G
Sbjct: 453 AMQSKGLFDTRKQQQCSNLEAQ-------RDYELADSATSKHDSIDLRNPVIPSNMRG-- 503
Query: 532 SSARGFESPIVIMKPAKLVEKTGISASSVIPIGGFSGSHKLHNSGAYVDNQNNTSSTRAA 591
PIVIMKPA+LVEK+GI +SS+IPI SG +K V+ + +++S +A
Sbjct: 504 --------PIVIMKPARLVEKSGIPSSSLIPIHSLSGLNKTCREEP-VNVRRSSTSRKAV 554
Query: 592 KDQSPKNIRRXXXXXXXXXXXXXXXXXRSAQSQSRSNQLPKDNXXXXXXXXXXXXPRLQQ 651
KD+SP N R +S+ S+Q+ K++ +LQQ
Sbjct: 555 KDRSPGNQR---------AEPCISSDKKSSSRNVMSSQVYKESTSKNSGPASS---KLQQ 602
Query: 652 QKKLELEKXXXXXXXXXXXXXXNKPRRKSSGKKATESGSPGGKLRTKVPNSQHC----DE 707
K + +K R++ S + + SPGG R P Q D
Sbjct: 603 MKP----EHDKRSRPPASPSDSSKLRKQISRQPVESTTSPGG--RRSRPRDQRSLQQNDG 656
Query: 708 QLSEISNDSLSVSCHGDELSLQSDSTAEVTSRFQSADIDDSRSQSLKAIKQLVSEAVHEK 767
QLS++SN S + + ++T + + +S + ++A K +VS + K
Sbjct: 657 QLSQMSNKSRT----------KIEATLSIENGGKSPSV-------IEAAKAVVSNLIQNK 699
Query: 768 STPRLDEDETIAELATDAPEHPSPVSVLDDSVYRDDLLSPVK-QISEVPNADDAQETEEN 826
S+P ED + EHPSPVSVL+ +YR+ SPVK Q SE E+
Sbjct: 700 SSPTFSED--------GSSEHPSPVSVLNAEIYREIEPSPVKIQASEGSVNGSINSGVEH 751
Query: 827 EAKDEWNPVVDSLSFNSTGSAEINRKKLQNIDHLVQKLRRLNSSHDEARIDYIASLCENT 886
+D+WNP ++ S E+NRKKLQN++HLVQKL+RLNSSHDE DYIASLCEN+
Sbjct: 752 CEEDQWNPAYSFSKTTTSFSPEMNRKKLQNVEHLVQKLKRLNSSHDETSQDYIASLCENS 811
Query: 887 NP--DHRYISEIXXXXXXXXXXXXXXXXTFQLHSSGHPINPELFLVLEQTKASNLLSKEE 944
+P DHRYISEI TFQLH SGHPINPELFLV+EQTK
Sbjct: 812 DPDTDHRYISEILLASGLLLRDLGSGLTTFQLHPSGHPINPELFLVIEQTKG-------- 863
Query: 945 SNNGKITCLKLNAERFHRKLIFDAVNEILGAKLA---SSPEPWFQPNCDRLTKKSLSAQK 1001
C + E+ +RKL+FDAVNE+LG KLA S +PW + R KK LSAQ
Sbjct: 864 -------CSSSSNEKINRKLVFDAVNEMLGKKLAFVESYVDPWMKQAKAR--KKVLSAQN 914
Query: 1002 LLKELCFEIEKV--QAKKPGWCXXXXXXXXXXX----KSILWEDVMHEGSESWKNFNGEI 1055
LLKELC EIE + QAKK K IL ED M SE W +F+ I
Sbjct: 915 LLKELCSEIEILQKQAKKRSENLLLLEEEEEEEEDFLKCILDED-MAIQSEKWTDFDDAI 973
Query: 1056 PGFVLDVERLIFKDLVGEIVIGEAASMRLKPSVRRRK 1092
PG VLD+ERL+FKDLV EIV GE RL+ + RR+K
Sbjct: 974 PGLVLDMERLLFKDLVKEIVHGEID--RLQGNSRRQK 1008
>AT3G02170.1 | Symbols: LNG2 | longifolia2 | chr3:396208-399393
REVERSE LENGTH=905
Length = 905
Score = 242 bits (617), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 309/1105 (27%), Positives = 459/1105 (41%), Gaps = 230/1105 (20%)
Query: 1 MAAKLLHSLADDNPDLQKQIGCMTGIFQLFDRH---PTLTARRSSSQKKFPYGNAYISGG 57
M+AKLL++L+D+NP+L KQ GCM GIFQ+F R T ++K P G S G
Sbjct: 1 MSAKLLYNLSDENPNLNKQFGCMNGIFQVFYRQHCPATPVTVSGGAEKSLPPGERRGSVG 60
Query: 58 SLDRDPNSRHHGHTAIDTSLNKVVTEKQRIXXXXXXXXXXXXXXXXXXLDCKAEVDAPFD 117
+ + + + + + + FD
Sbjct: 61 ETNMESDKETERSSTKKKKSAAKEKHRVSFESSSRPSFSSSPRSSSFSSAEVSTTASQFD 120
Query: 118 RIIFPETPSRDTVMNQPTISSHLGRHSLDLRDVVKDSMYREARGLSLKTTAKEEGIHAMK 177
+ P + + QP + DL+++VK S+ RE R T EE
Sbjct: 121 Q------PGENLIREQPNGGLMM---PYDLKELVKGSINREIR------TRGEEASFT-- 163
Query: 178 HRDSPRPVLLSKSVDGAYGVGIDGKQSAPIDLKESIRVLAKLREAPWYYAETRELPRSSH 237
+Q PI + S+ +L + + R RSS+
Sbjct: 164 ------------------------QQQQPISARSSMLLLKE--------SSLRSPCRSSN 191
Query: 238 EAKDGHWHSIPDGRETSRLSFESRDT----IKSTPKLKELPRLSLDSREGSWRAYSSDSK 293
E +G ++ +E+ RLS++ R+ + KLKE PRLSLDSR S+R+ +D+
Sbjct: 192 EWNEGRGAAM-KFKESHRLSYDEREMRNNGFRVGSKLKETPRLSLDSRSNSFRSPRADAA 250
Query: 294 HSQLQRNVNSGTGTSTSDEKFSSLQQSRPPSVVAKLMGLEALPDSSLGGDNQSRSTETFS 353
S S +E + + SVVAKLMGLE + D+S D + R F
Sbjct: 251 RS------------SCPEEPATMTHRRSSSSVVAKLMGLEVIADNS---DTEQRRENRFC 295
Query: 354 ARNNDQFPRSSSKNGFTRPLRVSNSPKSSLKDPTSPRRKNPDVVMKPLSSSKFPIEPAPW 413
PR S+ T L+ S S S + P S ++SKFP+EPAPW
Sbjct: 296 -----DSPRPMSRVEPT-ALQRSRSVDSIKRIPAS-------------AASKFPMEPAPW 336
Query: 414 KQQDGNRRSQKLSLRPTKAPLRTSDSFPSVYSEIEKRLKDLEFKQSGRDLRALKQILEAM 473
KQ ++ DS +VY EI+KRL LEFK+SG+DLRALKQILEAM
Sbjct: 337 KQ------------------MKAGDSALTVYGEIQKRLTQLEFKKSGKDLRALKQILEAM 378
Query: 474 QEKGLLESRKEEQGPNVDGNQSDHYESRATKLIQNSRSARQQNPQGNSFLSSTIKGSDSS 533
++ L +D ++ D S T L+Q R P + + S
Sbjct: 379 EKTQQL----------IDESRDDGTLS-TTTLMQ-----RTHKP---------VSAATSP 413
Query: 534 ARGFESP-IVIMKPAKLVEKTGISASSVIP---IGGFSGSHKLHNS-----------GAY 578
AR F+S IV+MK A V + + + +P +G + K+ + G Y
Sbjct: 414 ARNFKSSSIVVMKSAAPVSTSPLPQNVTLPNVKVGNSRQTRKVTSGKQNAMDLTPRPGLY 473
Query: 579 VDNQNNTSSTRAAKDQSPKNIRRXXXXXXXXXXXXXXXXXRSAQSQSRSNQLPKDNXXXX 638
++T K SPK +R +S +SQ S
Sbjct: 474 KGQLDST------KSNSPKTVR-----SRQALAADAGSMTKSGRSQQHS----------- 511
Query: 639 XXXXXXXXPRLQQQKKLELEKXXXXXXXXXXXXXXNKPRRKSSGKKATESGSPGGKLRTK 698
PR Q KKL EK ++P ++ G++ TE SP K K
Sbjct: 512 ------VSPRTQP-KKLGFEKQTRPTTPK------SEPGKRQLGRQQTEVASPRRKQMIK 558
Query: 699 VPNS--QHCDEQLSEISNDSLSVSCHGDELSLQSDSTAEVTSRFQSADIDDSRSQSLKAI 756
P+S Q D++LS+ +D S+ + +SL S+ EVTSR + D Q
Sbjct: 559 -PHSTLQQPDDRLSDARSDLRSLRSDSN-ISLGSNVDIEVTSRHRLERNCDFPEQ--HTP 614
Query: 757 KQLVSEAVHEKSTPRLDEDETIAELATDAPEHPSPVSVLDDSVYRDDLLSPVKQISEVPN 816
KQ + ++ P L + E PSPVSVLD +D SPV++IS
Sbjct: 615 KQRSPDFGIKQDRPSLKPLKVTVE-------QPSPVSVLDAVFDEEDSPSPVRKISLSFK 667
Query: 817 ADDAQETEENEAKDEWNPVVDSLSFNSTGSAEINRKKLQ-NIDHLVQKLRRLNSSHDEAR 875
+DA +EE+E ++ S+ F + NR ++ + DH E
Sbjct: 668 EEDALRSEESEWINKPTSFCRSVPF-----PQSNRGPMKPSSDHF--------ECSPEEG 714
Query: 876 IDYIASLCENTNPDHRYISEIXXXXXXXXXXXXXXXXTFQLHSSGHPINPELFLVLEQTK 935
D+ + +H+YI EI +FQLH + PINP LF +LEQ K
Sbjct: 715 ADFKSG-------NHKYILEILLASGILRDLEYSMI-SFQLHQTRLPINPGLFFILEQNK 766
Query: 936 ASNLLSKEESNNGK-ITCLKLN-AERFHRKLIFDAVNEILGAKLAS--SPEPWFQPNCDR 991
ASN+ + + G+ + N E RKL+FD VNEIL K + +P N +
Sbjct: 767 ASNVTLPDNKHRGRGFRQQQTNPTETIRRKLVFDTVNEILARKFTAEGCIKPRLIANPLK 826
Query: 992 LTKKSLSAQKLLKELCFEIEKVQAKKPGWCXXXXXXXXXXXKSILWEDVMHEGSESWKNF 1051
+K ++LL+ LC EI+++Q + I+WED+ + S + K F
Sbjct: 827 KLEKISKEEQLLQTLCSEIDRLQQNNSNCILEDDE------EDIIWEDLQSQ-SMNLKEF 879
Query: 1052 NGEIPGFVLDVERLIFKDLVGEIVI 1076
GE PG VLD+ER+IF+DLV E+
Sbjct: 880 EGETPGIVLDIERMIFRDLVNEVCF 904
>AT5G15580.1 | Symbols: LNG1 | longifolia1 | chr5:5068495-5071930
REVERSE LENGTH=927
Length = 927
Score = 238 bits (606), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 310/1116 (27%), Positives = 463/1116 (41%), Gaps = 213/1116 (19%)
Query: 1 MAAKLLHSLADDNPDLQKQIGCMTGIFQLFDRHPTLTARRSSSQ-KKFPYGNAYISGGSL 59
M+AKLL++L+D+NP+L KQIGCM GIFQ+F R R + + K P G A + G
Sbjct: 1 MSAKLLYNLSDENPNLNKQIGCMNGIFQVFYRQHYPPRRVTGDELKSLPSGKASDNVG-- 58
Query: 60 DRDPNSRHHGHTAIDTSLNKVVTEKQRIXXXXXXXXXXXXXXXXXXLDCKAEVDAPFDRI 119
D N + K EKQR A++ +
Sbjct: 59 --DTNISADKKETEKSKKKKTAKEKQRGVSSESSSRLSFSSSPCSSSFSSADISTTASQF 116
Query: 120 IFPETPSRDTVMNQPTISS-HLGRHSL--DLRDVVKDSMYREARGLSLKTTAKEEGIHAM 176
P + + + +PT S G + D+R++V+ S+++E R T EE +
Sbjct: 117 EQPGLSNGENPVREPTNGSPRWGGLMMPSDIRELVRSSIHKETR------TRDEEALSQQ 170
Query: 177 KHRDSPRPVLLSKSVDGAYGVGIDGKQSAPI----DLKESIRVLAKLREAPWYYAETREL 232
LL K+S+P + E RV+ KL+++P + + RE
Sbjct: 171 PKSARANVSLL--------------KESSPSRNSNEWSEGRRVV-KLKDSPRFSYDERET 215
Query: 233 PRSSHEAKDGHWHSIPDGRETSRLSFESRDTIKSTPKLKELPRLSLDSREGSWR-AYSSD 291
++ + K+ TP+ LSLDSR S+R A SS
Sbjct: 216 RKTGAKLKE-------------------------TPR------LSLDSRSNSFRSARSSC 244
Query: 292 SKHSQLQRNVNSGTGTSTSDEKFSSLQQSRPPSVVAKLMGLEALPDSSLGGDNQSRSTET 351
S Q + +G +TS SVVAKLMGLE +PD + T
Sbjct: 245 SPEPQ---ELVTGHRRTTS-------------SVVAKLMGLEVIPDEPV----------T 278
Query: 352 FSARNNDQFPRSSSKNGFTRPLRVSNSPKSSLKDPTSPRRKNPDVVMKPLSSSKFPIEPA 411
R N R +SP+ + + +R +K + +KFP++ +
Sbjct: 279 IQNREN----------------RFCDSPRPTSRVEVDLQRSRGFDSIKKMMPAKFPMKAS 322
Query: 412 PWKQQDGNRRSQKLSLRPTKAPLRTSDSFPSVYSEIEKRLKDLEFKQSGRDLRALKQILE 471
PW Q DG + K+ P L +VY EI+KRL LEFK+S +DLRALKQILE
Sbjct: 323 PWAQVDGAKNQVKI---PDATTL-------TVYGEIQKRLSQLEFKKSEKDLRALKQILE 372
Query: 472 AMQEKGLLESRKEEQGPNVDGNQSDHYESRATKLIQNSRSARQQNPQGNSFLSSTIKGSD 531
AM++ L S+ + D N++ L ++ R P ++ +S++
Sbjct: 373 AMEKTQQLISKDD------DDNKT---------LCSSNFMQRNNQPIPSAINTSSMNFKS 417
Query: 532 SSARGFESPIVIMKPAK--LVEKTGISASSVIPIGGFSGSH-KLHN--SGAYVDNQNNTS 586
SS IV+MK A + + TGI+ S+ FS + L N G Q
Sbjct: 418 SS-------IVVMKAATAPVFKDTGIAGSA-----SFSPRNVALPNVKVGNLRQAQKVIP 465
Query: 587 STRAAKDQSPKNIRRXXXXXXXXXXXXXXXXXRSAQSQSRSNQLPKDNXXXXXXXXXXXX 646
++A D +P R R QS+S + K
Sbjct: 466 RKQSAMDVTP----RPGYYKGQTESTMKNTSTRPLQSKS---DMAKSGKIQKPSVSLRTP 518
Query: 647 PRLQQQKKLELEKXXXXXXXXXXXXXXNKPRRKSSGKKATESGSPGGKLRTKVPNSQHCD 706
P KKL EK NK +R+ ++ TES SP K K Q +
Sbjct: 519 P-----KKLGFEK---QSRPTSPKPELNKNQRQQLSRQQTESASPRRKPGIKSRGLQQSE 570
Query: 707 EQLSEISNDSLSVSCHGDELSLQSDSTAEVTSRFQSADIDDSRSQSLKAIKQLVSEAVHE 766
++LS+ S+D S+ + +SL S+ EVTSR+ D + + +
Sbjct: 571 DRLSDESSDLRSLRSDSN-VSLASNLDTEVTSRYNYERNSD-----------ITEQHTPK 618
Query: 767 KSTPRLDEDETIAELATDAPEHPSPVSVLDDSVYRDDLLSPVKQISEVPNADDAQETEEN 826
+ +P L L E PSPVSVLD + DD SPV++IS V DD +EE+
Sbjct: 619 QRSPDLGMRSLSKPLKVTV-EQPSPVSVLDVAFDEDDSPSPVRKISIVFKEDDNLSSEES 677
Query: 827 EAKDEWNPVVDSLSFNSTGSAEINRKKLQNIDHLVQKLRRLNSSHDEARIDYIASLCENT 886
++ N + S+ + + ++ L E ++ E
Sbjct: 678 HWMNKNNNLCRSIVWPESNTS-------------------LKQPDAELTEGFMEDDAEFK 718
Query: 887 NPDHRYISEIXXXXXXXXXXXXXXXXTFQLHSSGHPINPELFLVLEQTKASNLLSKEESN 946
N DH+YISEI + QLH + PINP LF VLEQ K SN+ ++ +
Sbjct: 719 NGDHKYISEI-MLASGLLRDIDYSMISIQLHQAHLPINPSLFFVLEQNKTSNVSLQDNKH 777
Query: 947 NGKITCLKLN---AERFHRKLIFDAVNEILGAKLAS-----SPEPWFQPNCDRLTKKSLS 998
G+ + ER RKLIFD +NEIL + A+ P + R +KS
Sbjct: 778 KGRGFGQQQTVNLVERSKRKLIFDTINEILAHRFAAEGCTKQPSITLSISTQRTHEKSSR 837
Query: 999 AQKLLKELCFEIEKVQAKKPGWCXXXXXXXXXXXKSILWEDVMHEGSESWKNFNGEIPGF 1058
++LL+ LC EI+++Q + ++WED+ G +WK GE PG
Sbjct: 838 GEELLQTLCSEIDRLQ-------DNSKCILDEDDEDLIWEDLQSHGM-NWKEIEGETPGL 889
Query: 1059 VLDVERLIFKDLVGEIVIGEAASMRLKPSVRRRKLF 1094
VLD+ERLIFKDL+GE+V E A+ S + R+LF
Sbjct: 890 VLDIERLIFKDLIGEVVTSEFAAFPRMLSGQPRQLF 925
>AT3G63430.1 | Symbols: | unknown protein; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT1G74160.1); Has
135 Blast hits to 119 proteins in 22 species: Archae - 0;
Bacteria - 2; Metazoa - 3; Fungi - 3; Plants - 119;
Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink). |
chr3:23420414-23422201 FORWARD LENGTH=540
Length = 540
Score = 58.5 bits (140), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 52/163 (31%), Positives = 79/163 (48%), Gaps = 21/163 (12%)
Query: 932 EQTKASNLLSKEESNNGKITCLKLNAERFHRKLIFDAVNEILGAKLASSPEPWFQPNCDR 991
+++ + + L K++ GK C A + R+LIFDAV EI+ A+ S P PW
Sbjct: 399 QESDSFSFLEKQQYLKGK--CASRAAAQ-ERRLIFDAVQEIV-ARRRSLP-PWMMVG--- 450
Query: 992 LTKKSLSAQKLLKELCFEIEKVQAKKPGWCXXXXXXXXXXXKSILWEDVMHEGSESWKNF 1051
A ++ + E +K++ KK +L D+ + W++F
Sbjct: 451 ------EADNKMQVIWSEFQKIRDKK---SSTEEDDLVGYVCGVLGRDL---SEDRWRDF 498
Query: 1052 NGEIPGFVLDVERLIFKDLVGEIVIGEAASMRLKPSVRRRKLF 1094
E+ VLDVERLIFKDL+GE I + A + S+RRR LF
Sbjct: 499 QVEMSEAVLDVERLIFKDLIGE-TIRQLAFLNRSDSLRRRLLF 540