Miyakogusa Predicted Gene
- Lj3g3v1981280.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v1981280.1 Non Characterized Hit- tr|H3DBX9|H3DBX9_TETNG
Uncharacterized protein (Fragment) OS=Tetraodon
nigrov,39.76,2e-18,Smg4_UPF3,Regulator of nonsense-mediated decay,
UPF3; UPF3 REGULATOR OF NONSENSE TRANSCRIPTS-LIKE PR,CUFF.43387.1
(535 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr3g005510.1 | regulator of nonsense transcripts UPF3 protein... 771 0.0
Medtr2g005600.3 | Smg-4/UPF3 family protein | HC | chr2:225129-2... 327 1e-89
Medtr2g005600.1 | Smg-4/UPF3 family protein | HC | chr2:225247-2... 327 1e-89
Medtr2g005600.2 | Smg-4/UPF3 family protein | HC | chr2:225129-2... 318 6e-87
Medtr2g005600.4 | Smg-4/UPF3 family protein | HC | chr2:225129-2... 318 6e-87
>Medtr3g005510.1 | regulator of nonsense transcripts UPF3 protein |
HC | chr3:238569-230542 | 20130731
Length = 540
Score = 771 bits (1992), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 406/543 (74%), Positives = 450/543 (82%), Gaps = 11/543 (2%)
Query: 1 MKVRSERERTKVVIRHLPPSLTHSDLIQQIDNHFASRYNWFSFRPGNTSHKHQRYSRAYI 60
MK RSE+ RTKVVIRHLPPSLT SDLIQ IDN F+SRY+WF FR GNTS+++Q+Y+RAY+
Sbjct: 1 MKARSEKGRTKVVIRHLPPSLTESDLIQHIDNRFSSRYHWFVFRSGNTSYRNQKYARAYL 60
Query: 61 DFKHPNDVFEFAEFFHGHVFVNEKGAQHKALVEYAPSQRVPKPGTKKDGREGSIYKDPDY 120
DF P+DVFEFAEFF+GHVFVNEKG QHKA+VEYAPSQRVPK TKKDGREG+IYKDPDY
Sbjct: 61 DFNSPDDVFEFAEFFNGHVFVNEKGVQHKAVVEYAPSQRVPKLSTKKDGREGTIYKDPDY 120
Query: 121 LEFLKLIAKPEEHLPSAEIQLERREAEQAGANKEPPVVTPLMEYVRQKRALGSGVQVSSA 180
LEFLKLI+KP+EHLPSAEIQLER+EAEQAGA+KE P+VTPLM Y+RQKRA+ SG VSSA
Sbjct: 121 LEFLKLISKPQEHLPSAEIQLERKEAEQAGASKEAPIVTPLMAYIRQKRAVDSGPLVSSA 180
Query: 181 ATKISRRARAALPGKPGSGNAKRGSEKKKYVQKDNAKIANRKELRDKSAFIAVPRREDQS 240
AT++ RRAR A+ GKPG N +RGSEKKKYVQKDN K ANRK+ +DKSAF VPRRED S
Sbjct: 181 ATRVGRRAR-AMQGKPGPSNTRRGSEKKKYVQKDNVKNANRKDSKDKSAFTVVPRREDHS 239
Query: 241 AESSAKGTSEIET--------LHGIEGPISGIPLTSDSXXXXXXXXXXXQREIPNAAEGM 292
+ESS KG EI++ +HGIEG ISGIPLTSDS QREIP A EGM
Sbjct: 240 SESSIKGVYEIDSSHVIDEFAVHGIEGSISGIPLTSDSGKKKILLLKGKQREIPKATEGM 299
Query: 293 VKQQNVQSGSSLVSTSAKQTQRREGSGRLIRSILQNNEPRQSQSASGTQPKIQILTSENG 352
VKQQN QS + + T+AKQ QRRE GRLIRSIL NNE RQSQS S Q KIQILTSENG
Sbjct: 300 VKQQNAQSANLPIPTTAKQNQRREAGGRLIRSILLNNESRQSQSTSTAQHKIQILTSENG 359
Query: 353 KRPPRPFISRSGLSDQVSSHDAGQVNSEGDSKRVSDEKFIKRDLHGSGSVSEKTERRTRN 412
+RPPRPF SRSGLSDQVSSHDAG VNSEG+SKR DEKF++RD HGSG + +KTERRTRN
Sbjct: 360 RRPPRPFGSRSGLSDQVSSHDAGHVNSEGESKRDLDEKFVRRDFHGSG-IGDKTERRTRN 418
Query: 413 KDRPDRGVWAPLRRSDVSHSGNEHPSSSWSQTTLSNPESVEGEVKESVPSGNRSPEFSAS 472
KDRPDRGVWAPLRRSD SHS NE SSS +Q+ SNPESVEGEVKE+ SGNRS EFSAS
Sbjct: 419 KDRPDRGVWAPLRRSDSSHSSNELSSSSLAQSAPSNPESVEGEVKENAYSGNRSGEFSAS 478
Query: 473 AVGRGSPSVENGSQRNFTRRGASYIVKDEGAVSLSEGKPSKKGVAGNSAHEKQVWVQKSS 532
A GR SPSVENGSQR FTRRGA YIVKD+GAVS SEGK SKKGV GNS HEKQVWVQKSS
Sbjct: 479 AGGRSSPSVENGSQRIFTRRGAPYIVKDDGAVSSSEGKLSKKGV-GNSTHEKQVWVQKSS 537
Query: 533 SGS 535
SG+
Sbjct: 538 SGT 540
>Medtr2g005600.3 | Smg-4/UPF3 family protein | HC |
chr2:225129-230626 | 20130731
Length = 510
Score = 327 bits (839), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 227/547 (41%), Positives = 310/547 (56%), Gaps = 58/547 (10%)
Query: 8 ERTKVVIRHLPPSLTHSDLIQQIDNHFASRYNWFSFRPGN-TSHKHQRYSRAYIDFKHPN 66
+RTKVVIRHLPP++T L+ ID+ FA RYNWFSF P TSH H SRAYIDF P+
Sbjct: 3 DRTKVVIRHLPPTITQDSLLPLIDSSFAGRYNWFSFHPPKITSHNHT--SRAYIDFNTPD 60
Query: 67 DVFEFAEFFHGHVFVNEKGAQHKALVEYAPSQRVPKPGTKK--DGREGSIYKDPDYLEFL 124
DV +FA FF+GH+F+N+KG K VEYAPSQRVP +KK D R+G+I+KDPDYL+FL
Sbjct: 61 DVIDFAHFFNGHLFLNQKGTHFKVTVEYAPSQRVPNHSSKKPEDARDGTIFKDPDYLQFL 120
Query: 125 KLIAKPEEHLPSAEIQLERREAEQAGANKEPPVVTPLMEYVRQKRALGSG---VQVSSAA 181
+ IAKP E+LPSAEIQL++REA K+ P+VTPLM++VR KRA +G S +
Sbjct: 121 QQIAKPVENLPSAEIQLDKREA----VRKDIPIVTPLMDFVRHKRATKNGPRQQHRSLSN 176
Query: 182 TKISRRARAALPGKPGSGNAKRGSEKKK-----YVQKDNAKIANRKELRDKSAFIAVPRR 236
K++RR+ G S ++RG K + YV +D K + ++DKS +I VPR+
Sbjct: 177 GKVTRRSLTTSNGSSTSAPSRRGYTKNRLSTTMYVARDPGKSST---VQDKSTYILVPRQ 233
Query: 237 EDQSAESSAKGTSEIETLHGIEGPISGIPLTSDSXXXXXXXXXXXQREIPNA--AEGMVK 294
DQ+ + + T+ + + +GI ++DS +RE ++ M +
Sbjct: 234 GDQNPSNKSSNTASSDGNQTFDE--NGIAGSNDSGKKKLLLLKGNERETITVSDSDSMSQ 291
Query: 295 QQNVQSGSSLVSTSAKQTQRREGSGRLIRSILQNNEPRQSQSA-SGTQPKIQILTSENGK 353
+ + L ST+ KQ QR EG GR+I+SIL N + RQSQS+ + ++ +IQ E K
Sbjct: 292 HHTSSTKTILSSTALKQNQRHEGRGRIIKSILTNKDFRQSQSSRAHSERQIQTSNLEREK 351
Query: 354 RPPRPFISRSGLSDQVSSHDAGQVNSEGDSKRVSDEKFIKRDLHGSGSVSEKTERRTRNK 413
+ RP + L D N R++ +HG SE+ ERR R+K
Sbjct: 352 QSTRPVHVQLILKGT----DGAPEN------RIT--------VHGLHVSSERQERRFRHK 393
Query: 414 DRPDRGVWAPLRRSDVSHSGNEHPSSSWSQTTLSNPESVEG---EVKESVPSGNRSPEFS 470
DRPDRGVW S + S S + S + +EG E+K PS RS E
Sbjct: 394 DRPDRGVWT---------SRSNGGGESLSSSASSQVDPLEGGHTELKHDTPSA-RSGEVK 443
Query: 471 ASAVGRGSPSVENGSQRNFTRRGASYIVKD-EGAVSLSEGK-PSKKGVAGNSAHEKQVWV 528
+ R S S ENG ++F RRG Y VKD +G LSEGK P K + ++EKQVWV
Sbjct: 444 SLGSFRASHSSENGFSKHFGRRGPIYGVKDVDGYSILSEGKHPRKSSTSAYGSNEKQVWV 503
Query: 529 QKSSSGS 535
QK+SSG+
Sbjct: 504 QKASSGT 510
>Medtr2g005600.1 | Smg-4/UPF3 family protein | HC |
chr2:225247-230505 | 20130731
Length = 510
Score = 327 bits (839), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 227/547 (41%), Positives = 310/547 (56%), Gaps = 58/547 (10%)
Query: 8 ERTKVVIRHLPPSLTHSDLIQQIDNHFASRYNWFSFRPGN-TSHKHQRYSRAYIDFKHPN 66
+RTKVVIRHLPP++T L+ ID+ FA RYNWFSF P TSH H SRAYIDF P+
Sbjct: 3 DRTKVVIRHLPPTITQDSLLPLIDSSFAGRYNWFSFHPPKITSHNHT--SRAYIDFNTPD 60
Query: 67 DVFEFAEFFHGHVFVNEKGAQHKALVEYAPSQRVPKPGTKK--DGREGSIYKDPDYLEFL 124
DV +FA FF+GH+F+N+KG K VEYAPSQRVP +KK D R+G+I+KDPDYL+FL
Sbjct: 61 DVIDFAHFFNGHLFLNQKGTHFKVTVEYAPSQRVPNHSSKKPEDARDGTIFKDPDYLQFL 120
Query: 125 KLIAKPEEHLPSAEIQLERREAEQAGANKEPPVVTPLMEYVRQKRALGSG---VQVSSAA 181
+ IAKP E+LPSAEIQL++REA K+ P+VTPLM++VR KRA +G S +
Sbjct: 121 QQIAKPVENLPSAEIQLDKREA----VRKDIPIVTPLMDFVRHKRATKNGPRQQHRSLSN 176
Query: 182 TKISRRARAALPGKPGSGNAKRGSEKKK-----YVQKDNAKIANRKELRDKSAFIAVPRR 236
K++RR+ G S ++RG K + YV +D K + ++DKS +I VPR+
Sbjct: 177 GKVTRRSLTTSNGSSTSAPSRRGYTKNRLSTTMYVARDPGKSST---VQDKSTYILVPRQ 233
Query: 237 EDQSAESSAKGTSEIETLHGIEGPISGIPLTSDSXXXXXXXXXXXQREIPNA--AEGMVK 294
DQ+ + + T+ + + +GI ++DS +RE ++ M +
Sbjct: 234 GDQNPSNKSSNTASSDGNQTFDE--NGIAGSNDSGKKKLLLLKGNERETITVSDSDSMSQ 291
Query: 295 QQNVQSGSSLVSTSAKQTQRREGSGRLIRSILQNNEPRQSQSA-SGTQPKIQILTSENGK 353
+ + L ST+ KQ QR EG GR+I+SIL N + RQSQS+ + ++ +IQ E K
Sbjct: 292 HHTSSTKTILSSTALKQNQRHEGRGRIIKSILTNKDFRQSQSSRAHSERQIQTSNLEREK 351
Query: 354 RPPRPFISRSGLSDQVSSHDAGQVNSEGDSKRVSDEKFIKRDLHGSGSVSEKTERRTRNK 413
+ RP + L D N R++ +HG SE+ ERR R+K
Sbjct: 352 QSTRPVHVQLILKGT----DGAPEN------RIT--------VHGLHVSSERQERRFRHK 393
Query: 414 DRPDRGVWAPLRRSDVSHSGNEHPSSSWSQTTLSNPESVEG---EVKESVPSGNRSPEFS 470
DRPDRGVW S + S S + S + +EG E+K PS RS E
Sbjct: 394 DRPDRGVWT---------SRSNGGGESLSSSASSQVDPLEGGHTELKHDTPSA-RSGEVK 443
Query: 471 ASAVGRGSPSVENGSQRNFTRRGASYIVKD-EGAVSLSEGK-PSKKGVAGNSAHEKQVWV 528
+ R S S ENG ++F RRG Y VKD +G LSEGK P K + ++EKQVWV
Sbjct: 444 SLGSFRASHSSENGFSKHFGRRGPIYGVKDVDGYSILSEGKHPRKSSTSAYGSNEKQVWV 503
Query: 529 QKSSSGS 535
QK+SSG+
Sbjct: 504 QKASSGT 510
>Medtr2g005600.2 | Smg-4/UPF3 family protein | HC |
chr2:225129-230626 | 20130731
Length = 540
Score = 318 bits (816), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 228/577 (39%), Positives = 312/577 (54%), Gaps = 88/577 (15%)
Query: 8 ERTKVVIRHLPPSLTHSDLIQQIDNHFASRYNWFSFRPGN-TSHKHQRYSRAYIDFKHPN 66
+RTKVVIRHLPP++T L+ ID+ FA RYNWFSF P TSH H SRAYIDF P+
Sbjct: 3 DRTKVVIRHLPPTITQDSLLPLIDSSFAGRYNWFSFHPPKITSHNHT--SRAYIDFNTPD 60
Query: 67 DVFEFAEFFHGHVFVNEKGAQHKALVEYAPSQRVPKPGTKK--DGREGSIYKDPDYLEFL 124
DV +FA FF+GH+F+N+KG K VEYAPSQRVP +KK D R+G+I+KDPDYL+FL
Sbjct: 61 DVIDFAHFFNGHLFLNQKGTHFKVTVEYAPSQRVPNHSSKKPEDARDGTIFKDPDYLQFL 120
Query: 125 KLIAKPEEHLPSAEIQLERREAEQAGANKEPPVVTPLMEYVRQKRALGSGVQV------- 177
+ IAKP E+LPSAEIQL++REA K+ P+VTPLM++VR KRA +G +V
Sbjct: 121 QQIAKPVENLPSAEIQLDKREA----VRKDIPIVTPLMDFVRHKRATKNGPRVIFLIIRV 176
Query: 178 --------------------------SSAATKISRRARAALPGKPGSGNAKRGSEKKK-- 209
S + K++RR+ G S ++RG K +
Sbjct: 177 GSCMHSFLASHPFLAYFISVQQQQHRSLSNGKVTRRSLTTSNGSSTSAPSRRGYTKNRLS 236
Query: 210 ---YVQKDNAKIANRKELRDKSAFIAVPRREDQSAESSAKGTSEIETLHGIEGPISGIPL 266
YV +D K + ++DKS +I VPR+ DQ+ + + T+ + + +GI
Sbjct: 237 TTMYVARDPGKSST---VQDKSTYILVPRQGDQNPSNKSSNTASSDGNQTFDE--NGIAG 291
Query: 267 TSDSXXXXXXXXXXXQREIPNA--AEGMVKQQNVQSGSSLVSTSAKQTQRREGSGRLIRS 324
++DS +RE ++ M + + + L ST+ KQ QR EG GR+I+S
Sbjct: 292 SNDSGKKKLLLLKGNERETITVSDSDSMSQHHTSSTKTILSSTALKQNQRHEGRGRIIKS 351
Query: 325 ILQNNEPRQSQSA-SGTQPKIQILTSENGKRPPRPFISRSGLSDQVSSHDAGQVNSEGDS 383
IL N + RQSQS+ + ++ +IQ E K+ RP + L D N
Sbjct: 352 ILTNKDFRQSQSSRAHSERQIQTSNLEREKQSTRPVHVQLILKGT----DGAPEN----- 402
Query: 384 KRVSDEKFIKRDLHGSGSVSEKTERRTRNKDRPDRGVWAPLRRSDVSHSGNEHPSSSWSQ 443
R++ +HG SE+ ERR R+KDRPDRGVW S + S S
Sbjct: 403 -RIT--------VHGLHVSSERQERRFRHKDRPDRGVWT---------SRSNGGGESLSS 444
Query: 444 TTLSNPESVEG---EVKESVPSGNRSPEFSASAVGRGSPSVENGSQRNFTRRGASYIVKD 500
+ S + +EG E+K PS RS E + R S S ENG ++F RRG Y VKD
Sbjct: 445 SASSQVDPLEGGHTELKHDTPSA-RSGEVKSLGSFRASHSSENGFSKHFGRRGPIYGVKD 503
Query: 501 -EGAVSLSEGK-PSKKGVAGNSAHEKQVWVQKSSSGS 535
+G LSEGK P K + ++EKQVWVQK+SSG+
Sbjct: 504 VDGYSILSEGKHPRKSSTSAYGSNEKQVWVQKASSGT 540
>Medtr2g005600.4 | Smg-4/UPF3 family protein | HC |
chr2:225129-230626 | 20130731
Length = 540
Score = 318 bits (816), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 228/577 (39%), Positives = 312/577 (54%), Gaps = 88/577 (15%)
Query: 8 ERTKVVIRHLPPSLTHSDLIQQIDNHFASRYNWFSFRPGN-TSHKHQRYSRAYIDFKHPN 66
+RTKVVIRHLPP++T L+ ID+ FA RYNWFSF P TSH H SRAYIDF P+
Sbjct: 3 DRTKVVIRHLPPTITQDSLLPLIDSSFAGRYNWFSFHPPKITSHNHT--SRAYIDFNTPD 60
Query: 67 DVFEFAEFFHGHVFVNEKGAQHKALVEYAPSQRVPKPGTKK--DGREGSIYKDPDYLEFL 124
DV +FA FF+GH+F+N+KG K VEYAPSQRVP +KK D R+G+I+KDPDYL+FL
Sbjct: 61 DVIDFAHFFNGHLFLNQKGTHFKVTVEYAPSQRVPNHSSKKPEDARDGTIFKDPDYLQFL 120
Query: 125 KLIAKPEEHLPSAEIQLERREAEQAGANKEPPVVTPLMEYVRQKRALGSGVQV------- 177
+ IAKP E+LPSAEIQL++REA K+ P+VTPLM++VR KRA +G +V
Sbjct: 121 QQIAKPVENLPSAEIQLDKREA----VRKDIPIVTPLMDFVRHKRATKNGPRVIFLIIRV 176
Query: 178 --------------------------SSAATKISRRARAALPGKPGSGNAKRGSEKKK-- 209
S + K++RR+ G S ++RG K +
Sbjct: 177 GSCMHSFLASHPFLAYFISVQQQQHRSLSNGKVTRRSLTTSNGSSTSAPSRRGYTKNRLS 236
Query: 210 ---YVQKDNAKIANRKELRDKSAFIAVPRREDQSAESSAKGTSEIETLHGIEGPISGIPL 266
YV +D K + ++DKS +I VPR+ DQ+ + + T+ + + +GI
Sbjct: 237 TTMYVARDPGKSST---VQDKSTYILVPRQGDQNPSNKSSNTASSDGNQTFDE--NGIAG 291
Query: 267 TSDSXXXXXXXXXXXQREIPNA--AEGMVKQQNVQSGSSLVSTSAKQTQRREGSGRLIRS 324
++DS +RE ++ M + + + L ST+ KQ QR EG GR+I+S
Sbjct: 292 SNDSGKKKLLLLKGNERETITVSDSDSMSQHHTSSTKTILSSTALKQNQRHEGRGRIIKS 351
Query: 325 ILQNNEPRQSQSA-SGTQPKIQILTSENGKRPPRPFISRSGLSDQVSSHDAGQVNSEGDS 383
IL N + RQSQS+ + ++ +IQ E K+ RP + L D N
Sbjct: 352 ILTNKDFRQSQSSRAHSERQIQTSNLEREKQSTRPVHVQLILKGT----DGAPEN----- 402
Query: 384 KRVSDEKFIKRDLHGSGSVSEKTERRTRNKDRPDRGVWAPLRRSDVSHSGNEHPSSSWSQ 443
R++ +HG SE+ ERR R+KDRPDRGVW S + S S
Sbjct: 403 -RIT--------VHGLHVSSERQERRFRHKDRPDRGVWT---------SRSNGGGESLSS 444
Query: 444 TTLSNPESVEG---EVKESVPSGNRSPEFSASAVGRGSPSVENGSQRNFTRRGASYIVKD 500
+ S + +EG E+K PS RS E + R S S ENG ++F RRG Y VKD
Sbjct: 445 SASSQVDPLEGGHTELKHDTPSA-RSGEVKSLGSFRASHSSENGFSKHFGRRGPIYGVKD 503
Query: 501 -EGAVSLSEGK-PSKKGVAGNSAHEKQVWVQKSSSGS 535
+G LSEGK P K + ++EKQVWVQK+SSG+
Sbjct: 504 VDGYSILSEGKHPRKSSTSAYGSNEKQVWVQKASSGT 540