Miyakogusa Predicted Gene
- Lj1g3v4518320.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v4518320.1 Non Characterized Hit- tr|Q8LBZ5|Q8LBZ5_ARATH
Putative uncharacterized protein OS=Arabidopsis
thalia,33.05,0.00000000007,TPR_11,NULL; TPR_REGION,Tetratricopeptide
repeat-containing domain; SUBFAMILY NOT NAMED,NULL;
TETRAT,CUFF.32556.1
(368 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr7g099260.1 | TPR repeat protein | HC | chr7:39789340-397912... 320 1e-87
Medtr7g099260.2 | TPR repeat protein | HC | chr7:39789442-397912... 320 1e-87
Medtr7g098450.1 | TPR repeat protein | HC | chr7:39412668-394107... 301 8e-82
Medtr7g098450.2 | TPR repeat protein | HC | chr7:39412635-394103... 300 1e-81
Medtr1g073230.1 | O-linked N-acetylglucosamine transferase, ogt ... 138 1e-32
Medtr5g029910.1 | TPR repeat protein | HC | chr5:12584803-125828... 118 8e-27
Medtr4g114690.1 | TPR repeat protein | HC | chr4:47200290-472018... 117 2e-26
Medtr2g103795.1 | TPR repeat protein | HC | chr2:44688706-446946... 114 2e-25
Medtr4g010250.1 | TPR repeat protein | HC | chr4:2308253-2305617... 106 4e-23
Medtr2g103795.2 | TPR repeat protein | HC | chr2:44688706-446901... 87 3e-17
Medtr3g463450.1 | TPR superfamily protein | HC | chr3:25403273-2... 86 6e-17
Medtr5g082130.1 | hypothetical protein | HC | chr5:35259864-3525... 75 1e-13
>Medtr7g099260.1 | TPR repeat protein | HC | chr7:39789340-39791288
| 20130731
Length = 363
Score = 320 bits (820), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 204/379 (53%), Positives = 237/379 (62%), Gaps = 32/379 (8%)
Query: 1 MLLRSSSTPVLGSLLPSFTE-SPSNV--LHSESCHTQKHLPPTSVPQHQHRRSFSQIGSF 57
MLLRSSSTPVLG+L+ SFT+ +P+++ LH E+CH K LPP + S
Sbjct: 1 MLLRSSSTPVLGNLISSFTDNTPTHIQSLHLETCHALKPLPPPTT-------SIQHHHHH 53
Query: 58 GLSPFSCNSSPISPSIADLDRQNKGFRRVQSEGNLEDLVYNSC---SEDRFSYIDTP-KR 113
SC+SSPISPSI+DL+RQNKGFRRVQSEGNLEDL Y + +E+R SY+D+ KR
Sbjct: 54 NNHRLSCSSSPISPSISDLERQNKGFRRVQSEGNLEDLAYATTFNNNEERLSYMDSSSKR 113
Query: 114 FSGRQ-RCLTLETIPSFTLSKHRGYLXXXXXXXXXXXXXXXXXXXXXGLNFSVVNNGSGV 172
+S RQ R LETIPSF+LSK G FSV+N +
Sbjct: 114 YSARQQRGFALETIPSFSLSKRTGLREEEEDVEESDIEDEEGYDE-----FSVMNR---M 165
Query: 173 MLTEDLLGVKDGFCRVXXXXXXXXXXXXXMYLAKGLGVX---XXXXXXXXXXXXXXXXXX 229
M +E++ D CRV MYLAKGLGV
Sbjct: 166 MQSEEV----DRVCRVSFDEEGEFGDNE-MYLAKGLGVDFCGGDGIGGGCRGGGNGGGDY 220
Query: 230 XXXXXXXXXXXXXXXXVEEYYKKMVEESPGDPLFLRNYAQFLYQCKHDLKGAEEYYSRAI 289
VE+YYKKMV+++PG+PLFLRNYAQFLYQCK DL+GAEEYYSRAI
Sbjct: 221 NSMDSERNDGDNNNHGVEQYYKKMVQQNPGNPLFLRNYAQFLYQCKQDLEGAEEYYSRAI 280
Query: 290 LADPKDGEVLSQYGKLVWELHHDEERASSYFERAAQASPEDSHVHAAYASFLWDTEEGED 349
LADP DGEVLSQYGKLVWELHHDEERASSYFERA QASPEDSHV AAYASFLWDTEE D
Sbjct: 281 LADPNDGEVLSQYGKLVWELHHDEERASSYFERAVQASPEDSHVQAAYASFLWDTEEEND 340
Query: 350 -GCNESQCLPSHYHLGAIA 367
G N+SQCLP H+HLGA+A
Sbjct: 341 AGYNDSQCLPQHFHLGAMA 359
>Medtr7g099260.2 | TPR repeat protein | HC | chr7:39789442-39791288
| 20130731
Length = 363
Score = 320 bits (820), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 204/379 (53%), Positives = 237/379 (62%), Gaps = 32/379 (8%)
Query: 1 MLLRSSSTPVLGSLLPSFTE-SPSNV--LHSESCHTQKHLPPTSVPQHQHRRSFSQIGSF 57
MLLRSSSTPVLG+L+ SFT+ +P+++ LH E+CH K LPP + S
Sbjct: 1 MLLRSSSTPVLGNLISSFTDNTPTHIQSLHLETCHALKPLPPPTT-------SIQHHHHH 53
Query: 58 GLSPFSCNSSPISPSIADLDRQNKGFRRVQSEGNLEDLVYNSC---SEDRFSYIDTP-KR 113
SC+SSPISPSI+DL+RQNKGFRRVQSEGNLEDL Y + +E+R SY+D+ KR
Sbjct: 54 NNHRLSCSSSPISPSISDLERQNKGFRRVQSEGNLEDLAYATTFNNNEERLSYMDSSSKR 113
Query: 114 FSGRQ-RCLTLETIPSFTLSKHRGYLXXXXXXXXXXXXXXXXXXXXXGLNFSVVNNGSGV 172
+S RQ R LETIPSF+LSK G FSV+N +
Sbjct: 114 YSARQQRGFALETIPSFSLSKRTGLREEEEDVEESDIEDEEGYDE-----FSVMNR---M 165
Query: 173 MLTEDLLGVKDGFCRVXXXXXXXXXXXXXMYLAKGLGVX---XXXXXXXXXXXXXXXXXX 229
M +E++ D CRV MYLAKGLGV
Sbjct: 166 MQSEEV----DRVCRVSFDEEGEFGDNE-MYLAKGLGVDFCGGDGIGGGCRGGGNGGGDY 220
Query: 230 XXXXXXXXXXXXXXXXVEEYYKKMVEESPGDPLFLRNYAQFLYQCKHDLKGAEEYYSRAI 289
VE+YYKKMV+++PG+PLFLRNYAQFLYQCK DL+GAEEYYSRAI
Sbjct: 221 NSMDSERNDGDNNNHGVEQYYKKMVQQNPGNPLFLRNYAQFLYQCKQDLEGAEEYYSRAI 280
Query: 290 LADPKDGEVLSQYGKLVWELHHDEERASSYFERAAQASPEDSHVHAAYASFLWDTEEGED 349
LADP DGEVLSQYGKLVWELHHDEERASSYFERA QASPEDSHV AAYASFLWDTEE D
Sbjct: 281 LADPNDGEVLSQYGKLVWELHHDEERASSYFERAVQASPEDSHVQAAYASFLWDTEEEND 340
Query: 350 -GCNESQCLPSHYHLGAIA 367
G N+SQCLP H+HLGA+A
Sbjct: 341 AGYNDSQCLPQHFHLGAMA 359
>Medtr7g098450.1 | TPR repeat protein | HC | chr7:39412668-39410728
| 20130731
Length = 376
Score = 301 bits (770), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 206/381 (54%), Positives = 231/381 (60%), Gaps = 46/381 (12%)
Query: 1 MLLRSSSTPVLGSLLPS---FTESP----SNVLHSESCHTQKHLPPT-SVPQHQHRRSFS 52
MLLRSSSTPVLGSLL S FT+SP ++ LH ES KHLPPT S QH H
Sbjct: 24 MLLRSSSTPVLGSLLSSSGSFTDSPIHHHTHSLHPESYQALKHLPPTASSLQHHHHNH-- 81
Query: 53 QIGSFGLSPFSCNSSPISPSIADLDRQNKGF-RRVQSEGNLEDLVY-NSCSEDRFSYIDT 110
SC SSPISPSI+DL+RQNKG RRVQSEGNLEDL Y +C+ + S +
Sbjct: 82 --------KLSCTSSPISPSISDLERQNKGLIRRVQSEGNLEDLAYATNCNNNMDS---S 130
Query: 111 PKRFSGRQRCLTLETIPSFTLSKHRGYLXXXXXXXXXXXXXXXXXXXXXGLNFSVVNN-- 168
KR+S RQR LETIPSF+LSK G +FS V N
Sbjct: 131 SKRYSVRQRGFALETIPSFSLSKQTGLREEETDFEDEGYDD----------DFSSVLNST 180
Query: 169 GSGVMLTEDLLGVKDGFCRVXXXXXXXXXXXXXMYLAKGLGVXXXXXXXXXXXXXXXXXX 228
SGV++ ++ VKD RV MYLAKGLGV
Sbjct: 181 ASGVVVNDE---VKDRVFRVSFGEEGKVGNKE-MYLAKGLGVDGIGGCSGGNGGGDYNSM 236
Query: 229 XXXXXXXXXXXXXXXXXVEEYYKKMVEESPGDPLFLRNYAQFLYQCKHDLKGAEEYYSRA 288
VEEYYKKMV+++PG+PLFLRNYAQFLYQCK D +GAEEYYSRA
Sbjct: 237 GSGGNDGDSNHG-----VEEYYKKMVQQNPGNPLFLRNYAQFLYQCKQDREGAEEYYSRA 291
Query: 289 ILADPKDGEVLSQYGKLVWELHHDEERASSYFERAAQASPEDSHVHAAYASFLWDTEEGE 348
ILADP DGEVLSQYGKLVWELH DEERASSYFERA QASP+DSHV AAYASFLWDTEE E
Sbjct: 292 ILADPNDGEVLSQYGKLVWELHRDEERASSYFERAVQASPDDSHVQAAYASFLWDTEEDE 351
Query: 349 D--GCNESQCLPSHYHLGAIA 367
D G N+ QCLP H+H GA+A
Sbjct: 352 DAAGSNDPQCLPQHFHFGAMA 372
>Medtr7g098450.2 | TPR repeat protein | HC | chr7:39412635-39410361
| 20130731
Length = 353
Score = 300 bits (769), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 206/381 (54%), Positives = 231/381 (60%), Gaps = 46/381 (12%)
Query: 1 MLLRSSSTPVLGSLLPS---FTESP----SNVLHSESCHTQKHLPPT-SVPQHQHRRSFS 52
MLLRSSSTPVLGSLL S FT+SP ++ LH ES KHLPPT S QH H
Sbjct: 1 MLLRSSSTPVLGSLLSSSGSFTDSPIHHHTHSLHPESYQALKHLPPTASSLQHHHHNH-- 58
Query: 53 QIGSFGLSPFSCNSSPISPSIADLDRQNKGF-RRVQSEGNLEDLVY-NSCSEDRFSYIDT 110
SC SSPISPSI+DL+RQNKG RRVQSEGNLEDL Y +C+ + S +
Sbjct: 59 --------KLSCTSSPISPSISDLERQNKGLIRRVQSEGNLEDLAYATNCNNNMDS---S 107
Query: 111 PKRFSGRQRCLTLETIPSFTLSKHRGYLXXXXXXXXXXXXXXXXXXXXXGLNFSVVNN-- 168
KR+S RQR LETIPSF+LSK G +FS V N
Sbjct: 108 SKRYSVRQRGFALETIPSFSLSKQTGLREEETDFEDEGYDD----------DFSSVLNST 157
Query: 169 GSGVMLTEDLLGVKDGFCRVXXXXXXXXXXXXXMYLAKGLGVXXXXXXXXXXXXXXXXXX 228
SGV++ ++ VKD RV MYLAKGLGV
Sbjct: 158 ASGVVVNDE---VKDRVFRVSFGEEGKVGNKE-MYLAKGLGVDGIGGCSGGNGGGDYNSM 213
Query: 229 XXXXXXXXXXXXXXXXXVEEYYKKMVEESPGDPLFLRNYAQFLYQCKHDLKGAEEYYSRA 288
VEEYYKKMV+++PG+PLFLRNYAQFLYQCK D +GAEEYYSRA
Sbjct: 214 GSGGNDGDSNHG-----VEEYYKKMVQQNPGNPLFLRNYAQFLYQCKQDREGAEEYYSRA 268
Query: 289 ILADPKDGEVLSQYGKLVWELHHDEERASSYFERAAQASPEDSHVHAAYASFLWDTEEGE 348
ILADP DGEVLSQYGKLVWELH DEERASSYFERA QASP+DSHV AAYASFLWDTEE E
Sbjct: 269 ILADPNDGEVLSQYGKLVWELHRDEERASSYFERAVQASPDDSHVQAAYASFLWDTEEDE 328
Query: 349 D--GCNESQCLPSHYHLGAIA 367
D G N+ QCLP H+H GA+A
Sbjct: 329 DAAGSNDPQCLPQHFHFGAMA 349
>Medtr1g073230.1 | O-linked N-acetylglucosamine transferase, ogt
protein, putative | HC | chr1:32490728-32494156 |
20130731
Length = 376
Score = 138 bits (347), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 61/101 (60%), Positives = 80/101 (79%)
Query: 248 EYYKKMVEESPGDPLFLRNYAQFLYQCKHDLKGAEEYYSRAILADPKDGEVLSQYGKLVW 307
+Y KKM+ E+P +PLFL+ YAQFL+Q DL+ AE+YYSRAI ADP DGE +S+Y KL W
Sbjct: 275 DYLKKMINENPNNPLFLKKYAQFLFQSNRDLEAAEDYYSRAISADPSDGETISEYAKLQW 334
Query: 308 ELHHDEERASSYFERAAQASPEDSHVHAAYASFLWDTEEGE 348
+LHHD+E+A S FE+A +A+P DS+V AAY FLW+TE+ E
Sbjct: 335 QLHHDQEKALSLFEQAVKATPGDSNVLAAYTCFLWETEDEE 375
Score = 136 bits (343), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 69/110 (62%), Positives = 83/110 (75%), Gaps = 4/110 (3%)
Query: 246 VEEYYKKMVEESPGDPLFLRNYAQFLYQCKHDLKGAEEYYSRAILADPKDGEVLSQYGKL 305
++EYYK MV + P PL L+ YA FL Q K +L+ AEEY+ RA LADP DGE+L Y KL
Sbjct: 129 LQEYYKIMVHDYPSHPLILKKYAHFL-QGKGELQDAEEYFHRATLADPNDGEILMHYAKL 187
Query: 306 VWELHHDEERASSYFERAAQASPEDSHVHAAYASFLWDTEEGEDGCNESQ 355
VWE HHD +RAS YFERAA+ASP+DS V AAYASFLW+TE+ E NES+
Sbjct: 188 VWENHHDRDRASVYFERAAKASPQDSDVLAAYASFLWETEDDE---NESE 234
>Medtr5g029910.1 | TPR repeat protein | HC | chr5:12584803-12582810
| 20130731
Length = 253
Score = 118 bits (296), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 49/99 (49%), Positives = 73/99 (73%)
Query: 246 VEEYYKKMVEESPGDPLFLRNYAQFLYQCKHDLKGAEEYYSRAILADPKDGEVLSQYGKL 305
+ YY++M++ +P D L LRNY ++L++ + ++ AEEYY RAILA+P+D E+LS YGKL
Sbjct: 122 IGAYYEEMLKSNPADALLLRNYGKYLHEVEKNMVRAEEYYGRAILANPEDAELLSLYGKL 181
Query: 306 VWELHHDEERASSYFERAAQASPEDSHVHAAYASFLWDT 344
+WE+ DEERA SYF++A P+DS V +YA F+W+
Sbjct: 182 IWEMSRDEERAKSYFDQAIHVDPDDSTVLGSYAHFMWEA 220
>Medtr4g114690.1 | TPR repeat protein | HC | chr4:47200290-47201853
| 20130731
Length = 292
Score = 117 bits (292), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 51/94 (54%), Positives = 69/94 (73%)
Query: 249 YYKKMVEESPGDPLFLRNYAQFLYQCKHDLKGAEEYYSRAILADPKDGEVLSQYGKLVWE 308
YY+ M+E +PG+PLFL NYA++L + + D AEEY RAILA+P DG VLS Y L+WE
Sbjct: 165 YYRTMIEANPGNPLFLGNYAKYLKEVRKDYVKAEEYCGRAILANPNDGNVLSLYADLIWE 224
Query: 309 LHHDEERASSYFERAAQASPEDSHVHAAYASFLW 342
H D RA +YF++A +A+P+D +V A+YA FLW
Sbjct: 225 CHKDAPRAETYFDQAVKAAPDDCYVLASYAHFLW 258
>Medtr2g103795.1 | TPR repeat protein | HC | chr2:44688706-44694656
| 20130731
Length = 275
Score = 114 bits (285), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 53/117 (45%), Positives = 73/117 (62%)
Query: 246 VEEYYKKMVEESPGDPLFLRNYAQFLYQCKHDLKGAEEYYSRAILADPKDGEVLSQYGKL 305
+ YY+ M+E +P + L L NYA+FL + + D AEEY RAILA P D + LS Y L
Sbjct: 139 TDAYYQNMIEANPNNSLLLGNYAKFLKEVRGDYGKAEEYVERAILASPSDADALSLYADL 198
Query: 306 VWELHHDEERASSYFERAAQASPEDSHVHAAYASFLWDTEEGEDGCNESQCLPSHYH 362
+W+ + +RA +YF+RA Q+ P D +V A+YA FLWD EE ED + + SH H
Sbjct: 199 IWQTEKNADRAEAYFDRAIQSDPNDCYVLASYAKFLWDAEEDEDNDCQHKTDKSHTH 255
>Medtr4g010250.1 | TPR repeat protein | HC | chr4:2308253-2305617 |
20130731
Length = 253
Score = 106 bits (265), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 53/118 (44%), Positives = 69/118 (58%), Gaps = 2/118 (1%)
Query: 246 VEEYYKKMVEESPGDPLFLRNYAQFLYQCKHDLKGAEEYYSRAILADPKDGEVLSQYGKL 305
++ YY+ M+E P D L L NY +FL + D AEE RAILA+P DG V+S Y L
Sbjct: 124 LDAYYQNMIEAHPCDALLLGNYGKFLKEVCGDYAKAEECLERAILANPGDGHVMSIYADL 183
Query: 306 VWELHHDEERASSYFERAAQASPEDSHVHAAYASFLWDTEEGEDGCNESQCLPSHYHL 363
+WE + RA YF++A Q+ P D +V A+YA FLWD E ED + Q H HL
Sbjct: 184 IWETKKNAARAQQYFDQAIQSDPNDCYVLASYAKFLWDAENEED--KDYQIKSDHMHL 239
>Medtr2g103795.2 | TPR repeat protein | HC | chr2:44688706-44690159
| 20130731
Length = 224
Score = 86.7 bits (213), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 38/85 (44%), Positives = 54/85 (63%)
Query: 246 VEEYYKKMVEESPGDPLFLRNYAQFLYQCKHDLKGAEEYYSRAILADPKDGEVLSQYGKL 305
+ YY+ M+E +P + L L NYA+FL + + D AEEY RAILA P D + LS Y L
Sbjct: 139 TDAYYQNMIEANPNNSLLLGNYAKFLKEVRGDYGKAEEYVERAILASPSDADALSLYADL 198
Query: 306 VWELHHDEERASSYFERAAQASPED 330
+W+ + +RA +YF+RA Q+ P D
Sbjct: 199 IWQTEKNADRAEAYFDRAIQSDPND 223
Score = 51.2 bits (121), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 35/66 (53%)
Query: 284 YYSRAILADPKDGEVLSQYGKLVWELHHDEERASSYFERAAQASPEDSHVHAAYASFLWD 343
YY I A+P + +L Y K + E+ D +A Y ERA ASP D+ + YA +W
Sbjct: 142 YYQNMIEANPNNSLLLGNYAKFLKEVRGDYGKAEEYVERAILASPSDADALSLYADLIWQ 201
Query: 344 TEEGED 349
TE+ D
Sbjct: 202 TEKNAD 207
>Medtr3g463450.1 | TPR superfamily protein | HC |
chr3:25403273-25399680 | 20130731
Length = 513
Score = 85.9 bits (211), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 59/106 (55%), Gaps = 1/106 (0%)
Query: 246 VEEYYKKMVEESPGDPLFLRNYAQFLYQCKHDLKGAEEYYSRAILADPKDGEVLSQYGKL 305
E Y+ + + P + L L NYAQFLY H+ AEEY+ RAI +P D E ++Y
Sbjct: 399 TELVYQTGLSQEPNNALLLANYAQFLYIVAHEFDRAEEYFKRAIEVEPPDAEAYNKYATF 458
Query: 306 VWELHHDEERASSYFERAAQASPEDSHVHAAYASFLWDTEEGEDGC 351
+W++ +D A + A A P +++ A YA FLW+T GED C
Sbjct: 459 LWKVKNDLWAAEETYLEAISAEPSNTYYAANYAHFLWNT-GGEDTC 503
>Medtr5g082130.1 | hypothetical protein | HC |
chr5:35259864-35257495 | 20130731
Length = 416
Score = 74.7 bits (182), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 35/96 (36%), Positives = 56/96 (58%)
Query: 249 YYKKMVEESPGDPLFLRNYAQFLYQCKHDLKGAEEYYSRAILADPKDGEVLSQYGKLVWE 308
YYKK + +P + L L NYAQFL+ D GAEEYY ++++ + + E +YG +
Sbjct: 301 YYKKHINLAPYNSLLLSNYAQFLFLVMKDNDGAEEYYKQSVVVESPEAEAYCRYGDFLLW 360
Query: 309 LHHDEERASSYFERAAQASPEDSHVHAAYASFLWDT 344
+ D A + +A +A P +++ + YASFLW+T
Sbjct: 361 IRKDNWAAELRYLQALEADPGNTYYLSKYASFLWNT 396