
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC143338.3 - phase: 0
(446 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC90342 similar to PIR|C96757|C96757 hypothetical protein T18K17... 192 4e-94
BE320305 similar to GP|19347854|gb unknown protein {Arabidopsis ... 311 2e-85
TC87263 similar to GP|9759298|dbj|BAB09804.1 emb|CAB82953.1~gene... 243 8e-65
TC87744 weakly similar to GP|17104519|gb|AAL34148.1 unknown prot... 234 7e-62
TC77779 similar to GP|18377638|gb|AAL66969.1 unknown protein {Ar... 220 8e-58
TC78988 similar to GP|6862912|gb|AAF30301.1| unknown protein {Ar... 213 9e-56
TC89440 similar to PIR|T49211|T49211 hypothetical protein F27K19... 198 3e-51
BF636066 similar to GP|20197242|gb expressed protein {Arabidopsi... 111 4e-50
TC82914 similar to PIR|A84828|A84828 hypothetical protein At2g40... 110 1e-44
TC79178 similar to GP|17065212|gb|AAL32760.1 Unknown protein {Ar... 122 5e-43
TC87950 weakly similar to GP|17104519|gb|AAL34148.1 unknown prot... 166 2e-41
BG450971 similar to GP|21553616|gb unknown {Arabidopsis thaliana... 162 2e-40
TC92469 similar to GP|19347854|gb|AAL86006.1 unknown protein {Ar... 157 6e-39
TC82783 similar to GP|22136764|gb|AAM91701.1 unknown protein {Ar... 148 5e-36
BF646181 similar to GP|17104519|gb unknown protein {Arabidopsis ... 134 7e-32
TC90222 similar to GP|15451108|gb|AAK96825.1 putative protein {A... 105 2e-30
BG585335 similar to PIR|A84828|A84 hypothetical protein At2g4032... 129 2e-30
BM812987 weakly similar to GP|16226759|gb At2g30010/F23F1.7 {Ara... 125 3e-29
AL375618 weakly similar to GP|19571145|db contains ESTs AU033061... 124 6e-29
AW126374 similar to PIR|A84828|A84 hypothetical protein At2g4032... 123 1e-28
>TC90342 similar to PIR|C96757|C96757 hypothetical protein T18K17.20
[imported] - Arabidopsis thaliana, partial (80%)
Length = 1533
Score = 192 bits (489), Expect(2) = 4e-94
Identities = 83/164 (50%), Positives = 113/164 (68%)
Frame = +3
Query: 106 ESCDVFSGKWVFDNASYPLYNESDCPYMSDQLACNKHGRTDLGYQHWRWQPHDCDLKRWN 165
E C++F GKWV+DN SYPLY E CPY+ Q C K+GR D Y +WRWQPH+C+L R++
Sbjct: 204 EGCNIFEGKWVWDNVSYPLYEEESCPYLVKQTTCMKNGRPDSFYTNWRWQPHECNLPRFD 383
Query: 166 VKEMWEKLRGKRLMFVGDSLNRGQWISMVCLLQSEIPADKRSMSPNAHLTIFRAEEYNAT 225
++ LR KR+MF+GDSL RGQ+ SM+CL+QS IP K+S+ + IFR EE+NAT
Sbjct: 384 PLKLLHMLRNKRMMFIGDSLQRGQFESMICLVQSVIPEGKKSLQRIPPMKIFRVEEFNAT 563
Query: 226 VEFLWAPLLAESNSDDPVNHRLDERIIRPDSVLKHASLWEHADI 269
+E+ WAP + ES SD NH + +R++ DS+ KH W+ DI
Sbjct: 564 IEYYWAPFMVESISDHATNHTVHKRMVMLDSIAKHGKHWQGVDI 695
Score = 170 bits (431), Expect(2) = 4e-94
Identities = 79/164 (48%), Positives = 106/164 (64%), Gaps = 1/164 (0%)
Frame = +2
Query: 276 LWWRQGPVKLLWTDEENGACEELDGRGAMELAMGAWADWVSSKVDPLKKRVFFVTMSPTH 335
+WW P + T +E + A +LA+ WA+W+ S + PL + VFF++MSPTH
Sbjct: 716 VWWMHSPF-INATHGSPDEVQEYNVTTAYKLALKTWANWLESNIQPLNQYVFFMSMSPTH 892
Query: 336 LWSREWNPESEGNCYGEKKPIDVESYWGSGSDLPTMSSVENILSSLNSKVSVLNVTQLSE 395
LWS EW P S+ NC+ E PI SYWG+GS+L M + + L L V++LN+TQLSE
Sbjct: 893 LWSWEWKPGSDENCFNESYPIQGSSYWGTGSNLEIMKILHDSLQELKIDVTLLNITQLSE 1072
Query: 396 YRKDGHPSIF-RKFWEPLRPEQLSNPSSYSDCIHWCLPGVPDTW 438
YRKD H S++ + + L EQ SNP S++DCIHWCLPGVPDTW
Sbjct: 1073YRKDAHTSVYGERKGKLLTKEQRSNPKSFADCIHWCLPGVPDTW 1204
>BE320305 similar to GP|19347854|gb unknown protein {Arabidopsis thaliana},
partial (32%)
Length = 444
Score = 311 bits (798), Expect = 2e-85
Identities = 148/148 (100%), Positives = 148/148 (100%)
Frame = +1
Query: 176 KRLMFVGDSLNRGQWISMVCLLQSEIPADKRSMSPNAHLTIFRAEEYNATVEFLWAPLLA 235
KRLMFVGDSLNRGQWISMVCLLQSEIPADKRSMSPNAHLTIFRAEEYNATVEFLWAPLLA
Sbjct: 1 KRLMFVGDSLNRGQWISMVCLLQSEIPADKRSMSPNAHLTIFRAEEYNATVEFLWAPLLA 180
Query: 236 ESNSDDPVNHRLDERIIRPDSVLKHASLWEHADILVFNTYLWWRQGPVKLLWTDEENGAC 295
ESNSDDPVNHRLDERIIRPDSVLKHASLWEHADILVFNTYLWWRQGPVKLLWTDEENGAC
Sbjct: 181 ESNSDDPVNHRLDERIIRPDSVLKHASLWEHADILVFNTYLWWRQGPVKLLWTDEENGAC 360
Query: 296 EELDGRGAMELAMGAWADWVSSKVDPLK 323
EELDGRGAMELAMGAWADWVSSKVDPLK
Sbjct: 361 EELDGRGAMELAMGAWADWVSSKVDPLK 444
>TC87263 similar to GP|9759298|dbj|BAB09804.1
emb|CAB82953.1~gene_id:MPH15.5~strong similarity to
unknown protein {Arabidopsis thaliana}, partial (65%)
Length = 2229
Score = 243 bits (621), Expect = 8e-65
Identities = 141/405 (34%), Positives = 210/405 (51%), Gaps = 10/405 (2%)
Frame = +2
Query: 49 EQASSTTYVKPNLPNHLKKSQEILDRYSRCNSTVGYSGRKIARRGGSKSSSNRRVSSESC 108
+ ++T P +PN S L N T + ++ GS + S C
Sbjct: 653 QTTNATVEGVPIVPNKNLSSDSSLKGVDLHNYTASLARKQ---NNGSNKYAELMESLMKC 823
Query: 109 DVFSGKWVFDNASYPLYNESDCPYMSDQLACNKHGRTDLGYQHWRWQPHDCDLKRWNVKE 168
D F G+W+ D+ SYPLY C + +Q C ++GR D YQ ++W+P C L R +
Sbjct: 824 DFFDGEWIKDD-SYPLYKPGSCSIIDEQFNCIRNGRPDKDYQKYKWKPKGCSLPRLDGHR 1000
Query: 169 MWEKLRGKRLMFVGDSLNRGQWISMVCLLQSEIPADKRSMSPNAHLTIFRAE-------- 220
M + LRGKRL+FVGDSLNR W S++C+L++ + K+ N + FR E
Sbjct: 1001 MLDLLRGKRLVFVGDSLNRNMWESLICILKNSVKDKKKVYEANGRVH-FRGEASYSFVFK 1177
Query: 221 EYNATVEFLWAPLLAESNSDDPVNHRLDERIIRPDSVLKHASLWEHADILVFNTYLWWRQ 280
+Y +VE +P L + + P + + +R D V + + ++ ADI+VFNT WW
Sbjct: 1178 DYKFSVELFVSPFLVQ-EWEMPDKNGTKKETLRLDLVGRSSDQYKDADIIVFNTGHWWTH 1354
Query: 281 GPVK--LLWTDEENGACEELDGRGAMELAMGAWADWVSSKVDPLKKRVFFVTMSPTHLWS 338
+ E + +EL+ A A+ W WV + V+P K V F S +H
Sbjct: 1355 DKTSKGKDYYQEGSHVYDELNVLEAFRRAITTWGRWVDANVNPTKSIVLFRGYSASHFSG 1534
Query: 339 REWNPESEGNCYGEKKPIDVESYWGSGSDLPTMSSVENILSSLNSKVSVLNVTQLSEYRK 398
+WN S G C E PID E Y P M +E +L ++ + VS LN+T+++++RK
Sbjct: 1535 GQWN--SGGQCDHETAPIDNEKYLTEYP--PKMRVLEKVLKNMKNPVSYLNITRMTDFRK 1702
Query: 399 DGHPSIFRKFWEPLRPEQLSNPSSYSDCIHWCLPGVPDTWNELLF 443
DGHPSI+RK + L PE+ +P + DC HWCLPGVPD WNE+L+
Sbjct: 1703 DGHPSIYRK--QNLSPEERKSPLRFQDCSHWCLPGVPDAWNEILY 1831
>TC87744 weakly similar to GP|17104519|gb|AAL34148.1 unknown protein
{Arabidopsis thaliana}, partial (69%)
Length = 1387
Score = 234 bits (596), Expect = 7e-62
Identities = 121/342 (35%), Positives = 181/342 (52%), Gaps = 2/342 (0%)
Frame = +3
Query: 104 SSESCDVFSGKWVFDNASYPLYNESDCPYMSDQLACNKHGRTDLGYQHWRWQPHDCDLKR 163
+ CD+F GKWV+D SYPLY S CP++ + C +GR D Y +RWQP CDL R
Sbjct: 129 AKSGCDLFQGKWVYDE-SYPLYQTSQCPFIEKEFDCQNNGRPDKFYLKYRWQPTKCDLPR 305
Query: 164 WNVKEMWEKLRGKRLMFVGDSLNRGQWISMVCLLQSEIPADKRSMSPNAHLTIFRAEEYN 223
+N ++ + RGK ++FVGDSL+ QW S+ C+L +P ++ L+IF Y
Sbjct: 306 FNGEDFLRRYRGKSILFVGDSLSLNQWQSLTCMLHIAVPQAHYTLVRIGDLSIFTFTTYG 485
Query: 224 ATVEFLWAPLLAESNSDDPVNHRLDERIIRPDSVLKHASLWEHADILVFNTYLWWRQGPV 283
V F L + S++ R+++ DS+ + A W+ D+L+F+++ WW
Sbjct: 486 VKVMFSRNAFLVDIFSEN------IGRVLKLDSI-QSARNWKGIDVLIFDSWHWWLHTGR 644
Query: 284 KLLW--TDEENGACEELDGRGAMELAMGAWADWVSSKVDPLKKRVFFVTMSPTHLWSREW 341
K W E N ++D A E + WA W+ VD K +VFF +SP HL SR+W
Sbjct: 645 KQPWDLIQEGNNTFRDMDRLVAYEKGLKTWAKWIDDNVDITKTKVFFQGISPDHLNSRQW 824
Query: 342 NPESEGNCYGEKKPIDVESYWGSGSDLPTMSSVENILSSLNSKVSVLNVTQLSEYRKDGH 401
C G++KP+ Y G +P ++E ++ ++ V +L++T LS+ RKDGH
Sbjct: 825 GDPKANFCEGQEKPLSGSMY--PGGPVPAQLALERVIRAMKKPVYLLDITTLSQLRKDGH 998
Query: 402 PSIFRKFWEPLRPEQLSNPSSYSDCIHWCLPGVPDTWNELLF 443
PS++ DC HWCL GVPDTWN+LL+
Sbjct: 999 PSVYG-----------HGGHRDMDCSHWCLAGVPDTWNQLLY 1091
>TC77779 similar to GP|18377638|gb|AAL66969.1 unknown protein {Arabidopsis
thaliana}, partial (85%)
Length = 1450
Score = 220 bits (561), Expect = 8e-58
Identities = 119/352 (33%), Positives = 188/352 (52%), Gaps = 3/352 (0%)
Frame = +2
Query: 95 SKSSSNRRVSSESCDVFSGKWVFDNASYPLYNESDCPYMSDQLACNKHGRTDLGYQHWRW 154
S + V + C++F G WV D SYPLY +S CP++ + C K+GR D Y + W
Sbjct: 269 SSLRGKKPVVTNGCNLFIGSWVVD-PSYPLY-DSSCPFIDPEFNCQKYGRPDKQYLKYSW 442
Query: 155 QPHDCDLKRWNVKEMWEKLRGKRLMFVGDSLNRGQWISMVCLLQSEIPADKRSMSPNAHL 214
+P C L ++ K+ K RGK++MFVGDSL+ W S+ C++ + +P K S
Sbjct: 443 KPDSCALPSFDGKDFLNKWRGKKIMFVGDSLSLNMWESLSCMIHASVPNVKTSFLRREAQ 622
Query: 215 TIFRAEEYNATVEFLWAPLLAESNSDDPVNHRLDERIIRPDSVLKHASLWEHADILVFNT 274
+ ++Y T++ P L + ++ R++ DS++ + W+ D+LVFN+
Sbjct: 623 STVTFQDYGVTIQLYRTPYLVDIIRENV------GRVLTLDSIVA-GNAWKGMDMLVFNS 781
Query: 275 YLWWRQGPVKLLWTDEENGA--CEELDGRGAMELAMGAWADWVSSKVDPLKKRVFFVTMS 332
+ WW W +G+ + +D A + WA WV VDP K +VFF +S
Sbjct: 782 WHWWTHKGSSQGWDYIRDGSKLVKNMDRLVAYNKGLTTWAKWVDLNVDPTKTKVFFQGIS 961
Query: 333 PTHLWSREWNPESEGNCYGEKKPIDVESYWGSGSDLPTMSSV-ENILSSLNSKVSVLNVT 391
PTH +EWN + + +C G+ +P+ +Y + LP S++ N+L S+ S V +L++T
Sbjct: 962 PTHYMGKEWN-QPKNSCSGQLEPLSGSTY---PAGLPPSSNILNNVLKSMKSPVYLLDIT 1129
Query: 392 QLSEYRKDGHPSIFRKFWEPLRPEQLSNPSSYSDCIHWCLPGVPDTWNELLF 443
LS+ RKD HPS + S + +DC HWCLPG+PDTWN+LL+
Sbjct: 1130LLSQLRKDAHPSSY------------SGDHAGNDCSHWCLPGLPDTWNQLLY 1249
>TC78988 similar to GP|6862912|gb|AAF30301.1| unknown protein {Arabidopsis
thaliana}, partial (68%)
Length = 2203
Score = 213 bits (543), Expect = 9e-56
Identities = 124/356 (34%), Positives = 195/356 (53%), Gaps = 15/356 (4%)
Frame = +3
Query: 106 ESCDVFSGKWVFDNASYPLYNESDCPYMSDQLACNKHGRTDLGYQHWRWQPHDCDLKRWN 165
++CDVF G WV+D +YPLY+ S+C ++ C+++GR D Y WRWQP DC+L R++
Sbjct: 411 DNCDVFDGNWVWDE-TYPLYHSSNCSFLDQGFRCSENGRPDAFYTKWRWQPKDCNLPRFD 587
Query: 166 VKEMWEKLRGKRLMFVGDSLNRGQWISMVCLLQSEI-------PADKRSMSPNAHLTIFR 218
++M E +R KRL+FVGDS+ R QW S++C+L S + + ++ + F+
Sbjct: 588 ARKMLENIRNKRLVFVGDSIGRNQWESLLCMLSSAVTNKSSVYEVNGNPITKHTGFLAFK 767
Query: 219 AEEYNATVEFLWAP-LLAESNSDDPVNHRLDERIIRPDSVLKHASLWEHADILVFNTYLW 277
E++N TVE+ +P L+ + +R+ + +R D + + W AD+LV N W
Sbjct: 768 FEDFNCTVEYYRSPFLVVQGRPPHGAPYRV-KLTLRVDHMDWTSHRWRDADVLVLNAGHW 944
Query: 278 WR-QGPVKL-LWTDEENGACEELDGRGAMELAMGAWADWVSSKVDPLKKRVFFVTMSPTH 335
W + VK+ + + A L++ DW++ +V+ K V F T +P H
Sbjct: 945 WNYEKTVKMGCYFQIGEQVKMNMSTEDAFRLSVETVVDWIAREVNRNKTYVLFRTYAPVH 1124
Query: 336 LWSREWNPESEGNCYGEKKPIDVESYWGSGSDLPTMSSVENILSSLNSK-----VSVLNV 390
+WN + G C+ E P D+ S + SD+ S+V N+LS SK + +LN+
Sbjct: 1125FRGGDWN--TGGGCHSETLP-DLGSV-PAISDI-HFSTVTNVLSQRASKSHVLNLDLLNI 1289
Query: 391 TQLSEYRKDGHPSIFRKFWEPLRPEQLSNPSSYSDCIHWCLPGVPDTWNELLFHFL 446
TQ+S RKDGH SI+ + P++ DC HWCLPGVPD+WNE+L+ L
Sbjct: 1290TQMSARRKDGHASIYY-----IGPDKGPASMQRQDCSHWCLPGVPDSWNEILYALL 1442
>TC89440 similar to PIR|T49211|T49211 hypothetical protein F27K19.170 -
Arabidopsis thaliana, partial (48%)
Length = 1446
Score = 198 bits (504), Expect = 3e-51
Identities = 98/217 (45%), Positives = 133/217 (61%), Gaps = 7/217 (3%)
Frame = +1
Query: 106 ESCDVFSGKWVFDNASYPLYNESDCPYMSDQLACNKHGRTDLGYQHWRWQPHDCDLKRWN 165
+ CD+F+GKWV DN ++PLY E +C +++ Q+ C ++GR D YQ+W+WQP +
Sbjct: 793 KDCDLFNGKWVLDNVTHPLYKEDECEFLTSQVTCMRNGRRDSLYQNWKWQPKRLFYAKVQ 972
Query: 166 VK-EMWEKLRGKRLMFVGDSLNRGQWISMVCLLQSEIPADKRSMSPNAHLTIFRAEEYN- 223
K + + RGKRLMFVGDSLNR QW SMVC++QS +P+DK++ I + E
Sbjct: 973 AKIVVLRRFRGKRLMFVGDSLNRNQWESMVCMVQSVVPSDKKTWYKTGSFAILKITEPGH 1152
Query: 224 --ATVEFLWAPLLAESNSDDPVNHRLDERIIRPDSVLKHASLWEHADILVFNTYLWWRQG 281
TVEF WAP L ESNSDDP H + RII P+S+ KH W+ AD L+FNTY+WW
Sbjct: 1153 IITTVEFYWAPFLVESNSDDPNMHSILNRIIMPESIEKHGVNWKEADYLIFNTYIWWMNT 1332
Query: 282 -PVKLLWTDEENGACE--ELDGRGAMELAMGAWADWV 315
+K+L + GA E E+ A E M W+ WV
Sbjct: 1333 FNMKVLRGSFDEGATEYDEVSRPVAYERVMKTWSKWV 1443
>BF636066 similar to GP|20197242|gb expressed protein {Arabidopsis thaliana},
partial (29%)
Length = 683
Score = 111 bits (278), Expect(2) = 4e-50
Identities = 54/86 (62%), Positives = 66/86 (75%), Gaps = 2/86 (2%)
Frame = +2
Query: 161 LKRWNVKEMWEKLRGKRLMFVGDSLNRGQWISMVCLLQSEIPADKRSMS--PNAHLTIFR 218
LKR+N + E+LR KRL+FVGDSLNRGQW+SMVCL+ S IP+ +SM N L IF+
Sbjct: 425 LKRFNATALLERLRNKRLVFVGDSLNRGQWVSMVCLVDSIIPSSLKSMQSIANGSLNIFK 604
Query: 219 AEEYNATVEFLWAPLLAESNSDDPVN 244
+EYNAT+E W+PLL ESNSDDPVN
Sbjct: 605 IKEYNATIENYWSPLLVESNSDDPVN 682
Score = 105 bits (261), Expect(2) = 4e-50
Identities = 41/61 (67%), Positives = 50/61 (81%)
Frame = +1
Query: 103 VSSESCDVFSGKWVFDNASYPLYNESDCPYMSDQLACNKHGRTDLGYQHWRWQPHDCDLK 162
+SS CD+FSGKWVFDN +YPLY E+ C ++SDQLAC K+GR DL YQ+WRW+PH CDL
Sbjct: 193 LSSSKCDLFSGKWVFDNETYPLYKENQCSFLSDQLACEKYGRKDLSYQNWRWKPHQCDLP 372
Query: 163 R 163
R
Sbjct: 373 R 375
>TC82914 similar to PIR|A84828|A84828 hypothetical protein At2g40320
[imported] - Arabidopsis thaliana, partial (35%)
Length = 798
Score = 110 bits (276), Expect(2) = 1e-44
Identities = 51/91 (56%), Positives = 66/91 (72%)
Frame = +2
Query: 175 GKRLMFVGDSLNRGQWISMVCLLQSEIPADKRSMSPNAHLTIFRAEEYNATVEFLWAPLL 234
G+ LMFVGDSLNRGQ++S++CLL IP +SM LT+F A+EYNAT+EF WAP L
Sbjct: 509 GRGLMFVGDSLNRGQYVSLICLLHHLIPQHAKSMETFDSLTVFTAKEYNATIEFYWAPFL 688
Query: 235 AESNSDDPVNHRLDERIIRPDSVLKHASLWE 265
ESNSD+ V HR+ +RI+R S+ KH W+
Sbjct: 689 LESNSDNAVIHRVTDRIVRKGSINKHGRYWK 781
Score = 87.4 bits (215), Expect(2) = 1e-44
Identities = 36/68 (52%), Positives = 45/68 (65%)
Frame = +3
Query: 106 ESCDVFSGKWVFDNASYPLYNESDCPYMSDQLACNKHGRTDLGYQHWRWQPHDCDLKRWN 165
+ CD+FSG WV D + PLY ES+CPY+ QL C +HGR D Y+ RWQPH CDL ++N
Sbjct: 300 KECDLFSGTWVKDELTRPLYEESECPYIQPQLTCQEHGRPDKEYRRLRWQPHGCDLPKFN 479
Query: 166 VKEMWEKL 173
M E L
Sbjct: 480 ASLMLETL 503
>TC79178 similar to GP|17065212|gb|AAL32760.1 Unknown protein {Arabidopsis
thaliana}, partial (83%)
Length = 1288
Score = 122 bits (305), Expect(2) = 5e-43
Identities = 57/124 (45%), Positives = 81/124 (64%), Gaps = 1/124 (0%)
Frame = +1
Query: 104 SSESCDVFSGKWVFDNASYPLYN-ESDCPYMSDQLACNKHGRTDLGYQHWRWQPHDCDLK 162
SS+ CD+F+GKWV D SYPLY S CP++ + C +GR DL Y H+RWQP C+L
Sbjct: 130 SSQQCDMFTGKWVVDE-SYPLYKPTSTCPFIEREFRCEANGRPDLIYTHYRWQPLSCNLL 306
Query: 163 RWNVKEMWEKLRGKRLMFVGDSLNRGQWISMVCLLQSEIPADKRSMSPNAHLTIFRAEEY 222
R++ ++ E++RGK +MFVGDSL+R QW S+ CLL S +P +++ ++IF EY
Sbjct: 307 RFDGQDFLERMRGKSIMFVGDSLSRNQWQSLTCLLHSAVPNSSYTVARVGDVSIFTFTEY 486
Query: 223 NATV 226
V
Sbjct: 487 EVKV 498
Score = 76.6 bits (187), Expect = 2e-14
Identities = 39/105 (37%), Positives = 59/105 (56%)
Frame = +1
Query: 342 NPESEGNCYGEKKPIDVESYWGSGSDLPTMSSVENILSSLNSKVSVLNVTQLSEYRKDGH 401
N S +C +K P+ +Y G P + ++ +LS++ V++L++T LS RKDGH
Sbjct: 832 NEPSAKSCIRKKTPLTGSTY--PGGLPPAVGVLKGVLSTIKKPVTLLDITTLSLLRKDGH 1005
Query: 402 PSIFRKFWEPLRPEQLSNPSSYSDCIHWCLPGVPDTWNELLFHFL 446
PSI+ F S DC HWCL GVPDTWN++L++ +
Sbjct: 1006PSIYGLFG-----------SKGMDCSHWCLSGVPDTWNQILYNLI 1107
Score = 70.9 bits (172), Expect(2) = 5e-43
Identities = 36/101 (35%), Positives = 54/101 (52%), Gaps = 2/101 (1%)
Frame = +3
Query: 259 KHASLWEHADILVFNTYLWW-RQGPVKLL-WTDEENGACEELDGRGAMELAMGAWADWVS 316
K A LW+ D+L+FNT+ WW R+GP + + N +++D A E A+ WA WV
Sbjct: 576 KEAKLWKGIDMLIFNTWHWWYRRGPTQPWDYIQVGNQVLKDMDRMKAFERALTTWARWVD 755
Query: 317 SKVDPLKKRVFFVTMSPTHLWSREWNPESEGNCYGEKKPID 357
+ +DP K +VFF +SP+H W Y +K +D
Sbjct: 756 ANIDPAKVKVFFQGISPSHYNGTLWK*TKCKELYPKKDTVD 878
>TC87950 weakly similar to GP|17104519|gb|AAL34148.1 unknown protein
{Arabidopsis thaliana}, partial (69%)
Length = 938
Score = 166 bits (419), Expect = 2e-41
Identities = 83/251 (33%), Positives = 135/251 (53%), Gaps = 2/251 (0%)
Frame = +1
Query: 107 SCDVFSGKWVFDNASYPLYNESDCPYMSDQLACNKHGRTDLGYQHWRWQPHDCDLKRWNV 166
+C++F GKWV+D ASYPLY+ S CP++ Q C KHGR D YQ +RW P C++ R+N
Sbjct: 154 TCNLFRGKWVYD-ASYPLYDPSTCPFIDPQFNCQKHGRKDKLYQKYRWMPFSCNMPRFNG 330
Query: 167 KEMWEKLRGKRLMFVGDSLNRGQWISMVCLLQSEIPADKRSMSPNAHLTIFRAEEYNATV 226
+ +GK++MFVGDSL+ Q+ S+ C++ + +P + + ++ EEY +
Sbjct: 331 LNFLKGNKGKKIMFVGDSLSLNQFNSLACMIHAAVPNSRSTFRQRDAISSVTFEEYGLEL 510
Query: 227 EFLWAPLLAESNSDDPVNHRLDERIIRPDSVLKHASLWEHADILVFNTYLWWRQGPVKLL 286
L + ++H + R+++ DS+ K W D+L+FNT+ WW
Sbjct: 511 FLYRTAYLVD------LDHDKEGRVLKLDSI-KSGEAWRGMDVLIFNTWHWWTHTGSSQP 669
Query: 287 W--TDEENGACEELDGRGAMELAMGAWADWVSSKVDPLKKRVFFVTMSPTHLWSREWNPE 344
W E ++++ A + WA WV V+P + +VFF+ +SP H R+WN
Sbjct: 670 WDYIQENKKLYKDMNRFVAFYKGLQTWARWVEMNVNPAQTKVFFLGISPVHYQGRDWNQP 849
Query: 345 SEGNCYGEKKP 355
++ +C EK P
Sbjct: 850 TK-SCMSEKVP 879
>BG450971 similar to GP|21553616|gb unknown {Arabidopsis thaliana}, partial
(35%)
Length = 675
Score = 162 bits (411), Expect = 2e-40
Identities = 70/135 (51%), Positives = 95/135 (69%)
Frame = +1
Query: 106 ESCDVFSGKWVFDNASYPLYNESDCPYMSDQLACNKHGRTDLGYQHWRWQPHDCDLKRWN 165
E C+V +GKW+F+++ PLY++ CPY+ Q +C K+GR D Y HW WQP DC L ++N
Sbjct: 238 EECNVANGKWIFNSSIKPLYSDKSCPYIDKQFSCVKNGRNDSDYLHWEWQPEDCTLPQFN 417
Query: 166 VKEMWEKLRGKRLMFVGDSLNRGQWISMVCLLQSEIPADKRSMSPNAHLTIFRAEEYNAT 225
+ +KL GKRL+FVGDSL R QW S VCL+Q IP ++SM ++F+A+EYNAT
Sbjct: 418 PEIALKKLEGKRLLFVGDSLQRNQWESFVCLVQGIIPEKEKSMKRGRVRSVFKAKEYNAT 597
Query: 226 VEFLWAPLLAESNSD 240
+EF WAP L ESN+D
Sbjct: 598 IEFYWAPFLVESNTD 642
>TC92469 similar to GP|19347854|gb|AAL86006.1 unknown protein {Arabidopsis
thaliana}, partial (14%)
Length = 598
Score = 157 bits (398), Expect = 6e-39
Identities = 72/72 (100%), Positives = 72/72 (100%)
Frame = +1
Query: 375 ENILSSLNSKVSVLNVTQLSEYRKDGHPSIFRKFWEPLRPEQLSNPSSYSDCIHWCLPGV 434
ENILSSLNSKVSVLNVTQLSEYRKDGHPSIFRKFWEPLRPEQLSNPSSYSDCIHWCLPGV
Sbjct: 1 ENILSSLNSKVSVLNVTQLSEYRKDGHPSIFRKFWEPLRPEQLSNPSSYSDCIHWCLPGV 180
Query: 435 PDTWNELLFHFL 446
PDTWNELLFHFL
Sbjct: 181 PDTWNELLFHFL 216
>TC82783 similar to GP|22136764|gb|AAM91701.1 unknown protein {Arabidopsis
thaliana}, partial (36%)
Length = 823
Score = 148 bits (373), Expect = 5e-36
Identities = 81/226 (35%), Positives = 125/226 (54%), Gaps = 2/226 (0%)
Frame = +1
Query: 220 EEYNATVEFLWAPLLAESNSDDPVNHRLDERIIRPDSVLKHASLWEHADILVFNTYLWWR 279
E+YN +V+F+ +P L ++ N + +R D + S ++ A+I+VFNT WW
Sbjct: 1 EDYNCSVDFVASPFLVRESTFKGKNGSFET--LRLDLMDHTTSRYQDANIIVFNTGHWWT 174
Query: 280 QGPVKL--LWTDEENGACEELDGRGAMELAMGAWADWVSSKVDPLKKRVFFVTMSPTHLW 337
+ E N L A A+ WA WV K++ +VFF S TH W
Sbjct: 175 HEKTSEGEEYYQEGNHVYPRLKALDAYMRALTTWAKWVDRKINANHTQVFFRGYSVTHFW 354
Query: 338 SREWNPESEGNCYGEKKPIDVESYWGSGSDLPTMSSVENILSSLNSKVSVLNVTQLSEYR 397
+WN S G C+ E +PI E+Y M ++E+++ ++ ++V +N+++L++YR
Sbjct: 355 GGQWN--SGGQCHKETEPIYNETYLQKHPS--KMRALEHVIQNMKTEVIYMNISRLTDYR 522
Query: 398 KDGHPSIFRKFWEPLRPEQLSNPSSYSDCIHWCLPGVPDTWNELLF 443
KDGHPS++RK ++ + S S Y DC HWCLPGVPDTWNELL+
Sbjct: 523 KDGHPSVYRKDYKTSMKQNSS--SLYEDCSHWCLPGVPDTWNELLY 654
>BF646181 similar to GP|17104519|gb unknown protein {Arabidopsis thaliana},
partial (41%)
Length = 656
Score = 134 bits (337), Expect = 7e-32
Identities = 66/181 (36%), Positives = 104/181 (56%)
Frame = +3
Query: 95 SKSSSNRRVSSESCDVFSGKWVFDNASYPLYNESDCPYMSDQLACNKHGRTDLGYQHWRW 154
S SSSN R + C+ F GKWV+D SYPLY+ S CP++ Q C K+GR D YQ +RW
Sbjct: 138 SSSSSNERKLAGRCNWFRGKWVYD-PSYPLYDPSSCPFIDPQFNCQKYGRPDTQYQKYRW 314
Query: 155 QPHDCDLKRWNVKEMWEKLRGKRLMFVGDSLNRGQWISMVCLLQSEIPADKRSMSPNAHL 214
QP C + R+N + K RGK++MFVGDSL+ Q+ S+ C++ S +P + + S + +
Sbjct: 315 QPFTCSIPRFNALDFLAKYRGKKIMFVGDSLSLNQFNSLACMIHSWVPKTRYTFSKQSAI 494
Query: 215 TIFRAEEYNATVEFLWAPLLAESNSDDPVNHRLDERIIRPDSVLKHASLWEHADILVFNT 274
+ ++Y + P L + + ++ R+++ DS+ K + W D+L FNT
Sbjct: 495 STITFQDYGLQLFLFRTPYLVDLDRENV------GRVLKLDSI-KSGNAWRGMDVLXFNT 653
Query: 275 Y 275
+
Sbjct: 654 W 656
>TC90222 similar to GP|15451108|gb|AAK96825.1 putative protein {Arabidopsis
thaliana}, partial (41%)
Length = 624
Score = 105 bits (262), Expect(2) = 2e-30
Identities = 47/97 (48%), Positives = 65/97 (66%)
Frame = +3
Query: 100 NRRVSSESCDVFSGKWVFDNASYPLYNESDCPYMSDQLACNKHGRTDLGYQHWRWQPHDC 159
N R SS+ CD GKWV+D SYPLY + +CPY+S + C K+GR D Y+ W+W+P C
Sbjct: 129 NNRNSSKRCDFSVGKWVYDE-SYPLY-DPNCPYLSTAVTCQKNGRPDSDYEKWKWKPSGC 302
Query: 160 DLKRWNVKEMWEKLRGKRLMFVGDSLNRGQWISMVCL 196
+ R++ + K+R KR+M VGDS+ R QW S+VCL
Sbjct: 303 SIPRFDALKFLGKMRRKRIMLVGDSIMRNQWESLVCL 413
Score = 45.4 bits (106), Expect(2) = 2e-30
Identities = 24/76 (31%), Positives = 41/76 (53%)
Frame = +1
Query: 196 LLQSEIPADKRSMSPNAHLTIFRAEEYNATVEFLWAPLLAESNSDDPVNHRLDERIIRPD 255
L+Q IP ++ ++ + F + ++ ++EFLWAPLL E P N +R++ D
Sbjct: 412 LVQGVIPTGRKRVTYHGPAMAFHSMDFETSIEFLWAPLLVELKK-SPEN----KRVLHLD 576
Query: 256 SVLKHASLWEHADILV 271
+ ++A W DILV
Sbjct: 577 LIEENAKYWRGVDILV 624
>BG585335 similar to PIR|A84828|A84 hypothetical protein At2g40320 [imported]
- Arabidopsis thaliana, partial (23%)
Length = 595
Score = 129 bits (325), Expect = 2e-30
Identities = 57/97 (58%), Positives = 67/97 (68%)
Frame = +3
Query: 347 GNCYGEKKPIDVESYWGSGSDLPTMSSVENILSSLNSKVSVLNVTQLSEYRKDGHPSIFR 406
GNCY E PI+ SYWGS S M + ++ LN+TQLS YRKD H SI++
Sbjct: 12 GNCYNETTPIEEPSYWGSDSLKSIMQVIGEEFRKSKFPITFLNITQLSNYRKDAHTSIYK 191
Query: 407 KFWEPLRPEQLSNPSSYSDCIHWCLPGVPDTWNELLF 443
K W PL PEQL+NP+SYSDCIHWCLPG+ DTWNELLF
Sbjct: 192 KQWNPLTPEQLANPASYSDCIHWCLPGLQDTWNELLF 302
>BM812987 weakly similar to GP|16226759|gb At2g30010/F23F1.7 {Arabidopsis
thaliana}, partial (32%)
Length = 622
Score = 125 bits (315), Expect = 3e-29
Identities = 70/188 (37%), Positives = 96/188 (50%), Gaps = 13/188 (6%)
Frame = +3
Query: 269 ILVFNTYLWWRQGPVKLLWTDEENGA--CEELDGRGAMELAMGAWADWVSSKVDPLKKRV 326
+LVFNT WW W E G ++D A+E M WA+WV + +D + V
Sbjct: 3 VLVFNTGHWWSHQGSLQGWDYVELGGNFYPDMDRLVALERGMKTWANWVDANIDRSRTHV 182
Query: 327 FFVTMSPTHLWSREWNPE--------SEGNCYGEKKPIDVESYWGSGSDLPT---MSSVE 375
F +SPTH EWN + NCYGE PI + G + T M V
Sbjct: 183 LFQAISPTHYDENEWNSAVGRATSVTTTKNCYGETAPISGTTTDFGGGETYTDQQMRVVN 362
Query: 376 NILSSLNSKVSVLNVTQLSEYRKDGHPSIFRKFWEPLRPEQLSNPSSYSDCIHWCLPGVP 435
++ + +L++T LSE RKDGHPSI+ L +Q ++P +DC HWCLPG+P
Sbjct: 363 MVIREMRDPAYLLDITMLSEMRKDGHPSIYSG---ELSSQQKTDPDHSADCSHWCLPGLP 533
Query: 436 DTWNELLF 443
DTWN+LL+
Sbjct: 534 DTWNQLLY 557
>AL375618 weakly similar to GP|19571145|db contains ESTs AU033061(S2143)
AU108199(S2143)~unknown protein, partial (31%)
Length = 490
Score = 124 bits (312), Expect = 6e-29
Identities = 60/169 (35%), Positives = 98/169 (57%)
Frame = +2
Query: 110 VFSGKWVFDNASYPLYNESDCPYMSDQLACNKHGRTDLGYQHWRWQPHDCDLKRWNVKEM 169
++ G WV+D SY LY+ S CP++ + C K+GR D Y +RW+P CDL R++ +
Sbjct: 2 LYEGTWVYDE-SYSLYDSSTCPHIRLEYDCLKYGRVDKEYLKYRWKPSTCDLPRFDGQSF 178
Query: 170 WEKLRGKRLMFVGDSLNRGQWISMVCLLQSEIPADKRSMSPNAHLTIFRAEEYNATVEFL 229
KL+GK++MF+GDS++ QW S++CLL S +P +T + ++Y +V
Sbjct: 179 LTKLKGKQIMFIGDSVSLNQWQSLICLLHSAVPKANIIQQGGDPITNYTFKDYGVSVIVY 358
Query: 230 WAPLLAESNSDDPVNHRLDERIIRPDSVLKHASLWEHADILVFNTYLWW 278
+ L + + R+++ DS+ K +LW+ D+LVFNT+LWW
Sbjct: 359 HSTYLVD------IEGEKIGRVLKLDSI-KSGNLWKQMDVLVFNTWLWW 484
>AW126374 similar to PIR|A84828|A84 hypothetical protein At2g40320 [imported]
- Arabidopsis thaliana, partial (24%)
Length = 382
Score = 123 bits (309), Expect = 1e-28
Identities = 54/100 (54%), Positives = 67/100 (67%)
Frame = +1
Query: 344 ESEGNCYGEKKPIDVESYWGSGSDLPTMSSVENILSSLNSKVSVLNVTQLSEYRKDGHPS 403
E G+CY E I+ +YWGS S M + +LS ++ LN+TQLS YRKD H S
Sbjct: 19 EPGGSCYNETTLINNSTYWGSDSRKSIMQVIGEVLSKTKVPITFLNITQLSSYRKDAHTS 198
Query: 404 IFRKFWEPLRPEQLSNPSSYSDCIHWCLPGVPDTWNELLF 443
I++K W PL EQLSNP SY+DC+HWCLPG+ D WNELLF
Sbjct: 199 IYKKQWSPLTKEQLSNPVSYADCVHWCLPGLQDNWNELLF 318
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.318 0.134 0.438
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 18,492,916
Number of Sequences: 36976
Number of extensions: 315054
Number of successful extensions: 1848
Number of sequences better than 10.0: 96
Number of HSP's better than 10.0 without gapping: 1755
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1779
length of query: 446
length of database: 9,014,727
effective HSP length: 99
effective length of query: 347
effective length of database: 5,354,103
effective search space: 1857873741
effective search space used: 1857873741
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (27.7 bits)
Medicago: description of AC143338.3