
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC139747.1 - phase: 0 /pseudo
(441 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
CB892913 similar to GP|1769897|emb lectin receptor kinase {Arabi... 57 1e-12
BI262818 weakly similar to GP|6642775|gb| gag-pol polyprotein {V... 53 2e-07
TC81992 weakly similar to PIR|T47841|T47841 hypothetical protein... 35 0.058
TC87383 similar to GP|19168656|emb|CAD26175. DNA-DIRECTED RNA PO... 32 0.37
TC84423 similar to PIR|F84730|F84730 probable myosin heavy chain... 32 0.64
TC87382 similar to EGAD|146423|156195 vitellogenin {Anolis pulch... 31 0.83
TC91923 weakly similar to GP|1946369|gb|AAB63087.1| unknown prot... 31 1.1
TC87422 similar to PIR|T00840|T00840 probable senescence-related... 29 3.2
BI311504 similar to PIR|A86268|A86 hypothetical protein AAG09555... 29 4.1
TC82236 similar to GP|12056928|gb|AAG48132.1 putative resistance... 29 4.1
AW586257 weakly similar to GP|14532668|gb putative RB-binding pr... 28 5.4
TC92005 weakly similar to GP|18033111|gb|AAL56987.1 functional c... 28 7.0
TC79250 similar to PIR|T06329|T06329 symbiotic ammonium transpor... 28 9.2
TC81336 homologue to GP|20198031|gb|AAM15362.1 hypothetical prot... 28 9.2
>CB892913 similar to GP|1769897|emb lectin receptor kinase {Arabidopsis
thaliana}, partial (35%)
Length = 837
Score = 56.6 bits (135), Expect(2) = 1e-12
Identities = 27/78 (34%), Positives = 46/78 (58%)
Frame = +1
Query: 143 ESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEEARDISSFKVD 202
E ++ Y ++ I+N + EK+ D +++ KILRSL +F+ V IEE +D+ + ++
Sbjct: 472 ERVRVYFSRVISISNQLKRNSEKLEDVRIMEKILRSLDPKFEHIVEKIEETKDLETMTIE 651
Query: 203 ELIGSLQNFEITVNSKND 220
+L GSLQ +E K D
Sbjct: 652 KLQGSLQAYEEKHKKKKD 705
Score = 33.9 bits (76), Expect(2) = 1e-12
Identities = 33/128 (25%), Positives = 50/128 (38%)
Frame = +3
Query: 2 VMDKEGGFVNTPPLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTN 61
VM G PLL G YD +M L D WE Y + +
Sbjct: 93 VMAVNGAASFQVPLLKGSTYDNCFIKMMALLGAHDV---------WEVVEKGYKESQDED 245
Query: 62 VLKP*EEWTAAEDELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTK 121
L + T + + KAL ++ +D++ F+ I AK+ E LK + +G K
Sbjct: 246 SLTKAQRDTLKDSRKR--DKKALFLIYQALDEDEFEKISNATSAKEAWEKLKTSCQGEDK 419
Query: 122 VKSAKFQL 129
VK + Q+
Sbjct: 420 VKKVRLQI 443
>BI262818 weakly similar to GP|6642775|gb| gag-pol polyprotein {Vitis
vinifera}, partial (24%)
Length = 615
Score = 53.1 bits (126), Expect = 2e-07
Identities = 44/169 (26%), Positives = 74/169 (43%), Gaps = 1/169 (0%)
Frame = +1
Query: 3 MDKEGGFVNTP-PLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTN 61
MD E + P P+ DG NY W +R+ +L+ N AV +E + L + M
Sbjct: 142 MDSETSYKAPPLPVFDGENYHIWAARIEAYLEA--NDLWEAVEEDYE-VLPLSDNPTMAQ 312
Query: 62 VLKP*EEWTAAEDELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTK 121
+ E T SKA LF V + +F I A + LK +EG +
Sbjct: 313 IKNHKERKTR--------KSKARATLFAAVSEEIFTRIMTIKSAFEIWNFLKTEYEGDER 468
Query: 122 VKSAKFQLLTTKYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEK 170
++ + L ++E +M + E+I++Y ++ IAN G ++ D +
Sbjct: 469 IRGMQALNLIREFEMQKMKESETIKEYANKLISIANKVRLLGSELXDSR 615
>TC81992 weakly similar to PIR|T47841|T47841 hypothetical protein T2O9.150 -
Arabidopsis thaliana, partial (3%)
Length = 952
Score = 35.0 bits (79), Expect = 0.058
Identities = 22/65 (33%), Positives = 35/65 (53%)
Frame = +3
Query: 112 LKIAHEGTTKVKSAKFQLLTTKYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKL 171
LK ++ T + K A+ Q L + L+M + ESI DY + I N GE++S+ +
Sbjct: 705 LKK*YQVTARGKRAQLQAL*RDW*ILQMKNGESIIDYFARTVTIVNKIRMHGEQMSNLTV 884
Query: 172 VRKIL 176
+ KIL
Sbjct: 885 IEKIL 899
>TC87383 similar to GP|19168656|emb|CAD26175. DNA-DIRECTED RNA POLYMERASE II
{Encephalitozoon cuniculi}, partial (0%)
Length = 1247
Score = 32.3 bits (72), Expect = 0.37
Identities = 26/94 (27%), Positives = 42/94 (44%)
Frame = -2
Query: 303 QCHECEGYGHIKIECASFLKK*KKSLTVSWSDDDGSKGDGERESIKHVAALIGRVFSDAE 362
+C C+GYGHI I+C ++ + + + + + ERE I +AE
Sbjct: 310 KCVMCQGYGHIAIDCVNY------KAVIIVNREINNIFEEEREDIHE--------SFEAE 173
Query: 363 SCSEDLAYDELVVSYKRLNDKNTDICKQLEEQKN 396
+ E + YDE V TDIC+ +E+ N
Sbjct: 172 TMGEPI-YDEEYV--------GTDICEVFKEEGN 98
>TC84423 similar to PIR|F84730|F84730 probable myosin heavy chain [imported]
- Arabidopsis thaliana, partial (10%)
Length = 787
Score = 31.6 bits (70), Expect = 0.64
Identities = 19/49 (38%), Positives = 31/49 (62%), Gaps = 3/49 (6%)
Frame = +2
Query: 389 KQLEEQKNITNNLEEERVGYLEKNSELNSEVR---MLNSQLSNVMKQVK 434
K+ EE+KN+ N+L +E +++K S+L S++ NSQL +K VK
Sbjct: 488 KEAEEEKNLLNSLLQE---HMDKLSQLESDLNQSTQKNSQLEEELKIVK 625
>TC87382 similar to EGAD|146423|156195 vitellogenin {Anolis pulchellus},
partial (7%)
Length = 2304
Score = 31.2 bits (69), Expect = 0.83
Identities = 27/95 (28%), Positives = 45/95 (46%)
Frame = +2
Query: 303 QCHECEGYGHIKIECASFLKK*KKSLTVSWSDDDGSKGDGERESIKHVAALIGRVFSDAE 362
+C C+GYGHI ++C + +K T+ +++ + + ERE + F D E
Sbjct: 1043 KCFICQGYGHIALDCVN-----QKVFTIV-NEEINNIFEEEREDVY-------ESFED-E 1180
Query: 363 SCSEDLAYDELVVSYKRLNDKNTDICKQLEEQKNI 397
+ E + YDE V DIC+ +E+ NI
Sbjct: 1181 TLGEPI-YDEEYV--------GADICEVFDEEGNI 1258
>TC91923 weakly similar to GP|1946369|gb|AAB63087.1| unknown protein
{Arabidopsis thaliana}, partial (14%)
Length = 1129
Score = 30.8 bits (68), Expect = 1.1
Identities = 51/208 (24%), Positives = 86/208 (40%), Gaps = 7/208 (3%)
Frame = +3
Query: 236 ELQVNQEDDEDMTESLTLLGRQ--FKKIVKQYDKRPRSIGQNIRPNIDNQPSKEKMVRTD 293
EL + E E LL Q K+++++++ R RS + IR N S++ + +
Sbjct: 21 ELTIFPEKANLKAEMSKLLEEQTHLKELIREWESRGRSFEEQIR----NIQSEKIEMEAE 188
Query: 294 EKNFQYKGVQCHECEGYGHIKIECASFLKK*KKSLTVSWSDDDGSKGDGERESIKHVAAL 353
KN G+Q + E + K L VS D K + + +++ V +L
Sbjct: 189 LKN----GIQLLKAE---------IEQRENNIKDLNVSL---DNLKLEKDNLNVE-VGSL 317
Query: 354 IGRVFS-DAESCSEDLAYDELVVSYKRLNDKNTDICKQLEEQKNITNNLEE----ERVGY 408
V S D S D ++L + + +L + C+Q+EE K NLEE ++
Sbjct: 318 KEDVNSRDGRIGSLDRHLNDLHIEHVQLISSLEEACRQVEEIKTKAKNLEEQVERQKTEI 497
Query: 409 LEKNSELNSEVRMLNSQLSNVMKQVKMM 436
LE E +R L L + M+
Sbjct: 498 LEAAEEKREAIRQLCFSLEHYRNNYHML 581
>TC87422 similar to PIR|T00840|T00840 probable senescence-related protein
[imported] - Arabidopsis thaliana, partial (55%)
Length = 1660
Score = 29.3 bits (64), Expect = 3.2
Identities = 16/44 (36%), Positives = 27/44 (61%), Gaps = 6/44 (13%)
Frame = +2
Query: 230 SSVELDE------LQVNQEDDEDMTESLTLLGRQFKKIVKQYDK 267
S+V+LDE L++ Q DDE + LT+ + KK++K+ D+
Sbjct: 359 STVKLDESHYFFTLKLPQSDDEVLNYGLTVAAKGSKKVLKKLDE 490
>BI311504 similar to PIR|A86268|A86 hypothetical protein AAG09555.1
[imported] - Arabidopsis thaliana, partial (76%)
Length = 774
Score = 28.9 bits (63), Expect = 4.1
Identities = 18/67 (26%), Positives = 33/67 (48%), Gaps = 9/67 (13%)
Frame = -1
Query: 379 RLNDKNTDICKQLEEQKNITNNLEEE---------RVGYLEKNSELNSEVRMLNSQLSNV 429
++ +KN ++ KQLEEQK + +E E EK +L +V+ L +L+ +
Sbjct: 681 QMKEKNANLQKQLEEQKKVIGEVEAEIKSLQSNLTMEQICEKEIDLRMQVQELEIKLNKL 502
Query: 430 MKQVKMM 436
V ++
Sbjct: 501 RGGVTLV 481
>TC82236 similar to GP|12056928|gb|AAG48132.1 putative resistance protein
{Glycine max}, partial (2%)
Length = 1014
Score = 28.9 bits (63), Expect = 4.1
Identities = 21/49 (42%), Positives = 26/49 (52%)
Frame = +1
Query: 380 LNDKNTDICKQLEEQKNITNNLEEERVGYLEKNSELNSEVRMLNSQLSN 428
LN+ D CK LEE + I NL ER+ + S +S RML SQ N
Sbjct: 421 LNNLILDNCKSLEEIRGIAPNL--ERLSAMGCKSLSSSSRRMLLSQKLN 561
>AW586257 weakly similar to GP|14532668|gb putative RB-binding protein
{Arabidopsis thaliana}, partial (3%)
Length = 590
Score = 28.5 bits (62), Expect = 5.4
Identities = 17/67 (25%), Positives = 33/67 (48%), Gaps = 4/67 (5%)
Frame = -1
Query: 379 RLNDKNTDICKQLEEQKNITNNLEEERVGYLEKNSELNSEVRMLNS----QLSNVMKQVK 434
R D T+ CK + LEE + G+ N EL S +L S +++N+ +
Sbjct: 551 RCFD*ETNNCKSSTFRTEFDTQLEEAKYGFEFLNQELASNNALLTSSSEARITNIFSEAL 372
Query: 435 MMAARTD 441
+M+++++
Sbjct: 371 IMSSKSE 351
>TC92005 weakly similar to GP|18033111|gb|AAL56987.1 functional candidate
resistance protein KR1 {Glycine max}, partial (7%)
Length = 858
Score = 28.1 bits (61), Expect = 7.0
Identities = 19/40 (47%), Positives = 21/40 (52%)
Frame = +1
Query: 386 DICKQLEEQKNITNNLEEERVGYLEKNSELNSEVRMLNSQ 425
D CK LEE + I NLEE E S +S RML SQ
Sbjct: 595 DCCKSLEEIRGIPPNLEELSAYKCESLS--SSSRRMLTSQ 708
>TC79250 similar to PIR|T06329|T06329 symbiotic ammonium transport protein
SAT1 - soybean, partial (34%)
Length = 1145
Score = 27.7 bits (60), Expect = 9.2
Identities = 18/75 (24%), Positives = 33/75 (44%)
Frame = +2
Query: 344 RESIKHVAALIGRVFSDAESCSEDLAYDELVVSYKRLNDKNTDICKQLEEQKNITNNLEE 403
RE I + + D + + + + K+L +K +L E+KN NN+E
Sbjct: 536 REKISQKFIALSALLPDLKKMDKASVLGDAINHVKQLQEK-----VKLLEEKNQKNNVES 700
Query: 404 ERVGYLEKNSELNSE 418
+ Y+EK +S+
Sbjct: 701 VSMVYVEKTKSYSSD 745
>TC81336 homologue to GP|20198031|gb|AAM15362.1 hypothetical protein
{Arabidopsis thaliana}, partial (2%)
Length = 1542
Score = 27.7 bits (60), Expect = 9.2
Identities = 14/43 (32%), Positives = 25/43 (57%)
Frame = +3
Query: 200 KVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQE 242
+VDELI ++++ D++ G++ + SVEL E Q + E
Sbjct: 201 QVDELITVPDTVRLSLSQLKDEENVGVSESKSVELSESQNDTE 329
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.318 0.135 0.378
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,533,030
Number of Sequences: 36976
Number of extensions: 122322
Number of successful extensions: 599
Number of sequences better than 10.0: 29
Number of HSP's better than 10.0 without gapping: 594
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 599
length of query: 441
length of database: 9,014,727
effective HSP length: 99
effective length of query: 342
effective length of database: 5,354,103
effective search space: 1831103226
effective search space used: 1831103226
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (27.7 bits)
Medicago: description of AC139747.1