
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC149197.4 - phase: 0
(204 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BU547962 74 6e-14
TC222348 59 8e-14
TC208902 53 8e-08
CO980191 39 3e-07
TC213599 homologue to UP|EXBB_PASHA (P72202) Biopolymer transpor... 39 5e-07
CA803034 38 1e-06
TC211008 46 1e-05
AW350693 43 1e-04
TC232226 41 4e-04
TC203464 similar to GB|AAF35920.1|7025820|AC005892 L505.8 {Leish... 38 0.004
BQ253293 38 0.004
CO980802 35 0.023
CA851573 33 0.068
TC208144 similar to UP|Q8VYU3 (Q8VYU3) GTP cyclohydrolase I , p... 33 0.068
AW277722 similar to SP|O65719|HS73_ Heat shock cognate 70 kDa pr... 31 0.34
TC232296 similar to UP|Q941I4 (Q941I4) Vacuolar acid invertase, ... 29 1.7
TC234126 29 1.7
CO981985 29 1.7
TC206238 weakly similar to UP|Q7PX96 (Q7PX96) AgCP12237 (Fragmen... 28 2.2
TC231053 similar to UP|Q941I4 (Q941I4) Vacuolar acid invertase, ... 28 2.9
>BU547962
Length = 591
Score = 73.6 bits (179), Expect = 6e-14
Identities = 40/111 (36%), Positives = 56/111 (50%), Gaps = 2/111 (1%)
Frame = -2
Query: 65 GIFRNRLGQFLGCFAEGLSYGNSLFAELCGAMRSIEFAQSKSWNNFWLETDSMLVVQAF- 123
G+ RN +FL +AE + NS AEL + +E WN+ WLE D+ +V+
Sbjct: 590 GVVRNHNAEFLXXYAESIGQANSTIAELTALRKGLELVLENGWNDIWLEGDAKTLVEIIV 411
Query: 124 KNSAIVPWQVRNRWNNCQIIISNM-NFLASHIYREGNSCADVLANIGLTLD 173
K + +V+ N+ I+ NF SHIYREGN AD A +G LD
Sbjct: 410 KRRKVRCTEVQRHINHINTILPEFNNFFVSHIYREGNRAADKFAQMGHHLD 258
>TC222348
Length = 995
Score = 58.5 bits (140), Expect(2) = 8e-14
Identities = 33/104 (31%), Positives = 54/104 (51%), Gaps = 2/104 (1%)
Frame = -2
Query: 43 PIHSWIKGN--CVGAFASGKAACGGIFRNRLGQFLGCFAEGLSYGNSLFAELCGAMRSIE 100
P + WIK N C G A CG +F +FLG FA ++ ++ FAE+ A+ +IE
Sbjct: 328 PDYPWIKCNTDCAARGNPGLAGCGKLFIYSNARFLGNFAYSIAINSAFFAEVMAAIIAIE 149
Query: 101 FAQSKSWNNFWLETDSMLVVQAFKNSAIVPWQVRNRWNNCQIII 144
A WN WL+TDS + + K + ++ + + N +++I
Sbjct: 148 KAVENRWNFLWLKTDSSFQLSSAKYNPVIKGLIILQERNDRLMI 17
Score = 34.7 bits (78), Expect(2) = 8e-14
Identities = 13/27 (48%), Positives = 21/27 (77%)
Frame = -1
Query: 14 ITEFTILKAFNVSIHPPRSMLVKEVLW 40
IT+F+ LK F+VS HP ++ ++K+V W
Sbjct: 419 ITDFSTLKNFSVSSHPQKAQVIKQVSW 339
>TC208902
Length = 838
Score = 53.1 bits (126), Expect = 8e-08
Identities = 39/130 (30%), Positives = 58/130 (44%), Gaps = 7/130 (5%)
Frame = +1
Query: 48 IKGNCVGAFASGKAACGGIFRNRLGQFLGCFAEGLSYGNSLFAELCGAMRSIEFAQSKSW 107
I NC GA + G AACG + RN+ G + FA L + AE+ ++ A K +
Sbjct: 232 INSNCDGAVSGGVAACGRVLRNQAGAVV-VFAHNLGMCSISQAEIWAITMGVKLAWDKGF 408
Query: 108 NNFWLETDSMLVVQAFKNSAIVPWQVRNRWNNCQIIISNM-------NFLASHIYREGNS 160
N ++E+D FK + + V NNC ++ + + HI RE N
Sbjct: 409 TNLYVESD-------FKYAIEMMDGVCEFSNNCFQLVKSAKDSV*GETYSWCHILREANK 567
Query: 161 CADVLANIGL 170
AD LA G+
Sbjct: 568 TADALAKFGV 597
>CO980191
Length = 802
Score = 38.5 bits (88), Expect(2) = 3e-07
Identities = 20/49 (40%), Positives = 30/49 (60%), Gaps = 1/49 (2%)
Frame = +3
Query: 89 FAELCGAMRSIEFAQSKSWNNFWLETDSMLVVQAFKNS-AIVPWQVRNR 136
F EL A+++I+FA W N LE + MLV +AF+N +VP + R +
Sbjct: 99 FGELVDAVKAIDFANENGWLNLRLECNFMLVTKAFQNDRRLVPRKNRKK 245
Score = 32.3 bits (72), Expect(2) = 3e-07
Identities = 15/40 (37%), Positives = 21/40 (52%)
Frame = +1
Query: 132 QVRNRWNNCQIIISNMNFLASHIYREGNSCADVLANIGLT 171
++ + C +MN SHIYREGN+ LA+ G T
Sbjct: 232 KIEKK*KKCIAFTKHMNLFVSHIYREGNTYEIKLASHGTT 351
>TC213599 homologue to UP|EXBB_PASHA (P72202) Biopolymer transport exbB
protein, partial (8%)
Length = 825
Score = 38.5 bits (88), Expect(2) = 5e-07
Identities = 23/66 (34%), Positives = 34/66 (50%)
Frame = +3
Query: 61 AACGGIFRNRLGQFLGCFAEGLSYGNSLFAELCGAMRSIEFAQSKSWNNFWLETDSMLVV 120
AACGG+ R+ G++ F L ++ AEL GA + A K N + DS++VV
Sbjct: 351 AACGGLLRDCHGRWAASFV*KLGNTSAFVAEL*GAYLGLSLAWDKGDKNLEVNIDSLVVV 530
Query: 121 QAFKNS 126
+ K S
Sbjct: 531 VSIKIS 548
Score = 31.6 bits (70), Expect(2) = 5e-07
Identities = 12/21 (57%), Positives = 14/21 (66%)
Frame = +2
Query: 153 HIYREGNSCADVLANIGLTLD 173
HIYRE N CAD LAN ++
Sbjct: 629 HIYREANGCADALANYACAME 691
>CA803034
Length = 456
Score = 37.7 bits (86), Expect(2) = 1e-06
Identities = 26/103 (25%), Positives = 49/103 (47%)
Frame = +3
Query: 17 FTILKAFNVSIHPPRSMLVKEVLWSPPIHSWIKGNCVGAFASGKAACGGIFRNRLGQFLG 76
++ LK F+VS HP ++ ++K+V W P + + G G+ + Q +
Sbjct: 60 YSTLKNFSVSSHPWKAWVIKQVSWKLPDYPF------G*NVIQTVQLRGVL---VWQVVA 212
Query: 77 CFAEGLSYGNSLFAELCGAMRSIEFAQSKSWNNFWLETDSMLV 119
+ ++ FAE+ A+ +I+ W+ FWL+T S L+
Sbjct: 213 DCSYSKGVNSAFFAEVMAAIIAIDKPVENRWDFFWLKTYSSLL 341
Score = 30.8 bits (68), Expect(2) = 1e-06
Identities = 12/15 (80%), Positives = 14/15 (93%)
Frame = +2
Query: 152 SHIYREGNSCADVLA 166
+HIYREGN+CAD LA
Sbjct: 356 THIYREGNACADKLA 400
>TC211008
Length = 516
Score = 46.2 bits (108), Expect = 1e-05
Identities = 35/144 (24%), Positives = 60/144 (41%), Gaps = 14/144 (9%)
Frame = +1
Query: 40 WSPPIHSWIKGNCVGAFASGK------AACGGIFRNRLGQFLGCFAEGLSYGNSLFAELC 93
W P + W+ N G+ +ACGG+ R+ G +LG F L + AEL
Sbjct: 64 WRCPXNGWVCLNTDGSVFENHRNGCSGSACGGLVRDSSGCYLGGFTVNLGNTSVTLAELW 243
Query: 94 GAMRSIEFAQSKSWNNFWLETDSMLVVQAFKNSAIVPWQVRNRWNNCQIIISNMNFLA-- 151
G + ++ A ++ DS + ++ + + ++S +N L
Sbjct: 244 GVVHGLKLAWDLGCKKVKVDIDSGNALGLVRHGPVAN-------DPAFALVSEINELVRK 402
Query: 152 ------SHIYREGNSCADVLANIG 169
SH++RE N AD LA++G
Sbjct: 403 EWLVEFSHVFRESNRAADKLAHLG 474
>AW350693
Length = 563
Score = 42.7 bits (99), Expect = 1e-04
Identities = 20/56 (35%), Positives = 32/56 (56%)
Frame = -3
Query: 114 TDSMLVVQAFKNSAIVPWQVRNRWNNCQIIISNMNFLASHIYREGNSCADVLANIG 169
T + VQA N + W++RNR N ++ +M+FL +I+RE N+C +L G
Sbjct: 219 TSVLCYVQASGNHRTIIWRIRNR*MNRIMLTIDMHFLVGYIFREWNTCVKILVTHG 52
>TC232226
Length = 667
Score = 40.8 bits (94), Expect = 4e-04
Identities = 18/35 (51%), Positives = 24/35 (68%)
Frame = -3
Query: 6 SKLSASSSITEFTILKAFNVSIHPPRSMLVKEVLW 40
SK SI EFT+LK+F V++HPP+ L+K V W
Sbjct: 137 SKGHLGHSILEFTLLKSFAVNVHPPKPSLIKRVDW 33
>TC203464 similar to GB|AAF35920.1|7025820|AC005892 L505.8 {Leishmania
major;} , partial (8%)
Length = 1386
Score = 37.7 bits (86), Expect = 0.004
Identities = 27/100 (27%), Positives = 45/100 (45%), Gaps = 4/100 (4%)
Frame = +2
Query: 30 PRSMLVKEVLWSPPIHSWIKGNCVGA---FASGKAACGGIFRNRLGQFLGCFAEGLSYGN 86
PR W P W+K N G+ + S A CGG+ R+ ++L FA+ L+
Sbjct: 458 PREGATTVSRWKKPEIGWVKLNVDGSRDPYKSSSAGCGGVLRDASAKWLRGFAKKLNPTY 637
Query: 87 SLF-AELCGAMRSIEFAQSKSWNNFWLETDSMLVVQAFKN 125
++ EL + ++ A + +E+DS VV +N
Sbjct: 638 AVHQTELEAILTGLKVASEMNVKKLIVESDSDSVVSMVEN 757
>BQ253293
Length = 420
Score = 37.7 bits (86), Expect = 0.004
Identities = 23/69 (33%), Positives = 36/69 (51%), Gaps = 4/69 (5%)
Frame = +2
Query: 11 SSSITEFTILKAFNVSIHPPRS---MLVKEVLWSPPIHSWIKGNCVGAF-ASGKAACGGI 66
S +T+ F++S+ PP S + K + W PP +K NC A +G A+CGG+
Sbjct: 41 SEEVTDIWQAHRFSLSL-PPHS*DLAVGKSICWEPPPTVQLKLNCEAAVNRNGMASCGGV 217
Query: 67 FRNRLGQFL 75
R+ G+ L
Sbjct: 218 MRDPYGKVL 244
>CO980802
Length = 803
Score = 35.0 bits (79), Expect = 0.023
Identities = 19/60 (31%), Positives = 33/60 (54%), Gaps = 2/60 (3%)
Frame = -3
Query: 47 WIKGNCVGAF--ASGKAACGGIFRNRLGQFLGCFAEGLSYGNSLFAELCGAMRSIEFAQS 104
+ K N G++ ++ K AC GI + G+ L CF + NS++A+L G I+ A++
Sbjct: 405 FFKVNVDGSYKASTAKMACRGIILDSFGRVLSCFHSNIGVCNSIWAKLWGLKHGIKPARN 226
>CA851573
Length = 614
Score = 33.5 bits (75), Expect = 0.068
Identities = 27/101 (26%), Positives = 43/101 (41%), Gaps = 2/101 (1%)
Frame = +1
Query: 71 LGQFLGCFAEGLSYGNSLFAELCGAMRSIEFAQSKSWNNFWLETDSMLVVQAFKNSAIVP 130
+ + L CF++ L Y N + AEL I+ A + E D+ V VP
Sbjct: 43 IAEXLSCFSKRLGYCNVILAELWSLKYGIKLAIDLNTQFSLFEMDAREAVHMIDXIKSVP 222
Query: 131 WQVRNRWNNCQIIISNMNFLAS--HIYREGNSCADVLANIG 169
+R + + ++ + +S HI RE NS A+ A G
Sbjct: 223 SSLRPLAADIRRVLWANGWQSSIVHINREANSAANWAAKFG 345
>TC208144 similar to UP|Q8VYU3 (Q8VYU3) GTP cyclohydrolase I , partial (28%)
Length = 1195
Score = 33.5 bits (75), Expect = 0.068
Identities = 14/31 (45%), Positives = 18/31 (57%)
Frame = +3
Query: 152 SHIYREGNSCADVLANIGLTLDTYVYYYSLP 182
SH++RE N+CAD LAN+ L Y P
Sbjct: 972 SHVHREANACADALANLSYGLQLGFMMYEQP 1064
>AW277722 similar to SP|O65719|HS73_ Heat shock cognate 70 kDa protein 3
(Hsc70.3). [Mouse-ear cress] {Arabidopsis thaliana},
partial (8%)
Length = 432
Score = 31.2 bits (69), Expect = 0.34
Identities = 19/61 (31%), Positives = 27/61 (44%), Gaps = 6/61 (9%)
Frame = +1
Query: 40 WSPPIHSWIKGNCVGAFASGK------AACGGIFRNRLGQFLGCFAEGLSYGNSLFAELC 93
W P + W+ N G+ +ACGG+ R+ G +LG F L + AEL
Sbjct: 247 WRCPPNGWVCLNTDGSVFENHRNGCSGSACGGLVRDSSGCYLGGFTVNLGNTSVTLAELW 426
Query: 94 G 94
G
Sbjct: 427 G 429
>TC232296 similar to UP|Q941I4 (Q941I4) Vacuolar acid invertase, partial
(10%)
Length = 593
Score = 28.9 bits (63), Expect = 1.7
Identities = 12/35 (34%), Positives = 20/35 (56%)
Frame = -1
Query: 139 NCQIIISNMNFLASHIYREGNSCADVLANIGLTLD 173
NCQI+ + F++ + N A +A++G TLD
Sbjct: 164 NCQILSETLMFMSVALLNRNNLAAPYIASVG*TLD 60
>TC234126
Length = 428
Score = 28.9 bits (63), Expect = 1.7
Identities = 21/61 (34%), Positives = 28/61 (45%), Gaps = 8/61 (13%)
Frame = -2
Query: 135 NRWNNCQIIISNMN------FLASHIYREGN--SCADVLANIGLTLDTYVYYYSLPLQVR 186
N WN C+ II + + Y + S A+VL NIG T +SLPL+ R
Sbjct: 274 NEWNLCKYIIISFQKGVRKWWGLRQFYHHNSFLSFANVLPNIGCTSSFQSNLFSLPLRQR 95
Query: 187 S 187
S
Sbjct: 94 S 92
>CO981985
Length = 903
Score = 28.9 bits (63), Expect = 1.7
Identities = 19/79 (24%), Positives = 36/79 (45%), Gaps = 1/79 (1%)
Frame = -1
Query: 92 LCGAMRSIEFAQSKSWNNFWLETDSMLVVQAFKNSAI-VPWQVRNRWNNCQIIISNMNFL 150
+ +R + A S + N ++DS+L +Q + + Q ++ + N+
Sbjct: 387 ITSCVRGLTLAWSAVYKNIECDSDSLLALQLLV*EGVTIHHQHASKHKQT*AVTVNV--- 217
Query: 151 ASHIYREGNSCADVLANIG 169
H+YR+ N CAD L+ G
Sbjct: 216 -PHVYRKANQCAD*LSKKG 163
>TC206238 weakly similar to UP|Q7PX96 (Q7PX96) AgCP12237 (Fragment), partial
(9%)
Length = 1259
Score = 28.5 bits (62), Expect = 2.2
Identities = 21/63 (33%), Positives = 33/63 (52%), Gaps = 7/63 (11%)
Frame = +3
Query: 20 LKAFNVSIHPPRSMLVKEVLWSPPI-----HSWIKGNCVGAF--ASGKAACGGIFRNRLG 72
++A I PR++L W P+ H ++ N GA+ +S AACGGIFR+
Sbjct: 537 MRAHASHILMPRNLLK----WRRPLPLSHGHWLLRLNVSGAYDRSSDTAACGGIFRDNND 704
Query: 73 QFL 75
+F+
Sbjct: 705 RFV 713
>TC231053 similar to UP|Q941I4 (Q941I4) Vacuolar acid invertase, partial
(34%)
Length = 887
Score = 28.1 bits (61), Expect = 2.9
Identities = 12/35 (34%), Positives = 20/35 (56%)
Frame = -3
Query: 139 NCQIIISNMNFLASHIYREGNSCADVLANIGLTLD 173
NCQI+ + F++ + N A +A++G TLD
Sbjct: 654 NCQILSEALMFMSVALLNRNNLAAPYIASVG*TLD 550
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.323 0.135 0.433
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,728,340
Number of Sequences: 63676
Number of extensions: 183145
Number of successful extensions: 949
Number of sequences better than 10.0: 48
Number of HSP's better than 10.0 without gapping: 943
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 945
length of query: 204
length of database: 12,639,632
effective HSP length: 93
effective length of query: 111
effective length of database: 6,717,764
effective search space: 745671804
effective search space used: 745671804
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 56 (26.2 bits)
Medicago: description of AC149197.4