
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC119416.7 - phase: 0 /pseudo
(447 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC79089 weakly similar to PIR|T46111|T46111 probable transposase... 125 3e-29
BE204282 weakly similar to PIR|C85069|C850 hypothetical protein ... 90 2e-18
TC82295 similar to PIR|A96722|A96722 unknown protein T17F3.2 [im... 67 1e-11
TC87384 56 3e-08
BI310751 56 3e-08
BG645718 56 3e-08
BF645752 homologue to GP|22713420|gb| Unknown (protein for MGC:3... 52 4e-07
TC81829 46 3e-05
BE325071 similar to GP|3928077|gb|A unknown protein {Arabidopsis... 44 1e-04
TC80007 44 2e-04
BQ751384 39 0.004
TC90414 39 0.005
TC87385 similar to GP|23093788|gb|AAF50220.2 CG16711-PA {Drosoph... 35 0.077
TC90179 weakly similar to GP|9719456|gb|AAF97810.1| transposase ... 33 0.22
BE203595 31 0.85
BQ141683 29 3.2
TC76730 similar to GP|20521241|dbj|BAB91757. putative 60S riboso... 28 5.5
TC76729 similar to GP|20521241|dbj|BAB91757. putative 60S riboso... 28 5.5
TC81390 similar to PIR|G85255|G85255 CDP-diacylglycerol syntheta... 28 5.5
TC88553 weakly similar to GP|21327037|gb|AAM48133.1 putative fla... 28 7.2
>TC79089 weakly similar to PIR|T46111|T46111 probable transposase -
Arabidopsis thaliana, partial (36%)
Length = 942
Score = 125 bits (314), Expect = 3e-29
Identities = 79/292 (27%), Positives = 137/292 (46%), Gaps = 3/292 (1%)
Frame = +3
Query: 159 KFKIPSRVSVARDCLRLYFDEKETLKSVLSANNQMVSLTTDTWTSIQNMNYMCVTAHYID 218
+F + + ++ DC+ Y EK+ L LT D WTS Q++ Y+ +T H++D
Sbjct: 9 QFNMVTFNTIQGDCVATYLTEKQNLSKYFDELPGRFGLTLDMWTSSQSVGYVFITGHFVD 188
Query: 219 DEWVLKKKILSFNLIAD---HKGETIGIALENCMKDWGIRSICCVTVDNASANNLAINYL 275
+W L+K+IL N++ + + A+ C+ DW + N +A+ L
Sbjct: 189 SDWKLQKRIL--NVVMEPCPDSDSALSHAVSACISDWNLEGRLFTITCNQPLTKVALENL 362
Query: 276 NRGMRVWNGRTLFNGEYLHMRCCAHILNLIVLDGLKDIDASVQRIRAACKFVKSSPSRLA 335
+ V N +FNG+ L C A L+ + D L V +IR + K+VK+S S
Sbjct: 363 RPLLSVKNP-LIFNGQLLIGHCIARTLSNVAYDLLSSAQGIVNKIRESVKYVKTSESHED 539
Query: 336 TFKKCADSVGVCSKAMVTLDVVTRWNSTYLMLNIAEKYEHVFYRLEHADAAFVTSLSGEL 395
F + + V S+ + +D T+WN+TY ML A + + VF L+ +D +
Sbjct: 540 KFLDLKEHLQVPSEKSLFIDDQTKWNTTYQMLVAASELKEVFSCLDTSDP--------DY 695
Query: 396 GGCPDHDDWERSRVFVKFLKTFYDATLSFSGSLHVSANSFFKQLMEIQKTLN 447
G P DW+ + +LK YDA + + +A +FF ++ ++ L+
Sbjct: 696 KGAPSIPDWKLVEILCTYLKPLYDAANILMTTTYPTAITFFHEVXKLHLDLS 851
>BE204282 weakly similar to PIR|C85069|C850 hypothetical protein AT4g05510
[imported] - Arabidopsis thaliana, partial (16%)
Length = 563
Score = 90.1 bits (222), Expect = 2e-18
Identities = 61/175 (34%), Positives = 90/175 (50%), Gaps = 9/175 (5%)
Frame = +3
Query: 249 MKDWGI-RSICCVTVDNASAN-----NLA--INYLNRGMRVWNGRTLFNGEYLHMRCCAH 300
+++WGI ++I +T+DNASAN NL + L+ +R + H +CCAH
Sbjct: 57 LQEWGIEKNIFSITMDNASANDGNSQNLKDQLCSLDSLLRTFITSATAPSLEKHFKCCAH 236
Query: 301 ILNLIVLDGLKDIDASVQRIRAACKFVKSSPSRLATFKKCADSV-GVCSKAMVTLDVVTR 359
+L+L+V + LK + + +IR + KFV S SRL F +C + V G S + LDV +
Sbjct: 237 VLDLMVQESLKVVSDVLDKIRNSLKFVSVSNSRLKQFCQCVEEVGGGDSSDSLHLDVSGK 416
Query: 360 WNSTYLMLNIAEKYEHVFYRLEHADAAFVTSLSGELGGCPDHDDWERSRVFVKFL 414
W STYLML A KY F + D + CP ++WER +FL
Sbjct: 417 WVSTYLMLKSAIKYRSAFEHMCLKDTTY--------NHCPSSEEWERGEKICEFL 557
>TC82295 similar to PIR|A96722|A96722 unknown protein T17F3.2 [imported] -
Arabidopsis thaliana, partial (17%)
Length = 698
Score = 67.4 bits (163), Expect = 1e-11
Identities = 55/176 (31%), Positives = 86/176 (48%), Gaps = 33/176 (18%)
Frame = +1
Query: 39 KKPQTCSKVWRHFNRNKN----LAVCKYCGKEYAANSSSHGTTNLGKHLKVCLKNP---- 90
K + S VW F+R K +AVC++C K+ + +S+S GT++L HL C +
Sbjct: 142 KSSRLKSVVWNDFDRIKKGDTCVAVCRHCKKKLSGSSTS-GTSHLRNHLIRCQRRSNHGL 318
Query: 91 --YRVVDKKQK--TLAIGMGSED-DPN--------------------SVSFKLVDFSQEK 125
Y +K+K +LAI S D DPN SV+ +F Q +
Sbjct: 319 AQYITAREKRKEGSLAITNFSLDQDPNKDDTVSLVNIKFEQAQLKDESVNTGNSNFDQRR 498
Query: 126 TRLALAKMIIIDELPFKHVENEGFKMFMGEAQPKFKIPSRVSVARDCLRLYFDEKE 181
+R LA+MII+ P VE+ GF+ F+ QP F++ + V DC+ +Y E++
Sbjct: 499 SRFDLARMIILHGYPLAMVEHVGFRAFVKNLQPLFELVTLNRVEADCIEIYDKERK 666
>TC87384
Length = 740
Score = 55.8 bits (133), Expect = 3e-08
Identities = 22/53 (41%), Positives = 34/53 (63%)
Frame = +2
Query: 36 KQKKKPQTCSKVWRHFNRNKNLAVCKYCGKEYAANSSSHGTTNLGKHLKVCLK 88
K++K + S VWRHF + K+ A C YC + +N+S+H T N+ KH+ +C K
Sbjct: 512 KKRKPARPPSMVWRHFMKKKDTAFCNYCFTSFVSNASNHRTNNMLKHMMICPK 670
>BI310751
Length = 712
Score = 55.8 bits (133), Expect = 3e-08
Identities = 22/53 (41%), Positives = 34/53 (63%)
Frame = +1
Query: 36 KQKKKPQTCSKVWRHFNRNKNLAVCKYCGKEYAANSSSHGTTNLGKHLKVCLK 88
K++K + S VWRHF + K+ A C YC + +N+S+H T N+ KH+ +C K
Sbjct: 472 KKRKPARPPSMVWRHFMKKKDTAFCNYCFTSFVSNASNHRTNNMLKHMMICPK 630
>BG645718
Length = 756
Score = 55.8 bits (133), Expect = 3e-08
Identities = 22/53 (41%), Positives = 34/53 (63%)
Frame = +3
Query: 36 KQKKKPQTCSKVWRHFNRNKNLAVCKYCGKEYAANSSSHGTTNLGKHLKVCLK 88
K++K + S VWRHF + K+ A C YC + +N+S+H T N+ KH+ +C K
Sbjct: 522 KKRKPARPPSMVWRHFMKKKDTAFCNYCFTSFVSNASNHRTNNMLKHMMICPK 680
>BF645752 homologue to GP|22713420|gb| Unknown (protein for MGC:33586) {Homo
sapiens}, partial (1%)
Length = 321
Score = 52.4 bits (124), Expect = 4e-07
Identities = 23/51 (45%), Positives = 35/51 (68%)
Frame = +3
Query: 130 LAKMIIIDELPFKHVENEGFKMFMGEAQPKFKIPSRVSVARDCLRLYFDEK 180
+A+++I+ +LPF VE++ F QP + IPSR +VARDC R++ DEK
Sbjct: 153 IARLVILSKLPFSVVESDRFNRLCKLLQPLWNIPSRRTVARDCFRMFXDEK 305
>TC81829
Length = 701
Score = 46.2 bits (108), Expect = 3e-05
Identities = 21/47 (44%), Positives = 29/47 (61%), Gaps = 4/47 (8%)
Frame = +2
Query: 45 SKVWRHFNRNKN----LAVCKYCGKEYAANSSSHGTTNLGKHLKVCL 87
S VW FN N +AVCK+C K Y +S +HGT+++ H K+CL
Sbjct: 560 SPVWLDFNPLPNEPEPIAVCKHCHKRYRCHSKAHGTSSMLAHTKICL 700
>BE325071 similar to GP|3928077|gb|A unknown protein {Arabidopsis thaliana},
partial (5%)
Length = 616
Score = 44.3 bits (103), Expect = 1e-04
Identities = 38/148 (25%), Positives = 63/148 (41%), Gaps = 5/148 (3%)
Frame = +2
Query: 47 VWRHFNR----NKNLAVCKYCGKEYAANSSSHGTTNLGKHLKVCLKNPYRVVDKKQKTLA 102
VW +F + K L CK CGK++ + + N +H+ C P + + K
Sbjct: 206 VWNYFVKIEADGKELCECKTCGKQFISGRTGISHLNQVQHITKC---PLIMKSRIAK--- 367
Query: 103 IGMGSEDDPNSVSFKLVDFSQEKTRLALAKMIIIDELPFKHVENEGFKMFMGEAQ-PKFK 161
FKL R ++ +M I +PF+ E + F+ F A + +
Sbjct: 368 ------------HFKLNKIDHRMVRDSVTQMTIKSYIPFQFAEWDEFRAFTKFASFNEAR 511
Query: 162 IPSRVSVARDCLRLYFDEKETLKSVLSA 189
SR +V D +++Y EK+ LK L+A
Sbjct: 512 SLSRDNVVADVMKVYLLEKDKLKKQLAA 595
>TC80007
Length = 1599
Score = 43.5 bits (101), Expect = 2e-04
Identities = 23/56 (41%), Positives = 31/56 (55%), Gaps = 4/56 (7%)
Frame = +3
Query: 39 KKPQTCSKVWRHFNRNK----NLAVCKYCGKEYAANSSSHGTTNLGKHLKVCLKNP 90
K+P+ SKVW R + N +CKYCGK N GT++L +HL +C K P
Sbjct: 168 KRPKMTSKVWDEMERVQTTEGNKVLCKYCGKLLQDNC---GTSHLKRHLVICPKRP 326
>BQ751384
Length = 455
Score = 38.9 bits (89), Expect = 0.004
Identities = 27/109 (24%), Positives = 49/109 (44%), Gaps = 3/109 (2%)
Frame = -1
Query: 118 LVDFSQEKTRLALAKMIIIDELPFKHVENEGFKMFMGEAQPKFKIPSRVSVARDCLRLYF 177
+ +F++E+ L + +I+ + L F ++++ FK + I +R V Y
Sbjct: 329 ITEFTEEEATLKILNLIVENNLSFSILDSKSFKDLLNYYNKSNPILNRHKVKNLLEFTYS 150
Query: 178 DEKETLKSVLSANNQMV---SLTTDTWTSIQNMNYMCVTAHYIDDEWVL 223
T+ L AN + + SLT D WTS Y+ + YI+ + L
Sbjct: 149 RFLSTINYELEANIKSLGKFSLTLDIWTSKSQDTYLSIIISYINSSFKL 3
>TC90414
Length = 621
Score = 38.5 bits (88), Expect = 0.005
Identities = 18/46 (39%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Frame = +1
Query: 45 SKVWRHFNRNKN----LAVCKYCGKEYAANSSSHGTTNLGKHLKVC 86
S WRH+ R K A+CKYC K+ +++GT++L HL +C
Sbjct: 379 SVYWRHYKRQKFDGKFKAICKYCDKK-LGGETTNGTSHLKDHLSIC 513
>TC87385 similar to GP|23093788|gb|AAF50220.2 CG16711-PA {Drosophila
melanogaster}, partial (1%)
Length = 879
Score = 34.7 bits (78), Expect = 0.077
Identities = 13/28 (46%), Positives = 18/28 (63%)
Frame = +1
Query: 36 KQKKKPQTCSKVWRHFNRNKNLAVCKYC 63
K++K + S VWRHF + K+ A C YC
Sbjct: 454 KKRKPARPPSMVWRHFMKKKDTAFCNYC 537
Score = 28.9 bits (63), Expect = 4.2
Identities = 12/36 (33%), Positives = 19/36 (52%)
Frame = +2
Query: 53 RNKNLAVCKYCGKEYAANSSSHGTTNLGKHLKVCLK 88
R K L + +N+S+H T N+ KH+ +C K
Sbjct: 506 RKKTLPFVTIASLLFVSNASNHRTNNMLKHMMICPK 613
>TC90179 weakly similar to GP|9719456|gb|AAF97810.1| transposase
{Cryphonectria parasitica}, partial (3%)
Length = 810
Score = 33.1 bits (74), Expect = 0.22
Identities = 22/89 (24%), Positives = 39/89 (43%), Gaps = 1/89 (1%)
Frame = -1
Query: 167 SVARDCLRLYFDEKETLKSVLSANNQMVSLTTDTWTSIQNMNYMCVTAHYIDDEWVLKKK 226
S AR+ +LY ++ +K + V L+ D WTS + + V H+I +
Sbjct: 399 SQARELHQLYEAKRVIVKEEMRQALTKVHLSFDLWTSPNRLAIIAVFGHFISPAGNDSRY 220
Query: 227 ILSF-NLIADHKGETIGIALENCMKDWGI 254
+L+ H GE I + ++DW +
Sbjct: 219 LLALRRQPGAHSGENIASTICQVIRDWDL 133
>BE203595
Length = 514
Score = 31.2 bits (69), Expect = 0.85
Identities = 16/31 (51%), Positives = 21/31 (67%)
Frame = +1
Query: 417 FYDATLSFSGSLHVSANSFFKQLMEIQKTLN 447
++ TL S ++ NSFF QL+EIQKTLN
Sbjct: 169 YHSTTL*CSVLESLADNSFFNQLIEIQKTLN 261
>BQ141683
Length = 741
Score = 29.3 bits (64), Expect = 3.2
Identities = 14/32 (43%), Positives = 18/32 (55%)
Frame = +2
Query: 75 GTTNLGKHLKVCLKNPYRVVDKKQKTLAIGMG 106
G TNL HLKV L + ++V D + I MG
Sbjct: 425 G*TNLSHHLKVFLPHAHKVFDHVMDAIGIAMG 520
>TC76730 similar to GP|20521241|dbj|BAB91757. putative 60S ribosomal protein
{Oryza sativa (japonica cultivar-group)}, partial (90%)
Length = 974
Score = 28.5 bits (62), Expect = 5.5
Identities = 31/143 (21%), Positives = 61/143 (41%), Gaps = 20/143 (13%)
Frame = +3
Query: 83 LKVCLKNPYRVVDKKQKTLAIGMGSED----DPNSVSFKLVDFSQEKTRLALAKMIIIDE 138
+K+C+ + V++ +K IG+ S D + + KLV +K LA +I +
Sbjct: 285 MKICMLGDAQHVEEAEK---IGLESMDVEALKKLNKNKKLVKKLAKKYHAFLASEAVIKQ 455
Query: 139 LP----------------FKHVENEGFKMFMGEAQPKFKIPSRVSVARDCLRLYFDEKET 182
+P H E+ K+ +A KF++ + + + DEK+
Sbjct: 456 IPRLLGPGLNKAGKFPTLVSHQESLEAKVNETKAMVKFQLKKVLCMGVAVGNVSMDEKQI 635
Query: 183 LKSVLSANNQMVSLTTDTWTSIQ 205
++V + N +VSL W +++
Sbjct: 636 FQNVQLSVNFLVSLLKKNWQNVR 704
>TC76729 similar to GP|20521241|dbj|BAB91757. putative 60S ribosomal protein
{Oryza sativa (japonica cultivar-group)}, partial (90%)
Length = 1006
Score = 28.5 bits (62), Expect = 5.5
Identities = 31/143 (21%), Positives = 61/143 (41%), Gaps = 20/143 (13%)
Frame = +3
Query: 83 LKVCLKNPYRVVDKKQKTLAIGMGSED----DPNSVSFKLVDFSQEKTRLALAKMIIIDE 138
+K+C+ + V++ +K IG+ S D + + KLV +K LA +I +
Sbjct: 261 MKICMLGDAQHVEEAEK---IGLESMDVEALKKLNKNKKLVKKLAKKYHAFLASEAVIKQ 431
Query: 139 LP----------------FKHVENEGFKMFMGEAQPKFKIPSRVSVARDCLRLYFDEKET 182
+P H E+ K+ +A KF++ + + + DEK+
Sbjct: 432 IPRLLGPGLNKAGKFPTLVSHQESLEAKVNETKAMVKFQLKKVLCMGVAVGNVSMDEKQI 611
Query: 183 LKSVLSANNQMVSLTTDTWTSIQ 205
++V + N +VSL W +++
Sbjct: 612 FQNVQLSVNFLVSLLKKNWQNVR 680
>TC81390 similar to PIR|G85255|G85255 CDP-diacylglycerol synthetase-like
protein [imported] - Arabidopsis thaliana, partial (62%)
Length = 1207
Score = 28.5 bits (62), Expect = 5.5
Identities = 21/59 (35%), Positives = 27/59 (45%), Gaps = 8/59 (13%)
Frame = -1
Query: 47 VWRHFNR-----NKNLAV--CKYCGKEYAANSSSHGTTNLGKHLKVC-LKNPYRVVDKK 97
V HFNR NK L V CKYC A +S T + K+ LK+P+ K+
Sbjct: 730 VQNHFNR*TLRHNK*LMVYICKYCHNHLAVHSVGKSTVSWNTIAKILYLKSPFETTCKE 554
>TC88553 weakly similar to GP|21327037|gb|AAM48133.1 putative flavanone
3-hydroxylase {Saussurea medusa}, partial (26%)
Length = 1017
Score = 28.1 bits (61), Expect = 7.2
Identities = 14/54 (25%), Positives = 25/54 (45%)
Frame = +1
Query: 98 QKTLAIGMGSEDDPNSVSFKLVDFSQEKTRLALAKMIIIDELPFKHVENEGFKM 151
+ LA+GM DPN ++ + D L K I ++ P + N G+++
Sbjct: 766 EPNLALGMSKHSDPNLITILMQDDVSGLQVLKDGKWIALEARPHAFIINVGYQL 927
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.320 0.133 0.399
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,230,235
Number of Sequences: 36976
Number of extensions: 205829
Number of successful extensions: 1051
Number of sequences better than 10.0: 41
Number of HSP's better than 10.0 without gapping: 1041
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1045
length of query: 447
length of database: 9,014,727
effective HSP length: 99
effective length of query: 348
effective length of database: 5,354,103
effective search space: 1863227844
effective search space used: 1863227844
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 60 (27.7 bits)
Medicago: description of AC119416.7