
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC126784.4 - phase: 0 /pseudo
(525 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC88637 similar to GP|14596217|gb|AAK68836.1 Unknown protein {Ar... 702 0.0
TC77948 similar to PIR|C85360|C85360 hypothetical protein AT4g30... 280 1e-75
TC83112 similar to GP|9758290|dbj|BAB08814.1 emb|CAB86899.1~gene... 30 2.3
BQ123584 similar to PIR|T47502|T47 hypothetical protein F9K21.20... 30 3.0
BQ157905 homologue to GP|10441918|gb| unknown {Homo sapiens}, pa... 30 3.0
TC82818 similar to GP|20259203|gb|AAM14317.1 unknown protein {Ar... 30 3.0
BG645169 similar to PIR|D86157|D86 hypothetical protein AAF02892... 28 6.6
TC79680 weakly similar to GP|21751020|dbj|BAC03887. unnamed prot... 28 6.6
>TC88637 similar to GP|14596217|gb|AAK68836.1 Unknown protein {Arabidopsis
thaliana}, partial (43%)
Length = 1207
Score = 702 bits (1812), Expect = 0.0
Identities = 351/393 (89%), Positives = 355/393 (90%)
Frame = +1
Query: 1 MVRFMGSKNPRSQWESSSATSSISPKFEIEDSIQDQHAPLNKRHKATNDILNEPSPLGLS 60
MVRFMGSKNPRSQWESSSATSSISPKFEIEDSIQDQHAPLNKRHKATNDILNEPSPLGLS
Sbjct: 118 MVRFMGSKNPRSQWESSSATSSISPKFEIEDSIQDQHAPLNKRHKATNDILNEPSPLGLS 297
Query: 61 LRKSPSLLDLIQMTLCQENSVNANTANDNLNSKANKNGRASVEKLKASNFPATHLKIGSW 120
LRKSPSLLDLIQMTLCQENSVNANTANDNLNSKANKNGRASVEKLKASNFPATHLKIGSW
Sbjct: 298 LRKSPSLLDLIQMTLCQENSVNANTANDNLNSKANKNGRASVEKLKASNFPATHLKIGSW 477
Query: 121 EYKSKYEGDLVAKCYFAKQKLVWEVLEGELKSKIEIQWSDISQLKANCPDDGPSTLTLMV 180
EYKSKYEGDLVAKCYFAKQKLVWEVLEGELKSKIEIQWSDISQLKANCPDDGPSTLTLMV
Sbjct: 478 EYKSKYEGDLVAKCYFAKQKLVWEVLEGELKSKIEIQWSDISQLKANCPDDGPSTLTLMV 657
Query: 181 ARQPLFFRETNPQPRKHTLWQSTTDFTGGQASIHRRHVLQCEQGLLIKHYEKLVQCNDRL 240
ARQPLFFRETNPQPRKHTLWQSTTDFTGGQASIHRRHVLQCEQGLLIKHYEKLVQCNDRL
Sbjct: 658 ARQPLFFRETNPQPRKHTLWQSTTDFTGGQASIHRRHVLQCEQGLLIKHYEKLVQCNDRL 837
Query: 241 KFLSQQPEIMVDSPHFDPRSAAIENPHNLKDCDLHQGNGSAVSCFQNMGSPHSSLSPSFT 300
KFLSQQPEIMVDSPHFDPRSAAIENPHNLKDCDLHQGNGSAVSCFQNMGSPHSSLSPSFT
Sbjct: 838 KFLSQQPEIMVDSPHFDPRSAAIENPHNLKDCDLHQGNGSAVSCFQNMGSPHSSLSPSFT 1017
Query: 301 TEHSDPSAITLDSVPCEAPSSSSA*IMKFSSKICQMSII*NL**LCKSVIVVLAC*FHQ* 360
TEHSDPSAITLDSVPCEAPSSSS + ++
Sbjct: 1018TEHSDPSAITLDSVPCEAPSSSSEAM------------------------------YNSE 1107
Query: 361 FNMLGSRNWDQIKLPGLRPSMSMSDFLGHIEHH 393
+ GSRNWDQIKLP LRPSMSMSDFLGHIEHH
Sbjct: 1108ADSKGSRNWDQIKLPXLRPSMSMSDFLGHIEHH 1206
>TC77948 similar to PIR|C85360|C85360 hypothetical protein AT4g30780
[imported] - Arabidopsis thaliana, partial (28%)
Length = 1916
Score = 280 bits (716), Expect = 1e-75
Identities = 140/205 (68%), Positives = 163/205 (79%)
Frame = +2
Query: 51 LNEPSPLGLSLRKSPSLLDLIQMTLCQENSVNANTANDNLNSKANKNGRASVEKLKASNF 110
L+EPSPLGL L+KSPSLLDLIQM L Q + + + ++ A+ A+ KLKASNF
Sbjct: 269 LDEPSPLGLRLKKSPSLLDLIQMKLSQ--TYESKKKDQKGSASASAAAAAADSKLKASNF 442
Query: 111 PATHLKIGSWEYKSKYEGDLVAKCYFAKQKLVWEVLEGELKSKIEIQWSDISQLKANCPD 170
PAT LKIG+WEYKS+YEGDLVAKCYFAK KLVWEVL+G LK+KIEI WSDI LKAN PD
Sbjct: 443 PATVLKIGTWEYKSRYEGDLVAKCYFAKHKLVWEVLDGCLKNKIEIPWSDIMALKANYPD 622
Query: 171 DGPSTLTLMVARQPLFFRETNPQPRKHTLWQSTTDFTGGQASIHRRHVLQCEQGLLIKHY 230
D P TL +++AR+PLFFRE NPQPRKHTLWQSTTDFTGGQAS+HRRH +QC QGLL +H+
Sbjct: 623 DAPGTLEVILARRPLFFREINPQPRKHTLWQSTTDFTGGQASMHRRHFVQCPQGLLGRHF 802
Query: 231 EKLVQCNDRLKFLSQQPEIMVDSPH 255
EKL+QC+ RL FLSQQ I V H
Sbjct: 803 EKLIQCDPRLNFLSQQTGICVGILH 877
Score = 120 bits (302), Expect = 1e-27
Identities = 76/171 (44%), Positives = 100/171 (58%), Gaps = 13/171 (7%)
Frame = +3
Query: 368 NWDQIKLPGLRPSMSMSDFLGHIEHHISKEMASGDPSFSAERLEYQQMMDGITQHLLNDN 427
N QIKLPGL PSMSM D + HIEH IS++M + SF+ +R +++ TQ+L ND+
Sbjct: 1146 NLSQIKLPGLHPSMSMDDLVNHIEHCISEQMGPENSSFTNDR----AVLEEFTQYLFNDS 1313
Query: 428 ----QVTTDSDEKSLMSRVNSLRCLLQMDPPAVPNSHDNTG--FIEGPNDAKVN------ 475
SDE+++MSRVNSL CLLQ DP A G + D KV+
Sbjct: 1314 IFPPVSDEQSDEQNVMSRVNSLYCLLQKDPSAPEEKQVQNGNNVFDAAEDRKVDEGKSKM 1493
Query: 476 IDIKATEENSRDVYGGNPAP-GMSRKDSFGDLLLSLPRIASLPKFLFDISE 525
D+ +++ D G G+SRK+S GDLLL+LPRIASLP FLF +SE
Sbjct: 1494 FDLGFQQDDDDDASGSQQQENGLSRKESAGDLLLNLPRIASLPHFLFPMSE 1646
>TC83112 similar to GP|9758290|dbj|BAB08814.1
emb|CAB86899.1~gene_id:MLE2.12~strong similarity to
unknown protein {Arabidopsis thaliana}, partial (34%)
Length = 631
Score = 30.0 bits (66), Expect = 2.3
Identities = 18/40 (45%), Positives = 26/40 (65%), Gaps = 1/40 (2%)
Frame = -2
Query: 274 LHQGNGS-AVSCFQNMGSPHSSLSPSFTTEHSDPSAITLD 312
LHQG GS VS N S HSS+S + +HS+ +AI+++
Sbjct: 573 LHQGGGSFVVSDLTN--SSHSSMSHMYHVDHSNYNAISIN 460
>BQ123584 similar to PIR|T47502|T47 hypothetical protein F9K21.200 -
Arabidopsis thaliana, partial (30%)
Length = 577
Score = 29.6 bits (65), Expect = 3.0
Identities = 20/56 (35%), Positives = 31/56 (54%), Gaps = 2/56 (3%)
Frame = +1
Query: 228 KHYEKLVQCNDRLKFLSQQPEIMVDSPHFD--PRSAAIENPHNLKDCDLHQGNGSA 281
K E LV ND L +L ++ + FD P SAA E+P NL++ ++ G+ +A
Sbjct: 184 KSSELLVSYNDDLIYLFEK------NSSFDSLPSSAACEDPKNLQETQVYSGHRNA 333
>BQ157905 homologue to GP|10441918|gb| unknown {Homo sapiens}, partial (3%)
Length = 811
Score = 29.6 bits (65), Expect = 3.0
Identities = 21/61 (34%), Positives = 29/61 (47%)
Frame = +2
Query: 263 IENPHNLKDCDLHQGNGSAVSCFQNMGSPHSSLSPSFTTEHSDPSAITLDSVPCEAPSSS 322
+ N N+ D DL +G A+ N G + + P S PS++TLD PC P S
Sbjct: 5 LNNSFNILDEDLPLLSGEAIFR-DNHGIQYHIVMPFI----SCPSSLTLDKRPCNLPPKS 169
Query: 323 S 323
S
Sbjct: 170 S 172
>TC82818 similar to GP|20259203|gb|AAM14317.1 unknown protein {Arabidopsis
thaliana}, partial (43%)
Length = 1176
Score = 29.6 bits (65), Expect = 3.0
Identities = 26/86 (30%), Positives = 42/86 (48%), Gaps = 5/86 (5%)
Frame = +1
Query: 13 QWESSSATSS-----ISPKFEIEDSIQDQHAPLNKRHKATNDILNEPSPLGLSLRKSPSL 67
+W+S S + +SP+ + ED D LNKR+K ND+L + + ++P+L
Sbjct: 4 KWKSISKVMAERGYRVSPQ-QCEDKFND----LNKRYKRLNDMLGRGT--SCQVVENPAL 162
Query: 68 LDLIQMTLCQENSVNANTANDNLNSK 93
LD+I+ N + LNSK
Sbjct: 163LDVIEYL----NEKEKDDVRKILNSK 228
>BG645169 similar to PIR|D86157|D86 hypothetical protein AAF02892.1
[imported] - Arabidopsis thaliana, partial (6%)
Length = 779
Score = 28.5 bits (62), Expect = 6.6
Identities = 16/42 (38%), Positives = 24/42 (57%), Gaps = 2/42 (4%)
Frame = +1
Query: 182 RQPLFFRETN--PQPRKHTLWQSTTDFTGGQASIHRRHVLQC 221
R PL +T+ + K+ + S T FTGG S+ R HV++C
Sbjct: 352 RMPLSSSQTSLPEENNKNNSFISGTIFTGGFNSVTRGHVIEC 477
>TC79680 weakly similar to GP|21751020|dbj|BAC03887. unnamed protein product
{Homo sapiens}, partial (13%)
Length = 2636
Score = 28.5 bits (62), Expect = 6.6
Identities = 16/57 (28%), Positives = 29/57 (50%), Gaps = 1/57 (1%)
Frame = -3
Query: 9 NPRSQWESSSATSSISPKFEIEDSIQDQHAPLNKRHKA-TNDILNEPSPLGLSLRKS 64
N + E TSS+ K D + Q++P+ RHK + D L + + +G+ ++S
Sbjct: 2364 NNYREMEYKRETSSVILKLMGLDKVPSQNSPVRNRHKVLSEDYLQKVASIGVRKKRS 2194
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.322 0.135 0.408
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 17,062,667
Number of Sequences: 36976
Number of extensions: 248841
Number of successful extensions: 1216
Number of sequences better than 10.0: 16
Number of HSP's better than 10.0 without gapping: 1200
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1212
length of query: 525
length of database: 9,014,727
effective HSP length: 101
effective length of query: 424
effective length of database: 5,280,151
effective search space: 2238784024
effective search space used: 2238784024
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 61 (28.1 bits)
Medicago: description of AC126784.4