Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC003936A_C01 KMC003936A_c01
(1034 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_567449.1| expressed protein; protein id: At4g14930.1, sup... 188 1e-46
pir||F71412 hypothetical protein - Arabidopsis thaliana gi|22448... 162 6e-39
ref|NP_177431.1| unknown protein; protein id: At1g72880.1 [Arabi... 129 6e-29
pir||H96753 hypothetical protein F3N23.8 [imported] - Arabidopsi... 120 3e-26
ref|NP_218859.1| survival protein, putative [Treponema pallidum]... 107 4e-22
>ref|NP_567449.1| expressed protein; protein id: At4g14930.1, supported by cDNA: 38042.
[Arabidopsis thaliana] gi|21593317|gb|AAM65266.1| unknown
[Arabidopsis thaliana] gi|27311791|gb|AAO00861.1|
expressed protein [Arabidopsis thaliana]
Length = 315
Score = 188 bits (477), Expect = 1e-46
Identities = 90/125 (72%), Positives = 109/125 (87%)
Frame = -1
Query: 1004 KRATILITNDDGIDAPGLRALVHSLVTTNLFNILVCAPDSEKSAVSHGITWLHPITAKQI 825
+R I++TNDDGIDAPGLR+LV LV+TNL+++ VCAPDSEKSAVSH I W P+TAK++
Sbjct: 12 ERPIIMVTNDDGIDAPGLRSLVRVLVSTNLYDVRVCAPDSEKSAVSHSIIWSRPLTAKRV 71
Query: 824 QIDGATSAFAVSGTPADCTSLGVSKALFPTLPDLVVSGINKGSNCGYHIVYSGTVAGARE 645
+IDGAT A++V GTPADCT LG+S+ALFP+ PDLV+SGIN GSNCGY+IVYSGTVAGARE
Sbjct: 72 EIDGAT-AYSVDGTPADCTGLGLSEALFPSRPDLVLSGINVGSNCGYNIVYSGTVAGARE 130
Query: 644 AFFND 630
AF D
Sbjct: 131 AFLYD 135
Score = 95.5 bits (236), Expect = 1e-18
Identities = 49/117 (41%), Positives = 75/117 (63%), Gaps = 1/117 (0%)
Frame = -1
Query: 629 GYKLTKQGKSIVRMGWKQVTSETEGRKMTSDMT-NTDTDTDTPKNVGASSVSPEHLSFVR 453
GYKLT+QGKS+ +MGW+QV E +G KM S MT +T++ + +N ++ + F R
Sbjct: 199 GYKLTRQGKSMGKMGWRQVEEEAQGPKMLSTMTMDTESGVVSSENDTSAHAGKDSRLFKR 258
Query: 452 EVRGYQLDDDDDTDHSSLQEGYISVTPLAALSHAEVDCQTYFKDWLQSVPEIPSSSA 282
E+R ++ D+ + L+EG+I+VTPL ALS +VDCQ Y+K+WL + SS+
Sbjct: 259 ELRASISEEGSDSHY--LKEGFITVTPLGALSQTDVDCQNYYKEWLPKITNQSCSSS 313
>pir||F71412 hypothetical protein - Arabidopsis thaliana
gi|2244850|emb|CAB10272.1| hypothetical protein
[Arabidopsis thaliana] gi|7268239|emb|CAB78535.1|
hypothetical protein [Arabidopsis thaliana]
Length = 275
Score = 162 bits (411), Expect = 6e-39
Identities = 89/166 (53%), Positives = 109/166 (65%), Gaps = 41/166 (24%)
Frame = -1
Query: 1004 KRATILITNDDGIDAPGLRALVHSLVTTNLFNILVCAPDS-------------EKSAVSH 864
+R I++TNDDGIDAPGLR+LV LV+TNL+++ VCAPDS EKSAVSH
Sbjct: 12 ERPIIMVTNDDGIDAPGLRSLVRVLVSTNLYDVRVCAPDSYMCNSYLFMKFSREKSAVSH 71
Query: 863 GITWLHPITAKQIQIDGATSAFAVSGTPADCTSLGVSKALFPTLPDLVVSGINKGSNCGY 684
I W P+TAK+++IDGAT A++V GTPADCT LG+S+ALFP+ PDLV+SGIN GSNCGY
Sbjct: 72 SIIWSRPLTAKRVEIDGAT-AYSVDGTPADCTGLGLSEALFPSRPDLVLSGINVGSNCGY 130
Query: 683 HI----------------------------VYSGTVAGAREAFFND 630
++ VYSGTVAGAREAF D
Sbjct: 131 NMSVNISSSVSLCFGFLFPYYVMFLYRLLGVYSGTVAGAREAFLYD 176
>ref|NP_177431.1| unknown protein; protein id: At1g72880.1 [Arabidopsis thaliana]
Length = 385
Score = 129 bits (325), Expect = 6e-29
Identities = 70/150 (46%), Positives = 97/150 (64%), Gaps = 1/150 (0%)
Frame = -1
Query: 1010 ESKRATILITNDDGIDAPGLRALVHSLVTTNLFNILVCAPDSEKSAVSHGITWLHPITAK 831
+ R +L+TN DGID+PGL +LV +LV L+N+ VCAP ++KSA +H T I
Sbjct: 57 DDSRPIVLVTNGDGIDSPGLVSLVEALVLEGLYNVHVCAPQTDKSASAHSTTPGETIAVS 116
Query: 830 QIQIDGATSAFAVSGTPADCTSLGVSKALFP-TLPDLVVSGINKGSNCGYHIVYSGTVAG 654
+++ GAT AF VSGTP DC SLG+S ALF + P LV+SGIN+GS+CG+ + YSG VAG
Sbjct: 117 SVKLKGAT-AFEVSGTPVDCISLGLSGALFAWSKPILVISGINQGSSCGHQMFYSGAVAG 175
Query: 653 AREAFFNDGYKLTKQGKSIVRMGWKQVTSE 564
REA + L+ + + WK+ S+
Sbjct: 176 GREALISGVPSLS------ISLNWKKNESQ 199
Score = 33.9 bits (76), Expect = 4.3
Identities = 12/37 (32%), Positives = 23/37 (61%)
Frame = -1
Query: 425 DDDTDHSSLQEGYISVTPLAALSHAEVDCQTYFKDWL 315
D+D D +L++G++SVTP + L + + Q +W+
Sbjct: 341 DEDLDVKALEDGFVSVTPFSLLPKTDSETQAAASEWI 377
>pir||H96753 hypothetical protein F3N23.8 [imported] - Arabidopsis thaliana
gi|5903077|gb|AAD55635.1|AC008017_8 Unknown protein
[Arabidopsis thaliana]
Length = 359
Score = 120 bits (301), Expect = 3e-26
Identities = 87/296 (29%), Positives = 135/296 (45%), Gaps = 64/296 (21%)
Frame = -1
Query: 1010 ESKRATILITNDDGIDAPGLRALVHSLVTTNLFNILVCAPDSEKSAVSHGITWLHPITAK 831
+ R +L+TN DGID+PGL +LV +LV L+N+ VCAP ++KSA +H T I
Sbjct: 57 DDSRPIVLVTNGDGIDSPGLVSLVEALVLEGLYNVHVCAPQTDKSASAHSTTPGETIAVS 116
Query: 830 QIQIDGATSAFAVSGTPADCTSLGVSKALFP-TLPDLVVSGINKGSNCGYHI-------- 678
+++ GAT AF VSGTP DC SLG+S ALF + P LV+SGIN+GS+CG+ +
Sbjct: 117 SVKLKGAT-AFEVSGTPVDCISLGLSGALFAWSKPILVISGINQGSSCGHQMKKNESQES 175
Query: 677 -----------VYSGTVAGAREAFF----------------NDGYKLTKQGKSIVRMGWK 579
+ + T+ + F N G+K+TKQ W+
Sbjct: 176 HFKDAVGVCLPLINATIRDIAKGVFPKDCSLNIEIPTSPSSNKGFKVTKQSMWRQYPSWQ 235
Query: 578 QVTSETE--------------------GRKMTSDMTNTDTDTDTPKNVGASSVSPEHLSF 459
V++ GR ++ T V SV +
Sbjct: 236 AVSANRHPGAGNFMSNQQSLGAQLAQLGRDASAAGAARRFTTQKKSIVEIESVGVAGKTD 295
Query: 458 VREVRGYQLD--------DDDDTDHSSLQEGYISVTPLAALSHAEVDCQTYFKDWL 315
R + ++L+ D+D D +L++G++SVTP + L + + Q +W+
Sbjct: 296 TRVKKFFRLEFLAKEQEHTDEDLDVKALEDGFVSVTPFSLLPKTDSETQAAASEWI 351
>ref|NP_218859.1| survival protein, putative [Treponema pallidum]
gi|7388271|sp|O83434|SURE_TREPA Acid phosphatase surE
gi|7521443|pir||A71328 probable survival protein -
syphilis spirochete gi|3322702|gb|AAC65406.1| survival
protein, putative [Treponema pallidum]
Length = 187
Score = 107 bits (266), Expect = 4e-22
Identities = 57/118 (48%), Positives = 77/118 (64%), Gaps = 1/118 (0%)
Frame = -1
Query: 992 ILITNDDGIDAPGLRALVHSL-VTTNLFNILVCAPDSEKSAVSHGITWLHPITAKQIQID 816
IL+TNDDG A G+RAL +L + + V APD ++SAVSHGIT L P+T K+++
Sbjct: 3 ILLTNDDGYQAAGIRALHAALKAAPEGYEVTVVAPDRDRSAVSHGITTLEPVTVKEVE-- 60
Query: 815 GATSAFAVSGTPADCTSLGVSKALFPTLPDLVVSGINKGSNCGYHIVYSGTVAGAREA 642
++ SGTP DC + + + T PD+VVSGIN+G N G IV+SGTVA AR+A
Sbjct: 61 --PGIWSCSGTPVDCVNRALRQVCVGTPPDVVVSGINEGENLGTDIVFSGTVAAARQA 116
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 860,797,940
Number of Sequences: 1393205
Number of extensions: 18587695
Number of successful extensions: 51972
Number of sequences better than 10.0: 127
Number of HSP's better than 10.0 without gapping: 47840
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 51323
length of database: 448,689,247
effective HSP length: 124
effective length of database: 275,931,827
effective search space used: 60705001940
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)