FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1450, 428 aa
1>>>pF1KE1450 428 - 428 aa - 428 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.5692+/-0.00116; mu= 2.3648+/- 0.068
mean_var=212.9642+/-44.154, 0's: 0 Z-trim(109.2): 136 B-trim: 5 in 1/52
Lambda= 0.087886
statistics sampled from 10617 (10754) to 10617 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.687), E-opt: 0.2 (0.33), width: 16
Scan time: 3.250
The best scores are: opt bits E(32554)
CCDS42772.1 NOSTRIN gene_id:115677|Hs108|chr2 ( 428) 2740 360.4 1.9e-99
CCDS54416.1 NOSTRIN gene_id:115677|Hs108|chr2 ( 478) 2740 360.5 2e-99
CCDS42771.1 NOSTRIN gene_id:115677|Hs108|chr2 ( 506) 2740 360.5 2.1e-99
CCDS54415.1 NOSTRIN gene_id:115677|Hs108|chr2 ( 563) 1932 258.1 1.6e-68
>>CCDS42772.1 NOSTRIN gene_id:115677|Hs108|chr2 (428 aa)
initn: 2740 init1: 2740 opt: 2740 Z-score: 1899.0 bits: 360.4 E(32554): 1.9e-99
Smith-Waterman score: 2740; 99.8% identity (99.8% similar) in 428 aa overlap (1-428:1-428)
10 20 30 40 50 60
pF1KE1 MKSTADLHQKLGKAIELEAIKPTYQVLNVQEKKRKSLDNEVEKTANLVISNWNQQIKAKK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 MKSTADLHQKLGKAIELEAIKPTYQVLNVQEKKRKSLDNEVEKTANLVISNWNQQIKAKK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 KLMVSTKKHEALFQLVESSKQSMTEKEKRKLLNKLTKSTEKLEKEDENYYQKNMAGYSTR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 KLMVSTKKHEALFQLVESSKQSMTEKEKRKLLNKLTKSTEKLEKEDENYYQKNMAGYSTR
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 LKWENTLENCYQSILELEKERIQLLCNNLNQYSQHISLFGQTLTTCHTQIHCAISKIDIE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 LKWENTLENCYQSILELEKERIQLLCNNLNQYSQHISLFGQTLTTCHTQIHCAISKIDIE
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 KDIQAVMEETAILSTENKSEFLLTDYFEEDPNSAMDKERRKSLLKPKLLRLQRDIEKASK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 KDIQAVMEETAILSTENKSEFLLTDYFEEDPNSAMDKERRKSLLKPKLLRLQRDIEKASK
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE1 DKEGLERMLKTYSSTSSFSDAKSQKDTAALMDENNLKLDLLEANSYKLSSMLAELEQRPQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 DKEGLERMLKTYSSTSSFSDAKSQKDTAALMDENNLKLDLLEANSYKLSSMLAELEQRPQ
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE1 PSHPCSNSIFRWREKEHTHSYVKISRPFLMKRLENIVSKASSGGQSNPGSSTPAPGAAQL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 PSHPCSNSIFRWREKEHTHSYVKISRPFLMKRLENIVSKASSGGQSNPGSSTPAPGAAQL
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE1 SSRLCKALYSFQARQDDELNLEKGDIVIIHEKKEEGWWFGSLNGKKGHFPAAYVEELPSN
:::::::::::::::::::::::::::::::::: :::::::::::::::::::::::::
CCDS42 SSRLCKALYSFQARQDDELNLEKGDIVIIHEKKEGGWWFGSLNGKKGHFPAAYVEELPSN
370 380 390 400 410 420
pF1KE1 AGNTATKA
::::::::
CCDS42 AGNTATKA
>>CCDS54416.1 NOSTRIN gene_id:115677|Hs108|chr2 (478 aa)
initn: 2740 init1: 2740 opt: 2740 Z-score: 1898.4 bits: 360.5 E(32554): 2e-99
Smith-Waterman score: 2740; 99.8% identity (99.8% similar) in 428 aa overlap (1-428:51-478)
10 20 30
pF1KE1 MKSTADLHQKLGKAIELEAIKPTYQVLNVQ
::::::::::::::::::::::::::::::
CCDS54 SQNGENFCKQVTSVLQQSCVSSAWAWASEGMKSTADLHQKLGKAIELEAIKPTYQVLNVQ
30 40 50 60 70 80
40 50 60 70 80 90
pF1KE1 EKKRKSLDNEVEKTANLVISNWNQQIKAKKKLMVSTKKHEALFQLVESSKQSMTEKEKRK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 EKKRKSLDNEVEKTANLVISNWNQQIKAKKKLMVSTKKHEALFQLVESSKQSMTEKEKRK
90 100 110 120 130 140
100 110 120 130 140 150
pF1KE1 LLNKLTKSTEKLEKEDENYYQKNMAGYSTRLKWENTLENCYQSILELEKERIQLLCNNLN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 LLNKLTKSTEKLEKEDENYYQKNMAGYSTRLKWENTLENCYQSILELEKERIQLLCNNLN
150 160 170 180 190 200
160 170 180 190 200 210
pF1KE1 QYSQHISLFGQTLTTCHTQIHCAISKIDIEKDIQAVMEETAILSTENKSEFLLTDYFEED
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 QYSQHISLFGQTLTTCHTQIHCAISKIDIEKDIQAVMEETAILSTENKSEFLLTDYFEED
210 220 230 240 250 260
220 230 240 250 260 270
pF1KE1 PNSAMDKERRKSLLKPKLLRLQRDIEKASKDKEGLERMLKTYSSTSSFSDAKSQKDTAAL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 PNSAMDKERRKSLLKPKLLRLQRDIEKASKDKEGLERMLKTYSSTSSFSDAKSQKDTAAL
270 280 290 300 310 320
280 290 300 310 320 330
pF1KE1 MDENNLKLDLLEANSYKLSSMLAELEQRPQPSHPCSNSIFRWREKEHTHSYVKISRPFLM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 MDENNLKLDLLEANSYKLSSMLAELEQRPQPSHPCSNSIFRWREKEHTHSYVKISRPFLM
330 340 350 360 370 380
340 350 360 370 380 390
pF1KE1 KRLENIVSKASSGGQSNPGSSTPAPGAAQLSSRLCKALYSFQARQDDELNLEKGDIVIIH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 KRLENIVSKASSGGQSNPGSSTPAPGAAQLSSRLCKALYSFQARQDDELNLEKGDIVIIH
390 400 410 420 430 440
400 410 420
pF1KE1 EKKEEGWWFGSLNGKKGHFPAAYVEELPSNAGNTATKA
:::: :::::::::::::::::::::::::::::::::
CCDS54 EKKEGGWWFGSLNGKKGHFPAAYVEELPSNAGNTATKA
450 460 470
>>CCDS42771.1 NOSTRIN gene_id:115677|Hs108|chr2 (506 aa)
initn: 2740 init1: 2740 opt: 2740 Z-score: 1898.0 bits: 360.5 E(32554): 2.1e-99
Smith-Waterman score: 2740; 99.8% identity (99.8% similar) in 428 aa overlap (1-428:79-506)
10 20 30
pF1KE1 MKSTADLHQKLGKAIELEAIKPTYQVLNVQ
::::::::::::::::::::::::::::::
CCDS42 LQKLASKLSKALQNTRKSCVSSAWAWASEGMKSTADLHQKLGKAIELEAIKPTYQVLNVQ
50 60 70 80 90 100
40 50 60 70 80 90
pF1KE1 EKKRKSLDNEVEKTANLVISNWNQQIKAKKKLMVSTKKHEALFQLVESSKQSMTEKEKRK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 EKKRKSLDNEVEKTANLVISNWNQQIKAKKKLMVSTKKHEALFQLVESSKQSMTEKEKRK
110 120 130 140 150 160
100 110 120 130 140 150
pF1KE1 LLNKLTKSTEKLEKEDENYYQKNMAGYSTRLKWENTLENCYQSILELEKERIQLLCNNLN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 LLNKLTKSTEKLEKEDENYYQKNMAGYSTRLKWENTLENCYQSILELEKERIQLLCNNLN
170 180 190 200 210 220
160 170 180 190 200 210
pF1KE1 QYSQHISLFGQTLTTCHTQIHCAISKIDIEKDIQAVMEETAILSTENKSEFLLTDYFEED
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 QYSQHISLFGQTLTTCHTQIHCAISKIDIEKDIQAVMEETAILSTENKSEFLLTDYFEED
230 240 250 260 270 280
220 230 240 250 260 270
pF1KE1 PNSAMDKERRKSLLKPKLLRLQRDIEKASKDKEGLERMLKTYSSTSSFSDAKSQKDTAAL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 PNSAMDKERRKSLLKPKLLRLQRDIEKASKDKEGLERMLKTYSSTSSFSDAKSQKDTAAL
290 300 310 320 330 340
280 290 300 310 320 330
pF1KE1 MDENNLKLDLLEANSYKLSSMLAELEQRPQPSHPCSNSIFRWREKEHTHSYVKISRPFLM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 MDENNLKLDLLEANSYKLSSMLAELEQRPQPSHPCSNSIFRWREKEHTHSYVKISRPFLM
350 360 370 380 390 400
340 350 360 370 380 390
pF1KE1 KRLENIVSKASSGGQSNPGSSTPAPGAAQLSSRLCKALYSFQARQDDELNLEKGDIVIIH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 KRLENIVSKASSGGQSNPGSSTPAPGAAQLSSRLCKALYSFQARQDDELNLEKGDIVIIH
410 420 430 440 450 460
400 410 420
pF1KE1 EKKEEGWWFGSLNGKKGHFPAAYVEELPSNAGNTATKA
:::: :::::::::::::::::::::::::::::::::
CCDS42 EKKEGGWWFGSLNGKKGHFPAAYVEELPSNAGNTATKA
470 480 490 500
>>CCDS54415.1 NOSTRIN gene_id:115677|Hs108|chr2 (563 aa)
initn: 1932 init1: 1932 opt: 1932 Z-score: 1343.7 bits: 258.1 E(32554): 1.6e-68
Smith-Waterman score: 2460; 87.4% identity (87.4% similar) in 460 aa overlap (26-428:104-563)
10 20 30 40 50
pF1KE1 MKSTADLHQKLGKAIELEAIKPTYQVLNVQEKKRKSLDNEVEKTANLVISNWNQQ
::::::::::::::::::::::::::::::
CCDS54 WASEGMKSTADLHQKLGKAIELEAIKPTYQVLNVQEKKRKSLDNEVEKTANLVISNWNQQ
80 90 100 110 120 130
60 70 80 90 100 110
pF1KE1 IKAKKKLMVSTKKHEALFQLVESSKQSMTEKEKRKLLNKLTKSTEKLEKEDENYYQKNMA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 IKAKKKLMVSTKKHEALFQLVESSKQSMTEKEKRKLLNKLTKSTEKLEKEDENYYQKNMA
140 150 160 170 180 190
120 130
pF1KE1 GYSTRLKWENTLENCYQ-------------------------------------------
:::::::::::::::::
CCDS54 GYSTRLKWENTLENCYQVTHSICLYAFWVKRAWGKCVSDLRYQDTFLPGNLPPLWFGYDI
200 210 220 230 240 250
140 150 160 170
pF1KE1 --------------SILELEKERIQLLCNNLNQYSQHISLFGQTLTTCHTQIHCAISKID
::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 VKRLIMRLCSVCLQSILELEKERIQLLCNNLNQYSQHISLFGQTLTTCHTQIHCAISKID
260 270 280 290 300 310
180 190 200 210 220 230
pF1KE1 IEKDIQAVMEETAILSTENKSEFLLTDYFEEDPNSAMDKERRKSLLKPKLLRLQRDIEKA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 IEKDIQAVMEETAILSTENKSEFLLTDYFEEDPNSAMDKERRKSLLKPKLLRLQRDIEKA
320 330 340 350 360 370
240 250 260 270 280 290
pF1KE1 SKDKEGLERMLKTYSSTSSFSDAKSQKDTAALMDENNLKLDLLEANSYKLSSMLAELEQR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 SKDKEGLERMLKTYSSTSSFSDAKSQKDTAALMDENNLKLDLLEANSYKLSSMLAELEQR
380 390 400 410 420 430
300 310 320 330 340 350
pF1KE1 PQPSHPCSNSIFRWREKEHTHSYVKISRPFLMKRLENIVSKASSGGQSNPGSSTPAPGAA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 PQPSHPCSNSIFRWREKEHTHSYVKISRPFLMKRLENIVSKASSGGQSNPGSSTPAPGAA
440 450 460 470 480 490
360 370 380 390 400 410
pF1KE1 QLSSRLCKALYSFQARQDDELNLEKGDIVIIHEKKEEGWWFGSLNGKKGHFPAAYVEELP
:::::::::::::::::::::::::::::::::::: :::::::::::::::::::::::
CCDS54 QLSSRLCKALYSFQARQDDELNLEKGDIVIIHEKKEGGWWFGSLNGKKGHFPAAYVEELP
500 510 520 530 540 550
420
pF1KE1 SNAGNTATKA
::::::::::
CCDS54 SNAGNTATKA
560
428 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 01:27:42 2016 done: Mon Nov 7 01:27:43 2016
Total Scan time: 3.250 Total Display time: 0.030
Function used was FASTA [36.3.4 Apr, 2011]