FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1450, 428 aa 1>>>pF1KE1450 428 - 428 aa - 428 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.5692+/-0.00116; mu= 2.3648+/- 0.068 mean_var=212.9642+/-44.154, 0's: 0 Z-trim(109.2): 136 B-trim: 5 in 1/52 Lambda= 0.087886 statistics sampled from 10617 (10754) to 10617 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.687), E-opt: 0.2 (0.33), width: 16 Scan time: 3.250 The best scores are: opt bits E(32554) CCDS42772.1 NOSTRIN gene_id:115677|Hs108|chr2 ( 428) 2740 360.4 1.9e-99 CCDS54416.1 NOSTRIN gene_id:115677|Hs108|chr2 ( 478) 2740 360.5 2e-99 CCDS42771.1 NOSTRIN gene_id:115677|Hs108|chr2 ( 506) 2740 360.5 2.1e-99 CCDS54415.1 NOSTRIN gene_id:115677|Hs108|chr2 ( 563) 1932 258.1 1.6e-68 >>CCDS42772.1 NOSTRIN gene_id:115677|Hs108|chr2 (428 aa) initn: 2740 init1: 2740 opt: 2740 Z-score: 1899.0 bits: 360.4 E(32554): 1.9e-99 Smith-Waterman score: 2740; 99.8% identity (99.8% similar) in 428 aa overlap (1-428:1-428) 10 20 30 40 50 60 pF1KE1 MKSTADLHQKLGKAIELEAIKPTYQVLNVQEKKRKSLDNEVEKTANLVISNWNQQIKAKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 MKSTADLHQKLGKAIELEAIKPTYQVLNVQEKKRKSLDNEVEKTANLVISNWNQQIKAKK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 KLMVSTKKHEALFQLVESSKQSMTEKEKRKLLNKLTKSTEKLEKEDENYYQKNMAGYSTR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 KLMVSTKKHEALFQLVESSKQSMTEKEKRKLLNKLTKSTEKLEKEDENYYQKNMAGYSTR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 LKWENTLENCYQSILELEKERIQLLCNNLNQYSQHISLFGQTLTTCHTQIHCAISKIDIE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 LKWENTLENCYQSILELEKERIQLLCNNLNQYSQHISLFGQTLTTCHTQIHCAISKIDIE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 KDIQAVMEETAILSTENKSEFLLTDYFEEDPNSAMDKERRKSLLKPKLLRLQRDIEKASK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 KDIQAVMEETAILSTENKSEFLLTDYFEEDPNSAMDKERRKSLLKPKLLRLQRDIEKASK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 DKEGLERMLKTYSSTSSFSDAKSQKDTAALMDENNLKLDLLEANSYKLSSMLAELEQRPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 DKEGLERMLKTYSSTSSFSDAKSQKDTAALMDENNLKLDLLEANSYKLSSMLAELEQRPQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 PSHPCSNSIFRWREKEHTHSYVKISRPFLMKRLENIVSKASSGGQSNPGSSTPAPGAAQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 PSHPCSNSIFRWREKEHTHSYVKISRPFLMKRLENIVSKASSGGQSNPGSSTPAPGAAQL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE1 SSRLCKALYSFQARQDDELNLEKGDIVIIHEKKEEGWWFGSLNGKKGHFPAAYVEELPSN :::::::::::::::::::::::::::::::::: ::::::::::::::::::::::::: CCDS42 SSRLCKALYSFQARQDDELNLEKGDIVIIHEKKEGGWWFGSLNGKKGHFPAAYVEELPSN 370 380 390 400 410 420 pF1KE1 AGNTATKA :::::::: CCDS42 AGNTATKA >>CCDS54416.1 NOSTRIN gene_id:115677|Hs108|chr2 (478 aa) initn: 2740 init1: 2740 opt: 2740 Z-score: 1898.4 bits: 360.5 E(32554): 2e-99 Smith-Waterman score: 2740; 99.8% identity (99.8% similar) in 428 aa overlap (1-428:51-478) 10 20 30 pF1KE1 MKSTADLHQKLGKAIELEAIKPTYQVLNVQ :::::::::::::::::::::::::::::: CCDS54 SQNGENFCKQVTSVLQQSCVSSAWAWASEGMKSTADLHQKLGKAIELEAIKPTYQVLNVQ 30 40 50 60 70 80 40 50 60 70 80 90 pF1KE1 EKKRKSLDNEVEKTANLVISNWNQQIKAKKKLMVSTKKHEALFQLVESSKQSMTEKEKRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 EKKRKSLDNEVEKTANLVISNWNQQIKAKKKLMVSTKKHEALFQLVESSKQSMTEKEKRK 90 100 110 120 130 140 100 110 120 130 140 150 pF1KE1 LLNKLTKSTEKLEKEDENYYQKNMAGYSTRLKWENTLENCYQSILELEKERIQLLCNNLN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LLNKLTKSTEKLEKEDENYYQKNMAGYSTRLKWENTLENCYQSILELEKERIQLLCNNLN 150 160 170 180 190 200 160 170 180 190 200 210 pF1KE1 QYSQHISLFGQTLTTCHTQIHCAISKIDIEKDIQAVMEETAILSTENKSEFLLTDYFEED :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 QYSQHISLFGQTLTTCHTQIHCAISKIDIEKDIQAVMEETAILSTENKSEFLLTDYFEED 210 220 230 240 250 260 220 230 240 250 260 270 pF1KE1 PNSAMDKERRKSLLKPKLLRLQRDIEKASKDKEGLERMLKTYSSTSSFSDAKSQKDTAAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PNSAMDKERRKSLLKPKLLRLQRDIEKASKDKEGLERMLKTYSSTSSFSDAKSQKDTAAL 270 280 290 300 310 320 280 290 300 310 320 330 pF1KE1 MDENNLKLDLLEANSYKLSSMLAELEQRPQPSHPCSNSIFRWREKEHTHSYVKISRPFLM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MDENNLKLDLLEANSYKLSSMLAELEQRPQPSHPCSNSIFRWREKEHTHSYVKISRPFLM 330 340 350 360 370 380 340 350 360 370 380 390 pF1KE1 KRLENIVSKASSGGQSNPGSSTPAPGAAQLSSRLCKALYSFQARQDDELNLEKGDIVIIH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 KRLENIVSKASSGGQSNPGSSTPAPGAAQLSSRLCKALYSFQARQDDELNLEKGDIVIIH 390 400 410 420 430 440 400 410 420 pF1KE1 EKKEEGWWFGSLNGKKGHFPAAYVEELPSNAGNTATKA :::: ::::::::::::::::::::::::::::::::: CCDS54 EKKEGGWWFGSLNGKKGHFPAAYVEELPSNAGNTATKA 450 460 470 >>CCDS42771.1 NOSTRIN gene_id:115677|Hs108|chr2 (506 aa) initn: 2740 init1: 2740 opt: 2740 Z-score: 1898.0 bits: 360.5 E(32554): 2.1e-99 Smith-Waterman score: 2740; 99.8% identity (99.8% similar) in 428 aa overlap (1-428:79-506) 10 20 30 pF1KE1 MKSTADLHQKLGKAIELEAIKPTYQVLNVQ :::::::::::::::::::::::::::::: CCDS42 LQKLASKLSKALQNTRKSCVSSAWAWASEGMKSTADLHQKLGKAIELEAIKPTYQVLNVQ 50 60 70 80 90 100 40 50 60 70 80 90 pF1KE1 EKKRKSLDNEVEKTANLVISNWNQQIKAKKKLMVSTKKHEALFQLVESSKQSMTEKEKRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 EKKRKSLDNEVEKTANLVISNWNQQIKAKKKLMVSTKKHEALFQLVESSKQSMTEKEKRK 110 120 130 140 150 160 100 110 120 130 140 150 pF1KE1 LLNKLTKSTEKLEKEDENYYQKNMAGYSTRLKWENTLENCYQSILELEKERIQLLCNNLN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 LLNKLTKSTEKLEKEDENYYQKNMAGYSTRLKWENTLENCYQSILELEKERIQLLCNNLN 170 180 190 200 210 220 160 170 180 190 200 210 pF1KE1 QYSQHISLFGQTLTTCHTQIHCAISKIDIEKDIQAVMEETAILSTENKSEFLLTDYFEED :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 QYSQHISLFGQTLTTCHTQIHCAISKIDIEKDIQAVMEETAILSTENKSEFLLTDYFEED 230 240 250 260 270 280 220 230 240 250 260 270 pF1KE1 PNSAMDKERRKSLLKPKLLRLQRDIEKASKDKEGLERMLKTYSSTSSFSDAKSQKDTAAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 PNSAMDKERRKSLLKPKLLRLQRDIEKASKDKEGLERMLKTYSSTSSFSDAKSQKDTAAL 290 300 310 320 330 340 280 290 300 310 320 330 pF1KE1 MDENNLKLDLLEANSYKLSSMLAELEQRPQPSHPCSNSIFRWREKEHTHSYVKISRPFLM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 MDENNLKLDLLEANSYKLSSMLAELEQRPQPSHPCSNSIFRWREKEHTHSYVKISRPFLM 350 360 370 380 390 400 340 350 360 370 380 390 pF1KE1 KRLENIVSKASSGGQSNPGSSTPAPGAAQLSSRLCKALYSFQARQDDELNLEKGDIVIIH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 KRLENIVSKASSGGQSNPGSSTPAPGAAQLSSRLCKALYSFQARQDDELNLEKGDIVIIH 410 420 430 440 450 460 400 410 420 pF1KE1 EKKEEGWWFGSLNGKKGHFPAAYVEELPSNAGNTATKA :::: ::::::::::::::::::::::::::::::::: CCDS42 EKKEGGWWFGSLNGKKGHFPAAYVEELPSNAGNTATKA 470 480 490 500 >>CCDS54415.1 NOSTRIN gene_id:115677|Hs108|chr2 (563 aa) initn: 1932 init1: 1932 opt: 1932 Z-score: 1343.7 bits: 258.1 E(32554): 1.6e-68 Smith-Waterman score: 2460; 87.4% identity (87.4% similar) in 460 aa overlap (26-428:104-563) 10 20 30 40 50 pF1KE1 MKSTADLHQKLGKAIELEAIKPTYQVLNVQEKKRKSLDNEVEKTANLVISNWNQQ :::::::::::::::::::::::::::::: CCDS54 WASEGMKSTADLHQKLGKAIELEAIKPTYQVLNVQEKKRKSLDNEVEKTANLVISNWNQQ 80 90 100 110 120 130 60 70 80 90 100 110 pF1KE1 IKAKKKLMVSTKKHEALFQLVESSKQSMTEKEKRKLLNKLTKSTEKLEKEDENYYQKNMA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 IKAKKKLMVSTKKHEALFQLVESSKQSMTEKEKRKLLNKLTKSTEKLEKEDENYYQKNMA 140 150 160 170 180 190 120 130 pF1KE1 GYSTRLKWENTLENCYQ------------------------------------------- ::::::::::::::::: CCDS54 GYSTRLKWENTLENCYQVTHSICLYAFWVKRAWGKCVSDLRYQDTFLPGNLPPLWFGYDI 200 210 220 230 240 250 140 150 160 170 pF1KE1 --------------SILELEKERIQLLCNNLNQYSQHISLFGQTLTTCHTQIHCAISKID :::::::::::::::::::::::::::::::::::::::::::::: CCDS54 VKRLIMRLCSVCLQSILELEKERIQLLCNNLNQYSQHISLFGQTLTTCHTQIHCAISKID 260 270 280 290 300 310 180 190 200 210 220 230 pF1KE1 IEKDIQAVMEETAILSTENKSEFLLTDYFEEDPNSAMDKERRKSLLKPKLLRLQRDIEKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 IEKDIQAVMEETAILSTENKSEFLLTDYFEEDPNSAMDKERRKSLLKPKLLRLQRDIEKA 320 330 340 350 360 370 240 250 260 270 280 290 pF1KE1 SKDKEGLERMLKTYSSTSSFSDAKSQKDTAALMDENNLKLDLLEANSYKLSSMLAELEQR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 SKDKEGLERMLKTYSSTSSFSDAKSQKDTAALMDENNLKLDLLEANSYKLSSMLAELEQR 380 390 400 410 420 430 300 310 320 330 340 350 pF1KE1 PQPSHPCSNSIFRWREKEHTHSYVKISRPFLMKRLENIVSKASSGGQSNPGSSTPAPGAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PQPSHPCSNSIFRWREKEHTHSYVKISRPFLMKRLENIVSKASSGGQSNPGSSTPAPGAA 440 450 460 470 480 490 360 370 380 390 400 410 pF1KE1 QLSSRLCKALYSFQARQDDELNLEKGDIVIIHEKKEEGWWFGSLNGKKGHFPAAYVEELP :::::::::::::::::::::::::::::::::::: ::::::::::::::::::::::: CCDS54 QLSSRLCKALYSFQARQDDELNLEKGDIVIIHEKKEGGWWFGSLNGKKGHFPAAYVEELP 500 510 520 530 540 550 420 pF1KE1 SNAGNTATKA :::::::::: CCDS54 SNAGNTATKA 560 428 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 01:27:42 2016 done: Mon Nov 7 01:27:43 2016 Total Scan time: 3.250 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]