FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KA1159, 252 aa
1>>>pF1KA1159 252 - 252 aa - 252 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.0256+/-0.000692; mu= 7.8388+/- 0.042
mean_var=126.7577+/-25.170, 0's: 0 Z-trim(114.9): 8 B-trim: 0 in 0/50
Lambda= 0.113917
statistics sampled from 15478 (15485) to 15478 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.804), E-opt: 0.2 (0.476), width: 16
Scan time: 2.600
The best scores are: opt bits E(32554)
CCDS11550.1 NXPH3 gene_id:11248|Hs108|chr17 ( 252) 1794 305.0 3.1e-83
CCDS47540.1 NXPH1 gene_id:30010|Hs108|chr7 ( 271) 847 149.4 2.3e-36
CCDS46421.1 NXPH2 gene_id:11249|Hs108|chr2 ( 264) 815 144.1 8.8e-35
CCDS8933.1 NXPH4 gene_id:11247|Hs108|chr12 ( 308) 375 71.9 5.8e-13
>>CCDS11550.1 NXPH3 gene_id:11248|Hs108|chr17 (252 aa)
initn: 1794 init1: 1794 opt: 1794 Z-score: 1607.8 bits: 305.0 E(32554): 3.1e-83
Smith-Waterman score: 1794; 100.0% identity (100.0% similar) in 252 aa overlap (1-252:1-252)
10 20 30 40 50 60
pF1KA1 MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPM
10 20 30 40 50 60
70 80 90 100 110 120
pF1KA1 ANSTLLGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 ANSTLLGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTG
70 80 90 100 110 120
130 140 150 160 170 180
pF1KA1 KIVDHGNGTFSVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEAKASKIFNCRMEWE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 KIVDHGNGTFSVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEAKASKIFNCRMEWE
130 140 150 160 170 180
190 200 210 220 230 240
pF1KA1 KVERGRRTSLCTHDPAKICSRDHAQSSATWSCSQPFKVVCVYIAFYSTDYRLVQKVCPDY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 KVERGRRTSLCTHDPAKICSRDHAQSSATWSCSQPFKVVCVYIAFYSTDYRLVQKVCPDY
190 200 210 220 230 240
250
pF1KA1 NYHSDTPYYPSG
::::::::::::
CCDS11 NYHSDTPYYPSG
250
>>CCDS47540.1 NXPH1 gene_id:30010|Hs108|chr7 (271 aa)
initn: 906 init1: 838 opt: 847 Z-score: 766.2 bits: 149.4 E(32554): 2.3e-36
Smith-Waterman score: 847; 65.1% identity (86.9% similar) in 175 aa overlap (79-252:97-271)
50 60 70 80 90 100
pF1KA1 KRGHISPKSRPMANSTLLGLLAPPGEAWGILGQPPNRPNHSPP-PSAKVKKIFGWGDFYS
: .: : .. : ..: ::.::::::.:
CCDS47 ENDTDLDLRYDTPEPYSEQDLWDWLRNSTDLQEPRPRAKRRPIVKTGKFKKMFGWGDFHS
70 80 90 100 110 120
110 120 130 140 150 160
pF1KA1 NIKTVALNLLVTGKIVDHGNGTFSVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEA
::::: ::::.::::::::::::::.:.::.:::::.:.:::::.: ::: :: :.:
CCDS47 NIKTVKLNLLITGKIVDHGNGTFSVYFRHNSTGQGNVSVSLVPPTKIVEFDLAQQTVIDA
130 140 150 160 170 180
170 180 190 200 210 220
pF1KA1 KASKIFNCRMEWEKVERGRRTSLCTHDPAKICSRDHAQSSATWSCSQPFKVVCVYIAFYS
: :: ::::.:.:::... ...::..::.: : ....:: ..: ::.::::.:.::.:::
CCDS47 KDSKSFNCRIEYEKVDKATKNTLCNYDPSKTCYQEQTQSHVSWLCSKPFKVICIYISFYS
190 200 210 220 230 240
230 240 250
pF1KA1 TDYRLVQKVCPDYNYHSDTPYYPSG
:::.:::::::::::::::::.:::
CCDS47 TDYKLVQKVCPDYNYHSDTPYFPSG
250 260 270
>>CCDS46421.1 NXPH2 gene_id:11249|Hs108|chr2 (264 aa)
initn: 826 init1: 806 opt: 815 Z-score: 737.9 bits: 144.1 E(32554): 8.8e-35
Smith-Waterman score: 822; 50.8% identity (72.3% similar) in 256 aa overlap (11-252:10-264)
10 20 30 40 50
pF1KA1 MQLTRCCFVFLVQGSLYLVICGQDDGPPGSE--DPERDDHEGQPRPRVPRKRGHISP---
.: : : :..: . . ..: : : : : : ..: :::
CCDS46 MRLRPLPLVVVPGLLQLLFCDSKEVVHATEGLDWEDKDAPGTLVGNVVHSRI-ISPLRL
10 20 30 40 50
60 70 80 90 100
pF1KA1 --KSRPMANSTLLGLLAPPGEAWGILG------QPPNRPNHSPP-PSAKVKKIFGWGDFY
:. :. . .. . : :. .: : .. : ..: ::.::::::.
CCDS46 FVKQSPVPKPGPMAYADSMENFWDWLANITEIQEPLARTKRRPIVKTGKFKKMFGWGDFH
60 70 80 90 100 110
110 120 130 140 150 160
pF1KA1 SNIKTVALNLLVTGKIVDHGNGTFSVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIE
:::::: ::::.::::::::::::::.:.::.:: ::.:.:::::::.:::. : .:
CCDS46 SNIKTVKLNLLITGKIVDHGNGTFSVYFRHNSTGLGNVSVSLVPPSKVVEFEVSPQSTLE
120 130 140 150 160 170
170 180 190 200 210 220
pF1KA1 AKASKIFNCRMEWEKVERGRRTSLCTHDPAKICSRDHAQSSATWSCSQPFKVVCVYIAFY
.: :: ::::.:.::..:...:.::. ::.::: ....:: ..: ::.::::.:.:::::
CCDS46 TKESKSFNCRIEYEKTDRAKKTALCNFDPSKICYQEQTQSHVSWLCSKPFKVICIYIAFY
180 190 200 210 220 230
230 240 250
pF1KA1 STDYRLVQKVCPDYNYHSDTPYYPSG
:.::.:::::::::::::.::: ::
CCDS46 SVDYKLVQKVCPDYNYHSETPYLSSG
240 250 260
>>CCDS8933.1 NXPH4 gene_id:11247|Hs108|chr12 (308 aa)
initn: 693 init1: 375 opt: 375 Z-score: 346.2 bits: 71.9 E(32554): 5.8e-13
Smith-Waterman score: 598; 39.7% identity (59.9% similar) in 277 aa overlap (26-249:44-307)
10 20 30 40 50
pF1KA1 MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISP
: ::.. :: :: :.
CCDS89 PWLLRKAVSAQIPESGRPQYLGLRPAAAGAGAPGQQLPE-------PRSSDGLGVGRAWS
20 30 40 50 60
60 70 80 90 100 110
pF1KA1 KSRPMANSTLLGLLAPPGEAWGILGQPPNRPNHSPP-PSAKVKKIFGWGDFYSNIKTVAL
. : .: : : :: : : : : : .: ...: .:..:::::::::: ..:. .
CCDS89 WAWP-TNHT--GALARAGAA-GAL--PAQRTKRKPSIKAARAKKIFGWGDFYFRVHTLKF
70 80 90 100 110 120
120 130 140 150 160
pF1KA1 NLLVTGKIVDHGNGTFSVHFQHNATGQGNISISLVPPSKAVEF----------HQEQQIF
.:::::::::: ::::::.:.::... ::.:.:.::::: ::: : :. .
CCDS89 SLLVTGKIVDHVNGTFSVYFRHNSSSLGNLSVSIVPPSKRVEFGGVWLPGPVPHPLQSTL
130 140 150 160 170 180
170 180
pF1KA1 -IE-----------------------------------------AKASKIFNCRMEWEKV
.: :: :. :::..:.::.
CCDS89 ALEGVLPGLGPPLGMAAAAAGPGLGGSLGGALAGPLGGALGVPGAKESRAFNCHVEYEKT
190 200 210 220 230 240
190 200 210 220 230 240
pF1KA1 ERGRRTSLCTHDPAKICSRDHAQSSATWSCSQPFKVVCVYIAFYSTDYRLVQKVCPDYNY
.:.:. : .::...: .:.::.:.: :..::::.:....: : ::.::::::::::.
CCDS89 NRARKHRPCLYDPSQVCFTEHTQSQAAWLCAKPFKVICIFVSFLSFDYKLVQKVCPDYNF
250 260 270 280 290 300
250
pF1KA1 HSDTPYYPSG
.:. ::.
CCDS89 QSEHPYFG
252 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 19:54:01 2016 done: Thu Nov 3 19:54:01 2016
Total Scan time: 2.600 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]