FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE3003, 70 aa
1>>>pF1KE3003 70 - 70 aa - 70 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.9458+/-0.000504; mu= 8.0557+/- 0.030
mean_var=46.0528+/- 9.040, 0's: 0 Z-trim(112.8): 21 B-trim: 2 in 1/52
Lambda= 0.188993
statistics sampled from 13484 (13504) to 13484 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.814), E-opt: 0.2 (0.415), width: 16
Scan time: 1.120
The best scores are: opt bits E(32554)
CCDS12687.1 GNG8 gene_id:94235|Hs108|chr19 ( 70) 450 129.1 2.1e-31
CCDS32082.1 GNG2 gene_id:54331|Hs108|chr14 ( 71) 351 102.1 2.9e-23
CCDS1607.1 GNG4 gene_id:2786|Hs108|chr1 ( 75) 321 94.0 8.8e-21
CCDS8032.1 GNG3 gene_id:2785|Hs108|chr11 ( 75) 290 85.5 3.1e-18
CCDS12091.1 GNG7 gene_id:2788|Hs108|chr19 ( 68) 263 78.1 4.6e-16
CCDS30749.1 GNG12 gene_id:55970|Hs108|chr1 ( 72) 261 77.6 7.2e-16
CCDS696.1 GNG5 gene_id:2787|Hs108|chr1 ( 68) 202 61.5 4.7e-11
CCDS35107.1 GNG10 gene_id:2790|Hs108|chr9 ( 68) 201 61.2 5.7e-11
>>CCDS12687.1 GNG8 gene_id:94235|Hs108|chr19 (70 aa)
initn: 450 init1: 450 opt: 450 Z-score: 677.1 bits: 129.1 E(32554): 2.1e-31
Smith-Waterman score: 450; 100.0% identity (100.0% similar) in 70 aa overlap (1-70:1-70)
10 20 30 40 50 60
pF1KE3 MSNNMAKIAEARKTVEQLKLEVNIDRMKVSQAAAELLAFCETHAKDDPLVTPVPAAENPF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 MSNNMAKIAEARKTVEQLKLEVNIDRMKVSQAAAELLAFCETHAKDDPLVTPVPAAENPF
10 20 30 40 50 60
70
pF1KE3 RDKRLFCVLL
::::::::::
CCDS12 RDKRLFCVLL
70
>>CCDS32082.1 GNG2 gene_id:54331|Hs108|chr14 (71 aa)
initn: 358 init1: 351 opt: 351 Z-score: 531.1 bits: 102.1 E(32554): 2.9e-23
Smith-Waterman score: 351; 71.0% identity (97.1% similar) in 69 aa overlap (2-70:3-71)
10 20 30 40 50
pF1KE3 MSNNMAKIAEARKTVEQLKLEVNIDRMKVSQAAAELLAFCETHAKDDPLVTPVPAAENP
::: :.::.::: :::::.:.::::.:::.:::.:.:.::.:::.:::.:::::.:::
CCDS32 MASNNTASIAQARKLVEQLKMEANIDRIKVSKAAADLMAYCEAHAKEDPLLTPVPASENP
10 20 30 40 50 60
60 70
pF1KE3 FRDKRLFCVLL
::.:..::..:
CCDS32 FREKKFFCAIL
70
>>CCDS1607.1 GNG4 gene_id:2786|Hs108|chr1 (75 aa)
initn: 326 init1: 300 opt: 321 Z-score: 486.5 bits: 94.0 E(32554): 8.8e-21
Smith-Waterman score: 321; 63.4% identity (94.4% similar) in 71 aa overlap (1-70:5-75)
10 20 30 40 50
pF1KE3 MSNN-MAKIAEARKTVEQLKLEVNIDRMKVSQAAAELLAFCETHAKDDPLVTPVPA
:::: ..:..:::.:::::.:. .::.:::::::.:::.::.:...:::. ::::
CCDS16 MKEGMSNNSTTSISQARKAVEQLKMEACMDRVKVSQAAADLLAYCEAHVREDPLIIPVPA
10 20 30 40 50 60
60 70
pF1KE3 AENPFRDKRLFCVLL
.:::::.:..::..:
CCDS16 SENPFREKKFFCTIL
70
>>CCDS8032.1 GNG3 gene_id:2785|Hs108|chr11 (75 aa)
initn: 290 init1: 290 opt: 290 Z-score: 440.9 bits: 85.5 E(32554): 3.1e-18
Smith-Waterman score: 290; 57.4% identity (92.6% similar) in 68 aa overlap (3-70:8-75)
10 20 30 40 50
pF1KE3 MSNNMAKIAEARKTVEQLKLEVNIDRMKVSQAAAELLAFCETHAKDDPLVTPVPA
:. .:..::: :::::.:... :.:::.:::.:...:..:: .:::.::::.
CCDS80 MKGETPVNSTMSIGQARKMVEQLKIEASLCRIKVSKAAADLMTYCDAHACEDPLITPVPT
10 20 30 40 50 60
60 70
pF1KE3 AENPFRDKRLFCVLL
.:::::.:..::.::
CCDS80 SENPFREKKFFCALL
70
>>CCDS12091.1 GNG7 gene_id:2788|Hs108|chr19 (68 aa)
initn: 264 init1: 255 opt: 263 Z-score: 401.8 bits: 78.1 E(32554): 4.6e-16
Smith-Waterman score: 263; 57.1% identity (92.1% similar) in 63 aa overlap (8-70:7-68)
10 20 30 40 50 60
pF1KE3 MSNNMAKIAEARKTVEQLKLEVNIDRMKVSQAAAELLAFCETHAKDDPLVTPVPAAENPF
::.::: ::::..:..:.:.:::.::..:...:: ::..:::.. :::.::::
CCDS12 MSATNNIAQARKLVEQLRIEAGIERIKVSKAASDLMSYCEQHARNDPLLVGVPASENPF
10 20 30 40 50
70
pF1KE3 RDKRLFCVLL
.::. :..:
CCDS12 KDKKP-CIIL
60
>>CCDS30749.1 GNG12 gene_id:55970|Hs108|chr1 (72 aa)
initn: 258 init1: 249 opt: 261 Z-score: 398.4 bits: 77.6 E(32554): 7.2e-16
Smith-Waterman score: 261; 50.7% identity (87.7% similar) in 73 aa overlap (1-70:1-72)
10 20 30 40 50
pF1KE3 MSNNMAK---IAEARKTVEQLKLEVNIDRMKVSQAAAELLAFCETHAKDDPLVTPVPAAE
::.. :. ::.::.::.::.::..:.:.:::.:.:.:...:: ::..:::. .:..:
CCDS30 MSSKTASTNNIAQARRTVQQLRLEASIERIKVSKASADLMSYCEEHARSDPLLIGIPTSE
10 20 30 40 50 60
60 70
pF1KE3 NPFRDKRLFCVLL
:::.::. :..:
CCDS30 NPFKDKKT-CIIL
70
>>CCDS696.1 GNG5 gene_id:2787|Hs108|chr1 (68 aa)
initn: 181 init1: 173 opt: 202 Z-score: 311.9 bits: 61.5 E(32554): 4.7e-11
Smith-Waterman score: 202; 45.7% identity (80.0% similar) in 70 aa overlap (1-70:1-68)
10 20 30 40 50 60
pF1KE3 MSNNMAKIAEARKTVEQLKLEVNIDRMKVSQAAAELLAFCETHAKDDPLVTPVPAAENPF
::.. ...: .:.:.::.::....:.:::::::.: :: .:. :::.: : .. :::
CCDS69 MSGS-SSVAAMKKVVQQLRLEAGLNRVKVSQAAADLKQFCLQNAQHDPLLTGVSSSTNPF
10 20 30 40 50
70
pF1KE3 RDKRLFCVLL
: ... : .:
CCDS69 RPQKV-CSFL
60
>>CCDS35107.1 GNG10 gene_id:2790|Hs108|chr9 (68 aa)
initn: 207 init1: 187 opt: 201 Z-score: 310.4 bits: 61.2 E(32554): 5.7e-11
Smith-Waterman score: 201; 50.0% identity (75.7% similar) in 70 aa overlap (1-70:1-68)
10 20 30 40 50 60
pF1KE3 MSNNMAKIAEARKTVEQLKLEVNIDRMKVSQAAAELLAFCETHAKDDPLVTPVPAAENPF
::.. :. . .. :::::::....:.::::::::: .: .: : :.. :::. :::
CCDS35 MSSG-ASASALQRLVEQLKLEAGVERIKVSQAAAELQQYCMQNACKDALLVGVPAGSNPF
10 20 30 40 50
70
pF1KE3 RDKRLFCVLL
:. : :.::
CCDS35 REPRS-CALL
60
70 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 13:20:15 2016 done: Sun Nov 6 13:20:16 2016
Total Scan time: 1.120 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]