FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1012, 302 aa
1>>>pF1KE1012 302 - 302 aa - 302 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.7696+/-0.000778; mu= 12.9050+/- 0.047
mean_var=64.3914+/-13.242, 0's: 0 Z-trim(107.7): 21 B-trim: 250 in 1/49
Lambda= 0.159831
statistics sampled from 9744 (9762) to 9744 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.671), E-opt: 0.2 (0.3), width: 16
Scan time: 2.630
The best scores are: opt bits E(32554)
CCDS10464.1 ECI1 gene_id:1632|Hs108|chr16 ( 302) 1966 461.8 2.8e-130
CCDS58410.1 ECI1 gene_id:1632|Hs108|chr16 ( 285) 1136 270.4 1.1e-72
CCDS6689.1 AUH gene_id:549|Hs108|chr9 ( 339) 258 68.0 1.1e-11
CCDS1721.1 HADHA gene_id:3030|Hs108|chr2 ( 763) 259 68.3 2.1e-11
>>CCDS10464.1 ECI1 gene_id:1632|Hs108|chr16 (302 aa)
initn: 1966 init1: 1966 opt: 1966 Z-score: 2452.3 bits: 461.8 E(32554): 2.8e-130
Smith-Waterman score: 1966; 100.0% identity (100.0% similar) in 302 aa overlap (1-302:1-302)
10 20 30 40 50 60
pF1KE1 MALVASVRVPARVLLRAGARLPGAALGRTERAAGGGDGARRFGSQRVLVEPDAGAGVAVM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 MALVASVRVPARVLLRAGARLPGAALGRTERAAGGGDGARRFGSQRVLVEPDAGAGVAVM
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 KFKNPPVNSLSLEFLTELVISLEKLENDKSFRGVILTSDRPGVFSAGLDLTEMCGRSPAH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 KFKNPPVNSLSLEFLTELVISLEKLENDKSFRGVILTSDRPGVFSAGLDLTEMCGRSPAH
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 YAGYWKAVQELWLRLYQSNLVLVSAINGACPAGGCLVALTCDYRILADNPRYCIGLNETQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 YAGYWKAVQELWLRLYQSNLVLVSAINGACPAGGCLVALTCDYRILADNPRYCIGLNETQ
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 LGIIAPFWLKDTLENTIGHRAAERALQLGLLFPPAEALQVGIVDQVVPEEQVQSTALSAI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 LGIIAPFWLKDTLENTIGHRAAERALQLGLLFPPAEALQVGIVDQVVPEEQVQSTALSAI
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE1 AQWMAIPDHARQLTKAMMRKATASRLVTQRDADVQNFVSFISKDSIQKSLQMYLERLKEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 AQWMAIPDHARQLTKAMMRKATASRLVTQRDADVQNFVSFISKDSIQKSLQMYLERLKEE
250 260 270 280 290 300
pF1KE1 KG
::
CCDS10 KG
>>CCDS58410.1 ECI1 gene_id:1632|Hs108|chr16 (285 aa)
initn: 1827 init1: 1126 opt: 1136 Z-score: 1418.3 bits: 270.4 E(32554): 1.1e-72
Smith-Waterman score: 1797; 94.4% identity (94.4% similar) in 302 aa overlap (1-302:1-285)
10 20 30 40 50 60
pF1KE1 MALVASVRVPARVLLRAGARLPGAALGRTERAAGGGDGARRFGSQRVLVEPDAGAGVAVM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 MALVASVRVPARVLLRAGARLPGAALGRTERAAGGGDGARRFGSQRVLVEPDAGAGVAVM
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 KFKNPPVNSLSLEFLTELVISLEKLENDKSFRGVILTSDRPGVFSAGLDLTEMCGRSPAH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 KFKNPPVNSLSLEFLTELVISLEKLENDKSFRGVILTSDRPGVFSAGLDLTEMCGRSPAH
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 YAGYWKAVQELWLRLYQSNLVLVSAINGACPAGGCLVALTCDYRILADNPRYCIGLNETQ
:::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 YAGYWKAVQELWLRLYQSNLVLVSAINGACPAGGCLVALTCDYRILADNPR---------
130 140 150 160 170
190 200 210 220 230 240
pF1KE1 LGIIAPFWLKDTLENTIGHRAAERALQLGLLFPPAEALQVGIVDQVVPEEQVQSTALSAI
::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 --------LKDTLENTIGHRAAERALQLGLLFPPAEALQVGIVDQVVPEEQVQSTALSAI
180 190 200 210 220
250 260 270 280 290 300
pF1KE1 AQWMAIPDHARQLTKAMMRKATASRLVTQRDADVQNFVSFISKDSIQKSLQMYLERLKEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 AQWMAIPDHARQLTKAMMRKATASRLVTQRDADVQNFVSFISKDSIQKSLQMYLERLKEE
230 240 250 260 270 280
pF1KE1 KG
::
CCDS58 KG
>>CCDS6689.1 AUH gene_id:549|Hs108|chr9 (339 aa)
initn: 270 init1: 179 opt: 258 Z-score: 322.9 bits: 68.0 E(32554): 1.1e-11
Smith-Waterman score: 269; 25.6% identity (52.9% similar) in 308 aa overlap (18-302:33-338)
10 20 30
pF1KE1 MALVASVRVPARVLLRAGARLPGAALGRTER----------AAGGGD
: ::::. :: ::::
CCDS66 AAVAAAPGALGSLHAGGARLVAACSAWLCPGLRLPGSLAGRRAGPAIWAQGWVPAAGGPA
10 20 30 40 50 60
40 50 60 70 80
pF1KE1 GARRFGSQ-------RVLVEPDAGAGVAVMKF-KNPPVNSLSLEFLTELVISLEKLENDK
: ..:. :: . . :..:. . . :::: ... : ... :..::
CCDS66 PKRGYSSEMKTEDELRVRHLEEENRGIVVLGINRAYGKNSLSKNLIKMLSKAVDALKSDK
70 80 90 100 110 120
90 100 110 120 130 140
pF1KE1 SFRGVILTSDRPGVFSAGLDLTEMCGRSPAHYAGYWKAVQELWLRLYQSNLVLVSAINGA
. : .:. :. ::.: :: :: : : .. . . . .. . . . . ..::.:
CCDS66 KVRTIIIRSEVPGIFCAGADLKERAKMSSSEVGPFVSKIRAVINDIANLPVPTIAAIDGL
130 140 150 160 170 180
150 160 170 180 190 200
pF1KE1 CPAGGCLVALTCDYRILADNPRYCIGLNETQLGIIAPFWLKDTLENTIGHRAAERALQLG
.:: .::.:: :. :.. . .:: ::.:.:: . : .:: :.. . .
CCDS66 ALGGGLELALACDIRVAASSAK--MGLVETKLAIIPGGGGTQRLPRAIGMSLAKELIFSA
190 200 210 220 230 240
210 220 230 240 250 260
pF1KE1 LLFPPAEALQVGIVDQVVPEEQ----VQSTALSAIAQWMAIPDHARQLTKAMMRKATASR
.. :: ::....:. ..: . ::. ... : ...: . ..
CCDS66 RVLDGKEAKAVGLISHVLEQNQEGDAAYRKALDLAREFLPQGPVAMRVAKLAINQGMEVD
250 260 270 280 290 300
270 280 290 300
pF1KE1 LVTQRDADVQNFVSFI-SKDSIQKSLQMYLERLKEEKG
::: . ... : .:: .. : . .: . ::
CCDS66 LVTGLAIEEACYAQTIPTKDRLEGLLAFKEKRPPRYKGE
310 320 330
>>CCDS1721.1 HADHA gene_id:3030|Hs108|chr2 (763 aa)
initn: 233 init1: 148 opt: 259 Z-score: 318.4 bits: 68.3 E(32554): 2.1e-11
Smith-Waterman score: 259; 29.1% identity (61.7% similar) in 196 aa overlap (40-227:27-222)
10 20 30 40 50 60
pF1KE1 PARVLLRAGARLPGAALGRTERAAGGGDGARRF-GSQRVLVEPDAGAGV----AVMKFKN
: : ::. .:.. . :: ::.....
CCDS17 MVACRAIGILSRFSAFRILRSRGYICRNFTGSSALLTRTHINYGVKGDVAVVRINS
10 20 30 40 50
70 80 90 100 110 120
pF1KE1 P--PVNSLSLEFLTELVISLEKLENDKSFRGVILTSDRPGVFSAGLDLTEMCG-RSPAHY
: ::.:: :. .:. .... . ..:...: :..:: : :: :.. . . .. .
CCDS17 PNSKVNTLSKELHSEFSEVMNEIWASDQIRSAVLISSKPGCFIAGADINMLAACKTLQEV
60 70 80 90 100 110
130 140 150 160 170 180
pF1KE1 AGYWKAVQELWLRLYQSNLVLVSAINGACPAGGCLVALTCDYRILADNPRYCIGLNETQL
. . .:.. .: .:. .:.::::.: .:: ::..:.::: . . . .: :. :
CCDS17 TQLSQEAQRIVEKLEKSTKPIVAAINGSCLGGGLEVAISCQYRIATKDRKTVLGTPEVLL
120 130 140 150 160 170
190 200 210 220 230 240
pF1KE1 GIIAPFWLKDTLENTIGHRAAERALQLGLLFPPAEALQVGIVDQVVPEEQVQSTALSAIA
: . . : . .: :: . : . .: ..:.:::.:
CCDS17 GALPGAGGTQRLPKMVGVPAALDMMLTGRSIRADRAKKMGLVDQLVEPLGPGLKPPEERT
180 190 200 210 220 230
250 260 270 280 290 300
pF1KE1 QWMAIPDHARQLTKAMMRKATASRLVTQRDADVQNFVSFISKDSIQKSLQMYLERLKEEK
CCDS17 IEYLEEVAITFAKGLADKKISPKRDKGLVEKLTAYAMTIPFVRQQVYKKVEEKVRKQTKG
240 250 260 270 280 290
302 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 20:16:22 2016 done: Sun Nov 6 20:16:23 2016
Total Scan time: 2.630 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]