FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB4784, 271 aa
1>>>pF1KB4784 271 - 271 aa - 271 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.5984+/-0.000841; mu= 12.4594+/- 0.050
mean_var=58.5906+/-12.017, 0's: 0 Z-trim(105.1): 19 B-trim: 216 in 1/52
Lambda= 0.167556
statistics sampled from 8233 (8245) to 8233 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.647), E-opt: 0.2 (0.253), width: 16
Scan time: 2.350
The best scores are: opt bits E(32554)
CCDS41801.1 CTDSP2 gene_id:10106|Hs108|chr12 ( 271) 1806 445.0 2.7e-125
CCDS33734.1 CTDSPL gene_id:10217|Hs108|chr3 ( 276) 1083 270.2 1.1e-72
CCDS33735.1 CTDSPL gene_id:10217|Hs108|chr3 ( 265) 990 247.7 6.4e-66
CCDS2416.1 CTDSP1 gene_id:58190|Hs108|chr2 ( 261) 981 245.5 2.8e-65
CCDS56166.1 CTDSP1 gene_id:58190|Hs108|chr2 ( 260) 976 244.3 6.5e-65
CCDS10110.1 CTDSPL2 gene_id:51496|Hs108|chr15 ( 466) 509 131.5 1.1e-30
CCDS11093.1 CTDNEP1 gene_id:23399|Hs108|chr17 ( 244) 415 108.7 4.1e-24
CCDS33023.1 TIMM50 gene_id:92609|Hs108|chr19 ( 456) 240 66.4 4e-11
>>CCDS41801.1 CTDSP2 gene_id:10106|Hs108|chr12 (271 aa)
initn: 1806 init1: 1806 opt: 1806 Z-score: 2362.9 bits: 445.0 E(32554): 2.7e-125
Smith-Waterman score: 1806; 100.0% identity (100.0% similar) in 271 aa overlap (1-271:1-271)
10 20 30 40 50 60
pF1KB4 MEHGSIITQARREDALVLTKQGLVSKSSPKKPRGRNIFKALFCCFRAQHVGQSSSSTELA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 MEHGSIITQARREDALVLTKQGLVSKSSPKKPRGRNIFKALFCCFRAQHVGQSSSSTELA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB4 AYKEEANTIAKSDLLQCLQYQFYQIPGTCLLPEVTEEDQGRICVVIDLDETLVHSSFKPI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 AYKEEANTIAKSDLLQCLQYQFYQIPGTCLLPEVTEEDQGRICVVIDLDETLVHSSFKPI
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB4 NNADFIVPIEIEGTTHQVYVLKRPYVDEFLRRMGELFECVLFTASLAKYADPVTDLLDRC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 NNADFIVPIEIEGTTHQVYVLKRPYVDEFLRRMGELFECVLFTASLAKYADPVTDLLDRC
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB4 GVFRARLFRESCVFHQGCYVKDLSRLGRDLRKTLILDNSPASYIFHPENAVPVQSWFDDM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 GVFRARLFRESCVFHQGCYVKDLSRLGRDLRKTLILDNSPASYIFHPENAVPVQSWFDDM
190 200 210 220 230 240
250 260 270
pF1KB4 ADTELLNLIPIFEELSGAEDVYTSLGQLRAP
:::::::::::::::::::::::::::::::
CCDS41 ADTELLNLIPIFEELSGAEDVYTSLGQLRAP
250 260 270
>>CCDS33734.1 CTDSPL gene_id:10217|Hs108|chr3 (276 aa)
initn: 1093 init1: 1033 opt: 1083 Z-score: 1418.2 bits: 270.2 E(32554): 1.1e-72
Smith-Waterman score: 1083; 64.4% identity (78.9% similar) in 275 aa overlap (1-268:1-273)
10 20 30 40 50
pF1KB4 MEHGSIITQAR--REDALVLTKQGLVSKS---SPKKPRGRNIFKALFCCFRAQHV--GQS
:. .::::. .:: : : ... : :: :.:.:....::::: .:
CCDS33 MDGPAIITQVTNPKEDEGRLPGAGEKASQCNVSLKKQRSRSILSSFFCCFRDYNVEAPPP
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB4 SSSTELAAYKEEANTIAKSDLLQCLQYQFYQIPGTCLLPEVTEEDQGRICVVIDLDETLV
:: . : :: . . :.: : . . :. :::::: : :. :::::::::::
CCDS33 SSPSVLPPLVEENGGLQKGDQRQVIPIP--SPPAKYLLPEVTVLDYGKKCVVIDLDETLV
70 80 90 100 110
120 130 140 150 160 170
pF1KB4 HSSFKPINNADFIVPIEIEGTTHQVYVLKRPYVDEFLRRMGELFECVLFTASLAKYADPV
:::::::.:::::::.::.:: :::::::::.:::::.:::.::::::::::::::::::
CCDS33 HSSFKPISNADFIVPVEIDGTIHQVYVLKRPHVDEFLQRMGQLFECVLFTASLAKYADPV
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB4 TDLLDRCGVFRARLFRESCVFHQGCYVKDLSRLGRDLRKTLILDNSPASYIFHPENAVPV
.::::: :::::::::::::::.: ::::::::::.: :..:.:::::::::::::::::
CCDS33 ADLLDRWGVFRARLFRESCVFHRGNYVKDLSRLGRELSKVIIVDNSPASYIFHPENAVPV
180 190 200 210 220 230
240 250 260 270
pF1KB4 QSWFDDMADTELLNLIPIFEELSGAEDVYTSLGQLRAP
:::::::.:::::.:::.:: :: .:::. : .:
CCDS33 QSWFDDMTDTELLDLIPFFEGLSREDDVYSMLHRLCNR
240 250 260 270
>>CCDS33735.1 CTDSPL gene_id:10217|Hs108|chr3 (265 aa)
initn: 1116 init1: 984 opt: 990 Z-score: 1297.0 bits: 247.7 E(32554): 6.4e-66
Smith-Waterman score: 1055; 63.6% identity (77.1% similar) in 275 aa overlap (1-268:1-262)
10 20 30 40 50
pF1KB4 MEHGSIITQAR--REDALVLTKQGLVSKS---SPKKPRGRNIFKALFCCFRAQHV--GQS
:. .::::. .:: : : ... : :: :.:.:....::::: .:
CCDS33 MDGPAIITQVTNPKEDEGRLPGAGEKASQCNVSLKKQRSRSILSSFFCCFRDYNVEAPPP
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB4 SSSTELAAYKEEANTIAKSDLLQCLQYQFYQIPGTCLLPEVTEEDQGRICVVIDLDETLV
:: . : :: . . : :. :::::: : :. :::::::::::
CCDS33 SSPSVLPPLVEENGGLQKP-------------PAKYLLPEVTVLDYGKKCVVIDLDETLV
70 80 90 100
120 130 140 150 160 170
pF1KB4 HSSFKPINNADFIVPIEIEGTTHQVYVLKRPYVDEFLRRMGELFECVLFTASLAKYADPV
:::::::.:::::::.::.:: :::::::::.:::::.:::.::::::::::::::::::
CCDS33 HSSFKPISNADFIVPVEIDGTIHQVYVLKRPHVDEFLQRMGQLFECVLFTASLAKYADPV
110 120 130 140 150 160
180 190 200 210 220 230
pF1KB4 TDLLDRCGVFRARLFRESCVFHQGCYVKDLSRLGRDLRKTLILDNSPASYIFHPENAVPV
.::::: :::::::::::::::.: ::::::::::.: :..:.:::::::::::::::::
CCDS33 ADLLDRWGVFRARLFRESCVFHRGNYVKDLSRLGRELSKVIIVDNSPASYIFHPENAVPV
170 180 190 200 210 220
240 250 260 270
pF1KB4 QSWFDDMADTELLNLIPIFEELSGAEDVYTSLGQLRAP
:::::::.:::::.:::.:: :: .:::. : .:
CCDS33 QSWFDDMTDTELLDLIPFFEGLSREDDVYSMLHRLCNR
230 240 250 260
>>CCDS2416.1 CTDSP1 gene_id:58190|Hs108|chr2 (261 aa)
initn: 1038 init1: 973 opt: 981 Z-score: 1285.3 bits: 245.5 E(32554): 2.8e-65
Smith-Waterman score: 1049; 60.9% identity (80.1% similar) in 271 aa overlap (1-269:1-258)
10 20 30 40 50
pF1KB4 MEHGSIITQARREDAL-VLTKQGLVSKSSPKKPRGRNIFKALFCCFRAQHVGQSSSSTEL
:. ...::: .:.: : .: .... .:::.:.:...:::: . :.. .
CCDS24 MDSSAVITQISKEEARGPLRGKGDQKSAASQKPRSRGILHSLFCCV-CRDDGEALPAHSG
10 20 30 40 50
60 70 80 90 100 110
pF1KB4 AAYK-EEANTIAKSDLLQCLQYQFYQIPGTCLLPEVTEEDQGRICVVIDLDETLVHSSFK
: :: ..: : : : ::::. .:. .:::::::::::::::::
CCDS24 APLLVEENGAIPK------------QTPVQYLLPEAKAQDSDKICVVIDLDETLVHSSFK
60 70 80 90 100
120 130 140 150 160 170
pF1KB4 PINNADFIVPIEIEGTTHQVYVLKRPYVDEFLRRMGELFECVLFTASLAKYADPVTDLLD
:.::::::.:.::.:..:::::::::.:::::.::::::::::::::::::::::.::::
CCDS24 PVNNADFIIPVEIDGVVHQVYVLKRPHVDEFLQRMGELFECVLFTASLAKYADPVADLLD
110 120 130 140 150 160
180 190 200 210 220 230
pF1KB4 RCGVFRARLFRESCVFHQGCYVKDLSRLGRDLRKTLILDNSPASYIFHPENAVPVQSWFD
. :.:::::::::::::.: :::::::::::::..::::::::::.:::.::::: ::::
CCDS24 KWGAFRARLFRESCVFHRGNYVKDLSRLGRDLRRVLILDNSPASYVFHPDNAVPVASWFD
170 180 190 200 210 220
240 250 260 270
pF1KB4 DMADTELLNLIPIFEELSGAEDVYTSLGQLRAP
.:.:::: .:.:.::.:: ..:::. : : :
CCDS24 NMSDTELHDLLPFFEQLSRVDDVYSVLRQPRPGS
230 240 250 260
>>CCDS56166.1 CTDSP1 gene_id:58190|Hs108|chr2 (260 aa)
initn: 1041 init1: 976 opt: 976 Z-score: 1278.8 bits: 244.3 E(32554): 6.5e-65
Smith-Waterman score: 1017; 63.2% identity (82.4% similar) in 250 aa overlap (21-269:22-257)
10 20 30 40 50
pF1KB4 MEHGSIITQARREDALVLTKQGLVSKSSPKKPRGRNIFKALFCCFRAQHVGQSSSSTEL
.: .... .:::.:.:...:::: . :.. .
CCDS56 MVAAPWATQEQEEGRGIQPGDRGDQKSAASQKPRSRGILHSLFCCV-CRDDGEALPAHSG
10 20 30 40 50
60 70 80 90 100 110
pF1KB4 AAYK-EEANTIAKSDLLQCLQYQFYQIPGTCLLPEVTEEDQGRICVVIDLDETLVHSSFK
: :: ..: :. .:: ::::. .:. .:::::::::::::::::
CCDS56 APLLVEENGAIPKTP----VQY---------LLPEAKAQDSDKICVVIDLDETLVHSSFK
60 70 80 90 100
120 130 140 150 160 170
pF1KB4 PINNADFIVPIEIEGTTHQVYVLKRPYVDEFLRRMGELFECVLFTASLAKYADPVTDLLD
:.::::::.:.::.:..:::::::::.:::::.::::::::::::::::::::::.::::
CCDS56 PVNNADFIIPVEIDGVVHQVYVLKRPHVDEFLQRMGELFECVLFTASLAKYADPVADLLD
110 120 130 140 150 160
180 190 200 210 220 230
pF1KB4 RCGVFRARLFRESCVFHQGCYVKDLSRLGRDLRKTLILDNSPASYIFHPENAVPVQSWFD
. :.:::::::::::::.: :::::::::::::..::::::::::.:::.::::: ::::
CCDS56 KWGAFRARLFRESCVFHRGNYVKDLSRLGRDLRRVLILDNSPASYVFHPDNAVPVASWFD
170 180 190 200 210 220
240 250 260 270
pF1KB4 DMADTELLNLIPIFEELSGAEDVYTSLGQLRAP
.:.:::: .:.:.::.:: ..:::. : : :
CCDS56 NMSDTELHDLLPFFEQLSRVDDVYSVLRQPRPGS
230 240 250 260
>>CCDS10110.1 CTDSPL2 gene_id:51496|Hs108|chr15 (466 aa)
initn: 497 init1: 283 opt: 509 Z-score: 664.5 bits: 131.5 E(32554): 1.1e-30
Smith-Waterman score: 509; 41.5% identity (69.1% similar) in 217 aa overlap (51-261:238-449)
30 40 50 60 70
pF1KB4 QGLVSKSSPKKPRGRNIFKALFCCFRAQHVGQSSSSTELAAYKEEANTIAKSDLLQCL--
: ::. .: :.:.:. ... ... .
CCDS10 RPSLNNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAE-ATYEEDWEVFDPYYFIKHVPP
210 220 230 240 250 260
80 90 100 110 120 130
pF1KB4 --QYQFYQIPGTCLLPEVTEEDQGRICVVIDLDETLVHSSFKPINNADFIVPIEIEGTTH
. :. . :. : . : : . .:.:::::::: :.. ...: . :. .. . .
CCDS10 LTEEQLNRKPALPLKTRSTPE----FSLVLDLDETLVHCSLNELEDAALTFPVLFQDVIY
270 280 290 300 310 320
140 150 160 170 180 190
pF1KB4 QVYVLKRPYVDEFLRRMGELFECVLFTASLAKYADPVTDLLD-RCGVFRARLFRESCVFH
:::: ::. :::.::....: .::::: ::: . ..:: . . : ::::: ::
CCDS10 QVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFREHCVCV
330 340 350 360 370 380
200 210 220 230 240 250
pF1KB4 QGCYVKDLSRLGRDLRKTLILDNSPASYIFHPENAVPVQSWFDDMADTELLNLIPIFEEL
:: :.:::. ::::: ::.:.:::: .. .. :..:..::: : :.:::.:::..:.:
CCDS10 QGNYIKDLNILGRDLSKTIIIDNSPQAFAYQLSNGIPIESWFMDKNDNELLKLIPFLEKL
390 400 410 420 430 440
260 270
pF1KB4 SG-AEDVYTSLGQLRAP
:::
CCDS10 VELNEDVRPHIRDRFRLHDLLPPD
450 460
>>CCDS11093.1 CTDNEP1 gene_id:23399|Hs108|chr17 (244 aa)
initn: 402 init1: 228 opt: 415 Z-score: 546.4 bits: 108.7 E(32554): 4.1e-24
Smith-Waterman score: 454; 40.9% identity (69.9% similar) in 176 aa overlap (101-267:61-236)
80 90 100 110 120
pF1KB4 KSDLLQCLQYQFYQIPGTCLLPEVTEEDQGRICVVIDLDETLVHS--------SFKPINN
: .:.::::::.:: . .: .
CCDS11 QIRTVIQYQTVRYDILPLSPVSRNRLAQVKRKILVLDLDETLIHSHHDGVLRPTVRPGTP
40 50 60 70 80 90
130 140 150 160 170 180
pF1KB4 ADFIVPIEIEGTTHQVYVLKRPYVDEFLRRMGELFECVLFTASLAKYADPVTDLLDRC-G
:::. . :. . .: :::.:: ::. ... .: :.::::. :.. :.: :: .
CCDS11 PDFILKVVIDKHPVRFFVHKRPHVDFFLEVVSQWYELVVFTASMEIYGSAVADKLDNSRS
100 110 120 130 140 150
190 200 210 220 230 240
pF1KB4 VFRARLFRESCVFHQGCYVKDLSRLGRDLRKTLILDNSPASYIFHPENAVPVQSWFDDMA
... : .:. :... : :.:::: . :: . .::::::..: ::.::.:..:::.: .
CCDS11 ILKRRYYRQHCTLELGSYIKDLSVVHSDLSSIVILDNSPGAYRSHPDNAIPIKSWFSDPS
160 170 180 190 200 210
250 260 270
pF1KB4 DTELLNLIPIFEELSGAEDVYTSLGQLRAP
:: ::::.:... : . :: . :..
CCDS11 DTALLNLLPMLDALRFTADVRSVLSRNLHQHRLW
220 230 240
>>CCDS33023.1 TIMM50 gene_id:92609|Hs108|chr19 (456 aa)
initn: 242 init1: 217 opt: 240 Z-score: 313.2 bits: 66.4 E(32554): 4e-11
Smith-Waterman score: 245; 28.7% identity (58.0% similar) in 181 aa overlap (89-265:236-401)
60 70 80 90 100 110
pF1KB4 LAAYKEEANTIAKSDLLQCLQYQFYQIPGTCLLPEVTEED--QGRICVVIDLDETLVHSS
::::. .: : .:..: .:.:
CCDS33 DNDPILVQQLRRTYKYFKDYRQMIIEPTSPCLLPDPLQEPYYQPPYTLVLELTGVLLHPE
210 220 230 240 250 260
120 130 140 150 160 170
pF1KB4 FKPINNADFIVPIEIEGTTHQVYVLKRPYVDEFLRRMGELFECVLFTASLAKYADPVTDL
.. .. : ::: .. ...... :.: :.::. . : :. :
CCDS33 WSLATGWRFK---------------KRPGIETLFQQLAPLYEIVIFTSETGMTAFPLIDS
270 280 290 300 310
180 190 200 210 220 230
pF1KB4 LDRCGVFRARLFRESCVFHQGCYVKDLSRLGRDLRKTLILDNSPASYIFHPENAVPVQSW
.: : . ::::.. . .: .:::.: :.:: .....: . .. ..: :.: .. :
CCDS33 VDPHGFISYRLFRDATRYMDGHHVKDISCLNRDPARVVVVDCKKEAFRLQPYNGVALRPW
320 330 340 350 360 370
240 250 260 270
pF1KB4 FDDMADTELLNLIPIFEE--LSGAEDVYTSLGQLRAP
. : ::.: ... :.:.::: : :
CCDS33 DGNSDDRVLLDLSAFLKTIALNGVEDVRTVLEHYALEDDPLAAFKQRQSRLEQEEQQRLA
380 390 400 410 420 430
CCDS33 ELSKSNKQNLFLGSLTSRLWPRSKQP
440 450
271 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 21:42:15 2016 done: Thu Nov 3 21:42:15 2016
Total Scan time: 2.350 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]