FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB4707, 362 aa
1>>>pF1KB4707 362 - 362 aa - 362 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.1803+/-0.000957; mu= 11.8773+/- 0.057
mean_var=79.3028+/-15.329, 0's: 0 Z-trim(105.5): 33 B-trim: 0 in 0/52
Lambda= 0.144022
statistics sampled from 8436 (8461) to 8436 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.638), E-opt: 0.2 (0.26), width: 16
Scan time: 2.460
The best scores are: opt bits E(32554)
CCDS4324.1 MFAP3 gene_id:4238|Hs108|chr5 ( 362) 2367 501.6 4.4e-142
CCDS47319.1 MFAP3 gene_id:4238|Hs108|chr5 ( 216) 1405 301.6 4.1e-82
CCDS34103.1 MFAP3L gene_id:9848|Hs108|chr4 ( 409) 1015 220.7 1.8e-57
CCDS43281.1 MFAP3L gene_id:9848|Hs108|chr4 ( 306) 920 200.9 1.2e-51
>>CCDS4324.1 MFAP3 gene_id:4238|Hs108|chr5 (362 aa)
initn: 2367 init1: 2367 opt: 2367 Z-score: 2664.3 bits: 501.6 E(32554): 4.4e-142
Smith-Waterman score: 2367; 100.0% identity (100.0% similar) in 362 aa overlap (1-362:1-362)
10 20 30 40 50 60
pF1KB4 MKLHCCLFTLVASIIVPAAFVLEDVDFDQMVSLEANRSSYNASFPSSFELSASSHSDDDV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 MKLHCCLFTLVASIIVPAAFVLEDVDFDQMVSLEANRSSYNASFPSSFELSASSHSDDDV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB4 IIAKEGTSVSIECLLTASHYEDVHWHNSKGQQLDGRSRGGKWLVSDNFLNITNVAFDDRG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 IIAKEGTSVSIECLLTASHYEDVHWHNSKGQQLDGRSRGGKWLVSDNFLNITNVAFDDRG
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB4 LYTCFVTSPIRASYSVTLRVIFTSGDMSVYYMIVCLIAFTITLILNVTRLCMMSSHLRKT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 LYTCFVTSPIRASYSVTLRVIFTSGDMSVYYMIVCLIAFTITLILNVTRLCMMSSHLRKT
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB4 EKAINEFFRTEGAEKLQKAFEIAKRIPIITSAKTLELAKVTQFKTMEFARYIEELARSVP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 EKAINEFFRTEGAEKLQKAFEIAKRIPIITSAKTLELAKVTQFKTMEFARYIEELARSVP
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB4 LPPLILNCRAFVEEMFEAVRVDDPDDLGERIKERPALNAQGGIYVINPEMGRSNSPGGDS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 LPPLILNCRAFVEEMFEAVRVDDPDDLGERIKERPALNAQGGIYVINPEMGRSNSPGGDS
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB4 DDGSLNEQGQEIAVQVSVHLQSETKSIDTESQGSSHFSPPDDIGSAESNCNYKDGAYENC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 DDGSLNEQGQEIAVQVSVHLQSETKSIDTESQGSSHFSPPDDIGSAESNCNYKDGAYENC
310 320 330 340 350 360
pF1KB4 QL
::
CCDS43 QL
>>CCDS47319.1 MFAP3 gene_id:4238|Hs108|chr5 (216 aa)
initn: 1405 init1: 1405 opt: 1405 Z-score: 1587.6 bits: 301.6 E(32554): 4.1e-82
Smith-Waterman score: 1405; 100.0% identity (100.0% similar) in 216 aa overlap (147-362:1-216)
120 130 140 150 160 170
pF1KB4 DDRGLYTCFVTSPIRASYSVTLRVIFTSGDMSVYYMIVCLIAFTITLILNVTRLCMMSSH
::::::::::::::::::::::::::::::
CCDS47 MSVYYMIVCLIAFTITLILNVTRLCMMSSH
10 20 30
180 190 200 210 220 230
pF1KB4 LRKTEKAINEFFRTEGAEKLQKAFEIAKRIPIITSAKTLELAKVTQFKTMEFARYIEELA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 LRKTEKAINEFFRTEGAEKLQKAFEIAKRIPIITSAKTLELAKVTQFKTMEFARYIEELA
40 50 60 70 80 90
240 250 260 270 280 290
pF1KB4 RSVPLPPLILNCRAFVEEMFEAVRVDDPDDLGERIKERPALNAQGGIYVINPEMGRSNSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 RSVPLPPLILNCRAFVEEMFEAVRVDDPDDLGERIKERPALNAQGGIYVINPEMGRSNSP
100 110 120 130 140 150
300 310 320 330 340 350
pF1KB4 GGDSDDGSLNEQGQEIAVQVSVHLQSETKSIDTESQGSSHFSPPDDIGSAESNCNYKDGA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 GGDSDDGSLNEQGQEIAVQVSVHLQSETKSIDTESQGSSHFSPPDDIGSAESNCNYKDGA
160 170 180 190 200 210
360
pF1KB4 YENCQL
::::::
CCDS47 YENCQL
>>CCDS34103.1 MFAP3L gene_id:9848|Hs108|chr4 (409 aa)
initn: 1010 init1: 717 opt: 1015 Z-score: 1145.2 bits: 220.7 E(32554): 1.8e-57
Smith-Waterman score: 1017; 55.1% identity (76.9% similar) in 325 aa overlap (22-333:13-336)
10 20 30 40 50
pF1KB4 MKLHCCLFTLVASIIVPAAFVLEDVDFDQMVS-LEANRSSYNASFPSSFELSASSH---S
: .: : .:: : . .: :... .. . .: .
CCDS34 MDRLKSHLTVCFLPSVPFLILVSTLATAKSVTNSTLNGTNVVLGSVPVIIA
10 20 30 40 50
60 70 80 90 100 110
pF1KB4 DDDVIIAKEGTSVSIECLLTASHYEDVHWHNSKGQQL----DGRSRGG-KWLVSDN-FLN
: ::.:::.:. :.: . . . .:.:: :. : : . ::: :: . :. .::
CCDS34 RTDHIIVKEGNSALINCSVYGIPDPQFKWYNSIGKLLKEEEDEKERGGGKWQMHDSGLLN
60 70 80 90 100 110
120 130 140 150 160
pF1KB4 ITNVAFDDRGLYTCFVTSPIRASY--SVTLRVIFTSGDMSVYYMIVCLIAFTITLILNVT
::.:.:.::: ::: :.: : .. .::::::::::::.::::.:::.::::...::.:
CCDS34 ITKVSFSDRGKYTC-VASNIYGTVNNTVTLRVIFTSGDMGVYYMVVCLVAFTIVMVLNIT
120 130 140 150 160 170
170 180 190 200 210 220
pF1KB4 RLCMMSSHLRKTEKAINEFFRTEGAEKLQKAFEIAKRIPIITSAKTLELAKVTQFKTMEF
:::::::::.::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 RLCMMSSHLKKTEKAINEFFRTEGAEKLQKAFEIAKRIPIITSAKTLELAKVTQFKTMEF
180 190 200 210 220 230
230 240 250 260 270 280
pF1KB4 ARYIEELARSVPLPPLILNCRAFVEEMFEAVRVDDP-DDLGERIKERPALNAQGGIYVIN
:::::::::::::::::.:::...::..:.: ... ... .. : . .:.:
CCDS34 ARYIEELARSVPLPPLIMNCRTIMEEIMEVVGLEEQGQNFVRHTPEGQEAADRDEVYTIP
240 250 260 270 280 290
290 300 310 320 330 340
pF1KB4 PEMGRSNSPGGDSDDGSLNEQGQEIAVQVSVHLQSETKSIDTESQGSSHFSPPDDIGSAE
. ::.::..::: .::.:: :.::..:::: ::. . : . :
CCDS34 NSLKRSDSPAADSDASSLHEQPQQIAIKVSVHPQSKKEHADDQEGGQFEVKDVEETELSA
300 310 320 330 340 350
350 360
pF1KB4 SNCNYKDGAYENCQL
CCDS34 EHSPETAEPSTDVTSTELTSEEPTPVEVPDKVLPPAYLEATEPAVTHDKNTCIIYESHV
360 370 380 390 400
>>CCDS43281.1 MFAP3L gene_id:9848|Hs108|chr4 (306 aa)
initn: 914 init1: 717 opt: 920 Z-score: 1040.5 bits: 200.9 E(32554): 1.2e-51
Smith-Waterman score: 920; 65.1% identity (85.2% similar) in 229 aa overlap (108-333:6-233)
80 90 100 110 120 130
pF1KB4 SHYEDVHWHNSKGQQLDGRSRGGKWLVSDNFLNITNVAFDDRGLYTCFVTSPIRASY--S
.::::.:.:.::: ::: :.: : .. .
CCDS43 MHDSGLLNITKVSFSDRGKYTC-VASNIYGTVNNT
10 20 30
140 150 160 170 180 190
pF1KB4 VTLRVIFTSGDMSVYYMIVCLIAFTITLILNVTRLCMMSSHLRKTEKAINEFFRTEGAEK
::::::::::::.::::.:::.::::...::.::::::::::.:::::::::::::::::
CCDS43 VTLRVIFTSGDMGVYYMVVCLVAFTIVMVLNITRLCMMSSHLKKTEKAINEFFRTEGAEK
40 50 60 70 80 90
200 210 220 230 240 250
pF1KB4 LQKAFEIAKRIPIITSAKTLELAKVTQFKTMEFARYIEELARSVPLPPLILNCRAFVEEM
::::::::::::::::::::::::::::::::::::::::::::::::::.:::...::.
CCDS43 LQKAFEIAKRIPIITSAKTLELAKVTQFKTMEFARYIEELARSVPLPPLIMNCRTIMEEI
100 110 120 130 140 150
260 270 280 290 300 310
pF1KB4 FEAVRVDDP-DDLGERIKERPALNAQGGIYVINPEMGRSNSPGGDSDDGSLNEQGQEIAV
.:.: ... ... .. : . .:.: . ::.::..::: .::.:: :.::.
CCDS43 MEVVGLEEQGQNFVRHTPEGQEAADRDEVYTIPNSLKRSDSPAADSDASSLHEQPQQIAI
160 170 180 190 200 210
320 330 340 350 360
pF1KB4 QVSVHLQSETKSIDTESQGSSHFSPPDDIGSAESNCNYKDGAYENCQL
.:::: ::. . : . :
CCDS43 KVSVHPQSKKEHADDQEGGQFEVKDVEETELSAEHSPETAEPSTDVTSTELTSEEPTPVE
220 230 240 250 260 270
362 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 06:01:02 2016 done: Sat Nov 5 06:01:03 2016
Total Scan time: 2.460 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]