FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8147, 268 aa
1>>>pF1KB8147 268 - 268 aa - 268 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.8572+/-0.000826; mu= 8.9904+/- 0.050
mean_var=107.9156+/-21.545, 0's: 0 Z-trim(109.8): 8 B-trim: 0 in 0/51
Lambda= 0.123462
statistics sampled from 11167 (11174) to 11167 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.71), E-opt: 0.2 (0.343), width: 16
Scan time: 1.670
The best scores are: opt bits E(32554)
CCDS13208.1 MAPRE1 gene_id:22919|Hs108|chr20 ( 268) 1777 326.6 1.1e-89
CCDS45851.1 MAPRE2 gene_id:10982|Hs108|chr18 ( 284) 1018 191.4 5.9e-49
CCDS45850.1 MAPRE2 gene_id:10982|Hs108|chr18 ( 315) 1018 191.4 6.5e-49
CCDS11910.1 MAPRE2 gene_id:10982|Hs108|chr18 ( 327) 1018 191.4 6.7e-49
CCDS1731.1 MAPRE3 gene_id:22924|Hs108|chr2 ( 281) 885 167.7 7.9e-42
CCDS58619.1 MAPRE2 gene_id:10982|Hs108|chr18 ( 274) 827 157.4 1e-38
>>CCDS13208.1 MAPRE1 gene_id:22919|Hs108|chr20 (268 aa)
initn: 1777 init1: 1777 opt: 1777 Z-score: 1723.4 bits: 326.6 E(32554): 1.1e-89
Smith-Waterman score: 1777; 100.0% identity (100.0% similar) in 268 aa overlap (1-268:1-268)
10 20 30 40 50 60
pF1KB8 MAVNVYSTSVTSDNLSRHDMLAWINESLQLNLTKIEQLCSGAAYCQFMDMLFPGSIALKK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 MAVNVYSTSVTSDNLSRHDMLAWINESLQLNLTKIEQLCSGAAYCQFMDMLFPGSIALKK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 VKFQAKLEHEYIQNFKILQAGFKRMGVDKIIPVDKLVKGKFQDNFEFVQWFKKFFDANYD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 VKFQAKLEHEYIQNFKILQAGFKRMGVDKIIPVDKLVKGKFQDNFEFVQWFKKFFDANYD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 GKDYDPVAARQGQETAVAPSLVAPALNKPKKPLTSSSAAPQRPISTQRTAAAPKAGPGVV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 GKDYDPVAARQGQETAVAPSLVAPALNKPKKPLTSSSAAPQRPISTQRTAAAPKAGPGVV
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB8 RKNPGVGNGDDEAAELMQQVNVLKLTVEDLEKERDFYFGKLRNIELICQENEGENDPVLQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 RKNPGVGNGDDEAAELMQQVNVLKLTVEDLEKERDFYFGKLRNIELICQENEGENDPVLQ
190 200 210 220 230 240
250 260
pF1KB8 RIVDILYATDEGFVIPDEGGPQEEQEEY
::::::::::::::::::::::::::::
CCDS13 RIVDILYATDEGFVIPDEGGPQEEQEEY
250 260
>>CCDS45851.1 MAPRE2 gene_id:10982|Hs108|chr18 (284 aa)
initn: 1007 init1: 784 opt: 1018 Z-score: 992.3 bits: 191.4 E(32554): 5.9e-49
Smith-Waterman score: 1018; 56.3% identity (78.3% similar) in 277 aa overlap (1-267:1-270)
10 20 30 40 50 60
pF1KB8 MAVNVYSTSVTSDNLSRHDMLAWINESLQLNLTKIEQLCSGAAYCQFMDMLFPGSIALKK
:::::::::.:....::::..::.:. ..:: ::.::::::::::::::::::: :.:::
CCDS45 MAVNVYSTSITQETMSRHDIIAWVNDIVSLNYTKVEQLCSGAAYCQFMDMLFPGCISLKK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 VKFQAKLEHEYIQNFKILQAGFKRMGVDKIIPVDKLVKGKFQDNFEFVQWFKKFFDANYD
::::::::::::.:::.:::.::::.:::.:::.:::::.::::..:.::::::.:::::
CCDS45 VKFQAKLEHEYIHNFKLLQASFKRMNVDKVIPVEKLVKGRFQDNLDFIQWFKKFYDANYD
70 80 90 100 110 120
130 140 150 160 170
pF1KB8 GKDYDPVAARQGQETAVAPSLVAPALNKPKKPLTSSSAAPQRPISTQRTAAAPKAGPG--
::.:::: :::::.. :. .: ::: ..: : . . :: : . :.
CCDS45 GKEYDPVEARQGQDAIPPPDPGEQIFNLPKKSHHANS--PTAGAAKSSPAAKPGSTPSRP
130 140 150 160 170
180 190 200 210 220 230
pF1KB8 -VVRKNPGVGNG-------DDEAAELMQQVNVLKLTVEDLEKERDFYFGKLRNIELICQE
... . :.. . .. .: .::. :::..: .::::::::::::.:::.:::
CCDS45 SSAKRASSSGSASKSDKDLETQVIQLNEQVHSLKLALEGVEKERDFYFGKLREIELLCQE
180 190 200 210 220 230
240 250 260
pF1KB8 NEGENDPVLQRIVDILYATDEGFVIPDEGGPQEEQEEY
. ::: ..::..:::::..: :: .: . :
CCDS45 HGQENDDLVQRLMDILYASEE-----HEGHTEEPEAEEQAHEQQPPQQEEY
240 250 260 270 280
>>CCDS45850.1 MAPRE2 gene_id:10982|Hs108|chr18 (315 aa)
initn: 1007 init1: 784 opt: 1018 Z-score: 991.7 bits: 191.4 E(32554): 6.5e-49
Smith-Waterman score: 1018; 56.3% identity (78.3% similar) in 277 aa overlap (1-267:32-301)
10 20 30
pF1KB8 MAVNVYSTSVTSDNLSRHDMLAWINESLQL
:::::::::.:....::::..::.:. ..:
CCDS45 KQNRDQKCPVSQRNSSFQQPGRKPGCSSWGMAVNVYSTSITQETMSRHDIIAWVNDIVSL
10 20 30 40 50 60
40 50 60 70 80 90
pF1KB8 NLTKIEQLCSGAAYCQFMDMLFPGSIALKKVKFQAKLEHEYIQNFKILQAGFKRMGVDKI
: ::.::::::::::::::::::: :.:::::::::::::::.:::.:::.::::.:::.
CCDS45 NYTKVEQLCSGAAYCQFMDMLFPGCISLKKVKFQAKLEHEYIHNFKLLQASFKRMNVDKV
70 80 90 100 110 120
100 110 120 130 140 150
pF1KB8 IPVDKLVKGKFQDNFEFVQWFKKFFDANYDGKDYDPVAARQGQETAVAPSLVAPALNKPK
:::.:::::.::::..:.::::::.:::::::.:::: :::::.. :. .: ::
CCDS45 IPVEKLVKGRFQDNLDFIQWFKKFYDANYDGKEYDPVEARQGQDAIPPPDPGEQIFNLPK
130 140 150 160 170 180
160 170 180 190 200
pF1KB8 KPLTSSSAAPQRPISTQRTAAAPKAGPG---VVRKNPGVGNG-------DDEAAELMQQV
: ..: : . . :: : . :. ... . :.. . .. .: .::
CCDS45 KSHHANS--PTAGAAKSSPAAKPGSTPSRPSSAKRASSSGSASKSDKDLETQVIQLNEQV
190 200 210 220 230
210 220 230 240 250 260
pF1KB8 NVLKLTVEDLEKERDFYFGKLRNIELICQENEGENDPVLQRIVDILYATDEGFVIPDEGG
. :::..: .::::::::::::.:::.:::. ::: ..::..:::::..: ::
CCDS45 HSLKLALEGVEKERDFYFGKLREIELLCQEHGQENDDLVQRLMDILYASEE-----HEGH
240 250 260 270 280 290
pF1KB8 PQEEQEEY
.: . :
CCDS45 TEEPEAEEQAHEQQPPQQEEY
300 310
>>CCDS11910.1 MAPRE2 gene_id:10982|Hs108|chr18 (327 aa)
initn: 1007 init1: 784 opt: 1018 Z-score: 991.4 bits: 191.4 E(32554): 6.7e-49
Smith-Waterman score: 1018; 56.3% identity (78.3% similar) in 277 aa overlap (1-267:44-313)
10 20 30
pF1KB8 MAVNVYSTSVTSDNLSRHDMLAWINESLQL
:::::::::.:....::::..::.:. ..:
CCDS11 NNNDIIQDNNGTIIPFRKHTVRGERSYSWGMAVNVYSTSITQETMSRHDIIAWVNDIVSL
20 30 40 50 60 70
40 50 60 70 80 90
pF1KB8 NLTKIEQLCSGAAYCQFMDMLFPGSIALKKVKFQAKLEHEYIQNFKILQAGFKRMGVDKI
: ::.::::::::::::::::::: :.:::::::::::::::.:::.:::.::::.:::.
CCDS11 NYTKVEQLCSGAAYCQFMDMLFPGCISLKKVKFQAKLEHEYIHNFKLLQASFKRMNVDKV
80 90 100 110 120 130
100 110 120 130 140 150
pF1KB8 IPVDKLVKGKFQDNFEFVQWFKKFFDANYDGKDYDPVAARQGQETAVAPSLVAPALNKPK
:::.:::::.::::..:.::::::.:::::::.:::: :::::.. :. .: ::
CCDS11 IPVEKLVKGRFQDNLDFIQWFKKFYDANYDGKEYDPVEARQGQDAIPPPDPGEQIFNLPK
140 150 160 170 180 190
160 170 180 190 200
pF1KB8 KPLTSSSAAPQRPISTQRTAAAPKAGPG---VVRKNPGVGNG-------DDEAAELMQQV
: ..: : . . :: : . :. ... . :.. . .. .: .::
CCDS11 KSHHANS--PTAGAAKSSPAAKPGSTPSRPSSAKRASSSGSASKSDKDLETQVIQLNEQV
200 210 220 230 240 250
210 220 230 240 250 260
pF1KB8 NVLKLTVEDLEKERDFYFGKLRNIELICQENEGENDPVLQRIVDILYATDEGFVIPDEGG
. :::..: .::::::::::::.:::.:::. ::: ..::..:::::..: ::
CCDS11 HSLKLALEGVEKERDFYFGKLREIELLCQEHGQENDDLVQRLMDILYASEE-----HEGH
260 270 280 290 300
pF1KB8 PQEEQEEY
.: . :
CCDS11 TEEPEAEEQAHEQQPPQQEEY
310 320
>>CCDS1731.1 MAPRE3 gene_id:22924|Hs108|chr2 (281 aa)
initn: 1157 init1: 838 opt: 885 Z-score: 864.4 bits: 167.7 E(32554): 7.9e-42
Smith-Waterman score: 1159; 65.4% identity (81.6% similar) in 283 aa overlap (1-268:1-281)
10 20 30 40 50 60
pF1KB8 MAVNVYSTSVTSDNLSRHDMLAWINESLQLNLTKIEQLCSGAAYCQFMDMLFPGSIALKK
::::::::::::.::::::::::.:.::.:: :::::::::::::::::::::: . :.:
CCDS17 MAVNVYSTSVTSENLSRHDMLAWVNDSLHLNYTKIEQLCSGAAYCQFMDMLFPGCVHLRK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 VKFQAKLEHEYIQNFKILQAGFKRMGVDKIIPVDKLVKGKFQDNFEFVQWFKKFFDANYD
::::::::::::.:::.:::.::.:::::::::.:::::::::::::.::::::::::::
CCDS17 VKFQAKLEHEYIHNFKVLQAAFKKMGVDKIIPVEKLVKGKFQDNFEFIQWFKKFFDANYD
70 80 90 100 110 120
130 140 150 160 170
pF1KB8 GKDYDPVAARQGQETAVAPSLVAPALNKPKKPLTSSSAAPQR-----PISTQRTAAAPK-
::::.:. :::::..: :. .:: :: . ..:.::: : . : .. .
CCDS17 GKDYNPLLARQGQDVAPPPNPGDQIFNKSKKLI--GTAVPQRTSPTGPKNMQTSGRLSNV
130 140 150 160 170
180 190 200 210 220
pF1KB8 AGPGVVRKNP-GVGNG----DDEAAELMQQVNVLKLTVEDLEKERDFYFGKLRNIELICQ
: : ..:::: .. :: : . :: ::. :::::. :::::::::.:::.::::::
CCDS17 APPCILRKNPPSARNGGHETDAQILELNQQLVDLKLTVDGLEKERDFYFSKLRDIELICQ
180 190 200 210 220 230
230 240 250 260
pF1KB8 ENEGENDPVLQRIVDILYATDEGFVIPD----EGGPQEEQEEY
:.:.::.::.. :. :::::.:::. :. : ::.:.::
CCDS17 EHESENSPVISGIIGILYATEEGFAPPEDDEIEEHQQEDQDEY
240 250 260 270 280
>>CCDS58619.1 MAPRE2 gene_id:10982|Hs108|chr18 (274 aa)
initn: 816 init1: 593 opt: 827 Z-score: 808.7 bits: 157.4 E(32554): 1e-38
Smith-Waterman score: 827; 54.2% identity (75.6% similar) in 238 aa overlap (40-267:30-260)
10 20 30 40 50 60
pF1KB8 VTSDNLSRHDMLAWINESLQLNLTKIEQLCSGAAYCQFMDMLFPGSIALKKVKFQAKLEH
.:::::::::::::: :.::::::::::::
CCDS58 MARTTTTSSRIITGPSFLSGSTQCAGSVPTGAAYCQFMDMLFPGCISLKKVKFQAKLEH
10 20 30 40 50
70 80 90 100 110 120
pF1KB8 EYIQNFKILQAGFKRMGVDKIIPVDKLVKGKFQDNFEFVQWFKKFFDANYDGKDYDPVAA
:::.:::.:::.::::.:::.:::.:::::.::::..:.::::::.:::::::.:::: :
CCDS58 EYIHNFKLLQASFKRMNVDKVIPVEKLVKGRFQDNLDFIQWFKKFYDANYDGKEYDPVEA
60 70 80 90 100 110
130 140 150 160 170 180
pF1KB8 RQGQETAVAPSLVAPALNKPKKPLTSSSAAPQRPISTQRTAAAPKAGPG---VVRKNPGV
::::.. :. .: ::: . . .: . . :: : . :. ... .
CCDS58 RQGQDAIPPPDPGEQIFNLPKK--SHHANSPTAGAAKSSPAAKPGSTPSRPSSAKRASSS
120 130 140 150 160 170
190 200 210 220 230
pF1KB8 GNG-------DDEAAELMQQVNVLKLTVEDLEKERDFYFGKLRNIELICQENEGENDPVL
:.. . .. .: .::. :::..: .::::::::::::.:::.:::. ::: ..
CCDS58 GSASKSDKDLETQVIQLNEQVHSLKLALEGVEKERDFYFGKLREIELLCQEHGQENDDLV
180 190 200 210 220 230
240 250 260
pF1KB8 QRIVDILYATDEGFVIPDEGGPQEEQEEY
::..:::::..: :: .: . :
CCDS58 QRLMDILYASEE-----HEGHTEEPEAEEQAHEQQPPQQEEY
240 250 260 270
268 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 10:00:12 2016 done: Fri Nov 4 10:00:12 2016
Total Scan time: 1.670 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]