FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7629, 319 aa
1>>>pF1KB7629 319 - 319 aa - 319 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.6613+/-0.00111; mu= 4.8912+/- 0.066
mean_var=162.2770+/-32.946, 0's: 0 Z-trim(107.0): 234 B-trim: 0 in 0/55
Lambda= 0.100681
statistics sampled from 9028 (9291) to 9028 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.649), E-opt: 0.2 (0.285), width: 16
Scan time: 2.690
The best scores are: opt bits E(32554)
CCDS7412.1 ANKRD1 gene_id:27063|Hs108|chr10 ( 319) 2070 312.8 2.2e-85
CCDS7466.1 ANKRD2 gene_id:26287|Hs108|chr10 ( 360) 846 135.1 8.2e-32
CCDS2027.1 ANKRD23 gene_id:200539|Hs108|chr2 ( 305) 773 124.4 1.1e-28
CCDS44468.1 ANKRD2 gene_id:26287|Hs108|chr10 ( 327) 571 95.1 8e-20
>>CCDS7412.1 ANKRD1 gene_id:27063|Hs108|chr10 (319 aa)
initn: 2070 init1: 2070 opt: 2070 Z-score: 1646.2 bits: 312.8 E(32554): 2.2e-85
Smith-Waterman score: 2070; 100.0% identity (100.0% similar) in 319 aa overlap (1-319:1-319)
10 20 30 40 50 60
pF1KB7 MMVLKVEELVTGKKNGNGEAGEFLPEDFRDGEYEAAVTLEKQEDLKTLLAHPVTLGEQQW
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 MMVLKVEELVTGKKNGNGEAGEFLPEDFRDGEYEAAVTLEKQEDLKTLLAHPVTLGEQQW
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 KSEKQREAELKKKKLEQRSKLENLEDLEIIIQLKKRKKYRKTKVPVVKEPEPEIITEPVD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 KSEKQREAELKKKKLEQRSKLENLEDLEIIIQLKKRKKYRKTKVPVVKEPEPEIITEPVD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 VPTFLKAALENKLPVVEKFLSDKNNPDVCDEYKRTALHRACLEGHLAIVEKLMEAGAQIE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 VPTFLKAALENKLPVVEKFLSDKNNPDVCDEYKRTALHRACLEGHLAIVEKLMEAGAQIE
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 FRDMLESTAIHWASRGGNLDVLKLLLNKGAKISARDKLLSTALHVAVRTGHYECAEHLIA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 FRDMLESTAIHWASRGGNLDVLKLLLNKGAKISARDKLLSTALHVAVRTGHYECAEHLIA
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 CEADLNAKDREGDTPLHDAVRLNRYKMIRLLIMYGADLNIKNCAGKTPMDLVLHWQNGTK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 CEADLNAKDREGDTPLHDAVRLNRYKMIRLLIMYGADLNIKNCAGKTPMDLVLHWQNGTK
250 260 270 280 290 300
310
pF1KB7 AIFDSLRENSYKTSRIATF
:::::::::::::::::::
CCDS74 AIFDSLRENSYKTSRIATF
310
>>CCDS7466.1 ANKRD2 gene_id:26287|Hs108|chr10 (360 aa)
initn: 1119 init1: 800 opt: 846 Z-score: 684.6 bits: 135.1 E(32554): 8.2e-32
Smith-Waterman score: 846; 48.4% identity (75.1% similar) in 285 aa overlap (20-300:50-328)
10 20 30 40
pF1KB7 MMVLKVEELVTGKKNGNGEAGEFLPEDFR-DGEYEAAVTLEKQEDLKTL
: : : .: :.. . . : :: :
CCDS74 ALWPAEAVMDGTMEDSEAVQRATALIEQRLAQEEENEKLRGDARQKLPMDLLVLEDEKHH
20 30 40 50 60 70
50 60 70 80 90 100
pF1KB7 LAHPVTLGEQQWKSEKQREAELKKKKLEQRSKLENLEDLEIIIQLKKRKKYRKTKVPVVK
:. ..: :. :.... ..: .:. : .. .. .. .:.:.:..: .: . ...
CCDS74 GAQSAAL--QKVKGQER----VRKTSLDLRREIIDVGGIQNLIELRKKRKQKKRDALAAS
80 90 100 110 120 130
110 120 130 140 150 160
pF1KB7 E---PEPEIITEPVDVPTFLKAALENKLPVVEKFLSDKNNPDVCDEYKRTALHRACLEGH
. :::: :: ::: ::::::.:.:. :.::::.: .. :.::...::::::: ::::
CCDS74 HEPPPEPEEITGPVDEETFLKAAVEGKMKVIEKFLADGGSADTCDQFRRTALHRASLEGH
140 150 160 170 180 190
170 180 190 200 210 220
pF1KB7 LAIVEKLMEAGAQIEFRDMLESTAIHWASRGGNLDVLKLLLNKGAKISARDKLLSTALHV
. :.:::.. :: ..:.: :. ::.::: :::.:.:.::: ..:: ..::::::: :::
CCDS74 MEILEKLLDNGATVDFQDRLDCTAMHWACRGGHLEVVKLLQSHGADTNVRDKLLSTPLHV
200 210 220 230 240 250
230 240 250 260 270 280
pF1KB7 AVRTGHYECAEHLIACEADLNAKDREGDTPLHDAVRLNRYKMIRLLIMYGADLNIKNCAG
:::::. : .::... ..::.:::::: :::::::::::.:.::...:::. :: ::
CCDS74 AVRTGQVEIVEHFLSLGLEINARDREGDTALHDAVRLNRYKIIKLLLLHGADMMTKNLAG
260 270 280 290 300 310
290 300 310
pF1KB7 KTPMDLVLHWQNGTKAIFDSLRENSYKTSRIATF
::: ::: :: :.
CCDS74 KTPTDLVQLWQADTRHALEHPEPGAEHNGLEGPNDSGRETPQPVPAQ
320 330 340 350 360
>>CCDS2027.1 ANKRD23 gene_id:200539|Hs108|chr2 (305 aa)
initn: 1168 init1: 678 opt: 773 Z-score: 628.3 bits: 124.4 E(32554): 1.1e-28
Smith-Waterman score: 773; 49.6% identity (73.8% similar) in 248 aa overlap (63-300:47-291)
40 50 60 70 80
pF1KB7 YEAAVTLEKQEDLKTLLAHPVTLGEQQWKSEKQREAELKKKKLEQ----RSKLENLEDLE
:: . : ::::::. : .:.:: :::
CCDS20 GKVLGFGHGVPDPGAWPSDWRRGPQEAVAREKLKLEEEKKKKLERFNSTRFNLDNLADLE
20 30 40 50 60 70
90 100 110 120 130 140
pF1KB7 IIIQLKKRKKYRKTKVPVVKEPEPEII------TEPVDVPTFLKAALENKLPVVEKFLSD
..: .::: . .:: ..::: . .::: . ::::: ::. ...:.:.:
CCDS20 NLVQ--RRKKRLRHRVPP-RKPEPLVKPQSQAQVEPVGLEMFLKAAAENQEYLIDKYLTD
80 90 100 110 120 130
150 160 170 180 190 200
pF1KB7 KNNPDVCDEYKRTALHRACLEGHLAIVEKLMEAGAQIEFRDMLESTAIHWASRGGNLDVL
..:.. :. .::::: :::.:: .:.::. ::: .. ::.:. : . :: :::.: .:
CCDS20 GGDPNAHDKLHRTALHWACLKGHSQLVNKLLVAGATVDARDLLDRTPVFWACRGGHLVIL
140 150 160 170 180 190
210 220 230 240 250 260
pF1KB7 KLLLNKGAKISARDKLLSTALHVAVRTGHYECAEHLIACEADLNAKDREGDTPLHDAVRL
: :::.::...::::. :: ::::::: : .: :::: : : :::.:.:::: ::.:::
CCDS20 KQLLNQGARVNARDKIGSTPLHVAVRTRHPDCLEHLIECGAHLNAQDKEGDTALHEAVRH
200 210 220 230 240 250
270 280 290 300 310
pF1KB7 NRYKMIRLLIMYGADLNIKNCAGKTPMDLVLHWQNGTKAIFDSLRENSYKTSRIATF
. :: ..::..:::.:...: :. ::..:. :: : .
CCDS20 GSYKAMKLLLLYGAELGVRNAASVTPVQLARDWQRGIREALQAHVAHPRTRC
260 270 280 290 300
>>CCDS44468.1 ANKRD2 gene_id:26287|Hs108|chr10 (327 aa)
initn: 947 init1: 521 opt: 571 Z-score: 469.4 bits: 95.1 E(32554): 8e-20
Smith-Waterman score: 658; 41.8% identity (66.0% similar) in 285 aa overlap (20-300:50-295)
10 20 30 40
pF1KB7 MMVLKVEELVTGKKNGNGEAGEFLPEDFR-DGEYEAAVTLEKQEDLKTL
: : : .: :.. . . : :: :
CCDS44 ALWPAEAVMDGTMEDSEAVQRATALIEQRLAQEEENEKLRGDARQKLPMDLLVLEDEKHH
20 30 40 50 60 70
50 60 70 80 90 100
pF1KB7 LAHPVTLGEQQWKSEKQREAELKKKKLEQRSKLENLEDLEIIIQLKKRKKYRKTKVPVVK
:. ..: :. :.... ..: .:. : .. .. .. .:.:.:..: .: . ...
CCDS44 GAQSAAL--QKVKGQER----VRKTSLDLRREIIDVGGIQNLIELRKKRKQKKRDALAAS
80 90 100 110 120 130
110 120 130 140 150 160
pF1KB7 E---PEPEIITEPVDVPTFLKAALENKLPVVEKFLSDKNNPDVCDEYKRTALHRACLEGH
. :::: :: ::: ::::::.:.:. :.::::.: .. :.::...::::::: ::::
CCDS44 HEPPPEPEEITGPVDEETFLKAAVEGKMKVIEKFLADGGSADTCDQFRRTALHRASLEGH
140 150 160 170 180 190
170 180 190 200 210 220
pF1KB7 LAIVEKLMEAGAQIEFRDMLESTAIHWASRGGNLDVLKLLLNKGAKISARDKLLSTALHV
. :.:::.. :: ..:.: :. ::.::: :::.:.:.::: ..::
CCDS44 MEILEKLLDNGATVDFQDRLDCTAMHWACRGGHLEVVKLLQSHGA---------------
200 210 220 230
230 240 250 260 270 280
pF1KB7 AVRTGHYECAEHLIACEADLNAKDREGDTPLHDAVRLNRYKMIRLLIMYGADLNIKNCAG
: :..:.:::: :::::::::::.:.::...:::. :: ::
CCDS44 ------------------DTNVRDKEGDTALHDAVRLNRYKIIKLLLLHGADMMTKNLAG
240 250 260 270 280
290 300 310
pF1KB7 KTPMDLVLHWQNGTKAIFDSLRENSYKTSRIATF
::: ::: :: :.
CCDS44 KTPTDLVQLWQADTRHALEHPEPGAEHNGLEGPNDSGRETPQPVPAQ
290 300 310 320
319 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 18:08:07 2016 done: Sat Nov 5 18:08:07 2016
Total Scan time: 2.690 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]