FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB6477, 326 aa
1>>>pF1KB6477 326 - 326 aa - 326 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.4890+/-0.000799; mu= 7.0316+/- 0.048
mean_var=141.7295+/-27.743, 0's: 0 Z-trim(112.4): 20 B-trim: 14 in 1/52
Lambda= 0.107732
statistics sampled from 13166 (13181) to 13166 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.76), E-opt: 0.2 (0.405), width: 16
Scan time: 2.630
The best scores are: opt bits E(32554)
CCDS13301.1 RPRD1B gene_id:58490|Hs108|chr20 ( 326) 2082 334.6 6.5e-92
CCDS77178.1 RPRD1A gene_id:55197|Hs108|chr18 ( 276) 729 124.3 1.1e-28
CCDS11917.1 RPRD1A gene_id:55197|Hs108|chr18 ( 312) 680 116.7 2.5e-26
CCDS44216.1 RPRD2 gene_id:23248|Hs108|chr1 (1461) 373 69.4 2e-11
>>CCDS13301.1 RPRD1B gene_id:58490|Hs108|chr20 (326 aa)
initn: 2082 init1: 2082 opt: 2082 Z-score: 1763.6 bits: 334.6 E(32554): 6.5e-92
Smith-Waterman score: 2082; 100.0% identity (100.0% similar) in 326 aa overlap (1-326:1-326)
10 20 30 40 50 60
pF1KB6 MSSFSESALEKKLSELSNSQQSVQTLSLWLIHHRKHAGPIVSVWHRELRKAKSNRKLTFL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 MSSFSESALEKKLSELSNSQQSVQTLSLWLIHHRKHAGPIVSVWHRELRKAKSNRKLTFL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB6 YLANDVIQNSKRKGPEFTREFESVLVDAFSHVAREADEGCKKPLERLLNIWQERSVYGGE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 YLANDVIQNSKRKGPEFTREFESVLVDAFSHVAREADEGCKKPLERLLNIWQERSVYGGE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB6 FIQQLKLSMEDSKSPPPKATEEKKSLKRTFQQIQEEEDDDYPGSYSPQDPSAGPLLTEEL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 FIQQLKLSMEDSKSPPPKATEEKKSLKRTFQQIQEEEDDDYPGSYSPQDPSAGPLLTEEL
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB6 IKALQDLENAASGDATVRQKIASLPQEVQDVSLLEKITDKEAAERLSKTVDEACLLLAEY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 IKALQDLENAASGDATVRQKIASLPQEVQDVSLLEKITDKEAAERLSKTVDEACLLLAEY
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB6 NGRLAAELEDRRQLARMLVEYTQNQKDVLSEKEKKLEEYKQKLARVTQVRKELKSHIQSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 NGRLAAELEDRRQLARMLVEYTQNQKDVLSEKEKKLEEYKQKLARVTQVRKELKSHIQSL
250 260 270 280 290 300
310 320
pF1KB6 PDLSLLPNVTGGLAPLPSAGDLFSTD
::::::::::::::::::::::::::
CCDS13 PDLSLLPNVTGGLAPLPSAGDLFSTD
310 320
>>CCDS77178.1 RPRD1A gene_id:55197|Hs108|chr18 (276 aa)
initn: 1078 init1: 657 opt: 729 Z-score: 628.1 bits: 124.3 E(32554): 1.1e-28
Smith-Waterman score: 1053; 61.2% identity (84.1% similar) in 276 aa overlap (51-326:15-276)
30 40 50 60 70 80
pF1KB6 QSVQTLSLWLIHHRKHAGPIVSVWHRELRKAKSNRKLTFLYLANDVIQNSKRKGPEFTRE
.: :::::::::::::::::::::::::..
CCDS77 MRNSNWRFCQTGIYSKPNRKLTFLYLANDVIQNSKRKGPEFTKD
10 20 30 40
90 100 110 120 130 140
pF1KB6 FESVLVDAFSHVAREADEGCKKPLERLLNIWQERSVYGGEFIQQLKLSMEDSKSPPPKAT
: :.:.::.::. :.::.::: : :.:.::.::::: .. ..::: .. .:.:
CCDS77 FAPVIVEAFKHVSSETDESCKKHLGRVLSIWEERSVYENDVLEQLKQALYGDKKPR----
50 60 70 80 90 100
150 160 170 180 190 200
pF1KB6 EEKKSLKRTFQQIQEEEDDDYPGSYSPQDPSAGPLLTEELIKALQDLENAASGDATVRQK
:::..::. .:... . ::..: : : .:..:::::::::::::.:.:.
CCDS77 ------KRTYEQIKVDENENCSSLGSPSEP---PQ-TLDLVRALQDLENAASGDAAVHQR
110 120 130 140 150
210 220 230 240 250 260
pF1KB6 IASLPQEVQDVSLLEKITDKEAAERLSKTVDEACLLLAEYNGRLAAELEDRRQLARMLVE
::::: :::.::::.::::::..::::: :..::.:::.::::::::..::.::.:::..
CCDS77 IASLPVEVQEVSLLDKITDKESGERLSKMVEDACMLLADYNGRLAAEIDDRKQLTRMLAD
160 170 180 190 200 210
270 280 290 300 310 320
pF1KB6 YTQNQKDVLSEKEKKLEEYKQKLARVTQVRKELKSHIQSLPDLSLLPNVTGGLAPLPSAG
. . ::..:.:::.::::::.:::::. :::::.:.:::::::: ::::::. :: ::
CCDS77 FLRCQKEALAEKEHKLEEYKRKLARVSLVRKELRSRIQSLPDLSRLPNVTGSHMHLPFAG
220 230 240 250 260 270
pF1KB6 DLFSTD
:..: :
CCDS77 DIYSED
>>CCDS11917.1 RPRD1A gene_id:55197|Hs108|chr18 (312 aa)
initn: 1371 init1: 668 opt: 680 Z-score: 586.2 bits: 116.7 E(32554): 2.5e-26
Smith-Waterman score: 1346; 65.6% identity (86.2% similar) in 326 aa overlap (1-326:1-312)
10 20 30 40 50 60
pF1KB6 MSSFSESALEKKLSELSNSQQSVQTLSLWLIHHRKHAGPIVSVWHRELRKAKSNRKLTFL
::.:::.:::::::::::::::::::::::::::::. :::.::.::::::: :::::::
CCDS11 MSAFSEAALEKKLSELSNSQQSVQTLSLWLIHHRKHSRPIVTVWERELRKAKPNRKLTFL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB6 YLANDVIQNSKRKGPEFTREFESVLVDAFSHVAREADEGCKKPLERLLNIWQERSVYGGE
::::::::::::::::::..: :.:.::.::. :.::.::: : :.:.::.::::: ..
CCDS11 YLANDVIQNSKRKGPEFTKDFAPVIVEAFKHVSSETDESCKKHLGRVLSIWEERSVYEND
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB6 FIQQLKLSMEDSKSPPPKATEEKKSLKRTFQQIQEEEDDDYPGSYSPQDPSAGPLLTEEL
..::: .. .:.: :::..::. .:... . ::..: : : .:
CCDS11 VLEQLKQALYGDKKPR----------KRTYEQIKVDENENCSSLGSPSEP---PQ-TLDL
130 140 150 160
190 200 210 220 230 240
pF1KB6 IKALQDLENAASGDATVRQKIASLPQEVQDVSLLEKITDKEAAERLSKTVDEACLLLAEY
..:::::::::::::.:.:.::::: :::.::::.::::::..::::: :..::.:::.:
CCDS11 VRALQDLENAASGDAAVHQRIASLPVEVQEVSLLDKITDKESGERLSKMVEDACMLLADY
170 180 190 200 210 220
250 260 270 280 290 300
pF1KB6 NGRLAAELEDRRQLARMLVEYTQNQKDVLSEKEKKLEEYKQKLARVTQVRKELKSHIQSL
:::::::..::.::.:::... . ::..:.:::.::::::.:::::. :::::.:.::::
CCDS11 NGRLAAEIDDRKQLTRMLADFLRCQKEALAEKEHKLEEYKRKLARVSLVRKELRSRIQSL
230 240 250 260 270 280
310 320
pF1KB6 PDLSLLPNVTGGLAPLPSAGDLFSTD
:::: ::::::. :: :::..: :
CCDS11 PDLSRLPNVTGSHMHLPFAGDIYSED
290 300 310
>>CCDS44216.1 RPRD2 gene_id:23248|Hs108|chr1 (1461 aa)
initn: 346 init1: 218 opt: 373 Z-score: 318.6 bits: 69.4 E(32554): 2e-11
Smith-Waterman score: 375; 27.0% identity (61.5% similar) in 322 aa overlap (6-321:24-335)
10 20 30 40
pF1KB6 MSSFSESALEKKLSELSNSQQSVQTLSLWLIHHRKHAGPIVS
::.:..:.. ..:...:.: :: : :...:: . ::
CCDS44 MAAGGGGGSSKASSSSASSAGALESSLDRKFQSVTNTMESIQGLSSWCIENKKHHSTIVY
10 20 30 40 50 60
50 60 70 80 90 100
pF1KB6 VWHRELRKAKSNRKLTFLYLANDVIQNSKRKGPEFTRE-FESVLVDAFSHVAREADEGCK
: . ::.. ..:...::::::::: :::. . :: : .:: .: . : : . .
CCDS44 HWMKWLRRSAYPHRLNLFYLANDVIQNCKRKNAIIFRESFADVLPEAAALVK---DPSVS
70 80 90 100 110
110 120 130 140 150 160
pF1KB6 KPLERLLNIWQERSVYGGEFIQQLKLSMEDSKSPPPKATEEKKSLKRTFQQIQEEEDDDY
: .::...::..:.:: :.: :. .. . . .:.::..... ...
CCDS44 KSVERIFKIWEDRNVYPEEMIVALREALSTT-------FKTQKQLKENLNKQPNKQWKKS
120 130 140 150 160 170
170 180 190 200 210
pF1KB6 PGSYSPQDPSAGPLLTEELIKALQD---LENAASGDATVRQK-IASLPQEVQDVSLLEKI
: .:. . ...: .:: . : . . . ...: .... .: .. :. .
CCDS44 QTSTNPKAALKSKIVAEFRSQALIEELLLYKRSEDQIELKEKQLSTMRVDVCSTETLKCL
180 190 200 210 220 230
220 230 240 250 260 270
pF1KB6 TDKEAAERLSKTVDEACLLLAEYNGRLAAELEDRRQLARMLVEYTQNQKDVLSEKEKKLE
:: .....:: .:: : :. . : .... .:.. : . . .: . .
CCDS44 KDKTGGKKFSKEFEEASSKLEEFVNGLDKQVKNGPSLTEALENAGIFYEAQYKEVKVVAN
240 250 260 270 280 290
280 290 300 310 320
pF1KB6 EYKQKLARVTQVRKELKSHIQSLPDLSLLPNVTGGL-APLPSAGDLFSTD
:: ::....:.: . ..::: : . .. :: :....
CCDS44 AYKTFANRVNNLKKKLDQLKSTLPDPEESPVPSPSMDAPSPTGSESPFQGMGGEESQSPT
300 310 320 330 340 350
CCDS44 MESEKSATPEPVTDNRDVEDMELSDVEDDGSKIIVEDRKEKPAEKSAVSTSVPTKPTENI
360 370 380 390 400 410
326 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 20:59:14 2016 done: Fri Nov 4 20:59:15 2016
Total Scan time: 2.630 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]