FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE2647, 492 aa
1>>>pF1KE2647 492 - 492 aa - 492 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.1060+/-0.000937; mu= 10.3025+/- 0.057
mean_var=290.7228+/-60.656, 0's: 0 Z-trim(114.3): 37 B-trim: 0 in 0/55
Lambda= 0.075220
statistics sampled from 14798 (14830) to 14798 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.737), E-opt: 0.2 (0.456), width: 16
Scan time: 4.000
The best scores are: opt bits E(32554)
CCDS12679.1 NOVA2 gene_id:4858|Hs108|chr19 ( 492) 3120 351.9 9.4e-97
CCDS32060.1 NOVA1 gene_id:4857|Hs108|chr14 ( 483) 1735 201.6 1.6e-51
CCDS32061.1 NOVA1 gene_id:4857|Hs108|chr14 ( 507) 1281 152.3 1.1e-36
CCDS9635.1 NOVA1 gene_id:4857|Hs108|chr14 ( 181) 717 90.5 1.6e-18
>>CCDS12679.1 NOVA2 gene_id:4858|Hs108|chr19 (492 aa)
initn: 3120 init1: 3120 opt: 3120 Z-score: 1850.5 bits: 351.9 E(32554): 9.4e-97
Smith-Waterman score: 3120; 100.0% identity (100.0% similar) in 492 aa overlap (1-492:1-492)
10 20 30 40 50 60
pF1KE2 MEPEAPDSRKRPLETPPEVVCTKRSNTGEEGEYFLKVLIPSYAAGSIIGKGGQTIVQLQK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 MEPEAPDSRKRPLETPPEVVCTKRSNTGEEGEYFLKVLIPSYAAGSIIGKGGQTIVQLQK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE2 ETGATIKLSKSKDFYPGTTERVCLVQGTAEALNAVHSFIAEKVREIPQAMTKPEVVNILQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 ETGATIKLSKSKDFYPGTTERVCLVQGTAEALNAVHSFIAEKVREIPQAMTKPEVVNILQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE2 PQTTMNPDRAKQAKLIVPNSTAGLIIGKGGATVKAVMEQSGAWVQLSQKPEGINLQERVV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 PQTTMNPDRAKQAKLIVPNSTAGLIIGKGGATVKAVMEQSGAWVQLSQKPEGINLQERVV
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE2 TVSGEPEQVHKAVSAIVQKVQEDPQSSSCLNISYANVAGPVANSNPTGSPYASPADVLPA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 TVSGEPEQVHKAVSAIVQKVQEDPQSSSCLNISYANVAGPVANSNPTGSPYASPADVLPA
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE2 AAAASAAAASGLLGPAGLAGVGAFPAALPAFSGTDLLAISTALNTLASYGYNTNSLGLGL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 AAAASAAAASGLLGPAGLAGVGAFPAALPAFSGTDLLAISTALNTLASYGYNTNSLGLGL
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE2 NSAAASGVLAAVAAGANPAAAAAANLLASYAGEAGAGPAGGAAPPPPPPPGALGSFALAA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 NSAAASGVLAAVAAGANPAAAAAANLLASYAGEAGAGPAGGAAPPPPPPPGALGSFALAA
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE2 AANGYLGAGAGGGAGGGGGPLVAAAAAAGAAGGFLTAEKLAAESAKELVEIAVPENLVGA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 AANGYLGAGAGGGAGGGGGPLVAAAAAAGAAGGFLTAEKLAAESAKELVEIAVPENLVGA
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE2 ILGKGGKTLVEYQELTGARIQISKKGEFLPGTRNRRVTITGSPAATQAAQYLISQRVTYE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 ILGKGGKTLVEYQELTGARIQISKKGEFLPGTRNRRVTITGSPAATQAAQYLISQRVTYE
430 440 450 460 470 480
490
pF1KE2 QGVRASNPQKVG
::::::::::::
CCDS12 QGVRASNPQKVG
490
>>CCDS32060.1 NOVA1 gene_id:4857|Hs108|chr14 (483 aa)
initn: 1959 init1: 1341 opt: 1735 Z-score: 1038.3 bits: 201.6 E(32554): 1.6e-51
Smith-Waterman score: 2245; 75.2% identity (89.4% similar) in 492 aa overlap (4-492:21-483)
10 20 30 40
pF1KE2 MEPEAPDSRKRPLETPPEVVCTKRSNTGEEGEYFLKVLIPSYA
. :::::::::.:::. :::.::::.:.:::::::::::
CCDS32 MMAAAPIQQNGTHTGVPIDLDPPDSRKRPLEAPPEAGSTKRTNTGEDGQYFLKVLIPSYA
10 20 30 40 50 60
50 60 70 80 90 100
pF1KE2 AGSIIGKGGQTIVQLQKETGATIKLSKSKDFYPGTTERVCLVQGTAEALNAVHSFIAEKV
:::::::::::::::::::::::::::::::::::::::::.:::.:::::::.:::::.
CCDS32 AGSIIGKGGQTIVQLQKETGATIKLSKSKDFYPGTTERVCLIQGTVEALNAVHGFIAEKI
70 80 90 100 110 120
110 120 130 140 150 160
pF1KE2 REIPQAMTKPEVVNILQPQTTMNPDRAKQAKLIVPNSTAGLIIGKGGATVKAVMEQSGAW
::.:: ..: : :.:::::::.:::: ::.:.::::::::::::::::::::::::::::
CCDS32 REMPQNVAKTEPVSILQPQTTVNPDRIKQVKIIVPNSTAGLIIGKGGATVKAVMEQSGAW
130 140 150 160 170 180
170 180 190 200 210 220
pF1KE2 VQLSQKPEGINLQERVVTVSGEPEQVHKAVSAIVQKVQEDPQSSSCLNISYANVAGPVAN
:::::::.::::::::::::::::: .::: :.::.::::::.::::::::::.:::::
CCDS32 VQLSQKPDGINLQERVVTVSGEPEQNRKAVELIIQKIQEDPQSGSCLNISYANVTGPVAN
190 200 210 220 230 240
230 240 250 260 270 280
pF1KE2 SNPTGSPYASPADVLPAAAAASAAAASGLLGPAGLAGVGAFPAALPAFSGTDLLAISTAL
:::::::::. :.::: .::::.:::: :.::::.::::.: .:.:.::.::..::
CCDS32 SNPTGSPYANTAEVLP-----TAAAAAGLLGHANLAGVAAFPAVLSGFTGNDLVAITSAL
250 260 270 280 290
290 300 310 320 330 340
pF1KE2 NTLASYGYNTNSLGLGLNSAAASGVLAAVAAGANPAAAAAANLLASYAGEAGAG--PAGG
::::::::: :.:::::..:::.:.:::.::.:::::::: ::::.::.::.:. :::
CCDS32 NTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAA-NLLATYASEASASGSTAGG
300 310 320 330 340 350
350 360 370 380 390 400
pF1KE2 AAPPPPPPPGALGSFALAAAA-NGYLGAGAGGGAGGGGGPLVAAAAAAGAAGGFLTAEKL
.: ::::.: :.:: :::.::.. ::.:.: .: .::
CCDS32 TAGTF-----ALGSLAAATAATNGYFGAAS---------PLAASA--------ILGTEK-
360 370 380 390
410 420 430 440 450 460
pF1KE2 AAESAKELVEIAVPENLVGAILGKGGKTLVEYQELTGARIQISKKGEFLPGTRNRRVTIT
.....:..::::::::::::::::::::::::::::::::::::::::.::::::.::::
CCDS32 STDGSKDVVEIAVPENLVGAILGKGGKTLVEYQELTGARIQISKKGEFVPGTRNRKVTIT
400 410 420 430 440 450
470 480 490
pF1KE2 GSPAATQAAQYLISQRVTYEQGVRASNPQKVG
:.:::::::::::.::.::::::::.::::::
CCDS32 GTPAATQAAQYLITQRITYEQGVRAANPQKVG
460 470 480
>>CCDS32061.1 NOVA1 gene_id:4857|Hs108|chr14 (507 aa)
initn: 1573 init1: 715 opt: 1281 Z-score: 771.8 bits: 152.3 E(32554): 1.1e-36
Smith-Waterman score: 2187; 71.7% identity (85.3% similar) in 516 aa overlap (4-492:21-507)
10 20 30 40
pF1KE2 MEPEAPDSRKRPLETPPEVVCTKRSNTGEEGEYFLKVLIPSYA
. :::::::::.:::. :::.::::.:.:::::::::::
CCDS32 MMAAAPIQQNGTHTGVPIDLDPPDSRKRPLEAPPEAGSTKRTNTGEDGQYFLKVLIPSYA
10 20 30 40 50 60
50 60 70 80 90 100
pF1KE2 AGSIIGKGGQTIVQLQKETGATIKLSKSKDFYPGTTERVCLVQGTAEALNAVHSFIAEKV
:::::::::::::::::::::::::::::::::::::::::.:::.:::::::.:::::.
CCDS32 AGSIIGKGGQTIVQLQKETGATIKLSKSKDFYPGTTERVCLIQGTVEALNAVHGFIAEKI
70 80 90 100 110 120
110 120 130
pF1KE2 REIPQAMTKPEVVNILQPQTTMNPDRAKQA------------------------KLIVPN
::.:: ..: : :.:::::::.:::: ::. :.::::
CCDS32 REMPQNVAKTEPVSILQPQTTVNPDRIKQTLPSSPTTTKSSPSDPMTTSRANQVKIIVPN
130 140 150 160 170 180
140 150 160 170 180 190
pF1KE2 STAGLIIGKGGATVKAVMEQSGAWVQLSQKPEGINLQERVVTVSGEPEQVHKAVSAIVQK
:::::::::::::::::::::::::::::::.::::::::::::::::: .::: :.::
CCDS32 STAGLIIGKGGATVKAVMEQSGAWVQLSQKPDGINLQERVVTVSGEPEQNRKAVELIIQK
190 200 210 220 230 240
200 210 220 230 240 250
pF1KE2 VQEDPQSSSCLNISYANVAGPVANSNPTGSPYASPADVLPAAAAASAAAASGLLGPAGLA
.::::::.::::::::::.::::::::::::::. :.::: .::::.:::: :.::
CCDS32 IQEDPQSGSCLNISYANVTGPVANSNPTGSPYANTAEVLP-----TAAAAAGLLGHANLA
250 260 270 280 290
260 270 280 290 300 310
pF1KE2 GVGAFPAALPAFSGTDLLAISTALNTLASYGYNTNSLGLGLNSAAASGVLAAVAAGANPA
::.::::.: .:.:.::.::..::::::::::: :.:::::..:::.:.:::.::.::::
CCDS32 GVAAFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPA
300 310 320 330 340 350
320 330 340 350 360 370
pF1KE2 AAAAANLLASYAGEAGAG--PAGGAAPPPPPPPGALGSFALAAAA-NGYLGAGAGGGAGG
:::: ::::.::.::.:. :::.: ::::.: :.:: :::.::..
CCDS32 AAAA-NLLATYASEASASGSTAGGTAGTF-----ALGSLAAATAATNGYFGAAS------
360 370 380 390 400
380 390 400 410 420 430
pF1KE2 GGGPLVAAAAAAGAAGGFLTAEKLAAESAKELVEIAVPENLVGAILGKGGKTLVEYQELT
::.:.: .: .:: .....:..::::::::::::::::::::::::::::
CCDS32 ---PLAASA--------ILGTEK-STDGSKDVVEIAVPENLVGAILGKGGKTLVEYQELT
410 420 430 440 450
440 450 460 470 480 490
pF1KE2 GARIQISKKGEFLPGTRNRRVTITGSPAATQAAQYLISQRVTYEQGVRASNPQKVG
::::::::::::.::::::.:::::.:::::::::::.::.::::::::.::::::
CCDS32 GARIQISKKGEFVPGTRNRKVTITGTPAATQAAQYLITQRITYEQGVRAANPQKVG
460 470 480 490 500
>>CCDS9635.1 NOVA1 gene_id:4857|Hs108|chr14 (181 aa)
initn: 710 init1: 710 opt: 717 Z-score: 445.9 bits: 90.5 E(32554): 1.6e-18
Smith-Waterman score: 717; 69.6% identity (87.0% similar) in 161 aa overlap (4-164:21-180)
10 20 30 40
pF1KE2 MEPEAPDSRKRPLETPPEVVCTKRSNTGEEGEYFLKVLIPSYA
. :::::::::.:::. :::.::::.:.:::::::::::
CCDS96 MMAAAPIQQNGTHTGVPIDLDPPDSRKRPLEAPPEAGSTKRTNTGEDGQYFLKVLIPSYA
10 20 30 40 50 60
50 60 70 80 90 100
pF1KE2 AGSIIGKGGQTIVQLQKETGATIKLSKSKDFYPGTTERVCLVQGTAEALNAVHSFIAEKV
:::::::::::::::::::::::::::::::::::::::::.:::.:::::::.:::::.
CCDS96 AGSIIGKGGQTIVQLQKETGATIKLSKSKDFYPGTTERVCLIQGTVEALNAVHGFIAEKI
70 80 90 100 110 120
110 120 130 140 150 160
pF1KE2 REIPQAMTKPEVVNILQPQTTMNPDRAKQAKLIVPNSTAGLIIGKGGATVKAVMEQSGAW
::.:: ..: : :.:::::::.:::: ::. :..: . . .: .: .... .:
CCDS96 REMPQNVAKTEPVSILQPQTTVNPDRIKQTLPSSPTTTKSSP-SDPMTTSRANQKHNISW
130 140 150 160 170
170 180 190 200 210 220
pF1KE2 VQLSQKPEGINLQERVVTVSGEPEQVHKAVSAIVQKVQEDPQSSSCLNISYANVAGPVAN
.
CCDS96 IS
180
492 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 11 17:00:51 2016 done: Fri Nov 11 17:00:52 2016
Total Scan time: 4.000 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]