FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2647, 492 aa 1>>>pF1KE2647 492 - 492 aa - 492 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.1060+/-0.000937; mu= 10.3025+/- 0.057 mean_var=290.7228+/-60.656, 0's: 0 Z-trim(114.3): 37 B-trim: 0 in 0/55 Lambda= 0.075220 statistics sampled from 14798 (14830) to 14798 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.737), E-opt: 0.2 (0.456), width: 16 Scan time: 4.000 The best scores are: opt bits E(32554) CCDS12679.1 NOVA2 gene_id:4858|Hs108|chr19 ( 492) 3120 351.9 9.4e-97 CCDS32060.1 NOVA1 gene_id:4857|Hs108|chr14 ( 483) 1735 201.6 1.6e-51 CCDS32061.1 NOVA1 gene_id:4857|Hs108|chr14 ( 507) 1281 152.3 1.1e-36 CCDS9635.1 NOVA1 gene_id:4857|Hs108|chr14 ( 181) 717 90.5 1.6e-18 >>CCDS12679.1 NOVA2 gene_id:4858|Hs108|chr19 (492 aa) initn: 3120 init1: 3120 opt: 3120 Z-score: 1850.5 bits: 351.9 E(32554): 9.4e-97 Smith-Waterman score: 3120; 100.0% identity (100.0% similar) in 492 aa overlap (1-492:1-492) 10 20 30 40 50 60 pF1KE2 MEPEAPDSRKRPLETPPEVVCTKRSNTGEEGEYFLKVLIPSYAAGSIIGKGGQTIVQLQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MEPEAPDSRKRPLETPPEVVCTKRSNTGEEGEYFLKVLIPSYAAGSIIGKGGQTIVQLQK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 ETGATIKLSKSKDFYPGTTERVCLVQGTAEALNAVHSFIAEKVREIPQAMTKPEVVNILQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 ETGATIKLSKSKDFYPGTTERVCLVQGTAEALNAVHSFIAEKVREIPQAMTKPEVVNILQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 PQTTMNPDRAKQAKLIVPNSTAGLIIGKGGATVKAVMEQSGAWVQLSQKPEGINLQERVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 PQTTMNPDRAKQAKLIVPNSTAGLIIGKGGATVKAVMEQSGAWVQLSQKPEGINLQERVV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 TVSGEPEQVHKAVSAIVQKVQEDPQSSSCLNISYANVAGPVANSNPTGSPYASPADVLPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 TVSGEPEQVHKAVSAIVQKVQEDPQSSSCLNISYANVAGPVANSNPTGSPYASPADVLPA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 AAAASAAAASGLLGPAGLAGVGAFPAALPAFSGTDLLAISTALNTLASYGYNTNSLGLGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 AAAASAAAASGLLGPAGLAGVGAFPAALPAFSGTDLLAISTALNTLASYGYNTNSLGLGL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE2 NSAAASGVLAAVAAGANPAAAAAANLLASYAGEAGAGPAGGAAPPPPPPPGALGSFALAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 NSAAASGVLAAVAAGANPAAAAAANLLASYAGEAGAGPAGGAAPPPPPPPGALGSFALAA 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE2 AANGYLGAGAGGGAGGGGGPLVAAAAAAGAAGGFLTAEKLAAESAKELVEIAVPENLVGA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 AANGYLGAGAGGGAGGGGGPLVAAAAAAGAAGGFLTAEKLAAESAKELVEIAVPENLVGA 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE2 ILGKGGKTLVEYQELTGARIQISKKGEFLPGTRNRRVTITGSPAATQAAQYLISQRVTYE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 ILGKGGKTLVEYQELTGARIQISKKGEFLPGTRNRRVTITGSPAATQAAQYLISQRVTYE 430 440 450 460 470 480 490 pF1KE2 QGVRASNPQKVG :::::::::::: CCDS12 QGVRASNPQKVG 490 >>CCDS32060.1 NOVA1 gene_id:4857|Hs108|chr14 (483 aa) initn: 1959 init1: 1341 opt: 1735 Z-score: 1038.3 bits: 201.6 E(32554): 1.6e-51 Smith-Waterman score: 2245; 75.2% identity (89.4% similar) in 492 aa overlap (4-492:21-483) 10 20 30 40 pF1KE2 MEPEAPDSRKRPLETPPEVVCTKRSNTGEEGEYFLKVLIPSYA . :::::::::.:::. :::.::::.:.::::::::::: CCDS32 MMAAAPIQQNGTHTGVPIDLDPPDSRKRPLEAPPEAGSTKRTNTGEDGQYFLKVLIPSYA 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE2 AGSIIGKGGQTIVQLQKETGATIKLSKSKDFYPGTTERVCLVQGTAEALNAVHSFIAEKV :::::::::::::::::::::::::::::::::::::::::.:::.:::::::.:::::. CCDS32 AGSIIGKGGQTIVQLQKETGATIKLSKSKDFYPGTTERVCLIQGTVEALNAVHGFIAEKI 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE2 REIPQAMTKPEVVNILQPQTTMNPDRAKQAKLIVPNSTAGLIIGKGGATVKAVMEQSGAW ::.:: ..: : :.:::::::.:::: ::.:.:::::::::::::::::::::::::::: CCDS32 REMPQNVAKTEPVSILQPQTTVNPDRIKQVKIIVPNSTAGLIIGKGGATVKAVMEQSGAW 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE2 VQLSQKPEGINLQERVVTVSGEPEQVHKAVSAIVQKVQEDPQSSSCLNISYANVAGPVAN :::::::.::::::::::::::::: .::: :.::.::::::.::::::::::.::::: CCDS32 VQLSQKPDGINLQERVVTVSGEPEQNRKAVELIIQKIQEDPQSGSCLNISYANVTGPVAN 190 200 210 220 230 240 230 240 250 260 270 280 pF1KE2 SNPTGSPYASPADVLPAAAAASAAAASGLLGPAGLAGVGAFPAALPAFSGTDLLAISTAL :::::::::. :.::: .::::.:::: :.::::.::::.: .:.:.::.::..:: CCDS32 SNPTGSPYANTAEVLP-----TAAAAAGLLGHANLAGVAAFPAVLSGFTGNDLVAITSAL 250 260 270 280 290 290 300 310 320 330 340 pF1KE2 NTLASYGYNTNSLGLGLNSAAASGVLAAVAAGANPAAAAAANLLASYAGEAGAG--PAGG ::::::::: :.:::::..:::.:.:::.::.:::::::: ::::.::.::.:. ::: CCDS32 NTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAA-NLLATYASEASASGSTAGG 300 310 320 330 340 350 350 360 370 380 390 400 pF1KE2 AAPPPPPPPGALGSFALAAAA-NGYLGAGAGGGAGGGGGPLVAAAAAAGAAGGFLTAEKL .: ::::.: :.:: :::.::.. ::.:.: .: .:: CCDS32 TAGTF-----ALGSLAAATAATNGYFGAAS---------PLAASA--------ILGTEK- 360 370 380 390 410 420 430 440 450 460 pF1KE2 AAESAKELVEIAVPENLVGAILGKGGKTLVEYQELTGARIQISKKGEFLPGTRNRRVTIT .....:..::::::::::::::::::::::::::::::::::::::::.::::::.:::: CCDS32 STDGSKDVVEIAVPENLVGAILGKGGKTLVEYQELTGARIQISKKGEFVPGTRNRKVTIT 400 410 420 430 440 450 470 480 490 pF1KE2 GSPAATQAAQYLISQRVTYEQGVRASNPQKVG :.:::::::::::.::.::::::::.:::::: CCDS32 GTPAATQAAQYLITQRITYEQGVRAANPQKVG 460 470 480 >>CCDS32061.1 NOVA1 gene_id:4857|Hs108|chr14 (507 aa) initn: 1573 init1: 715 opt: 1281 Z-score: 771.8 bits: 152.3 E(32554): 1.1e-36 Smith-Waterman score: 2187; 71.7% identity (85.3% similar) in 516 aa overlap (4-492:21-507) 10 20 30 40 pF1KE2 MEPEAPDSRKRPLETPPEVVCTKRSNTGEEGEYFLKVLIPSYA . :::::::::.:::. :::.::::.:.::::::::::: CCDS32 MMAAAPIQQNGTHTGVPIDLDPPDSRKRPLEAPPEAGSTKRTNTGEDGQYFLKVLIPSYA 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE2 AGSIIGKGGQTIVQLQKETGATIKLSKSKDFYPGTTERVCLVQGTAEALNAVHSFIAEKV :::::::::::::::::::::::::::::::::::::::::.:::.:::::::.:::::. CCDS32 AGSIIGKGGQTIVQLQKETGATIKLSKSKDFYPGTTERVCLIQGTVEALNAVHGFIAEKI 70 80 90 100 110 120 110 120 130 pF1KE2 REIPQAMTKPEVVNILQPQTTMNPDRAKQA------------------------KLIVPN ::.:: ..: : :.:::::::.:::: ::. :.:::: CCDS32 REMPQNVAKTEPVSILQPQTTVNPDRIKQTLPSSPTTTKSSPSDPMTTSRANQVKIIVPN 130 140 150 160 170 180 140 150 160 170 180 190 pF1KE2 STAGLIIGKGGATVKAVMEQSGAWVQLSQKPEGINLQERVVTVSGEPEQVHKAVSAIVQK :::::::::::::::::::::::::::::::.::::::::::::::::: .::: :.:: CCDS32 STAGLIIGKGGATVKAVMEQSGAWVQLSQKPDGINLQERVVTVSGEPEQNRKAVELIIQK 190 200 210 220 230 240 200 210 220 230 240 250 pF1KE2 VQEDPQSSSCLNISYANVAGPVANSNPTGSPYASPADVLPAAAAASAAAASGLLGPAGLA .::::::.::::::::::.::::::::::::::. :.::: .::::.:::: :.:: CCDS32 IQEDPQSGSCLNISYANVTGPVANSNPTGSPYANTAEVLP-----TAAAAAGLLGHANLA 250 260 270 280 290 260 270 280 290 300 310 pF1KE2 GVGAFPAALPAFSGTDLLAISTALNTLASYGYNTNSLGLGLNSAAASGVLAAVAAGANPA ::.::::.: .:.:.::.::..::::::::::: :.:::::..:::.:.:::.::.:::: CCDS32 GVAAFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPA 300 310 320 330 340 350 320 330 340 350 360 370 pF1KE2 AAAAANLLASYAGEAGAG--PAGGAAPPPPPPPGALGSFALAAAA-NGYLGAGAGGGAGG :::: ::::.::.::.:. :::.: ::::.: :.:: :::.::.. CCDS32 AAAA-NLLATYASEASASGSTAGGTAGTF-----ALGSLAAATAATNGYFGAAS------ 360 370 380 390 400 380 390 400 410 420 430 pF1KE2 GGGPLVAAAAAAGAAGGFLTAEKLAAESAKELVEIAVPENLVGAILGKGGKTLVEYQELT ::.:.: .: .:: .....:..:::::::::::::::::::::::::::: CCDS32 ---PLAASA--------ILGTEK-STDGSKDVVEIAVPENLVGAILGKGGKTLVEYQELT 410 420 430 440 450 440 450 460 470 480 490 pF1KE2 GARIQISKKGEFLPGTRNRRVTITGSPAATQAAQYLISQRVTYEQGVRASNPQKVG ::::::::::::.::::::.:::::.:::::::::::.::.::::::::.:::::: CCDS32 GARIQISKKGEFVPGTRNRKVTITGTPAATQAAQYLITQRITYEQGVRAANPQKVG 460 470 480 490 500 >>CCDS9635.1 NOVA1 gene_id:4857|Hs108|chr14 (181 aa) initn: 710 init1: 710 opt: 717 Z-score: 445.9 bits: 90.5 E(32554): 1.6e-18 Smith-Waterman score: 717; 69.6% identity (87.0% similar) in 161 aa overlap (4-164:21-180) 10 20 30 40 pF1KE2 MEPEAPDSRKRPLETPPEVVCTKRSNTGEEGEYFLKVLIPSYA . :::::::::.:::. :::.::::.:.::::::::::: CCDS96 MMAAAPIQQNGTHTGVPIDLDPPDSRKRPLEAPPEAGSTKRTNTGEDGQYFLKVLIPSYA 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE2 AGSIIGKGGQTIVQLQKETGATIKLSKSKDFYPGTTERVCLVQGTAEALNAVHSFIAEKV :::::::::::::::::::::::::::::::::::::::::.:::.:::::::.:::::. CCDS96 AGSIIGKGGQTIVQLQKETGATIKLSKSKDFYPGTTERVCLIQGTVEALNAVHGFIAEKI 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE2 REIPQAMTKPEVVNILQPQTTMNPDRAKQAKLIVPNSTAGLIIGKGGATVKAVMEQSGAW ::.:: ..: : :.:::::::.:::: ::. :..: . . .: .: .... .: CCDS96 REMPQNVAKTEPVSILQPQTTVNPDRIKQTLPSSPTTTKSSP-SDPMTTSRANQKHNISW 130 140 150 160 170 170 180 190 200 210 220 pF1KE2 VQLSQKPEGINLQERVVTVSGEPEQVHKAVSAIVQKVQEDPQSSSCLNISYANVAGPVAN . CCDS96 IS 180 492 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 11 17:00:51 2016 done: Fri Nov 11 17:00:52 2016 Total Scan time: 4.000 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]