FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7931, 365 aa 1>>>pF1KB7931 365 - 365 aa - 365 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.4820+/-0.000813; mu= 5.6340+/- 0.049 mean_var=253.2491+/-51.727, 0's: 0 Z-trim(116.8): 148 B-trim: 11 in 1/53 Lambda= 0.080594 statistics sampled from 17339 (17508) to 17339 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.83), E-opt: 0.2 (0.538), width: 16 Scan time: 2.530 The best scores are: opt bits E(32554) CCDS13168.1 VSX1 gene_id:30813|Hs108|chr20 ( 365) 2504 303.4 2e-82 CCDS58767.1 VSX1 gene_id:30813|Hs108|chr20 ( 301) 1866 229.2 3.7e-60 CCDS13169.1 VSX1 gene_id:30813|Hs108|chr20 ( 239) 1434 178.8 4.2e-45 CCDS58766.1 VSX1 gene_id:30813|Hs108|chr20 ( 236) 1429 178.2 6.2e-45 CCDS9827.1 VSX2 gene_id:338917|Hs108|chr14 ( 361) 764 101.1 1.6e-21 >>CCDS13168.1 VSX1 gene_id:30813|Hs108|chr20 (365 aa) initn: 2504 init1: 2504 opt: 2504 Z-score: 1593.3 bits: 303.4 E(32554): 2e-82 Smith-Waterman score: 2504; 100.0% identity (100.0% similar) in 365 aa overlap (1-365:1-365) 10 20 30 40 50 60 pF1KB7 MTGRDSLSDGRTSSRALVPGGSPRGSRPRGFAITDLLGLEAELPAPAGPGQGSGCEGPAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MTGRDSLSDGRTSSRALVPGGSPRGSRPRGFAITDLLGLEAELPAPAGPGQGSGCEGPAV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 APCPGPGLDGSSLARGALPLGLGLLCGFGTQPPAAARAPCLLLADVPFLPPRGPEPAAPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 APCPGPGLDGSSLARGALPLGLGLLCGFGTQPPAAARAPCLLLADVPFLPPRGPEPAAPL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 APSRPPPALGRQKRSDSVSTSDEDSQSEDRNDLKASPTLGKRKKRRHRTVFTAHQLEELE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 APSRPPPALGRQKRSDSVSTSDEDSQSEDRNDLKASPTLGKRKKRRHRTVFTAHQLEELE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 KAFSEAHYPDVYAREMLAVKTELPEDRIQVWFQNRRAKWRKREKRWGGSSVMAEYGLYGA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 KAFSEAHYPDVYAREMLAVKTELPEDRIQVWFQNRRAKWRKREKRWGGSSVMAEYGLYGA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 MVRHCIPLPDSVLNSAEGGLLGSCAPWLLGMHKKSMGMIRKPGSEDKLAGLWGSDHFKEG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MVRHCIPLPDSVLNSAEGGLLGSCAPWLLGMHKKSMGMIRKPGSEDKLAGLWGSDHFKEG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 SSQSESGSQRGSDKVSPENGLEDVAIDLSSSARQETKKVHPGAGAQGGSNSTALEGPQPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 SSQSESGSQRGSDKVSPENGLEDVAIDLSSSARQETKKVHPGAGAQGGSNSTALEGPQPG 310 320 330 340 350 360 pF1KB7 KVGAT ::::: CCDS13 KVGAT >>CCDS58767.1 VSX1 gene_id:30813|Hs108|chr20 (301 aa) initn: 1866 init1: 1866 opt: 1866 Z-score: 1193.4 bits: 229.2 E(32554): 3.7e-60 Smith-Waterman score: 1866; 100.0% identity (100.0% similar) in 269 aa overlap (1-269:1-269) 10 20 30 40 50 60 pF1KB7 MTGRDSLSDGRTSSRALVPGGSPRGSRPRGFAITDLLGLEAELPAPAGPGQGSGCEGPAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 MTGRDSLSDGRTSSRALVPGGSPRGSRPRGFAITDLLGLEAELPAPAGPGQGSGCEGPAV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 APCPGPGLDGSSLARGALPLGLGLLCGFGTQPPAAARAPCLLLADVPFLPPRGPEPAAPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 APCPGPGLDGSSLARGALPLGLGLLCGFGTQPPAAARAPCLLLADVPFLPPRGPEPAAPL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 APSRPPPALGRQKRSDSVSTSDEDSQSEDRNDLKASPTLGKRKKRRHRTVFTAHQLEELE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 APSRPPPALGRQKRSDSVSTSDEDSQSEDRNDLKASPTLGKRKKRRHRTVFTAHQLEELE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 KAFSEAHYPDVYAREMLAVKTELPEDRIQVWFQNRRAKWRKREKRWGGSSVMAEYGLYGA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 KAFSEAHYPDVYAREMLAVKTELPEDRIQVWFQNRRAKWRKREKRWGGSSVMAEYGLYGA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 MVRHCIPLPDSVLNSAEGGLLGSCAPWLLGMHKKSMGMIRKPGSEDKLAGLWGSDHFKEG ::::::::::::::::::::::::::::: CCDS58 MVRHCIPLPDSVLNSAEGGLLGSCAPWLLVQTSAPGGSRSLDFAGDTQAPQTPWWCLMTF 250 260 270 280 290 300 >>CCDS13169.1 VSX1 gene_id:30813|Hs108|chr20 (239 aa) initn: 1434 init1: 1434 opt: 1434 Z-score: 923.2 bits: 178.8 E(32554): 4.2e-45 Smith-Waterman score: 1434; 100.0% identity (100.0% similar) in 210 aa overlap (1-210:1-210) 10 20 30 40 50 60 pF1KB7 MTGRDSLSDGRTSSRALVPGGSPRGSRPRGFAITDLLGLEAELPAPAGPGQGSGCEGPAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MTGRDSLSDGRTSSRALVPGGSPRGSRPRGFAITDLLGLEAELPAPAGPGQGSGCEGPAV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 APCPGPGLDGSSLARGALPLGLGLLCGFGTQPPAAARAPCLLLADVPFLPPRGPEPAAPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 APCPGPGLDGSSLARGALPLGLGLLCGFGTQPPAAARAPCLLLADVPFLPPRGPEPAAPL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 APSRPPPALGRQKRSDSVSTSDEDSQSEDRNDLKASPTLGKRKKRRHRTVFTAHQLEELE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 APSRPPPALGRQKRSDSVSTSDEDSQSEDRNDLKASPTLGKRKKRRHRTVFTAHQLEELE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 KAFSEAHYPDVYAREMLAVKTELPEDRIQVWFQNRRAKWRKREKRWGGSSVMAEYGLYGA :::::::::::::::::::::::::::::: CCDS13 KAFSEAHYPDVYAREMLAVKTELPEDRIQVSGVPFLRSKDTTENVSFPHSVSQSAVPSL 190 200 210 220 230 >>CCDS58766.1 VSX1 gene_id:30813|Hs108|chr20 (236 aa) initn: 1429 init1: 1429 opt: 1429 Z-score: 920.1 bits: 178.2 E(32554): 6.2e-45 Smith-Waterman score: 1429; 100.0% identity (100.0% similar) in 209 aa overlap (1-209:1-209) 10 20 30 40 50 60 pF1KB7 MTGRDSLSDGRTSSRALVPGGSPRGSRPRGFAITDLLGLEAELPAPAGPGQGSGCEGPAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 MTGRDSLSDGRTSSRALVPGGSPRGSRPRGFAITDLLGLEAELPAPAGPGQGSGCEGPAV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 APCPGPGLDGSSLARGALPLGLGLLCGFGTQPPAAARAPCLLLADVPFLPPRGPEPAAPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 APCPGPGLDGSSLARGALPLGLGLLCGFGTQPPAAARAPCLLLADVPFLPPRGPEPAAPL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 APSRPPPALGRQKRSDSVSTSDEDSQSEDRNDLKASPTLGKRKKRRHRTVFTAHQLEELE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 APSRPPPALGRQKRSDSVSTSDEDSQSEDRNDLKASPTLGKRKKRRHRTVFTAHQLEELE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 KAFSEAHYPDVYAREMLAVKTELPEDRIQVWFQNRRAKWRKREKRWGGSSVMAEYGLYGA ::::::::::::::::::::::::::::: CCDS58 KAFSEAHYPDVYAREMLAVKTELPEDRIQCKLLLLEAPVHWTLQETHRLPRPRGGA 190 200 210 220 230 >>CCDS9827.1 VSX2 gene_id:338917|Hs108|chr14 (361 aa) initn: 776 init1: 701 opt: 764 Z-score: 500.0 bits: 101.1 E(32554): 1.6e-21 Smith-Waterman score: 816; 45.8% identity (65.6% similar) in 369 aa overlap (1-359:1-337) 10 20 30 40 50 pF1KB7 MTGR--DSLSDGRTSSRAL-VPGGSPRGSRPRGFAITDLLGLEAELPAPAGPGQGSGCEG :::. ..:: .. . : . ::.: .: ::.: ..:::. : : . : .. .: CCDS98 MTGKAGEALSKPKSETVAKSTSGGAP--ARCTGFGIQEILGLNKE-PPSSHPR--AALDG 10 20 30 40 50 60 70 80 90 100 110 pF1KB7 PAVAPCPGPGLDGSSLARGALPLGLGLLCGFGTQPPAAARAPCL-LLADVPFLPPRGPEP : :: : . :. : :.::: : : : .. : .:.: :.. . CCDS98 LA----PGHLLAARSVLSPAGVGGMGLL-GPGGLPGFYTQPTFLEVLSD-----PQSVH- 60 70 80 90 100 120 130 140 150 160 170 pF1KB7 AAPLAPSRPPPALGRQKRSDSVSTSDEDSQSEDRNDLKAS-PTLGKRKKRRHRTVFTAHQ ::. . : :...:...:: .: ::. :.. :::::::::.::..: CCDS98 LQPLGRASGP-----LDTSQTASSDSEDVSSSDRKMSKSALNQTKKRKKRRHRTIFTSYQ 110 120 130 140 150 180 190 200 210 220 230 pF1KB7 LEELEKAFSEAHYPDVYAREMLAVKTELPEDRIQVWFQNRRAKWRKREKRWGGSSVMAEY ::::::::.::::::::::::::.::::::::::::::::::::::::: :: ::::::: CCDS98 LEELEKAFNEAHYPDVYAREMLAMKTELPEDRIQVWFQNRRAKWRKREKCWGRSSVMAEY 160 170 180 190 200 210 240 250 260 270 280 290 pF1KB7 GLYGAMVRHCIPLPDSVLNSAEGGLLGSCAPWLLGMHKKSMGMI----RKP-GSEDKLAG ::::::::: ::::.:.:.::. :.. :::::::::::::. ::: : .. : CCDS98 GLYGAMVRHSIPLPESILKSAKDGIMDSCAPWLLGMHKKSLEAAAESGRKPEGERQALPK 220 230 240 250 260 270 300 310 320 330 340 350 pF1KB7 LWGSDHFKEGSSQSESGSQRGSDKVSPENGLEDVAIDLSSSARQETKKVHPGAGAQGGSN : :... :.: : . .. .: :. :. : ..:.... :: :. .: . CCDS98 L---DKME----QDERGPD-AQAAISQEELRENSIAVLRAKAQEHSTKV---LGTVSGPD 280 290 300 310 320 360 pF1KB7 STALEGPQPGKVGAT : : .: CCDS98 SLARSTEKPEEEEAMDEDRPAERLSPPQLEDMA 330 340 350 360 365 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 01:38:53 2016 done: Tue Nov 8 01:38:53 2016 Total Scan time: 2.530 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]