FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7616, 361 aa 1>>>pF1KB7616 361 - 361 aa - 361 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.6833+/-0.000742; mu= 12.1742+/- 0.045 mean_var=145.2232+/-29.780, 0's: 0 Z-trim(113.8): 185 B-trim: 10 in 1/51 Lambda= 0.106428 statistics sampled from 14168 (14379) to 14168 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.776), E-opt: 0.2 (0.442), width: 16 Scan time: 3.100 The best scores are: opt bits E(32554) CCDS9827.1 VSX2 gene_id:338917|Hs108|chr14 ( 361) 2387 377.6 9.1e-105 CCDS13168.1 VSX1 gene_id:30813|Hs108|chr20 ( 365) 764 128.4 9.5e-30 CCDS58767.1 VSX1 gene_id:30813|Hs108|chr20 ( 301) 697 118.0 1e-26 CCDS31468.1 ALX4 gene_id:60529|Hs108|chr11 ( 411) 396 71.9 1.1e-12 CCDS819.1 ALX3 gene_id:257|Hs108|chr1 ( 343) 357 65.9 5.9e-11 >>CCDS9827.1 VSX2 gene_id:338917|Hs108|chr14 (361 aa) initn: 2387 init1: 2387 opt: 2387 Z-score: 1994.4 bits: 377.6 E(32554): 9.1e-105 Smith-Waterman score: 2387; 100.0% identity (100.0% similar) in 361 aa overlap (1-361:1-361) 10 20 30 40 50 60 pF1KB7 MTGKAGEALSKPKSETVAKSTSGGAPARCTGFGIQEILGLNKEPPSSHPRAALDGLAPGH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 MTGKAGEALSKPKSETVAKSTSGGAPARCTGFGIQEILGLNKEPPSSHPRAALDGLAPGH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 LLAARSVLSPAGVGGMGLLGPGGLPGFYTQPTFLEVLSDPQSVHLQPLGRASGPLDTSQT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 LLAARSVLSPAGVGGMGLLGPGGLPGFYTQPTFLEVLSDPQSVHLQPLGRASGPLDTSQT 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 ASSDSEDVSSSDRKMSKSALNQTKKRKKRRHRTIFTSYQLEELEKAFNEAHYPDVYAREM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 ASSDSEDVSSSDRKMSKSALNQTKKRKKRRHRTIFTSYQLEELEKAFNEAHYPDVYAREM 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 LAMKTELPEDRIQVWFQNRRAKWRKREKCWGRSSVMAEYGLYGAMVRHSIPLPESILKSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 LAMKTELPEDRIQVWFQNRRAKWRKREKCWGRSSVMAEYGLYGAMVRHSIPLPESILKSA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 KDGIMDSCAPWLLGMHKKSLEAAAESGRKPEGERQALPKLDKMEQDERGPDAQAAISQEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 KDGIMDSCAPWLLGMHKKSLEAAAESGRKPEGERQALPKLDKMEQDERGPDAQAAISQEE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 LRENSIAVLRAKAQEHSTKVLGTVSGPDSLARSTEKPEEEEAMDEDRPAERLSPPQLEDM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 LRENSIAVLRAKAQEHSTKVLGTVSGPDSLARSTEKPEEEEAMDEDRPAERLSPPQLEDM 310 320 330 340 350 360 pF1KB7 A : CCDS98 A >>CCDS13168.1 VSX1 gene_id:30813|Hs108|chr20 (365 aa) initn: 776 init1: 701 opt: 764 Z-score: 647.5 bits: 128.4 E(32554): 9.5e-30 Smith-Waterman score: 816; 46.1% identity (65.3% similar) in 369 aa overlap (1-337:1-359) 10 20 30 40 50 pF1KB7 MTGKAGEALSKPKSETVAKSTSGGAP--ARCTGFGIQEILGLNKE-PPSSHPR--AALDG :::. ..:: .. . : . ::.: .: ::.: ..:::. : : . : .. .: CCDS13 MTGR--DSLSDGRTSSRAL-VPGGSPRGSRPRGFAITDLLGLEAELPAPAGPGQGSGCEG 10 20 30 40 50 60 70 80 90 100 pF1KB7 LA----PGHLLAARSVLSPAGVGGMGLL-GPGGLPGFYTQPTFLEVLSD-----PQSVH- : :: : . :. : :.::: : : : .. : .:.: :.. . CCDS13 PAVAPCPGPGLDGSSLARGALPLGLGLLCGFGTQPPAAARAPCL-LLADVPFLPPRGPEP 60 70 80 90 100 110 110 120 130 140 150 pF1KB7 LQPLGRASGP-----LDTSQTASSDSEDVSSSDRKMSKSALNQTKKRKKRRHRTIFTSYQ ::. . : :...:...:: .: ::. : : :::::::::.::..: CCDS13 AAPLAPSRPPPALGRQKRSDSVSTSDEDSQSEDRNDLK-ASPTLGKRKKRRHRTVFTAHQ 120 130 140 150 160 170 160 170 180 190 200 210 pF1KB7 LEELEKAFNEAHYPDVYAREMLAMKTELPEDRIQVWFQNRRAKWRKREKCWGRSSVMAEY ::::::::.::::::::::::::.::::::::::::::::::::::::: :: ::::::: CCDS13 LEELEKAFSEAHYPDVYAREMLAVKTELPEDRIQVWFQNRRAKWRKREKRWGGSSVMAEY 180 190 200 210 220 230 220 230 240 250 260 270 pF1KB7 GLYGAMVRHSIPLPESILKSAKDGIMDSCAPWLLGMHKKSLEAAAESGRKPEGERQALPK ::::::::: ::::.:.:.::. :.. :::::::::::::. ::: : .. : CCDS13 GLYGAMVRHCIPLPDSVLNSAEGGLLGSCAPWLLGMHKKSMGMI----RKP-GSEDKLAG 240 250 260 270 280 290 280 290 300 310 320 pF1KB7 L---DKME----QDERGPD-AQAAISQEELRENSIAVLRAKAQEHSTKV---LGTVSGPD : :... :.: : . .. .: :. :. : ..:.... :: :. .: . CCDS13 LWGSDHFKEGSSQSESGSQRGSDKVSPENGLEDVAIDLSSSARQETKKVHPGAGAQGGSN 300 310 320 330 340 350 330 340 350 360 pF1KB7 SLARSTEKPEEEEAMDEDRPAERLSPPQLEDMA : : .: CCDS13 STALEGPQPGKVGAT 360 >>CCDS58767.1 VSX1 gene_id:30813|Hs108|chr20 (301 aa) initn: 731 init1: 656 opt: 697 Z-score: 593.0 bits: 118.0 E(32554): 1e-26 Smith-Waterman score: 749; 51.5% identity (69.7% similar) in 274 aa overlap (1-253:1-269) 10 20 30 40 50 pF1KB7 MTGKAGEALSKPKSETVAKSTSGGAP--ARCTGFGIQEILGLNKE-PPSSHPR--AALDG :::. ..:: .. . : . ::.: .: ::.: ..:::. : : . : .. .: CCDS58 MTGR--DSLSDGRTSSRAL-VPGGSPRGSRPRGFAITDLLGLEAELPAPAGPGQGSGCEG 10 20 30 40 50 60 70 80 90 100 pF1KB7 LA----PGHLLAARSVLSPAGVGGMGLL-GPGGLPGFYTQPTFLEVLSD-----PQSVH- : :: : . :. : :.::: : : : .. : .:.: :.. . CCDS58 PAVAPCPGPGLDGSSLARGALPLGLGLLCGFGTQPPAAARAPCL-LLADVPFLPPRGPEP 60 70 80 90 100 110 110 120 130 140 150 pF1KB7 LQPLGRASGP-----LDTSQTASSDSEDVSSSDRKMSKSALNQTKKRKKRRHRTIFTSYQ ::. . : :...:...:: .: ::. :.. :::::::::.::..: CCDS58 AAPLAPSRPPPALGRQKRSDSVSTSDEDSQSEDRNDLKAS-PTLGKRKKRRHRTVFTAHQ 120 130 140 150 160 170 160 170 180 190 200 210 pF1KB7 LEELEKAFNEAHYPDVYAREMLAMKTELPEDRIQVWFQNRRAKWRKREKCWGRSSVMAEY ::::::::.::::::::::::::.::::::::::::::::::::::::: :: ::::::: CCDS58 LEELEKAFSEAHYPDVYAREMLAVKTELPEDRIQVWFQNRRAKWRKREKRWGGSSVMAEY 180 190 200 210 220 230 220 230 240 250 260 270 pF1KB7 GLYGAMVRHSIPLPESILKSAKDGIMDSCAPWLLGMHKKSLEAAAESGRKPEGERQALPK ::::::::: ::::.:.:.::. :.. ::::::: CCDS58 GLYGAMVRHCIPLPDSVLNSAEGGLLGSCAPWLLVQTSAPGGSRSLDFAGDTQAPQTPWW 240 250 260 270 280 290 280 290 300 310 320 330 pF1KB7 LDKMEQDERGPDAQAAISQEELRENSIAVLRAKAQEHSTKVLGTVSGPDSLARSTEKPEE CCDS58 CLMTFS 300 >>CCDS31468.1 ALX4 gene_id:60529|Hs108|chr11 (411 aa) initn: 452 init1: 355 opt: 396 Z-score: 341.5 bits: 71.9 E(32554): 1.1e-12 Smith-Waterman score: 402; 38.2% identity (61.4% similar) in 220 aa overlap (6-208:58-274) 10 20 30 pF1KB7 MTGKAGEALSKPK----SETVAKSTSGGAPARCT- :.: :. . .. .: .:: :: . CCDS31 REGSSPFRAFPGGDKFGTTFLSAAAKAQGFGDAKSRARYGAGQQDLATPLESGAGARGSF 30 40 50 60 70 80 40 50 60 70 80 pF1KB7 -GFGIQEILGLNKEPPSSHPRAALDGLAPG---HLLAARSVLSPAGVGGMGLL-GPGGLP : : . ::. .:. : :: :.. . :.. : : .: CCDS31 NKFQPQPSTPQPQPPPQPQPQQQQPQPQPPAQPHLYLQRGACKTPPDGSLKLQEGSSGHS 90 100 110 120 130 140 90 100 110 120 130 pF1KB7 GFYTQPTFLE--VLSDPQSVHLQP----LGRASGPLDTSQTASSDSEDVSSSDRKMS-KS . : . . :..:. : : .: :. :...... . .: .::: .. CCDS31 AALQVPCYAKESSLGEPE---LPPDSDTVGMDSSYLSVKEAGVKGPQDRASSDLPSPLEK 150 160 170 180 190 200 140 150 160 170 180 190 pF1KB7 ALNQTKKRKKRRHRTIFTSYQLEELEKAFNEAHYPDVYAREMLAMKTELPEDRIQVWFQN : ....: ::::.:: :::::::::::.:...:::::::::.:::.:.: : :.:::::: CCDS31 ADSESNKGKKRRNRTTFTSYQLEELEKVFQKTHYPDVYAREQLAMRTDLTEARVQVWFQN 210 220 230 240 250 260 200 210 220 230 240 250 pF1KB7 RRAKWRKREKCWGRSSVMAEYGLYGAMVRHSIPLPESILKSAKDGIMDSCAPWLLGMHKK :::::::::. CCDS31 RRAKWRKRERFGQMQQVRTHFSTAYELPLLTRAENYAQIQNPSWLGNNGAASPVPACVVP 270 280 290 300 310 320 >>CCDS819.1 ALX3 gene_id:257|Hs108|chr1 (343 aa) initn: 331 init1: 331 opt: 357 Z-score: 310.1 bits: 65.9 E(32554): 5.9e-11 Smith-Waterman score: 369; 43.6% identity (62.0% similar) in 179 aa overlap (79-243:75-253) 50 60 70 80 90 100 pF1KB7 PRAALDGLAPGHLLAARSVLSPAGVGGMGLLGPG-GLPG--FYTQPTFLEV-LSDPQSVH :::: .: : :: :. : : : CCDS81 GPRLTRFPACGPLEPYLPEPAKPPAKYLQDLGPGPALNGGHFYEGPAEAEEKTSKAASFP 50 60 70 80 90 100 110 120 130 140 150 pF1KB7 LQPLGRASGPLD-TSQTASSDSEDVSSSDRKMSKS---ALNQTK-KRKKRRHRTIFTSYQ :: .:: : :. .: . ..: .: . ... .: : ::::.:: :...: CCDS81 QLPLDCRGGPRDGPSNLQGSPGPCLASLHLPLSPGLPDSMELAKNKSKKRRNRTTFSTFQ 110 120 130 140 150 160 160 170 180 190 200 210 pF1KB7 LEELEKAFNEAHYPDVYAREMLAMKTELPEDRIQVWFQNRRAKWRKREKCW----GRSSV ::::::.:...:::::::::.::..:.: : :.:::::::::::::::. ::. CCDS81 LEELEKVFQKTHYPDVYAREQLALRTDLTEARVQVWFQNRRAKWRKRERYGKIQEGRNPF 170 180 190 200 210 220 220 230 240 250 260 270 pF1KB7 MAEYGLYGAMVRHSIP-LPESILKSAKDGIMDSCAPWLLGMHKKSLEAAAESGRKPEGER : : . : : : .:. : .: CCDS81 TAAYDISVLPRTDSHPQLQNSLWASPGSGSPGGPCLVSPEGIPSPCMSPYSHPHGSVAGF 230 240 250 260 270 280 361 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 04:41:29 2016 done: Sun Nov 6 04:41:30 2016 Total Scan time: 3.100 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]