FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2642, 397 aa 1>>>pF1KE2642 397 - 397 aa - 397 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.0774+/-0.00095; mu= 13.1897+/- 0.057 mean_var=82.7059+/-16.237, 0's: 0 Z-trim(106.2): 37 B-trim: 2 in 1/50 Lambda= 0.141028 statistics sampled from 8848 (8875) to 8848 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.659), E-opt: 0.2 (0.273), width: 16 Scan time: 2.580 The best scores are: opt bits E(32554) CCDS46724.1 PARVB gene_id:29780|Hs108|chr22 ( 397) 2597 538.3 4.7e-153 CCDS14056.1 PARVB gene_id:29780|Hs108|chr22 ( 364) 2102 437.5 9.1e-123 CCDS58808.1 PARVB gene_id:29780|Hs108|chr22 ( 327) 2098 436.7 1.5e-122 CCDS44541.2 PARVA gene_id:55742|Hs108|chr11 ( 412) 1612 337.9 1e-92 CCDS74874.1 PARVB gene_id:29780|Hs108|chr22 ( 289) 1325 279.4 2.9e-75 CCDS14057.1 PARVG gene_id:64098|Hs108|chr22 ( 331) 863 185.4 6.4e-47 >>CCDS46724.1 PARVB gene_id:29780|Hs108|chr22 (397 aa) initn: 2597 init1: 2597 opt: 2597 Z-score: 2861.3 bits: 538.3 E(32554): 4.7e-153 Smith-Waterman score: 2597; 99.7% identity (100.0% similar) in 397 aa overlap (1-397:1-397) 10 20 30 40 50 60 pF1KE2 MHHVFKDHQRGEKRGFLSPENKNCRRLELRRGCSCSWGLCSQALMASLAGSLLPGSDRSG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 MHHVFKDHQRGEKRGFLSPENKNCRRLELRRGCSCSWGLCSQALMASLAGSLLPGSDRSG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 VETSEYAQGGVSDLQEEGKNAINSPMSPALADVHPEDTQLEENEERTMIDPTSKEDPKFK ::::::::::::::::::::::::::::::.::::::::::::::::::::::::::::: CCDS46 VETSEYAQGGVSDLQEEGKNAINSPMSPALVDVHPEDTQLEENEERTMIDPTSKEDPKFK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 ELVKVLLDWINDVLVEERIIVKQLEEDLYDGQVLQKLLEKLAGCKLNVAEVTQSEIGQKQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 ELVKVLLDWINDVLVEERIIVKQLEEDLYDGQVLQKLLEKLAGCKLNVAEVTQSEIGQKQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 KLQTVLEAVHDLLRPRGWALRWSVDSIHGKNLVAILHLLVSLAMHFRAPIRLPEHVTVQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 KLQTVLEAVHDLLRPRGWALRWSVDSIHGKNLVAILHLLVSLAMHFRAPIRLPEHVTVQV 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 VVVRKREGLLHSSHISEELTTTTEMMMGRFERDAFDTLFDHAPDKLSVVKKSLITFVNKH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 VVVRKREGLLHSSHISEELTTTTEMMMGRFERDAFDTLFDHAPDKLSVVKKSLITFVNKH 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE2 LNKLNLEVTELETQFADGVYLVLLMGLLEDYFVPLHHFYLTPESFDQKVHNVSFAFELML :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 LNKLNLEVTELETQFADGVYLVLLMGLLEDYFVPLHHFYLTPESFDQKVHNVSFAFELML 310 320 330 340 350 360 370 380 390 pF1KE2 DGGLKKPKARPEDVVNLDLKSTLRVLYNLFTKYKNVE ::::::::::::::::::::::::::::::::::::: CCDS46 DGGLKKPKARPEDVVNLDLKSTLRVLYNLFTKYKNVE 370 380 390 >>CCDS14056.1 PARVB gene_id:29780|Hs108|chr22 (364 aa) initn: 2102 init1: 2102 opt: 2102 Z-score: 2317.6 bits: 437.5 E(32554): 9.1e-123 Smith-Waterman score: 2102; 99.7% identity (100.0% similar) in 327 aa overlap (71-397:38-364) 50 60 70 80 90 100 pF1KE2 SQALMASLAGSLLPGSDRSGVETSEYAQGGVSDLQEEGKNAINSPMSPALADVHPEDTQL ::::::::::::::::::::.::::::::: CCDS14 PTPRPRRMKKDESFLGKLGGTLARKRRAREVSDLQEEGKNAINSPMSPALVDVHPEDTQL 10 20 30 40 50 60 110 120 130 140 150 160 pF1KE2 EENEERTMIDPTSKEDPKFKELVKVLLDWINDVLVEERIIVKQLEEDLYDGQVLQKLLEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 EENEERTMIDPTSKEDPKFKELVKVLLDWINDVLVEERIIVKQLEEDLYDGQVLQKLLEK 70 80 90 100 110 120 170 180 190 200 210 220 pF1KE2 LAGCKLNVAEVTQSEIGQKQKLQTVLEAVHDLLRPRGWALRWSVDSIHGKNLVAILHLLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 LAGCKLNVAEVTQSEIGQKQKLQTVLEAVHDLLRPRGWALRWSVDSIHGKNLVAILHLLV 130 140 150 160 170 180 230 240 250 260 270 280 pF1KE2 SLAMHFRAPIRLPEHVTVQVVVVRKREGLLHSSHISEELTTTTEMMMGRFERDAFDTLFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 SLAMHFRAPIRLPEHVTVQVVVVRKREGLLHSSHISEELTTTTEMMMGRFERDAFDTLFD 190 200 210 220 230 240 290 300 310 320 330 340 pF1KE2 HAPDKLSVVKKSLITFVNKHLNKLNLEVTELETQFADGVYLVLLMGLLEDYFVPLHHFYL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 HAPDKLSVVKKSLITFVNKHLNKLNLEVTELETQFADGVYLVLLMGLLEDYFVPLHHFYL 250 260 270 280 290 300 350 360 370 380 390 pF1KE2 TPESFDQKVHNVSFAFELMLDGGLKKPKARPEDVVNLDLKSTLRVLYNLFTKYKNVE ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 TPESFDQKVHNVSFAFELMLDGGLKKPKARPEDVVNLDLKSTLRVLYNLFTKYKNVE 310 320 330 340 350 360 >>CCDS58808.1 PARVB gene_id:29780|Hs108|chr22 (327 aa) initn: 2098 init1: 2098 opt: 2098 Z-score: 2313.9 bits: 436.7 E(32554): 1.5e-122 Smith-Waterman score: 2098; 99.4% identity (100.0% similar) in 327 aa overlap (71-397:1-327) 50 60 70 80 90 100 pF1KE2 SQALMASLAGSLLPGSDRSGVETSEYAQGGVSDLQEEGKNAINSPMSPALADVHPEDTQL .:::::::::::::::::::.::::::::: CCDS58 MSDLQEEGKNAINSPMSPALVDVHPEDTQL 10 20 30 110 120 130 140 150 160 pF1KE2 EENEERTMIDPTSKEDPKFKELVKVLLDWINDVLVEERIIVKQLEEDLYDGQVLQKLLEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 EENEERTMIDPTSKEDPKFKELVKVLLDWINDVLVEERIIVKQLEEDLYDGQVLQKLLEK 40 50 60 70 80 90 170 180 190 200 210 220 pF1KE2 LAGCKLNVAEVTQSEIGQKQKLQTVLEAVHDLLRPRGWALRWSVDSIHGKNLVAILHLLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 LAGCKLNVAEVTQSEIGQKQKLQTVLEAVHDLLRPRGWALRWSVDSIHGKNLVAILHLLV 100 110 120 130 140 150 230 240 250 260 270 280 pF1KE2 SLAMHFRAPIRLPEHVTVQVVVVRKREGLLHSSHISEELTTTTEMMMGRFERDAFDTLFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 SLAMHFRAPIRLPEHVTVQVVVVRKREGLLHSSHISEELTTTTEMMMGRFERDAFDTLFD 160 170 180 190 200 210 290 300 310 320 330 340 pF1KE2 HAPDKLSVVKKSLITFVNKHLNKLNLEVTELETQFADGVYLVLLMGLLEDYFVPLHHFYL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 HAPDKLSVVKKSLITFVNKHLNKLNLEVTELETQFADGVYLVLLMGLLEDYFVPLHHFYL 220 230 240 250 260 270 350 360 370 380 390 pF1KE2 TPESFDQKVHNVSFAFELMLDGGLKKPKARPEDVVNLDLKSTLRVLYNLFTKYKNVE ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 TPESFDQKVHNVSFAFELMLDGGLKKPKARPEDVVNLDLKSTLRVLYNLFTKYKNVE 280 290 300 310 320 >>CCDS44541.2 PARVA gene_id:55742|Hs108|chr11 (412 aa) initn: 1611 init1: 1611 opt: 1612 Z-score: 1777.9 bits: 337.9 E(32554): 1e-92 Smith-Waterman score: 1612; 76.0% identity (91.2% similar) in 329 aa overlap (71-397:86-412) 50 60 70 80 90 100 pF1KE2 SQALMASLAGSLLPGSDRSGVETSEYAQGGVSDLQEEGKNAINSPMSPALADVHPEDTQL ::.::::: :::: :.:: .. ::::.: CCDS44 TPKSPPSRKKDDSFLGKLGGTLARRKKAKEVSELQEEGMNAINLPLSPIPFELDPEDTML 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE2 EENEERTMIDPTSKEDPKFKELVKVLLDWINDVLVEERIIVKQLEEDLYDGQVLQKLLEK :::: :::.::.:. :::..::.:::.:::::::: ::::::.: ::::::::::::.:: CCDS44 EENEVRTMVDPNSRSDPKLQELMKVLIDWINDVLVGERIIVKDLAEDLYDGQVLQKLFEK 120 130 140 150 160 170 170 180 190 200 210 pF1KE2 LAGCKLNVAEVTQSEIGQKQKLQTVLEAVHDLLR--PRGWALRWSVDSIHGKNLVAILHL : . ::::::::::::.:::::::::: ... :. :: ...:.:::.:.:.::::::: CCDS44 LESEKLNVAEVTQSEIAQKQKLQTVLEKINETLKLPPR--SIKWNVDSVHAKSLVAILHL 180 190 200 210 220 230 220 230 240 250 260 270 pF1KE2 LVSLAMHFRAPIRLPEHVTVQVVVVRKREGLLHSSHISEELTTTTEMMMGRFERDAFDTL ::.:...::::::::.::..:::::.::::.:.: .:.::.: .:: . :: :::::::: CCDS44 LVALSQYFRAPIRLPDHVSIQVVVVQKREGILQSRQIQEEITGNTEALSGRHERDAFDTL 240 250 260 270 280 290 280 290 300 310 320 330 pF1KE2 FDHAPDKLSVVKKSLITFVNKHLNKLNLEVTELETQFADGVYLVLLMGLLEDYFVPLHHF ::::::::.::::.::::::::::::::::::::::::::::::::::::: :::::: : CCDS44 FDHAPDKLNVVKKTLITFVNKHLNKLNLEVTELETQFADGVYLVLLMGLLEGYFVPLHSF 300 310 320 330 340 350 340 350 360 370 380 390 pF1KE2 YLTPESFDQKVHNVSFAFELMLDGGLKKPKARPEDVVNLDLKSTLRVLYNLFTKYKNVE .:::.::.::: ::::::::: ::::.::: ::::.:: ::::::::::::::::.::: CCDS44 FLTPDSFEQKVLNVSFAFELMQDGGLEKPKPRPEDIVNCDLKSTLRVLYNLFTKYRNVE 360 370 380 390 400 410 >>CCDS74874.1 PARVB gene_id:29780|Hs108|chr22 (289 aa) initn: 1325 init1: 1325 opt: 1325 Z-score: 1464.7 bits: 279.4 E(32554): 2.9e-75 Smith-Waterman score: 1815; 92.3% identity (92.6% similar) in 312 aa overlap (86-397:1-289) 60 70 80 90 100 110 pF1KE2 SDRSGVETSEYAQGGVSDLQEEGKNAINSPMSPALADVHPEDTQLEENEERTMIDPTSKE :::::.:::::::::::::::::::::::: CCDS74 MSPALVDVHPEDTQLEENEERTMIDPTSKE 10 20 30 120 130 140 150 160 170 pF1KE2 DPKFKELVKVLLDWINDVLVEERIIVKQLEEDLYDGQVLQKLLEKLAGCKLNVAEVTQSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 DPKFKELVKVLLDWINDVLVEERIIVKQLEEDLYDGQVLQKLLEKLAGCKLNVAEVTQSE 40 50 60 70 80 90 180 190 200 210 220 230 pF1KE2 IGQKQKLQTVLEAVHDLLRPRGWALRWSVDSIHGKNLVAILHLLVSLAMHFRAPIRLPEH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 IGQKQKLQTVLEAVHDLLRPRGWALRWSVDSIHGKNLVAILHLLVSLAMHFRAPIRLPEH 100 110 120 130 140 150 240 250 260 270 280 290 pF1KE2 VTVQVVVVRKREGLLHSSHISEELTTTTEMMMGRFERDAFDTLFDHAPDKLSVVKKSLIT :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 VTVQVVVVRKREGLLHSSHISEELTTTTEMMMGRFERDAFDTLFDHAPDKLSVVKK---- 160 170 180 190 200 300 310 320 330 340 350 pF1KE2 FVNKHLNKLNLEVTELETQFADGVYLVLLMGLLEDYFVPLHHFYLTPESFDQKVHNVSFA ::::::::::::::::::::::::::::::::::::::::: CCDS74 -------------------FADGVYLVLLMGLLEDYFVPLHHFYLTPESFDQKVHNVSFA 210 220 230 240 360 370 380 390 pF1KE2 FELMLDGGLKKPKARPEDVVNLDLKSTLRVLYNLFTKYKNVE :::::::::::::::::::::::::::::::::::::::::: CCDS74 FELMLDGGLKKPKARPEDVVNLDLKSTLRVLYNLFTKYKNVE 250 260 270 280 >>CCDS14057.1 PARVG gene_id:64098|Hs108|chr22 (331 aa) initn: 862 init1: 465 opt: 863 Z-score: 955.8 bits: 185.4 E(32554): 6.4e-47 Smith-Waterman score: 863; 45.2% identity (78.7% similar) in 301 aa overlap (95-393:19-316) 70 80 90 100 110 120 pF1KE2 EYAQGGVSDLQEEGKNAINSPMSPALADVHPEDTQLEENEERTMIDPTSKEDPKFKELVK : . .: .. .. .. :::..::::.:: : CCDS14 MEPEFLYDLLQLPKGVEPPAEEELSKGGKKKYLPPTSRKDPKFEELQK 10 20 30 40 130 140 150 160 170 180 pF1KE2 VLLDWINDVLVEERIIVKQLEEDLYDGQVLQKLLEKLAGCKLNVAEVTQSEIGQKQKLQT ::..::: .:. :.:.:..::::..:: .:..:...::. ::.. ... . .::.:: . CCDS14 VLMEWINATLLPEHIVVRSLEEDMFDGLILHHLFQRLAALKLEAEDIALTATSQKHKLTV 50 60 70 80 90 100 190 200 210 220 230 240 pF1KE2 VLEAVHDLLRPRGWALRWSVDSIHGKNLVAILHLLVSLAMHFRAPIRLPEHVTVQVVVVR :::::. :. . : .:::.:: .:.:.. :::::.:: .:. . :: .: :.:.... CCDS14 VLEAVNRSLQLEEWQAKWSVESIFNKDLLSTLHLLVALAKRFQPDLSLPTNVQVEVITIE 110 120 130 140 150 160 250 260 270 280 290 300 pF1KE2 KREGLLHSSHISEELTTTTEMMMGRFE--RDAFDTLFDHAPDKLSVVKKSLITFVNKHLN . .. :.: .. :.:: :. . : .:.:: :: ::.:...::.....:::..:. CCDS14 STKSGLKSEKLVEQLT---EYSTDKDEPPKDVFDELFKLAPEKVNAVKEAIVNFVNQKLD 170 180 190 200 210 220 310 320 330 340 350 360 pF1KE2 KLNLEVTELETQFADGVYLVLLMGLLEDYFVPLHHFYLTPESFDQKVHNVSFAFELMLDG .:.: : .:.::::::: :.::.: :: .:. :..:::::.: . .:::..:.::. : CCDS14 RLGLSVQNLDTQFADGVILLLLIGQLEGFFLHLKEFYLTPNSPAEMLHNVTLALELLKDE 230 240 250 260 270 280 370 380 390 pF1KE2 GLKKPKARPEDVVNLDLKSTLRVLYNLFTKYKNVE :: . . :::.:: : ::::::::.:: :. CCDS14 GLLSCPVSPEDIVNKDAKSTLRVLYGLFCKHTQKAHRDRTPHGAPN 290 300 310 320 330 397 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 17:43:22 2016 done: Tue Nov 8 17:43:22 2016 Total Scan time: 2.580 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]