FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE9514, 395 aa 1>>>pF1KE9514 395 - 395 aa - 395 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.0076+/-0.000935; mu= 18.4155+/- 0.056 mean_var=61.9703+/-13.067, 0's: 0 Z-trim(104.1): 54 B-trim: 424 in 1/48 Lambda= 0.162923 statistics sampled from 7688 (7733) to 7688 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.604), E-opt: 0.2 (0.238), width: 16 Scan time: 2.680 The best scores are: opt bits E(32554) CCDS12862.1 VN1R2 gene_id:317701|Hs108|chr19 ( 395) 2673 637.1 8.4e-183 CCDS33099.1 VN1R4 gene_id:317703|Hs108|chr19 ( 301) 1130 274.3 1e-73 CCDS12951.1 VN1R1 gene_id:57191|Hs108|chr19 ( 353) 827 203.1 3.2e-52 >>CCDS12862.1 VN1R2 gene_id:317701|Hs108|chr19 (395 aa) initn: 2673 init1: 2673 opt: 2673 Z-score: 3395.3 bits: 637.1 E(32554): 8.4e-183 Smith-Waterman score: 2673; 100.0% identity (100.0% similar) in 395 aa overlap (1-395:1-395) 10 20 30 40 50 60 pF1KE9 MTHTLYPTPFALYPINISAAWHLGPLPVSCFVSNKYQCSLAFGATTGLRVLVVVVPQTQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MTHTLYPTPFALYPINISAAWHLGPLPVSCFVSNKYQCSLAFGATTGLRVLVVVVPQTQL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE9 SFLSSLCLVSLFLHSLVSAHGEKPTKPVGLDPTLFQVVVGILGNFSLLYYYMFLYFRGYK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 SFLSSLCLVSLFLHSLVSAHGEKPTKPVGLDPTLFQVVVGILGNFSLLYYYMFLYFRGYK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE9 PRSTDLILRHLTVADSLVILSKRIPETMATFGLKHFDNYFGCKFLLYAHRVGRGVSIGST :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 PRSTDLILRHLTVADSLVILSKRIPETMATFGLKHFDNYFGCKFLLYAHRVGRGVSIGST 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE9 CLLSVFQVITINPRNSRWAEMKVKAPTYIGLSNILCWAFHMLVNAIFPIYTTGKWSNNNI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 CLLSVFQVITINPRNSRWAEMKVKAPTYIGLSNILCWAFHMLVNAIFPIYTTGKWSNNNI 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE9 TKKGDLGYCSAPLSDEVTKSVYAALTSFHDVLCLGLMLWASSSIVLVLYRHKQQVQHICR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 TKKGDLGYCSAPLSDEVTKSVYAALTSFHDVLCLGLMLWASSSIVLVLYRHKQQVQHICR 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE9 NNLYPNSSPGNRAIQSILALVSTFALCYALSFITYVYLALFDNSSWWLVNTAALIIACFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 NNLYPNSSPGNRAIQSILALVSTFALCYALSFITYVYLALFDNSSWWLVNTAALIIACFP 310 320 330 340 350 360 370 380 390 pF1KE9 TISPFVLMCRDPSRSRLCSICCRRNRRFFHDFRKM ::::::::::::::::::::::::::::::::::: CCDS12 TISPFVLMCRDPSRSRLCSICCRRNRRFFHDFRKM 370 380 390 >>CCDS33099.1 VN1R4 gene_id:317703|Hs108|chr19 (301 aa) initn: 1111 init1: 1111 opt: 1130 Z-score: 1437.0 bits: 274.3 E(32554): 1e-73 Smith-Waterman score: 1130; 57.1% identity (80.6% similar) in 294 aa overlap (85-378:3-296) 60 70 80 90 100 110 pF1KE9 VPQTQLSFLSSLCLVSLFLHSLVSAHGEKPTKPVGLDPTLFQVVVGILGNFSLLYYYMFL .. :.. : :.:::.::.::.: .:. . CCDS33 MASRYVAVGMILSQTVVGVLGSFSVLLHYLSF 10 20 30 120 130 140 150 160 170 pF1KE9 YFRGYKPRSTDLILRHLTVADSLVILSKRIPETMATFGLKHFDNYFGCKFLLYAHRVGRG : : . ::::::..:: ::. :.. : .:.:::.::...: : .:::...: :::::: CCDS33 YCTGCRLRSTDLIVKHLIVANFLALRCKGVPQTMAAFGVRYFLNALGCKLVFYLHRVGRG 40 50 60 70 80 90 180 190 200 210 220 230 pF1KE9 VSIGSTCLLSVFQVITINPRNSRWAEMKVKAPTYIGLSNILCWAFHMLVNAIFPIYTTGK ::::.:::::::::::.. :.::::..: ::: ..:.: .::: :::: :::.:.::: CCDS33 VSIGTTCLLSVFQVITVSSRKSRWAKLKEKAPKHVGFSVLLCWIVCMLVNIIFPMYVTGK 100 110 120 130 140 150 240 250 260 270 280 290 pF1KE9 WSNNNITKKGDLGYCSAPLSDEVTKSVYAALTSFHDVLCLGLMLWASSSIVLVLYRHKQQ :. .::: . ::::::. ........ : : :: ::::::::::.:::.: .:.::::. CCDS33 WNYTNITVNEDLGYCSGGGNNKIAQTLRAMLLSFPDVLCLGLMLWVSSSMVCILHRHKQR 160 170 180 190 200 210 300 310 320 330 340 350 pF1KE9 VQHICRNNLYPNSSPGNRAIQSILALVSTFALCYALSFITYVYLALFDNSSWWLVNTAAL :::: :.:: : .:: ::: :::: :::::. :.:: . : .::.:: . ::::.:: CCDS33 VQHIDRSNLSPRASPENRATQSILILVSTFVSSYTLSCLFQVCMALLDNPNSLLVNTSAL 220 230 240 250 260 270 360 370 380 390 pF1KE9 IIACFPTISPFVLMCRDPSRSRLCSICCRRNRRFFHDFRKM . .::::.:::::: ::: :.: CCDS33 MSVCFPTLSPFVLMSCDPSVYRFCFAWKR 280 290 300 >>CCDS12951.1 VN1R1 gene_id:57191|Hs108|chr19 (353 aa) initn: 773 init1: 507 opt: 827 Z-score: 1051.0 bits: 203.1 E(32554): 3.2e-52 Smith-Waterman score: 827; 43.2% identity (71.0% similar) in 303 aa overlap (94-395:53-352) 70 80 90 100 110 120 pF1KE9 SSLCLVSLFLHSLVSAHGEKPTKPVGLDPTLFQVVVGILGNFSLLYYYMFLYFRGYKPRS :.:. :::::: :: .: .. : :.: : CCDS12 STDSSDLNENQHPLDFDEMAFGKVKSGISFLIQTGVGILGNSFLLCFYNLILFTGHKLRP 30 40 50 60 70 80 130 140 150 160 170 180 pF1KE9 TDLILRHLTVADSLVILSKRIPETMATFGLKHFDNYFGCKFLLYAHRVGRGVSIGSTCLL ::::: .:..:.:.:.. : ::.:::.::::.. : ::::..: :::: ::... ::: CCDS12 TDLILSQLALANSMVLFFKGIPQTMAAFGLKYLLNDTGCKFVFYYHRVGTRVSLSTICLL 90 100 110 120 130 140 190 200 210 220 230 240 pF1KE9 SVFQVITINPRNSRWAEMKVKAPTYIGLSNILCWAFHMLVNAIFPIYTTGKWSNNNITKK . ::.: .:: :: :.:...: .: . .:::: :.:.:: . ..: ...: . : CCDS12 NGFQAIKLNPSICRWMEIKIRSPRFIDFCCLLCWAPHVLMNASVLLLVNGPLNSKNSSAK 150 160 170 180 190 200 250 260 270 280 290 300 pF1KE9 GDLGYCSAPLSDEVTKSVYAALTSFHDVLCLGLMLWASSSIVLVLYRHKQQVQHICRNNL .. :::: : . . :..:.: : . ::.:.:::.:.:. :::::::::: : : CCDS12 NNYGYCSYKASKRFS-SLHAVLYFSPDFMSLGFMVWASGSMVFFLYRHKQQVQHNHSNRL 210 220 230 240 250 260 310 320 330 340 350 360 pF1KE9 YPNSSPGNRAIQSILALVSTFALCYAL-SFITYVYLALFDNSSWWLVNTAALIIACFPTI : :: ..:..:::.: . :.. ::.: .. .. : . :.:....:. .:::. CCDS12 SCRPSQEARATHTIMVLVSSFFVFYSVHSFLT-IWTTVVANPGQWIVTNSVLVASCFPAR 270 280 290 300 310 320 370 380 390 pF1KE9 SPFVLMCRDPSRSRLCSICCRRNRRFFHDFRKM :::::. : :..: . :: . .: .. : CCDS12 SPFVLIMSDTHISQFC-FACRTRKTLFPNLVVMP 330 340 350 395 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 13:13:47 2016 done: Sun Nov 6 13:13:48 2016 Total Scan time: 2.680 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]