FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3649, 437 aa 1>>>pF1KE3649 437 - 437 aa - 437 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1252+/-0.00072; mu= 18.6174+/- 0.044 mean_var=66.1326+/-13.144, 0's: 0 Z-trim(109.8): 12 B-trim: 4 in 1/50 Lambda= 0.157713 statistics sampled from 11166 (11171) to 11166 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.715), E-opt: 0.2 (0.343), width: 16 Scan time: 3.130 The best scores are: opt bits E(32554) CCDS9769.1 FNTB gene_id:2342|Hs108|chr14 ( 437) 3048 702.1 2.7e-202 CCDS669.1 RABGGTB gene_id:5876|Hs108|chr1 ( 331) 421 104.3 1.9e-22 CCDS4116.1 PGGT1B gene_id:5229|Hs108|chr5 ( 377) 337 85.2 1.2e-16 >>CCDS9769.1 FNTB gene_id:2342|Hs108|chr14 (437 aa) initn: 3048 init1: 3048 opt: 3048 Z-score: 3745.2 bits: 702.1 E(32554): 2.7e-202 Smith-Waterman score: 3048; 100.0% identity (100.0% similar) in 437 aa overlap (1-437:1-437) 10 20 30 40 50 60 pF1KE3 MASPSSFTYYCPPSSSPVWSEPLYSLRPEHARERLQDDSVETVTSIEQAKVEEKIQEVFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS97 MASPSSFTYYCPPSSSPVWSEPLYSLRPEHARERLQDDSVETVTSIEQAKVEEKIQEVFS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 SYKFNHLVPRLVLQREKHFHYLKRGLRQLTDAYECLDASRPWLCYWILHSLELLDEPIPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS97 SYKFNHLVPRLVLQREKHFHYLKRGLRQLTDAYECLDASRPWLCYWILHSLELLDEPIPQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 IVATDVCQFLELCQSPEGGFGGGPGQYPHLAPTYAAVNALCIIGTEEAYDIINREKLLQY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS97 IVATDVCQFLELCQSPEGGFGGGPGQYPHLAPTYAAVNALCIIGTEEAYDIINREKLLQY 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 LYSLKQPDGSFLMHVGGEVDVRSAYCAASVASLTNIITPDLFEGTAEWIARCQNWEGGIG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS97 LYSLKQPDGSFLMHVGGEVDVRSAYCAASVASLTNIITPDLFEGTAEWIARCQNWEGGIG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 GVPGMEAHGGYTFCGLAALVILKRERSLNLKSLLQWVTSRQMRFEGGFQGRCNKLVDGCY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS97 GVPGMEAHGGYTFCGLAALVILKRERSLNLKSLLQWVTSRQMRFEGGFQGRCNKLVDGCY 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 SFWQAGLLPLLHRALHAQGDPALSMSHWMFHQQALQEYILMCCQCPAGGLLDKPGKSRDF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS97 SFWQAGLLPLLHRALHAQGDPALSMSHWMFHQQALQEYILMCCQCPAGGLLDKPGKSRDF 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE3 YHTCYCLSGLSIAQHFGSGAMLHDVVLGVPENALQPTHPVYNIGPDKVIQATTYFLQKPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS97 YHTCYCLSGLSIAQHFGSGAMLHDVVLGVPENALQPTHPVYNIGPDKVIQATTYFLQKPV 370 380 390 400 410 420 430 pF1KE3 PGFEELKDETSAEPATD ::::::::::::::::: CCDS97 PGFEELKDETSAEPATD 430 >>CCDS669.1 RABGGTB gene_id:5876|Hs108|chr1 (331 aa) initn: 490 init1: 209 opt: 421 Z-score: 516.6 bits: 104.3 E(32554): 1.9e-22 Smith-Waterman score: 521; 29.9% identity (59.5% similar) in 338 aa overlap (76-410:21-321) 50 60 70 80 90 100 pF1KE3 IEQAKVEEKIQEVFSSYKFNHLVPRLVLQREKHFHYL-KRGLRQLTDAYE-CLDAS-RPW ::: :. . : .. : :: :.. : CCDS66 MGTPQKDVIIKSDAPDTLLLEKHADYIASYGSKK--DDYEYCMSEYLRMS 10 20 30 40 110 120 130 140 150 160 pF1KE3 LCYWILHSLELLDEPIPQIVATDVCQFLELCQSPEGGFGGGPGQYPHLAPTYAAVNALCI :: : ..:. . . .. .. :.. :: ::.... :. ::: : .::. : . CCDS66 GIYWGLTVMDLMGQ-LHRMNREEILAFIKSCQHECGGISASIGHDPHLLYTLSAVQILTL 50 60 70 80 90 100 170 180 190 200 210 220 pF1KE3 IGTEEAYDIINREKLLQYLYSLKQPDGSFLMHVGGEVDVRSAYCAASVASLTNIITPDLF .. ..:. .:...:. .:.. :::: . ::.:.: ..::... .: . . CCDS66 Y---DSINVIDVNKVVEYVKGLQKEDGSFAGDIWGEIDTRFSFCAVATLALLGKLDAINV 110 120 130 140 150 160 230 240 250 260 270 280 pF1KE3 EGTAEWIARCQNWEGGIGGVPGMEAHGGYTFCGLAALVILKRERSLNLKSLLQWVTSRQM : . :.. :.:..::.: :: :.:.: .: . :.: .. ...: : :. ::. CCDS66 EKAIEFVLSCMNFDGGFGCRPGSESHAGQIYCCTGFLAITSQLHQVNSDLLGWWLCERQL 170 180 190 200 210 220 290 300 310 320 330 340 pF1KE3 RFEGGFQGRCNKLVDGCYSFWQAGLLPLLHRALHAQGDPALSMSHWMFHQQALQEYILMC ::..:: .:: : :::.: . : .. : ::. .. :...:: : CCDS66 P-SGGLNGRPEKLPDVCYSWWVLASLKIIGRL------------HWI-DREKLRNFILAC 230 240 250 260 270 350 360 370 380 390 400 pF1KE3 CQCPAGGLLDKPGKSRDFYHTCYCLSGLSIAQHFGSGAMLHDVVLGVPENALQPTHPVYN . .::. :.:: : .:: . ..:::. :: :. ..:..::. CCDS66 QDEETGGFADRPGDMVDPFHTLFGIAGLSL--------------LG--EEQIKPVNPVFC 280 290 300 310 410 420 430 pF1KE3 IGPDKVIQATTYFLQKPVPGFEELKDETSAEPATD . :..:.: CCDS66 M-PEEVLQRVNVQPELVS 320 330 >>CCDS4116.1 PGGT1B gene_id:5229|Hs108|chr5 (377 aa) initn: 324 init1: 132 opt: 337 Z-score: 412.5 bits: 85.2 E(32554): 1.2e-16 Smith-Waterman score: 392; 28.4% identity (55.6% similar) in 338 aa overlap (70-377:17-336) 40 50 60 70 80 90 pF1KE3 VETVTSIEQAKVEEKIQEVFSSYKFNHLVPRLVLQREKHFHYLKRGLRQLTDAYECLDAS :: . :..: ....: :. : . : :..: CCDS41 MAATEDERLAGSGEGERLDFLRDRHVRFFQRCLQVLPERYSSLETS 10 20 30 40 100 110 120 130 140 pF1KE3 RPWLCYWILHSLELLD--------EPIPQIVATDVCQFLELCQSPEGGFGGG-------- : . .. : .:..:: . : : . .: . . . :: :. CCDS41 RLTIAFFALSGLDMLDSLDVVNKDDIIEWIYSLQVLPTEDRSNLNRCGFRGSSYLGIPFN 50 60 70 80 90 100 150 160 170 180 190 pF1KE3 PGQYP---------HLAPTYAAVNALCIIGTEEAYDIINREKLLQYLYSLKQPDGSFL-M :.. : :.: ::.... : :.: . . .:.: : : .:. :::: . CCDS41 PSKAPGTAHPYDSGHIAMTYTGLSCLVILGDDLSR--VNKEACLAGLRALQLEDGSFCAV 110 120 130 140 150 160 200 210 220 230 240 250 pF1KE3 HVGGEVDVRSAYCAASVASLTNIITPDLFEGTAEWIARCQNWEGGIGGVPGMEAHGGYTF :.: :.: .:::. . . : . .. . .: : .....:.. :.:.::: :: CCDS41 PEGSENDMRFVYCASCICYMLNNWSGMDMKKAITYIRRSMSYDNGLAQGAGLESHGGSTF 170 180 190 200 210 220 260 270 280 290 300 pF1KE3 CGLAALVIL-KRERSLNLKSL---LQWVTSRQMRFEGGFQGRCNKLVDGCYSFWQAGLLP ::.:.: .. : :. .. : : .: :: ..:..:: :: :: ::::: .. : CCDS41 CGIASLCLMGKLEEVFSEKELNRIKRWCIMRQ---QNGYHGRPNKPVDTCYSFWVGATLK 230 240 250 260 270 280 310 320 330 340 350 360 pF1KE3 LLHRALHAQGDPALSMSHWMFHQQALQEYILMCCQCPAGGLLDKPGKSRDFYHTCYCLSG ::. ... :... ..::: . .::. : . : :. . . : CCDS41 LLK-----------IFQYTNFEKN--RNYILSTQDRLVGGFAKWPDSHPDALHAYFGICG 290 300 310 320 370 380 390 400 410 420 pF1KE3 LSIAQHFGSGAMLHDVVLGVPENALQPTHPVYNIGPDKVIQATTYFLQKPVPGFEELKDE ::. .. : CCDS41 LSLMEESGICKVHPALNVSTRTSERLLDLHQSWKTKDSKQCSENVHIST 330 340 350 360 370 437 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 00:46:11 2016 done: Mon Nov 7 00:46:11 2016 Total Scan time: 3.130 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]