FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5413, 310 aa 1>>>pF1KE5413 310 - 310 aa - 310 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3636+/-0.000337; mu= 15.2190+/- 0.021 mean_var=68.0103+/-13.962, 0's: 0 Z-trim(115.2): 15 B-trim: 1221 in 1/54 Lambda= 0.155520 statistics sampled from 25539 (25553) to 25539 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.692), E-opt: 0.2 (0.3), width: 16 Scan time: 7.110 The best scores are: opt bits E(85289) NP_775745 (OMIM: 615367) protein N-terminal aspara ( 310) 2090 477.7 1.3e-134 NP_001257696 (OMIM: 615367) protein N-terminal asp ( 205) 1392 321.0 1.2e-87 NP_001257695 (OMIM: 615367) protein N-terminal asp ( 205) 1392 321.0 1.2e-87 XP_011520657 (OMIM: 615367) PREDICTED: protein N-t ( 227) 1301 300.6 1.9e-81 >>NP_775745 (OMIM: 615367) protein N-terminal asparagine (310 aa) initn: 2090 init1: 2090 opt: 2090 Z-score: 2537.9 bits: 477.7 E(85289): 1.3e-134 Smith-Waterman score: 2090; 100.0% identity (100.0% similar) in 310 aa overlap (1-310:1-310) 10 20 30 40 50 60 pF1KE5 MPLLVEGRRVRLPQSAGDLVRAHPPLEERARLLRGQSVQQVGPQGLLYVQQRELAVTSPK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_775 MPLLVEGRRVRLPQSAGDLVRAHPPLEERARLLRGQSVQQVGPQGLLYVQQRELAVTSPK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 DGSISILGSDDATTCHIVVLRHTGNGATCLTHCDGTDTKAEVPLIMNSIKSFSDHAQCGR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_775 DGSISILGSDDATTCHIVVLRHTGNGATCLTHCDGTDTKAEVPLIMNSIKSFSDHAQCGR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 LEVHLVGGFSDDRQLSQKLTHQLLSEFDRQEDDIHLVTLCVTELNDREENENHFPVIYGI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_775 LEVHLVGGFSDDRQLSQKLTHQLLSEFDRQEDDIHLVTLCVTELNDREENENHFPVIYGI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 AVNIKTAEIYRASFQDRGPEEQLRAARTLAGGPMISIYDAETEQLRIGPYSWTPFPHVDF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_775 AVNIKTAEIYRASFQDRGPEEQLRAARTLAGGPMISIYDAETEQLRIGPYSWTPFPHVDF 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 WLHQDDKQILENLSTSPLAEPPHFVEHIRSTLMFLKKHPSPAHTLFSGNKALLYKKNEDG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_775 WLHQDDKQILENLSTSPLAEPPHFVEHIRSTLMFLKKHPSPAHTLFSGNKALLYKKNEDG 250 260 270 280 290 300 310 pF1KE5 LWEKISSPGS :::::::::: NP_775 LWEKISSPGS 310 >>NP_001257696 (OMIM: 615367) protein N-terminal asparag (205 aa) initn: 1392 init1: 1392 opt: 1392 Z-score: 1694.2 bits: 321.0 E(85289): 1.2e-87 Smith-Waterman score: 1392; 100.0% identity (100.0% similar) in 205 aa overlap (106-310:1-205) 80 90 100 110 120 130 pF1KE5 HIVVLRHTGNGATCLTHCDGTDTKAEVPLIMNSIKSFSDHAQCGRLEVHLVGGFSDDRQL :::::::::::::::::::::::::::::: NP_001 MNSIKSFSDHAQCGRLEVHLVGGFSDDRQL 10 20 30 140 150 160 170 180 190 pF1KE5 SQKLTHQLLSEFDRQEDDIHLVTLCVTELNDREENENHFPVIYGIAVNIKTAEIYRASFQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SQKLTHQLLSEFDRQEDDIHLVTLCVTELNDREENENHFPVIYGIAVNIKTAEIYRASFQ 40 50 60 70 80 90 200 210 220 230 240 250 pF1KE5 DRGPEEQLRAARTLAGGPMISIYDAETEQLRIGPYSWTPFPHVDFWLHQDDKQILENLST :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 DRGPEEQLRAARTLAGGPMISIYDAETEQLRIGPYSWTPFPHVDFWLHQDDKQILENLST 100 110 120 130 140 150 260 270 280 290 300 310 pF1KE5 SPLAEPPHFVEHIRSTLMFLKKHPSPAHTLFSGNKALLYKKNEDGLWEKISSPGS ::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SPLAEPPHFVEHIRSTLMFLKKHPSPAHTLFSGNKALLYKKNEDGLWEKISSPGS 160 170 180 190 200 >>NP_001257695 (OMIM: 615367) protein N-terminal asparag (205 aa) initn: 1392 init1: 1392 opt: 1392 Z-score: 1694.2 bits: 321.0 E(85289): 1.2e-87 Smith-Waterman score: 1392; 100.0% identity (100.0% similar) in 205 aa overlap (106-310:1-205) 80 90 100 110 120 130 pF1KE5 HIVVLRHTGNGATCLTHCDGTDTKAEVPLIMNSIKSFSDHAQCGRLEVHLVGGFSDDRQL :::::::::::::::::::::::::::::: NP_001 MNSIKSFSDHAQCGRLEVHLVGGFSDDRQL 10 20 30 140 150 160 170 180 190 pF1KE5 SQKLTHQLLSEFDRQEDDIHLVTLCVTELNDREENENHFPVIYGIAVNIKTAEIYRASFQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SQKLTHQLLSEFDRQEDDIHLVTLCVTELNDREENENHFPVIYGIAVNIKTAEIYRASFQ 40 50 60 70 80 90 200 210 220 230 240 250 pF1KE5 DRGPEEQLRAARTLAGGPMISIYDAETEQLRIGPYSWTPFPHVDFWLHQDDKQILENLST :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 DRGPEEQLRAARTLAGGPMISIYDAETEQLRIGPYSWTPFPHVDFWLHQDDKQILENLST 100 110 120 130 140 150 260 270 280 290 300 310 pF1KE5 SPLAEPPHFVEHIRSTLMFLKKHPSPAHTLFSGNKALLYKKNEDGLWEKISSPGS ::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SPLAEPPHFVEHIRSTLMFLKKHPSPAHTLFSGNKALLYKKNEDGLWEKISSPGS 160 170 180 190 200 >>XP_011520657 (OMIM: 615367) PREDICTED: protein N-termi (227 aa) initn: 1301 init1: 1301 opt: 1301 Z-score: 1583.2 bits: 300.6 E(85289): 1.9e-81 Smith-Waterman score: 1301; 100.0% identity (100.0% similar) in 192 aa overlap (119-310:36-227) 90 100 110 120 130 140 pF1KE5 CLTHCDGTDTKAEVPLIMNSIKSFSDHAQCGRLEVHLVGGFSDDRQLSQKLTHQLLSEFD :::::::::::::::::::::::::::::: XP_011 QELGGGRRHRPAVKTTVHTPVPSPCSLWRPGRLEVHLVGGFSDDRQLSQKLTHQLLSEFD 10 20 30 40 50 60 150 160 170 180 190 200 pF1KE5 RQEDDIHLVTLCVTELNDREENENHFPVIYGIAVNIKTAEIYRASFQDRGPEEQLRAART :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 RQEDDIHLVTLCVTELNDREENENHFPVIYGIAVNIKTAEIYRASFQDRGPEEQLRAART 70 80 90 100 110 120 210 220 230 240 250 260 pF1KE5 LAGGPMISIYDAETEQLRIGPYSWTPFPHVDFWLHQDDKQILENLSTSPLAEPPHFVEHI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 LAGGPMISIYDAETEQLRIGPYSWTPFPHVDFWLHQDDKQILENLSTSPLAEPPHFVEHI 130 140 150 160 170 180 270 280 290 300 310 pF1KE5 RSTLMFLKKHPSPAHTLFSGNKALLYKKNEDGLWEKISSPGS :::::::::::::::::::::::::::::::::::::::::: XP_011 RSTLMFLKKHPSPAHTLFSGNKALLYKKNEDGLWEKISSPGS 190 200 210 220 310 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 00:30:22 2016 done: Tue Nov 8 00:30:23 2016 Total Scan time: 7.110 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]