FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4542, 626 aa 1>>>pF1KE4542 626 - 626 aa - 626 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 13.3819+/-0.000496; mu= -20.3645+/- 0.030 mean_var=468.2960+/-100.311, 0's: 0 Z-trim(120.3): 76 B-trim: 1597 in 1/60 Lambda= 0.059267 statistics sampled from 35234 (35320) to 35234 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.737), E-opt: 0.2 (0.414), width: 16 Scan time: 12.920 The best scores are: opt bits E(85289) NP_000164 (OMIM: 153670,177820,231200,258660,60667 ( 652) 2929 265.5 4.1e-70 NP_443204 (OMIM: 611289) leucine-rich alpha-2-glyc ( 347) 370 46.5 0.00018 NP_001073982 (OMIM: 603104) carboxypeptidase N sub ( 545) 341 44.2 0.0015 NP_001278917 (OMIM: 603104) carboxypeptidase N sub ( 545) 341 44.2 0.0015 XP_005269337 (OMIM: 603104) PREDICTED: carboxypept ( 545) 341 44.2 0.0015 NP_004479 (OMIM: 173511) platelet glycoprotein V p ( 560) 330 43.2 0.0029 >>NP_000164 (OMIM: 153670,177820,231200,258660,606672) p (652 aa) initn: 2890 init1: 2890 opt: 2929 Z-score: 1379.7 bits: 265.5 E(85289): 4.1e-70 Smith-Waterman score: 4136; 95.9% identity (96.0% similar) in 652 aa overlap (1-626:1-652) 10 20 30 40 50 60 pF1KE4 MPLLLLLLLLPSPLHPHPICEVSKVASHLEVNCDKRNLTALPPDLPKDTTILHLSENLLY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 MPLLLLLLLLPSPLHPHPICEVSKVASHLEVNCDKRNLTALPPDLPKDTTILHLSENLLY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 TFSLATLMPYTRLTQLNLDRCELTKLQVDGTLPVLGTLDLSHNQLQSLPLLGQTLPALTV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 TFSLATLMPYTRLTQLNLDRCELTKLQVDGTLPVLGTLDLSHNQLQSLPLLGQTLPALTV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 LDVSFNRLTSLPLGALRGLGELQELYLKGNELKTLPPGLLTPTPKLEKLSLANNNLTELP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 LDVSFNRLTSLPLGALRGLGELQELYLKGNELKTLPPGLLTPTPKLEKLSLANNNLTELP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 AGLLNGLENLDTLLLQENSLYTIPKGFFGSHLLPFAFLHGNPWLCNCEILYFRRWLQDNA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 AGLLNGLENLDTLLLQENSLYTIPKGFFGSHLLPFAFLHGNPWLCNCEILYFRRWLQDNA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 ENVYVWKQGVDVKAMTSNVASVQCDNSDKFPVYKYPGKGCPTLGDEGDTDLYDYYPEEDT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 ENVYVWKQGVDVKAMTSNVASVQCDNSDKFPVYKYPGKGCPTLGDEGDTDLYDYYPEEDT 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 EGDKVRATRTVVKFPTKAHTTPWGLFYSWSTASLDSQMPSSLHPTQESTKEQTTFPPRWT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 EGDKVRATRTVVKFPTKAHTTPWGLFYSWSTASLDSQMPSSLHPTQESTKEQTTFPPRWT 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 PNFTLHMESITFSKTPKSTTEPTPSPTTSEPVPEPAPNMTTLEPTPSPTTPEPTSEPAPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 PNFTLHMESITFSKTPKSTTEPTPSPTTSEPVPEPAPNMTTLEPTPSPTTPEPTSEPAPS 370 380 390 400 410 420 430 440 450 pF1KE4 PTTPEPT--------------------------PIPTIATSPTILVSATSLITPKSTFLT ::::::: ::::::::::::::::::::::::::: NP_000 PTTPEPTSEPAPSPTTPEPTSEPAPSPTTPEPTPIPTIATSPTILVSATSLITPKSTFLT 430 440 450 460 470 480 460 470 480 490 500 510 pF1KE4 TTKPVSLLESTKKTIPELDQPPKLRGVLQGHLESSRNDPFLHPDFCCLLPLGFYVLGLFW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 TTKPVSLLESTKKTIPELDQPPKLRGVLQGHLESSRNDPFLHPDFCCLLPLGFYVLGLFW 490 500 510 520 530 540 520 530 540 550 560 570 pF1KE4 LLFASVVLILLLSWVGHVKPQALDSGQGAALTTATQTTHLELQRGRQVTVPRAWLLFLRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 LLFASVVLILLLSWVGHVKPQALDSGQGAALTTATQTTHLELQRGRQVTVPRAWLLFLRG 550 560 570 580 590 600 580 590 600 610 620 pF1KE4 SLPTFRSSLFLWVRPNGRLGPLVAGRRPSALSQGRGQDLLSTVSIRYSGHSL ::::::::::::::::::.::::::::::::::::::::::::::::::::: NP_000 SLPTFRSSLFLWVRPNGRVGPLVAGRRPSALSQGRGQDLLSTVSIRYSGHSL 610 620 630 640 650 >>NP_443204 (OMIM: 611289) leucine-rich alpha-2-glycopro (347 aa) initn: 499 init1: 229 opt: 370 Z-score: 201.1 bits: 46.5 E(85289): 0.00018 Smith-Waterman score: 370; 34.8% identity (66.1% similar) in 233 aa overlap (26-244:91-322) 10 20 30 40 50 pF1KE4 MPLLLLLLLLPSPLHPHPICEVSKVASHL-EVNCDKRNLTALPPDLPKDTT---I ::.: :.. .. .: .: :.. . . . NP_443 EIPGYLPADTVHLAVEFFNLTHLPANLLQGASKLQELHLSSNGLESLSPEFLRPVPQLRV 70 80 90 100 110 120 60 70 80 90 100 pF1KE4 LHLSENLLYTFSLATLMPYTRLTQLNLDRCELTKLQVDGT--LPVLGTLDLSHNQLQSLP : :..: : . . .. . : : : . .: :.:. : .:: :::: :.:..:: NP_443 LDLTRNALTGLPPGLFQASATLDTLVLKENQLEVLEVSWLHGLKALGHLDLSGNRLRKLP 130 140 150 160 170 180 110 120 130 140 150 160 pF1KE4 --LLGQTLPALTVLDVSFNRLTSLPLGALRGLGELQELYLKGNELKTLPPGLLTPTPKLE ::.. . : .::.. :.: .:: ::: .:..:.:.::.:..: :: : : :. NP_443 PGLLAN-FTLLRTLDLGENQLETLPPDLLRGPLQLERLHLEGNKLQVLGKDLLLPQPDLR 190 200 210 220 230 170 180 190 200 210 220 pF1KE4 KLSLANNNLTELPAGLLNGLENLDTLLLQENSLYTIPKGFFGSHLLPF-----AF-LHGN : : .:.:... :: ..::..:: : :..::: ..:.:...: : .: . :: NP_443 YLFLNGNKLARVAAGAFQGLRQLDMLDLSNNSLASVPEGLWASLGQPNWDMRDGFDISGN 240 250 260 270 280 290 230 240 250 260 270 280 pF1KE4 PWLCNCEILYFRRWLQDNAENVYVWKQGVDVKAMTSNVASVQCDNSDKFPVYKYPGKGCP ::.:. .. . :::: . .... NP_443 PWICDQNLSDLYRWLQAQKDKMFSQNDTRCAGPEAVKGQTLLAVAKSQ 300 310 320 330 340 >>NP_001073982 (OMIM: 603104) carboxypeptidase N subunit (545 aa) initn: 586 init1: 227 opt: 341 Z-score: 184.9 bits: 44.2 E(85289): 0.0015 Smith-Waterman score: 341; 32.7% identity (61.7% similar) in 248 aa overlap (7-240:12-252) 10 20 30 40 pF1KE4 MPLLLLLLLLPSPLHPHPI---CEVSKVASHLEVNCDKRNLTALPPDLP---KDT :::: : .: :. : :. :: :. ..:...: :.: :. NP_001 MLPGAWLLWTSLLLLARPAQPCPMGCDCFVQ------EVFCSDEELATVPLDIPPYTKNI 10 20 30 40 50 50 60 70 80 90 100 pF1KE4 TILHLSENLLYTFSLATLMPYTRLTQLNLDRCELTKLQVDGTLPVLGTLDLSHNQLQSLP ... : . : : .... :... :: . :.. . .. : :: : :... ... .: NP_001 IFVETSFTTLETRAFGSNPNLTKVVFLNTQLCQF-RPDAFGGLPRLEDLEVTGSSFLNLS 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE4 L-LGQTLPALTVLDVSFNRLTSLPLGALRGLGELQELYLKGNELKTLPPGLLTPTPKLEK . ..: .: : ..:: : .:: : .. :. :. :.:.::.:..:: :. : .:. NP_001 TNIFSNLTSLGKLTLNFNMLEALPEGLFQHLAALESLHLQGNQLQALPRRLFQPLTHLKT 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE4 LSLANNNLTELPAGLLNGLENLDTLLLQENSLYTIPKGFFGS-HLLPFAFLHGN------ :.::.: :..:: :.. : .:.:: :..:.: .:.: ::. : :: .: NP_001 LNLAQNLLAQLPEELFHPLTSLQTLKLSNNALSGLPQGVFGKLGSLQELFLDSNNISELP 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE4 PWLCNCEILYFRRWLQDNAENVYVWKQGVDVKAMTSNVASVQCDNSDKFPVYKYPGKGCP : . . . : ::: :: NP_001 PQVFSQLFCLERLWLQRNAITHLPLSIFASLGNLTFLSLQWNMLRVLPAGLFAHTPCLVG 240 250 260 270 280 290 >>NP_001278917 (OMIM: 603104) carboxypeptidase N subunit (545 aa) initn: 586 init1: 227 opt: 341 Z-score: 184.9 bits: 44.2 E(85289): 0.0015 Smith-Waterman score: 341; 32.7% identity (61.7% similar) in 248 aa overlap (7-240:12-252) 10 20 30 40 pF1KE4 MPLLLLLLLLPSPLHPHPI---CEVSKVASHLEVNCDKRNLTALPPDLP---KDT :::: : .: :. : :. :: :. ..:...: :.: :. NP_001 MLPGAWLLWTSLLLLARPAQPCPMGCDCFVQ------EVFCSDEELATVPLDIPPYTKNI 10 20 30 40 50 50 60 70 80 90 100 pF1KE4 TILHLSENLLYTFSLATLMPYTRLTQLNLDRCELTKLQVDGTLPVLGTLDLSHNQLQSLP ... : . : : .... :... :: . :.. . .. : :: : :... ... .: NP_001 IFVETSFTTLETRAFGSNPNLTKVVFLNTQLCQF-RPDAFGGLPRLEDLEVTGSSFLNLS 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE4 L-LGQTLPALTVLDVSFNRLTSLPLGALRGLGELQELYLKGNELKTLPPGLLTPTPKLEK . ..: .: : ..:: : .:: : .. :. :. :.:.::.:..:: :. : .:. NP_001 TNIFSNLTSLGKLTLNFNMLEALPEGLFQHLAALESLHLQGNQLQALPRRLFQPLTHLKT 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE4 LSLANNNLTELPAGLLNGLENLDTLLLQENSLYTIPKGFFGS-HLLPFAFLHGN------ :.::.: :..:: :.. : .:.:: :..:.: .:.: ::. : :: .: NP_001 LNLAQNLLAQLPEELFHPLTSLQTLKLSNNALSGLPQGVFGKLGSLQELFLDSNNISELP 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE4 PWLCNCEILYFRRWLQDNAENVYVWKQGVDVKAMTSNVASVQCDNSDKFPVYKYPGKGCP : . . . : ::: :: NP_001 PQVFSQLFCLERLWLQRNAITHLPLSIFASLGNLTFLSLQWNMLRVLPAGLFAHTPCLVG 240 250 260 270 280 290 >>XP_005269337 (OMIM: 603104) PREDICTED: carboxypeptidas (545 aa) initn: 586 init1: 227 opt: 341 Z-score: 184.9 bits: 44.2 E(85289): 0.0015 Smith-Waterman score: 341; 32.7% identity (61.7% similar) in 248 aa overlap (7-240:12-252) 10 20 30 40 pF1KE4 MPLLLLLLLLPSPLHPHPI---CEVSKVASHLEVNCDKRNLTALPPDLP---KDT :::: : .: :. : :. :: :. ..:...: :.: :. XP_005 MLPGAWLLWTSLLLLARPAQPCPMGCDCFVQ------EVFCSDEELATVPLDIPPYTKNI 10 20 30 40 50 50 60 70 80 90 100 pF1KE4 TILHLSENLLYTFSLATLMPYTRLTQLNLDRCELTKLQVDGTLPVLGTLDLSHNQLQSLP ... : . : : .... :... :: . :.. . .. : :: : :... ... .: XP_005 IFVETSFTTLETRAFGSNPNLTKVVFLNTQLCQF-RPDAFGGLPRLEDLEVTGSSFLNLS 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE4 L-LGQTLPALTVLDVSFNRLTSLPLGALRGLGELQELYLKGNELKTLPPGLLTPTPKLEK . ..: .: : ..:: : .:: : .. :. :. :.:.::.:..:: :. : .:. XP_005 TNIFSNLTSLGKLTLNFNMLEALPEGLFQHLAALESLHLQGNQLQALPRRLFQPLTHLKT 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE4 LSLANNNLTELPAGLLNGLENLDTLLLQENSLYTIPKGFFGS-HLLPFAFLHGN------ :.::.: :..:: :.. : .:.:: :..:.: .:.: ::. : :: .: XP_005 LNLAQNLLAQLPEELFHPLTSLQTLKLSNNALSGLPQGVFGKLGSLQELFLDSNNISELP 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE4 PWLCNCEILYFRRWLQDNAENVYVWKQGVDVKAMTSNVASVQCDNSDKFPVYKYPGKGCP : . . . : ::: :: XP_005 PQVFSQLFCLERLWLQRNAITHLPLSIFASLGNLTFLSLQWNMLRVLPAGLFAHTPCLVG 240 250 260 270 280 290 >>NP_004479 (OMIM: 173511) platelet glycoprotein V precu (560 aa) initn: 412 init1: 227 opt: 330 Z-score: 179.6 bits: 43.2 E(85289): 0.0029 Smith-Waterman score: 330; 34.4% identity (61.5% similar) in 218 aa overlap (30-239:222-439) 10 20 30 40 50 pF1KE4 MPLLLLLLLLPSPLHPHPICEVSKVASHLEVNCDKRNLTALPP---DLPKDTTILHLSE :.. . .. .. : : . . : ::. NP_004 AQAKLERLLLHSNRLVSLDSGLLNSLGALTELQFHRNHIRSIAPGAFDRLPNLSSLTLSR 200 210 220 230 240 250 60 70 80 90 100 110 pF1KE4 NLLYTFSLATLMPYTRLTQLNLDRCELTKLQ--VDGTLPVLGTLDLSHNQLQSLPLLG-Q : : . : .. :: :.: . :..: . : . : : :...::..:: . . NP_004 NHLAFLPSALFLHSHNLTLLTLFENPLAELPGVLFGEMGGLQELWLNRTQLRTLPAAAFR 260 270 280 290 300 310 120 130 140 150 160 170 pF1KE4 TLPALTVLDVSFN-RLTSLPLGALRGLGELQELYLKGNELKTLPPGLLTPTPKLEKLSLA .: : : :... ::..:: ::..:::::: : :..: : .:: ::: ::...:: NP_004 NLSRLRYLGVTLSPRLSALPQGAFQGLGELQVLALHSNGLTALPDGLLRGLGKLRQVSLR 320 330 340 350 360 370 180 190 200 210 220 230 pF1KE4 NNNLTELPAGLLNGLENLDTLLLQENSLYTIPKGFFGS-HLLPFAFLHGNPWLCNCEILY : : :: .:. .: .:... :..:.: :.: ::. : ..: : : :.: . NP_004 RNRLRALPRALFRNLSSLESVQLDHNQLETLPGDVFGALPRLTEVLLGHNSWRCDCGLGP 380 390 400 410 420 430 240 250 260 270 280 290 pF1KE4 FRRWLQDNAENVYVWKQGVDVKAMTSNVASVQCDNSDKFPVYKYPGKGCPTLGDEGDTDL : ::... NP_004 FLGWLRQHLGLVGGEEPPRCAGPGAHAGLPLWALPGGDAECPGPRGPPPRPAADSSSEAP 440 450 460 470 480 490 626 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 00:07:10 2016 done: Sun Nov 6 00:07:12 2016 Total Scan time: 12.920 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]