FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4317, 323 aa 1>>>pF1KE4317 323 - 323 aa - 323 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1715+/-0.000321; mu= 17.0135+/- 0.020 mean_var=76.6285+/-14.494, 0's: 0 Z-trim(117.3): 51 B-trim: 47 in 1/53 Lambda= 0.146514 statistics sampled from 29068 (29119) to 29068 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.718), E-opt: 0.2 (0.341), width: 16 Scan time: 7.830 The best scores are: opt bits E(85289) NP_002344 (OMIM: 137290,204870) tumor-associated c ( 323) 2176 469.1 5.5e-132 NP_002345 (OMIM: 185535,613217,613244) epithelial ( 314) 1013 223.3 5.4e-58 XP_011542497 (OMIM: 131390) PREDICTED: nidogen-1 i (1205) 159 43.2 0.0033 NP_002499 (OMIM: 131390) nidogen-1 precursor [Homo (1247) 159 43.2 0.0033 >>NP_002344 (OMIM: 137290,204870) tumor-associated calci (323 aa) initn: 2176 init1: 2176 opt: 2176 Z-score: 2490.6 bits: 469.1 E(85289): 5.5e-132 Smith-Waterman score: 2176; 100.0% identity (100.0% similar) in 323 aa overlap (1-323:1-323) 10 20 30 40 50 60 pF1KE4 MARGPGLAPPPLRLPLLLLVLAAVTGHTAAQDNCTCPTNKMTVCSPDGPGGRCQCRALGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 MARGPGLAPPPLRLPLLLLVLAAVTGHTAAQDNCTCPTNKMTVCSPDGPGGRCQCRALGS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 GMAVDCSTLTSKCLLLKARMSAPKNARTLVRPSEHALVDNDGLYDPDCDPEGRFKARQCN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 GMAVDCSTLTSKCLLLKARMSAPKNARTLVRPSEHALVDNDGLYDPDCDPEGRFKARQCN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 QTSVCWCVNSVGVRRTDKGDLSLRCDELVRTHHILIDLRHRPTAGAFNHSDLDAELRRLF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 QTSVCWCVNSVGVRRTDKGDLSLRCDELVRTHHILIDLRHRPTAGAFNHSDLDAELRRLF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 RERYRLHPKFVAAVHYEQPTIQIELRQNTSQKAAGDVDIGDAAYYFERDIKGESLFQGRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 RERYRLHPKFVAAVHYEQPTIQIELRQNTSQKAAGDVDIGDAAYYFERDIKGESLFQGRG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 GLDLRVRGEPLQVERTLIYYLDEIPPKFSMKRLTAGLIAVIVVVVVALVAGMAVLVITNR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 GLDLRVRGEPLQVERTLIYYLDEIPPKFSMKRLTAGLIAVIVVVVVALVAGMAVLVITNR 250 260 270 280 290 300 310 320 pF1KE4 RKSGKYKKVEIKELGELRKEPSL ::::::::::::::::::::::: NP_002 RKSGKYKKVEIKELGELRKEPSL 310 320 >>NP_002345 (OMIM: 185535,613217,613244) epithelial cell (314 aa) initn: 923 init1: 416 opt: 1013 Z-score: 1162.2 bits: 223.3 E(85289): 5.4e-58 Smith-Waterman score: 1013; 49.7% identity (78.8% similar) in 316 aa overlap (7-320:1-311) 10 20 30 40 50 60 pF1KE4 MARGPGLAPPPLRLPLLLLVLAAVTGHTAAQDNCTCPTNKMTVCSPDGPGGRCQCRALGS .::: . : . ::. ::.. .:::..:.: . :..: . . .::: ..:. NP_002 MAPPQV-LAFGLLLAAATATFAAAQEECVCENYKLAVNCFVNNNRQCQCTSVGA 10 20 30 40 50 70 80 90 100 110 120 pF1KE4 GMAVDCSTLTSKCLLLKARMSAPKNARTLVRPSEHALVDNDGLYDPDCDPEGRFKARQCN .: :: :..:::..::.:.. : .: ..: : :: .:::::::::: : :::.::: NP_002 QNTVICSKLAAKCLVMKAEMNGSKLGRR-AKP-EGALQNNDGLYDPDCDESGLFKAKQCN 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE4 QTSVCWCVNSVGVRRTDKGDLSLRCDELVRTHHILIDLRHRPTAGAFNHSDLDAELRRLF ::.:::::..::::::: : . :.: :::. :.:.:.:. .. ..: . :.. . NP_002 GTSMCWCVNTAGVRRTDK-DTEITCSERVRTYWIIIELKHKAREKPYDSKSLRTALQKEI 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE4 RERYRLHPKFVAAVHYEQPTIQIELRQNTSQKAAGDVDIGDAAYYFERDIKGESLFQGRG ::.: :::.... ::. .: :.: ::.:::. .::::.:.:::::.:.::::::... NP_002 TTRYQLDPKFITSILYENNVITIDLVQNSSQKTQNDVDIADVAYYFEKDVKGESLFHSKK 180 190 200 210 220 230 250 260 270 280 290 pF1KE4 GLDLRVRGEPLQVE--RTLIYYLDEIPPKFSMKRLTAGLIAVIVVVVVALVAGMAVLVIT .:: : :: :... .:::::.:: :.:::. : ::.::::::::.:.:::..::::. NP_002 -MDLTVNGEQLDLDPGQTLIYYVDEKAPEFSMQGLKAGVIAVIVVVVIAVVAGIVVLVIS 240 250 260 270 280 300 310 320 pF1KE4 NRRKSGKYKKVEIKELGELRKEPSL ... .::.:.::::.::...: NP_002 RKKRMAKYEKAEIKEMGEMHRELNA 290 300 310 >>XP_011542497 (OMIM: 131390) PREDICTED: nidogen-1 isofo (1205 aa) initn: 103 init1: 77 opt: 159 Z-score: 178.7 bits: 43.2 E(85289): 0.0033 Smith-Waterman score: 159; 28.1% identity (49.1% similar) in 114 aa overlap (29-132:757-861) 10 20 30 40 50 pF1KE4 MARGPGLAPPPLRLPLLLLVLAAVTGHTAAQDNCTCPTNKMTVCSPDG-----PGG-R : :: : .. : ::. ::. XP_011 CDIPQRAQCIYTGGSSYTCSCLPGFSGDGQACQDVDECQPSR---CHPDAFCYNTPGSFT 730 740 750 760 770 780 60 70 80 90 100 pF1KE4 CQCRALGSGMAVDC---STLTSKCLLLKARMSAPKNARTLVRPSEHALVDNDGLYDPDCD :::. .: . : . ..: . .. . .: :: ::. :.:: XP_011 CQCKPGYQGDGFRCVPGEVEKTRCQHEREHILGAAGATDPQRPIP------PGLFVPECD 790 800 810 820 830 110 120 130 140 150 160 pF1KE4 PEGRFKARQCN-QTSVCWCVNSVGVRRTDKGDLSLRCDELVRTHHILIDLRHRPTAGAFN .:.. ::. .:. ::::. : XP_011 AHGHYAPTQCHGSTGYCWCVDRDGREVEGTRTRPGMTPPCLSTVAPPIHQGPAVPTAVIP 840 850 860 870 880 890 >>NP_002499 (OMIM: 131390) nidogen-1 precursor [Homo sap (1247 aa) initn: 103 init1: 77 opt: 159 Z-score: 178.5 bits: 43.2 E(85289): 0.0033 Smith-Waterman score: 159; 28.1% identity (49.1% similar) in 114 aa overlap (29-132:799-903) 10 20 30 40 50 pF1KE4 MARGPGLAPPPLRLPLLLLVLAAVTGHTAAQDNCTCPTNKMTVCSPDG-----PGG-R : :: : .. : ::. ::. NP_002 CDIPQRAQCIYTGGSSYTCSCLPGFSGDGQACQDVDECQPSR---CHPDAFCYNTPGSFT 770 780 790 800 810 820 60 70 80 90 100 pF1KE4 CQCRALGSGMAVDC---STLTSKCLLLKARMSAPKNARTLVRPSEHALVDNDGLYDPDCD :::. .: . : . ..: . .. . .: :: ::. :.:: NP_002 CQCKPGYQGDGFRCVPGEVEKTRCQHEREHILGAAGATDPQRPIP------PGLFVPECD 830 840 850 860 870 110 120 130 140 150 160 pF1KE4 PEGRFKARQCN-QTSVCWCVNSVGVRRTDKGDLSLRCDELVRTHHILIDLRHRPTAGAFN .:.. ::. .:. ::::. : NP_002 AHGHYAPTQCHGSTGYCWCVDRDGREVEGTRTRPGMTPPCLSTVAPPIHQGPAVPTAVIP 880 890 900 910 920 930 323 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 23:13:01 2016 done: Sat Nov 5 23:13:02 2016 Total Scan time: 7.830 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]