GENSCAN 1.0 Date run: 6-Nov-116 Time: 09:19:51 Sequence gi568815583f:72039624_72240259 : 200636 bp : 44.33% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1570 1565 6 1.05 1.02 Term - 2114 1662 453 2 0 22 36 348 0.729 18.16 1.01 Init - 6940 6101 840 1 0 59 84 436 0.821 34.99 1.00 Prom - 7566 7527 40 -8.96 2.00 Prom + 8021 8060 40 -10.35 2.01 Init + 9914 9980 67 1 1 86 82 116 0.990 12.03 2.02 Intr + 14610 14715 106 1 1 54 108 73 0.357 5.17 2.03 Term + 63557 64076 520 2 1 3 43 912 0.632 71.97 2.04 PlyA + 64687 64692 6 1.05 3.00 Prom + 72900 72939 40 -5.06 3.01 Init + 88328 88373 46 1 1 92 81 1 0.264 0.68 3.02 Term + 99954 100639 686 1 2 77 37 339 0.410 21.50 3.03 PlyA + 103676 103681 6 1.05 4.18 PlyA - 105789 105784 6 1.05 4.17 Term - 109355 109336 20 0 2 143 49 -12 0.095 -1.12 4.16 Intr - 119917 119835 83 0 2 55 64 64 0.176 -0.02 4.15 Intr - 121454 121293 162 1 0 82 39 79 0.540 1.49 4.14 Intr - 121761 121633 129 2 0 15 97 83 0.697 1.71 4.13 Intr - 122754 122650 105 2 0 75 94 87 0.989 7.33 4.12 Intr - 123770 123643 128 2 2 39 87 118 0.748 6.28 4.11 Intr - 124134 124102 33 0 0 75 91 26 0.396 0.02 4.10 Intr - 125787 125731 57 0 0 117 85 32 0.806 4.88 4.09 Intr - 127469 127371 99 2 0 71 86 102 0.979 8.61 4.08 Intr - 128216 128080 137 0 2 48 48 128 0.640 5.09 4.07 Intr - 128943 128868 76 0 1 64 105 71 0.687 5.49 4.06 Intr - 129498 129316 183 0 0 83 98 7 0.194 1.18 4.05 Intr - 133292 133102 191 0 2 108 80 36 0.756 4.10 4.04 Intr - 136204 136086 119 0 2 66 103 84 0.901 7.81 4.03 Intr - 140182 140022 161 1 2 55 42 79 0.768 -1.21 4.02 Intr - 141178 140989 190 0 1 108 90 60 0.816 7.79 4.01 Init - 148200 148193 8 0 2 51 92 11 0.313 -2.48 4.00 Prom - 152454 152415 40 -0.46 5.15 PlyA - 154786 154781 6 1.05 5.14 Term - 160133 160027 107 0 2 137 45 154 0.999 14.57 5.13 Intr - 161032 160851 182 0 2 100 84 216 0.997 21.91 5.12 Intr - 161457 161305 153 2 0 100 98 22 0.918 3.59 5.11 Intr - 162997 162831 167 1 2 82 91 189 0.998 17.36 5.10 Intr - 163336 163264 73 0 1 81 30 6 0.611 -6.49 5.09 Intr - 163565 163399 167 2 2 91 91 201 0.959 19.46 5.08 Intr - 167257 167105 153 1 0 102 105 196 0.944 22.97 5.07 Intr - 169268 169006 263 0 2 120 -19 432 0.514 32.91 5.06 Intr - 170236 170050 187 1 1 117 102 365 0.999 40.06 5.05 Intr - 170855 170724 132 2 0 95 69 220 0.999 21.64 5.04 Intr - 177877 177786 92 2 2 74 83 79 0.986 5.71 5.03 Intr - 179487 179321 167 2 2 88 97 246 0.915 25.10 5.02 Intr - 181661 181545 117 1 0 78 86 62 0.650 4.58 5.01 Init - 189003 188954 50 0 2 84 92 29 0.261 3.32 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 43675 43460 216 1 0 58 47 133 0.818 3.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:72039624_72240259|GENSCAN_predicted_peptide_1|430_aa MNINDGGRRRFEDNEHTLRIYPGAISEGTIYCPIPARKNSTAAEVIESLINKLHLDKTKC YVLAEVKEFGGEEWILNPTDCPVQRMMLWPRMALENRLSGEDYRFLLREKNLDGSIHYGS LQSWLRVTEERRRMMERGFLPQPQQKDFDDLCSLPDLNEKTLLENLRNRFKHEKIYTYVG SILIVINPFKFLPIYNPKYVKMYDNHQLGKLEPHIYAVADVAYHAMLQRKKNQCIVISGE SGSGKTQSTNFLIHHLTALSQKGFASGVEQIILGAGPVLEGTCAGTQVKGALCLLKVPTE KLPSPQLTMVYEIDLTTGDAGASSTHPMQCSTLRKSDFMMFKGQPCKIVKMLTSKTGKLG HAKVHLVGIAVFMGKKEEEDICPSVHNVDVPNIQRHDYQLICIQDSCLSLPTEADEVPED LKLPESELGK >gi568815583f:72039624_72240259|GENSCAN_predicted_CDS_1|1293_bp atgaatataaatgatggaggaagacgacgctttgaagataatgaacatacattacggata tatcctggggctatttcagaagggacaatctactgtccgattcctgccagaaaaaactcc acagctgctgaggtgattgagtctcttataaacaaacttcatcttgacaaaacaaaatgt tatgttctagcagaggtaaaggaatttggtggagaagaatggattctcaatccaacagat tgtccagttcagcgaatgatgctgtggccccgaatggctctggaaaatcgcttaagtgga gaggactaccgcttccttctgagagagaaaaaccttgatggatcaatccattatggtagc ctgcagtcatggctacgggtaacagaagaacgtcgcaggatgatggaacggggttttctt ccacagcctcaacagaaagactttgatgatttatgtagtttacctgatttgaatgagaaa actctcttagaaaacctacgaaatcgctttaagcatgaaaaaatttatacctatgttggc agtattctaatagttattaacccattcaagtttcttcctatttataaccccaaatatgtc aaaatgtatgataaccaccaactgggaaaacttgagccccacatttatgctgtggctgat gtagcttatcatgccatgcttcagcgcaaaaagaatcagtgcatcgtgatttcaggagag agtggttctgggaagactcaaagcacaaactttcttattcaccaccttactgctctcagt cagaaaggatttgccagtggagtagaacagattattcttggagctggaccagtacttgag gggacctgtgcaggtacgcaggtgaaaggagctctttgcctgctgaaagttcccacggaa aaactaccatctccccagctcaccatggtgtacgaaattgatttaactactggagatgct ggggcttccagcactcaccctatgcagtgctcaaccttgcggaaaagtgacttcatgatg ttcaaaggacaaccatgcaaaatagtgaagatgttaacttccaaaactggaaaacttggt catgccaaggttcaccttgttggaattgctgttttcatgggaaaaaaagaagaagaagat atttgtccttctgttcacaacgtggatgttccaaatattcagagacatgattatcaactg atatgcattcaagatagttgcctttccctgccgacagaagctgatgaagttcctgaggat cttaaactgccagaaagtgaactaggcaaataa >gi568815583f:72039624_72240259|GENSCAN_predicted_peptide_2|230_aa MDQCQSMACQKLGRTAEDEQQASLGLRERSLDITGILEKKYKTEKDYKKLRRKPQSGKVR LSLKKKKKQQQKQKQQQKQQQQKEKQRQQQKQQQAAAAEAEAAASSSSRSRSSSKQRQKQ QQKQKQQQQKQQQAASSSRSSSSRSRSSSRRRSSSRSRSSAEAEAVAAEAAEAVEAEAAE AVEAEAAEAVEAEAAEAVEAEEATEEAEEETEEAEVTEEAAAEAEEFHKY >gi568815583f:72039624_72240259|GENSCAN_predicted_CDS_2|693_bp atggaccagtgccagtccatggcctgtcagaaactgggccgcacagcagaagatgagcag caggcaagcttaggcctgagagaaaggtctctggatatcactggtattcttgaaaagaag tacaaaacagaaaaagattacaagaagttaaggaggaagccacagagtggcaaagtaaga ctaagtctcaagaaaaaaaaaaagcagcagcagaagcagaagcagcagcagaagcagcag cagcagaaggagaagcagaggcagcagcagaagcagcagcaagcagcagcagcagaagca gaagcagcagcaagcagcagcagcagaagcagaagcagcagcaagcagagacagaagcag cagcagaagcagaagcagcagcagcagaagcagcagcaagcagcaagcagcagcagaagc agcagcagcagaagcagaagcagcagcagaagaagaagcagcagcagaagcagaagcagc gcagaagcagaagcagtggcagcagaagcagcagaagcagtggaagcagaagcagcagaa gcagtggaagcagaagcagcagaagcagtggaagcagaagcagcagaagcagtggaagca gaagaagcaacagaagaagcagaagaagaaacagaagaagcagaagtaacagaagaagca gcagcagaagcagaagaatttcacaaatactga >gi568815583f:72039624_72240259|GENSCAN_predicted_peptide_3|243_aa MVVCTYSAYSYLEGQALVQLLEFLSSPRQYKMDPVVLSYMDSLLRQSDVSLLDPPSWLND HIIGFAFEYFANSQFHDCSDHVSFISPEVTQFIKCTSNPAEIAMFLEPLDLPNKRVVFLA INDNSNQAAGGTHWSLLVYLQDKNSFFHYDSHSRSNSVHAKQVAEKLEAFLGRKGDKLAF VEEKAPAQQNSYDCGMYVICNTEALCQNFFRQQTESLLQLLTPAYITKKRGEWKDLITTL AKK >gi568815583f:72039624_72240259|GENSCAN_predicted_CDS_3|732_bp atggtggtatgcacgtatagtgcatatagctacttggaaggccaagctcttgttcagctt ctggaatttctgagcagccctcgtcagtacaagatggaccccgtagtcttgagttacatg gacagtctactgcggcaatcagatgtctcactattggatccgccaagctggctcaatgac catattattgggtttgcgtttgagtactttgccaacagtcagtttcatgactgctctgat cacgtcagtttcatcagccctgaagtcacccagttcatcaagtgcactagcaacccagca gagattgccatgttccttgaaccactggacctccccaacaagagagttgtatttttagcc atcaatgataactccaaccaggcagctggaggaacccactggagtttattggtctacctc caagataaaaatagcttttttcattatgattcccatagcaggagcaactcagttcacgca aagcaggtagcagagaaactggaggctttcttaggcagaaaaggagacaaactggccttt gtggaagagaaagcccctgcccaacaaaacagctatgactgtgggatgtacgtgatatgt aacactgaggccttgtgtcagaacttctttaggcaacagacagaatcactgctgcagcta ctcacccctgcatacatcacaaagaagaggggagaatggaaagatctcattaccacactt gctaaaaagtag >gi568815583f:72039624_72240259|GENSCAN_predicted_peptide_4|626_aa MARGVTEAQKGRVAYSKPLSSEKQFQACIPVFLSPTPGSFRTPRPFTLACGPEKSLMDPQ SQSNSEVSFKDVMGQRLTQRSPVLVGDVLLNGQGYGCPQPHPRLSQDLLQILTVALTICS KHWVLAVCWGYRDVRDTPLLELRVNWGTQMCKQTSTLWCGLSCERTETKRETYMEEIDGI SHAVITHRHSRNTTDKRETTNWGKHHKNDKLRINVLNKEFTGQRRVLVSDLCLKLLAQGQ GRIGTEDAWAGLGFLVPEMECLLCSLHWPEGLKGEEIKKCGREGITLNKYNQQYHKLFKD VPLEEVVLKVCSCALQRDFLLQGRLYISPNWLCFHASLFGKDIKVVMSGEGCHDPVVIPV VSVQMIKKHKMARLLPNGLAITTNTSQKPSSKKSLSVREFSGEPESLEVLIPEMKWRKKM PNCSPTAKNAVYEEDELEEEPRSTGELRLWDYRLLKVFFVLICFLVMSSSYLAFRISRLE QQLCSLSWDDPVPGHRALCRPLMRWLAALWGDAAHGGDGGSGCEEQRYLNSVSSAVTFRH GQQQPVEGGLSGQLRVMVLSHHRAEVTCVTPIVIGLQAMEEGPPLHGAYSPLRIMPLIQE LTYGVQSQKARVAFSMTQSLCLILFL >gi568815583f:72039624_72240259|GENSCAN_predicted_CDS_4|1881_bp atggccagaggagtcactgaggcccagaaaggaagagtggcttactcaaagcccctcagc agtgagaagcagttccaggcctgcattcctgtcttcctgtccccaactccaggctctttc aggaccccaagacccttcacacttgcctgtggtcctgagaagagcctgatggatccacaa agccagagtaattctgaggtatccttcaaggatgtgatgggacagaggctaacacagcgc tcaccagtgctagtcggggacgtcctgctcaatgggcagggttatgggtgtccccagccc caccccagactgagtcaagacctcttgcagatcctcacagtagccctgacgatctgcagc aaacactgggtgctggctgtatgctggggatacagagatgtacgggacacgcccttgctg gagctcagagtcaactgggggacacagatgtgtaaacagacaagcacactctggtgtggc ctgtcctgtgagaggactgagaccaagcgagaaacatacatggaagaaatagatggaatc tcccatgcagtaattacacacaggcacagcagaaataccacagacaaaagagaaacgaca aactggggaaaacatcacaaaaatgataaactaaggattaacgttctcaataaagaattt acaggccagagaagggtcctagtctctgatctctgcctcaagcttctggctcaaggccag ggcagaattggcactgaggatgcatgggcaggactggggttccttgttcccgaaatggaa tgtctcctttgcagtctgcactggccagaaggcttgaagggtgaagagataaagaagtgt ggccgagaagggataacactgaataaatacaaccagcaataccacaagctgtttaaggat gttcccttggaggaagtggttctcaaagtgtgttcctgtgccctccagagggacttcctc ctccagggccggctctacatctcccccaactggctctgcttccatgccagcctctttggc aaggatatcaaggtagtaatgagtggtgagggctgccatgaccctgtggtcattcctgtg gtgtctgtgcaaatgatcaaaaaacacaagatggcacggctccttcccaatggactggcc atcaccaccaacaccagccagaagccttccagcaagaagagtctgagtgtaagagaattt tcaggggaacctgagtctctggaagtcctcatccctgagatgaagtggagaaagaagatg ccgaactgctctcccactgcaaagaatgctgtctatgaggaggatgagctggaggaggag cccaggagcactggggagctgaggctctgggattaccggctcctcaaggtcttctttgtg ctgatctgcttcctggtcatgtcctcatcctacctggcgttccgtatttctcggctagag cagcagttatgctccttgagttgggatgacccagtccctgggcacagggctctgtgccgg cccctcatgaggtggctggctgcattatggggagatgctgcccacggcggagatggtggc agtggctgtgaagaacagaggtacctgaattctgtgtcctcagctgtcactttccggcat ggccaacagcagcccgttgaggggggcttgtctggacagttgcgtgtgatggtgttgagc catcacagggcagaggtcacttgtgtcacccccatagtcattgggttgcaggccatggag gagggacctccactccatggggcctattctccattaaggattatgcctctgatccaggaa ttaacatacggagttcagagccagaaagctcgtgtggccttttctatgacccagagccta tgcctcattctctttctctag >gi568815583f:72039624_72240259|GENSCAN_predicted_peptide_5|669_aa MTQPFNYSAFERIIFAGSPGHTVFSSERSLLVRPRSHPEPKGEHYVTGSPTPENQRTSAA MSKPHSEAGTAFIQTQQLHAAMADTFLEHMCRLDIDSPPITARNTGIICTIGPASRSVET LKEMIKSGMNVARLNFSHGTHEYHAETIKNVRTATESFASDPILYRPVAVALDTKGPEIR TGLIKGSGTAEVELKKGATLKITLDNAYMEKCDENILWLDYKNICKVVEVGSKIYVDDGL ISLQVKQKGADFLVTEVENGGSLGSKKGVNLPGAAVDLPAVSEKDIQDLKFGVEQDVDMV FASFIRKASDVHEVRKVLGEKGKNIKIISKIENHEGMLESMIKKPRPTRAEGSDVANAVL DGADCIMLSGETAKGDYPLEAVRMQHLIAREAEAAMFHRKLFEELVRASSHSTDLMEAMA MGSVEASYKCLAAALIVLTESGRRRQPGPTWHLGTEPLLVCRNTAREIAREAEAAIYHLQ LFEELRRLAPITSDPTEATAVGAVEASFKCCSGAIIVLTKSGRCQESGASHVLLGLSLSL AARGMGAGDIGPGLPLCVRVQGVGGVLPTMGCVGSAHQVARYRPRAPIIAVTRNPQTARQ AHLYRGIFPVLCKDPVQEAWAEDVDLRVNFAMNVGKARGFFKKGDVVIVLTGWRPGSGFT NTMRVVPVP >gi568815583f:72039624_72240259|GENSCAN_predicted_CDS_5|2010_bp atgactcaacctttcaattattctgcttttgagagaattatcttcgctggctctccaggg cacaccgtattcagctctgagcggtctttgctagtgaggccaaggagccaccctgagcca aaaggggagcattatgtcaccggaagcccaaccccagagaaccaaaggacctcagcagcc atgtcgaagccccatagtgaagccgggactgccttcattcagacccagcagctgcacgca gccatggctgacacattcctggagcacatgtgccgcctggacattgattcaccacccatc acagcccggaacactggcatcatctgtaccattggcccagcttcccgatcagtggagacg ttgaaggagatgattaagtctggaatgaatgtggctcgtctgaacttctctcatggaact catgagtaccatgcggagaccatcaagaatgtgcgcacagccacggaaagctttgcttct gaccccatcctctaccggcccgttgctgtggctctagacactaaaggacctgagatccga actgggctcatcaagggcagcggcactgcagaggtggagctgaagaagggagccactctc aaaatcacgctggataacgcctacatggaaaagtgtgacgagaacatcctgtggctggac tacaagaacatctgcaaggtggtggaagtgggcagcaagatctacgtggatgatgggctt atttctctccaggtgaagcagaaaggtgccgacttcctggtgacggaggtggaaaatggt ggctccttgggcagcaagaagggtgtgaaccttcctggggctgctgtggacttgcctgct gtgtcggagaaggacatccaggatctgaagtttggggtcgagcaggatgttgatatggtg tttgcgtcattcatccgcaaggcatctgatgtccatgaagttaggaaggtcctgggagag aagggaaagaacatcaagattatcagcaaaatcgagaatcatgaggggatgctggagagc atgatcaagaagccccgccccactcgggctgaaggcagtgatgtggccaatgcagtcctg gatggagccgactgcatcatgctgtctggagaaacagccaaaggggactatcctctggag gctgtgcgcatgcagcacctgatagctcgtgaggctgaggcagccatgttccaccgcaag ctgtttgaagaacttgtgcgagcctcaagtcactccacagacctcatggaagccatggcc atgggcagcgtggaggcttcttataagtgtttagcagcagctttgatagttctgacggag tctggcaggagaagacagccaggcccaacctggcatctgggcacagagcctcttctcgtc tgtaggaacaccgccagggagattgcccgtgaggcagaggctgccatctaccacttgcaa ttatttgaggaactccgccgcctggcgcccattaccagcgaccccacagaagccaccgcc gtgggtgccgtggaggcctccttcaagtgctgcagtggggccataatcgtcctcaccaag tctggcagatgtcaagagtcaggtgctagtcacgtgctgcttggcttgtcactgtcattg gcagcgagaggaatgggtgctggtgacattgggccagggctgcctctctgtgtcagagtt cagggtgtaggaggggttctgccaaccatgggctgtgtggggtctgctcaccaggtggcc agataccgcccacgtgcccccatcattgctgtgacccggaatccccagacagctcgtcag gcccacctgtaccgtggcatcttccctgtgctgtgcaaggacccagtccaggaggcctgg gctgaggacgtggacctccgggtgaactttgccatgaatgttggcaaggcccgaggcttc ttcaagaagggagatgtggtcattgtgctgaccggatggcgccctggctccggcttcacc aacaccatgcgtgttgttcctgtgccgtga