GENSCAN 1.0 Date run: 6-Nov-116 Time: 00:53:23 Sequence gi568815592r:73495080_73753886 : 258807 bp : 42.57% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2657 2817 161 0 2 34 107 132 0.755 8.49 1.02 Term + 5495 5656 162 1 0 61 33 180 0.997 6.75 1.03 PlyA + 5938 5943 6 1.05 2.13 PlyA - 6866 6861 6 1.05 2.12 Term - 22855 22731 125 2 2 84 42 184 0.999 10.97 2.11 Intr - 23185 22951 235 1 1 50 82 210 0.995 12.94 2.10 Intr - 23531 23275 257 0 2 53 92 423 0.999 35.44 2.09 Intr - 23769 23619 151 0 1 62 103 151 0.788 12.81 2.08 Intr - 24149 23853 297 2 0 59 111 388 0.887 34.35 2.07 Intr - 24437 24258 180 2 0 89 77 167 0.887 14.84 2.06 Intr - 25386 25283 104 1 2 -40 100 165 0.011 3.87 2.05 Intr - 25998 25921 78 1 0 21 111 76 0.009 1.80 2.04 Intr - 26792 26583 210 0 0 -11 38 264 0.007 9.46 2.03 Intr - 28795 28646 150 2 0 97 60 152 0.083 12.51 2.02 Intr - 34069 33985 85 1 1 79 77 60 0.394 2.47 2.01 Init - 49801 49733 69 1 0 71 80 43 0.050 2.90 2.00 Prom - 58444 58405 40 -6.35 3.00 Prom + 63207 63246 40 -7.25 3.01 Init + 68621 68672 52 1 1 62 90 30 0.677 -0.02 3.02 Intr + 74117 74339 223 0 1 92 41 185 0.711 10.46 3.03 Intr + 75275 75344 70 2 1 35 108 28 0.727 -2.23 3.04 Intr + 76616 76762 147 1 0 69 101 101 0.496 9.01 3.05 Intr + 77908 78051 144 0 0 12 -9 184 0.063 0.66 3.06 Intr + 80773 80912 140 0 2 71 93 87 0.233 5.84 3.07 Term + 98166 98358 193 0 1 41 42 154 0.080 2.01 3.08 PlyA + 98378 98383 6 1.05 4.10 PlyA - 98943 98938 6 1.05 4.09 Term - 99464 99336 129 2 0 77 50 130 0.662 5.30 4.08 Intr - 105362 105272 91 1 1 67 95 47 0.679 2.38 4.07 Intr - 115468 115321 148 2 1 62 69 104 0.291 4.27 4.06 Intr - 126883 126725 159 2 0 75 94 57 0.277 4.04 4.05 Intr - 143420 143333 88 2 1 73 94 -10 0.317 -3.28 4.04 Intr - 146845 146612 234 1 0 86 76 191 0.998 14.66 4.03 Intr - 149524 149328 197 2 2 82 90 87 0.873 6.51 4.02 Intr - 158837 158714 124 2 1 88 119 168 0.009 19.14 4.01 Init - 181404 181297 108 0 0 63 26 109 0.015 2.47 4.00 Prom - 186408 186369 40 -3.85 5.00 Prom + 186830 186869 40 -6.95 5.01 Init + 201137 201210 74 1 2 85 57 221 0.999 17.29 5.02 Intr + 202321 202493 173 1 2 41 116 155 0.985 12.26 5.03 Intr + 214994 215400 407 0 2 61 49 155 0.023 2.04 5.04 Intr + 215632 215812 181 0 1 55 -21 169 0.002 1.22 5.05 Intr + 228172 228200 29 2 2 126 89 7 0.012 1.52 5.06 Intr + 235265 235495 231 1 0 123 115 215 0.999 24.75 5.07 Intr + 238457 238591 135 1 0 76 36 114 0.461 4.84 5.08 Intr + 241304 241429 126 1 0 95 82 38 0.515 3.86 5.09 Term + 247525 247641 117 0 0 63 48 111 0.370 2.16 5.10 PlyA + 252783 252788 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 25591 25283 309 1 0 20 100 301 0.954 19.86 S.002 Init + 26118 26195 78 2 0 51 76 148 0.882 11.01 S.003 Term - 28795 28443 353 2 2 97 39 258 0.861 15.46 S.004 Init - 150252 150210 43 0 1 75 94 10 0.918 0.94 S.005 Term + 180585 180705 121 0 1 53 38 155 0.893 3.97 S.006 Term + 215632 215820 189 0 0 55 41 176 0.844 6.07 S.007 Init + 233058 233060 3 2 0 98 53 0 0.882 -2.45 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:73495080_73753886|GENSCAN_predicted_peptide_1|107_aa XTYESVLFHQLQEIKGVQQDEALQLPKDLDYLTIRDVSLSHEVREKLHFSRPQTIGAASR IPGVTPAAIINLLRFVKTTQRRQSAMNESSKTDQYLCDADRLQEREL >gi568815592r:73495080_73753886|GENSCAN_predicted_CDS_1|324_bp nccacttatgaatcagtgttgttccatcaactacaagaaataaagggagttcagcaagat gaagctctccaactgccaaaagacctagattatttgactatcagggatgtgtctttgtcc catgaagttcgagagaaactacattttagtcgtccacagacgatcggggctgctagtcgc atacccggagtaacacctgccgccatcatcaatctgctgagatttgtgaagaccactcaa cgaagacagtcggctatgaatgaatcatccaagactgatcaatacttatgtgatgcagac agacttcaagagagagagttatag >gi568815592r:73495080_73753886|GENSCAN_predicted_peptide_2|646_aa MKMREDEEKERRRGRRKKKGESQTRDLKLKQTKHLAQSHMFSKWWTRDLDPGPSFRSQKA EEKLPQGGASGRADPGDLTLRRLERRIHTTQLFAAGTRRPAAWTSGVPGGDDVLRASIKA AGFEWKEPRGQHRSGFVAFTAIEPNLSQVREPLRRPPVPTLRPKPLGVPGPGWGRTVYKC SSRRERSFSQRVCRQNTGWPGRHQLRERKDGRFPALLQGAQNGGRGARESGRMGKGSFKY AWVLDKLKAERERGITIDISLWKFETSKYYVTIIDAPGHRDFIKNMITGTSQADCAVLIV AAGVGEFEAGISKNGQTREHALLAYTLGVKQLIVGVNKMDSTEPPYSQKRYEEIVKEVST YIKKIGYNPDTVAFVPISGWNGDNMLEPSANMPWFKGWKVTRKDGNASGTTLLEALDCIL PPTRPTDKPLRLPLQDVYKIGGIGTVPVGRVETGVLKPGMVVTFAPVNVTTEVKSVEMHH EALSEALPGDNVGFNVKNVSVKDVRRGNVAGDSKNDPPMEAAGFTAQVIILNHPGQISAG YAPVLDCHTAHIACKFAELKEKIDRRSGKKLEDGPKFLKSGDAAIVDMVPGKPMCVESFS DYPPLGRFAVRDMRQTVAVGVIKAVDKKAAGAGKVTKSAQKAQKAK >gi568815592r:73495080_73753886|GENSCAN_predicted_CDS_2|1941_bp atgaagatgagagaagacgaggaaaaagaaagaaggagggggaggagaaagaagaaagga gaaagccagacacgggatctgaagcttaaacagaccaaacaccttgcccagagccacatg ttcagcaagtggtggacgcgagacttggatccaggcccttccttccgcagtcagaaggcg gaagaaaagttacctcaaggtggggcgagtggcagggcggaccctggcgacctgacgctg cggaggctcgagcggcgtatccataccacccagctgttcgccgcgggaacccgccggccc gcagcctggacatctggagtaccagggggagatgacgtgttacgggcttccataaaagca gctggctttgaatggaaggagccaagaggccagcacaggagcggattcgtcgctttcacg gccatcgagccgaacctctcgcaagtccgtgagccgttaaggaggcccccagtcccgacc cttcgccccaagcccctcggggtccccgggcctgggtgggggagaaccgtatataagtgc agtagtcgccgtgaacgttctttttcgcaacgggtttgccgccagaacacaggctggccc ggtcggcaccagttgcgtgagcggaaagatggccgcttcccggccctgctgcagggagct caaaatggaggacgcggcgctcgggagagcgggcggatgggaaagggctccttcaagtat gcctgggtcttggataaactgaaagctgagcgtgaacgtggtatcaccattgatatctcc ttgtggaaatttgagaccagcaagtactatgtgactatcattgatgccccaggacacaga gactttatcaaaaacatgattacagggacatctcaggctgactgtgctgtcctgattgtt gctgctggtgttggtgaatttgaagctggtatctccaagaatgggcagacccgagagcat gcccttctggcttacacactgggtgtgaaacaactaattgtcggtgttaacaaaatggat tccactgagccaccctacagccagaagagatatgaggaaattgttaaggaagtcagcact tacattaagaaaattggctacaaccccgacacagtagcatttgtgccaatttctggttgg aatggtgacaacatgctggagccaagtgctaacatgccttggttcaagggatggaaagtc acccgtaaggatggcaatgccagtggaaccacgctgcttgaggctctggactgcatccta ccaccaactcgtccaactgacaagcccttgcgcctgcctctccaggatgtctacaaaatt ggtggtattggtactgttcctgttggccgagtggagactggtgttctcaaacccggtatg gtggtcacctttgctccagtcaacgttacaacggaagtaaaatctgtcgaaatgcaccat gaagctttgagtgaagctcttcctggggacaatgtgggcttcaatgtcaagaatgtgtct gtcaaggatgttcgtcgtggcaacgttgctggtgacagcaaaaatgacccaccaatggaa gcagctggcttcactgctcaggtgattatcctgaaccatccaggccaaataagcgccggc tatgcccctgtattggattgccacacggctcacattgcatgcaagtttgctgagctgaag gaaaagattgatcgccgttctggtaaaaagctggaagatggccctaaattcttgaagtct ggtgatgctgccattgttgatatggttcctggcaagcccatgtgtgttgagagcttctca gactatccacctttgggtcgctttgctgttcgtgatatgagacagacagttgcggtgggt gtcatcaaagcagtggacaagaaggctgctggagctggcaaggtcaccaagtctgcccag aaagctcagaaggctaaatga >gi568815592r:73495080_73753886|GENSCAN_predicted_peptide_3|322_aa MLGAVAHASDPSTLGGRVFSGKTCGNRAVWEPNSRIWARDDEEAGFQLRQSFIIVKITTA ALILLQTPEAHSSRDSEGPRTVAWKPSCVWHTCQHCAGQVLEVCFCEPGTRGTEQVRGLT GSEVKLQTFTVSVTALKVAHLELFVPPGGLVILLASGVKLQTFACSAGLKDSSSAAKVGA QAEEAPRASEGCEDCQHAVTSQADIGSCEDPPDTVMIALTAQIVEHVCLNNMKSEHLEKE QDNSNVQGTREITLNSDRWRPKFLGFVVAEMCVEKTMARAGCRKDVKESLWTLGNITELS DPKINSQRFILMIYKRNIAIAI >gi568815592r:73495080_73753886|GENSCAN_predicted_CDS_3|969_bp atgctgggcgcagtggctcacgcctctgatcccagcactttgggaggccgagtgttctct ggaaaaacttgtgggaacagagcagtctgggaaccgaacagcaggatttgggcaagggat gatgaggaagcaggctttcaactgaggcagtcgttcatcattgtaaaaattactaccgct gcactgatcttgctccaaactccagaggcacatagctccagggactcagaagggccaaga accgtagcatggaagcccagctgtgtttggcacacttgtcagcattgtgcaggacaggta cttgaggtctgtttttgtgaaccagggacccgaggcacagagcaagttcgtggtctcact ggctcagaagtgaagctgcagaccttcacggtgagtgttacagctcttaaggtggcgcat ctggagttgttcgttcctcctggtgggctcgtgatcttgctggcttcaggagtgaagctg cagaccttcgcgtgcagcgctgggctgaaggactcctcaagtgccgccaaagtgggagcc caggcagaggaggcgccgagagcgagcgagggctgtgaggactgccagcacgctgtcacc tctcaagctgacatcggcagctgtgaggacccacctgacactgtgatgattgcgttaact gcacaaattgtagagcatgtgtgtttgaacaatatgaaatctgagcaccttgaaaaagaa caggataatagcaatgttcagggaacaagagagataaccttaaactctgaccgctggagg cccaagttcctcggctttgtggtagcagaaatgtgtgtagagaagacgatggctcgagca ggatgcagaaaagatgtcaaagagtcactgtggacactaggaaacatcacagaactgtca gatccaaaaatcaattcacaaaggtttattttgatgatttataaaagaaacattgccata gcaatttaa >gi568815592r:73495080_73753886|GENSCAN_predicted_peptide_4|425_aa MKDKSWGGRREGRRESGKIQEPEAVAQPDNNVTPMEVASTPAHVGVMRSPVRDLARNDGE ESTDRTPLLPGAPRAEAAPVCCSARYNLAILAFFGFFIVYALRVNLSVALVDMVDSNTTL EDNRTSKACPEHSAPIKVHHNQTGKKYQWDAETQGWILGSFFYGYIITQIPGGYVASKIG GKMLLGFGILGTAVLTLFTPIAADLGVGPLIVLRALEGLGEGVTFPAMHAMWSSWAPPLE RSKLLSISYAAFFTEVSAVGTHFKIPATLGYRSCTLFLQLDFLYFIDIIAYLYEGDPKVQ CSRGMIGPAVFLVAAGFIGCDYSLAVAFLTISTTLGGFCSSGFSINHLDIAPSYAGILLG ITNTFATIPGMVGPVIAKSLTPDGQSAQRLSEAASKPRASVDSSAGAFPLRGYQCVDKAL SRQEQ >gi568815592r:73495080_73753886|GENSCAN_predicted_CDS_4|1278_bp atgaaagacaagagctggggtggaaggagagagggaaggagagagtctgggaagattcaa gaaccagaggcagtagctcaaccagacaacaacgtcactcccatggaggtggcgagtaca cctgctcacgtaggcgtcatgaggtctccggttcgagacctggcccggaacgatggcgag gagagcacggaccgcacgcctcttctaccgggcgccccacgggccgaagccgctccagtg tgctgctctgctcgttacaacttagcaattttggccttttttggtttcttcattgtgtat gcattacgtgtgaatctgagtgttgcgttagtggatatggtagattcaaatacaacttta gaagataatagaacttccaaggcgtgtccagagcattctgctcccataaaagttcatcat aatcaaacgggtaagaagtaccaatgggatgcagaaactcaaggatggattctcggttcc tttttttatggctacatcatcacacagattcctggaggatatgttgccagcaaaataggg gggaaaatgctgctaggatttgggatccttggcactgctgtcctcaccctgttcactccc attgctgcagatttaggagttggaccactcattgtactcagagcactagaaggactagga gagggtgttacatttccagccatgcatgccatgtggtcttcttgggctccccctcttgaa agaagcaaacttcttagcatttcatatgcagctttcttcacagaagtcagtgccgtgggt acccattttaaaatccctgccactttgggctatcgtagttgcacacttttcttacaactg gactttttatactttattgacattattgcctacttatatgaaggagatcctaaggttcaa tgttcaagaggaatgattggacctgcagtattcctggtagctgctggcttcattggctgt gattattctttggccgttgctttcctaactatatcaacaacactgggaggcttttgctct tctggatttagcatcaaccatctggatattgctccttcgtatgctggtatcctcctgggc atcacaaatacatttgccactattccaggaatggttgggcccgtcattgctaaaagtctg acccctgatggtcagagtgcccagcgtttatcagaggcagcatccaagcccagagccagt gtcgactcttcggctggtgcctttcctctgaggggctatcaatgtgtagataaagccctg agtaggcaagagcagtga >gi568815592r:73495080_73753886|GENSCAN_predicted_peptide_5|490_aa MQGPPLLTAAHLLCVCTAALAVAPGPRFLVTAPGIIRPGGNVTIGVELLEHCPSQVTVKA ELLKTASNLTVSVLEAEGVFEKVLEVLARAIRQEKEIKGIQLGKEEVKLSLFVDDMIVYL ENPIVSAQNLLKLISNFSKVSGYKINVQKSPAFLYTNNRQTESQIMSELPFTIASKRIKY LGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVDLHPYPLSLEEEENLPKESHR GRQNFCKGPDVKHVGFANLQVFITAAQLCLYSSEAAVDGSFKTLTLPSLPLNSADEIYEL RVTGRTQDEILFSNSTRLSFETKRISVFIQTDKALYKPKQEVKFRIVTLFSDFKPYKTSL NILIKPIWPDTMELSASDTAMAAGMAGFCSDLSIQKLLVLVRVNDLVGEQDPKSNLIQQW LSQQSDLGVISKTFQLSSHPILGDWSIQVQVNSPKEEPESLRDSWSFDDGSPVLSFLIVR WKSLLLSRVA >gi568815592r:73495080_73753886|GENSCAN_predicted_CDS_5|1473_bp atgcagggcccaccgctcctgaccgccgcccacctcctctgcgtgtgcaccgccgcgctg gccgtggctcccgggcctcggtttctggtgacagccccagggatcatcaggcccggagga aatgtgactattggggtggagcttctggaacactgcccttcacaggtgactgtgaaggcg gagctgctcaagacagcatcaaacctcactgtctctgtcctggaagcagaaggagtcttt gaaaaagtgttggaggttctggctagggcaatcaggcaggagaaagaaataaaaggtatt caattaggaaaagaggaagtcaaattgtccctgtttgtagatgacatgattgtatatcta gaaaaccccattgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtc tcaggatacaaaatcaatgtgcaaaaatcaccagcattcttatacaccaataacagacaa acagagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatac ctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactg ctcaacgaaataaaggaggacacaaacaaatggaagaacattccatgctcatgggtagat cttcacccttatcccttatcccttgaggaagaagaaaatctccctaaagagagccacagg ggtcggcaaaatttctgtaaagggccagatgttaagcatgtaggttttgccaaccttcag gtctttattacagctgctcaactctgcctttatagctcggaagcagctgtagacggctct tttaagacacttactcttccatcactacctctgaacagtgcagatgagatttatgagcta cgtgtaaccggacgtacccaggatgagattttattctctaatagtacccgcttatcattt gagaccaagagaatatctgtcttcattcaaacagacaaggccttatacaagccaaagcaa gaagtgaagtttcgcattgttacactcttctcagattttaagccttacaaaacctcttta aacattctcattaagccaatttggcctgacacaatggaattgagtgcctctgacactgcg atggcagctggcatggctggcttctgctctgacctctccatccagaagttgttggtgcta gtacgggtgaatgatcttgtaggggaacaggaccccaaatcaaatttgatccaacagtgg ttgtcacaacaaagtgatcttggagtcatttccaaaacttttcagctatcttcccatcca atacttggtgactggtctattcaagttcaagtgaatagcccgaaagaagagcctgagtca ctgagagattcctggtcatttgatgacggcagccctgtgctgtctttcctcattgtcaga tggaagtctcttcttctttctagagtggcttag