GENSCAN 1.0 Date run: 3-Nov-116 Time: 12:50:21 Sequence gi568815595r:44985641_45246287 : 260647 bp : 46.29% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3500 3601 102 1 0 35 105 157 0.940 12.47 1.02 Intr + 3910 4004 95 0 2 75 65 103 0.997 5.46 1.03 Intr + 11447 11612 166 2 1 74 105 168 0.999 17.06 1.04 Intr + 15898 15968 71 0 2 85 115 37 0.999 4.08 1.05 Intr + 19651 19774 124 1 1 59 70 137 0.916 9.59 1.06 Intr + 21780 21935 156 2 0 96 78 212 0.994 21.21 1.07 Term + 25595 25699 105 1 0 100 34 63 0.688 0.51 1.08 PlyA + 27001 27006 6 1.05 2.04 PlyA - 27348 27343 6 1.05 2.03 Term - 30964 30888 77 2 2 114 42 63 0.583 2.30 2.02 Intr - 36939 36741 199 0 1 73 53 84 0.371 2.32 2.01 Init - 37190 37122 69 2 0 52 115 30 0.488 3.15 2.00 Prom - 38379 38340 40 -6.46 3.00 Prom + 39640 39679 40 -5.96 3.01 Init + 40723 40831 109 0 1 93 116 208 0.997 22.48 3.02 Intr + 45187 45285 99 2 0 117 100 198 0.999 23.98 3.03 Term + 49884 50284 401 1 2 66 42 819 0.996 70.38 3.04 PlyA + 50402 50407 6 1.05 4.00 Prom + 53411 53450 40 -5.86 4.01 Init + 59721 59950 230 2 2 39 64 235 0.838 14.04 4.02 Intr + 62158 62245 88 1 1 57 65 24 0.573 -3.03 4.03 Term + 62588 62710 123 1 0 102 48 94 0.697 5.18 4.04 PlyA + 63132 63137 6 1.05 5.05 PlyA - 63893 63888 6 -0.45 5.04 Term - 64589 64431 159 2 0 98 42 72 0.329 1.54 5.03 Intr - 67805 67679 127 1 1 91 37 96 0.087 5.48 5.02 Intr - 91893 91752 142 1 1 105 80 22 0.541 2.51 5.01 Init - 93174 93147 28 0 1 99 109 43 0.839 5.27 5.00 Prom - 97698 97659 40 -7.46 6.12 PlyA - 97837 97832 6 1.05 6.11 Term - 100427 99998 430 1 1 95 32 377 0.994 27.57 6.10 Intr - 103501 103414 88 2 1 107 50 34 0.945 0.53 6.09 Intr - 105898 105533 366 2 0 89 82 375 0.996 32.12 6.08 Intr - 108017 107637 381 0 0 62 116 430 0.989 38.08 6.07 Intr - 109928 109707 222 0 0 96 33 266 0.937 20.00 6.06 Intr - 125201 124833 369 0 0 70 73 527 0.730 44.48 6.05 Intr - 126805 126443 363 2 0 73 55 223 0.146 12.66 6.04 Intr - 132981 132772 210 1 0 101 111 73 0.938 9.78 6.03 Intr - 142291 142210 82 1 1 106 107 16 0.305 4.51 6.02 Intr - 154415 154276 140 0 2 4 83 87 0.047 -0.02 6.01 Init - 160647 160566 82 0 1 97 109 149 0.806 17.03 6.00 Prom - 162258 162219 40 -6.66 7.00 Prom + 164660 164699 40 -3.66 7.01 Init + 180482 180487 6 1 0 84 87 0 0.333 0.38 7.02 Intr + 181877 181966 90 1 0 75 58 68 0.409 2.69 7.03 Intr + 188997 189223 227 2 2 85 72 170 0.126 11.88 7.04 Intr + 201350 201513 164 2 2 109 66 13 0.006 0.82 7.05 Intr + 214424 214557 134 0 2 78 119 69 0.635 9.36 7.06 Intr + 216699 216768 70 2 1 -4 116 52 0.231 -2.45 7.07 Intr + 218484 218614 131 1 2 61 114 65 0.878 6.81 7.08 Intr + 229519 229572 54 0 0 82 84 25 0.289 0.68 7.09 Term + 230575 230697 123 0 0 60 53 66 0.251 -1.32 7.10 PlyA + 231114 231119 6 1.05 8.02 PlyA - 232288 232283 6 1.05 8.01 Sngl - 240387 239485 903 0 0 70 46 1727 0.999 160.92 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 126729 126443 287 2 2 81 55 220 0.841 14.99 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:44985641_45246287|GENSCAN_predicted_peptide_1|272_aa EDLRVDGRGCEDYRCVEVETDVVSNTSGSARVKLGHTDILVGVKAEMGTPKLEKPNEGYL EFFVDCSASATPEFEGRGGDDLGTEIANTLYRIFNNKSSVDLKTLCISPREHCWVLYVDV LLLECGGNLFDAISIAVKAALFNTRIPRVRVLEDEEGSKDIELSDDPYDCIRLSVENVPC IVTLCKIGYRHVVDATLQEEACSLASLLVSVTSKGVVTCMRKVGKGSLDPESIFEMMETG KRVGKVLHASLQSVVHKEESLGPKRQKVGFLG >gi568815595r:44985641_45246287|GENSCAN_predicted_CDS_1|819_bp gaagacctccgtgtggatggccgtggctgtgaggactaccgatgtgtcgaagtggaaact gatgtggtgtccaacactagtgggtccgccagggtcaagctgggtcacacagacatcttg gtgggagtgaaagcagaaatggggacgccgaagctggagaaaccaaatgaaggctacttg gagttctttgttgactgttcagccagtgctacccctgaatttgaaggtagaggaggtgat gaccttggcaccgagatcgctaacaccctctatcggatatttaacaataaaagcagtgtc gacttaaagaccctctgcattagtcctcgggagcactgctgggttctctatgtggatgtg ctgcttctggaatgtggtggaaatttgtttgatgccatttccattgctgtaaaggctgct ctcttcaatacaaggataccaagggttcgagttttggaggatgaagaggggtcgaaggac attgaattgtcagatgacccttatgactgcatacgactaagtgtggagaatgtcccctgc attgtcactctgtgcaagattggctatcggcatgtggtggatgctactcttcaggaggag gcctgctcgctggccagcttgctggtgtcggtgaccagcaagggagttgtgacgtgcatg aggaaagtggggaagggcagcctggacccagagagcatcttcgagatgatggagactggc aagcgtgtgggcaaggtactgcatgcctccttgcagagtgttgtgcacaaggaagaaagc ctggggcccaagagacagaaagttggattcctgggatga >gi568815595r:44985641_45246287|GENSCAN_predicted_peptide_2|114_aa MSTFTHHSLTHSEQLPVLQAPFMVSSSITFSWEHKPYCELCMRRSRLHTPYENLTPNDLR WNSFILKPSPQPRIRGKIVFHETSPWCQKGPCPTSQVTPYYPGIHVYPYPFVQT >gi568815595r:44985641_45246287|GENSCAN_predicted_CDS_2|345_bp atgtccacgttcactcaccactcactgactcactcagagcaacttccagtcctgcaagct ccattcatggtcagcagcagcattacattctcatgggagcacaagccctattgtgaactg tgcatgagaaggtctaggttgcacactccttatgagaatctaacgcccaatgatctgagg tggaacagtttcatcctgaaaccatcaccccaaccccgcatccgtgggaaaattgtcttc catgaaaccagtccctggtgccaaaaaggcccttgtccaactagccaagttactccctac tatccgggaatccacgtgtacccttatccctttgtgcaaacttaa >gi568815595r:44985641_45246287|GENSCAN_predicted_peptide_3|202_aa MELWGAYLLLCLFSLLTQVTTEPPTQKPKKIVNAKKDVVNTKMFEELKSRLDTLAQEVAL LKEQQALQTVCLKGTKVHMKCFLAFTQTKTFHEASEDCISRGGTLGTPQTGSENDALYEY LRQSVGNEAEIWLGLNDMAAEGTWVDMTGARIAYKNWETEITAQPDGGKTENCAVLSGAA NGKWFDKRCRDQLPYICQFGIV >gi568815595r:44985641_45246287|GENSCAN_predicted_CDS_3|609_bp atggagctctggggggcctacctcctcctctgcctcttctccctcctgacccaggtcacc accgagccaccaacccagaagcccaagaagattgtaaatgccaagaaagatgttgtgaac acaaagatgtttgaggagctcaagagccgtctggacaccctggcccaggaggtggccctg ctgaaggagcagcaggccctgcagacggtctgcctgaaggggaccaaggtgcacatgaaa tgctttctggccttcacccagacgaagaccttccacgaggccagcgaggactgcatctcg cgcgggggcaccctgggcacccctcagactggctcggagaacgacgccctgtatgagtac ctgcgccagagcgtgggcaacgaggccgagatctggctgggcctcaacgacatggcggcc gagggcacctgggtggacatgaccggcgcccgcatcgcctacaagaactgggagactgag atcaccgcgcaacccgatggcggcaagaccgagaactgcgcggtcctgtcaggcgcggcc aacggcaagtggttcgacaagcgctgccgcgatcagctgccctacatctgccagttcggg atcgtgtag >gi568815595r:44985641_45246287|GENSCAN_predicted_peptide_4|146_aa MELEKKKLTVGRWMLLLFEVLDMQPEERSDRELIYINDNSACGKKDKDVLEEMMPAKNHE KEFLEIFCDIESKKDKICDSQKCLQTSPNVPWGEKLPPVEKLRTTEENLNGFKDSKKVVV AAEANMAMEGTTQIPLTEPAARSRVD >gi568815595r:44985641_45246287|GENSCAN_predicted_CDS_4|441_bp atggagttagaaaagaaaaagctgactgtgggaaggtggatgctgctgctgtttgaagtt ctggatatgcaaccagaggaacgtagtgaccgtgagcttatctacataaatgacaacagt gcctgtggcaaaaaggataaagatgtcctggaggaaatgatgccagcaaaaaaccacgaa aaggaattcttagagatattttgtgacattgaaagcaagaaagataaaatctgtgatagc caaaaatgtctgcagaccagtccaaatgtcccctggggggaaaaattacccccagttgag aagctgagaaccactgaggagaacttaaatggattcaaagactcaaaaaaagtggtagtg gcagctgaagccaatatggccatggagggcaccactcagatccctttaacagaacctgct gcaaggagcagagtggactga >gi568815595r:44985641_45246287|GENSCAN_predicted_peptide_5|151_aa MARLKLLTSGSTPCGSCQALGLAPFEATAHALHWPLSAIAEVAGMQGTKSLDCTQHRNAE GQSSSQVIDFASVRDKHTSAFTFSPNLCIENYQTFSNSNFSGPSVFSSTHYKSNSPVSPT RAACVFRMLATVHGLFSVLIGLELVLQPCEG >gi568815595r:44985641_45246287|GENSCAN_predicted_CDS_5|456_bp atggccaggctcaaactcctgacctcaggctcaacaccatgtggaagctgccaagccctg ggacttgcaccctttgaagccacagcccacgctctacattggcccctttcagccatagct gaagtggctgggatgcagggcaccaagtccctagactgcacacagcacaggaatgctgaa ggccagagcagctcacaggttatagattttgcatctgtaagagacaagcacacctcagca ttcaccttcagcccaaacctctgcatagagaactaccagacctttagcaacagcaacttc tcaggaccttcggtcttctcctcaactcactataagtccaactcaccagtgtctcccacc cgtgcagcatgtgttttccgtatgctggccactgtacatggcttgttctcggtcttgatt ggtttggaacttgttttacagccctgtgaaggttaa >gi568815595r:44985641_45246287|GENSCAN_predicted_peptide_6|910_aa MAGLNCGVSIALLGVLLLGAARLPRGAEMPSVTDLEAGTSGHFKQGSLAKTMFNPENSLE QTSSFWDITENAEVTCNSSLKAFPQLGLTAVPCPSFFLGESEAFEIALPRESNITVLIKL GTPTLLAKPCYIVISKRHITMLSIKSGERIVFTFSCQSPENHFVIEIQKNIDCMSGPCPF GEVQLQPSTSLLPTLNRTFIWDVKAHKSIGLELQFSIPRLRQIGPGESCPDGVTHSISGR IDATVVRIGTFCSNGTVSRIKMQEGVKMALHLPWFHPRNVSGFSIANRSSIKRLCIIESV FEGEGSATLMSANYPEGFPEDELMTWQFVVPAHLRASVSFLNFNLSNCERKEERVEYYIP GSTTNPEVFKLEDKQPGNMAGNFNLSLQGCDQDAQSPGILRLQFQVLVQHPQNESNKIYV VDLSNERAMSLTIEPRPVKQSRKFVPGCFVCLESRTCSSNLTLTSGSKHKISFLCDDLTR LWMNVEKTISCTDHRYCQRKSYSLQVPSDILHLPVELHDFSWKLLVPKDRLSLVLVPAQK LQQHTHEKPCNTSFSYLVASAIPSQDLYFGSFCPGGSIKQIQVKQNISVTLRTFAPSFQQ EASRQGLTVSFIPYFKEEGVFTVTPDTKSKVYLRTPNWDRGLPSLTSVSWNISVPRDQVA CLTFFKERSGVVCQTGRAFMIIQEQRTRAEEIFSLDEDVLPKPSFHHHSFWVNISNCSPT SGKQLDLLFSVTLTPRTVDLTVILIAAVGGGVLLLSALGLIICCVKKKKKKTNKGPAVGI YNDNINTEMPRQPKKFQKGRKDNDSHVYAVIEDTMVYGHLLQDSSGSFLQPEVDTYRPFQ GTMGVCPPSPPTICSRAPTAKLATEEPPPRSPPESESEPYTFSHPNNGDVSSKDTDIPLL NTQEPMEPAE >gi568815595r:44985641_45246287|GENSCAN_predicted_CDS_6|2733_bp atggccggcctgaactgcggggtctctatcgcactgctaggggttctgctgctgggtgcg gcgcgcctgccgcgcggggcagagatgccttctgttactgatctggaggcgggaacatct ggccatttcaaacaaggaagtttggctaaaacaatgtttaaccccgagaacagcttagag cagacaagcagtttctgggacatcacagagaatgcagaggtgacctgcaacagtagcctg aaagcatttcctcagcttggcctgacggctgtaccttgccccagcttctttctaggggag tctgaagcttttgagattgctctgccacgagaaagcaacattacagttctcataaagctg gggaccccgactctgctggcaaaaccctgttacatcgtcatttctaaaagacatataacc atgttgtccatcaagtctggagaaagaatagtctttacctttagctgccagagtcctgag aatcactttgtcatagagatccagaaaaatattgactgtatgtcaggcccatgtcctttt ggggaggttcagcttcagccctcgacatcgttgttgcctaccctcaacagaactttcatc tgggatgtcaaagctcataagagcatcggtttagagctgcagttttccatccctcgcctg aggcagatcggtccgggtgagagctgcccagacggagtcactcactccatcagcggccga atcgatgccaccgtggtcaggatcggaaccttctgcagcaatggcactgtgtcccggatc aagatgcaagaaggagtgaaaatggccttacacctcccatggttccaccccagaaatgtc tccggcttcagcattgcaaaccgctcatctataaaacgtctgtgcatcatcgagtctgtg tttgagggtgaaggctcagcaaccctgatgtctgccaactacccagaaggcttccctgag gatgagctcatgacgtggcagtttgtcgttcctgcacacctgcgggccagcgtctccttc ctcaacttcaacctctccaactgtgagaggaaggaggagcgggttgaatactacatcccg ggctccaccaccaaccccgaggtgttcaagctggaggacaagcagcctgggaacatggcg gggaacttcaacctctctctgcaaggctgtgaccaagatgcccaaagtccagggatcctc cggctgcagttccaagttttggtccaacatccacaaaatgaaagcaataaaatctacgtg gttgacttgagtaatgagcgagccatgtcactcaccatcgagccacggcccgtcaaacag agccgcaagtttgtccctggctgtttcgtgtgtctagaatctcggacctgcagtagcaac ctcaccctgacatctggctccaaacacaaaatctccttcctttgtgatgatctgacacgt ctgtggatgaatgtggaaaaaaccataagctgcacagaccaccggtactgccaaaggaaa tcctactcactccaggtgcccagtgacatcctccacctgcctgtggagctgcatgacttc tcctggaagctgctggtgcccaaggacaggctcagcctggtgctggtgccagcccagaag ctgcagcagcatacacacgagaagccctgcaacaccagcttcagctacctcgtggccagt gccatacccagccaggacctgtacttcggctccttctgcccgggaggctctatcaagcag atccaggtgaagcagaacatctcggtgacccttcgcacctttgcccccagcttccaacaa gaggcctccaggcagggtctgacggtgtcctttataccttatttcaaagaggaaggcgtt ttcacggtgacccctgacacaaaaagcaaggtctacctgaggacccccaactgggaccgg ggcctgccatccctcacctctgtgtcctggaacatcagcgtgcccagagaccaggtggcc tgcctgactttctttaaggagcggagcggcgtggtctgccagacagggcgcgcattcatg atcatccaggagcagcggacccgggctgaggagatcttcagcctggacgaggatgtgctc cccaagccaagcttccaccatcacagcttctgggtcaacatctctaactgcagccccacg agcggcaagcagctagacctgctcttctcggtgacacttaccccaaggactgtggacttg actgtcatcctcatcgcagcggtgggaggtggagtcttactgctgtctgccctcgggctc atcatttgctgtgtgaaaaagaagaaaaagaagacaaacaagggccccgctgtgggtatc tacaatgacaacatcaatactgagatgccgaggcagccaaaaaagtttcagaaagggcga aaggacaatgactcccatgtgtatgcagtcatcgaggacaccatggtatatgggcatctg ctacaggattccagcggctccttcctgcagccagaggtggacacctaccggccgttccag ggcaccatgggggtctgtcctccctccccacccaccatatgctccagggccccaactgca aagttggccactgaggagccacctcctcgctcccctcctgagtctgagagtgaaccgtac accttctcccatcccaacaatggggatgtaagcagcaaggacacagacattcccttactg aacactcaggagcccatggagccagcagaataa >gi568815595r:44985641_45246287|GENSCAN_predicted_peptide_7|332_aa MIEAASLVFHVLRIEAHFLLLFVPPAGPGPQMIPLIPEGLLVHKPKWLGPLYCSLDLLHG SFLLRDPLKLALGLPISTWFQAVFTSVSYAAFESEEADQALYFLLRERNESGSVHGPSRG WGAGSSVAVRLGTECGEPNSMGWLPVQTCCDGHLWSSGAAVPGKQTGGQMLCKVGERKED EWTSTQGKALREYFPSPQECKDTPTSLALNIRGNFDKLGYTKFKNFQPSEADLDRTLPNG ALSSPEGIFLRLIEEAKELRAAETRNRGRVRKSRRRRRKLKLQPNSSCMAGQLLEQHYTG AISFLLIGSQNSFSDQRHQINPGNTKPFAVMT >gi568815595r:44985641_45246287|GENSCAN_predicted_CDS_7|999_bp atgattgaagcggcatcacttgtttttcatgtcctgcgcatcgaggcacactttctgctc ctctttgtgccgccagccggcccaggcccacagatgatacctctaataccagagggtctg ctggtccataaacccaagtggctgggccccttgtactgcagcctggacctgctgcacggc tctttcctgctcagggacccgctcaagctggctctgggcctgcccatcagtacctggttc caagcagtattcacaagtgtgtcatatgcagccttcgagtctgaagaagctgaccaggca ttatacttccttcttagggagaggaatgagtctggtagtgtccatggcccaagtcgggga tggggagctggttccagcgtggctgtgaggttgggcactgagtgtggggaacccaacagc atggggtggctccctgtgcagacctgctgtgatgggcatctctggtcttcaggagctgca gtgccaggtaagcagacaggaggacagatgttgtgtaaggttggggaacgcaaggaagat gaatggacttcaacacaaggaaaggctctcagggaatactttccaagcccgcaggaatgt aaggacactcccacatcgctggcattaaacatcaggggaaattttgataaattgggctac actaaatttaagaacttccaaccatcagaagctgatttggatcgcacactccctaatggg gccctttcttctcctgaaggcatcttcctgagacttatagaagaagccaaggagctgaga gcagcagagacaagaaacagagggagagtgagaaagtcccggaggaggagacggaagctc aagctgcagccaaacagctcatgcatggcagggcagctcttggagcagcattacacggga gcaatatccttcctgctaattggatctcagaattcattttcggaccaaagacatcaaata aacccaggaaatacgaagccgtttgcagttatgacttga >gi568815595r:44985641_45246287|GENSCAN_predicted_peptide_8|300_aa MLPLLAALLAAACPLPPVRGGAADAPGLLGVPSNASVNASSADEPIAPRLLASAAPGPPE RPGPEEAAAAAAPCNISVQRQMLSSLLVRWGRPRGFQCDLLLFSTNAHGRAFFAAAFHRV GPPLLIEHLGLAAGGAQQDLRLCVGCGWVRGRRTGRLRPAAAPSAAAATAGAPTALPAYP AAEPPGPLWLQGEPLHFCCLDFSLEELQGEPGWRLNRKPIESTLVACFMTLVIVVWSVAA LIWPVPIIAGFLPNGMEQRRTTASTTAATPAAVPAGTTAAAAAAAAAAAAAAVTSGVATK >gi568815595r:44985641_45246287|GENSCAN_predicted_CDS_8|903_bp atgctgcccctgctcgccgcgctcctggccgccgcctgcccgctgccgcccgtccgcggc ggggccgcggacgcgcccggcctcctcggggtgccctccaatgcttcagtcaacgcgtcc tccgcggacgagcccatcgccccgcggctgctggcctcggcggcccccgggccccccgag cgcccgggcccggaggaggcggcggcggcggcggcgccgtgcaacatcagcgtgcagcgg cagatgctgagctcgctgctcgtgcgctggggccgcccgcggggcttccagtgcgaccta ctgctcttctccaccaacgcgcacggccgcgctttcttcgccgccgccttccaccgcgtc gggccgccgctgctcatcgagcacctggggctggcggcgggcggcgcgcagcaggacctg cgcctctgcgtgggctgcggctgggtgcgcggtcgccgcaccggccgcctccggcccgcc gccgcccccagcgccgccgccgccaccgccggggcgcccaccgcgctgccagcctacccc gcggccgagccgcccgggccgctgtggctgcagggcgagccgctgcatttctgctgccta gacttcagcctggaggagctgcagggcgagccgggctggcggctgaaccgtaagcccatt gagtccacgctggtggcctgcttcatgaccctggtcatcgtggtgtggagcgtggccgcc ctcatctggccggtgcccatcatcgccggcttcctgcccaacggcatggaacagcgccgg accaccgccagcaccaccgcagccacccccgccgcagtgcccgcagggaccaccgcagcc gccgccgccgccgccgctgccgccgccgccgcggccgtcacttcgggggtggcgaccaag tga