GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:26:44 Sequence gi568815589f:76298736_76499583 : 200848 bp : 42.64% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2608 2638 31 0 1 77 115 41 0.741 5.66 1.02 Intr + 17588 17717 130 0 1 43 87 46 0.037 -1.17 1.03 Intr + 19432 19584 153 1 0 83 49 75 0.078 1.37 1.04 Intr + 22687 22904 218 1 2 116 77 125 0.007 11.42 1.05 Intr + 24317 24553 237 0 0 101 88 139 0.008 11.86 1.06 Intr + 29274 29504 231 1 0 99 70 88 0.043 4.92 1.07 Intr + 33698 33875 178 0 1 54 110 104 0.036 7.26 1.08 Intr + 39495 39712 218 0 2 95 100 83 0.613 7.42 1.09 Intr + 40314 40389 76 1 1 81 34 61 0.016 -2.35 1.10 Intr + 52622 52946 325 2 1 75 75 103 0.032 2.55 1.11 Intr + 55298 55484 187 1 1 56 76 82 0.072 2.14 1.12 Term + 59778 60187 410 1 2 87 41 748 0.788 64.39 1.13 PlyA + 60998 61003 6 1.05 2.03 PlyA - 62123 62118 6 1.05 2.02 Term - 63891 63841 51 0 0 88 55 14 0.215 -5.45 2.01 Init - 64273 64121 153 1 0 68 57 140 0.201 8.93 2.00 Prom - 69225 69186 40 -2.25 3.06 PlyA - 70114 70109 6 1.05 3.05 Term - 88794 88664 131 1 2 87 45 159 0.978 8.86 3.04 Intr - 89921 89819 103 2 1 55 97 104 0.982 6.83 3.03 Intr - 93834 93683 152 1 2 72 103 126 0.966 11.46 3.02 Intr - 95549 95355 195 0 0 44 94 92 0.817 3.86 3.01 Init - 95819 95567 253 2 1 99 66 75 0.508 2.44 3.00 Prom - 96865 96826 40 -10.45 4.00 Prom + 97557 97596 40 -6.55 4.01 Init + 99964 100473 510 0 0 60 20 586 0.755 43.97 4.02 Term + 100492 100851 360 0 0 1 43 356 0.750 15.95 4.03 PlyA + 100901 100906 6 1.05 5.17 PlyA - 101010 101005 6 1.05 5.16 Term - 103422 103280 143 1 2 77 42 103 0.575 1.71 5.15 Intr - 107469 107272 198 1 0 -6 48 196 0.012 4.60 5.14 Intr - 116162 116042 121 2 1 36 -6 179 0.019 2.45 5.13 Intr - 116672 116621 52 1 1 55 77 57 0.052 -0.71 5.12 Intr - 134566 134385 182 1 2 69 72 149 0.120 9.14 5.11 Intr - 135533 135427 107 0 2 49 95 82 0.562 4.01 5.10 Intr - 138529 138501 29 0 2 31 100 40 0.053 -3.66 5.09 Intr - 145614 145466 149 0 2 120 98 24 0.143 4.71 5.08 Intr - 151864 151648 217 0 1 51 73 79 0.027 0.28 5.07 Intr - 154457 154341 117 1 0 112 60 74 0.029 5.66 5.06 Intr - 171149 171120 30 1 0 95 101 28 0.050 1.13 5.05 Intr - 188638 188510 129 0 0 28 100 138 0.074 7.89 5.04 Intr - 194371 194217 155 1 2 31 87 69 0.018 -0.95 5.03 Intr - 194653 194461 193 0 1 1 -6 211 0.062 1.57 5.02 Intr - 194903 194725 179 2 2 83 44 101 0.685 3.00 5.01 Intr - 195514 195399 116 2 2 50 109 81 0.681 5.65 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 24521 24280 242 0 2 87 41 211 0.931 12.65 S.002 Init - 25896 25830 67 0 1 73 81 106 0.982 9.69 S.003 Term - 44058 43919 140 1 2 35 44 186 0.849 6.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:76298736_76499583|GENSCAN_predicted_peptide_1|797_aa MALEKKAPDNGNSKGFEVSRRERPDLSLGEVKFFTMLTGTMNPSNFHSIRRCDWDMDEAG NLHSQQTNTGTENQTLHVLTHKWELNNENTWTQGGEHHTPGPVGGGSYAEDGICERCSSP CRTCEGNATNCHSCEGGHVLHHGVCQENCPERHVAVKGVCKHCPEMCQDCIHEKTCKECT PEFFLHDDMCHQSCPRGFYADSRHCVPCHKDCLECSGPKADDCELCLESSWVLYDGLCLE ECPAGTYYEKETKECRDCHKSCLTCSSSGTCTTCQKGLIMNPRGSCMANEKCSPSEYWDE DAPGCKPCHVKCFHCMGPAEDQCQTCPMNSLLLNTTCVKDCPEGYYADEDSNRCAHCHSS CRTCEGRHSRQCHSCRPGWFQLGKECLLQCREGYYADNSTGRCERCNRSCKGCQGPRPTD CLSCDRFFFLLRSKGECHRSCPDHYYVEQSTQTCERCHPTCDQCKVSSITPFEAASRSPQ LSILPPHQVICVLRRIHLQCYSSLEFLVSTRNRPPLQCERKERKKERKKERKKERKKERK KERKKERKKGRKERERREREKEGKGGRRERKKEKKEKERERKREKEERKRERKKERKKEG EKFNCEKCHESCMECKGPGAKNCTLCPANLVLHMDDSHCLHCCNTSDPPSAQECCDCQDT TDECILRTSKVRPATEHFKTALFITSSMMLVLLLGAAVVVWKKSRGRVQPAAKAGYEKLA DPNKSYSSYKSSYRESTSFEEDQVIEYRDRDYDEDDDDDIVYMGQDGTVYRKFKYGLLDD DDIDELEYDDESYSYYQ >gi568815589f:76298736_76499583|GENSCAN_predicted_CDS_1|2394_bp atggccctggaaaaaaaggcaccagataatggaaattccaaaggttttgaggttagccga agagaaagaccagacctctctttgggtgaggtcaaattctttactatgcttacaggaaca atgaatccatcaaattttcattcgattagaagatgtgattgggacatggacgaagctgga aaccttcattctcagcaaactaacacaggaacagaaaatcaaacactgcatgttctcact cataagtgggagttgaacaatgagaacacatggacacagggaggggaacatcacacacca gggcctgtcggagggggctcttatgcagaagacggcatatgtgaacgctgtagctctcct tgcagaacatgtgaaggaaacgccaccaactgccattcttgtgaaggaggccacgtcctg caccacggagtgtgccaggaaaactgccccgagaggcacgtggctgtgaagggggtatgc aagcattgcccagagatgtgtcaggactgcatccatgagaaaacatgcaaagagtgcacg cctgagttcttcctgcacgatgatatgtgccaccagtcctgtccccgtggcttctatgca gactcgcgccactgtgtcccctgccataaagactgtctggagtgcagtggccccaaagcc gacgactgcgagctctgtcttgagagttcctgggtcctctatgatggactgtgcttggag gagtgtccagcaggaacctattatgaaaaggagactaaggagtgcagagattgccacaag tcctgcttgacctgctcatcatctgggacctgcaccacctgtcagaaaggcctgatcatg aaccctcgtgggagctgcatggccaacgagaagtgctcaccctccgagtactgggatgag gatgctcccgggtgcaagccctgccatgttaagtgcttccactgcatggggccggcggag gaccagtgtcaaacatgccccatgaacagccttcttctcaacacaacctgtgtgaaggac tgcccagagggctattatgccgatgaggacagcaaccggtgtgcccactgccacagctct tgcaggacatgtgaagggagacacagcaggcagtgccactcctgccgaccgggctggttc cagctaggaaaagagtgcctgctccagtgcagggaaggatattacgcagacaactccact ggccggtgtgagaggtgcaacaggagctgcaaggggtgccagggcccacggcccacagac tgcctgtcttgcgatagatttttctttctgctccgctccaaaggagagtgtcatcgctcc tgcccagaccattactatgtagagcaaagcacacagacctgtgagagatgccatccgact tgtgatcaatgcaaagtatctagtatcacaccttttgaagcagccagcaggtcaccccaa ctctcaatcttgcccccacaccaggtcatctgtgtattaagaagaattcatttgcagtgt tacagctctttagaatttctggtttctaccagaaaccgcccccccctgcaatgtgaaaga aaggaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaag aaagaaagaaagaaagaaagaaagaaaggaaggaaagaaagagaaagaagagagagagag aaggaagggaagggaggaaggagagagagaaagaaagagaagaaagaaaaagaaagagaa agaaagagagagaaagaagaaagaaagagagagagaaagaaagaaagaaagaaagaggga gagaagtttaactgtgaaaaatgccacgagagctgcatggaatgcaagggaccaggggcc aagaactgcaccttgtgccctgccaacctggtgctgcacatggacgacagccactgcctc cactgctgcaacacctctgatccccccagtgcccaggagtgctgtgactgccaggacacc acggacgaatgcatccttcgaacaagcaaggttaggcctgcaactgagcatttcaagaca gctctgttcatcacctcctccatgatgctggtgcttctgctcggggcagctgtggtagtg tggaagaaatctcgtggccgagtccagccagcagcaaaggccggctatgaaaaactggcc gaccccaacaagtcttactcctcctataagagcagctatagagagagcaccagctttgaa gaggatcaggtgattgagtacagggatcgggactatgatgaggatgatgatgatgacatc gtctacatgggccaggatggcacagtctaccggaaatttaaatatgggctgctggatgac gatgacatagatgagctggaatatgatgacgagagttactcctactaccagtaa >gi568815589f:76298736_76499583|GENSCAN_predicted_peptide_2|67_aa MSRRQNSSDTELKKEQFIRPGASARLLSQEPSSPSEQFLSLLRAHNSKGVHFLLLLSLEA EIGHKTI >gi568815589f:76298736_76499583|GENSCAN_predicted_CDS_2|204_bp atgagccgcagacaaaactcctcagacaccgagttaaagaaggagcagtttattcggcca ggagcatcggcaagactcctgtctcaagagccgagctccccgagtgagcaattcctgtcc cttttaagggctcacaactctaagggggtccactttttacttcttctttctttggaggca gaaattgggcataagacaatatga >gi568815589f:76298736_76499583|GENSCAN_predicted_peptide_3|277_aa MGRGYGSRAGSRRSAEEGSARRRRGGRLCASGRPLPPPRSHPALTSAVSAALQRRRSSSG ARRYLLGRRRQGFAVACGRRRPATGRAHAPVPRLVRGLGAASTAAPQDAQTGPQPIPRAD CIMRHLPYFCRGQVVRGFGRGSKQLGIPTANFPEQVVDNLPADISTGIYYGWASVGSGDV HKMVVSIGWNPYYKNTKKSMETHIMHTFKEDFYGEILNVAIVGYLRPEKNFDSLESLISA IQGDIEEAKKRLELPEHLKIKEDNFFQVSKSKIMNGH >gi568815589f:76298736_76499583|GENSCAN_predicted_CDS_3|834_bp atggggcggggctacggaagccgagcgggctcgaggcgttcggctgaggaagggagcgca cgccggcggcgcggaggccggctctgcgcttcgggccgccccctccccccaccccgctca cacccggcacttacttcggctgtctccgctgccctccagcggagacgcagctcctcaggc gcccggcggtatttgttgggtcggcggcgtcagggattcgcagtggcctgtggtcggcgt cgtccggccactggcagagctcacgctcctgtcccccggctggtccggggtctgggcgcc gcgtcgacggcggctccgcaggacgcgcagaccgggccgcagcccattccccgagcggac tgcattatgaggcacctgccttacttctgccggggtcaagtggtgcggggcttcggccgc ggctccaagcagctgggcatccccacagctaattttcctgagcaagtggtagataatctt ccagctgatatatccactggtatttactatggttgggccagtgttggaagtggagatgtc cataagatggtggtgagcataggatggaacccatattacaagaatacgaagaagtctatg gaaacacatatcatgcataccttcaaagaggacttctatggggaaatcctcaatgtggcc attgttggctacctgagaccagaaaagaactttgattctttagagtcacttatttcagca attcaaggtgatattgaagaagctaagaaacgactagagttaccagaacatttgaaaatc aaagaagacaatttcttccaggtttctaaaagcaaaataatgaatggccactga >gi568815589f:76298736_76499583|GENSCAN_predicted_peptide_4|289_aa MSGALDVLQMKEEDVLKFLAAGTHLGGTNLDFQMEQYICKRKSNGIYIINLKRTWEKLLL AASAIVAIENPADVSVISSRNTGQRAVLKFAAATGATPIAGHFTPGTFTNQIQAAFREPW LLVVTDPRADHQPLTEASYVNLPTIALCNTDSPMRYVDIAIPCNNKGAHSMLAREVLRMC STISYDHPWEVMPDLYFYRDPEDIEKEEQAAAEKAVTKEEFQGEWTAPAPEFTATQPEVA DWSEGVQVPSVPIQQFPTEDWSAQPATEDWSAAPTAQATEWVGATTDWS >gi568815589f:76298736_76499583|GENSCAN_predicted_CDS_4|870_bp atgtccggagcccttgatgtcctgcaaatgaaggaggaggatgtccttaagttccttgca gcaggaacccacttaggtggcaccaatcttgacttccagatggaacagtacatctgtaaa aggaaaagtaatggcatctatatcataaatctgaagaggacctgggagaagcttctgctg gcagctagtgctattgttgccattgaaaaccctgctgatgtcagtgttatatcctccagg aatactggccagagggctgtgctgaagtttgctgctgccactggagccactccaattgct ggccacttcactcctggaaccttcactaaccagatccaggcagccttccgggagccatgg cttcttgtggttactgaccccagggctgaccaccagcctctcacggaggcatcttatgtt aacctacctaccattgctctgtgtaacacagattctcctatgcgctatgtggacattgcc atcccatgcaacaacaagggagctcactcaatgctggctcgggaagttctgcgcatgtgt agcacgatttcctatgaccacccgtgggaggtcatgcctgatctctacttctacagagat cctgaagatattgaaaaagaagagcaggctgctgctgaaaaggcagtgaccaaggaggaa tttcagggtgaatggactgctccagctcctgagttcactgctactcagcctgaggttgca gactggtctgaaggtgtacaggtgccctctgtgcctattcagcagttccctactgaagac tggagcgctcagcctgccacggaagactggtctgcagctcccactgctcaggccactgaa tgggtaggagcaaccactgactggtcttaa >gi568815589f:76298736_76499583|GENSCAN_predicted_peptide_5|705_aa XEYARVQVFKNASARATKSDLPRSSLWSSRKTSVSAAVSKETSKEISKGPQKPPGYQLCP LQAVGGGEFGPTQVHVPFSLSDLKQIKADLGKFSDDPDRSNSGPLNERNVASAAAREFGD TWYLSQVNDRMTADERDKFPTGQQAIPCMDPTGTSTQIMGTGVGKEENPSAFLKRLREAL RKYTPLSPDSLEGQLILKDKFITQSAADVRRKLQKTTVQCPRGCCDWSKIPTTVFDVKAM PVSRRTVTQVREPKRHPRFQGMESWAMRTGEAGISCAKQEMAEQMTPNTGQMRLATVFSV TYSQPKGLAKLKKIDKHNIDEGVGIQTPVLLSVVEISTNFLEGIWTTWILMDNYSSRNLS YVGNFTNTQRFQNRNTVNNPFCMASSNLMLLSTSRFNLDIALISISSLDLSSELQTHISN CLVNISLYRMLNRLGMQPTSVDDDNHTTLVPPYPQEIRSKTPSGYSKSLIVLNPPTGLGM TWPCCFPLRGATAELRRGGRAEGYSVTVVFVPAFGGVPSSWSASKKNEIMLTIEGWGEKS KGQRQYVKFTVKDKTGGEQDWYHTSEVVKTLLMKQDVVKKPAKTHRNQDGDETKDLNRHF TKDDIQMANKHMKRCSTSYVIREMQITTLGLLEGPKSRTLTTPNAGKDVQQQELSLIAVA GHRFTKTKVPFFPLYPEGQKLTTEDNCSPHLAGDGTRGIYVNKLC >gi568815589f:76298736_76499583|GENSCAN_predicted_CDS_5|2118_bp ngagaatacgcaagggtgcaggttttcaagaatgcgtcagcaagggccactaaatctgac cttcctcggtcatccttgtggtctagcaggaaaactagtgtttctgctgctgtgtcgaag gaaacaagcaaagaaatctccaaaggaccacaaaaacccccaggctatcagttatgtccc cttcaagctgtagggggaggggaatttggcccaacccaggtacatgtccccttctccctc tctgacttaaagcagatcaaggcagacctggggaagttttcagatgatcctgatagatca aactctgggccattaaatgaaaggaatgtggcttcagctgcagcccgagagtttggagat acctggtatcttagtcaagtaaatgatagaatgacagctgacgaaagggacaaattccct accggtcagcaagccatcccctgtatggatcccactgggacctcaactcagatcatgggg actggagtcggcaaggaagagaatccttctgccttcctcaagcggctacgggaggcctta agaaaatatactcccctgtcacccgactccctcgagggtcaattgatcctaaaagataag tttattacccaatcagctgcagatgtcaggagaaagctccaaaagactactgttcagtgt cccagaggctgctgtgactggagcaaaataccaactactgtgtttgatgtcaaagccatg ccagtatctcggaggactgtgacccaagtgagggagcccaaacgccacccaagattccaa ggaatggaatcttgggccatgcgcactggtgaagcagggatctcctgtgctaaacaggag atggctgaacagatgacacccaatactgggcagatgagattggctacagttttttcagtt acatactcacagcctaagggattggcaaagttgaaaaagattgataaacataatatagat gaaggtgtaggaatacagactcctgtgctgctctctgttgtagagatttctacaaacttt ctagaaggcatttggacaacatggatacttatggacaactattcatctaggaatttatcc tatgtaggaaattttacaaatacacaaagatttcagaatcgaaacactgtaaacaatccc ttctgtatggcctcatccaacctcatgctgttaagtacatctagatttaacctcgacatc gctctgatttctatctctagtctggacctgtcctcagaactccagactcatatttccaac tgcttagtaaacatctccctttacagaatgctgaaccgtctgggaatgcagcccacttct gttgatgatgataatcatactacattagtccccccttatccacaggagatacgttccaag acccccagtggatactcgaaatccttaatagtactgaaccctcccacaggtttggggatg acctggccctgctgctttccgttacgtggggcaactgctgagctccgaagagggggcagg gcagagggctacagtgttactgtcgtctttgtacctgcattcggaggggtcccaagttct tggtccgcatccaagaagaatgagattatgctgactattgaagggtggggggaaaaatcc aaggggcagagacagtatgtgaagttcaccgtcaaggataagactggaggtgaacaggac tggtatcacacgtcagaggttgtaaagaccctactgatgaaacaggatgtggtaaagaag ccagccaaaactcaccgaaaccaagatggagatgaaacaaaagacctgaacagacacttc accaaagatgatatacagatggcaaataagcatatgaaaagatgttccacatcatatgtc atcagagaaatgcaaattaccactctaggcctattggaagggcccaaatccagaacactg accacaccaaatgctggcaaggatgtgcagcaacaggagctctccttaattgctgtggca ggacacagatttacaaagacaaaggtgcccttcttccctctgtacccggaaggacaaaag ttaaccactgaagacaactgtagccctcatctggctggagatggcacaagaggaatctac gttaataagctttgctaa