GENSCAN 1.0 Date run: 2-Nov-116 Time: 21:04:24 Sequence gi568815587f:78099596_78313810 : 214215 bp : 45.03% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 37118 37154 37 1 1 69 101 37 0.901 3.27 1.02 Intr + 39875 39979 105 0 0 59 42 99 0.766 2.59 1.03 Intr + 42000 42107 108 1 0 42 93 215 0.947 17.66 1.04 Term + 48331 48341 11 2 2 106 54 0 0.213 -3.24 1.05 PlyA + 53404 53409 6 1.05 2.04 PlyA - 53885 53880 6 1.05 2.03 Term - 69636 69498 139 2 1 93 43 43 0.216 -2.26 2.02 Intr - 71018 70929 90 1 0 54 100 42 0.571 1.11 2.01 Init - 74959 74181 779 1 2 68 98 1312 0.876 123.17 2.00 Prom - 75890 75851 40 -1.86 3.00 Prom + 76500 76539 40 -6.26 3.01 Init + 77215 77276 62 0 2 116 37 105 0.914 8.92 3.02 Intr + 78857 78956 100 2 1 55 67 60 0.209 0.71 3.03 Intr + 83546 83603 58 1 1 49 113 57 0.021 2.76 3.04 Term + 86911 86993 83 2 2 50 39 88 0.003 -2.04 3.05 PlyA + 87550 87555 6 -0.45 4.00 Prom + 88062 88101 40 -4.26 4.01 Init + 92948 93120 173 1 2 98 43 148 0.976 8.21 4.02 Intr + 96641 97323 683 2 2 144 96 1216 0.964 119.23 4.03 Intr + 98341 98473 133 2 1 105 75 209 0.990 21.00 4.04 Intr + 100000 100129 130 1 1 86 81 154 0.969 15.20 4.05 Intr + 100538 100639 102 1 0 43 95 136 0.920 10.17 4.06 Intr + 101055 101213 159 2 0 80 105 186 0.921 19.68 4.07 Intr + 106247 106440 194 1 2 119 30 255 0.991 21.09 4.08 Intr + 107935 108028 94 1 1 45 109 54 0.517 3.17 4.09 Intr + 109262 109368 107 1 2 94 99 143 0.999 15.01 4.10 Intr + 109853 111149 1297 2 1 85 99 1513 0.765 139.40 4.11 Intr + 114051 114180 130 2 1 97 38 120 0.702 8.17 4.12 Term + 115383 115456 74 1 2 106 43 95 0.941 4.87 4.13 PlyA + 115615 115620 6 1.05 5.12 PlyA - 115727 115722 6 1.05 5.11 Term - 119820 119677 144 0 0 100 45 264 0.970 21.21 5.10 Intr - 120849 120724 126 0 0 84 46 98 0.962 6.08 5.09 Intr - 122184 122082 103 2 1 84 101 125 0.993 13.58 5.08 Intr - 122600 122510 91 0 1 84 108 92 0.999 9.85 5.07 Intr - 124081 123817 265 1 1 93 115 342 0.999 34.49 5.06 Intr - 125607 125513 95 1 2 75 105 87 0.990 8.78 5.05 Intr - 127456 126870 587 0 2 141 78 439 0.938 40.49 5.04 Intr - 132834 132748 87 2 0 84 76 27 0.163 0.19 5.03 Intr - 150754 150562 193 2 1 92 97 128 0.466 12.65 5.02 Intr - 181306 181006 301 1 1 61 80 476 0.922 40.41 5.01 Init - 186165 186121 45 0 0 84 90 38 0.619 2.28 5.00 Prom - 194805 194766 40 -2.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 89224 88978 247 2 1 60 90 121 0.856 6.02 S.002 Intr - 89456 89297 160 2 1 106 92 41 0.868 5.86 S.003 Init - 91930 91925 6 1 0 123 92 0 0.880 4.77 S.004 Term + 171134 171188 55 2 1 98 53 107 0.804 5.23 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:78099596_78313810|GENSCAN_predicted_peptide_1|86_aa MAQLDVIDGIFQSPGRASPLLCGDEKAFEKSHPERQSRKPIASTRGNSTVKWAAEDDDDD DLDTEKQKTNEDDQTAKKDKLKEGEK >gi568815587f:78099596_78313810|GENSCAN_predicted_CDS_1|261_bp atggctcaactagatgtcatagatggcatcttccaatcccctggccgtgcgagtccctta ctatgtggggatgagaaggcatttgagaagagtcaccccgagcgccaaagccgaaaacca attgccagtacccgtggcaattctacagtcaaatgggcagctgaagatgatgatgatgat gatcttgacaccgagaagcagaagaccaatgaagatgaccagacagcaaaaaaggataag ttaaaagaaggtgaaaaatga >gi568815587f:78099596_78313810|GENSCAN_predicted_peptide_2|335_aa MSDPITLNVGGKLYTTSLATLTSFPDSMLGAMFSGKMPTKRDSQGNCFIDRDGKVFRYIL NFLRTSHLDLPEDFQEMGLLRREADFYQVQPLIEALQEKEVELSKAEKNAMLNITLNQRV QTVHFTVREAPQIYSLSSSSMEVFNANIFSTSCLFLKLLGSKLFYCSNGNLSSITSHLQD PNHLTLDWVANVEGLPEEEYTKQNLKRLWVVPANKQINSFQVFVEEVLKIALSDGFCIDS SHPHALDFMNNKIIRLIRYRLGELSLLLGKSMRKTENKRSAVMAAPSTDGGLWADGGGWL GNVACDQLQEQNNPDNRPIKGRDESLAALSAQTAW >gi568815587f:78099596_78313810|GENSCAN_predicted_CDS_2|1008_bp atgtccgaccccatcacgctgaacgtcggggggaagctctatacaacctcactggcgacc ctgaccagcttccctgactccatgctaggcgccatgttcagcgggaagatgcccaccaag agggacagccagggcaactgcttcattgaccgtgacggcaaagtgttccgctatatcctc aacttcctgcggacctcccaccttgacctgcctgaggacttccaggagatggggctgctc cgcagggaggccgacttctaccaggtgcagcccctgattgaggccctgcaggagaaggaa gtggagctctccaaggccgagaagaatgccatgctcaacatcacactgaaccagcgtgtg cagacggtccacttcactgtgcgcgaggcaccccagatctacagcctctcctcttccagc atggaggtcttcaacgccaacatcttcagcacctcctgcctcttcctcaagctccttggc tctaagctcttctactgctccaatggcaatctctcctccatcaccagccacttgcaggac cccaaccacctgactctggactgggtggccaatgtggagggcctgccagaggaggagtac accaagcagaacctcaagaggctctgggtggtgcccgccaacaagcagatcaacagcttc caggtcttcgtggaagaggtactgaaaatcgctctgagcgatggcttctgcatcgattct tctcacccacatgctctggattttatgaacaataagattattcgattaatacggtacagg ttaggtgaactcagccttctgcttggcaaaagcatgagaaagacggagaacaaacgttca gcagtgatggcagcaccatcgacagacggggggctctgggctgatgggggtggctggctg ggaaacgtggcctgtgatcagctccaggagcaaaacaaccctgataacaggcccatcaaa gggagggacgagagtctggcagctctgtctgcccagacagcctggtga >gi568815587f:78099596_78313810|GENSCAN_predicted_peptide_3|100_aa MTKSCLYNNNNNNNNNNNKAGATVSIITNPRPCYKRQNDMTRGEFHESAEVCSQKPGLPG ETDDSWTFTGKVQAVQPWARSWADDDDHNNNKHYNGNNNG >gi568815587f:78099596_78313810|GENSCAN_predicted_CDS_3|303_bp atgacgaaatcctgtctctacaacaacaacaacaacaacaacaacaacaacaacaaagca ggggccacagtgtccatcatcacaaatccacggccctgctacaaaaggcaaaatgacatg acacgaggggagtttcatgagtcggcggaggtttgcagtcagaaaccagggcttcctgga gaaactgatgattcctggactttcacaggaaaagttcaagctgtgcagccctgggcacgt tcctgggcagatgatgatgatcataacaacaacaaacactacaacggcaataataatggc taa >gi568815587f:78099596_78313810|GENSCAN_predicted_peptide_4|1091_aa MVTAVLRLPAWELGQVGRACLLWPGYGLSRGCRLALEIRALMEQLYLNCQLPGLCGVSAG AMDKILEAVVTSSYPVSVKQGLVRRVLEAARQPLEREQCLALLALGARLYVGGAEELPRR VGCQLLHVAGRHHPDVFAEFFSARRVLRLLQGGAGPPGPRALACVQLGLQLLPEGPAADE VFALLRREVLRTVCERPGPAACAQVARLLARHPRCVPDGPHRLLFCQQLVRCLGRFRCPA EGEEGAVEFLEQAQQVSGLLAQLWRAQPAAILPCLKELFAVISCAEEEPPSSALASVVQH LPLELMDGVVRNLSNDDSVTDSQMLTAISRMIDWVSWPLGKNIDKWIIALLKGLAAVKKF SILIEVSLTKIEKVFSKLLYPIVRGAALSVLKYMLLTFQHSHEAFHLLLPHIPPMVASLV KEDSNSGTSCLEQLAELVHCMVFRFPGFPDLYEPVMEAIKDLHVPNEDRIKQLLGQDAWT SQKSELAGFYPRLMAKSDTGKIGLINLGNTCYVNSILQALFMASDFRHCVLRLTENNSQP LMTKLQWLFGFLEHSQRPAISPENFLSASWTPWFSPGTQQDCSEYLKYLLDRLHEEEKTG TRICQKLKQSSSPSPPEEPPAPSSTSVEKMFGGKIVTRICCLCCLNVSSREEAFTDLSLA FPPPERCRRRRLGSVMRPTEDITARELPPPTSAQGPGRVGPRRQRKHCITEDTPPTSLYI EGLDSKEAGGQSSQEERIEREEEGKEERTEKEEVGEEEESTRGEGEREKEEEVEEEEEKV EKETEKEAEQEKEEDSLGAGTHPDAAIPSGERTCGSEGSRSVLDLVNYFLSPEKLTAENR YYCESCASLQDAEKVVELSQGPCYLILTLLRFSFDLRTMRRRKILDDVSIPLLLRLPLAG GRGQAYDLCSVVVHSGVSSESGHYYCYAREGAARPAASLGTADRPEPENQWYLFNDTRVS FSSFESVSNVTSFFPKDTAYVLFYRQRPREGPEAELGSSRVRTEPTLHKDLMEAISKDNI LYLQEQEKEARSRAAYISALPTSPHWGRGFDEDKDEDEGSPGGCNPAGAACCDGGVEFGP NVTDPQDVTSK >gi568815587f:78099596_78313810|GENSCAN_predicted_CDS_4|3276_bp atggtgacagctgtcctcagactgcctgcgtgggagttggggcaggtgggcagggcttgc ttgctgtggccaggctacggactgagcagaggctgccggctggccttggagatcagagct ctcatggagcagctctatttgaactgtcagctgccaggcttatgtggtgttagcgcgggc gccatggacaagatcttggaggcggtggtgacgtcgtcatacccggtcagcgtgaagcag gggctggttcggcgcgtgctggaggcggcgcggcagccgctggagcgtgagcagtgcctg gcgctgctggcgctgggcgcgcgcctctacgtgggcggcgcggaggagctgccgcgccgc gtgggctgccagctgctgcacgtggccggccgccaccaccccgacgtcttcgccgagttc ttcagcgcgcgtcgcgtgctgcgcctgctgcagggtggcgccggccccccgggcccccgc gcgctcgcctgcgtgcagctgggtctgcagctgctgcccgaggggcctgcggccgacgag gtgttcgcgctgctgcggcgcgaggtgctgcgcaccgtgtgcgagcgcccgggccccgcg gcctgcgcgcaggtggcacggctgctggctcgccacccgcgctgtgtgcccgacggaccc caccgcctgctcttctgccagcagctggtgcgttgcctcggccgcttccgctgcccagcc gaaggcgaggagggcgccgtggagttcctagagcaggcccagcaggtgagcgggctcctg gcgcagctgtggcgcgcacagcccgccgccatcctgccctgcctcaaagagctgttcgca gtcatctcctgcgcagaggaggagccaccatctagcgccctggccagcgtggtccagcac ctcccattggagctcatggatggtgttgtccggaacctcagcaatgatgacagtgtgaca gactcgcagatgctgactgccattagcaggatgattgactgggtgtcctggcccctgggg aagaatattgacaagtggatcattgcactgctgaagggcctggctgctgttaagaagttc agcatcttgatcgaggtttcgctcaccaaaattgagaaggttttctctaagctgctgtac cccatcgtccggggagctgccttgtctgtgctcaagtacatgctcctgaccttccagcac tcccacgaagccttccacctgctcctccctcacatcccccccatggtggcctctctggtc aaggaggactcgaactcggggaccagctgcctggagcagctggcggagctggtccactgc atggtgttccggttcccgggcttcccggatctgtatgagcctgtcatggaggccatcaag gacctccatgttcccaatgaggaccgcatcaagcagctgctggggcaggatgcctggact tcgcagaagagcgagctggcgggtttctatccccggctcatggccaagtcagacacgggc aagattggtctcatcaacctgggcaacacatgctatgtcaacagcatccttcaggcctta ttcatggcgtctgacttcagacattgtgtgctccgcttgactgagaacaactcacagccc ctgatgaccaagctgcagtggctctttggcttcctagaacacagccagcggcctgccatt tccccagagaacttcctctccgcatcctggacgccctggttcagccctggcacccagcag gactgctcggagtatctgaagtacctgctggatcggctgcacgaagaggagaaaacgggc acaaggatctgccagaaactcaagcagtccagctcgccctctccgcccgaggagcccccg gccccaagttcaacctctgtggaaaaaatgtttggaggcaagatagtgactcggatctgc tgtctctgctgcctcaacgtctcctcccgggaggaggccttcacggacctctctctcgcc ttccctcctcctgagcgctgtcgccgccgccgcctgggctctgtgatgcgccccacagaa gacatcacagcccgggagttgcccccaccaaccagtgcacaggggccaggcagggtgggt cctcggaggcaaaggaaacactgcatcacagaggacaccccccccaccagcctgtacatc gaaggcctggactccaaggaagctggtgggcagagcagtcaggaggaaaggatagagagg gaggaagaagggaaggaggagagaacggagaaggaagaagtgggggaggaggaggaaagc accagaggggaaggagagagggagaaagaggaggaggtggaagaggaagaagagaaggtg gagaaggagacagaaaaggaggctgagcaggaaaaggaagaagacagcctgggagcgggg acccacccggatgctgccatcccctccggggagcggacatgtggctctgagggctcccgc tccgtcctggacctggttaactacttcctgtcccccgagaagctgacagcagaaaaccgc tactactgcgagtcgtgtgcctccctgcaggatgccgagaaggtggtggagctgagccaa gggccgtgctacctcatcctcacactgctgcgcttctctttcgacctgcgcaccatgcgg cgccgcaagatcctggatgacgtctccatccccctgctgctccgcctgccactggctggt ggccgtggccaggcctatgacctctgcagtgtggtggtgcactctggagtgtcttcggag agtggtcactactactgctatgcccgtgagggcgctgcccgccctgccgcttctctggga actgccgataggccagagcccgagaaccagtggtacctgttcaatgacactcgggtgtcc ttctcttccttcgaatctgtcagcaacgtcacctccttcttccctaaggacacagcctat gtgctgttttaccggcagcggcccagggaggggcccgaggctgagttgggctcttctaga gtccggacagagcccaccctgcacaaggacttgatggaagccatttccaaagacaacatc ctttacctacaggagcaggagaaggaggcccggagcagggcggcctacatctctgcactc cccacatctccgcactgggggaggggctttgatgaagacaaggatgaggatgaaggctct ccagggggctgcaatcctgcaggggctgcctgctgtgatggtggtgtggagtttgggccc aatgtcacagaccctcaagatgtcacatctaagtga >gi568815587f:78099596_78313810|GENSCAN_predicted_peptide_5|678_aa MPWLTPVIPALWEAEAWKKRWFILRSGRMSGDPDVLEYYKNDHSKKPLRIINLNFCEQVD AGLTFNKKELQDSFVFDIKTSERTFYLVAETEEDMNKWVQSICQICGFNQAEESTAELSS SSQHLLRERKSSAPSHSSQPTLFTFEPPVSNHMQPTLSTSAPQEYLYLHQCISRRAENAS LWGLIPDSSQCLSSYFKKAVKPVTDDTLMSASFSQGTRASFLMRSDTAVQKLAQGNGHCV NGISGQVHGFYSLPKPSRHNTEFRDSTYDLPRSLASHGHTKGSLTGSETDNEDVYTFKTP SNTLCREFGDLLVDNMDVPATPLSAYQIPRTFTLDKNHNAMTVATPGDSAIAPPPRPPKP SQAETPRWGSPQQRPPISENSRSVAATIPRRNTLPAMDNSRLHRASSCETYEYPQRGGES AGRSAESMSDGVGSFLPGKMIVGRSDSTNSEDNYVPMNPGSSTLLAMERAGDNSQSVYIP MSPGAHHFDSLGYPSTTLPVHRGPSRGSEIQPPPVNRNLKPDRKAKPTPLDLRNNTVIDE LPFKSPITKSWSRANHTFNSSSSQYCRPISTQSITSTDSGDSEENYVPMQNPVSASPVPS GTNSPAPKKSTGSVDYLALDFQPSSPSPHRKPSTSSVTSDEKVDYVQVDKEKTQALQNTM QEWTDVRQSSEPSKGAKL >gi568815587f:78099596_78313810|GENSCAN_predicted_CDS_5|2037_bp atgccgtggctcacgcctgtaatcccagcactttgggaggctgaggcctggaagaaacgc tggtttatcctgcggagtggccggatgagcggtgacccagatgttctggaatactacaag aacgatcactccaagaagcctctgcggatcatcaacctgaacttctgtgagcaggtagat gcaggcctgacctttaacaagaaggagctgcaggatagttttgtgtttgacatcaagacc agtgaacgcaccttttacctggtggctgagacagaagaggacatgaataagtgggtccag agcatctgccagatctgtggcttcaatcaggctgaggagagcacagctgagctcagcagc tctagccagcaccttctccgagagcgcaagtcctcagccccatcacactccagccagcca actctgttcacgtttgaaccccctgtgtcaaaccacatgcagcccaccttgtccaccagc gcacctcaggagtatctctacttgcaccagtgcataagccgaagagcagaaaatgcaagc ctttggggcctcattccagattctagccagtgtctgtcttcttatttcaagaaggctgtt aaacctgtaacagatgatacattgatgagtgccagcttctctcagggcaccagagcctct tttctcatgaggagtgacacagctgtacaaaaacttgcccagggcaatggacactgtgtc aacgggatcagtggtcaagtccatggcttctatagccttcccaagccgagccggcacaat acagaattcagagacagtacctacgacctcccccgcagcctggcctcccatggccacacc aagggcagcctcacaggctccgagacagataatgaggatgtgtacaccttcaagacgccc agcaacaccctgtgcagggagttcggggacctcctggtagacaatatggatgttccggcc accccactctcagcctaccagatccctaggacattcactctggacaaaaaccacaatgcc atgacagtggccactcctggggactcagccatagctcccccaccccgcccccccaagcca agtcaggcagaaacacctcgatggggcagtcctcagcagagaccgccaatcagtgaaaat agcagatctgtcgctgccaccatccccagacgcaacaccctccctgcaatggacaacagc cgacttcaccgagcttcttcctgtgagacctacgagtacccacagcgtggtggagagagt gcaggccggtctgctgaatccatgagtgatggagttggctctttcctgccagggaaaatg attgtgggccgatcggacagcaccaattctgaagacaactatgtgcccatgaatccaggt tcttccaccctgttggccatggaacgagcaggtgataattcccagagcgtctacatccca atgagcccaggggcccatcactttgactcacttggctacccatcaacaacccttcctgtg caccgaggccccagcagaggaagtgagattcagccaccccctgtcaaccgcaacctcaaa cctgatcggaaagcaaagccaacaccacttgacctgaggaacaacaccgtcatcgatgaa ctccccttcaagtcacctatcaccaagtcttggtctagggccaaccacaccttcaactcc agctcctcccagtactgccgccccatctccacccagagcatcaccagcacagactcagga gacagcgaagagaactatgtccctatgcaaaacccagtgtctgcatctcccgttcccagt ggcacgaacagtcctgcccctaagaagagcaccggcagcgttgattatctggccctggac ttccagccgagctccccaagcccccaccgcaagccatctacttcatccgtcacctctgat gagaaggtggactacgttcaggtggacaaggagaagacccaggccctgcagaacaccatg caggagtggacagacgtgcggcagtcctcagagccttccaagggtgccaagctgtga