GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:50:45 Sequence gi568815587r:78073775_78274554 : 200780 bp : 45.79% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 5970 5805 166 0 1 94 105 288 0.952 29.19 1.00 Prom - 16916 16877 40 -4.06 2.00 Prom + 28295 28334 40 -4.96 2.01 Init + 62939 62975 37 1 1 69 101 37 0.900 3.27 2.02 Intr + 65696 65800 105 0 0 59 42 99 0.765 2.59 2.03 Intr + 67821 67928 108 1 0 42 93 215 0.947 17.66 2.04 Term + 74152 74162 11 2 2 106 54 0 0.213 -3.24 2.05 PlyA + 79225 79230 6 1.05 3.04 PlyA - 79706 79701 6 1.05 3.03 Term - 95457 95319 139 2 1 93 43 43 0.216 -2.26 3.02 Intr - 96839 96750 90 1 0 54 100 42 0.571 1.11 3.01 Init - 100780 100002 779 1 2 68 98 1312 0.876 123.17 3.00 Prom - 101711 101672 40 -1.86 4.00 Prom + 102321 102360 40 -6.26 4.01 Init + 103036 103097 62 0 2 116 37 105 0.914 8.92 4.02 Intr + 104678 104777 100 2 1 55 67 60 0.209 0.71 4.03 Intr + 109367 109424 58 1 1 49 113 57 0.021 2.76 4.04 Term + 112732 112814 83 2 2 50 39 88 0.003 -2.04 4.05 PlyA + 113371 113376 6 -0.45 5.00 Prom + 113883 113922 40 -4.26 5.01 Init + 118769 118941 173 1 2 98 43 148 0.976 8.21 5.02 Intr + 122462 123144 683 2 2 144 96 1216 0.964 119.23 5.03 Intr + 124162 124294 133 2 1 105 75 209 0.990 21.00 5.04 Intr + 125821 125950 130 1 1 86 81 154 0.969 15.20 5.05 Intr + 126359 126460 102 1 0 43 95 136 0.920 10.17 5.06 Intr + 126876 127034 159 2 0 80 105 186 0.921 19.68 5.07 Intr + 132068 132261 194 1 2 119 30 255 0.991 21.09 5.08 Intr + 133756 133849 94 1 1 45 109 54 0.517 3.17 5.09 Intr + 135083 135189 107 1 2 94 99 143 0.999 15.01 5.10 Intr + 135674 136970 1297 2 1 85 99 1513 0.765 139.40 5.11 Intr + 139872 140001 130 2 1 97 38 120 0.702 8.17 5.12 Term + 141204 141277 74 1 2 106 43 95 0.941 4.87 5.13 PlyA + 141436 141441 6 1.05 6.11 PlyA - 141548 141543 6 1.05 6.10 Term - 145641 145498 144 0 0 100 45 264 0.970 21.21 6.09 Intr - 146670 146545 126 0 0 84 46 98 0.962 6.08 6.08 Intr - 148005 147903 103 2 1 84 101 125 0.993 13.58 6.07 Intr - 148421 148331 91 0 1 84 108 92 0.999 9.85 6.06 Intr - 149902 149638 265 1 1 93 115 342 0.999 34.49 6.05 Intr - 151428 151334 95 1 2 75 105 87 0.990 8.78 6.04 Intr - 153277 152691 587 0 2 141 78 439 0.938 40.49 6.03 Intr - 158655 158569 87 2 0 84 76 27 0.163 0.19 6.02 Intr - 176575 176383 193 2 1 92 97 128 0.466 12.65 6.01 Init - 197098 197056 43 1 1 61 103 48 0.034 2.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 9722 9758 37 1 1 92 110 49 0.861 7.70 S.002 Intr - 115045 114799 247 2 1 60 90 121 0.856 6.02 S.003 Intr - 115277 115118 160 2 1 106 92 41 0.868 5.86 S.004 Init - 117751 117746 6 1 0 123 92 0 0.880 4.77 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:78073775_78274554|GENSCAN_predicted_peptide_1|56_aa MIARRNPEPLRFLPDEARSLPPPKLTDPRLLYIGFLGYCSGLIDNLIRRRPIATAX >gi568815587r:78073775_78274554|GENSCAN_predicted_CDS_1|168_bp atgatcgcacggcggaacccagaacccttacggtttctgccggatgaggcccggagcctg cccccgcccaagctgaccgacccgcggctcctctacatcggcttcttgggctactgctcc ggcctgattgataacctaatccggcggaggccgatcgcgacggctgnn >gi568815587r:78073775_78274554|GENSCAN_predicted_peptide_2|86_aa MAQLDVIDGIFQSPGRASPLLCGDEKAFEKSHPERQSRKPIASTRGNSTVKWAAEDDDDD DLDTEKQKTNEDDQTAKKDKLKEGEK >gi568815587r:78073775_78274554|GENSCAN_predicted_CDS_2|261_bp atggctcaactagatgtcatagatggcatcttccaatcccctggccgtgcgagtccctta ctatgtggggatgagaaggcatttgagaagagtcaccccgagcgccaaagccgaaaacca attgccagtacccgtggcaattctacagtcaaatgggcagctgaagatgatgatgatgat gatcttgacaccgagaagcagaagaccaatgaagatgaccagacagcaaaaaaggataag ttaaaagaaggtgaaaaatga >gi568815587r:78073775_78274554|GENSCAN_predicted_peptide_3|335_aa MSDPITLNVGGKLYTTSLATLTSFPDSMLGAMFSGKMPTKRDSQGNCFIDRDGKVFRYIL NFLRTSHLDLPEDFQEMGLLRREADFYQVQPLIEALQEKEVELSKAEKNAMLNITLNQRV QTVHFTVREAPQIYSLSSSSMEVFNANIFSTSCLFLKLLGSKLFYCSNGNLSSITSHLQD PNHLTLDWVANVEGLPEEEYTKQNLKRLWVVPANKQINSFQVFVEEVLKIALSDGFCIDS SHPHALDFMNNKIIRLIRYRLGELSLLLGKSMRKTENKRSAVMAAPSTDGGLWADGGGWL GNVACDQLQEQNNPDNRPIKGRDESLAALSAQTAW >gi568815587r:78073775_78274554|GENSCAN_predicted_CDS_3|1008_bp atgtccgaccccatcacgctgaacgtcggggggaagctctatacaacctcactggcgacc ctgaccagcttccctgactccatgctaggcgccatgttcagcgggaagatgcccaccaag agggacagccagggcaactgcttcattgaccgtgacggcaaagtgttccgctatatcctc aacttcctgcggacctcccaccttgacctgcctgaggacttccaggagatggggctgctc cgcagggaggccgacttctaccaggtgcagcccctgattgaggccctgcaggagaaggaa gtggagctctccaaggccgagaagaatgccatgctcaacatcacactgaaccagcgtgtg cagacggtccacttcactgtgcgcgaggcaccccagatctacagcctctcctcttccagc atggaggtcttcaacgccaacatcttcagcacctcctgcctcttcctcaagctccttggc tctaagctcttctactgctccaatggcaatctctcctccatcaccagccacttgcaggac cccaaccacctgactctggactgggtggccaatgtggagggcctgccagaggaggagtac accaagcagaacctcaagaggctctgggtggtgcccgccaacaagcagatcaacagcttc caggtcttcgtggaagaggtactgaaaatcgctctgagcgatggcttctgcatcgattct tctcacccacatgctctggattttatgaacaataagattattcgattaatacggtacagg ttaggtgaactcagccttctgcttggcaaaagcatgagaaagacggagaacaaacgttca gcagtgatggcagcaccatcgacagacggggggctctgggctgatgggggtggctggctg ggaaacgtggcctgtgatcagctccaggagcaaaacaaccctgataacaggcccatcaaa gggagggacgagagtctggcagctctgtctgcccagacagcctggtga >gi568815587r:78073775_78274554|GENSCAN_predicted_peptide_4|100_aa MTKSCLYNNNNNNNNNNNKAGATVSIITNPRPCYKRQNDMTRGEFHESAEVCSQKPGLPG ETDDSWTFTGKVQAVQPWARSWADDDDHNNNKHYNGNNNG >gi568815587r:78073775_78274554|GENSCAN_predicted_CDS_4|303_bp atgacgaaatcctgtctctacaacaacaacaacaacaacaacaacaacaacaacaaagca ggggccacagtgtccatcatcacaaatccacggccctgctacaaaaggcaaaatgacatg acacgaggggagtttcatgagtcggcggaggtttgcagtcagaaaccagggcttcctgga gaaactgatgattcctggactttcacaggaaaagttcaagctgtgcagccctgggcacgt tcctgggcagatgatgatgatcataacaacaacaaacactacaacggcaataataatggc taa >gi568815587r:78073775_78274554|GENSCAN_predicted_peptide_5|1091_aa MVTAVLRLPAWELGQVGRACLLWPGYGLSRGCRLALEIRALMEQLYLNCQLPGLCGVSAG AMDKILEAVVTSSYPVSVKQGLVRRVLEAARQPLEREQCLALLALGARLYVGGAEELPRR VGCQLLHVAGRHHPDVFAEFFSARRVLRLLQGGAGPPGPRALACVQLGLQLLPEGPAADE VFALLRREVLRTVCERPGPAACAQVARLLARHPRCVPDGPHRLLFCQQLVRCLGRFRCPA EGEEGAVEFLEQAQQVSGLLAQLWRAQPAAILPCLKELFAVISCAEEEPPSSALASVVQH LPLELMDGVVRNLSNDDSVTDSQMLTAISRMIDWVSWPLGKNIDKWIIALLKGLAAVKKF SILIEVSLTKIEKVFSKLLYPIVRGAALSVLKYMLLTFQHSHEAFHLLLPHIPPMVASLV KEDSNSGTSCLEQLAELVHCMVFRFPGFPDLYEPVMEAIKDLHVPNEDRIKQLLGQDAWT SQKSELAGFYPRLMAKSDTGKIGLINLGNTCYVNSILQALFMASDFRHCVLRLTENNSQP LMTKLQWLFGFLEHSQRPAISPENFLSASWTPWFSPGTQQDCSEYLKYLLDRLHEEEKTG TRICQKLKQSSSPSPPEEPPAPSSTSVEKMFGGKIVTRICCLCCLNVSSREEAFTDLSLA FPPPERCRRRRLGSVMRPTEDITARELPPPTSAQGPGRVGPRRQRKHCITEDTPPTSLYI EGLDSKEAGGQSSQEERIEREEEGKEERTEKEEVGEEEESTRGEGEREKEEEVEEEEEKV EKETEKEAEQEKEEDSLGAGTHPDAAIPSGERTCGSEGSRSVLDLVNYFLSPEKLTAENR YYCESCASLQDAEKVVELSQGPCYLILTLLRFSFDLRTMRRRKILDDVSIPLLLRLPLAG GRGQAYDLCSVVVHSGVSSESGHYYCYAREGAARPAASLGTADRPEPENQWYLFNDTRVS FSSFESVSNVTSFFPKDTAYVLFYRQRPREGPEAELGSSRVRTEPTLHKDLMEAISKDNI LYLQEQEKEARSRAAYISALPTSPHWGRGFDEDKDEDEGSPGGCNPAGAACCDGGVEFGP NVTDPQDVTSK >gi568815587r:78073775_78274554|GENSCAN_predicted_CDS_5|3276_bp atggtgacagctgtcctcagactgcctgcgtgggagttggggcaggtgggcagggcttgc ttgctgtggccaggctacggactgagcagaggctgccggctggccttggagatcagagct ctcatggagcagctctatttgaactgtcagctgccaggcttatgtggtgttagcgcgggc gccatggacaagatcttggaggcggtggtgacgtcgtcatacccggtcagcgtgaagcag gggctggttcggcgcgtgctggaggcggcgcggcagccgctggagcgtgagcagtgcctg gcgctgctggcgctgggcgcgcgcctctacgtgggcggcgcggaggagctgccgcgccgc gtgggctgccagctgctgcacgtggccggccgccaccaccccgacgtcttcgccgagttc ttcagcgcgcgtcgcgtgctgcgcctgctgcagggtggcgccggccccccgggcccccgc gcgctcgcctgcgtgcagctgggtctgcagctgctgcccgaggggcctgcggccgacgag gtgttcgcgctgctgcggcgcgaggtgctgcgcaccgtgtgcgagcgcccgggccccgcg gcctgcgcgcaggtggcacggctgctggctcgccacccgcgctgtgtgcccgacggaccc caccgcctgctcttctgccagcagctggtgcgttgcctcggccgcttccgctgcccagcc gaaggcgaggagggcgccgtggagttcctagagcaggcccagcaggtgagcgggctcctg gcgcagctgtggcgcgcacagcccgccgccatcctgccctgcctcaaagagctgttcgca gtcatctcctgcgcagaggaggagccaccatctagcgccctggccagcgtggtccagcac ctcccattggagctcatggatggtgttgtccggaacctcagcaatgatgacagtgtgaca gactcgcagatgctgactgccattagcaggatgattgactgggtgtcctggcccctgggg aagaatattgacaagtggatcattgcactgctgaagggcctggctgctgttaagaagttc agcatcttgatcgaggtttcgctcaccaaaattgagaaggttttctctaagctgctgtac cccatcgtccggggagctgccttgtctgtgctcaagtacatgctcctgaccttccagcac tcccacgaagccttccacctgctcctccctcacatcccccccatggtggcctctctggtc aaggaggactcgaactcggggaccagctgcctggagcagctggcggagctggtccactgc atggtgttccggttcccgggcttcccggatctgtatgagcctgtcatggaggccatcaag gacctccatgttcccaatgaggaccgcatcaagcagctgctggggcaggatgcctggact tcgcagaagagcgagctggcgggtttctatccccggctcatggccaagtcagacacgggc aagattggtctcatcaacctgggcaacacatgctatgtcaacagcatccttcaggcctta ttcatggcgtctgacttcagacattgtgtgctccgcttgactgagaacaactcacagccc ctgatgaccaagctgcagtggctctttggcttcctagaacacagccagcggcctgccatt tccccagagaacttcctctccgcatcctggacgccctggttcagccctggcacccagcag gactgctcggagtatctgaagtacctgctggatcggctgcacgaagaggagaaaacgggc acaaggatctgccagaaactcaagcagtccagctcgccctctccgcccgaggagcccccg gccccaagttcaacctctgtggaaaaaatgtttggaggcaagatagtgactcggatctgc tgtctctgctgcctcaacgtctcctcccgggaggaggccttcacggacctctctctcgcc ttccctcctcctgagcgctgtcgccgccgccgcctgggctctgtgatgcgccccacagaa gacatcacagcccgggagttgcccccaccaaccagtgcacaggggccaggcagggtgggt cctcggaggcaaaggaaacactgcatcacagaggacaccccccccaccagcctgtacatc gaaggcctggactccaaggaagctggtgggcagagcagtcaggaggaaaggatagagagg gaggaagaagggaaggaggagagaacggagaaggaagaagtgggggaggaggaggaaagc accagaggggaaggagagagggagaaagaggaggaggtggaagaggaagaagagaaggtg gagaaggagacagaaaaggaggctgagcaggaaaaggaagaagacagcctgggagcgggg acccacccggatgctgccatcccctccggggagcggacatgtggctctgagggctcccgc tccgtcctggacctggttaactacttcctgtcccccgagaagctgacagcagaaaaccgc tactactgcgagtcgtgtgcctccctgcaggatgccgagaaggtggtggagctgagccaa gggccgtgctacctcatcctcacactgctgcgcttctctttcgacctgcgcaccatgcgg cgccgcaagatcctggatgacgtctccatccccctgctgctccgcctgccactggctggt ggccgtggccaggcctatgacctctgcagtgtggtggtgcactctggagtgtcttcggag agtggtcactactactgctatgcccgtgagggcgctgcccgccctgccgcttctctggga actgccgataggccagagcccgagaaccagtggtacctgttcaatgacactcgggtgtcc ttctcttccttcgaatctgtcagcaacgtcacctccttcttccctaaggacacagcctat gtgctgttttaccggcagcggcccagggaggggcccgaggctgagttgggctcttctaga gtccggacagagcccaccctgcacaaggacttgatggaagccatttccaaagacaacatc ctttacctacaggagcaggagaaggaggcccggagcagggcggcctacatctctgcactc cccacatctccgcactgggggaggggctttgatgaagacaaggatgaggatgaaggctct ccagggggctgcaatcctgcaggggctgcctgctgtgatggtggtgtggagtttgggccc aatgtcacagaccctcaagatgtcacatctaagtga >gi568815587r:78073775_78274554|GENSCAN_predicted_peptide_6|577_aa MEMMLLTALLGGCTAELSSSSQHLLRERKSSAPSHSSQPTLFTFEPPVSNHMQPTLSTSA PQEYLYLHQCISRRAENASLWGLIPDSSQCLSSYFKKAVKPVTDDTLMSASFSQGTRASF LMRSDTAVQKLAQGNGHCVNGISGQVHGFYSLPKPSRHNTEFRDSTYDLPRSLASHGHTK GSLTGSETDNEDVYTFKTPSNTLCREFGDLLVDNMDVPATPLSAYQIPRTFTLDKNHNAM TVATPGDSAIAPPPRPPKPSQAETPRWGSPQQRPPISENSRSVAATIPRRNTLPAMDNSR LHRASSCETYEYPQRGGESAGRSAESMSDGVGSFLPGKMIVGRSDSTNSEDNYVPMNPGS STLLAMERAGDNSQSVYIPMSPGAHHFDSLGYPSTTLPVHRGPSRGSEIQPPPVNRNLKP DRKAKPTPLDLRNNTVIDELPFKSPITKSWSRANHTFNSSSSQYCRPISTQSITSTDSGD SEENYVPMQNPVSASPVPSGTNSPAPKKSTGSVDYLALDFQPSSPSPHRKPSTSSVTSDE KVDYVQVDKEKTQALQNTMQEWTDVRQSSEPSKGAKL >gi568815587r:78073775_78274554|GENSCAN_predicted_CDS_6|1734_bp atggagatgatgctcctcacagcgttgttgggaggttgcacagctgagctcagcagctct agccagcaccttctccgagagcgcaagtcctcagccccatcacactccagccagccaact ctgttcacgtttgaaccccctgtgtcaaaccacatgcagcccaccttgtccaccagcgca cctcaggagtatctctacttgcaccagtgcataagccgaagagcagaaaatgcaagcctt tggggcctcattccagattctagccagtgtctgtcttcttatttcaagaaggctgttaaa cctgtaacagatgatacattgatgagtgccagcttctctcagggcaccagagcctctttt ctcatgaggagtgacacagctgtacaaaaacttgcccagggcaatggacactgtgtcaac gggatcagtggtcaagtccatggcttctatagccttcccaagccgagccggcacaataca gaattcagagacagtacctacgacctcccccgcagcctggcctcccatggccacaccaag ggcagcctcacaggctccgagacagataatgaggatgtgtacaccttcaagacgcccagc aacaccctgtgcagggagttcggggacctcctggtagacaatatggatgttccggccacc ccactctcagcctaccagatccctaggacattcactctggacaaaaaccacaatgccatg acagtggccactcctggggactcagccatagctcccccaccccgcccccccaagccaagt caggcagaaacacctcgatggggcagtcctcagcagagaccgccaatcagtgaaaatagc agatctgtcgctgccaccatccccagacgcaacaccctccctgcaatggacaacagccga cttcaccgagcttcttcctgtgagacctacgagtacccacagcgtggtggagagagtgca ggccggtctgctgaatccatgagtgatggagttggctctttcctgccagggaaaatgatt gtgggccgatcggacagcaccaattctgaagacaactatgtgcccatgaatccaggttct tccaccctgttggccatggaacgagcaggtgataattcccagagcgtctacatcccaatg agcccaggggcccatcactttgactcacttggctacccatcaacaacccttcctgtgcac cgaggccccagcagaggaagtgagattcagccaccccctgtcaaccgcaacctcaaacct gatcggaaagcaaagccaacaccacttgacctgaggaacaacaccgtcatcgatgaactc cccttcaagtcacctatcaccaagtcttggtctagggccaaccacaccttcaactccagc tcctcccagtactgccgccccatctccacccagagcatcaccagcacagactcaggagac agcgaagagaactatgtccctatgcaaaacccagtgtctgcatctcccgttcccagtggc acgaacagtcctgcccctaagaagagcaccggcagcgttgattatctggccctggacttc cagccgagctccccaagcccccaccgcaagccatctacttcatccgtcacctctgatgag aaggtggactacgttcaggtggacaaggagaagacccaggccctgcagaacaccatgcag gagtggacagacgtgcggcagtcctcagagccttccaagggtgccaagctgtga