GENSCAN 1.0 Date run: 2-Nov-116 Time: 19:20:15 Sequence gi568815587r:78119275_78380862 : 261588 bp : 43.93% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 17439 17475 37 2 1 69 101 37 0.922 3.27 1.02 Intr + 20196 20300 105 1 0 59 42 99 0.785 2.59 1.03 Intr + 22321 22428 108 2 0 42 93 215 0.948 17.66 1.04 Term + 28652 28662 11 0 2 106 54 0 0.213 -3.24 1.05 PlyA + 33725 33730 6 1.05 2.04 PlyA - 34206 34201 6 1.05 2.03 Term - 49957 49819 139 0 1 93 43 43 0.216 -2.26 2.02 Intr - 51339 51250 90 2 0 54 100 42 0.571 1.11 2.01 Init - 55280 54502 779 2 2 68 98 1312 0.876 123.17 2.00 Prom - 56211 56172 40 -1.86 3.00 Prom + 56821 56860 40 -6.26 3.01 Init + 57536 57597 62 1 2 116 37 105 0.914 8.92 3.02 Intr + 59178 59277 100 0 1 55 67 60 0.209 0.71 3.03 Intr + 63867 63924 58 2 1 49 113 57 0.021 2.76 3.04 Term + 67232 67314 83 0 2 50 39 88 0.003 -2.04 3.05 PlyA + 67871 67876 6 -0.45 4.00 Prom + 68383 68422 40 -4.26 4.01 Init + 73269 73441 173 2 2 98 43 148 0.976 8.21 4.02 Intr + 76962 77644 683 0 2 144 96 1216 0.964 119.23 4.03 Intr + 78662 78794 133 0 1 105 75 209 0.990 21.00 4.04 Intr + 80321 80450 130 2 1 86 81 154 0.969 15.20 4.05 Intr + 80859 80960 102 2 0 43 95 136 0.920 10.17 4.06 Intr + 81376 81534 159 0 0 80 105 186 0.921 19.68 4.07 Intr + 86568 86761 194 2 2 119 30 255 0.991 21.09 4.08 Intr + 88256 88349 94 2 1 45 109 54 0.517 3.17 4.09 Intr + 89583 89689 107 2 2 94 99 143 0.999 15.01 4.10 Intr + 90174 91470 1297 0 1 85 99 1513 0.765 139.40 4.11 Intr + 94372 94501 130 0 1 97 38 120 0.702 8.17 4.12 Term + 95704 95777 74 2 2 106 43 95 0.941 4.87 4.13 PlyA + 95936 95941 6 1.05 5.12 PlyA - 96048 96043 6 1.05 5.11 Term - 100141 99998 144 1 0 100 45 264 0.970 21.21 5.10 Intr - 101170 101045 126 1 0 84 46 98 0.962 6.08 5.09 Intr - 102505 102403 103 0 1 84 101 125 0.993 13.58 5.08 Intr - 102921 102831 91 1 1 84 108 92 0.999 9.85 5.07 Intr - 104402 104138 265 2 1 93 115 342 0.999 34.49 5.06 Intr - 105928 105834 95 2 2 75 105 87 0.990 8.78 5.05 Intr - 107777 107191 587 1 2 141 78 439 0.938 40.49 5.04 Intr - 113155 113069 87 0 0 84 76 27 0.163 0.19 5.03 Intr - 131075 130883 193 0 1 92 97 128 0.466 12.65 5.02 Intr - 161627 161327 301 2 1 61 80 476 0.922 40.41 5.01 Init - 166486 166442 45 1 0 84 90 38 0.619 2.28 5.00 Prom - 175126 175087 40 -2.46 6.00 Prom + 192014 192053 40 -2.66 6.01 Init + 200787 200859 73 2 1 85 21 110 0.233 3.23 6.02 Intr + 210591 210674 84 1 0 95 95 35 0.637 4.69 6.03 Term + 212762 212970 209 0 2 99 42 73 0.652 1.30 6.04 PlyA + 213324 213329 6 1.05 7.00 Prom + 238699 238738 40 -2.46 7.01 Init + 243537 243581 45 2 0 43 93 39 0.642 0.58 7.02 Term + 248882 249049 168 1 0 90 31 128 0.878 5.18 7.03 PlyA + 249280 249285 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 69545 69299 247 0 1 60 90 121 0.856 6.02 S.002 Intr - 69777 69618 160 0 1 106 92 41 0.868 5.86 S.003 Init - 72251 72246 6 2 0 123 92 0 0.880 4.77 S.004 Term + 151455 151509 55 0 1 98 53 107 0.804 5.23 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:78119275_78380862|GENSCAN_predicted_peptide_1|86_aa MAQLDVIDGIFQSPGRASPLLCGDEKAFEKSHPERQSRKPIASTRGNSTVKWAAEDDDDD DLDTEKQKTNEDDQTAKKDKLKEGEK >gi568815587r:78119275_78380862|GENSCAN_predicted_CDS_1|261_bp atggctcaactagatgtcatagatggcatcttccaatcccctggccgtgcgagtccctta ctatgtggggatgagaaggcatttgagaagagtcaccccgagcgccaaagccgaaaacca attgccagtacccgtggcaattctacagtcaaatgggcagctgaagatgatgatgatgat gatcttgacaccgagaagcagaagaccaatgaagatgaccagacagcaaaaaaggataag ttaaaagaaggtgaaaaatga >gi568815587r:78119275_78380862|GENSCAN_predicted_peptide_2|335_aa MSDPITLNVGGKLYTTSLATLTSFPDSMLGAMFSGKMPTKRDSQGNCFIDRDGKVFRYIL NFLRTSHLDLPEDFQEMGLLRREADFYQVQPLIEALQEKEVELSKAEKNAMLNITLNQRV QTVHFTVREAPQIYSLSSSSMEVFNANIFSTSCLFLKLLGSKLFYCSNGNLSSITSHLQD PNHLTLDWVANVEGLPEEEYTKQNLKRLWVVPANKQINSFQVFVEEVLKIALSDGFCIDS SHPHALDFMNNKIIRLIRYRLGELSLLLGKSMRKTENKRSAVMAAPSTDGGLWADGGGWL GNVACDQLQEQNNPDNRPIKGRDESLAALSAQTAW >gi568815587r:78119275_78380862|GENSCAN_predicted_CDS_2|1008_bp atgtccgaccccatcacgctgaacgtcggggggaagctctatacaacctcactggcgacc ctgaccagcttccctgactccatgctaggcgccatgttcagcgggaagatgcccaccaag agggacagccagggcaactgcttcattgaccgtgacggcaaagtgttccgctatatcctc aacttcctgcggacctcccaccttgacctgcctgaggacttccaggagatggggctgctc cgcagggaggccgacttctaccaggtgcagcccctgattgaggccctgcaggagaaggaa gtggagctctccaaggccgagaagaatgccatgctcaacatcacactgaaccagcgtgtg cagacggtccacttcactgtgcgcgaggcaccccagatctacagcctctcctcttccagc atggaggtcttcaacgccaacatcttcagcacctcctgcctcttcctcaagctccttggc tctaagctcttctactgctccaatggcaatctctcctccatcaccagccacttgcaggac cccaaccacctgactctggactgggtggccaatgtggagggcctgccagaggaggagtac accaagcagaacctcaagaggctctgggtggtgcccgccaacaagcagatcaacagcttc caggtcttcgtggaagaggtactgaaaatcgctctgagcgatggcttctgcatcgattct tctcacccacatgctctggattttatgaacaataagattattcgattaatacggtacagg ttaggtgaactcagccttctgcttggcaaaagcatgagaaagacggagaacaaacgttca gcagtgatggcagcaccatcgacagacggggggctctgggctgatgggggtggctggctg ggaaacgtggcctgtgatcagctccaggagcaaaacaaccctgataacaggcccatcaaa gggagggacgagagtctggcagctctgtctgcccagacagcctggtga >gi568815587r:78119275_78380862|GENSCAN_predicted_peptide_3|100_aa MTKSCLYNNNNNNNNNNNKAGATVSIITNPRPCYKRQNDMTRGEFHESAEVCSQKPGLPG ETDDSWTFTGKVQAVQPWARSWADDDDHNNNKHYNGNNNG >gi568815587r:78119275_78380862|GENSCAN_predicted_CDS_3|303_bp atgacgaaatcctgtctctacaacaacaacaacaacaacaacaacaacaacaacaaagca ggggccacagtgtccatcatcacaaatccacggccctgctacaaaaggcaaaatgacatg acacgaggggagtttcatgagtcggcggaggtttgcagtcagaaaccagggcttcctgga gaaactgatgattcctggactttcacaggaaaagttcaagctgtgcagccctgggcacgt tcctgggcagatgatgatgatcataacaacaacaaacactacaacggcaataataatggc taa >gi568815587r:78119275_78380862|GENSCAN_predicted_peptide_4|1091_aa MVTAVLRLPAWELGQVGRACLLWPGYGLSRGCRLALEIRALMEQLYLNCQLPGLCGVSAG AMDKILEAVVTSSYPVSVKQGLVRRVLEAARQPLEREQCLALLALGARLYVGGAEELPRR VGCQLLHVAGRHHPDVFAEFFSARRVLRLLQGGAGPPGPRALACVQLGLQLLPEGPAADE VFALLRREVLRTVCERPGPAACAQVARLLARHPRCVPDGPHRLLFCQQLVRCLGRFRCPA EGEEGAVEFLEQAQQVSGLLAQLWRAQPAAILPCLKELFAVISCAEEEPPSSALASVVQH LPLELMDGVVRNLSNDDSVTDSQMLTAISRMIDWVSWPLGKNIDKWIIALLKGLAAVKKF SILIEVSLTKIEKVFSKLLYPIVRGAALSVLKYMLLTFQHSHEAFHLLLPHIPPMVASLV KEDSNSGTSCLEQLAELVHCMVFRFPGFPDLYEPVMEAIKDLHVPNEDRIKQLLGQDAWT SQKSELAGFYPRLMAKSDTGKIGLINLGNTCYVNSILQALFMASDFRHCVLRLTENNSQP LMTKLQWLFGFLEHSQRPAISPENFLSASWTPWFSPGTQQDCSEYLKYLLDRLHEEEKTG TRICQKLKQSSSPSPPEEPPAPSSTSVEKMFGGKIVTRICCLCCLNVSSREEAFTDLSLA FPPPERCRRRRLGSVMRPTEDITARELPPPTSAQGPGRVGPRRQRKHCITEDTPPTSLYI EGLDSKEAGGQSSQEERIEREEEGKEERTEKEEVGEEEESTRGEGEREKEEEVEEEEEKV EKETEKEAEQEKEEDSLGAGTHPDAAIPSGERTCGSEGSRSVLDLVNYFLSPEKLTAENR YYCESCASLQDAEKVVELSQGPCYLILTLLRFSFDLRTMRRRKILDDVSIPLLLRLPLAG GRGQAYDLCSVVVHSGVSSESGHYYCYAREGAARPAASLGTADRPEPENQWYLFNDTRVS FSSFESVSNVTSFFPKDTAYVLFYRQRPREGPEAELGSSRVRTEPTLHKDLMEAISKDNI LYLQEQEKEARSRAAYISALPTSPHWGRGFDEDKDEDEGSPGGCNPAGAACCDGGVEFGP NVTDPQDVTSK >gi568815587r:78119275_78380862|GENSCAN_predicted_CDS_4|3276_bp atggtgacagctgtcctcagactgcctgcgtgggagttggggcaggtgggcagggcttgc ttgctgtggccaggctacggactgagcagaggctgccggctggccttggagatcagagct ctcatggagcagctctatttgaactgtcagctgccaggcttatgtggtgttagcgcgggc gccatggacaagatcttggaggcggtggtgacgtcgtcatacccggtcagcgtgaagcag gggctggttcggcgcgtgctggaggcggcgcggcagccgctggagcgtgagcagtgcctg gcgctgctggcgctgggcgcgcgcctctacgtgggcggcgcggaggagctgccgcgccgc gtgggctgccagctgctgcacgtggccggccgccaccaccccgacgtcttcgccgagttc ttcagcgcgcgtcgcgtgctgcgcctgctgcagggtggcgccggccccccgggcccccgc gcgctcgcctgcgtgcagctgggtctgcagctgctgcccgaggggcctgcggccgacgag gtgttcgcgctgctgcggcgcgaggtgctgcgcaccgtgtgcgagcgcccgggccccgcg gcctgcgcgcaggtggcacggctgctggctcgccacccgcgctgtgtgcccgacggaccc caccgcctgctcttctgccagcagctggtgcgttgcctcggccgcttccgctgcccagcc gaaggcgaggagggcgccgtggagttcctagagcaggcccagcaggtgagcgggctcctg gcgcagctgtggcgcgcacagcccgccgccatcctgccctgcctcaaagagctgttcgca gtcatctcctgcgcagaggaggagccaccatctagcgccctggccagcgtggtccagcac ctcccattggagctcatggatggtgttgtccggaacctcagcaatgatgacagtgtgaca gactcgcagatgctgactgccattagcaggatgattgactgggtgtcctggcccctgggg aagaatattgacaagtggatcattgcactgctgaagggcctggctgctgttaagaagttc agcatcttgatcgaggtttcgctcaccaaaattgagaaggttttctctaagctgctgtac cccatcgtccggggagctgccttgtctgtgctcaagtacatgctcctgaccttccagcac tcccacgaagccttccacctgctcctccctcacatcccccccatggtggcctctctggtc aaggaggactcgaactcggggaccagctgcctggagcagctggcggagctggtccactgc atggtgttccggttcccgggcttcccggatctgtatgagcctgtcatggaggccatcaag gacctccatgttcccaatgaggaccgcatcaagcagctgctggggcaggatgcctggact tcgcagaagagcgagctggcgggtttctatccccggctcatggccaagtcagacacgggc aagattggtctcatcaacctgggcaacacatgctatgtcaacagcatccttcaggcctta ttcatggcgtctgacttcagacattgtgtgctccgcttgactgagaacaactcacagccc ctgatgaccaagctgcagtggctctttggcttcctagaacacagccagcggcctgccatt tccccagagaacttcctctccgcatcctggacgccctggttcagccctggcacccagcag gactgctcggagtatctgaagtacctgctggatcggctgcacgaagaggagaaaacgggc acaaggatctgccagaaactcaagcagtccagctcgccctctccgcccgaggagcccccg gccccaagttcaacctctgtggaaaaaatgtttggaggcaagatagtgactcggatctgc tgtctctgctgcctcaacgtctcctcccgggaggaggccttcacggacctctctctcgcc ttccctcctcctgagcgctgtcgccgccgccgcctgggctctgtgatgcgccccacagaa gacatcacagcccgggagttgcccccaccaaccagtgcacaggggccaggcagggtgggt cctcggaggcaaaggaaacactgcatcacagaggacaccccccccaccagcctgtacatc gaaggcctggactccaaggaagctggtgggcagagcagtcaggaggaaaggatagagagg gaggaagaagggaaggaggagagaacggagaaggaagaagtgggggaggaggaggaaagc accagaggggaaggagagagggagaaagaggaggaggtggaagaggaagaagagaaggtg gagaaggagacagaaaaggaggctgagcaggaaaaggaagaagacagcctgggagcgggg acccacccggatgctgccatcccctccggggagcggacatgtggctctgagggctcccgc tccgtcctggacctggttaactacttcctgtcccccgagaagctgacagcagaaaaccgc tactactgcgagtcgtgtgcctccctgcaggatgccgagaaggtggtggagctgagccaa gggccgtgctacctcatcctcacactgctgcgcttctctttcgacctgcgcaccatgcgg cgccgcaagatcctggatgacgtctccatccccctgctgctccgcctgccactggctggt ggccgtggccaggcctatgacctctgcagtgtggtggtgcactctggagtgtcttcggag agtggtcactactactgctatgcccgtgagggcgctgcccgccctgccgcttctctggga actgccgataggccagagcccgagaaccagtggtacctgttcaatgacactcgggtgtcc ttctcttccttcgaatctgtcagcaacgtcacctccttcttccctaaggacacagcctat gtgctgttttaccggcagcggcccagggaggggcccgaggctgagttgggctcttctaga gtccggacagagcccaccctgcacaaggacttgatggaagccatttccaaagacaacatc ctttacctacaggagcaggagaaggaggcccggagcagggcggcctacatctctgcactc cccacatctccgcactgggggaggggctttgatgaagacaaggatgaggatgaaggctct ccagggggctgcaatcctgcaggggctgcctgctgtgatggtggtgtggagtttgggccc aatgtcacagaccctcaagatgtcacatctaagtga >gi568815587r:78119275_78380862|GENSCAN_predicted_peptide_5|678_aa MPWLTPVIPALWEAEAWKKRWFILRSGRMSGDPDVLEYYKNDHSKKPLRIINLNFCEQVD AGLTFNKKELQDSFVFDIKTSERTFYLVAETEEDMNKWVQSICQICGFNQAEESTAELSS SSQHLLRERKSSAPSHSSQPTLFTFEPPVSNHMQPTLSTSAPQEYLYLHQCISRRAENAS LWGLIPDSSQCLSSYFKKAVKPVTDDTLMSASFSQGTRASFLMRSDTAVQKLAQGNGHCV NGISGQVHGFYSLPKPSRHNTEFRDSTYDLPRSLASHGHTKGSLTGSETDNEDVYTFKTP SNTLCREFGDLLVDNMDVPATPLSAYQIPRTFTLDKNHNAMTVATPGDSAIAPPPRPPKP SQAETPRWGSPQQRPPISENSRSVAATIPRRNTLPAMDNSRLHRASSCETYEYPQRGGES AGRSAESMSDGVGSFLPGKMIVGRSDSTNSEDNYVPMNPGSSTLLAMERAGDNSQSVYIP MSPGAHHFDSLGYPSTTLPVHRGPSRGSEIQPPPVNRNLKPDRKAKPTPLDLRNNTVIDE LPFKSPITKSWSRANHTFNSSSSQYCRPISTQSITSTDSGDSEENYVPMQNPVSASPVPS GTNSPAPKKSTGSVDYLALDFQPSSPSPHRKPSTSSVTSDEKVDYVQVDKEKTQALQNTM QEWTDVRQSSEPSKGAKL >gi568815587r:78119275_78380862|GENSCAN_predicted_CDS_5|2037_bp atgccgtggctcacgcctgtaatcccagcactttgggaggctgaggcctggaagaaacgc tggtttatcctgcggagtggccggatgagcggtgacccagatgttctggaatactacaag aacgatcactccaagaagcctctgcggatcatcaacctgaacttctgtgagcaggtagat gcaggcctgacctttaacaagaaggagctgcaggatagttttgtgtttgacatcaagacc agtgaacgcaccttttacctggtggctgagacagaagaggacatgaataagtgggtccag agcatctgccagatctgtggcttcaatcaggctgaggagagcacagctgagctcagcagc tctagccagcaccttctccgagagcgcaagtcctcagccccatcacactccagccagcca actctgttcacgtttgaaccccctgtgtcaaaccacatgcagcccaccttgtccaccagc gcacctcaggagtatctctacttgcaccagtgcataagccgaagagcagaaaatgcaagc ctttggggcctcattccagattctagccagtgtctgtcttcttatttcaagaaggctgtt aaacctgtaacagatgatacattgatgagtgccagcttctctcagggcaccagagcctct tttctcatgaggagtgacacagctgtacaaaaacttgcccagggcaatggacactgtgtc aacgggatcagtggtcaagtccatggcttctatagccttcccaagccgagccggcacaat acagaattcagagacagtacctacgacctcccccgcagcctggcctcccatggccacacc aagggcagcctcacaggctccgagacagataatgaggatgtgtacaccttcaagacgccc agcaacaccctgtgcagggagttcggggacctcctggtagacaatatggatgttccggcc accccactctcagcctaccagatccctaggacattcactctggacaaaaaccacaatgcc atgacagtggccactcctggggactcagccatagctcccccaccccgcccccccaagcca agtcaggcagaaacacctcgatggggcagtcctcagcagagaccgccaatcagtgaaaat agcagatctgtcgctgccaccatccccagacgcaacaccctccctgcaatggacaacagc cgacttcaccgagcttcttcctgtgagacctacgagtacccacagcgtggtggagagagt gcaggccggtctgctgaatccatgagtgatggagttggctctttcctgccagggaaaatg attgtgggccgatcggacagcaccaattctgaagacaactatgtgcccatgaatccaggt tcttccaccctgttggccatggaacgagcaggtgataattcccagagcgtctacatccca atgagcccaggggcccatcactttgactcacttggctacccatcaacaacccttcctgtg caccgaggccccagcagaggaagtgagattcagccaccccctgtcaaccgcaacctcaaa cctgatcggaaagcaaagccaacaccacttgacctgaggaacaacaccgtcatcgatgaa ctccccttcaagtcacctatcaccaagtcttggtctagggccaaccacaccttcaactcc agctcctcccagtactgccgccccatctccacccagagcatcaccagcacagactcagga gacagcgaagagaactatgtccctatgcaaaacccagtgtctgcatctcccgttcccagt ggcacgaacagtcctgcccctaagaagagcaccggcagcgttgattatctggccctggac ttccagccgagctccccaagcccccaccgcaagccatctacttcatccgtcacctctgat gagaaggtggactacgttcaggtggacaaggagaagacccaggccctgcagaacaccatg caggagtggacagacgtgcggcagtcctcagagccttccaagggtgccaagctgtga >gi568815587r:78119275_78380862|GENSCAN_predicted_peptide_6|121_aa MGFRHVGQAGLELLASICPCLASQIRKSGKLKVTFTKFVCRQSANYLHSYKTGKDKKGSF LVLYKVGSLWRGTEQTPSKTADLIHPQTDTRREEDFRKWADYVQEKISCKQLMGNLHEKQ D >gi568815587r:78119275_78380862|GENSCAN_predicted_CDS_6|366_bp atggggtttcgccatgttggccaggctggtctggaactcctggcctcgatctgcccctgc ttggcctctcaaattagaaaatctggaaaactaaaagtcacctttaccaaatttgtttgc aggcagtctgccaattacttgcattcatataagactggaaaagacaagaaaggcagcttt ctggtactctacaaagtggggagcctgtggcggggaacagaacaaaccccctccaaaact gctgacttgattcatccccaaactgacacaaggcgggaggaagacttcaggaaatgggca gattatgtccaagaaaagatttcctgcaagcagttaatgggaaacttacatgaaaagcag gactaa >gi568815587r:78119275_78380862|GENSCAN_predicted_peptide_7|70_aa MLLQVYDYQAKKDIKHLLATSMSCHMLRYFFAKNGKTILFNQEVHEQQLNQCICLDGLII QQINTVHEYD >gi568815587r:78119275_78380862|GENSCAN_predicted_CDS_7|213_bp atgctgctgcaggtatatgactaccaagcaaaaaaagatattaagcacttactggccaca tccatgagttgccacatgctgaggtatttctttgctaagaatggcaagaccattttgttc aaccaggaagtccatgaacagcaactcaaccaatgcatctgccttgatggtttgattatt caacagattaatactgttcatgaatatgattaa