GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:31:30 Sequence gi568815597r:202231838_202435754 : 203917 bp : 43.92% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 4085 4160 76 2 1 130 47 94 0.694 6.81 1.02 PlyA + 5624 5629 6 -0.45 2.07 PlyA - 7200 7195 6 1.05 2.06 Term - 7382 7241 142 1 1 63 43 127 0.496 3.10 2.05 Intr - 8146 8088 59 1 2 88 48 23 0.379 -4.02 2.04 Intr - 11998 11872 127 0 1 109 68 42 0.354 4.98 2.03 Intr - 12776 12633 144 1 0 22 59 144 0.912 4.30 2.02 Intr - 12935 12836 100 0 1 90 64 32 0.648 0.17 2.01 Init - 21168 21105 64 0 1 80 69 61 0.603 4.71 2.00 Prom - 27220 27181 40 -3.36 3.00 Prom + 28951 28990 40 -4.96 3.01 Init + 33434 33514 81 1 0 96 37 29 0.071 -0.43 3.02 Intr + 39079 39191 113 0 2 0 110 130 0.511 5.58 3.03 Intr + 44469 44684 216 0 0 118 68 362 0.985 34.82 3.04 Intr + 48944 49015 72 2 0 111 74 97 0.987 9.12 3.05 Intr + 50695 50824 130 1 1 58 75 37 0.715 0.10 3.06 Intr + 51696 51822 127 2 1 81 80 56 0.577 4.35 3.07 Intr + 56995 57094 100 2 1 18 94 30 0.041 -4.23 3.08 Intr + 65671 65739 69 1 0 128 67 112 0.716 11.40 3.09 Intr + 69012 69083 72 0 0 99 76 111 0.981 9.52 3.10 Intr + 69327 69398 72 0 0 57 93 54 0.713 1.32 3.11 Intr + 71442 71510 69 0 0 93 76 99 0.955 7.50 3.12 Intr + 72722 72793 72 2 0 142 76 74 0.998 10.12 3.13 Intr + 73847 73912 66 2 0 120 82 32 0.917 3.82 3.14 Intr + 75031 75102 72 1 0 88 76 107 0.994 8.02 3.15 Intr + 75493 75564 72 1 0 113 87 75 0.997 8.42 3.16 Intr + 77214 77339 126 0 0 90 78 75 0.988 6.49 3.17 Intr + 78360 78520 161 0 2 83 74 162 0.997 13.93 3.18 Intr + 82965 83045 81 1 0 126 94 92 0.999 13.21 3.19 Term + 86115 87370 1256 1 2 99 36 1260 0.902 113.65 3.20 PlyA + 87901 87906 6 1.05 4.00 Prom + 92539 92578 40 -5.96 4.01 Sngl + 95003 95341 339 1 0 104 44 148 0.975 8.03 4.02 PlyA + 99884 99889 6 1.05 5.05 PlyA - 100300 100295 6 1.05 5.04 Term - 101498 101331 168 2 0 56 55 143 0.349 5.68 5.03 Intr - 101718 101613 106 2 1 107 87 17 0.999 3.72 5.02 Intr - 103221 103152 70 1 1 116 88 58 0.994 6.84 5.01 Init - 103917 103809 109 0 1 86 49 47 0.875 1.02 5.00 Prom - 112246 112207 40 -7.76 6.00 Prom + 114177 114216 40 -5.06 6.01 Init + 115815 115817 3 2 0 108 81 0 0.847 1.30 6.02 Intr + 116754 117305 552 2 0 -28 93 599 0.345 41.85 6.03 Intr + 172583 172633 51 1 0 95 99 6 0.024 1.50 6.04 Intr + 184950 185080 131 2 2 76 70 133 0.850 9.79 6.05 Intr + 190783 190901 119 1 2 100 86 113 0.996 12.41 6.06 Intr + 193729 193888 160 2 1 61 72 187 0.999 13.45 6.07 Intr + 195203 195347 145 2 1 83 53 110 0.997 7.28 6.08 Intr + 197018 197092 75 1 0 96 89 95 0.996 10.11 6.09 Intr + 198894 198973 80 2 2 110 63 62 0.985 4.25 6.10 Intr + 199643 199782 140 2 2 35 95 193 0.906 14.81 6.11 Intr + 202819 202931 113 2 2 72 96 39 0.570 3.20 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:202231838_202435754|GENSCAN_predicted_peptide_1|25_aa XMLQNNQLGGIPAEALWELPSLQSL >gi568815597r:202231838_202435754|GENSCAN_predicted_CDS_1|78_bp nngatgctgcagaacaatcagctgggaggaatccccgcagaggcgctgtgggagctgccg agcctgcagtcgctgtga >gi568815597r:202231838_202435754|GENSCAN_predicted_peptide_2|211_aa MTRECQNVFLLGALKPSIKECVGEMEAPEDIQLPEPVNGTLFRKGAFANVLRILGDHRET QRERGESYVKAEGRDWSDAATRNASSCRKLEEAKKHPPQHPQSAQETQENDSLRKISKFM HNWLLLLESPRAPTWALLKICFREALPHPVSITTTNDHCRSSNKRPSRQAQGYLRTPARQ QKQEKSRPEDTYPGWKTPTPEDGERGHPGTA >gi568815597r:202231838_202435754|GENSCAN_predicted_CDS_2|636_bp atgactcgagagtgtcagaatgtcttcctgctgggtgcgttgaagccctcaattaaggag tgtgtgggtgaaatggaggccccagaagatatacaacttccagaacctgtaaatgggaca ttatttagaaaaggggcctttgcaaatgtattaaggatcttgggagaccacagagagaca cagagagaaagaggagaaagctacgtgaaggcagaaggcagagactggagtgatgcagcc acaaggaacgccagcagctgccggaagctggaagaggcaaagaagcacccaccccagcac cctcagagtgctcaggagacccaagaaaatgactccttgagaaaaatctccaaattcatg cacaactggctccttctcttggaatcacccagggctcccacttgggccctcttgaagatt tgcttccgggaggctctcccacatccagtcagcattactactactaatgatcattgtaga agtagcaacaaaaggcccagtcgccaggcgcagggctacttgcgtacgccagcaagacag cagaagcaggaaaagagccggccggaagacacctaccctggttggaagacacctacccct gaagatggagaaagaggccatccgggtaccgcgtag >gi568815597r:202231838_202435754|GENSCAN_predicted_peptide_3|1008_aa MGLSHQEYSGSDSPELSLLGSSGLVCSLGEYEKQFGPRQVKLFPQSLSKPELACEVPANL PHYCRRLDANLISLVPERSFEGLSSLRHLWLDDNALTEIPVRALNNLPALQAMTLALNRI SHIPDYAFQNLTSLVVLHLHNNRIQHLGTHSFEGLHNLETLHLPSGDIRAFRAVFGVAMD EQGRRESNAMNPGVWALLALLGNQQREDQTPAAQAAVWLELWFGSSADSSKWGKRPPAIP PGEEGPGPSLWERLSGHILLWSGTPQSLQVACINGPQTSRDLNYNKLQEFPVAIRTLGRL QELGFHNNNIKAIPEKAFMGNPLLQTIHFYDNPIQFVGRSAFQYLPKLHTLSLNGAMDIQ EFPDLKGTTSLEILTLTRAGIRLLPSGMCQQLPRLRVLELSHNQIEELPSLHRCQKLEEI GLQHNRIWEIGADTFSQLSSLQALDLSWNAIRSIHPEAFSTLHSLVKLDLTDNQLTTLPL AGLGGLMHLKLKGNLALSQAFSKDSFPKLRILEVPYAYQCCPYGMCASFFKASGQWEAED LHLDDEESSKRPLGLLARQAENHYDQDLDELQLEMEDSKPHPSVQCSPTPGPFKPCEYLF ESWGIRLAVWAIVLLSVLCNGLVLLTVFAGGPVPLPPVKFVVGAIAGANTLTGISCGLLA SVDALTFGQFSEYGARWETGLGCRATGFLAVLGSEASVLLLTLAAVQCSVSVSCVRAYGK SPSLGSVRAGVLGCLALAGLAAALPLASVGEYGASPLCLPYAPPEGQPAALGFTVALVMM NSFCFLVVAGAYIKLYCDLPRGDFEAVWDCAMVRHVAWLIFADGLLYCPVAFLSFASMLG LFPVTPEAVKSVLLVVLPLPACLNPLLYLLFNPHFRDDLRRLRPRAGDSGPLAYAAAGEL EKSSCDSTQALVAFSDVDLILEASEAGRPPGLETYGFPSVTLISCQQPGAPRLEGSHCVE PEGNHFGNPQPSMDGELLLRAEGSTPAGGGLSGGGGFQPSGLAFASHV >gi568815597r:202231838_202435754|GENSCAN_predicted_CDS_3|3027_bp atgggactgtcccaccaggagtactcaggctccgactccccagagttgtcacttttgggc tcttcagggcttgtgtgctctctgggggagtatgagaagcagtttggccctcgacaggtt aagctgtttccccagagcctaagcaagcccgaactggcctgtgaggtccctgctaatctg ccccattactgcaggcgcctagatgccaacctcatctccctggtcccggagaggagcttt gaggggctgtcctccctccgccacctctggctggacgacaatgcactcacggagatccct gtcagggccctcaacaacctccctgccctgcaggccatgaccctggccctcaaccgcatc agccacatccccgactacgcgttccagaatctcaccagccttgtggtgctgcatttgcat aacaaccgcatccagcatctggggacccacagcttcgaggggctgcacaatctggagaca ctccaccttccctcaggagatataagagcttttagggcagtgtttggagttgccatggat gagcagggcaggagggagtccaatgccatgaaccctggagtgtgggcgctgttagctttg ctgggaaatcagcagagggaagatcaaacccctgcggcccaagctgcggtttggctggag ctgtggtttggctcctctgcagactcctctaagtggggaaagcggcccccagccatacca cctggggaggaggggccaggtccaagtctctgggaaagattatcgggccacattctgctg tggtcaggaacacctcagagccttcaggtcgcctgcattaacgggccccaaacatccaga gacctgaattataacaagctgcaggagttccctgtggccatccggaccctgggcagactg caggaactggggttccataacaacaacatcaaggccatcccagaaaaggccttcatgggg aaccctctgctacagacgatacacttttatgataacccaatccagtttgtgggaagatcg gcattccagtacctgcctaaactccacacactatctctgaatggtgccatggacatccag gagtttccagatctcaaaggcaccaccagcctggagatcctgaccctgacccgcgcaggc atccggctgctcccatcggggatgtgccaacagctgcccaggctccgagtcctggaactg tctcacaatcaaattgaggagctgcccagcctgcacaggtgtcagaaattggaggaaatc ggcctccaacacaaccgcatctgggaaattggagctgacaccttcagccagctgagctcc ctgcaagccctggatcttagctggaacgccatccggtccatccaccccgaggccttctcc accctgcactccctggtcaagctggacctgacagacaaccagctgaccacactgcccctg gctggacttgggggcttgatgcatctgaagctcaaagggaaccttgctctctcccaggcc ttctccaaggacagtttcccaaaactgaggatcctggaggtgccttatgcctaccagtgc tgtccctatgggatgtgtgccagcttcttcaaggcctctgggcagtgggaggctgaagac cttcaccttgatgatgaggagtcttcaaaaaggcccctgggcctccttgccagacaagca gagaaccactatgaccaggacctggatgagctccagctggagatggaggactcaaagcca caccccagtgtccagtgtagccctactccaggccccttcaagccctgtgagtacctcttt gaaagctggggcatccgcctggccgtgtgggccatcgtgttgctctccgtgctctgcaat ggactggtgctgctgaccgtgttcgctggcgggcctgtccccctgcccccggtcaagttt gtggtaggtgcgattgcaggcgccaacaccttgactggcatttcctgtggccttctagcc tcagtcgatgccctgacctttggtcagttctctgagtacggagcccgctgggagacgggg ctaggctgccgggccactggcttcctggcagtacttgggtcggaggcatcggtgctgctg ctcactctggccgcagtgcagtgcagcgtctccgtctcctgtgtccgggcctatgggaag tccccctccctgggcagcgttcgagcaggggtcctaggctgcctggcactggcagggctg gccgccgcgctgcccctggcctcagtgggagaatacggggcctccccactctgcctgccc tacgcgccacctgagggtcagccagcagccctgggcttcaccgtggccctggtgatgatg aactccttctgtttcctggtcgtggccggtgcctacatcaaactgtactgtgacctgccg cggggcgactttgaggccgtgtgggactgcgccatggtgaggcacgtggcctggctcatc ttcgcagacgggctcctctactgtcccgtggccttcctcagctttgcctccatgctgggc ctcttccctgtcacgcccgaggccgtcaagtctgtcctgctggtggtgctgcccctgcct gcctgcctcaacccactgctgtacctgctcttcaacccccacttccgggatgaccttcgg cggcttcggccccgcgcaggggactcagggcccctagcctatgctgcggccggggagctg gagaagagctcctgtgattctacccaggccctggtagccttctctgatgtggatctcatt ctggaagcttctgaagctgggcggccccctgggctggagacctatggcttcccctcagtg accctcatctcctgtcagcagccaggggcccccaggctggagggcagccattgtgtagag ccagaggggaaccactttgggaacccccaaccctccatggatggagaactgctgctgagg gcagagggatctacgccagcaggtggaggcttgtcagggggtggcggctttcagccctct ggcttggcctttgcttcacacgtgtaa >gi568815597r:202231838_202435754|GENSCAN_predicted_peptide_4|112_aa MHGLNRNFNEQKYVIRQKYCQDELTGRDCGPGWKILSRLAGNLRQENAEEVKIMKREDEK EQTNLIYFITKPEIHRLSPEPLDYTGSFGIAELSSVSFGRNLREHSSHACIL >gi568815597r:202231838_202435754|GENSCAN_predicted_CDS_4|339_bp atgcatggcctgaataggaatttcaatgagcaaaagtatgtgataagacagaagtactgt caggatgagttaacaggtagagactgtgggcctgggtggaagatccttagcaggttagca gggaacctacggcaagagaatgctgaagaagtaaagataatgaaaagggaagatgagaag gaacagaccaacctaatttacttcatcactaaaccggagatccacaggctctcccctgag cccctggattacacaggaagtttcggaattgcagagctgtcgagtgtcagctttgggaga aatctcagagagcattcaagccatgcctgcattttatag >gi568815597r:202231838_202435754|GENSCAN_predicted_peptide_5|150_aa MQRASRLKRELHMLATEPPPGITCWQDKDQMDDLRAQILGGANTPYEKGVFKLEVIIPER YPFEPPQIRFLTPIYHPNIDSAGRICLDVLKLPPKGAWRPSLNIATVLTSIQLLMSEPNP DDPLMADIVNIPIFLCNTYPTLSIPLTLPV >gi568815597r:202231838_202435754|GENSCAN_predicted_CDS_5|453_bp atgcagagagcttcacgtctgaagagagagctgcacatgttagccacagagccaccccca ggcatcacatgttggcaagataaagaccaaatggatgacctgcgagctcaaatattaggt ggagccaacacaccttatgagaaaggtgtttttaagctagaagttatcattcctgagagg tacccatttgaacctcctcagatccgatttctcactccaatttatcatccaaacattgat tctgctggaaggatttgtctggatgttctcaaattgccaccaaaaggtgcttggagacca tccctcaacatcgcaactgtgttgacctctattcagctgctcatgtcagaacccaaccct gatgacccgctcatggctgacatagtaaatatccccattttcctctgcaacacatatcct accttgtctataccgctaactctccctgtgtga >gi568815597r:202231838_202435754|GENSCAN_predicted_peptide_6|523_aa MFPYGHQVHLGQASYYESRDDPRPGSACVLKPRERAGGTTGGRSKDGGARVSALCSGLKR SERGGSGNSSPNSNLVLVVLAAAAEAGAMAELEHLGGKRAESARMRRAEQLRRWRGSLTE QEPAERRGAGRQPLTRRGSPRVRFEDGAVFLAACSSGDTDEVRKLLARGADINTVNVDGL TALHQILFIAAVSVFSFFFITQACIDENLDMVKFLVENRANVNQQDNEGWTPLHAAASCG YLNIAEYFINHGASVGIVNSEGEVPSDLAEEPAMKDLLLEQVKKQGVDLEQSRKEEEQQM LQDARQWLNSGKIEDVRQARSGATALHVAAAKGYSEVLRLLIQAGYELNVQDYDGWTPLH AAAHWGVKEACSILAEALCDMDIRNKLGQTPFDVADEGLVEHLELLQKKQNVLRSEKETR NKLIESDLNSKIQSGFFKNKEKMLYEEETPKSQEMEEENKESSSSSSEEEEGEDEASESE TEKEADKKPEAFVNHSNSESKSSITEQIPAPAQNTFSASSARR >gi568815597r:202231838_202435754|GENSCAN_predicted_CDS_6|1569_bp atgttcccttacggccaccaagtgcacttagggcaggcctcttactacgagtcccgtgat gacccgaggccgggcagcgcctgcgtattgaagccgagggagcgtgcgggcggtactact ggcgggaggagtaaagatggcggcgcgagggtctccgccctctgctccgggctgaagcgc tctgagagaggcggcagcggcaactcgagccccaacagtaatttagtgttggtagttttg gcagcagctgccgaggccggagcaatggcggaactggagcacctaggagggaagcgggca gagtcggcgcgaatgcggcgggcagagcagcttcggcgctggcggggctcgctgacagag caggagcctgcggagcgacgaggcgcggggcggcagccgctgaccaggcgcgggagcccc agggtccgcttcgaggacggtgctgtctttctggccgcctgctctagcggggacaccgac gaggtgagaaagcttctggcaagaggtgctgatatcaacacggtcaacgtggacggcttg acagccctgcaccagattctcttcatcgctgctgtttcagtgttctctttcttctttata actcaggcatgtattgatgaaaatttggacatggtgaagtttctggtggagaacagagcc aatgtaaaccagcaagacaacgagggctggacaccccttcatgcagcagcttcctgtggc tatctcaacatagcagagtatttcattaatcacggagccagtgtaggtattgtcaatagt gaaggtgaagttccctctgaccttgcagaagagccagccatgaaggatcttcttctggag caagtaaagaagcaaggagttgatctagagcagtcaagaaaagaagaagagcagcagatg ttgcaggatgcccgccagtggctcaacagtgggaaaatagaggatgtgaggcaggctcgc tcaggggctacagcccttcatgtggctgctgccaagggctactctgaagtcctcagactt ttaattcaggctggctatgaactcaatgttcaggattatgatggctggactcccctccat gctgctgcacactggggagtgaaggaggcttgctccatcctggcagaagcactttgtgac atggatattcgaaataaactgggccagacaccatttgatgtggctgatgagggtctcgtg gagcatttggagttgctccagaagaagcagaatgtgcttcgaagtgaaaaggagacacgg aataaactcattgagtcagatctgaacagcaagattcagagtgggttctttaagaacaaa gagaagatgctctatgaggaggagacacctaagtcccaagaaatggaggaagaaaataaa gaatctagtagctccagctcagaggaggaggaaggtgaagatgaagcttctgagtcagaa actgagaaggaggcagataaaaagccagaagcctttgtcaatcattccaactctgaaagc aagagtagtatcacagagcagataccagcaccagctcaaaataccttctctgcctcttct gctaggagg