GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:29:40 Sequence gi568815585r:80236761_80437705 : 200945 bp : 39.69% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2380 2548 169 1 1 55 60 82 0.215 1.13 1.02 Term + 5621 5773 153 1 0 14 38 265 0.464 11.04 1.03 PlyA + 5824 5829 6 1.05 2.00 Prom + 5867 5906 40 -6.55 2.01 Init + 12128 12188 61 1 1 54 94 61 0.455 4.77 2.02 Intr + 26359 26460 102 2 0 62 103 79 0.256 6.03 2.03 Intr + 29625 29759 135 1 0 20 92 81 0.084 1.32 2.04 Term + 37263 37390 128 1 2 65 49 64 0.062 -2.34 2.05 PlyA + 37447 37452 6 1.05 3.00 Prom + 50763 50802 40 -2.35 3.01 Init + 82450 82513 64 0 1 52 107 51 0.949 4.76 3.02 Intr + 83207 83402 196 0 1 58 90 102 0.424 4.95 3.03 Intr + 93650 93688 39 2 0 120 94 16 0.238 1.82 3.04 Term + 95747 95819 73 2 1 104 36 51 0.300 -2.10 3.05 PlyA + 96388 96393 6 1.05 4.02 PlyA - 98778 98773 6 1.05 4.01 Sngl - 100945 99998 948 1 0 58 39 595 0.948 47.91 4.00 Prom - 101462 101423 40 -12.91 5.00 Prom + 101946 101985 40 -8.85 5.01 Init + 102217 102287 71 0 2 53 83 119 0.747 6.47 5.02 Intr + 102361 102581 221 1 2 31 -73 225 0.304 -2.28 5.03 Intr + 104349 104611 263 1 2 45 -12 236 0.631 5.58 5.04 Intr + 106032 106108 77 2 2 98 100 116 0.980 11.19 5.05 Term + 118968 119280 313 0 1 61 36 233 0.160 8.89 5.06 PlyA + 119565 119570 6 1.05 6.00 Prom + 124762 124801 40 -5.45 6.01 Init + 158584 158644 61 0 1 76 75 52 0.876 4.16 6.02 Intr + 162078 162281 204 1 0 36 28 174 0.502 4.45 6.03 Term + 162992 163134 143 0 2 36 28 170 0.902 2.91 6.04 PlyA + 163422 163427 6 1.05 7.03 PlyA - 164747 164742 6 1.05 7.02 Term - 167934 167647 288 0 0 47 48 175 0.548 3.79 7.01 Init - 170078 169770 309 2 0 62 6 152 0.445 1.76 7.00 Prom - 176881 176842 40 -0.65 8.03 PlyA - 176909 176904 6 1.05 8.02 Term - 198169 198061 109 0 1 76 42 118 0.919 3.00 8.01 Init - 198875 198664 212 2 2 77 69 62 0.474 1.50 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:80236761_80437705|GENSCAN_predicted_peptide_1|107_aa XGLNQSPSHLKQLYWGYFKDCWGITLLPPPNFFLVPPVGGTQPEARGHESLLVEFREPQA GPAGDIPEEVFIITRDDSYMYVIAPEDPPVDQDVKVEDSDIDDPDPV >gi568815585r:80236761_80437705|GENSCAN_predicted_CDS_1|324_bp nngggactcaaccagtcccccagccatcttaaacaactttactggggatatttcaaggat tgttggggtatcactctccttccaccccctaatttcttcctggtgcctcctgttggtgga acacagccagaagccagagggcatgaaagtctgctggtggagttcagggagcctcaggca ggtcccgcaggagatattccagaagaagtcttcatcatcacaagagatgacagttacatg tatgttattgcccctgaagatcctccagtggatcaagatgtgaaggtggaagacagtgat attgatgatcctgaccctgtgtag >gi568815585r:80236761_80437705|GENSCAN_predicted_peptide_2|141_aa MTATLADILTATPTLTQNQLGRNGLLGDPGSSQGLSAASSTSIFRSALQIDSAPESHKQS MRKPASHILGNSGMIVIPIANLICVESGILLFFKFAFKNNLSITGSLRIYLEKNSELGQG LASPFTMAKHDTAKRKGSNSS >gi568815585r:80236761_80437705|GENSCAN_predicted_CDS_2|426_bp atgactgcaaccctagctgatattttgactgctaccccgaccctaactcagaaccaactg ggcaggaatggcctgctaggggacccagggagctcccagggcctttctgctgcttcttct acatctatatttcgttcagctctccaaattgactcagctccagagtcccataagcagtca atgagaaaacctgcttcccacatccttggcaactctggtatgattgtcattcccattgcc aatctgatctgtgtggaaagtggcatcctgcttttctttaaatttgcatttaaaaataat ctgagtataacaggttcattgcgaatttatctggagaaaaatagtgaattaggccagggt ctagccagtccctttacaatggcaaaacatgatactgcaaagagaaaaggttctaactct tcctag >gi568815585r:80236761_80437705|GENSCAN_predicted_peptide_3|123_aa MIQVPHPNDNESSTLEKKGGRGLSHHARHLPSFKGRIKVYQVNGCGMRVKEGEFPGKDNS IVKDMWRKGTRLKLRKETGGRAAVVSGFRSKLKLWVNGKRQSPEQATTWQKFGRALKSPP QNT >gi568815585r:80236761_80437705|GENSCAN_predicted_CDS_3|372_bp atgatacaggtgccccacccaaatgacaatgaatccagtactttagagaagaaaggaggg agaggcttgagccaccatgcccggcatttaccgagttttaaaggaagaataaaagtttat caggtgaatggatgtgggatgagggttaaggaaggggagtttccaggtaaagacaacagt attgtgaaggacatgtggaggaaagggacgagactgaagctaagaaaagagacaggaggc agagcagctgtggtgagcgggttcagaagcaagctaaaactatgggttaatggcaaaaga caatccccagagcaagctacaacatggcagaaatttggcagggccttaaaatctcctcct caaaacacataa >gi568815585r:80236761_80437705|GENSCAN_predicted_peptide_4|315_aa MEARAQSGNGSQPLLQTPRDGGRQRGEPDPRDALTQQVHVLSLDQIRAIRNTNEYTEGPT VVPRPGLKPAPRPSTQHKHERLHGLPEHRQPPRLQHSQVHSSARAPLSRSISTVSSGSRS STRTSTSSSSSEQRLLGSSFSSGPVADGIIRVQPKSELKPGELKPLSKEDLGLHAYRCED CGKCKCKECTYPRPLPSDWICDKQCLCSAQNVIDYGTCVCCVKGLFYHCSNDDEDNCADN PCSCSQSHCCTRWSAMGVMSLFLPCLWCYLPAKGCLKLCQGCYDRVNRPGCRCKNSNTVC CKVPTVPPRNFEKPT >gi568815585r:80236761_80437705|GENSCAN_predicted_CDS_4|948_bp atggaggccagagctcagagtggcaacgggtcgcagcccttgctgcagacgccccgtgac ggtggcagacagcgtggggagcccgaccccagagacgccctcacccagcaggtacatgtc ttgtctctggatcagatcagagccatccgaaacaccaatgagtacacagaggggcctact gtcgtcccaagacctgggctcaagcctgctcctcgcccctccactcagcacaaacacgag agactccacggtctgcctgagcaccgccagcctcctaggctccagcactcgcaggtccat tcttctgcacgagcccctctgtccagatccataagcacggtcagctcagggtcgcggagc agtacgaggacaagtaccagcagcagctcctctgaacagagactgctaggatcatccttc tcctccgggcctgttgctgatggcataatccgggtgcaacccaaatctgagctcaagcca ggtgagcttaagccactgagcaaggaagatttgggcctgcacgcctacaggtgtgaggac tgtggcaagtgcaaatgtaaggagtgcacctacccaaggcctctgccatcagactggatc tgcgacaagcagtgcctttgctcggcccagaacgtgattgactatgggacttgtgtatgc tgtgtgaaaggtctcttctatcactgttctaatgatgatgaggacaactgtgctgacaac ccatgttcttgcagccagtctcactgttgtacacgatggtcagccatgggtgtcatgtcc ctctttttgccttgtttatggtgttaccttccagccaagggttgccttaaattgtgccag gggtgttatgaccgggttaacaggcctggttgccgctgtaaaaactcaaacacagtttgc tgcaaagttcccactgtcccccctaggaactttgaaaaaccaacatag >gi568815585r:80236761_80437705|GENSCAN_predicted_peptide_5|314_aa MGQAAPLLTIPRLRQAGAAGASEGELPSSPKEPLRPQAEVTPPTARLSAKPLGHPAQVSH PDTASTKALPETRSQDAQVQPTHTRRLHAALTEGGIASLGGADWALWRRGSEAEAGAVGA EEEEEEEEEEEEGLGPARGGLAVGLRPRLSGGGRELCGGRGSSKWRHVDPRRNRAVVVPE DISHPGFGTRDGKGSGMRYFRALAYVPRPLSASPPALRSISPRALAAFLWGRARDLQPAM PKPPTPSVGSCAARASSIPALRRPVPATTQGLRSAGTRRRTGRQLHLQARCGIHWVKPAG LLSLGEPLSLAQGL >gi568815585r:80236761_80437705|GENSCAN_predicted_CDS_5|945_bp atgggccaggctgcccccctgcttacgatccccagactcagacaggcgggggccgcgggc gcctccgaaggagaactgccttccagccccaaggagccactccgcccccaggcagaggtc acgccgcccactgccaggctttctgcaaagcccctcggacatccggcacaggtttcccac cccgacactgcgagcacgaaagccctgcctgagacacgcagccaggacgcacaagtccaa cccacgcacacacggcgactccacgctgcactgaccgaagggggcattgcctctctcggc ggtgcagactgggcgttgtggaggcgaggcagcgaggccgaggcgggggcggtgggtgcg gaggaggaggaagaggaggaggaggaggaggaggagggtctggggcccgcccgggggggg ctcgccgttggcctgcgccctcgcctttccggaggaggcagggagctctgcggcggccgc ggcagcagtaaatggcgtcatgtggatccgaggcggaacagagcagttgttgtcccagag gatatatcgcatccgggatttggaaccagagatggcaagggaagtggcatgcgttatttc cgggcgctggcctacgtcccccgacccctaagtgccagcccaccagcgctgcgctcgatt tctccccgggccttagctgccttcctgtggggcagggctcgggacctgcagcccgccatg cctaagcctcccaccccctccgtgggctcctgtgcggcccgagcctcctccatccctgct ctgaggcgcccagtcccagcgaccacccaagggctgaggagtgcaggcacacggcgcagg actggcaggcagctccacctgcaggcccggtgtgggatccactgggtgaagccagctggg ctcctgagtcttggagaacctttaagtctagctcagggattgtaa >gi568815585r:80236761_80437705|GENSCAN_predicted_peptide_6|135_aa MAPRNIHFLISRTCKCYTAKGKQEGGLDRGTSNNEGTPPPGHNSVSGVSFWRHVDSLSPG TPGSAGLDLPVRERVTLIGGNKLTKIPTGVYDHGIGDWRDPCNPTVGLVPPVRAECKDRT NQQLKPAHFAITFLN >gi568815585r:80236761_80437705|GENSCAN_predicted_CDS_6|408_bp atggccccccgaaatatccacttcctcatctccagaacctgcaaatgttatacggcaaaa gggaaacaagaagggggcctggaccggggcacctcaaacaatgagggcactcctcctcca ggccacaactccgtttcaggggtgagtttctggaggcacgtcgattccctctccccggga acacctggaagcgcgggattagatctcccagtcagagaacgggttacgttaattggagga aacaaactcactaagattcccactggtgtgtacgaccatggaataggagactggagggac ccatgcaacccaactgtgggcctggttcccccagtacgagctgaatgcaaagacagaacg aaccagcaattgaagcctgcacattttgcaattacctttctcaattaa >gi568815585r:80236761_80437705|GENSCAN_predicted_peptide_7|198_aa MRKNQHKNFGNSNGHSVVCPQNDHTSSPTRVLNQAELAGKTKIEFRMWIGRKIIEIQENG KIQSRETENQNKIQELKDETASIKKNLMNLAELNNTRISMQSQVNDEIKAEFKKLFEMNE NKDTTYQTLWDTAKAVLRGKFIALNTHIKKLERPQINNLTLKLKEFKKQEQINPKASRRQ EITKIRAELKEIGMLKTI >gi568815585r:80236761_80437705|GENSCAN_predicted_CDS_7|597_bp atgagaaagaaccaacacaagaactttggaaactcaaatggccacagtgtcgtatgtcct caaaatgatcacaccagttctccaacaagagttcttaaccaggctgagctggctggaaag acaaaaatagaattcagaatgtggataggaaggaagatcattgagattcaggagaatggc aaaatccaatccagggaaactgagaaccaaaataaaatacaggagctaaaggatgaaaca gccagcataaaaaagaacctaatgaatctggcagagctgaataatacaaggatttccatg caatcacaagtaaatgatgaaattaaggcagaattcaagaagttatttgaaatgaatgag aacaaagatacaacataccagactctctgggacacagctaaggcagtgttaagagggaaa ttcatagcactaaatacccacatcaaaaagttagaaagacctcaaattaacaacctaaca ttgaaactgaaagaattcaagaagcaagagcaaatcaaccccaaagctagcagaagacaa gaaataaccaaaatcagagctgaactgaaggaaattgggatgctaaaaaccatttga >gi568815585r:80236761_80437705|GENSCAN_predicted_peptide_8|106_aa MRHQSNTFKMYIGSVQKVRTIQSRCFQVIGRFKNFLIGNLVERVAINIKKCLGYDKRLWR PKFYQANEASRDKDEAGNHSQQTNTGTENQTPRVLTHKWELNDENA >gi568815585r:80236761_80437705|GENSCAN_predicted_CDS_8|321_bp atgagacatcagtcaaatacatttaagatgtacattggttcggtccagaaagtcagaaca attcaaagcaggtgcttccaggttataggtagatttaaaaattttctgattggcaatttg gttgaaagagttgccatcaatataaagaaatgtctgggatatgacaagaggttgtggaga ccaaagttttatcaggcaaatgaagcctccagggacaaggatgaagctggaaatcattct cagcaaactaacacaggaacagaaaaccaaacaccacgtgttctcactcataagtgggag ctgaacgatgagaatgcatag