GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:21:28 Sequence gi568815587r:73577901_73860635 : 282735 bp : 43.73% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 1052 1047 6 1.05 1.05 Term - 2482 2320 163 0 1 113 48 71 0.065 3.01 1.04 Intr - 17753 17732 22 0 1 65 81 33 0.057 -3.10 1.03 Intr - 20235 20023 213 1 0 122 93 121 0.868 14.69 1.02 Intr - 21060 20935 126 1 0 47 85 73 0.492 3.55 1.01 Init - 40925 40832 94 2 1 77 97 97 0.805 10.04 1.00 Prom - 65718 65679 40 -2.66 2.00 Prom + 66472 66511 40 -6.46 2.01 Init + 69730 69838 109 0 1 67 98 153 0.947 12.58 2.02 Intr + 72653 72805 153 0 0 25 89 256 0.518 19.44 2.03 Intr + 73888 73990 103 2 1 82 76 145 0.989 11.93 2.04 Intr + 75075 75114 40 0 1 79 117 12 0.698 1.43 2.05 Intr + 82853 82952 100 1 1 59 111 239 0.871 22.98 2.06 Term + 83566 83702 137 2 2 114 52 139 0.994 11.08 2.07 PlyA + 84891 84896 6 1.05 3.15 PlyA - 85976 85971 6 1.05 3.14 Term - 101820 101753 68 1 2 133 35 72 0.112 4.50 3.13 Intr - 127846 127822 25 1 1 75 86 5 0.037 -3.40 3.12 Intr - 129613 129520 94 0 1 73 111 53 0.544 6.07 3.11 Intr - 133369 133259 111 0 0 25 34 139 0.279 1.69 3.10 Intr - 138462 138351 112 1 1 89 64 70 0.890 4.14 3.09 Intr - 140990 140885 106 2 1 90 58 99 0.952 6.89 3.08 Intr - 142999 142946 54 1 0 126 79 44 0.907 6.48 3.07 Intr - 163621 163554 68 2 2 101 95 33 0.051 3.92 3.06 Intr - 182285 182148 138 0 0 95 81 42 0.625 4.64 3.05 Intr - 182742 182666 77 2 2 34 101 90 0.633 4.06 3.04 Intr - 182936 182785 152 2 2 53 18 124 0.566 0.86 3.03 Intr - 183565 183433 133 0 1 152 92 50 0.731 12.45 3.02 Intr - 219428 219183 246 1 0 67 81 135 0.697 7.27 3.01 Init - 219744 219671 74 0 2 87 55 103 0.672 5.50 3.00 Prom - 224159 224120 40 -5.06 4.00 Prom + 229645 229684 40 -4.46 4.01 Init + 230600 230603 4 1 1 89 72 0 0.080 -1.34 4.02 Intr + 247808 247896 89 0 2 105 82 76 0.661 8.39 4.03 Intr + 266907 267076 170 2 2 116 86 109 0.985 12.24 4.04 Intr + 282007 282109 103 1 1 99 96 69 0.452 8.98 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 201595 201738 144 0 0 101 77 91 0.860 7.68 S.002 Term - 218497 218263 235 0 1 40 36 120 0.833 -2.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:73577901_73860635|GENSCAN_predicted_peptide_1|205_aa MKTQEYLEHQCGLKASRFETQEKLKFQPVSKGKFLKVMELGQEEDMPFVVWIQVAESSSN LQGDQRGSSGISSAERPLPPPVGLLGGYCRGFRAESLRRRLRSPRAIRHPTGHGRERRRR SLRRRQGHRRLSRRRRRFRLQLLTGQNDDNGRGFFGLGSIFGGAGLADNLVDLLCGSSFT LALSPLASPSAFLGHGGGDSDGSGT >gi568815587r:73577901_73860635|GENSCAN_predicted_CDS_1|618_bp atgaagacccaggagtacctggagcaccaatgtggcctgaaagccagcaggtttgagact caagaaaagctgaagtttcagcctgtctccaaaggtaaattcttaaaagtaatggagctg ggtcaagaagaggatatgccctttgtcgtttggattcaagttgccgaatcgtcctcaaat ttgcagggagaccagcgaggaagctccggcattagttctgccgagcgtcccctcccccca cccgtcgggctcctcggcggctactgcagaggattcagagccgagtcgctgaggaggagg ctccggtccccgcgcgccatccgccaccctactggacacgggcgagagcgtcgccgccgt agcctccggagaagacaagggcatcgccgcctcagccgccgccgccgccgttttcgcctg cagctgctcaccgggcagaacgatgataatggaaggggcttttttggcctgggctctatc tttggaggagcaggtttagcagacaaccttgtggatcttctctgtggttcatccttcacc ttggctttatctcctttagcatccccttcagcctttcttgggcatggtggcggcgacagt gacggcagcgggacgtag >gi568815587r:73577901_73860635|GENSCAN_predicted_peptide_2|213_aa MPQALRGTQSAGAAGWVAAASSSGRLGSNIRNKCRQGSILRRWKRNWFALWLDGTLGYYH DETAQDEEDRVLIHFNVRDIKIGPECHDVQPPEGRSRDGLLTVNLREGGRLHLCAETKDD ALAWKTALLEANSTPVRVYSPYQDYYEVVPPNAHEATYVRSYYGPPYAGPGVTHVIVRED PCYSAGAPLAMGMLAGAATGAALGSLMWSPCWF >gi568815587r:73577901_73860635|GENSCAN_predicted_CDS_2|642_bp atgccccaggctctccggggcacacaaagcgcaggcgcagcgggttgggtggcagcagca tcgagtagcggccgcttaggcagcaacatccgcaacaagtgtagacaaggctccatcctc cgccgctggaagcggaactggtttgccctgtggctggacgggaccctgggatactaccac gatgagacagcgcaggacgaggaggaccgtgtgctcatccacttcaatgtccgtgacata aagatcggcccagagtgccatgatgtgcagcccccagagggccggagccgagatggcctg ctgactgtgaacctacgggaaggcggccgcctgcacctctgtgcggagaccaaggatgat gccctagcatggaagacagcactgctggaggcaaactccaccccggtgcgcgtctacagc ccgtaccaagactactacgaggtggtgccccccaatgcacacgaggccacgtatgtccgc agctactacggaccgccctacgcaggccctggcgtgacgcacgtgatagtgcgggaggat ccctgctacagcgccggcgcccctctggccatgggcatgcttgcgggagccgccactggg gcggcgctgggctcgctcatgtggtcgccctgctggttctga >gi568815587r:73577901_73860635|GENSCAN_predicted_peptide_3|485_aa MPAAAAGRAAAGAGTGTGSLRGRTRSQVLVLQPEQMEYADKRRVSKTKKSFIEWQNSSEE DLGVGSSSLQAGRLNVFPALSREVSRVDSSSLQPVVPMSVQLWLSPRLRGATLLMAATVL RFPQLEGCARSHAATSLACGLALSPFRLAHYATLPGGGGGCQSVESPAALQPGSSTGPCR GRESSVPALPLAFFVSWLEQHRSTMSTGGDFGNPLRKFKLVFLGEQSVPQLCEAHSGKPH SRERALFNLGQGGWDLGVVFPRGLALRSADVVEDFCVDMFSALLGKYQGAWSLDHMATIG IDFLSKTMYLEDRTIRLQLWDTAGQERFRSLIPSYIRDSAAAVVVYDITNVNSFQQTTKW IDDVRTERGSDVIIMLVGNKTDLADKSWTAAVELSIKYFQEKLQQDLEAEHGRGRGYSPH TTVLQVSIEEGERKAKELNVMFIETSAKAGYNVKQNLAMCMERTLSTCSSSFAGNGKHTG QKQRR >gi568815587r:73577901_73860635|GENSCAN_predicted_CDS_3|1458_bp atgccggctgctgcagcagggcgggcagctgcaggtgctggcacaggcactggctctctg cgaggccggaccaggtcccaagttcttgtcctgcaaccagaacaaatggagtatgcagac aagcggagggtgagcaagacaaagaagagctttattgagtggcagaatagctcagaggag gaccttggagtgggcagctcctctctgcaggcaggtcgtctgaatgtcttcccagctctc agcagagaagtctctagagtggatagctcctctctgcagccggttgtcccgatgtctgtt cagctctggctgagcccaaggctgagaggcgccacacttcttatggccgccacagtgctc cgctttccacagctcgagggctgcgcgcgctctcacgctgccacctcactcgcgtgtggg ctcgccctcagccccttccgattggctcattatgctactctgccgggaggcggcggcggc tgccagtctgtggagagtcctgctgccctccagccgggctcctccaccgggccttgcagg ggccgagagagctcggtgcccgcccttccgctcgcctttttcgtcagctggctggagcag catcgttccacaatgtccacgggcggagacttcgggaatccgctgaggaaattcaagctg gtgttcctgggggagcaaagcgttcctcagctctgtgaggcccactcagggaagccccac tcccgagagcgggccctcttcaaccttggccaaggagggtgggatctgggcgttgttttc cccagaggcttggccctgcgaagtgctgatgtcgtggaagatttttgtgtggacatgttt tcagctcttttgggtaaataccaaggagcatggtcgctggatcatatggcaacaattggc attgactttttatcaaaaactatgtacttggaggatcgaacaatcaggcttcagctgtgg gatactgcgggtcaggaacgtttccgtagcctcattcccagttacatccgtgattctgct gcagctgtagtagtttacgatatcacaaatgttaactcattccagcaaactacaaagtgg attgatgatgtcagaacagaaagaggaagtgatgttatcatcatgctagtaggaaataaa acagatcttgctgacaagagctggactgctgctgtggagctcagcatcaagtacttccag gagaagcttcagcaggacctagaggcagagcatggtagaggtagaggatacagccctcat accaccgtgctgcaagtgtcaattgaggagggagagaggaaagccaaagagctgaatgtt atgtttattgaaactagtgcaaaagctggatacaatgtaaagcagaacctagcaatgtgt atggaacgtactctttcgacgtgtagcagcagctttgccgggaatggaaagcacacagga cagaagcagagaagatag >gi568815587r:73577901_73860635|GENSCAN_predicted_peptide_4|122_aa MGGILLSISRPYKTKPTHGIGKYKHLIKAEEPKKKKGKVEVRAINLGTDYEYGVLNIHLT AYDMTLAESYAQYVHNLCNSLSIKVEESYAMPTKTIEVLQLQDQGSKMLLDSVLTTHERV VQ >gi568815587r:73577901_73860635|GENSCAN_predicted_CDS_4|366_bp atgggtggaattctactaagtatcagtcggccctacaagacaaagcccacccacggcatt ggaaagtacaagcacttaattaaagcagaagagcccaagaagaagaagggaaaagtggaa gtgagagccattaatttggggacagattatgaatatggggttttaaatattcatctgact gcatatgatatgaccctggcagagagttatgcccagtatgttcacaacctctgcaactct ctctccattaaagtcgaggaaagttatgcaatgccaaccaaaaccatagaagtgttgcag ttgcaggaccaaggcagcaaaatgctcctggactcagtgcttaccacccatgagcgagtg gttcag