GENSCAN 1.0 Date run: 3-Nov-116 Time: 22:06:19 Sequence gi568815587r:73773118_73973378 : 200261 bp : 44.53% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6378 6521 144 2 0 101 77 91 0.818 7.68 1.02 Intr + 14776 14875 100 0 1 41 92 52 0.011 0.58 1.03 Intr + 52591 52679 89 2 2 105 82 76 0.661 8.39 1.04 Intr + 71690 71859 170 1 2 116 86 109 0.991 12.24 1.05 Intr + 86790 86892 103 0 1 99 96 69 0.981 8.98 1.06 Intr + 90055 90144 90 0 0 91 92 24 0.750 3.29 1.07 Term + 91179 91253 75 2 0 110 37 121 0.998 7.04 1.08 PlyA + 91464 91469 6 1.05 2.03 PlyA - 91985 91980 6 1.05 2.02 Term - 100262 99998 265 1 1 68 53 231 0.111 12.58 2.01 Init - 113958 113867 92 0 2 76 81 59 0.640 3.97 2.00 Prom - 114422 114383 40 -4.86 3.00 Prom + 116907 116946 40 -5.36 3.01 Init + 127231 127303 73 0 1 68 64 74 0.456 4.23 3.02 Intr + 136282 136476 195 2 0 66 100 117 0.953 10.09 3.03 Intr + 151498 151580 83 2 2 123 87 36 0.762 6.36 3.04 Intr + 158062 158208 147 0 0 59 47 102 0.312 3.63 3.05 Term + 162940 162975 36 0 0 109 49 73 0.741 2.84 3.06 PlyA + 166519 166524 6 1.05 4.02 PlyA - 167758 167753 6 1.05 4.01 Sngl - 173437 173093 345 1 0 82 42 220 0.366 12.84 4.00 Prom - 174154 174115 40 -5.66 5.00 Prom + 177128 177167 40 -8.86 5.01 Init + 177953 178020 68 1 2 96 88 58 0.955 7.14 5.02 Intr + 184287 184451 165 0 0 76 55 62 0.529 0.78 5.03 Intr + 185200 185303 104 1 2 113 84 120 0.937 13.92 5.04 Intr + 186377 186538 162 0 0 30 83 229 0.983 16.55 5.05 Intr + 191761 191918 158 2 2 95 96 95 0.997 10.73 5.06 Intr + 193021 193134 114 0 0 57 94 204 0.962 18.54 5.07 Intr + 195228 195341 114 2 0 78 113 158 0.973 17.94 5.08 Intr + 196129 196205 77 0 2 115 81 153 0.999 15.61 5.09 Term + 196844 196997 154 2 1 93 42 211 0.999 14.39 5.10 PlyA + 197139 197144 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 23280 23046 235 2 1 40 36 120 0.833 -2.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:73773118_73973378|GENSCAN_predicted_peptide_1|256_aa MAASVHVEEGARLLSQPQRGGPWEPNELQQTNGERQGAGLAWVFGQRACFQTPWERGCRV RSSVCTARGRAAAKDERNLGKGGILLSISRPYKTKPTHGIGKYKHLIKAEEPKKKKGKVE VRAINLGTDYEYGVLNIHLTAYDMTLAESYAQYVHNLCNSLSIKVEESYAMPTKTIEVLQ LQDQGSKMLLDSVLTTHERVVQISGLSATFAEIFLEIIQSSLPEGVRLSVKEHTEEDFKG RFKARPELEELLAKLK >gi568815587r:73773118_73973378|GENSCAN_predicted_CDS_1|771_bp atggccgccagcgtccatgtggaggagggagcgcgcctcctctcgcagccccagaggggc gggccctgggaaccgaacgagcttcagcagaccaatggggagaggcaaggggcggggttg gcctgggtcttcggccagagggcgtgttttcagacgccctgggaacgcggctgcagggtc cggtcttcggtttgcacagctagaggccgcgcagcagcaaaggatgagcggaaccttgga aaaggtggaattctactaagtatcagtcggccctacaagacaaagcccacccacggcatt ggaaagtacaagcacttaattaaagcagaagagcccaagaagaagaagggaaaagtggaa gtgagagccattaatttggggacagattatgaatatggggttttaaatattcatctgact gcatatgatatgaccctggcagagagttatgcccagtatgttcacaacctctgcaactct ctctccattaaagtcgaggaaagttatgcaatgccaaccaaaaccatagaagtgttgcag ttgcaggaccaaggcagcaaaatgctcctggactcagtgcttaccacccatgagcgagtg gttcagatcagcggtttgagtgctacgtttgcagaaattttcttggaaataatccaaagc agtcttcctgaaggagtcagactgtcagtgaaggagcacactgaagaagacttcaaggga cgattcaaagctcgaccagaactggaagaactgttggccaagttgaagtag >gi568815587r:73773118_73973378|GENSCAN_predicted_peptide_2|118_aa MAPASASDESLRLLPLMAEGEGEPVYAEITWMSTSVPQGHTWTQRVKKDDEEEDPLDQLI SRSGCAASHFAVQECMAQHQDWRQCQPQVQAFKDCMSEQQARRQEELQRRQEQAGAHH >gi568815587r:73773118_73973378|GENSCAN_predicted_CDS_2|357_bp atggcaccagcatctgcttctgatgagagcctcaggctgcttccactcatggcagaagga gaaggggagccagtgtatgcagagatcacatggatgtcaacctcagtccctcaaggccat acctggacccaacgggtgaagaaagacgatgaggaggaggacccgctggaccagctgatc tcccgctctggctgtgctgcctcccactttgcagtgcaggagtgcatggcccagcaccag gactggcggcaatgccagccacaggtgcaggcgttcaaggattgcatgagtgaacagcag gcgaggcggcaagaggagctgcagaggaggcaagaacaagccggtgcccaccactga >gi568815587r:73773118_73973378|GENSCAN_predicted_peptide_3|177_aa MDAQLKIWSAEDASCVVTFKGHKGGILDTAIVDRGRNVVSASRDGTARLWDCGRSACLGV LADCGSSINGVAVGAADNSINLGSPEQMPSDGSCFIVQQDLDYVTELTGADCDPVYKGSL FYIATTFVRSFSYVSRVDVLVVIYAVTLDSCSEIQTQTEGKETSSQKTAIVIMAHSQ >gi568815587r:73773118_73973378|GENSCAN_predicted_CDS_3|534_bp atggatgcccagctgaagatatggtcagctgaagatgctagctgcgtggtgaccttcaaa ggtcacaaaggaggtatcctggatacagccatcgttgatcgggggaggaatgtggtgtct gcttctcgagatgggacagcacgactttgggattgtgggcgctcagcctgcttgggagtc cttgcagattgtggttcttctatcaatggagtggcggtgggtgctgctgacaactccata aaccttggctcccctgagcagatgcccagtgatggaagctgttttattgtccagcaagac ttagactatgtcactgagctcactggggctgactgtgaccctgtgtacaagggctcactc ttctacattgccaccacattcgtccgctccttttcatacgtctccagagtcgatgtcctg gttgtgatatatgcagtgacccttgacagctgctctgaaatccagacccaaacagaagga aaggagaccagctctcagaaaaccgccatcgtcatcatggcccattctcaatga >gi568815587r:73773118_73973378|GENSCAN_predicted_peptide_4|114_aa MELDEWERTGRDFKKAYKDGAKIPVSVWSVWALIKAAPEPFQTDDEADSDEEEEDECKKL TSDSEREAQEPEEIIEKKGKLKKTYFTGPSAPPAELSEWPPPPSPTMGEKVKQL >gi568815587r:73773118_73973378|GENSCAN_predicted_CDS_4|345_bp atggagttggatgaatgggaaagaactggcagagattttaaaaaggcatataaagatgga gccaaaattccagtttctgtttggtcagtgtgggcgttaataaaggcagctcctgagcca tttcaaacagatgatgaggcagattcagatgaagaagaggaggatgagtgtaagaaacta acatcagattctgaacgcgaggcacaggagccggaggaaatcatagaaaagaaaggaaag ctgaaaaagacatattttactggtccatcagctccgcctgctgaattaagtgaatggcca cctcctccctctcccacaatgggtgagaaagtgaagcagctgtaa >gi568815587r:73773118_73973378|GENSCAN_predicted_peptide_5|371_aa MGQDYYSVLGITRNSEDAQIKQATWANGPFIKLSTNHPIRSISPPPRVTADTYSQTLRCL PFRNRPAGTTCINPPSSRYRRLALKHHPLKSNEPSSAEIFRQIAEAYDVLSDPMKRGIYD KFGEEGLKGGIPLEFGSQTPWTTGYVFHGKPEKVFHEFFGGNNPFSEFFDAEGSEVDLNF GGLQGRGVKKQDPQVERDLYLSLEDLFFGCTKKIKISRRVLNEDGYSSTIKDKILTIDVK PGWRQGTRITFEKEGDQGPNIIPADIIFIVKEKLHPRFRRENDNLFFVNPIPLGKALTCC TVEVRTLDDRLLNIPINDIIHPKYFKKVPGEGMPLPEDPTKKGDLFIFFDIQFPTRLTPQ KKQMLRQALLT >gi568815587r:73773118_73973378|GENSCAN_predicted_CDS_5|1116_bp atgggccaggattattactctgtgctcgggatcactcgcaattcagaggatgcccagatc aagcaggccacctgggcgaatggtccctttattaagctctccacaaaccacccaatccgc tccatctctcccccaccccgggtcactgccgatacatattctcagacgctgcgatgtctg cccttcagaaacaggccagctggtacgacttgtatcaaccctccctcctccaggtaccgc agactcgcccttaagcaccacccgttgaagtcaaatgagccgtcttcagcagagattttc aggcaaatagcagaggcctacgacgtgctgagtgaccccatgaagagaggcatctacgac aagtttggagaagagggcctgaagggtgggattcctttggagtttggatcccagacccca tggacaactggttacgtcttccatggcaaacctgaaaaggtgttccacgagttctttggt ggaaacaaccccttcagtgagttttttgatgcagaaggaagtgaggtagatttgaacttt ggggggctccagggccgaggggtcaagaagcaggacccccaagtcgaacgggatctctac ctgtccctggaggacttattctttggctgcaccaaaaaaattaagatctccagaagggtg ctgaacgaggatgggtactcctccaccatcaaggacaagatcctgaccattgatgtgaag cccggttggaggcagggcacacgcatcacctttgagaaggaaggggaccagggccccaac atcatcccagcagacatcattttcatcgtaaaggagaagctacaccctcgcttccgcagg gagaatgacaacctcttcttcgtgaaccccatccctcttggcaaggctctcacctgctgc actgtggaggtgaggaccctagatgaccgtctgctcaacatccccatcaatgacatcatc caccccaaatacttcaagaaggtgccaggggaggggatgccattgccggaggaccccact aagaaaggggatctcttcatcttcttcgacatccagttccccacccgcctcacaccccag aagaagcagatgctgcgccaggcattgctgacatga